We improve a theoretical result of the article "On Exploiting Spectral Properties for Solving MDP with Large State Space" showing that their algorithm, which was proved to converge under some unrealistic assumptions, is actually guaranteed to converge always.
翻译:我们改进了一篇题为“利用光谱属性解决大型国家空间的MDP”的文章的理论结果,该文章表明,其算法被证明在某些不现实的假设下会趋同,但实际上保证始终会趋同。