We investigate convergence of decentralized fictitious play (DFP) in near-potential games, wherein agents preferences can almost be captured by a potential function. In DFP agents keep local estimates of other agents' empirical frequencies, best-respond against these estimates, and receive information over a time-varying communication network. We prove that empirical frequencies of actions generated by DFP converge around a single Nash Equilibrium (NE) assuming that there are only finitely many Nash equilibria, and the difference in utility functions resulting from unilateral deviations is close enough to the difference in the potential function values. This result assures that DFP has the same convergence properties of standard Fictitious play (FP) in near-potential games.
翻译:我们调查了在近潜在游戏中分散的虚拟游戏(DFP)的趋同,其中代理商的偏好几乎可以被潜在功能所捕捉。在DFP代理商对其它代理商的经验频率进行当地估计,这些估计最能回应这些估计,并在一个时间变化的通信网络中接收信息。 我们证明DFP所产生行动的经验频率集中在一个单一的Nash Equilibrium(NE)上,假设只有有限数量的Nash 平衡,而单方面偏差导致的效用功能差异与潜在功能值的差异非常接近。这保证了DFP在近潜在游戏中具有标准的Ficticious游戏(FP)的趋同性。