利用多机构加强多机构强化学习进行混合合作-竞争交流 (Mixed Cooperative-Competitive Communication Using Multi-Agent Reinforcement Learning)

By using communication between multiple agents in multi-agent environments, one can reduce the effects of partial observability by combining one agent's observation with that of others in the same dynamic environment. While a lot of successful research has been done towards communication learning in cooperative settings, communication learning in mixed cooperative-competitive settings is also important and brings its own complexities such as the opposing team overhearing the communication. In this paper, we apply differentiable inter-agent learning (DIAL), designed for cooperative settings, to a mixed cooperative-competitive setting. We look at the difference in performance between communication that is private for a team and communication that can be overheard by the other team. Our research shows that communicating agents are able to achieve similar performance to fully observable agents after a given training period in our chosen environment. Overall, we find that sharing communication across teams results in decreased performance for the communicating team in comparison to results achieved with private communication.

翻译：通过多代理人环境中多个代理人之间的交流,人们可以通过将一个代理人的观察与同一动态环境中的其他人的观察结合起来,减少部分可观察性的影响。虽然在合作环境中对交流学习进行了许多成功的研究,但在合作竞争混合环境中的交流学习也很重要,并带来其本身的复杂性,例如对方小组对通信进行旁听。在本文件中,我们将为合作环境设计的不同机构间学习(DIAL)应用到一种合作竞争环境。我们审视的是,一个小组的私人通信与另一个小组可以偷听的通信之间的性能差异。我们的研究表明,在我们选定的环境中,在特定培训期之后,通信代理人能够取得与完全可观察的代理人相似的性能。总的来说,我们发现,跨小组之间的交流导致通信团队的性能下降,而与私人通信取得的成果相比,通信小组的性能下降。