学习与完全未知的队友合作 (Learning to Cooperate with Completely Unknown Teammates) - 专知论文

会员服务 ·

0

学成 · TEAM · 知识 (knowledge) · 讲稿 · 相互独立的 ·

2022 年 5 月 6 日

Learning to Cooperate with Completely Unknown Teammates

翻译：学习与完全未知的队友合作

Alexandre Neves,Alberto Sardinha

from arxiv, 13 pages, 1 figure

A key goal of ad hoc teamwork is to develop a learning agent that cooperates with unknown teams, without resorting to any pre-coordination protocol. Despite a vast number of ad hoc teamwork algorithms in the literature, most of them cannot address the problem of learning to cooperate with a completely unknown team, unless it learns from scratch. This article presents a novel approach that uses transfer learning alongside the state-of-the-art PLASTIC-Policy to adapt to completely unknown teammates quickly. We test our solution within the Half Field Offense simulator with five different teammates. The teammates were designed independently by developers from different countries and at different times. Our empirical evaluation shows that it is advantageous for an ad hoc agent to leverage its past knowledge when adapting to a new team instead of learning how to cooperate with it from scratch.

翻译：特设团队工作的一个关键目标是开发一个与未知团队合作的学习代理机构,不诉诸任何协调前协议。尽管文献中有大量的特设团队工作算法,但其中大多数无法解决学习与完全未知团队合作的问题,除非它从零开始学习。这篇文章提出了一个新颖的方法,即与最先进的PLASTIC政策一起,利用转移学习来迅速适应完全未知的队友。我们用5个不同的队友测试我们在半场防御模拟器中的解决方案。队友是由不同国家和不同时期的开发商独立设计的。我们的实证评估表明,在适应新团队时,该特设代理机构利用过去的知识而不是学习如何从零开始与它合作,是有好处的。

0

相关内容

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

氮原子α位C-H键的官能团化研究

国家自然科学基金

0+阅读 · 2015年12月31日

卟啉配位聚合物与锚定分子有序自组装敏化太阳能电池

国家自然科学基金

0+阅读 · 2014年12月31日

一种无直流储能元件的电能传输控制新技术：相位和幅值可控交-交变换器

国家自然科学基金

0+阅读 · 2014年12月31日

液态锑金属阳极直接碳燃料电池反应机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

套子代数的Hochschild上同调及套的分类

国家自然科学基金

3+阅读 · 2014年12月31日

有序介孔壳层/纳米线多级结构的可控合成及其光解水催化与电化学储能

国家自然科学基金

0+阅读 · 2014年12月31日

ArnSnOm靶向催化二甲氧基碳酸双酚A二酯合成和缩聚及反应机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

[(p-Cymene)RuCl2]2催化的C-H键直接官能团化反应及机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

水滑石基可见光响应型复合光催化剂薄膜的研制及其光催化性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

等离子体强化多孔介质燃烧降解有机废气的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

Toward Zero Oracle Word Error Rate on the Switchboard Benchmark

Arxiv

0+阅读 · 2022年6月27日

Learning to Parallelize in a Shared-Memory Environment with Transformers

Arxiv

0+阅读 · 2022年6月26日

Iterated Reasoning with Mutual Information in Cooperative and Byzantine Decentralized Teaming

Arxiv

0+阅读 · 2022年6月24日

How is model-related uncertainty quantified and reported in different disciplines?

Arxiv

0+阅读 · 2022年6月24日

Optimizing Two-way Partial AUC with an End-to-end Framework

Arxiv

0+阅读 · 2022年6月23日

Federated Learning for RAN Slicing in Beyond 5G Networks

Arxiv

0+阅读 · 2022年6月22日

Adaptive Transfer Learning on Graph Neural Networks

Arxiv

14+阅读 · 2021年7月20日

Generalizing to Unseen Domains: A Survey on Domain Generalization

Arxiv

30+阅读 · 2021年3月10日

Learning in the Frequency Domain

Learning in the Frequency Domain

Arxiv

11+阅读 · 2020年3月12日

Meta-Transfer Learning for Zero-Shot Super-Resolution

Meta-Transfer Learning for Zero-Shot Super-Resolution

Arxiv

43+阅读 · 2020年2月27日

VIP会员

文章信息

相关主题

知识 (knowledge)

相互独立的

相关VIP内容

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《具备集体态势感知能力的深度强化学习智能体在超视距空战中的应用研究》最新文献

《美军条令文件：频谱管理操作技术》2025最新100页

反制小型无人机：一项重大挑战

《AI作战：将人机协作集成至实时、虚拟与建构环境（LVC）的建模与仿真》

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

相关论文

Toward Zero Oracle Word Error Rate on the Switchboard Benchmark

Arxiv

0+阅读 · 2022年6月27日

Learning to Parallelize in a Shared-Memory Environment with Transformers

Arxiv

0+阅读 · 2022年6月26日

Iterated Reasoning with Mutual Information in Cooperative and Byzantine Decentralized Teaming

Arxiv

0+阅读 · 2022年6月24日

How is model-related uncertainty quantified and reported in different disciplines?

Arxiv

0+阅读 · 2022年6月24日

Optimizing Two-way Partial AUC with an End-to-end Framework

Arxiv

0+阅读 · 2022年6月23日

Federated Learning for RAN Slicing in Beyond 5G Networks

Arxiv

0+阅读 · 2022年6月22日

Adaptive Transfer Learning on Graph Neural Networks

Arxiv

14+阅读 · 2021年7月20日

Generalizing to Unseen Domains: A Survey on Domain Generalization

Arxiv

30+阅读 · 2021年3月10日

Learning in the Frequency Domain

Learning in the Frequency Domain

Arxiv

11+阅读 · 2020年3月12日

Meta-Transfer Learning for Zero-Shot Super-Resolution

Meta-Transfer Learning for Zero-Shot Super-Resolution

Arxiv

43+阅读 · 2020年2月27日

相关基金

氮原子α位C-H键的官能团化研究

国家自然科学基金

0+阅读 · 2015年12月31日

卟啉配位聚合物与锚定分子有序自组装敏化太阳能电池

国家自然科学基金

0+阅读 · 2014年12月31日

一种无直流储能元件的电能传输控制新技术：相位和幅值可控交-交变换器

国家自然科学基金

0+阅读 · 2014年12月31日

液态锑金属阳极直接碳燃料电池反应机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

套子代数的Hochschild上同调及套的分类

国家自然科学基金

3+阅读 · 2014年12月31日

有序介孔壳层/纳米线多级结构的可控合成及其光解水催化与电化学储能

国家自然科学基金

0+阅读 · 2014年12月31日

ArnSnOm靶向催化二甲氧基碳酸双酚A二酯合成和缩聚及反应机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

[(p-Cymene)RuCl2]2催化的C-H键直接官能团化反应及机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

水滑石基可见光响应型复合光催化剂薄膜的研制及其光催化性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

等离子体强化多孔介质燃烧降解有机废气的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员