在多人游戏中以$O(\log T) $ Swap Regret 来解码学习动态和 $O(\log T) $( Swap Regret) 来进行多玩游戏 (Uncoupled Learning Dynamics with $O(\log T)$ Swap Regret in Multiplayer Games) - 专知论文

会员服务 ·

0

学成 · 正则化项 · SimPLe · 学习率 · 优化器 ·

2022 年 4 月 25 日

Uncoupled Learning Dynamics with $O(\log T)$ Swap Regret in Multiplayer Games

翻译：在多人游戏中以$O(\log T) $ Swap Regret 来解码学习动态和 $O(\log T) $( Swap Regret) 来进行多玩游戏

Ioannis Anagnostides,Gabriele Farina,Christian Kroer,Chung-Wei Lee,Haipeng Luo,Tuomas Sandholm

In this paper we establish efficient and \emph{uncoupled} learning dynamics so that, when employed by all players in a general-sum multiplayer game, the \emph{swap regret} of each player after $T$ repetitions of the game is bounded by $O(\log T)$, improving over the prior best bounds of $O(\log^4 (T))$. At the same time, we guarantee optimal $O(\sqrt{T})$ swap regret in the adversarial regime as well. To obtain these results, our primary contribution is to show that when all players follow our dynamics with a \emph{time-invariant} learning rate, the \emph{second-order path lengths} of the dynamics up to time $T$ are bounded by $O(\log T)$, a fundamental property which could have further implications beyond near-optimally bounding the (swap) regret. Our proposed learning dynamics combine in a novel way \emph{optimistic} regularized learning with the use of \emph{self-concordant barriers}. Further, our analysis is remarkably simple, bypassing the cumbersome framework of higher-order smoothness recently developed by Daskalakis, Fishelson, and Golowich (NeurIPS'21).

翻译：在本文中,我们建立了高效的学习动态,这样,当所有玩家在普通和多玩家游戏中使用了所有玩家在游戏重复美元后,每个玩家的 emph{swap regret} 受美元(logT) 约束, 超过先前最好的O(log4)(T) 美元界限的改善。同时, 我们保证在对抗制中也得到最佳的 $( sqrt{T}) 互换遗憾。为了获得这些结果, 我们的主要贡献是表明, 当所有玩家在游戏重复美元后, 每个玩家在游戏重复美元之后的\ emph{ time- evilant} 学习时, 每个玩家的\ emph{ secon- road path lates} 都受美元( logT) 的束缚。 $( 4) 美元(log) 4 (T) 。同时,我们保证在对抗制制度下, 最接近于最接近的束缚( swap) IP(s) rb) 。我们提议的学习动态以新的方式结合了 emph{opimmextimedictime 的方式。

0

相关内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

专知会员服务

21+阅读 · 2019年12月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

整群环的K2群

国家自然科学基金

0+阅读 · 2013年12月31日

Tip60在oxLDL诱导的血管平滑肌细胞自噬及增殖中的作用机制

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

新型靶向肿瘤细胞活性氧可诱导DNA交联剂的合成及分子作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

有限长区域中的空间耦合多元Rateless码研究

国家自然科学基金

0+阅读 · 2012年12月31日

c-Src激酶在2型糖尿病脑动脉BKCa通道功能障碍中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

Baylis-Hillman 加成物与双亲核试剂-1,3-双三甲基硅氧基-1,3-二烯反应性能的研究

国家自然科学基金

0+阅读 · 2011年12月31日

群代数的双曲模判别及应用

国家自然科学基金

0+阅读 · 2011年12月31日

胰腺星状细胞在2型糖尿病胰岛纤维化和β32454;胞功能减退中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

过渡金属参与的不饱和键分子内环化反应及其选择性调控

国家自然科学基金

0+阅读 · 2008年12月31日

Staggered mesh method for correlation energy calculations of solids: Random phase approximation in direct ring coupled cluster doubles and adiabatic connection formalisms

Arxiv

0+阅读 · 2022年6月9日

Alternating Mirror Descent for Constrained Min-Max Games

Arxiv

0+阅读 · 2022年6月8日

Beyond Time-Average Convergence: Near-Optimal Uncoupled Online Learning via Clairvoyant Multiplicative Weights Update

Arxiv

0+阅读 · 2022年6月8日

High-dimensional limit theorems for SGD: Effective dynamics and critical scaling

High-dimensional limit theorems for SGD: Effective dynamics and critical scaling

Arxiv

0+阅读 · 2022年6月8日

A Primal-Dual Approach to Bilevel Optimization with Multiple Inner Minima

Arxiv

0+阅读 · 2022年6月8日

Model Generation with Provable Coverability for Offline Reinforcement Learning

Arxiv

0+阅读 · 2022年6月8日

Identifying good directions to escape the NTK regime and efficiently learn low-degree plus sparse polynomials

Arxiv

0+阅读 · 2022年6月8日

Parametric Chordal Sparsity for SDP-based Neural Network Verification

Arxiv

0+阅读 · 2022年6月7日

How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via $f$-Advantage Regression

Arxiv

0+阅读 · 2022年6月7日

Efficient Visual Recognition with Deep Neural Networks: A Survey on Recent Advances and New Directions

Arxiv

20+阅读 · 2021年8月30日

VIP会员

文章信息

相关主题

相关VIP内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

专知会员服务

21+阅读 · 2019年12月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【NTU博士论文】利用强化学习与生成模型推进可靠且可泛化的决策

美海军研发“增强侦察与态势评估系统（ARES）”应用程序以优化作战规划（附研究论文）

【NeurIPS2025】DNA-DetectLLM：基于 DNA 启发的“突变-修复”范式揭示 AI 生成文本

面向深度研究系统的强化学习基础：综述

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Staggered mesh method for correlation energy calculations of solids: Random phase approximation in direct ring coupled cluster doubles and adiabatic connection formalisms

Arxiv

0+阅读 · 2022年6月9日

Alternating Mirror Descent for Constrained Min-Max Games

Arxiv

0+阅读 · 2022年6月8日

Beyond Time-Average Convergence: Near-Optimal Uncoupled Online Learning via Clairvoyant Multiplicative Weights Update

Arxiv

0+阅读 · 2022年6月8日

High-dimensional limit theorems for SGD: Effective dynamics and critical scaling

High-dimensional limit theorems for SGD: Effective dynamics and critical scaling

Arxiv

0+阅读 · 2022年6月8日

A Primal-Dual Approach to Bilevel Optimization with Multiple Inner Minima

Arxiv

0+阅读 · 2022年6月8日

Model Generation with Provable Coverability for Offline Reinforcement Learning

Arxiv

0+阅读 · 2022年6月8日

Identifying good directions to escape the NTK regime and efficiently learn low-degree plus sparse polynomials

Arxiv

0+阅读 · 2022年6月8日

Parametric Chordal Sparsity for SDP-based Neural Network Verification

Arxiv

0+阅读 · 2022年6月7日

How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via $f$-Advantage Regression

Arxiv

0+阅读 · 2022年6月7日

Efficient Visual Recognition with Deep Neural Networks: A Survey on Recent Advances and New Directions

Arxiv

20+阅读 · 2021年8月30日

相关基金

整群环的K2群

国家自然科学基金

0+阅读 · 2013年12月31日

Tip60在oxLDL诱导的血管平滑肌细胞自噬及增殖中的作用机制

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

新型靶向肿瘤细胞活性氧可诱导DNA交联剂的合成及分子作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

有限长区域中的空间耦合多元Rateless码研究

国家自然科学基金

0+阅读 · 2012年12月31日

c-Src激酶在2型糖尿病脑动脉BKCa通道功能障碍中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

Baylis-Hillman 加成物与双亲核试剂-1,3-双三甲基硅氧基-1,3-二烯反应性能的研究

国家自然科学基金

0+阅读 · 2011年12月31日

群代数的双曲模判别及应用

国家自然科学基金

0+阅读 · 2011年12月31日

胰腺星状细胞在2型糖尿病胰岛纤维化和β32454;胞功能减退中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

过渡金属参与的不饱和键分子内环化反应及其选择性调控

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员