通信限制下分布的线性强盗 (Distributed Linear Bandits under Communication Constraints) - 专知论文

会员服务 ·

0

赌博机/老虎机 · 线性的 · INFORMS · 极小点 · Learning ·

2022 年 11 月 4 日

Distributed Linear Bandits under Communication Constraints

翻译：通信限制下分布的线性强盗

Sudeep Salgia,Qing Zhao

We consider distributed linear bandits where $M$ agents learn collaboratively to minimize the overall cumulative regret incurred by all agents. Information exchange is facilitated by a central server, and both the uplink and downlink communications are carried over channels with fixed capacity, which limits the amount of information that can be transmitted in each use of the channels. We investigate the regret-communication trade-off by (i) establishing information-theoretic lower bounds on the required communications (in terms of bits) for achieving a sublinear regret order; (ii) developing an efficient algorithm that achieves the minimum sublinear regret order offered by centralized learning using the minimum order of communications dictated by the information-theoretic lower bounds. For sparse linear bandits, we show a variant of the proposed algorithm offers better regret-communication trade-off by leveraging the sparsity of the problem.

翻译：我们认为,分布式线性土匪是分布式的线性土匪,用美元代理商合作学习,以尽量减少所有代理商产生的累积遗憾;中央服务器为信息交流提供便利,上链和下链通信都通过固定容量的渠道传递,这限制了每个渠道使用中能够传递的信息数量;我们调查了令人遗憾的通信交易,方法是:(一) 确定所需通信(按位数计算)的信息理论下限,以便实现亚线性遗憾命令;(二) 开发一种高效算法,通过利用信息理论较低界限要求的最低通信顺序集中学习,实现最低次线性遗憾命令;对于稀少的线性土匪,我们展示了拟议算法的变式,通过利用问题多发性来更好地进行遗憾交易。

0

相关内容

赌博机/老虎机

赌博机/老虎机

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

生物基聚(乳酸-蓖麻油酸)的合成与立构复合及改性聚乳酸

国家自然科学基金

1+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于"Build-and-Click"法的铂类RNA聚合酶I选择性抑制剂的构建、评价及亚细胞定位研究

国家自然科学基金

1+阅读 · 2013年12月31日

低温加工条件下高分子量对映体聚乳酸立构复合晶体的可控形成及其机理

国家自然科学基金

0+阅读 · 2012年12月31日

金属/有机骨架化合物与多孔金属复合材料的制备与性能

国家自然科学基金

0+阅读 · 2012年12月31日

Efficient First-order Methods for Convex Optimization with Strongly Convex Function Constraints

Arxiv

0+阅读 · 2022年12月26日

SAGDA: Achieving $\mathcal{O}(ε^{-2})$ Communication Complexity in Federated Min-Max Learning

Arxiv

0+阅读 · 2022年12月26日

Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity

Arxiv

0+阅读 · 2022年12月25日

Distributed Distributionally Robust Optimization with Non-Convex Objectives

Arxiv

0+阅读 · 2022年12月17日

Distributed Non-Convex Optimization with Sublinear Speedup under Intermittent Client Availability

Arxiv

11+阅读 · 2020年2月18日

VIP会员

文章信息

相关主题

赌博机/老虎机

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型中的事件抽取：方法、模态与未来展望的全面综述

美海军作战管理系统：变革战场空间的二十年

【MIT博士论文】以语言为中心的医学影像理解

俄罗斯“沙希德”/“天竺葵”攻击无人机

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

相关论文

Efficient First-order Methods for Convex Optimization with Strongly Convex Function Constraints

Arxiv

0+阅读 · 2022年12月26日

SAGDA: Achieving $\mathcal{O}(ε^{-2})$ Communication Complexity in Federated Min-Max Learning

Arxiv

0+阅读 · 2022年12月26日

Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity

Arxiv

0+阅读 · 2022年12月25日

Distributed Distributionally Robust Optimization with Non-Convex Objectives

Arxiv

0+阅读 · 2022年12月17日

Distributed Non-Convex Optimization with Sublinear Speedup under Intermittent Client Availability

Arxiv

11+阅读 · 2020年2月18日

相关基金

生物基聚(乳酸-蓖麻油酸)的合成与立构复合及改性聚乳酸

国家自然科学基金

1+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于"Build-and-Click"法的铂类RNA聚合酶I选择性抑制剂的构建、评价及亚细胞定位研究

国家自然科学基金

1+阅读 · 2013年12月31日

低温加工条件下高分子量对映体聚乳酸立构复合晶体的可控形成及其机理

国家自然科学基金

0+阅读 · 2012年12月31日

金属/有机骨架化合物与多孔金属复合材料的制备与性能

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员