CADA: 通信促进发展分配办法 Adam (CADA: Communication-Adaptive Distributed Adam) - 专知论文

会员服务 ·

0

Adam · 随机梯度下降 · AdaGrad · Performer · 梯度下降法 ·

2020 年 12 月 31 日

CADA: Communication-Adaptive Distributed Adam

翻译：CADA: 通信促进发展分配办法 Adam

Tianyi Chen,Ziye Guo,Yuejiao Sun,Wotao Yin

from arxiv, OPT2020: NeurIPS Workshop on Optimization for Machine Learning

Stochastic gradient descent (SGD) has taken the stage as the primary workhorse for large-scale machine learning. It is often used with its adaptive variants such as AdaGrad, Adam, and AMSGrad. This paper proposes an adaptive stochastic gradient descent method for distributed machine learning, which can be viewed as the communication-adaptive counterpart of the celebrated Adam method - justifying its name CADA. The key components of CADA are a set of new rules tailored for adaptive stochastic gradients that can be implemented to save communication upload. The new algorithms adaptively reuse the stale Adam gradients, thus saving communication, and still have convergence rates comparable to original Adam. In numerical experiments, CADA achieves impressive empirical performance in terms of total communication round reduction.

翻译：作为大规模机器学习的主要工作马,Stochasteric Sleep(SGD)已经进入了阶段,它经常与AdaGrad、Adam和AMSGrad等适应性变体一起使用。本文提出了一种用于分配式机器学习的适应性随机性梯度下降方法,可被视为著名的Adam方法的通信适应性对应方----其名称是CADA。CADA的关键组成部分是一套适合适应性随机梯度的新规则,可以用来保存通信上传。新的算法在适应性上重新利用陈旧的Adam梯度,从而节省通信,并且仍然具有与原Adam相似的趋同率。在数字实验中,CADA在通信全面减少方面取得了令人印象深刻的经验性业绩。

0

相关内容

Adam

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

50+阅读 · 2020年12月14日

人工智能如何用于抵抗COVID-19？Mila这份《AI against COVID-19 》PPT

专知会员服务

46+阅读 · 2020年5月17日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

105+阅读 · 2020年5月3日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

238+阅读 · 2020年4月19日

【CMU-Spring2020课程】离散微分几何15讲，Discrete Differential Geometry

【CMU-Spring2020课程】离散微分几何15讲，Discrete Differential Geometry

专知会员服务

51+阅读 · 2020年3月26日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

81+阅读 · 2020年2月18日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

176+阅读 · 2020年2月1日

【文献综述】分布式机器学习综述论文，33页pdf，A Survey on Distributed Machine Learning

【文献综述】分布式机器学习综述论文，33页pdf，A Survey on Distributed Machine Learning

专知会员服务

119+阅读 · 2019年12月23日

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

专知会员服务

22+阅读 · 2019年11月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

31+阅读 · 2019年10月17日

已删除

将门创投

4+阅读 · 2019年10月11日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

25+阅读 · 2019年5月18日

神器Cobalt Strike3.13破解版

神器Cobalt Strike3.13破解版

黑白之道

12+阅读 · 2019年3月1日

人工智能 | UAI 2019等国际会议信息4条

人工智能 | UAI 2019等国际会议信息4条

Call4Papers

6+阅读 · 2019年1月14日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

STRCF for Visual Object Tracking

STRCF for Visual Object Tracking

统计学习与视觉计算组

14+阅读 · 2018年5月29日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Gradient Coding with Dynamic Clustering for Straggler-Tolerant Distributed Learning

Arxiv

0+阅读 · 2021年3月1日

Permutation Tests for Equality of Distributions of Functional Data

Permutation Tests for Equality of Distributions of Functional Data

Arxiv

0+阅读 · 2021年3月1日

Distributed optimization for nonrigid nano-tomography

Arxiv

0+阅读 · 2021年2月28日

Communication-efficient Byzantine-robust distributed learning with statistical guarantee

Arxiv

0+阅读 · 2021年2月28日

Fundamental Limits of Distributed Encoding

Arxiv

0+阅读 · 2021年2月27日

Distributed Saddle-Point Problems: Lower Bounds, Optimal Algorithms and Federated GANs

Arxiv

0+阅读 · 2021年2月27日

Distributed Graph Convolutional Networks

Arxiv

18+阅读 · 2020年7月13日

A Survey on Distributed Machine Learning

Arxiv

43+阅读 · 2019年12月20日

Distributed Machine Learning on Mobile Devices: A Survey

Distributed Machine Learning on Mobile Devices: A Survey

Arxiv

35+阅读 · 2019年9月18日

BigDL: A Distributed Deep Learning Framework for Big Data

Arxiv

3+阅读 · 2018年4月16日

VIP会员

文章信息

相关主题

随机梯度下降

梯度下降法

相关VIP内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

50+阅读 · 2020年12月14日

人工智能如何用于抵抗COVID-19？Mila这份《AI against COVID-19 》PPT

专知会员服务

46+阅读 · 2020年5月17日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

105+阅读 · 2020年5月3日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

238+阅读 · 2020年4月19日

【CMU-Spring2020课程】离散微分几何15讲，Discrete Differential Geometry

【CMU-Spring2020课程】离散微分几何15讲，Discrete Differential Geometry

专知会员服务

51+阅读 · 2020年3月26日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

81+阅读 · 2020年2月18日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

176+阅读 · 2020年2月1日

【文献综述】分布式机器学习综述论文，33页pdf，A Survey on Distributed Machine Learning

【文献综述】分布式机器学习综述论文，33页pdf，A Survey on Distributed Machine Learning

专知会员服务

119+阅读 · 2019年12月23日

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

专知会员服务

22+阅读 · 2019年11月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

31+阅读 · 2019年10月17日

热门VIP内容

相关资讯

已删除

将门创投

4+阅读 · 2019年10月11日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

25+阅读 · 2019年5月18日

神器Cobalt Strike3.13破解版

神器Cobalt Strike3.13破解版

黑白之道

12+阅读 · 2019年3月1日

人工智能 | UAI 2019等国际会议信息4条

人工智能 | UAI 2019等国际会议信息4条

Call4Papers

6+阅读 · 2019年1月14日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

STRCF for Visual Object Tracking

STRCF for Visual Object Tracking

统计学习与视觉计算组

14+阅读 · 2018年5月29日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

相关论文

Gradient Coding with Dynamic Clustering for Straggler-Tolerant Distributed Learning

Arxiv

0+阅读 · 2021年3月1日

Permutation Tests for Equality of Distributions of Functional Data

Permutation Tests for Equality of Distributions of Functional Data

Arxiv

0+阅读 · 2021年3月1日

Distributed optimization for nonrigid nano-tomography

Arxiv

0+阅读 · 2021年2月28日

Communication-efficient Byzantine-robust distributed learning with statistical guarantee

Arxiv

0+阅读 · 2021年2月28日

Fundamental Limits of Distributed Encoding

Arxiv

0+阅读 · 2021年2月27日

Distributed Saddle-Point Problems: Lower Bounds, Optimal Algorithms and Federated GANs

Arxiv

0+阅读 · 2021年2月27日

Distributed Graph Convolutional Networks

Arxiv

18+阅读 · 2020年7月13日

A Survey on Distributed Machine Learning

Arxiv

43+阅读 · 2019年12月20日

Distributed Machine Learning on Mobile Devices: A Survey

Distributed Machine Learning on Mobile Devices: A Survey

Arxiv

35+阅读 · 2019年9月18日

BigDL: A Distributed Deep Learning Framework for Big Data

Arxiv

3+阅读 · 2018年4月16日

微信扫码咨询专知VIP会员