MSCCL:微软集体通信图书馆 (MSCCL: Microsoft Collective Communication Library) - 专知论文

会员服务 ·

0

优化器 · MoDELS · Machine Learning · 系统设计 · 微软 ·

2022 年 1 月 27 日

MSCCL: Microsoft Collective Communication Library

翻译：MSCCL:微软集体通信图书馆

Meghan Cowan,Saeed Maleki,Madanlal Musuvathi,Olli Saarikivi,Yifan Xiong

Machine learning models made up of millions or billions of parameters are often trained and served on large multi-GPU systems. As models grow in size and execute on more GPUs, the collective communications used in these applications becomes a bottleneck. Custom collective algorithms optimized for both particular network topologies and application specific communication patterns can alleviate this bottleneck and thus help these applications scale. This paper introduces MSCCL, a system designed to make GPU communication programmable. MSCCL provides a data oriented domain specific language for writing custom collective communication algorithms and an optimizing compiler for lowering them to an executable form, which can be executed efficiently and flexibly in an interpreter based runtime. We used MSCCL to write novel collective implementations for AllReduce and AllToAll that are up to 48% and 20% faster than optimized vendor implementations, respectively. We also demonstrate how directly implementing an application specific collective called AllToNext in MSCCL results in a 14.5 speedup over the baseline.

翻译：由数以百万或数十亿参数组成的机器学习模型往往经过培训,并用于大型的多参数系统。随着模型的大小扩大并在更多的GPU上执行,这些应用中所使用的集体通信成为瓶颈。为特定网络地形和应用特定通信模式优化的定制集体算法可以缓解这一瓶颈,从而帮助这些应用规模。本文件介绍了MSCCL, 该系统旨在使GPU通信编程成为可操作的系统。 MSCL为编写定制集体通信算法提供了以数据为导向的特定域语言,并为将其降格为可执行格式提供了最优化的编译员,可以在基于翻译的运行时高效和灵活地执行。我们使用MSCL为全红和全红公司编写了新的集体实施方法,分别比优化的供应商实施速度快48%和20%。我们还演示如何直接实施一个名为AllToNext的具体应用程序,在MSCLCL的结果中以14.5的速度加速到基线。

0

相关内容

优化器

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

专知会员服务

16+阅读 · 2022年3月13日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

瞬变电磁法基于有限差分的2.5维反演方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

移动政务价值的扩展对政府公共信任的影响机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向智能配电网的电力电子变压器关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

利用关联分析方法挖掘香菇重要性状相关标记及基因

国家自然科学基金

0+阅读 · 2012年12月31日

种稻大户农业技术扩散行为与示范机制研究- - 以江西省为例

国家自然科学基金

0+阅读 · 2012年12月31日

面向属性的CPN建模及On the Fly辅助的测试生成方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

相关于算子的Orlicz-型函数空间的实变理论

国家自然科学基金

0+阅读 · 2011年12月31日

复杂疾病中的若干统计方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

膨胀土宏细观结构演化CT图像的三维重建及其应用研究

国家自然科学基金

0+阅读 · 2009年12月31日

移动网格中基于能量优化的资源管理理论及方法的研究

国家自然科学基金

0+阅读 · 2009年12月31日

LIGHTYEAR: Using Modularity to Scale BGP Control Plane Verification

Arxiv

0+阅读 · 2022年4月20日

Active Few-Shot Learning with FASL

Arxiv

0+阅读 · 2022年4月20日

A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond

Arxiv

0+阅读 · 2022年4月20日

Antipatterns in Software Classification Taxonomies

Antipatterns in Software Classification Taxonomies

Arxiv

0+阅读 · 2022年4月19日

Auto-Icon+: An Automated End-to-End Code Generation Tool for Icon Designs in UI Development

Arxiv

0+阅读 · 2022年4月19日

Communication Bounds for Convolutional Neural Networks

Communication Bounds for Convolutional Neural Networks

Arxiv

0+阅读 · 2022年4月18日

How to Attain Communication-Efficient DNN Training? Convert, Compress, Correct

Arxiv

0+阅读 · 2022年4月18日

Comparison communication protocols

Arxiv

0+阅读 · 2022年4月17日

Near-Field Communications for 6G: Fundamentals, Challenges, Potentials, and Future Directions

Arxiv

0+阅读 · 2022年4月17日

ResT V2: Simpler, Faster and Stronger

ResT V2: Simpler, Faster and Stronger

Arxiv

0+阅读 · 2022年4月15日

VIP会员

文章信息

相关主题

Machine Learning

相关VIP内容

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

专知会员服务

16+阅读 · 2022年3月13日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

中国人工智能应用发展报告（2025）

从Idea构想到论文发表：AI for Research全链路综述与实践

【ACL2025】通过知识偏好优化提升蛋白质生成的安全性与可控性

上下文工程到底是什么？一文起底

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

LIGHTYEAR: Using Modularity to Scale BGP Control Plane Verification

Arxiv

0+阅读 · 2022年4月20日

Active Few-Shot Learning with FASL

Arxiv

0+阅读 · 2022年4月20日

A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond

Arxiv

0+阅读 · 2022年4月20日

Antipatterns in Software Classification Taxonomies

Antipatterns in Software Classification Taxonomies

Arxiv

0+阅读 · 2022年4月19日

Auto-Icon+: An Automated End-to-End Code Generation Tool for Icon Designs in UI Development

Arxiv

0+阅读 · 2022年4月19日

Communication Bounds for Convolutional Neural Networks

Communication Bounds for Convolutional Neural Networks

Arxiv

0+阅读 · 2022年4月18日

How to Attain Communication-Efficient DNN Training? Convert, Compress, Correct

Arxiv

0+阅读 · 2022年4月18日

Comparison communication protocols

Arxiv

0+阅读 · 2022年4月17日

Near-Field Communications for 6G: Fundamentals, Challenges, Potentials, and Future Directions

Arxiv

0+阅读 · 2022年4月17日

ResT V2: Simpler, Faster and Stronger

ResT V2: Simpler, Faster and Stronger

Arxiv

0+阅读 · 2022年4月15日

相关基金

瞬变电磁法基于有限差分的2.5维反演方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

移动政务价值的扩展对政府公共信任的影响机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向智能配电网的电力电子变压器关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

利用关联分析方法挖掘香菇重要性状相关标记及基因

国家自然科学基金

0+阅读 · 2012年12月31日

种稻大户农业技术扩散行为与示范机制研究- - 以江西省为例

国家自然科学基金

0+阅读 · 2012年12月31日

面向属性的CPN建模及On the Fly辅助的测试生成方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

相关于算子的Orlicz-型函数空间的实变理论

国家自然科学基金

0+阅读 · 2011年12月31日

复杂疾病中的若干统计方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

膨胀土宏细观结构演化CT图像的三维重建及其应用研究

国家自然科学基金

0+阅读 · 2009年12月31日

移动网格中基于能量优化的资源管理理论及方法的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员