广义拉格朗日编码计算：一种弹性计算-通信权衡方法，可实现弹性、安全和私密计算 (Generalized Lagrange Coded Computing: A Flexible Computation-Communication Tradeoff for Resilient, Secure, and Private Computation) - 专知论文

会员服务 ·

0

编码计算 · 广义拉格朗日 · 弹性 · 广义 · 弹性计算 ·

2023 年 5 月 2 日

Generalized Lagrange Coded Computing: A Flexible Computation-Communication Tradeoff for Resilient, Secure, and Private Computation

翻译：广义拉格朗日编码计算：一种弹性计算-通信权衡方法，可实现弹性、安全和私密计算

Jinbao Zhu,Hengxuan Tang,Songze Li,Yijia Chang

We consider the problem of evaluating arbitrary multivariate polynomials over a massive dataset containing multiple inputs, on a distributed computing system with a master node and multiple worker nodes. Generalized Lagrange Coded Computing (GLCC) codes are proposed to simultaneously provide resiliency against stragglers who do not return computation results in time, security against adversarial workers who deliberately modify results for their benefit, and information-theoretic privacy of the dataset amidst possible collusion of workers. GLCC codes are constructed by first partitioning the dataset into multiple groups, then encoding the dataset using carefully designed interpolation polynomials, and sharing multiple encoded data points to each worker, such that interference computation results across groups can be eliminated at the master. Particularly, GLCC codes include the state-of-the-art Lagrange Coded Computing (LCC) codes as a special case, and exhibit a more flexible tradeoff between communication and computation overheads in optimizing system efficiency. Furthermore, we apply GLCC to distributed training of machine learning models, and demonstrate that GLCC codes achieve a speedup of up to $2.5\text{--}3.9\times$ over LCC codes in training time, across experiments for training image classifiers on different datasets, model architectures, and straggler patterns.

翻译：我们考虑在含有多个输入的大规模数据集上对任意多元多项式进行求值，使用主节点和多个工作节点的分布式计算系统。提出广义拉格朗日编码计算（GLCC）码，旨在同时提供对未能及时返回计算结果的停顿节点的弹性、针对恶意工作节点的安全性以及可能存在劫持的工人之间数据的信息理论隐私保护。GLCC编码首先将数据集分成多个组，然后使用精心设计的插值多项式对数据集进行编码，并向每个工人共享多个编码数据点，从而可以在主节点处消除跨组的干扰计算结果。特别地，GLCC码包含现有最先进的拉格朗日编码计算（LCC）码作为一种特殊情况，并在优化系统效率时展现出更灵活的通信和计算开销的权衡。此外，我们将GLCC应用于分布式机器学习模型的训练，并展示GLCC码在不同的数据集、模型架构和停顿节点模式的图像分类器训练实验中，训练时间可达到LCC码的2.5至3.9倍的加速。

1

相关内容

编码计算

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

77+阅读 · 2022年3月15日

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

【CVPR 2022】单黑箱和多黑箱预测的领域适应，DINE: Domain Adaptation from Single and Multiple Black-box Predictors

【CVPR 2022】单黑箱和多黑箱预测的领域适应，DINE: Domain Adaptation from Single and Multiple Black-box Predictors

专知会员服务

14+阅读 · 2022年3月12日

【2021新书】并行高性能计算，705页pdf，Parallel and High Performance Computing

【2021新书】并行高性能计算，705页pdf，Parallel and High Performance Computing

专知会员服务

105+阅读 · 2021年10月30日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【SIGMOD2020-CMU】在内存中搜索树的顺序保持键压缩，Order-Preserving Key Compression for In-Memory Search Trees

【SIGMOD2020-CMU】在内存中搜索树的顺序保持键压缩，Order-Preserving Key Compression for In-Memory Search Trees

专知会员服务

15+阅读 · 2020年3月7日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知

5+阅读 · 2022年11月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新八篇网络节点表示相关论文—可扩展嵌入、对抗自编码器、图划分、异构信息、显式矩阵分解、深度高斯、图、随机游走

【论文推荐】最新八篇网络节点表示相关论文—可扩展嵌入、对抗自编码器、图划分、异构信息、显式矩阵分解、深度高斯、图、随机游走

专知

14+阅读 · 2018年3月30日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

半线性广义Tricomi方程Cauchy问题解的生命跨度估计研究

国家自然科学基金

0+阅读 · 2017年12月31日

面向众核处理器的HEVC并行编码关键技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

面向物理层网络编码通信的多进制LDPC码的编码调制设计

国家自然科学基金

0+阅读 · 2013年12月31日

HEVC的低复杂度和并行编码方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

云安全联盟认证与密钥协商

国家自然科学基金

1+阅读 · 2012年12月31日

容错存储系统的扩容问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

物理耦合软件时间行为建模和分析方法研究

国家自然科学基金

2+阅读 · 2011年12月31日

无线通信物理层网络编码与低复杂度迭代可译信道编码联合设计

国家自然科学基金

0+阅读 · 2009年12月31日

MIMO多跳无线网络信道分配算法与跨层优化机制

国家自然科学基金

0+阅读 · 2009年12月31日

虚拟机计算资源调度中关键技术的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Sparq: A Custom RISC-V Vector Processor for Efficient Sub-Byte Quantized Inference

Arxiv

0+阅读 · 2023年6月16日

On the testing of multiple hypothesis in sliced inverse regression

Arxiv

0+阅读 · 2023年6月16日

Communication-Efficient Federated Hypergradient Computation via Aggregated Iterative Differentiation

Arxiv

0+阅读 · 2023年6月16日

A flexible algorithm to offload DAG applications for edge computing

Arxiv

0+阅读 · 2023年6月15日

Your Room is not Private: Gradient Inversion Attack for Deep Q-Learning

Arxiv

0+阅读 · 2023年6月15日

Scaling Data-Constrained Language Models

Arxiv

0+阅读 · 2023年6月15日

The Effect of Length on Key Fingerprint Verification Security and Usability

Arxiv

0+阅读 · 2023年6月15日

TS Cache: A Fast Cache with Timing-speculation Mechanism Under Low Supply Voltages

Arxiv

0+阅读 · 2023年6月15日

PLAN: Variance-Aware Private Mean Estimation

Arxiv

0+阅读 · 2023年6月14日

Implicit Compressibility of Overparametrized Neural Networks Trained with Heavy-Tailed SGD

Arxiv

0+阅读 · 2023年6月13日

VIP会员

文章信息

相关主题

广义拉格朗日

相关VIP内容

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

77+阅读 · 2022年3月15日

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

【CVPR 2022】单黑箱和多黑箱预测的领域适应，DINE: Domain Adaptation from Single and Multiple Black-box Predictors

【CVPR 2022】单黑箱和多黑箱预测的领域适应，DINE: Domain Adaptation from Single and Multiple Black-box Predictors

专知会员服务

14+阅读 · 2022年3月12日

【2021新书】并行高性能计算，705页pdf，Parallel and High Performance Computing

【2021新书】并行高性能计算，705页pdf，Parallel and High Performance Computing

专知会员服务

105+阅读 · 2021年10月30日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【SIGMOD2020-CMU】在内存中搜索树的顺序保持键压缩，Order-Preserving Key Compression for In-Memory Search Trees

【SIGMOD2020-CMU】在内存中搜索树的顺序保持键压缩，Order-Preserving Key Compression for In-Memory Search Trees

专知会员服务

15+阅读 · 2020年3月7日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

[ICML2025]当模型知识遇见扩散模型：扩散辅助的无数据图像合成及域与类别对齐

95页《深度研究DeepResearch的综合综述：系统、方法与应用》

【MIT博士论文】从数据到模型，再回到数据：构建可预测且可靠的机器学习系统”

何恺明CVPR最新讲座PPT上线《走向端到端生成建模》46页ppt

相关资讯

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知

5+阅读 · 2022年11月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新八篇网络节点表示相关论文—可扩展嵌入、对抗自编码器、图划分、异构信息、显式矩阵分解、深度高斯、图、随机游走

【论文推荐】最新八篇网络节点表示相关论文—可扩展嵌入、对抗自编码器、图划分、异构信息、显式矩阵分解、深度高斯、图、随机游走

专知

14+阅读 · 2018年3月30日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Sparq: A Custom RISC-V Vector Processor for Efficient Sub-Byte Quantized Inference

Arxiv

0+阅读 · 2023年6月16日

On the testing of multiple hypothesis in sliced inverse regression

Arxiv

0+阅读 · 2023年6月16日

Communication-Efficient Federated Hypergradient Computation via Aggregated Iterative Differentiation

Arxiv

0+阅读 · 2023年6月16日

A flexible algorithm to offload DAG applications for edge computing

Arxiv

0+阅读 · 2023年6月15日

Your Room is not Private: Gradient Inversion Attack for Deep Q-Learning

Arxiv

0+阅读 · 2023年6月15日

Scaling Data-Constrained Language Models

Arxiv

0+阅读 · 2023年6月15日

The Effect of Length on Key Fingerprint Verification Security and Usability

Arxiv

0+阅读 · 2023年6月15日

TS Cache: A Fast Cache with Timing-speculation Mechanism Under Low Supply Voltages

Arxiv

0+阅读 · 2023年6月15日

PLAN: Variance-Aware Private Mean Estimation

Arxiv

0+阅读 · 2023年6月14日

Implicit Compressibility of Overparametrized Neural Networks Trained with Heavy-Tailed SGD

Arxiv

0+阅读 · 2023年6月13日

相关基金

半线性广义Tricomi方程Cauchy问题解的生命跨度估计研究

国家自然科学基金

0+阅读 · 2017年12月31日

面向众核处理器的HEVC并行编码关键技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

面向物理层网络编码通信的多进制LDPC码的编码调制设计

国家自然科学基金

0+阅读 · 2013年12月31日

HEVC的低复杂度和并行编码方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

云安全联盟认证与密钥协商

国家自然科学基金

1+阅读 · 2012年12月31日

容错存储系统的扩容问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

物理耦合软件时间行为建模和分析方法研究

国家自然科学基金

2+阅读 · 2011年12月31日

无线通信物理层网络编码与低复杂度迭代可译信道编码联合设计

国家自然科学基金

0+阅读 · 2009年12月31日

MIMO多跳无线网络信道分配算法与跨层优化机制

国家自然科学基金

0+阅读 · 2009年12月31日

虚拟机计算资源调度中关键技术的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员