几何学习中基于批处理的训练加速：静态与动态算法分析 (Training speedups via batching for geometric learning: an analysis of static and dynamic algorithms) - 专知论文

会员服务 ·

0

算法 · 数据集 · 分析 · GNN · 训练加速 ·

Training speedups via batching for geometric learning: an analysis of static and dynamic algorithms

翻译：几何学习中基于批处理的训练加速：静态与动态算法分析

Daniel T. Speckhard,Tim Bechtel,Sebastian Kehl,Jonathan Godwin,Claudia Draxl

Graph neural networks (GNN) have shown promising results for several domains such as materials science, chemistry, and the social sciences. GNN models often contain millions of parameters, and like other neural network (NN) models, are often fed only a fraction of the graphs that make up the training dataset in batches to update model parameters. The effect of batching algorithms on training time and model performance has been thoroughly explored for NNs but not yet for GNNs. We analyze two different batching algorithms for graph based models, namely static and dynamic batching for two datasets, the QM9 dataset of small molecules and the AFLOW materials database. Our experiments show that changing the batching algorithm can provide up to a 2.7x speedup, but the fastest algorithm depends on the data, model, batch size, hardware, and number of training steps run. Experiments show that for a select number of combinations of batch size, dataset, and model, significant differences in model learning metrics are observed between static and dynamic batching algorithms.

翻译：图神经网络（GNN）在材料科学、化学和社会科学等多个领域展现出有前景的结果。GNN模型通常包含数百万个参数，与其他神经网络（NN）模型类似，通常仅以训练数据集中图的一小部分作为批次输入来更新模型参数。批处理算法对训练时间和模型性能的影响已在NN中得到充分探索，但在GNN中尚未深入研究。我们针对基于图的模型分析了两种不同的批处理算法，即静态批处理和动态批处理，并在两个数据集上进行了验证：小分子QM9数据集和AFLOW材料数据库。实验表明，改变批处理算法可提供高达2.7倍的加速效果，但最快的算法取决于数据、模型、批次大小、硬件以及训练步数。实验结果显示，在特定的批次大小、数据集和模型组合下，静态与动态批处理算法在模型学习指标上存在显著差异。

0

相关内容

在数学和计算机科学之中，算法（Algorithm）为一个计算的具体步骤，常用于计算、数据处理和自动推理。精确而言，算法是一个表示为有限长列表的有效方法。算法应包含清晰定义的指令用于计算函数。来自维基百科：算法

【AAAI2025】基于全局变换器的模态无关图神经网络在多模态推荐中的应用

【AAAI2025】基于全局变换器的模态无关图神经网络在多模态推荐中的应用

专知会员服务

17+阅读 · 2024年12月19日

【NeurIPS2024】超越冗余：信息感知的无监督多重图结构学习

【NeurIPS2024】超越冗余：信息感知的无监督多重图结构学习

专知会员服务

27+阅读 · 2024年9月29日

【WWW2024】博弈论式反事实解释图神经网络

【WWW2024】博弈论式反事实解释图神经网络

专知会员服务

32+阅读 · 2024年2月17日

图上如何可解释？首篇《图反事实解释:定义、方法、评价》综述，46页pdf165篇文献全面概述图反事实解释进展

图上如何可解释？首篇《图反事实解释:定义、方法、评价》综述，46页pdf165篇文献全面概述图反事实解释进展

专知会员服务

41+阅读 · 2022年10月24日

临床自然语言处理中的嵌入综述，SECNLP: A survey of embeddings

临床自然语言处理中的嵌入综述，SECNLP: A survey of embeddings

专知会员服务

39+阅读 · 2020年3月23日

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

专知

11+阅读 · 2021年4月23日

【CVPR2021】CausalVAE: 引入因果结构的解耦表示学习

【CVPR2021】CausalVAE: 引入因果结构的解耦表示学习

专知

19+阅读 · 2021年3月28日

【深度度量学习系列】Triplet-loss原理与应用

【深度度量学习系列】Triplet-loss原理与应用

AINLP

61+阅读 · 2020年10月7日

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

专知

13+阅读 · 2020年4月1日

基于注意力机制的图卷积网络

基于注意力机制的图卷积网络

科技创新与创业

74+阅读 · 2017年11月8日

在复杂几何边界下基于带权最小二乘径向基函数的无网格格子玻尔兹曼流体仿真方法及其可视化研究

国家自然科学基金

0+阅读 · 2015年12月31日

分布式有监督学习的学习理论

国家自然科学基金

17+阅读 · 2015年12月31日

基于支撑函数的不规则形态扩展目标建模和估计研究

国家自然科学基金

0+阅读 · 2015年12月31日

高维数据下的模型平均方法

国家自然科学基金

6+阅读 · 2014年12月31日

基于结构学习的非平行支持向量机最优化方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

I-Diff: Structural Regularization for High-Fidelity Diffusion Models

Arxiv

0+阅读 · 12月16日

In-Context and Few-Shots Learning for Forecasting Time Series Data based on Large Language Models

Arxiv

0+阅读 · 12月8日

A quantitative analysis of semantic information in deep representations of text and images

Arxiv

0+阅读 · 12月5日

Benchmarking machine learning models for multi-class state recognition in double quantum dot data

Arxiv

0+阅读 · 12月1日

Decoupling Augmentation Bias in Prompt Learning for Vision-Language Models

Arxiv

0+阅读 · 11月5日

VIP会员

文章信息

相关主题

相关VIP内容

【AAAI2025】基于全局变换器的模态无关图神经网络在多模态推荐中的应用

【AAAI2025】基于全局变换器的模态无关图神经网络在多模态推荐中的应用

专知会员服务

17+阅读 · 2024年12月19日

【NeurIPS2024】超越冗余：信息感知的无监督多重图结构学习

【NeurIPS2024】超越冗余：信息感知的无监督多重图结构学习

专知会员服务

27+阅读 · 2024年9月29日

【WWW2024】博弈论式反事实解释图神经网络

【WWW2024】博弈论式反事实解释图神经网络

专知会员服务

32+阅读 · 2024年2月17日

图上如何可解释？首篇《图反事实解释:定义、方法、评价》综述，46页pdf165篇文献全面概述图反事实解释进展

图上如何可解释？首篇《图反事实解释:定义、方法、评价》综述，46页pdf165篇文献全面概述图反事实解释进展

专知会员服务

41+阅读 · 2022年10月24日

临床自然语言处理中的嵌入综述，SECNLP: A survey of embeddings

临床自然语言处理中的嵌入综述，SECNLP: A survey of embeddings

专知会员服务

39+阅读 · 2020年3月23日

热门VIP内容

开通专知VIP会员享更多权益服务

【MIT博士论文】弱监督学习：理论、方法与应用

Andrej Karpathy：2025 年 LLM 年度回顾（2025 LLM Year in Review）

锚定情报：合成欺骗时代的地面真相

NeurIPS 2025 | NMKE：基于神经元归因与动态稀疏掩码的终身知识编辑

相关资讯

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

专知

11+阅读 · 2021年4月23日

【CVPR2021】CausalVAE: 引入因果结构的解耦表示学习

【CVPR2021】CausalVAE: 引入因果结构的解耦表示学习

专知

19+阅读 · 2021年3月28日

【深度度量学习系列】Triplet-loss原理与应用

【深度度量学习系列】Triplet-loss原理与应用

AINLP

61+阅读 · 2020年10月7日

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

专知

13+阅读 · 2020年4月1日

基于注意力机制的图卷积网络

基于注意力机制的图卷积网络

科技创新与创业

74+阅读 · 2017年11月8日

相关论文

I-Diff: Structural Regularization for High-Fidelity Diffusion Models

Arxiv

0+阅读 · 12月16日

In-Context and Few-Shots Learning for Forecasting Time Series Data based on Large Language Models

Arxiv

0+阅读 · 12月8日

A quantitative analysis of semantic information in deep representations of text and images

Arxiv

0+阅读 · 12月5日

Benchmarking machine learning models for multi-class state recognition in double quantum dot data

Arxiv

0+阅读 · 12月1日

Decoupling Augmentation Bias in Prompt Learning for Vision-Language Models

Arxiv

0+阅读 · 11月5日

相关基金

在复杂几何边界下基于带权最小二乘径向基函数的无网格格子玻尔兹曼流体仿真方法及其可视化研究

国家自然科学基金

0+阅读 · 2015年12月31日

分布式有监督学习的学习理论

国家自然科学基金

17+阅读 · 2015年12月31日

基于支撑函数的不规则形态扩展目标建模和估计研究

国家自然科学基金

0+阅读 · 2015年12月31日

高维数据下的模型平均方法

国家自然科学基金

6+阅读 · 2014年12月31日

基于结构学习的非平行支持向量机最优化方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员