BlucGNN: 利用块环环重量矩阵实现高效的 GNN 加速 (BlockGNN: Towards Efficient GNN Acceleration Using Block-Circulant Weight Matrices) - 专知论文

会员服务 ·

0

Weight · GNN · 图 · CC · 可约的 ·

2021 年 4 月 13 日

BlockGNN: Towards Efficient GNN Acceleration Using Block-Circulant Weight Matrices

翻译：BlucGNN: 利用块环环重量矩阵实现高效的 GNN 加速

Zhe Zhou,Bizhao Shi,Zhe Zhang,Yijin Guan,Guangyu Sun,Guojie Luo

In recent years, Graph Neural Networks (GNNs) appear to be state-of-the-art algorithms for analyzing non-euclidean graph data. By applying deep-learning to extract high-level representations from graph structures, GNNs achieve extraordinary accuracy and great generalization ability in various tasks. However, with the ever-increasing graph sizes, more and more complicated GNN layers, and higher feature dimensions, the computational complexity of GNNs grows exponentially. How to inference GNNs in real time has become a challenging problem, especially for some resource-limited edge-computing platforms. To tackle this challenge, we propose BlockGNN, a software-hardware co-design approach to realize efficient GNN acceleration. At the algorithm level, we propose to leverage block-circulant weight matrices to greatly reduce the complexity of various GNN models. At the hardware design level, we propose a pipelined CirCore architecture, which supports efficient block-circulant matrices computation. Basing on CirCore, we present a novel BlockGNN accelerator to compute various GNNs with low latency. Moreover, to determine the optimal configurations for diverse deployed tasks, we also introduce a performance and resource model that helps choose the optimal hardware parameters automatically. Comprehensive experiments on the ZC706 FPGA platform demonstrate that on various GNN tasks, BlockGNN achieves up to $8.3\times$ speedup compared to the baseline HyGCN architecture and $111.9\times$ energy reduction compared to the Intel Xeon CPU platform.

翻译：近年来,图形神经网络(GNNS)似乎是用于分析非欧元图形数据的最先进的算法。通过应用深层学习从图形结构中提取高层次代表,GNNS在各种任务中达到了非常准确性和超强的概括化能力。然而,随着图形规模的不断增加,GNNS层的日益复杂和特性的提高,GNNS的计算复杂性成倍增长。如何实时推断GNS已成为一个具有挑战性的问题,对于一些资源有限的边缘计算平台来说尤其如此。为了应对这一挑战,我们建议BlockGNNNN是一个软件硬件联合设计方法,从图形结构中提取高层次代表高层次代表高层次代表高层次代表的GNNNN的加速性能。然而,在硬件设计层面,我们建议建立一个编织好的CirCoreCore架构,支持高效的区块-Cnurentral矩阵计算。在CirCore Core Core,我们推出一个新的BGNNNNNP6加速度模型, 比较各种GNNNNCS的模型, 以及优化的软化的硬性测试。

1

相关内容

Weight

【图神经网络导论】Intro to Graph Neural Networks，176页ppt

【图神经网络导论】Intro to Graph Neural Networks，176页ppt

专知会员服务

129+阅读 · 2021年6月4日

【NeurIPS 2020】图神经网络GNN架构设计

【NeurIPS 2020】图神经网络GNN架构设计

专知会员服务

85+阅读 · 2020年11月19日

图神经网络基准，37页ppt，NTU Chaitanya Joshi

图神经网络基准，37页ppt，NTU Chaitanya Joshi

专知会员服务

24+阅读 · 2020年8月22日

一份简单《图神经网络》教程，28页ppt

一份简单《图神经网络》教程，28页ppt

专知会员服务

127+阅读 · 2020年8月2日

【ICLR2020】图神经网络与图像处理，微分方程，27页ppt

【ICLR2020】图神经网络与图像处理，微分方程，27页ppt

专知会员服务

48+阅读 · 2020年6月6日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

专知会员服务

116+阅读 · 2020年2月10日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

专知

45+阅读 · 2020年7月22日

Graph Neural Network（GNN）最全资源整理分享

Graph Neural Network（GNN）最全资源整理分享

深度学习与NLP

339+阅读 · 2019年7月9日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

人工智能 | UAI 2019等国际会议信息4条

人工智能 | UAI 2019等国际会议信息4条

Call4Papers

6+阅读 · 2019年1月14日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

A Unified Lottery Ticket Hypothesis for Graph Neural Networks

Arxiv

2+阅读 · 2021年6月7日

Optimal Beam Association for High Mobility mmWave Vehicular Networks: Lightweight Parallel Reinforcement Learning Approach

Arxiv

0+阅读 · 2021年6月7日

Layered gradient accumulation and modular pipeline parallelism: fast and efficient training of large language models

Arxiv

0+阅读 · 2021年6月4日

MASA-SR: Matching Acceleration and Spatial Adaptation for Reference-Based Image Super-Resolution

Arxiv

0+阅读 · 2021年6月4日

Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations

Arxiv

0+阅读 · 2021年6月4日

CT-Net: Channel Tensorization Network for Video Classification

Arxiv

0+阅读 · 2021年6月3日

L^2-GCN: Layer-Wise and Learned Efficient Training of Graph Convolutional Networks

L^2-GCN: Layer-Wise and Learned Efficient Training of Graph Convolutional Networks

Arxiv

16+阅读 · 2020年3月30日

Auto-GNN: Neural Architecture Search of Graph Neural Networks

Auto-GNN: Neural Architecture Search of Graph Neural Networks

Arxiv

7+阅读 · 2019年9月10日

Accelerated Methods for Deep Reinforcement Learning

Accelerated Methods for Deep Reinforcement Learning

Arxiv

6+阅读 · 2019年1月10日

Hyperbolic Attention Networks

Arxiv

9+阅读 · 2018年5月24日

VIP会员

文章信息

相关主题

相关VIP内容

【图神经网络导论】Intro to Graph Neural Networks，176页ppt

【图神经网络导论】Intro to Graph Neural Networks，176页ppt

专知会员服务

129+阅读 · 2021年6月4日

【NeurIPS 2020】图神经网络GNN架构设计

【NeurIPS 2020】图神经网络GNN架构设计

专知会员服务

85+阅读 · 2020年11月19日

图神经网络基准，37页ppt，NTU Chaitanya Joshi

图神经网络基准，37页ppt，NTU Chaitanya Joshi

专知会员服务

24+阅读 · 2020年8月22日

一份简单《图神经网络》教程，28页ppt

一份简单《图神经网络》教程，28页ppt

专知会员服务

127+阅读 · 2020年8月2日

【ICLR2020】图神经网络与图像处理，微分方程，27页ppt

【ICLR2020】图神经网络与图像处理，微分方程，27页ppt

专知会员服务

48+阅读 · 2020年6月6日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

专知会员服务

116+阅读 · 2020年2月10日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

数据要素发展报告(2025年)：附下载

人工智能代理提升战时舰船战备水平

【NeurIPS2025教程】大语言模型规划

NeurIPS 2025 教程：深度学习训练不稳定性的理论洞见

相关资讯

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

专知

45+阅读 · 2020年7月22日

Graph Neural Network（GNN）最全资源整理分享

Graph Neural Network（GNN）最全资源整理分享

深度学习与NLP

339+阅读 · 2019年7月9日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

人工智能 | UAI 2019等国际会议信息4条

人工智能 | UAI 2019等国际会议信息4条

Call4Papers

6+阅读 · 2019年1月14日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

A Unified Lottery Ticket Hypothesis for Graph Neural Networks

Arxiv

2+阅读 · 2021年6月7日

Optimal Beam Association for High Mobility mmWave Vehicular Networks: Lightweight Parallel Reinforcement Learning Approach

Arxiv

0+阅读 · 2021年6月7日

Layered gradient accumulation and modular pipeline parallelism: fast and efficient training of large language models

Arxiv

0+阅读 · 2021年6月4日

MASA-SR: Matching Acceleration and Spatial Adaptation for Reference-Based Image Super-Resolution

Arxiv

0+阅读 · 2021年6月4日

Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations

Arxiv

0+阅读 · 2021年6月4日

CT-Net: Channel Tensorization Network for Video Classification

Arxiv

0+阅读 · 2021年6月3日

L^2-GCN: Layer-Wise and Learned Efficient Training of Graph Convolutional Networks

L^2-GCN: Layer-Wise and Learned Efficient Training of Graph Convolutional Networks

Arxiv

16+阅读 · 2020年3月30日

Auto-GNN: Neural Architecture Search of Graph Neural Networks

Auto-GNN: Neural Architecture Search of Graph Neural Networks

Arxiv

7+阅读 · 2019年9月10日

Accelerated Methods for Deep Reinforcement Learning

Accelerated Methods for Deep Reinforcement Learning

Arxiv

6+阅读 · 2019年1月10日

Hyperbolic Attention Networks

Arxiv

9+阅读 · 2018年5月24日

微信扫码咨询专知VIP会员