随机 K- FACs: 使用随机数字线性线性代数加速 K- FACs (Randomized K-FACs: Speeding up K-FAC with Randomized Numerical Linear Algebra) - 专知论文

会员服务 ·

0

分解的 · 线性的 · PAR · 特征值分解 · 易处理的 ·

2022 年 6 月 30 日

Randomized K-FACs: Speeding up K-FAC with Randomized Numerical Linear Algebra

翻译：随机 K- FACs: 使用随机数字线性线性代数加速 K- FACs

Constantin Octavian Puiu

from arxiv, First version: 12 pages, 2 figures, 1 table

K-FAC is a successful tractable implementation of Natural Gradient for Deep Learning, which nevertheless suffers from the requirement to compute the inverse of the Kronecker factors (through an eigen-decomposition). This can be very time-consuming (or even prohibitive) when these factors are large. In this paper, we theoretically show that, owing to the exponential-average construction paradigm of the Kronecker factors that is typically used, their eigen-spectrum must decay. We show numerically that in practice this decay is very rapid, leading to the idea that we could save substantial computation by only focusing on the first few eigen-modes when inverting the Kronecker-factors. Randomized Numerical Linear Algebra provides us with the necessary tools to do so. Numerical results show we obtain $\approx2.5\times$ reduction in per-epoch time and $\approx3.3\times$ reduction in time to target accuracy. We compare our proposed K-FAC sped-up versions with a more computationally efficient NG implementation, SENG, and observe we perform on par with it.

翻译：K- FAC 是一个成功的“ 深层学习自然梯度” 的成功执行, 但它仍然受制于对克伦克尔因素进行反向计算的要求( 通过 eigen 分解) 。当这些因素巨大时, 这可能非常耗时( 甚至令人望而却步 ) 。在本文中, 我们理论上显示, 由于通常使用的克伦克尔因素的指数平均构建模式, 他们的脑分光必须衰减。我们从数字上显示, 实际上这种衰减非常迅速, 导致一种想法, 即当Kronecker- factors 反转时, 我们只关注最初的几种机率模型, 就可以节省大量计算。随机化的Numical Algebra 为我们提供了这样做的必要工具。数值结果显示, 由于通常使用的Kronecker因素的指数平均构建模式, 他们的脑分光谱必须衰减1美元, 和 $\ approx3.3 y times 在时间上降低目标精确度。我们比较了我们提议的K- FAC 加速计算方法的精度, 与更具有计算效率的GNG 。

0

相关内容

分解的

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

新的小分子化合物WJ460通过靶向Myoferlin抑制乳腺癌转移和复发的分子机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

靶向微管蛋白秋水仙碱位点的白藜芦醇-Combrestatin A-4类抑制剂的设计、合成及活性研究

国家自然科学基金

0+阅读 · 2013年12月31日

Fitzhugh-Nagumo方程和抛物型方程组的时间最优控制问题

国家自然科学基金

0+阅读 · 2012年12月31日

原位合成SiC纳米带增韧ZrB2-SiC高温防氧化涂层研究

国家自然科学基金

0+阅读 · 2012年12月31日

某些偏微分方程解的零点集结构研究

国家自然科学基金

0+阅读 · 2012年12月31日

金属与有机小分子共催化合成几类环状化合物

国家自然科学基金

0+阅读 · 2009年12月31日

水稻OsCAS（Calcium-sensing Receptor）基因的功能分析

国家自然科学基金

0+阅读 · 2009年12月31日

可压和不可压Navier-Stokes方程的一些问题

国家自然科学基金

0+阅读 · 2009年12月31日

可压Navier-Stokes方程及相关流体动力学方程研究

国家自然科学基金

0+阅读 · 2008年12月31日

Instability and Local Minima in GAN Training with Kernel Discriminators

Arxiv

0+阅读 · 2022年8月21日

On augmented finite element formulation for the Navier--Stokes equations with vorticity and variable viscosity

Arxiv

0+阅读 · 2022年8月19日

Linear regression and its inference on noisy network-linked data

Arxiv

0+阅读 · 2022年8月19日

Exponential Speedup Over Locality in MPC with Optimal Memory

Arxiv

0+阅读 · 2022年8月19日

Imaginary Zeroth-Order Optimization

Arxiv

0+阅读 · 2022年8月19日

Exploration, Path Planning with Obstacle and Collision Avoidance in a Dynamic Environment

Arxiv

0+阅读 · 2022年8月19日

A Survey on Unsupervised Visual Industrial Anomaly Detection Algorithms

Arxiv

0+阅读 · 2022年8月19日

On the Estimation of Peer Effects for Sampled Networks

Arxiv

0+阅读 · 2022年8月19日

Balancing Consumer and Business Value of Recommender Systems: A Simulation-based Analysis

Arxiv

0+阅读 · 2022年8月18日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

VIP会员

文章信息

相关主题

特征值分解

相关VIP内容

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【NTU博士论文】利用强化学习与生成模型推进可靠且可泛化的决策

美海军研发“增强侦察与态势评估系统（ARES）”应用程序以优化作战规划（附研究论文）

【NeurIPS2025】DNA-DetectLLM：基于 DNA 启发的“突变-修复”范式揭示 AI 生成文本

面向深度研究系统的强化学习基础：综述

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Instability and Local Minima in GAN Training with Kernel Discriminators

Arxiv

0+阅读 · 2022年8月21日

On augmented finite element formulation for the Navier--Stokes equations with vorticity and variable viscosity

Arxiv

0+阅读 · 2022年8月19日

Linear regression and its inference on noisy network-linked data

Arxiv

0+阅读 · 2022年8月19日

Exponential Speedup Over Locality in MPC with Optimal Memory

Arxiv

0+阅读 · 2022年8月19日

Imaginary Zeroth-Order Optimization

Arxiv

0+阅读 · 2022年8月19日

Exploration, Path Planning with Obstacle and Collision Avoidance in a Dynamic Environment

Arxiv

0+阅读 · 2022年8月19日

A Survey on Unsupervised Visual Industrial Anomaly Detection Algorithms

Arxiv

0+阅读 · 2022年8月19日

On the Estimation of Peer Effects for Sampled Networks

Arxiv

0+阅读 · 2022年8月19日

Balancing Consumer and Business Value of Recommender Systems: A Simulation-based Analysis

Arxiv

0+阅读 · 2022年8月18日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

相关基金

新的小分子化合物WJ460通过靶向Myoferlin抑制乳腺癌转移和复发的分子机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

靶向微管蛋白秋水仙碱位点的白藜芦醇-Combrestatin A-4类抑制剂的设计、合成及活性研究

国家自然科学基金

0+阅读 · 2013年12月31日

Fitzhugh-Nagumo方程和抛物型方程组的时间最优控制问题

国家自然科学基金

0+阅读 · 2012年12月31日

原位合成SiC纳米带增韧ZrB2-SiC高温防氧化涂层研究

国家自然科学基金

0+阅读 · 2012年12月31日

某些偏微分方程解的零点集结构研究

国家自然科学基金

0+阅读 · 2012年12月31日

金属与有机小分子共催化合成几类环状化合物

国家自然科学基金

0+阅读 · 2009年12月31日

水稻OsCAS（Calcium-sensing Receptor）基因的功能分析

国家自然科学基金

0+阅读 · 2009年12月31日

可压和不可压Navier-Stokes方程的一些问题

国家自然科学基金

0+阅读 · 2009年12月31日

可压Navier-Stokes方程及相关流体动力学方程研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员