ISAAC Newton: Input-based Approximate Curvature for Newton's Method - 专知论文

会员服务 ·

0

曲率 · ISAAC · 近似 · INFORMS · Batch Size ·

2023 年 5 月 1 日

ISAAC Newton: Input-based Approximate Curvature for Newton's Method

翻译：暂无翻译

Felix Petersen,Tobias Sutter,Christian Borgelt,Dongsung Huh,Hilde Kuehne,Yuekai Sun,Oliver Deussen

from arxiv, Published at ICLR 2023, Code @ https://github.com/Felix-Petersen/isaac, Video @ https://youtu.be/7RKRX-MdwqM

We present ISAAC (Input-baSed ApproximAte Curvature), a novel method that conditions the gradient using selected second-order information and has an asymptotically vanishing computational overhead, assuming a batch size smaller than the number of neurons. We show that it is possible to compute a good conditioner based on only the input to a respective layer without a substantial computational overhead. The proposed method allows effective training even in small-batch stochastic regimes, which makes it competitive to first-order as well as second-order methods.

翻译：暂无翻译

0

相关内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

最优控制问题H1-Galerkin混合有限元方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

还原敏感触发式纳米Pickering乳递药系统的构建及靶向肝癌的研究

国家自然科学基金

0+阅读 · 2012年12月31日

喷雾热解耦合流化床还原技术合成空心铜基复合纳米材料用于Rochow反应的研究

国家自然科学基金

0+阅读 · 2012年12月31日

能量级联定向纳米有机复合薄膜的制备及其光伏性能的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Provably Efficient Bayesian Optimization with Unbiased Gaussian Process Hyperparameter Estimation

Arxiv

0+阅读 · 2023年6月12日

Convergence of Momentum-Based Heavy Ball Method with Batch Updating and/or Approximate Gradients

Arxiv

0+阅读 · 2023年6月10日

Causal Effect Estimation from Observational and Interventional Data Through Matrix Weighted Linear Estimators

Arxiv

0+阅读 · 2023年6月9日

Newton-based alternating methods for the ground state of a class of multi-component Bose-Einstein condensates

Arxiv

0+阅读 · 2023年6月9日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【MIT博士论文】弱监督学习：理论、方法与应用

Andrej Karpathy：2025 年 LLM 年度回顾（2025 LLM Year in Review）

锚定情报：合成欺骗时代的地面真相

NeurIPS 2025 | NMKE：基于神经元归因与动态稀疏掩码的终身知识编辑

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

相关论文

Provably Efficient Bayesian Optimization with Unbiased Gaussian Process Hyperparameter Estimation

Arxiv

0+阅读 · 2023年6月12日

Convergence of Momentum-Based Heavy Ball Method with Batch Updating and/or Approximate Gradients

Arxiv

0+阅读 · 2023年6月10日

Causal Effect Estimation from Observational and Interventional Data Through Matrix Weighted Linear Estimators

Arxiv

0+阅读 · 2023年6月9日

Newton-based alternating methods for the ground state of a class of multi-component Bose-Einstein condensates

Arxiv

0+阅读 · 2023年6月9日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

相关基金

最优控制问题H1-Galerkin混合有限元方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

还原敏感触发式纳米Pickering乳递药系统的构建及靶向肝癌的研究

国家自然科学基金

0+阅读 · 2012年12月31日

喷雾热解耦合流化床还原技术合成空心铜基复合纳米材料用于Rochow反应的研究

国家自然科学基金

0+阅读 · 2012年12月31日

能量级联定向纳米有机复合薄膜的制备及其光伏性能的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员