通过噪音喷射,在过度平衡模型中明确规范化 (Explicit Regularization in Overparametrized Models via Noise Injection) - 专知论文

会员服务 ·

0

正则化项 · 噪声 · 方差 · MoDELS · Performer ·

2022 年 6 月 9 日

Explicit Regularization in Overparametrized Models via Noise Injection

翻译：通过噪音喷射,在过度平衡模型中明确规范化

Antonio Orvieto,Anant Raj,Hans Kersting,Francis Bach

from arxiv, 32 pages

Injecting noise within gradient descent has several desirable features. In this paper, we explore noise injection before computing a gradient step, which is known to have smoothing and regularizing properties. We show that small perturbations induce explicit regularization for simple finite-dimensional models based on the l1-norm, group l1-norms, or nuclear norms. When applied to overparametrized neural networks with large widths, we show that the same perturbations do not work due to variance explosion resulting from overparametrization. However, we also show that independent layer wise perturbations allow to avoid the exploding variance term, and explicit regularizers can then be obtained. We empirically show that the small perturbations lead to better generalization performance than vanilla (stochastic) gradient descent training, with minor adjustments to the training procedure.

翻译：梯度下沉的注入噪音具有若干可取的特征。在本文中, 我们在计算梯度步骤之前先探索噪音注入, 已知梯度步骤具有平滑和规范化的特性。我们显示小扰动会明显地规范以 l1- 诺姆、 1- 诺姆或核规范为基础的简单限维模型。当应用到宽度大、超平衡的神经网络时, 我们发现同样的扰动不会起作用, 原因是过度平衡造成的爆炸变化。但是, 我们还表明, 独立的层明智的扰动可以避免爆炸性差异期, 然后可以取得明确的规范化。我们从经验上表明, 小扰动会比香草( 沙丁基) 梯度下沉培训带来更好的一般化效果, 并且对培训程序稍作调整。

0

相关内容

正则化项

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

专知会员服务

23+阅读 · 2019年11月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

金属肼硼烷的合成和性质

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

金属蛋白酶ADAMTS13表达调控的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

MeV离子辐照下钙钛矿型钛酸盐晶体结构稳定性的研究

国家自然科学基金

0+阅读 · 2013年12月31日

肠特异性CGI-58基因敲除致小鼠脂质代谢紊乱的研究

国家自然科学基金

0+阅读 · 2012年12月31日

有机膦酸构筑的多金属氧簇的制备、结构及性质研究

国家自然科学基金

0+阅读 · 2011年12月31日

新兴污染物HO-PBDEs在水相中的环境光化学活性

国家自然科学基金

0+阅读 · 2009年12月31日

胰腺癌细胞中c-Src激酶调控Notch-1活化的分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

On the Learning and Learnablity of Quasimetrics

Arxiv

0+阅读 · 2022年7月25日

Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference

Arxiv

0+阅读 · 2022年7月23日

Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks

Arxiv

0+阅读 · 2022年7月22日

Principal Geodesic Analysis of Merge Trees (and Persistence Diagrams)

Arxiv

0+阅读 · 2022年7月22日

Differential Geometry for Neural Implicit Models

Arxiv

0+阅读 · 2022年7月21日

Denoised MDPs: Learning World Models Better Than the World Itself

Arxiv

0+阅读 · 2022年7月21日

ProMix: Combating Label Noise via Maximizing Clean Sample Utility

Arxiv

0+阅读 · 2022年7月21日

On Computing Probabilistic Explanations for Decision Trees

Arxiv

0+阅读 · 2022年6月30日

On Neural Differential Equations

Arxiv

23+阅读 · 2022年2月4日

Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview

Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview

Arxiv

11+阅读 · 2019年9月19日

VIP会员

文章信息

相关主题

相关VIP内容

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

专知会员服务

23+阅读 · 2019年11月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能治理的未来

模态感知的特征匹配：单一模态与跨模态技术的全面综述

无监督行人重识别研究综述

【牛津博士论文】面向神经影像应用的可扩展且可解释的空间模型

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

On the Learning and Learnablity of Quasimetrics

Arxiv

0+阅读 · 2022年7月25日

Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference

Arxiv

0+阅读 · 2022年7月23日

Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks

Arxiv

0+阅读 · 2022年7月22日

Principal Geodesic Analysis of Merge Trees (and Persistence Diagrams)

Arxiv

0+阅读 · 2022年7月22日

Differential Geometry for Neural Implicit Models

Arxiv

0+阅读 · 2022年7月21日

Denoised MDPs: Learning World Models Better Than the World Itself

Arxiv

0+阅读 · 2022年7月21日

ProMix: Combating Label Noise via Maximizing Clean Sample Utility

Arxiv

0+阅读 · 2022年7月21日

On Computing Probabilistic Explanations for Decision Trees

Arxiv

0+阅读 · 2022年6月30日

On Neural Differential Equations

Arxiv

23+阅读 · 2022年2月4日

Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview

Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview

Arxiv

11+阅读 · 2019年9月19日

相关基金

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

金属肼硼烷的合成和性质

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

金属蛋白酶ADAMTS13表达调控的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

MeV离子辐照下钙钛矿型钛酸盐晶体结构稳定性的研究

国家自然科学基金

0+阅读 · 2013年12月31日

肠特异性CGI-58基因敲除致小鼠脂质代谢紊乱的研究

国家自然科学基金

0+阅读 · 2012年12月31日

有机膦酸构筑的多金属氧簇的制备、结构及性质研究

国家自然科学基金

0+阅读 · 2011年12月31日

新兴污染物HO-PBDEs在水相中的环境光化学活性

国家自然科学基金

0+阅读 · 2009年12月31日

胰腺癌细胞中c-Src激酶调控Notch-1活化的分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员