深神经网络培训损失水平层的数值探索 (Numerical Exploration of Training Loss Level-Sets in Deep Neural Networks) - 专知论文

会员服务 ·

0

参数空间 · 正则化项 · Neural Networks · 损失 · 可约的 ·

2021 年 4 月 24 日

Numerical Exploration of Training Loss Level-Sets in Deep Neural Networks

翻译：深神经网络培训损失水平层的数值探索

Naveed Tahir,Garrett E. Katz

from arxiv, Code maintained at https://github.com/vanishinggrad/levelsets

We present a computational method for empirically characterizing the training loss level-sets of deep neural networks. Our method numerically constructs a path in parameter space that is constrained to a set with a fixed near-zero training loss. By measuring regularization functions and test loss at different points within this path, we examine how different points in the parameter space with the same fixed training loss compare in terms of generalization ability. We also compare this method for finding regularized points with the more typical method, that uses objective functions which are weighted sums of training loss and regularization terms. We apply dimensionality reduction to the traversed paths in order to visualize the loss level sets in a well-regularized region of parameter space. Our results provide new information about the loss landscape of deep neural networks, as well as a new strategy for reducing test loss.

翻译：我们提出一种计算方法,对深神经网络的培训损失水平进行经验化定性。我们的方法在数字上构建了参数空间的路径,该路径受固定的近零培训损失的限制。通过测量正常化功能和测试路径内不同点的损失,我们研究了参数空间的不同点与相同的固定培训损失在一般化能力方面的比较。我们还比较了这一查找正常化点的方法与更典型的方法,该方法使用客观功能,即培训损失的加权总和和和正规化条件。我们将维度降低到跨轨路径,以便在正常化的参数空间区域对损失水平进行可视化。我们的结果提供了关于深神经网络损失情况的新信息,以及减少测试损失的新战略。

0

相关内容

参数空间

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

专知会员服务

74+阅读 · 2020年7月6日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

专知会员服务

229+阅读 · 2020年6月5日

简明《神经网络数学》手册，16页pdf带你入门，Mathematics of Neural Networks

简明《神经网络数学》手册，16页pdf带你入门，Mathematics of Neural Networks

专知会员服务

68+阅读 · 2020年5月9日

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

专知会员服务

35+阅读 · 2020年4月15日

【论文推荐】二值神经网络综述，Binary Neural Networks: A Survey

【论文推荐】二值神经网络综述，Binary Neural Networks: A Survey

专知会员服务

53+阅读 · 2020年4月8日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【论文】深度卷积神经网络的ImageNet分类（ImageNet Classification with Deep Convolutional Neural Networks）

【论文】深度卷积神经网络的ImageNet分类（ImageNet Classification with Deep Convolutional Neural Networks）

专知会员服务

14+阅读 · 2020年1月1日

论深度学习的信息瓶颈理论（On the information bottleneck theory of deep learning）

论深度学习的信息瓶颈理论（On the information bottleneck theory of deep learning）

专知会员服务

66+阅读 · 2019年12月20日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

revelation of MONet

revelation of MONet

CreateAMind

5+阅读 · 2019年6月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【深度】Deep Visualization:可视化并理解CNN

【深度】Deep Visualization:可视化并理解CNN

专知

12+阅读 · 2017年9月30日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Simultaneous Training of Partially Masked Neural Networks

Arxiv

0+阅读 · 2021年6月16日

Smoothness Analysis of Adversarial Training

Arxiv

0+阅读 · 2021年6月15日

Correlated Weights in Infinite Limits of Deep Convolutional Neural Networks

Arxiv

0+阅读 · 2021年6月13日

Globally-Robust Neural Networks

Arxiv

0+阅读 · 2021年6月11日

Infinite-dimensional Folded-in-time Deep Neural Networks

Arxiv

0+阅读 · 2021年6月11日

The Loss Surfaces of Neural Networks with General Activation Functions

Arxiv

0+阅读 · 2021年6月8日

Householder-Absolute Neural Layers For High Variability and Deep Trainability

Arxiv

0+阅读 · 2021年6月8日

Predicting Quantum Potentials by Deep Neural Network and Metropolis Sampling

Arxiv

0+阅读 · 2021年6月6日

Theoretical Analysis of Self-Training with Deep Networks on Unlabeled Data

Arxiv

9+阅读 · 2021年2月8日

Cloze-driven Pretraining of Self-attention Networks

Arxiv

6+阅读 · 2019年3月19日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

专知会员服务

74+阅读 · 2020年7月6日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

【斯坦福】凸优化圣经- Convex Optimization （附730pdf下载）

专知会员服务

229+阅读 · 2020年6月5日

简明《神经网络数学》手册，16页pdf带你入门，Mathematics of Neural Networks

简明《神经网络数学》手册，16页pdf带你入门，Mathematics of Neural Networks

专知会员服务

68+阅读 · 2020年5月9日

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

专知会员服务

35+阅读 · 2020年4月15日

【论文推荐】二值神经网络综述，Binary Neural Networks: A Survey

【论文推荐】二值神经网络综述，Binary Neural Networks: A Survey

专知会员服务

53+阅读 · 2020年4月8日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【论文】深度卷积神经网络的ImageNet分类（ImageNet Classification with Deep Convolutional Neural Networks）

【论文】深度卷积神经网络的ImageNet分类（ImageNet Classification with Deep Convolutional Neural Networks）

专知会员服务

14+阅读 · 2020年1月1日

论深度学习的信息瓶颈理论（On the information bottleneck theory of deep learning）

论深度学习的信息瓶颈理论（On the information bottleneck theory of deep learning）

专知会员服务

66+阅读 · 2019年12月20日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

卫星导航技术发展综述

《美军"僚机"联合能力技术演示项目：有人-无人火炮作战》41页报告

美军条令《火力指挥》116页

可解释的人工智能在生物医学图像分析中的应用综述

相关资讯

revelation of MONet

revelation of MONet

CreateAMind

5+阅读 · 2019年6月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【深度】Deep Visualization:可视化并理解CNN

【深度】Deep Visualization:可视化并理解CNN

专知

12+阅读 · 2017年9月30日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Simultaneous Training of Partially Masked Neural Networks

Arxiv

0+阅读 · 2021年6月16日

Smoothness Analysis of Adversarial Training

Arxiv

0+阅读 · 2021年6月15日

Correlated Weights in Infinite Limits of Deep Convolutional Neural Networks

Arxiv

0+阅读 · 2021年6月13日

Globally-Robust Neural Networks

Arxiv

0+阅读 · 2021年6月11日

Infinite-dimensional Folded-in-time Deep Neural Networks

Arxiv

0+阅读 · 2021年6月11日

The Loss Surfaces of Neural Networks with General Activation Functions

Arxiv

0+阅读 · 2021年6月8日

Householder-Absolute Neural Layers For High Variability and Deep Trainability

Arxiv

0+阅读 · 2021年6月8日

Predicting Quantum Potentials by Deep Neural Network and Metropolis Sampling

Arxiv

0+阅读 · 2021年6月6日

Theoretical Analysis of Self-Training with Deep Networks on Unlabeled Data

Arxiv

9+阅读 · 2021年2月8日

Cloze-driven Pretraining of Self-attention Networks

Arxiv

6+阅读 · 2019年3月19日

微信扫码咨询专知VIP会员