神经网络规范化干扰神经网络目标值 (Disturbing Target Values for Neural Network Regularization) - 专知论文

会员服务 ·

0

正则化项 · 置信度 · 暂退法 · 标注 · Neural Networks ·

2021 年 10 月 11 日

Disturbing Target Values for Neural Network Regularization

翻译：神经网络规范化干扰神经网络目标值

Yongho Kim,Hanna Lukashonak,Paweena Tarepakdee,Klavdia Zavalich,Mofassir ul Islam Arif

from arxiv, 10 pages,2 figures, 6 tables

Diverse regularization techniques have been developed such as L2 regularization, Dropout, DisturbLabel (DL) to prevent overfitting. DL, a newcomer on the scene, regularizes the loss layer by flipping a small share of the target labels at random and training the neural network on this distorted data so as to not learn the training data. It is observed that high confidence labels during training cause the overfitting problem and DL selects disturb labels at random regardless of the confidence of labels. To solve this shortcoming of DL, we propose Directional DisturbLabel (DDL) a novel regularization technique that makes use of the class probabilities to infer the confident labels and using these labels to regularize the model. This active regularization makes use of the model behavior during training to regularize it in a more directed manner. To address regression problems, we also propose DisturbValue (DV), and DisturbError (DE). DE uses only predefined confident labels to disturb target values. DV injects noise into a portion of target values at random similar to DL. In this paper, 6 and 8 datasets are used to validate the robustness of our methods in classification and regression tasks respectively. Finally, we demonstrate that our methods are either comparable to or outperform DisturbLabel, L2 regularization, and Dropout. Also, we achieve the best performance in more than half the datasets by combining our methods with either L2 regularization or Dropout.

翻译：已经开发了多种正规化技术, 如 L2 正规化、辍学、干扰拉贝尔 (DL) 等, 以防止过度配置。 DL, 现场的新来者 DL, 通过随机翻转一小部分目标标签来规范损失层, 并训练神经网络使用这种扭曲的数据, 从而避免学习培训数据。发现培训期间的高度信任标签导致过度配置问题, DL 随机选择标签。为了解决 DL 的缺陷, 我们建议 DL 退出 DDL (DDL) 是一种新型的正规化技术, 利用类的退化概率来推断信心标签, 并使用这些标签来规范模式。此项积极的正规化在培训期间使用模型行为来更直接地规范数据。为了解决回归问题, 我们还提议 DuturbValue (DVV) 和 DusturbError (DE) 使用比我们定义的可靠标签来干扰目标值。 DV 将噪音注入目标值的一部分, 与 DL 相似的随机性定义值相似。。在本文中, 使用和 LBregil 的半格式的校验校验中, 和 L 数据分别使用。

0

相关内容

正则化项

【NeurIPS2021】利用判别掩蔽(DAM)学习神经网络的紧致表示

专知会员服务

16+阅读 · 2021年10月11日

【CVPR2021】端到端的全卷积目标检测器

专知会员服务

30+阅读 · 2021年4月5日

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

专知会员服务

26+阅读 · 2020年3月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

专知会员服务

28+阅读 · 2020年3月11日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

【WWW2020-华为诺亚方舟论文】元学习推荐系统MetaSelector

【WWW2020-华为诺亚方舟论文】元学习推荐系统MetaSelector

专知会员服务

56+阅读 · 2020年2月10日

一网打尽！100+深度学习模型TensorFlow与Pytorch代码实现集合

一网打尽！100+深度学习模型TensorFlow与Pytorch代码实现集合

专知会员服务

142+阅读 · 2020年1月3日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

一网打尽！深度学习100+经典模型TensorFlow与Pytorch代码实现大集合

一网打尽！深度学习100+经典模型TensorFlow与Pytorch代码实现大集合

专知

32+阅读 · 2020年1月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Soft Calibration Objectives for Neural Networks

Arxiv

0+阅读 · 2021年12月7日

Deep Stable Learning for Out-Of-Distribution Generalization

Arxiv

12+阅读 · 2021年4月16日

Adaptive Consistency Regularization for Semi-Supervised Transfer Learning

Arxiv

23+阅读 · 2021年3月3日

Semi-supervised Medical Image Segmentation through Dual-task Consistency

Arxiv

14+阅读 · 2020年9月9日

On Attribution of Recurrent Neural Network Predictions via Additive Decomposition

Arxiv

3+阅读 · 2019年3月27日

Boundary-weighted Domain Adaptive Neural Network for Prostate MR Image Segmentation

Arxiv

9+阅读 · 2019年2月21日

Psi-Net: Shape and boundary aware joint multi-task deep network for medical image segmentation

Psi-Net: Shape and boundary aware joint multi-task deep network for medical image segmentation

Arxiv

8+阅读 · 2019年2月11日

CHIP: Channel-wise Disentangled Interpretation of Deep Convolutional Neural Networks

CHIP: Channel-wise Disentangled Interpretation of Deep Convolutional Neural Networks

Arxiv

5+阅读 · 2019年2月7日

Generative Dual Adversarial Network for Generalized Zero-shot Learning

Arxiv

7+阅读 · 2018年11月12日

Adversarial Reprogramming of Neural Networks

Adversarial Reprogramming of Neural Networks

Arxiv

3+阅读 · 2018年6月28日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

【NeurIPS2021】利用判别掩蔽(DAM)学习神经网络的紧致表示

专知会员服务

16+阅读 · 2021年10月11日

【CVPR2021】端到端的全卷积目标检测器

专知会员服务

30+阅读 · 2021年4月5日

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

专知会员服务

26+阅读 · 2020年3月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

专知会员服务

28+阅读 · 2020年3月11日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

【WWW2020-华为诺亚方舟论文】元学习推荐系统MetaSelector

【WWW2020-华为诺亚方舟论文】元学习推荐系统MetaSelector

专知会员服务

56+阅读 · 2020年2月10日

一网打尽！100+深度学习模型TensorFlow与Pytorch代码实现集合

一网打尽！100+深度学习模型TensorFlow与Pytorch代码实现集合

专知会员服务

142+阅读 · 2020年1月3日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《使用量化测量将传感器节点关联到融合中心的算法设计》171页

军事前沿模型

提升军事训练能力的最佳人工智能模拟工具

《社交媒体信息作战》最新48页技术报告

相关资讯

一网打尽！深度学习100+经典模型TensorFlow与Pytorch代码实现大集合

一网打尽！深度学习100+经典模型TensorFlow与Pytorch代码实现大集合

专知

32+阅读 · 2020年1月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Soft Calibration Objectives for Neural Networks

Arxiv

0+阅读 · 2021年12月7日

Deep Stable Learning for Out-Of-Distribution Generalization

Arxiv

12+阅读 · 2021年4月16日

Adaptive Consistency Regularization for Semi-Supervised Transfer Learning

Arxiv

23+阅读 · 2021年3月3日

Semi-supervised Medical Image Segmentation through Dual-task Consistency

Arxiv

14+阅读 · 2020年9月9日

On Attribution of Recurrent Neural Network Predictions via Additive Decomposition

Arxiv

3+阅读 · 2019年3月27日

Boundary-weighted Domain Adaptive Neural Network for Prostate MR Image Segmentation

Arxiv

9+阅读 · 2019年2月21日

Psi-Net: Shape and boundary aware joint multi-task deep network for medical image segmentation

Psi-Net: Shape and boundary aware joint multi-task deep network for medical image segmentation

Arxiv

8+阅读 · 2019年2月11日

CHIP: Channel-wise Disentangled Interpretation of Deep Convolutional Neural Networks

CHIP: Channel-wise Disentangled Interpretation of Deep Convolutional Neural Networks

Arxiv

5+阅读 · 2019年2月7日

Generative Dual Adversarial Network for Generalized Zero-shot Learning

Arxiv

7+阅读 · 2018年11月12日

Adversarial Reprogramming of Neural Networks

Adversarial Reprogramming of Neural Networks

Arxiv

3+阅读 · 2018年6月28日

微信扫码咨询专知VIP会员