PAC-Bayayes信息瓶颈 (PAC-Bayes Information Bottleneck) - 专知论文

会员服务 ·

0

INFORMS · 泛化理论 · CASES · Weight · Extensibility ·

2021 年 10 月 4 日

PAC-Bayes Information Bottleneck

翻译：PAC-Bayayes信息瓶颈

Zifeng Wang,Shao-Lun Huang,Ercan E. Kuruoglu,Jimeng Sun,Xi Chen,Yefeng Zheng

Information bottleneck (IB) depicts a trade-off between the accuracy and conciseness of encoded representations. IB has succeeded in explaining the objective and behavior of neural networks (NNs) as well as learning better representations. However, there are still critics of the universality of IB, e.g., phase transition usually fades away, representation compression is not causally related to generalization, and IB is trivial in deterministic cases. In this work, we build a new IB based on the trade-off between the accuracy and complexity of learned weights of NNs. We argue that this new IB represents a more solid connection to the objective of NNs since the information stored in weights (IIW) bounds their PAC-Bayes generalization capability, hence we name it as PAC-Bayes IB (PIB). On IIW, we can identify the phase transition phenomenon in general cases and solidify the causality between compression and generalization. We then derive a tractable solution of PIB and design a stochastic inference algorithm by Markov chain Monte Carlo sampling. We empirically verify our claims through extensive experiments. We also substantiate the superiority of the proposed algorithm on training NNs.

翻译：信息瓶颈(IB) 描述了编码代表的准确性和简洁性之间的权衡。 IB 成功地解释了神经网络的目标和行为,并学习了更好的表述。然而,仍然有人批评IB的普遍性,例如,阶段过渡通常会消失,代表压缩并不因果而与一般化有关,IB在确定性案例中是微不足道的。在这项工作中,我们根据所学 NNC 重量的准确性和复杂性之间的权衡,建立了一个新的IB。我们认为,这一新IB 与NNC 的目标有着更牢固的联系,因为储存在重量中的信息限制了其PAC-Bayes一般化能力,因此我们将其命名为PAC-Bayes IB(PIB)。关于IW,我们可以确定一般情况下的阶段过渡现象,并巩固压缩和一般化之间的因果关系。我们随后得出PIB的可感性解决办法,并设计了Markov 链蒙特卡洛取样的推论算法。我们还通过广泛的实验证实了我们提出的高超度要求。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

67+阅读 · 2021年3月27日

不可错过！UIUC最新《对抗机器学习》课程，附PPT

专知会员服务

33+阅读 · 2020年12月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

50+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

76+阅读 · 2020年7月26日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

75+阅读 · 2020年2月8日

论深度学习的信息瓶颈理论（On the information bottleneck theory of deep learning）

论深度学习的信息瓶颈理论（On the information bottleneck theory of deep learning）

专知会员服务

65+阅读 · 2019年12月20日

【UAI 2019 Tutorials】深度学习数学（Mathematics of Deep Learning）

【UAI 2019 Tutorials】深度学习数学（Mathematics of Deep Learning）

专知会员服务

42+阅读 · 2019年11月16日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

8+阅读 · 2019年10月24日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

32+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

90+阅读 · 2019年10月10日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

26+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

26+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

已删除

AI科技评论

4+阅读 · 2018年8月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

The Role of Mutual Information in Variational Classifiers

Arxiv

0+阅读 · 2021年11月25日

Polyconvex anisotropic hyperelasticity with neural networks

Arxiv

0+阅读 · 2021年11月25日

Collaborative Information Bottleneck

Arxiv

0+阅读 · 2021年11月24日

State-space deep Gaussian processes with applications

Arxiv

0+阅读 · 2021年11月24日

A Note on Consistency of the Bayes Estimator of the Density

Arxiv

0+阅读 · 2021年11月24日

An efficient estimation of nested expectations without conditional sampling

Arxiv

0+阅读 · 2021年11月24日

Learning Optimal Representations with the Decodable Information Bottleneck

Arxiv

6+阅读 · 2020年9月27日

Hyperspherical Variational Auto-Encoders

Hyperspherical Variational Auto-Encoders

Arxiv

4+阅读 · 2018年9月26日

Learning to Importance Sample in Primary Sample Space

Learning to Importance Sample in Primary Sample Space

Arxiv

5+阅读 · 2018年8月23日

Variational Inference In Pachinko Allocation Machines

Arxiv

8+阅读 · 2018年4月21日

VIP会员

文章信息

相关主题

相关VIP内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

67+阅读 · 2021年3月27日

不可错过！UIUC最新《对抗机器学习》课程，附PPT

专知会员服务

33+阅读 · 2020年12月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

50+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

76+阅读 · 2020年7月26日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

75+阅读 · 2020年2月8日

论深度学习的信息瓶颈理论（On the information bottleneck theory of deep learning）

论深度学习的信息瓶颈理论（On the information bottleneck theory of deep learning）

专知会员服务

65+阅读 · 2019年12月20日

【UAI 2019 Tutorials】深度学习数学（Mathematics of Deep Learning）

【UAI 2019 Tutorials】深度学习数学（Mathematics of Deep Learning）

专知会员服务

42+阅读 · 2019年11月16日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

8+阅读 · 2019年10月24日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

32+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

90+阅读 · 2019年10月10日

热门VIP内容

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

26+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

26+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

已删除

AI科技评论

4+阅读 · 2018年8月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

相关论文

The Role of Mutual Information in Variational Classifiers

Arxiv

0+阅读 · 2021年11月25日

Polyconvex anisotropic hyperelasticity with neural networks

Arxiv

0+阅读 · 2021年11月25日

Collaborative Information Bottleneck

Arxiv

0+阅读 · 2021年11月24日

State-space deep Gaussian processes with applications

Arxiv

0+阅读 · 2021年11月24日

A Note on Consistency of the Bayes Estimator of the Density

Arxiv

0+阅读 · 2021年11月24日

An efficient estimation of nested expectations without conditional sampling

Arxiv

0+阅读 · 2021年11月24日

Learning Optimal Representations with the Decodable Information Bottleneck

Arxiv

6+阅读 · 2020年9月27日

Hyperspherical Variational Auto-Encoders

Hyperspherical Variational Auto-Encoders

Arxiv

4+阅读 · 2018年9月26日

Learning to Importance Sample in Primary Sample Space

Learning to Importance Sample in Primary Sample Space

Arxiv

5+阅读 · 2018年8月23日

Variational Inference In Pachinko Allocation Machines

Arxiv

8+阅读 · 2018年4月21日

微信扫码咨询专知VIP会员