翻译后的标题： (The ELBO of Variational Autoencoders Converges to a Sum of Three Entropies) - 专知论文

会员服务 ·

0

变分自编码 · 变分 · 变分自编码器 · 自编码器 · 变分分布 ·

2023 年 4 月 20 日

The ELBO of Variational Autoencoders Converges to a Sum of Three Entropies

翻译：翻译后的标题：

Simon Damm,Dennis Forster,Dmytro Velychko,Zhenwen Dai,Asja Fischer,Jörg Lücke

The central objective function of a variational autoencoder (VAE) is its variational lower bound (the ELBO). Here we show that for standard (i.e., Gaussian) VAEs the ELBO converges to a value given by the sum of three entropies: the (negative) entropy of the prior distribution, the expected (negative) entropy of the observable distribution, and the average entropy of the variational distributions (the latter is already part of the ELBO). Our derived analytical results are exact and apply for small as well as for intricate deep networks for encoder and decoder. Furthermore, they apply for finitely and infinitely many data points and at any stationary point (including local maxima and saddle points). The result implies that the ELBO can for standard VAEs often be computed in closed-form at stationary points while the original ELBO requires numerical approximations of integrals. As a main contribution, we provide the proof that the ELBO for VAEs is at stationary points equal to entropy sums. Numerical experiments then show that the obtained analytical results are sufficiently precise also in those vicinities of stationary points that are reached in practice. Furthermore, we discuss how the novel entropy form of the ELBO can be used to analyze and understand learning behavior. More generally, we believe that our contributions can be useful for future theoretical and practical studies on VAE learning as they provide novel information on those points in parameters space that optimization of VAEs converges to.

翻译：变分自编码器的ELBO收敛于三个熵的总和翻译后的摘要：变分自编码器（VAE）的核心目标函数是其变分下限（ELBO）。本文表明，在标准（即，高斯）VAE中，ELBO收敛于三个熵的总和；包括先验分布的（负）熵、可观察分布的期望（负）熵和变分分布的平均熵（后者已经是ELBO的一部分）。我们推导的解析结果是精确的，并适用于编码器和解码器具有简单和复杂的深度网络的任何平稳点（包括局部最大值和马鞍点），以及有限和无限数量的数据点。该结果意味着，对于标准VAE，ELBO通常可以在平稳点处以封闭形式计算，而原始ELBO需要数值积分的近似。作为主要贡献，我们提供了证明ELBO对于VAE在平稳点处等于熵之和的证明。数值实验表明，所得到的解析结果在实践中达到的平稳点的邻域中也足够精确。此外，我们讨论了如何使用新的熵形式的ELBO来分析和理解学习行为。总之，我们认为，我们的贡献对于未来关于VAE学习的理论和实践研究可能是有用的，因为它们为优化VAE的参数空间中收敛点提供了新的信息。

0

相关内容

变分自编码

变分自编码

DARPA SI3-CMD项目支持，《网络多智能体影响博弈中的可扩展均衡计算》麻省理工、马里兰大学，Scalable Equilibrium Computation in Multi-agent Influence Games on Networks

DARPA SI3-CMD项目支持，《网络多智能体影响博弈中的可扩展均衡计算》麻省理工、马里兰大学，Scalable Equilibrium Computation in Multi-agent Influence Games on Networks

专知会员服务

23+阅读 · 2022年4月10日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

67+阅读 · 2021年3月27日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

41+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

50+阅读 · 2020年12月14日

NeurIPS 2020最佳论文奖项出炉！GPT-3、伯克利等3篇论文摘得！

NeurIPS 2020最佳论文奖项出炉！GPT-3、伯克利等3篇论文摘得！

专知会员服务

10+阅读 · 2020年12月8日

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

专知会员服务

22+阅读 · 2019年11月11日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

168+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

90+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

64+阅读 · 2019年10月9日

生成扩散模型漫谈：DDIM = 高观点DDPM

生成扩散模型漫谈：DDIM = 高观点DDPM

PaperWeekly

5+阅读 · 2022年8月4日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

26+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

专知

19+阅读 · 2018年6月26日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

11+阅读 · 2018年6月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

受体MDSCs通过CEACAM1-TIM3调控NK细胞功能介导肝移植免疫耐受的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

光子玻璃多光子近红外量子剪裁调控基础研究

国家自然科学基金

0+阅读 · 2014年12月31日

套子代数的Hochschild上同调及套的分类

国家自然科学基金

3+阅读 · 2014年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

1+阅读 · 2013年12月31日

二阶随机微分方程的Runge-Kutta方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

高维数据的非参数经验贝叶斯方法

国家自然科学基金

1+阅读 · 2012年12月31日

框架的冗余度

国家自然科学基金

0+阅读 · 2012年12月31日

小样本空间制图

国家自然科学基金

0+阅读 · 2012年12月31日

嵌段共聚物自组装合成金属配合物-聚合物纳米杂化发光材料

国家自然科学基金

0+阅读 · 2009年12月31日

A Data-Efficient Approach for Long-Term Human Motion Prediction Using Maps of Dynamics

Arxiv

0+阅读 · 2023年6月6日

Sketching low-rank matrices with a shared column space by convex programming

Arxiv

0+阅读 · 2023年6月5日

MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion

MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion

Arxiv

0+阅读 · 2023年6月5日

INDigo: An INN-Guided Probabilistic Diffusion Algorithm for Inverse Problems

Arxiv

0+阅读 · 2023年6月5日

Coupled Variational Autoencoder

Arxiv

0+阅读 · 2023年6月5日

Generalised Brègman relative entropies: a brief introduction

Arxiv

0+阅读 · 2023年6月4日

Towards Understanding the Dynamics of Gaussian-Stein Variational Gradient Descent

Arxiv

0+阅读 · 2023年6月2日

On the Convergence of Coordinate Ascent Variational Inference

Arxiv

0+阅读 · 2023年6月1日

Exploring Visual Relationship for Image Captioning

Exploring Visual Relationship for Image Captioning

Arxiv

14+阅读 · 2018年9月19日

Variational Knowledge Graph Reasoning

Arxiv

15+阅读 · 2018年4月5日

VIP会员

文章信息

相关主题

变分自编码

变分自编码器

相关VIP内容

DARPA SI3-CMD项目支持，《网络多智能体影响博弈中的可扩展均衡计算》麻省理工、马里兰大学，Scalable Equilibrium Computation in Multi-agent Influence Games on Networks

DARPA SI3-CMD项目支持，《网络多智能体影响博弈中的可扩展均衡计算》麻省理工、马里兰大学，Scalable Equilibrium Computation in Multi-agent Influence Games on Networks

专知会员服务

23+阅读 · 2022年4月10日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

67+阅读 · 2021年3月27日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

41+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

50+阅读 · 2020年12月14日

NeurIPS 2020最佳论文奖项出炉！GPT-3、伯克利等3篇论文摘得！

NeurIPS 2020最佳论文奖项出炉！GPT-3、伯克利等3篇论文摘得！

专知会员服务

10+阅读 · 2020年12月8日

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

专知会员服务

22+阅读 · 2019年11月11日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

168+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

90+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

64+阅读 · 2019年10月9日

热门VIP内容

相关资讯

生成扩散模型漫谈：DDIM = 高观点DDPM

生成扩散模型漫谈：DDIM = 高观点DDPM

PaperWeekly

5+阅读 · 2022年8月4日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

26+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

专知

19+阅读 · 2018年6月26日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

11+阅读 · 2018年6月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

A Data-Efficient Approach for Long-Term Human Motion Prediction Using Maps of Dynamics

Arxiv

0+阅读 · 2023年6月6日

Sketching low-rank matrices with a shared column space by convex programming

Arxiv

0+阅读 · 2023年6月5日

MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion

MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion

Arxiv

0+阅读 · 2023年6月5日

INDigo: An INN-Guided Probabilistic Diffusion Algorithm for Inverse Problems

Arxiv

0+阅读 · 2023年6月5日

Coupled Variational Autoencoder

Arxiv

0+阅读 · 2023年6月5日

Generalised Brègman relative entropies: a brief introduction

Arxiv

0+阅读 · 2023年6月4日

Towards Understanding the Dynamics of Gaussian-Stein Variational Gradient Descent

Arxiv

0+阅读 · 2023年6月2日

On the Convergence of Coordinate Ascent Variational Inference

Arxiv

0+阅读 · 2023年6月1日

Exploring Visual Relationship for Image Captioning

Exploring Visual Relationship for Image Captioning

Arxiv

14+阅读 · 2018年9月19日

Variational Knowledge Graph Reasoning

Arxiv

15+阅读 · 2018年4月5日

相关基金

受体MDSCs通过CEACAM1-TIM3调控NK细胞功能介导肝移植免疫耐受的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

光子玻璃多光子近红外量子剪裁调控基础研究

国家自然科学基金

0+阅读 · 2014年12月31日

套子代数的Hochschild上同调及套的分类

国家自然科学基金

3+阅读 · 2014年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

1+阅读 · 2013年12月31日

二阶随机微分方程的Runge-Kutta方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

高维数据的非参数经验贝叶斯方法

国家自然科学基金

1+阅读 · 2012年12月31日

框架的冗余度

国家自然科学基金

0+阅读 · 2012年12月31日

小样本空间制图

国家自然科学基金

0+阅读 · 2012年12月31日

嵌段共聚物自组装合成金属配合物-聚合物纳米杂化发光材料

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员