Discocrete 后端变量模型的适应性周期性扰动- 基于梯度的梯度估计 (Adaptive Perturbation-Based Gradient Estimation for Discrete Latent Variable Models) - 专知论文

会员服务 ·

0

估计/估计量 · 离散化 · 潜变量/隐变量 · 有限差分 · Learning ·

2023 年 2 月 5 日

Adaptive Perturbation-Based Gradient Estimation for Discrete Latent Variable Models

翻译：Discocrete 后端变量模型的适应性周期性扰动- 基于梯度的梯度估计

Pasquale Minervini,Luca Franceschi,Mathias Niepert

from arxiv, Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI 2023)

The integration of discrete algorithmic components in deep learning architectures has numerous applications. Recently, Implicit Maximum Likelihood Estimation (IMLE, Niepert, Minervini, and Franceschi 2021), a class of gradient estimators for discrete exponential family distributions, was proposed by combining implicit differentiation through perturbation with the path-wise gradient estimator. However, due to the finite difference approximation of the gradients, it is especially sensitive to the choice of the finite difference step size, which needs to be specified by the user. In this work, we present Adaptive IMLE (AIMLE), the first adaptive gradient estimator for complex discrete distributions: it adaptively identifies the target distribution for IMLE by trading off the density of gradient information with the degree of bias in the gradient estimates. We empirically evaluate our estimator on synthetic examples, as well as on Learning to Explain, Discrete Variational Auto-Encoders, and Neural Relational Inference tasks. In our experiments, we show that our adaptive gradient estimator can produce faithful estimates while requiring orders of magnitude fewer samples than other gradient estimators.

翻译：将离散的算法组成部分整合到深层学习结构中有许多应用。最近,通过将离散指数型家庭分布的梯度估计器(IMLE、Niepert、Minervini和Franceschi 2021)这一类梯度估计器,提出了将隐含的差别与路径偏差梯度估计器相结合的建议。然而,由于梯度的有限差差近近值,它特别敏感于用户需要指定的有限差异一步大小的选择。在这项工作中,我们提出了适应性IMLE(AIMLE),这是用于复杂离散分布的第一个适应性梯度估计器:它通过将梯度信息的密度与梯度估计的偏差程度进行交换,从而适应性地确定了IMLE的目标分布。我们用经验来评估我们关于合成示例的估算器以及学习解释、差异性自动电算器和神经关系推算的估算器,我们在实验中显示,我们的适应性梯度测度定值比其他梯度定值要低的测算器能够产生准确的测算。

0

相关内容

估计/估计量

估计/估计量

手册《兵棋推演：工具、技术和程序》33页slides，Connections UK – Wargaming for Professionals

手册《兵棋推演：工具、技术和程序》33页slides，Connections UK – Wargaming for Professionals

专知会员服务

40+阅读 · 2022年10月10日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

77+阅读 · 2022年3月15日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

罗巴代数的表示和罗巴代数在operad中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

数据分析中的大规模矩阵优化模型求解算法研究

国家自然科学基金

2+阅读 · 2013年12月31日

知识密集型服务外包中的知识共享激励与知识资产争端协调机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Nb超导隧道结的太赫兹单光子探测器的研究

国家自然科学基金

0+阅读 · 2013年12月31日

15-kDa硒蛋白在内质网应激（ERS）和阿尔茨海默病(AD)中的功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

流化超细颗粒相间作用与颗粒动理学的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Keap1-Nrf2-ARE信号通路的活性先导化合物的发现及作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

生物过滤脱除烟气中NOx的中温好氧反硝化分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

碳纳米管负载的双金属纳米粒子复合材料的制备及应用

国家自然科学基金

0+阅读 · 2012年12月31日

Mather理论与Hamilton系统的不稳定性

国家自然科学基金

0+阅读 · 2008年12月31日

Two-step estimation of latent trait models

Arxiv

0+阅读 · 2023年3月28日

GAS: A Gaussian Mixture Distribution-Based Adaptive Sampling Method for PINNs

Arxiv

0+阅读 · 2023年3月28日

Expert Kaplan--Meier estimation

Arxiv

0+阅读 · 2023年3月27日

FAStEN: an efficient adaptive method for feature selection and estimation in high-dimensional functional regressions

Arxiv

0+阅读 · 2023年3月26日

SPEC: Summary Preference Decomposition for Low-Resource Abstractive Summarization

Arxiv

0+阅读 · 2023年3月24日

PPG-based Heart Rate Estimation with Efficient Sensor Sampling and Learning Models

Arxiv

0+阅读 · 2023年3月23日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation

Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation

Arxiv

19+阅读 · 2021年4月19日

Controllable Multi-Interest Framework for Recommendation

Arxiv

18+阅读 · 2020年8月3日

Domain Adaptive Faster R-CNN for Object Detection in the Wild

Arxiv

10+阅读 · 2018年3月8日

VIP会员

文章信息

相关主题

估计/估计量

潜变量/隐变量

相关VIP内容

手册《兵棋推演：工具、技术和程序》33页slides，Connections UK – Wargaming for Professionals

手册《兵棋推演：工具、技术和程序》33页slides，Connections UK – Wargaming for Professionals

专知会员服务

40+阅读 · 2022年10月10日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

77+阅读 · 2022年3月15日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICML2025】用于持续多模态指令微调的动态课程化LoRA专家混合机制

生成模型中持续学习的综合综述

【斯坦福博士论文】通过以人为本的自然语言界面拓展 AI 的可及性

【新书】《LangChain生成式AI实战：使用 Python 与 LangGraph 构建大语言模型应用与高级智能体》

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Two-step estimation of latent trait models

Arxiv

0+阅读 · 2023年3月28日

GAS: A Gaussian Mixture Distribution-Based Adaptive Sampling Method for PINNs

Arxiv

0+阅读 · 2023年3月28日

Expert Kaplan--Meier estimation

Arxiv

0+阅读 · 2023年3月27日

FAStEN: an efficient adaptive method for feature selection and estimation in high-dimensional functional regressions

Arxiv

0+阅读 · 2023年3月26日

SPEC: Summary Preference Decomposition for Low-Resource Abstractive Summarization

Arxiv

0+阅读 · 2023年3月24日

PPG-based Heart Rate Estimation with Efficient Sensor Sampling and Learning Models

Arxiv

0+阅读 · 2023年3月23日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation

Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation

Arxiv

19+阅读 · 2021年4月19日

Controllable Multi-Interest Framework for Recommendation

Arxiv

18+阅读 · 2020年8月3日

Domain Adaptive Faster R-CNN for Object Detection in the Wild

Arxiv

10+阅读 · 2018年3月8日

相关基金

罗巴代数的表示和罗巴代数在operad中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

数据分析中的大规模矩阵优化模型求解算法研究

国家自然科学基金

2+阅读 · 2013年12月31日

知识密集型服务外包中的知识共享激励与知识资产争端协调机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Nb超导隧道结的太赫兹单光子探测器的研究

国家自然科学基金

0+阅读 · 2013年12月31日

15-kDa硒蛋白在内质网应激（ERS）和阿尔茨海默病(AD)中的功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

流化超细颗粒相间作用与颗粒动理学的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Keap1-Nrf2-ARE信号通路的活性先导化合物的发现及作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

生物过滤脱除烟气中NOx的中温好氧反硝化分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

碳纳米管负载的双金属纳米粒子复合材料的制备及应用

国家自然科学基金

0+阅读 · 2012年12月31日

Mather理论与Hamilton系统的不稳定性

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员