Oracle 下层孔径, 用于 Stochastratic 梯度取样算法的 Oracle 下层宽度 (Oracle Lower Bounds for Stochastic Gradient Sampling Algorithms) - 专知论文

会员服务 ·

0

INFORMS · 贝叶斯风险 · 样本 · 马尔可夫链蒙特卡罗 · Oracle ·

2021 年 7 月 3 日

Oracle Lower Bounds for Stochastic Gradient Sampling Algorithms

翻译：Oracle 下层孔径, 用于 Stochastratic 梯度取样算法的 Oracle 下层宽度

Niladri S. Chatterji,Peter L. Bartlett,Philip M. Long

from arxiv, 21 pages; accepted for publication at Bernoulli

We consider the problem of sampling from a strongly log-concave density in $\mathbb{R}^d$, and prove an information theoretic lower bound on the number of stochastic gradient queries of the log density needed. Several popular sampling algorithms (including many Markov chain Monte Carlo methods) operate by using stochastic gradients of the log density to generate a sample; our results establish an information theoretic limit for all these algorithms. We show that for every algorithm, there exists a well-conditioned strongly log-concave target density for which the distribution of points generated by the algorithm would be at least $\varepsilon$ away from the target in total variation distance if the number of gradient queries is less than $\Omega(\sigma^2 d/\varepsilon^2)$, where $\sigma^2 d$ is the variance of the stochastic gradient. Our lower bound follows by combining the ideas of Le Cam deficiency routinely used in the comparison of statistical experiments along with standard information theoretic tools used in lower bounding Bayes risk functions. To the best of our knowledge our results provide the first nontrivial dimension-dependent lower bound for this problem.

翻译：我们从强烈的对数密度 $mathbb{R ⁇ d$ 中考虑取样问题,并证明对所需日志密度的随机梯度查询数量的信息理论约束较低。一些流行的抽样算法(包括许多Markov链 Monte Carlo 方法)使用日志密度的随机梯度生成样本;我们的结果为所有这些算法设定了一个信息理论限制。我们显示,对于每一种算法来说,都有一种条件完善的强烈对数目标密度,为此,如果梯度查询数量低于$\Omega(gma_2 d/\varepsilon2美元),则算法生成的点的分布将至少离目标完全变异距离,如果梯度查询数量低于$\Omega(gma_2 d/\varepsilon%2美元),则使用美元作为所有这些算法梯度的差异。我们较低的界限是把统计实验中常用的勒卡姆缺陷概念与标准信息工具结合起来,在降低巴雅斯低风险功能中使用的测算工具结合起来。我们最可靠的知识层面提供了我们最可靠的结果。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

最新《高级算法》Advanced Algorithms，176页pdf

最新《高级算法》Advanced Algorithms，176页pdf

专知会员服务

92+阅读 · 2020年10月22日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

【Java实现遗传算法】162页pdf，Genetic Algorithms in Java Basics

【Java实现遗传算法】162页pdf，Genetic Algorithms in Java Basics

专知会员服务

44+阅读 · 2020年7月19日

【开放书】部分观测动态系统的贝叶斯学习，119页pdf，Bayesian Learning for partially observed dynamical systems

【开放书】部分观测动态系统的贝叶斯学习，119页pdf，Bayesian Learning for partially observed dynamical systems

专知会员服务

41+阅读 · 2019年12月27日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

蒙特卡罗方法(Monte Carlo Methods)

蒙特卡罗方法(Monte Carlo Methods)

数据挖掘入门与实战

6+阅读 · 2018年4月22日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

专知

11+阅读 · 2018年3月29日

【关关的刷题日记54】Leetcode 226. Invert Binary Tree

【关关的刷题日记54】Leetcode 226. Invert Binary Tree

专知

6+阅读 · 2017年12月2日

【LeetCode 202】关关的刷题日记35 – Leetcode 202. Happy Number

【LeetCode 202】关关的刷题日记35 – Leetcode 202. Happy Number

专知

5+阅读 · 2017年11月13日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Gaussian Process Uniform Error Bounds with Unknown Hyperparameters for Safety-Critical Applications

Arxiv

0+阅读 · 2021年9月6日

On the Complexity of Computing Markov Perfect Equilibrium in General-Sum Stochastic Games

Arxiv

0+阅读 · 2021年9月4日

Schr{ö}dinger-F{ö}llmer Sampler: Sampling without Ergodicity

Arxiv

0+阅读 · 2021年9月4日

A Bayesian Approach to (Online) Transfer Learning: Theory and Algorithms

A Bayesian Approach to (Online) Transfer Learning: Theory and Algorithms

Arxiv

0+阅读 · 2021年9月3日

Circuit Lower Bounds for the p-Spin Optimization Problem

Arxiv

0+阅读 · 2021年9月3日

Uniform minorization condition and convergence bounds for discretizations of kinetic Langevin dynamics

Arxiv

0+阅读 · 2021年9月1日

A Gradient Sampling Algorithm for Stratified Maps with Applications to Topological Data Analysis

Arxiv

0+阅读 · 2021年9月1日

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Arxiv

13+阅读 · 2020年6月24日

Large-Scale Stochastic Sampling from the Probability Simplex

Arxiv

3+阅读 · 2018年6月19日

Variance Reduction Methods for Sublinear Reinforcement Learning

Arxiv

4+阅读 · 2018年4月25日

VIP会员

文章信息

相关主题

贝叶斯风险

马尔可夫链蒙特卡罗

相关VIP内容

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

最新《高级算法》Advanced Algorithms，176页pdf

最新《高级算法》Advanced Algorithms，176页pdf

专知会员服务

92+阅读 · 2020年10月22日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

【Java实现遗传算法】162页pdf，Genetic Algorithms in Java Basics

【Java实现遗传算法】162页pdf，Genetic Algorithms in Java Basics

专知会员服务

44+阅读 · 2020年7月19日

【开放书】部分观测动态系统的贝叶斯学习，119页pdf，Bayesian Learning for partially observed dynamical systems

【开放书】部分观测动态系统的贝叶斯学习，119页pdf，Bayesian Learning for partially observed dynamical systems

专知会员服务

41+阅读 · 2019年12月27日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《人工智能绝不能完全自主》

《人工智能的法律与伦理：军事自主机器独特挑战的深度剖析》316页

从数据到主导：AI与兵棋推演构筑决策优势

《特洛伊木马货柜：武器化集装箱的战略威胁》最新报告

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

蒙特卡罗方法(Monte Carlo Methods)

蒙特卡罗方法(Monte Carlo Methods)

数据挖掘入门与实战

6+阅读 · 2018年4月22日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

专知

11+阅读 · 2018年3月29日

【关关的刷题日记54】Leetcode 226. Invert Binary Tree

【关关的刷题日记54】Leetcode 226. Invert Binary Tree

专知

6+阅读 · 2017年12月2日

【LeetCode 202】关关的刷题日记35 – Leetcode 202. Happy Number

【LeetCode 202】关关的刷题日记35 – Leetcode 202. Happy Number

专知

5+阅读 · 2017年11月13日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Gaussian Process Uniform Error Bounds with Unknown Hyperparameters for Safety-Critical Applications

Arxiv

0+阅读 · 2021年9月6日

On the Complexity of Computing Markov Perfect Equilibrium in General-Sum Stochastic Games

Arxiv

0+阅读 · 2021年9月4日

Schr{ö}dinger-F{ö}llmer Sampler: Sampling without Ergodicity

Arxiv

0+阅读 · 2021年9月4日

A Bayesian Approach to (Online) Transfer Learning: Theory and Algorithms

A Bayesian Approach to (Online) Transfer Learning: Theory and Algorithms

Arxiv

0+阅读 · 2021年9月3日

Circuit Lower Bounds for the p-Spin Optimization Problem

Arxiv

0+阅读 · 2021年9月3日

Uniform minorization condition and convergence bounds for discretizations of kinetic Langevin dynamics

Arxiv

0+阅读 · 2021年9月1日

A Gradient Sampling Algorithm for Stratified Maps with Applications to Topological Data Analysis

Arxiv

0+阅读 · 2021年9月1日

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Arxiv

13+阅读 · 2020年6月24日

Large-Scale Stochastic Sampling from the Probability Simplex

Arxiv

3+阅读 · 2018年6月19日

Variance Reduction Methods for Sublinear Reinforcement Learning

Arxiv

4+阅读 · 2018年4月25日

微信扫码咨询专知VIP会员