Excal-Concave在线学习中的最佳动态遗憾 (Optimal Dynamic Regret in Exp-Concave Online Learning) - 专知论文

会员服务 ·

0

学习器 · 学成 · 统计量 · 优化器 · 平方损失 ·

2021 年 4 月 23 日

Optimal Dynamic Regret in Exp-Concave Online Learning

翻译：Excal-Concave在线学习中的最佳动态遗憾

Dheeraj Baby,Yu-Xiang Wang

We consider the problem of the Zinkevich (2003)-style dynamic regret minimization in online learning with exp-concave losses. We show that whenever improper learning is allowed, a Strongly Adaptive online learner achieves the dynamic regret of $\tilde O(d^{3.5}n^{1/3}C_n^{2/3} \vee d\log n)$ where $C_n$ is the total variation (a.k.a. path length) of the an arbitrary sequence of comparators that may not be known to the learner ahead of time. Achieving this rate was highly nontrivial even for squared losses in 1D where the best known upper bound was $O(\sqrt{nC_n} \vee \log n)$ (Yuan and Lamperski, 2019). Our new proof techniques make elegant use of the intricate structures of the primal and dual variables imposed by the KKT conditions and could be of independent interest. Finally, we apply our results to the classical statistical problem of locally adaptive non-parametric regression (Mammen, 1991; Donoho and Johnstone, 1998) and obtain a stronger and more flexible algorithm that do not require any statistical assumptions or any hyperparameter tuning.

翻译：我们考虑了Zinkevich (2003年) 式样的动态在网上学习中以解剖亏损的方式最大限度地减低遗憾的问题。我们表明,每当允许不适当的学习时,一个强大的适应性在线学习者就会获得美元(d ⁇ 3.5}n ⁇ 1/3}C_n ⁇ 2/3}\vee d\log n) 美元(美元)的动态遗憾,而美元(a.k.a.路径长度)是KKT条件所强加的原始和双重变量的复杂结构(a.k.a.路径长度)的总变异性(a.k.a.路径长度),而且可能具有独立的兴趣。最后,我们把结果运用于当地适应性非参数回归的典型统计问题(Mammen,1991年;Dono和Johnstalisco,1998年)中,最著名的最高界限是美元(Yuan和Lamperski,2019年)。我们的新证据技术优美地利用了KKT条件所强加的原始和双重变量的复杂结构。最后,我们运用了我们的结果,对当地适应性非参数回归的典型统计的典型统计问题(Mamenenen,1991年;Donho和Johnstaldroadal,没有要求任何更强大的和任何更强的统计模型。

0

相关内容

学习器

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【新书】敏捷机器学习（Agile Machine Learning），微软| Eric Carter

【新书】敏捷机器学习（Agile Machine Learning），微软| Eric Carter

专知会员服务

25+阅读 · 2020年1月9日

TensorFlow深度学习，从线性回归到强化学习的深度学习（TensorFlow for Deep Learning From Linear Regression to Reinforcement Learning），附页256页pdf

TensorFlow深度学习，从线性回归到强化学习的深度学习（TensorFlow for Deep Learning From Linear Regression to Reinforcement Learning），附页256页pdf

专知会员服务

46+阅读 · 2020年1月1日

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

专知会员服务

21+阅读 · 2019年12月2日

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

专知会员服务

35+阅读 · 2019年11月30日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Localization, Convexity, and Star Aggregation

Arxiv

0+阅读 · 2021年6月16日

Chow-Liu++: Optimal Prediction-Centric Learning of Tree Ising Models

Arxiv

0+阅读 · 2021年6月16日

Breaking The Dimension Dependence in Sparse Distribution Estimation under Communication Constraints

Arxiv

0+阅读 · 2021年6月16日

Randomized Exploration for Reinforcement Learning with General Value Function Approximation

Arxiv

0+阅读 · 2021年6月15日

Online Sub-Sampling for Reinforcement Learning with General Function Approximation

Arxiv

0+阅读 · 2021年6月14日

A Simple Unified Framework for High Dimensional Bandit Problems

Arxiv

0+阅读 · 2021年6月14日

Bellman-consistent Pessimism for Offline Reinforcement Learning

Arxiv

0+阅读 · 2021年6月13日

Online learning in MDPs with linear function approximation and bandit feedback

Arxiv

0+阅读 · 2021年6月12日

Safe Reinforcement Learning with Linear Function Approximation

Arxiv

0+阅读 · 2021年6月11日

Neural Active Learning with Performance Guarantees

Arxiv

0+阅读 · 2021年6月6日

VIP会员

文章信息

相关主题

相关VIP内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【新书】敏捷机器学习（Agile Machine Learning），微软| Eric Carter

【新书】敏捷机器学习（Agile Machine Learning），微软| Eric Carter

专知会员服务

25+阅读 · 2020年1月9日

TensorFlow深度学习，从线性回归到强化学习的深度学习（TensorFlow for Deep Learning From Linear Regression to Reinforcement Learning），附页256页pdf

TensorFlow深度学习，从线性回归到强化学习的深度学习（TensorFlow for Deep Learning From Linear Regression to Reinforcement Learning），附页256页pdf

专知会员服务

46+阅读 · 2020年1月1日

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

专知会员服务

21+阅读 · 2019年12月2日

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

专知会员服务

35+阅读 · 2019年11月30日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

【NTU博士论文】利用强化学习与生成模型推进可靠且可泛化的决策

美海军研发“增强侦察与态势评估系统（ARES）”应用程序以优化作战规划（附研究论文）

【NeurIPS2025】DNA-DetectLLM：基于 DNA 启发的“突变-修复”范式揭示 AI 生成文本

面向深度研究系统的强化学习基础：综述

相关资讯

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Localization, Convexity, and Star Aggregation

Arxiv

0+阅读 · 2021年6月16日

Chow-Liu++: Optimal Prediction-Centric Learning of Tree Ising Models

Arxiv

0+阅读 · 2021年6月16日

Breaking The Dimension Dependence in Sparse Distribution Estimation under Communication Constraints

Arxiv

0+阅读 · 2021年6月16日

Randomized Exploration for Reinforcement Learning with General Value Function Approximation

Arxiv

0+阅读 · 2021年6月15日

Online Sub-Sampling for Reinforcement Learning with General Function Approximation

Arxiv

0+阅读 · 2021年6月14日

A Simple Unified Framework for High Dimensional Bandit Problems

Arxiv

0+阅读 · 2021年6月14日

Bellman-consistent Pessimism for Offline Reinforcement Learning

Arxiv

0+阅读 · 2021年6月13日

Online learning in MDPs with linear function approximation and bandit feedback

Arxiv

0+阅读 · 2021年6月12日

Safe Reinforcement Learning with Linear Function Approximation

Arxiv

0+阅读 · 2021年6月11日

Neural Active Learning with Performance Guarantees

Arxiv

0+阅读 · 2021年6月6日

微信扫码咨询专知VIP会员