形状制约对强盗问题的影响 (The Influence of Shape Constraints on the Thresholding Bandit Problem) - 专知论文

会员服务 ·

0

赌博机/老虎机 · 阈值 · CASE · 塑造 · 约束 ·

2021 年 1 月 12 日

The Influence of Shape Constraints on the Thresholding Bandit Problem

翻译：形状制约对强盗问题的影响

James Cheshire,Pierre Menard,Alexandra Carpentier

We investigate the stochastic Thresholding Bandit problem (TBP) under several shape constraints. On top of (i) the vanilla, unstructured TBP, we consider the case where (ii) the sequence of arm's means $(\mu_k)_k$ is monotonically increasing MTBP, (iii) the case where $(\mu_k)_k$ is unimodal UTBP and (iv) the case where $(\mu_k)_k$ is concave CTBP. In the TBP problem the aim is to output, at the end of the sequential game, the set of arms whose means are above a given threshold. The regret is the highest gap between a misclassified arm and the threshold. In the fixed budget setting, we provide problem independent minimax rates for the expected regret in all settings, as well as associated algorithms. We prove that the minimax rates for the regret are (i) $\sqrt{\log(K)K/T}$ for TBP, (ii) $\sqrt{\log(K)/T}$ for MTBP, (iii) $\sqrt{K/T}$ for UTBP and (iv) $\sqrt{\log\log K/T}$ for CTBP, where $K$ is the number of arms and $T$ is the budget. These rates demonstrate that the dependence on $K$ of the minimax regret varies significantly depending on the shape constraint. This highlights the fact that the shape constraints modify fundamentally the nature of the TBP.

翻译：我们调查了几个形状限制下的盗匪问题(TBP)。在(一) 香草、无结构的TBP中,我们考虑的是:(二) 手臂的顺序意味着$(mu_k)_k美元,这是单质增加的MTBP,(三) 美元(mu_k)_k美元是单式UTBP,(四) 美元(mu_k)_k美元是同质的CTBP。在TBP问题中,目标是输出,在连续游戏结束时,一套手段超过给定阈值的武器。遗憾是错分类的手臂和阈值之间的最大差距。在固定的预算设置中,我们为所有环境中的预期遗憾提供独立的微缩税率,以及相关的算法。我们证明,对于TBP, 美元(k) 美元(K) K/TBP 的最小税率是(K) 美元(K) 和美元(KBBP) 美元(K) 和美元(K) 美元(K/T) 的硬度(K) 美元(K) 预算的缩数(K) 美元(K) 和美元/ 美元/美元) 美元(KBBPT) 的硬值是。

0

相关内容

赌博机/老虎机

赌博机/老虎机

Python编程基础，121页ppt

Python编程基础，121页ppt

专知会员服务

49+阅读 · 2021年1月1日

5G边缘计算的价值机遇

5G边缘计算的价值机遇

专知会员服务

67+阅读 · 2020年8月17日

【ICML2020】拉普拉斯正则化小样本学习，Laplacian Regularized Few-Shot Learning

【ICML2020】拉普拉斯正则化小样本学习，Laplacian Regularized Few-Shot Learning

专知会员服务

77+阅读 · 2020年6月28日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【2020新书】Python Pro专业实践原则，Practices of the Python Pro，250页pdf

【2020新书】Python Pro专业实践原则，Practices of the Python Pro，250页pdf

专知会员服务

153+阅读 · 2020年1月25日

【Python Tricks新书】The book: A Buffet of Awesome Python Features，299页pdf

【Python Tricks新书】The book: A Buffet of Awesome Python Features，299页pdf

专知会员服务

45+阅读 · 2020年1月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

196+阅读 · 2019年10月10日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

专知

11+阅读 · 2018年3月29日

已删除

将门创投

5+阅读 · 2017年11月22日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

The pointwise stabilities of piecewise linear finite element method on non-obtuse tetrahedral meshes of nonconvex polyhedra

Arxiv

0+阅读 · 2021年3月9日

On the Oracle Complexity of Higher-Order Smooth Non-Convex Finite-Sum Optimization

Arxiv

0+阅读 · 2021年3月8日

Efficient Algorithms for Finite Horizon and Streaming Restless Multi-Armed Bandit Problems

Arxiv

0+阅读 · 2021年3月8日

A Lower Bound for the Sample Complexity of Inverse Reinforcement Learning

Arxiv

0+阅读 · 2021年3月7日

A Tractable Online Learning Algorithm for the Multinomial Logit Contextual Bandit

Arxiv

0+阅读 · 2021年3月7日

Parameterized Complexity of Stable Roommates with Ties and Incomplete Lists Through the Lens of Graph Parameters

Arxiv

0+阅读 · 2021年3月5日

A parabolic local problem with exponential decay of the resonance error for numerical homogenization

Arxiv

0+阅读 · 2021年3月4日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

Classification with Fairness Constraints: A Meta-Algorithm with Provable Guarantees

Classification with Fairness Constraints: A Meta-Algorithm with Provable Guarantees

Arxiv

3+阅读 · 2018年8月2日

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics

Arxiv

7+阅读 · 2018年6月12日

VIP会员

文章信息

相关主题

赌博机/老虎机

相关VIP内容

Python编程基础，121页ppt

Python编程基础，121页ppt

专知会员服务

49+阅读 · 2021年1月1日

5G边缘计算的价值机遇

5G边缘计算的价值机遇

专知会员服务

67+阅读 · 2020年8月17日

【ICML2020】拉普拉斯正则化小样本学习，Laplacian Regularized Few-Shot Learning

【ICML2020】拉普拉斯正则化小样本学习，Laplacian Regularized Few-Shot Learning

专知会员服务

77+阅读 · 2020年6月28日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【2020新书】Python Pro专业实践原则，Practices of the Python Pro，250页pdf

【2020新书】Python Pro专业实践原则，Practices of the Python Pro，250页pdf

专知会员服务

153+阅读 · 2020年1月25日

【Python Tricks新书】The book: A Buffet of Awesome Python Features，299页pdf

【Python Tricks新书】The book: A Buffet of Awesome Python Features，299页pdf

专知会员服务

45+阅读 · 2020年1月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

196+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

GPT-5如何对齐？从硬性拒绝到安全完成：走向以输出为中心的安全训练

【伯克利博士论文】超越人类监督的视觉智能

【ICCV2025】SO(3) 上连续非保守动力系统的预测

2025年中国数据要素行业发展研究报告

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

专知

11+阅读 · 2018年3月29日

已删除

将门创投

5+阅读 · 2017年11月22日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

The pointwise stabilities of piecewise linear finite element method on non-obtuse tetrahedral meshes of nonconvex polyhedra

Arxiv

0+阅读 · 2021年3月9日

On the Oracle Complexity of Higher-Order Smooth Non-Convex Finite-Sum Optimization

Arxiv

0+阅读 · 2021年3月8日

Efficient Algorithms for Finite Horizon and Streaming Restless Multi-Armed Bandit Problems

Arxiv

0+阅读 · 2021年3月8日

A Lower Bound for the Sample Complexity of Inverse Reinforcement Learning

Arxiv

0+阅读 · 2021年3月7日

A Tractable Online Learning Algorithm for the Multinomial Logit Contextual Bandit

Arxiv

0+阅读 · 2021年3月7日

Parameterized Complexity of Stable Roommates with Ties and Incomplete Lists Through the Lens of Graph Parameters

Arxiv

0+阅读 · 2021年3月5日

A parabolic local problem with exponential decay of the resonance error for numerical homogenization

Arxiv

0+阅读 · 2021年3月4日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

Classification with Fairness Constraints: A Meta-Algorithm with Provable Guarantees

Classification with Fairness Constraints: A Meta-Algorithm with Provable Guarantees

Arxiv

3+阅读 · 2018年8月2日

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics

Arxiv

7+阅读 · 2018年6月12日

微信扫码咨询专知VIP会员