上上下文土匪的自定义ML (AutoML for Contextual Bandits) - 专知论文

会员服务 ·

0

上下文赌博机/上下文老虎机 · 赌博机/老虎机 · Performer · Automator · AutoML ·

2022 年 2 月 2 日

AutoML for Contextual Bandits

翻译：上上下文土匪的自定义ML

Praneet Dutta,Joe Cheuk,Jonathan S Kim,Massimo Mascaro

from arxiv, Presented(peer-reviewed) at the REVEAL Workshop at the ACM RecSys Conference Copenhagen'19 [https://sites.google.com/view/reveal2019/proceedings]

Contextual Bandits is one of the widely popular techniques used in applications such as personalization, recommendation systems, mobile health, causal marketing etc . As a dynamic approach, it can be more efficient than standard A/B testing in minimizing regret. We propose an end to end automated meta-learning pipeline to approximate the optimal Q function for contextual bandits problems. We see that our model is able to perform much better than random exploration, being more regret efficient and able to converge with a limited number of samples, while remaining very general and easy to use due to the meta-learning approach. We used a linearly annealed e-greedy exploration policy to define the exploration vs exploitation schedule. We tested the system on a synthetic environment to characterize it fully and we evaluated it on some open source datasets to benchmark against prior work. We see that our model outperforms or performs comparatively to other models while requiring no tuning nor feature engineering.

翻译：在个人化、推荐系统、移动健康、因果营销等应用中,背景土匪是广泛流行的技术之一。作为一种动态方法,它比标准的A/B测试效率更高,可以最大限度地减少遗憾。我们提议结束自动元学习管道,以近似背景土匪问题的最佳Q功能。我们看到我们的模型比随机勘探能做得更好,更遗憾地高效,能够与数量有限的样本相匹配,同时由于元学习方法,仍然非常普遍和容易使用。我们使用线性化的电子基因勘探政策来定义勘探与开发时间表。我们用一个线性化的e-greedy勘探政策来对合成环境进行测试,以充分定性,我们用一些开放源数据集对它进行了评估,以比以前的工作做基准。我们发现,我们的模型比其他模型更完美,或比其他模型表现得更好,同时不需要调整或特征工程。

0

相关内容

上下文赌博机/上下文老虎机

上下文赌博机/上下文老虎机

NLP必读经典文献100篇

专知会员服务

123+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

122+阅读 · 2020年7月18日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

12+阅读 · 2020年6月8日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

113+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

91+阅读 · 2020年3月12日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

144+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

167+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

35+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

90+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

25+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

基于上下文感知和异质特征集成的SAR图像分割与评价

国家自然科学基金

1+阅读 · 2015年12月31日

基于神经网络和强化学习的车辆装配系统中的多载量小车实时调度方法

国家自然科学基金

2+阅读 · 2014年12月31日

时滞弱耦合制造系统能源产需平衡理论与协调优化方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

广东星湖湿地放线菌多样性及活性初步研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于ISSR分析、主成分分析的新疆软紫草属植物资源调查及其质量评价

国家自然科学基金

0+阅读 · 2012年12月31日

仿射技巧在复几何的应用

国家自然科学基金

0+阅读 · 2012年12月31日

重复过程的模型降阶及其在降阶综合中的应用研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于Sparse-Land模型的SAR图像噪声抑制与分割

国家自然科学基金

0+阅读 · 2009年12月31日

基于灰色理论的SAR图像分割及其效果评价方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

烟草内生菌多样性研究

国家自然科学基金

0+阅读 · 2008年12月31日

Modeling Review History for Reviewer Recommendation:A Hypergraph Approach

Modeling Review History for Reviewer Recommendation:A Hypergraph Approach

Arxiv

0+阅读 · 2022年4月20日

Reinforcement Learning Guided by Provable Normative Compliance

Arxiv

0+阅读 · 2022年4月19日

Auto-Icon+: An Automated End-to-End Code Generation Tool for Icon Designs in UI Development

Arxiv

0+阅读 · 2022年4月19日

Supervised Contrastive Learning for Recommendation

Arxiv

0+阅读 · 2022年4月19日

Expert-Calibrated Learning for Online Optimization with Switching Costs

Arxiv

0+阅读 · 2022年4月18日

Efficient Architecture Search for Diverse Tasks

Arxiv

0+阅读 · 2022年4月15日

This is the Moment for Probabilistic Loops

Arxiv

0+阅读 · 2022年4月14日

Data-Free Knowledge Distillation for Heterogeneous Federated Learning

Arxiv

12+阅读 · 2021年6月9日

Entity Context and Relational Paths for Knowledge Graph Completion

Arxiv

29+阅读 · 2020年2月17日

Learning over Knowledge-Base Embeddings for Recommendation

Arxiv

22+阅读 · 2018年3月22日

VIP会员

文章信息

相关主题

上下文赌博机/上下文老虎机

赌博机/老虎机

相关VIP内容

NLP必读经典文献100篇

专知会员服务

123+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

122+阅读 · 2020年7月18日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

12+阅读 · 2020年6月8日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

113+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

91+阅读 · 2020年3月12日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

144+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

167+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

35+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

90+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

25+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Modeling Review History for Reviewer Recommendation:A Hypergraph Approach

Modeling Review History for Reviewer Recommendation:A Hypergraph Approach

Arxiv

0+阅读 · 2022年4月20日

Reinforcement Learning Guided by Provable Normative Compliance

Arxiv

0+阅读 · 2022年4月19日

Auto-Icon+: An Automated End-to-End Code Generation Tool for Icon Designs in UI Development

Arxiv

0+阅读 · 2022年4月19日

Supervised Contrastive Learning for Recommendation

Arxiv

0+阅读 · 2022年4月19日

Expert-Calibrated Learning for Online Optimization with Switching Costs

Arxiv

0+阅读 · 2022年4月18日

Efficient Architecture Search for Diverse Tasks

Arxiv

0+阅读 · 2022年4月15日

This is the Moment for Probabilistic Loops

Arxiv

0+阅读 · 2022年4月14日

Data-Free Knowledge Distillation for Heterogeneous Federated Learning

Arxiv

12+阅读 · 2021年6月9日

Entity Context and Relational Paths for Knowledge Graph Completion

Arxiv

29+阅读 · 2020年2月17日

Learning over Knowledge-Base Embeddings for Recommendation

Arxiv

22+阅读 · 2018年3月22日

相关基金

基于上下文感知和异质特征集成的SAR图像分割与评价

国家自然科学基金

1+阅读 · 2015年12月31日

基于神经网络和强化学习的车辆装配系统中的多载量小车实时调度方法

国家自然科学基金

2+阅读 · 2014年12月31日

时滞弱耦合制造系统能源产需平衡理论与协调优化方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

广东星湖湿地放线菌多样性及活性初步研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于ISSR分析、主成分分析的新疆软紫草属植物资源调查及其质量评价

国家自然科学基金

0+阅读 · 2012年12月31日

仿射技巧在复几何的应用

国家自然科学基金

0+阅读 · 2012年12月31日

重复过程的模型降阶及其在降阶综合中的应用研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于Sparse-Land模型的SAR图像噪声抑制与分割

国家自然科学基金

0+阅读 · 2009年12月31日

基于灰色理论的SAR图像分割及其效果评价方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

烟草内生菌多样性研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员