线性强盗的不公平代价和有偏见的反馈 (The price of unfairness in linear bandits with biased feedback) - 专知论文

会员服务 ·

0

赌博机/老虎机 · 有偏 · 线性的 · 线性组合 · 无偏 ·

2022 年 6 月 3 日

The price of unfairness in linear bandits with biased feedback

翻译：线性强盗的不公平代价和有偏见的反馈

Solenne Gaucher,Alexandra Carpentier,Christophe Giraud

In this paper, we study the problem of fair sequential decision making with biased linear bandit feedback. At each round, a player selects an action described by a covariate and by a sensitive attribute. The perceived reward is a linear combination of the covariates of the chosen action, but the player only observes a biased evaluation of this reward, depending on the sensitive attribute. To characterize the difficulty of this problem, we design a phased elimination algorithm that corrects the unfair evaluations, and establish upper bounds on its regret. We show that the worst-case regret is smaller than $\mathcal{O}(\kappa_*^{1/3}\log(T)^{1/3}T^{2/3})$, where $\kappa_*$ is an explicit geometrical constant characterizing the difficulty of bias estimation. We prove lower bounds on the worst-case regret for some sets of actions showing that this rate is tight up to a possible sub-logarithmic factor. We also derive gap-dependent upper bounds on the regret, and matching lower bounds for some problem instance.Interestingly, these results reveal a transition between a regime where the problem is as difficult as its unbiased counterpart, and a regime where it can be much harder.

翻译：在本文中, 我们用偏差的线性匪徒反馈来研究公平顺序决策的问题。在每回合中, 玩家会选择由共变和敏感属性描述的动作。想象到的奖励是所选择动作共变的线性组合, 但玩家只观察到对奖赏的评价有偏差, 取决于敏感属性。为了说明这个问题的困难, 我们设计了一个分阶段消除算法, 纠正不公正的评价, 并设定其遗憾的上限。我们同时显示, 最坏的遗憾小于 $\mathcal{ O} (\kappa_ 1/ 3 ⁇ log (T) 1/ 1/ 3} T\\ 2/ 3/ 3} 敏感属性。在每回合中, 所看到的奖赏是明确的几何分常数性常数计算偏差估计难度。我们证明, 最坏的遗憾是, 某些行动显示这个比率与可能的次反差因素相近。我们还发现, 最差的上限在遗憾上, 和较低的框框框框中, 。。有趣的是, 这些结果揭示了制度之间的过渡更难, 。

0

相关内容

赌博机/老虎机

赌博机/老虎机

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

基于交替方向乘子法的高效译码理论与算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

退化抛物方程的可控性

国家自然科学基金

0+阅读 · 2013年12月31日

钌(II)/锇(II)/[60]富勒烯多吡啶配合物的合成及性质研究

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

立方（cubic)-TiB的合成、晶体结构与物理性能

国家自然科学基金

0+阅读 · 2011年12月31日

益气活血方通过"DAMPs-PRRs-巨噬细胞"途径影响动脉粥样硬化斑块易损性的机制

国家自然科学基金

0+阅读 · 2011年12月31日

REMg2TMx型多相合金的吸/放氢行为和衰减机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

有限域上多项式的降次与P-adic估计、指数和

国家自然科学基金

0+阅读 · 2009年12月31日

InGaAs表面相变过程的MBE/STM研究

国家自然科学基金

0+阅读 · 2008年12月31日

Incentive Designs for Stackelberg Games with a Large Number of Followers and their Mean-Field Limits

Incentive Designs for Stackelberg Games with a Large Number of Followers and their Mean-Field Limits

Arxiv

0+阅读 · 2022年7月21日

Feedback capacity of Gaussian channels with memory

Arxiv

0+阅读 · 2022年7月21日

Estimation of Non-Crossing Quantile Regression Process with Deep ReQU Neural Networks

Arxiv

0+阅读 · 2022年7月21日

Improved Hardness Results for the Guided Local Hamiltonian Problem

Arxiv

0+阅读 · 2022年7月21日

Measuring and signing fairness as performance under multiple stakeholder distributions

Arxiv

0+阅读 · 2022年7月20日

On the error rate of importance sampling with randomized quasi-Monte Carlo

Arxiv

0+阅读 · 2022年7月20日

Maximizing coverage while ensuring fairness: a tale of conflicting objective

Arxiv

0+阅读 · 2022年7月19日

Uncertainty in Contrastive Learning: On the Predictability of Downstream Performance

Arxiv

0+阅读 · 2022年7月19日

Inference of Common Multidimensional Equally-Distributed Attributes

Arxiv

0+阅读 · 2022年7月19日

Online Learning with Off-Policy Feedback

Arxiv

0+阅读 · 2022年7月18日

VIP会员

文章信息

相关主题

赌博机/老虎机

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争背景下俄罗斯的战略性海军分析（2022-2025年）》最新100页报告

【斯坦福博士论文】数据、决策与依赖：构建可信人工智能的挑战

人工智能时代背景下的未来海战

接触战中的无人机优势：美军旅级部队面临的小型无人机系统挑战与调整

相关资讯

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

相关论文

Incentive Designs for Stackelberg Games with a Large Number of Followers and their Mean-Field Limits

Incentive Designs for Stackelberg Games with a Large Number of Followers and their Mean-Field Limits

Arxiv

0+阅读 · 2022年7月21日

Feedback capacity of Gaussian channels with memory

Arxiv

0+阅读 · 2022年7月21日

Estimation of Non-Crossing Quantile Regression Process with Deep ReQU Neural Networks

Arxiv

0+阅读 · 2022年7月21日

Improved Hardness Results for the Guided Local Hamiltonian Problem

Arxiv

0+阅读 · 2022年7月21日

Measuring and signing fairness as performance under multiple stakeholder distributions

Arxiv

0+阅读 · 2022年7月20日

On the error rate of importance sampling with randomized quasi-Monte Carlo

Arxiv

0+阅读 · 2022年7月20日

Maximizing coverage while ensuring fairness: a tale of conflicting objective

Arxiv

0+阅读 · 2022年7月19日

Uncertainty in Contrastive Learning: On the Predictability of Downstream Performance

Arxiv

0+阅读 · 2022年7月19日

Inference of Common Multidimensional Equally-Distributed Attributes

Arxiv

0+阅读 · 2022年7月19日

Online Learning with Off-Policy Feedback

Arxiv

0+阅读 · 2022年7月18日

相关基金

基于交替方向乘子法的高效译码理论与算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

退化抛物方程的可控性

国家自然科学基金

0+阅读 · 2013年12月31日

钌(II)/锇(II)/[60]富勒烯多吡啶配合物的合成及性质研究

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

立方（cubic)-TiB的合成、晶体结构与物理性能

国家自然科学基金

0+阅读 · 2011年12月31日

益气活血方通过"DAMPs-PRRs-巨噬细胞"途径影响动脉粥样硬化斑块易损性的机制

国家自然科学基金

0+阅读 · 2011年12月31日

REMg2TMx型多相合金的吸/放氢行为和衰减机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

有限域上多项式的降次与P-adic估计、指数和

国家自然科学基金

0+阅读 · 2009年12月31日

InGaAs表面相变过程的MBE/STM研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员