根据政策约束对最佳动态待遇分配规则进行估计 (Estimation of Optimal Dynamic Treatment Assignment Rules under Policy Constraint) - 专知论文

会员服务 ·

0

估计/估计量 · 优化器 · 反向归纳 · 约束 · 统计量 ·

2021 年 6 月 9 日

Estimation of Optimal Dynamic Treatment Assignment Rules under Policy Constraint

翻译：根据政策约束对最佳动态待遇分配规则进行估计

Shosei Sakaguchi

This paper studies statistical decisions for dynamic treatment assignment problems. Many policies involve dynamics in their treatment assignments where treatments are sequentially assigned to individuals across multiple stages and the effect of treatment at each stage is usually heterogeneous with respect to the prior treatments, past outcomes, and observed covariates. We consider estimating an optimal dynamic treatment rule that guides the optimal treatment assignment for each individual at each stage based on the individual's history. This paper proposes an empirical welfare maximization approach in a dynamic framework. The approach estimates the optimal dynamic treatment rule from panel data taken from an experimental or quasi-experimental study. The paper proposes two estimation methods: one solves the treatment assignment problem at each stage through backward induction, and the other solves the whole dynamic treatment assignment problem simultaneously across all stages. We derive finite-sample upper bounds on the worst-case average welfare-regrets for the proposed methods and show $n^{-1/2}$-minimax convergence rates. We also modify the simultaneous estimation method to incorporate intertemporal budget/capacity constraints.

翻译：本文研究动态治疗分配问题的统计决定。许多政策涉及治疗任务动态,即治疗按顺序分配给不同阶段的个人,每个阶段的治疗效果通常与先前的治疗、过去的结果和观察到的共差情况不同。我们考虑估计一个最佳的动态治疗规则,根据个人的历史指导每个阶段对每个人的最佳治疗分配。本文件提议在动态框架内采用实证福利最大化办法。本方法从实验或准实验研究的小组数据中估计最佳动态治疗规则。本文提出两种估算方法:一种是通过后向感应解决每个阶段的治疗分配问题,另一种是通过所有阶段同时解决整个动态治疗分配问题。我们从最坏情况中得出拟议方法的福利平均平均比例的有限抽样,并显示 $ ⁇ -1/2 $-minimmax 的趋同率。我们还从实验或准实验研究中得出的小组数据中估算了最佳动态治疗规则。我们还修改了同时估算方法,以纳入时际预算/能力限制。

0

相关内容

估计/估计量

估计/估计量

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

【贝叶斯规则因果推理】《Causal Inference with Bayes Rule》by Finn Lattimore, David Rohde

【贝叶斯规则因果推理】《Causal Inference with Bayes Rule》by Finn Lattimore, David Rohde

专知会员服务

47+阅读 · 2019年12月13日

【ECML-PKDD 2019】基于bagged-trees学习的可解释生存梯度提升模型（Interpretable survival gradient boosting models with bagged trees base learners）

【ECML-PKDD 2019】基于bagged-trees学习的可解释生存梯度提升模型（Interpretable survival gradient boosting models with bagged trees base learners）

专知会员服务

6+阅读 · 2019年12月1日

【变分推断课件】Lectures on Variational Inference：Statistical Analysis of Variational Approximations（附带pdf）

【变分推断课件】Lectures on Variational Inference：Statistical Analysis of Variational Approximations（附带pdf）

专知会员服务

16+阅读 · 2019年11月30日

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

专知会员服务

35+阅读 · 2019年11月30日

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

专知会员服务

24+阅读 · 2019年11月11日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

280+阅读 · 2019年10月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

专知

19+阅读 · 2018年6月26日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

计算机类 | 国际会议信息7条

计算机类 | 国际会议信息7条

Call4Papers

3+阅读 · 2017年11月17日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Analysis of an Incomplete Binary Outcome Dichotomized From an Underlying Continuous Variable in Clinical Trials

Arxiv

0+阅读 · 2021年8月4日

Optimizing transmission conditions for multiple subdomains in the Magnetotelluric Approximation of Maxwell's equations

Arxiv

0+阅读 · 2021年8月4日

Doubly Robust Estimation with Machine Learning Predictions

Arxiv

0+阅读 · 2021年8月3日

Adaptive estimation for small diffusion processes based on sampled data

Arxiv

0+阅读 · 2021年8月3日

Optimal Covariate Balancing Conditions in Propensity Score Estimation

Arxiv

0+阅读 · 2021年8月3日

A new blocks estimator for the extremal index

Arxiv

0+阅读 · 2021年8月2日

Estimation and visualization of treatment effects for multiple outcomes

Arxiv

0+阅读 · 2021年7月31日

To adjust or not to adjust? Estimating the average treatment effect in randomized experiments with missing covariates

Arxiv

0+阅读 · 2021年7月31日

Design and Analysis of Bipartite Experiments under a Linear Exposure-Response Model

Arxiv

0+阅读 · 2021年7月30日

Semiparametric Estimation of Long-Term Treatment Effects

Arxiv

0+阅读 · 2021年7月30日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

【贝叶斯规则因果推理】《Causal Inference with Bayes Rule》by Finn Lattimore, David Rohde

【贝叶斯规则因果推理】《Causal Inference with Bayes Rule》by Finn Lattimore, David Rohde

专知会员服务

47+阅读 · 2019年12月13日

【ECML-PKDD 2019】基于bagged-trees学习的可解释生存梯度提升模型（Interpretable survival gradient boosting models with bagged trees base learners）

【ECML-PKDD 2019】基于bagged-trees学习的可解释生存梯度提升模型（Interpretable survival gradient boosting models with bagged trees base learners）

专知会员服务

6+阅读 · 2019年12月1日

【变分推断课件】Lectures on Variational Inference：Statistical Analysis of Variational Approximations（附带pdf）

【变分推断课件】Lectures on Variational Inference：Statistical Analysis of Variational Approximations（附带pdf）

专知会员服务

16+阅读 · 2019年11月30日

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

专知会员服务

35+阅读 · 2019年11月30日

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

专知会员服务

24+阅读 · 2019年11月11日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

280+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

2025全球人工智能展望报告：通向AGI之路，76页ppt

LLM/智能体作为数据分析师：综述

【NTU博士论文】多模态神经三维资产合成

【NeurIPS2025】一种基于结构信息原理的离线分层扩散框架

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

【论文推荐】最新六篇主题模型相关论文—领域特定知识库、神经变分推断、动态和静态主题模型

专知

19+阅读 · 2018年6月26日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

计算机类 | 国际会议信息7条

计算机类 | 国际会议信息7条

Call4Papers

3+阅读 · 2017年11月17日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Analysis of an Incomplete Binary Outcome Dichotomized From an Underlying Continuous Variable in Clinical Trials

Arxiv

0+阅读 · 2021年8月4日

Optimizing transmission conditions for multiple subdomains in the Magnetotelluric Approximation of Maxwell's equations

Arxiv

0+阅读 · 2021年8月4日

Doubly Robust Estimation with Machine Learning Predictions

Arxiv

0+阅读 · 2021年8月3日

Adaptive estimation for small diffusion processes based on sampled data

Arxiv

0+阅读 · 2021年8月3日

Optimal Covariate Balancing Conditions in Propensity Score Estimation

Arxiv

0+阅读 · 2021年8月3日

A new blocks estimator for the extremal index

Arxiv

0+阅读 · 2021年8月2日

Estimation and visualization of treatment effects for multiple outcomes

Arxiv

0+阅读 · 2021年7月31日

To adjust or not to adjust? Estimating the average treatment effect in randomized experiments with missing covariates

Arxiv

0+阅读 · 2021年7月31日

Design and Analysis of Bipartite Experiments under a Linear Exposure-Response Model

Arxiv

0+阅读 · 2021年7月30日

Semiparametric Estimation of Long-Term Treatment Effects

Arxiv

0+阅读 · 2021年7月30日

微信扫码咨询专知VIP会员