对集群随机调整实验的模型辅助分析 (Model-assisted analyses of cluster-randomized experiments) - 专知论文

会员服务 ·

0

簇 · 估计/估计量 · Weight · 稳健性 · MoDELS ·

2021 年 8 月 5 日

Model-assisted analyses of cluster-randomized experiments

翻译：对集群随机调整实验的模型辅助分析

Fangzhou Su,Peng Ding

Cluster-randomized experiments are widely used due to their logistical convenience and policy relevance. To analyze them properly, we must address the fact that the treatment is assigned at the cluster level instead of the individual level. Standard analytic strategies are regressions based on individual data, cluster averages, and cluster totals, which differ when the cluster sizes vary. These methods are often motivated by models with strong and unverifiable assumptions, and the choice among them can be subjective. Without any outcome modeling assumption, we evaluate these regression estimators and the associated robust standard errors from a design-based perspective where only the treatment assignment itself is random and controlled by the experimenter. We demonstrate that regression based on cluster averages targets a weighted average treatment effect, regression based on individual data is suboptimal in terms of efficiency, and regression based on cluster totals is consistent and more efficient with a large number of clusters. We highlight the critical role of covariates in improving estimation efficiency, and illustrate the efficiency gain via both simulation studies and data analysis. Moreover, we show that the robust standard errors are convenient approximations to the true asymptotic standard errors under the design-based perspective. Our theory holds even when the outcome models are misspecified, so it is model-assisted rather than model-based. We also extend the theory to a wider class of weighted average treatment effects.

翻译：为了正确分析它们,我们必须从设计角度来评估这些回归估计器及其相关的强势标准错误,因为只有治疗任务本身是随机的,由实验者控制。我们证明,基于组平均数的回归是加权平均处理效果,基于单个数据的回归在效率方面不尽相同,而基于组数的回归则与大量组数一致,效率更高。我们强调,在提高估算效率方面,各种差异的关键作用至关重要,并且通过模拟研究和数据分析来说明效率的提高。此外,我们表明,强势的标准错误甚至可以与真实相近,在基于模型和数据分析的模型中,我们发现,强势的标准错误比基于模型的模型和加权结果的理论更宽泛。

0

相关内容

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

已删除

将门创投

4+阅读 · 2018年1月19日

Stratification Trees for Adaptive Randomization in Randomized Controlled Trials

Arxiv

0+阅读 · 2021年10月4日

Boosted nonparametric hazards with time-dependent covariates

Arxiv

0+阅读 · 2021年10月4日

Confidence Intervals for Seroprevalence

Arxiv

0+阅读 · 2021年10月4日

Clarifying Selection Bias in Cluster Randomized Trials: Estimands and Estimation

Clarifying Selection Bias in Cluster Randomized Trials: Estimands and Estimation

Arxiv

0+阅读 · 2021年10月4日

On the Fairness of Randomized Trials for Recommendation with Heterogeneous Demographics and Beyond

Arxiv

0+阅读 · 2021年10月4日

Expected Validation Performance and Estimation of a Random Variable's Maximum

Arxiv

0+阅读 · 2021年10月1日

Self-Validated Ensemble Models for Design of Experiments

Arxiv

1+阅读 · 2021年10月1日

Unbiased Experiments in Congested Networks

Arxiv

0+阅读 · 2021年9月30日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

18+阅读 · 2019年10月30日

Efficient Parameter-free Clustering Using First Neighbor Relations

Efficient Parameter-free Clustering Using First Neighbor Relations

Arxiv

7+阅读 · 2019年2月28日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

数据要素发展报告(2025年)：附下载

人工智能代理提升战时舰船战备水平

【NeurIPS2025教程】大语言模型规划

NeurIPS 2025 教程：深度学习训练不稳定性的理论洞见

相关资讯

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

已删除

将门创投

4+阅读 · 2018年1月19日

相关论文

Stratification Trees for Adaptive Randomization in Randomized Controlled Trials

Arxiv

0+阅读 · 2021年10月4日

Boosted nonparametric hazards with time-dependent covariates

Arxiv

0+阅读 · 2021年10月4日

Confidence Intervals for Seroprevalence

Arxiv

0+阅读 · 2021年10月4日

Clarifying Selection Bias in Cluster Randomized Trials: Estimands and Estimation

Clarifying Selection Bias in Cluster Randomized Trials: Estimands and Estimation

Arxiv

0+阅读 · 2021年10月4日

On the Fairness of Randomized Trials for Recommendation with Heterogeneous Demographics and Beyond

Arxiv

0+阅读 · 2021年10月4日

Expected Validation Performance and Estimation of a Random Variable's Maximum

Arxiv

0+阅读 · 2021年10月1日

Self-Validated Ensemble Models for Design of Experiments

Arxiv

1+阅读 · 2021年10月1日

Unbiased Experiments in Congested Networks

Arxiv

0+阅读 · 2021年9月30日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

18+阅读 · 2019年10月30日

Efficient Parameter-free Clustering Using First Neighbor Relations

Efficient Parameter-free Clustering Using First Neighbor Relations

Arxiv

7+阅读 · 2019年2月28日

微信扫码咨询专知VIP会员