采取强有力的方法,量化不确定性,以匹配因果推断问题 (A robust approach to quantifying uncertainty in matching problems of causal inference)

Unquantified sources of uncertainty in observational causal analyses can break the integrity of the results. One would never want another analyst to repeat a calculation with the same dataset, using a seemingly identical procedure, only to find a different conclusion. However, as we show in this work, there is a typical source of uncertainty that is essentially never considered in observational causal studies: the choice of match assignment for matched groups, that is, which unit is matched to which other unit before a hypothesis test is conducted. The choice of match assignment is anything but innocuous, and can have a surprisingly large influence on the causal conclusions. Given that a vast number of causal inference studies test hypotheses on treatment effects after treatment cases are matched with similar control cases, we should find a way to quantify how much this extra source of uncertainty impacts results. What we would really like to be able to report is that \emph{no matter} which match assignment is made, as long as the match is sufficiently good, then the hypothesis test result still holds. In this paper, we provide methodology based on discrete optimization to create robust tests that explicitly account for this possibility. We formulate robust tests for binary and continuous data based on common test statistics as integer linear programs solvable with common methodologies. We study the finite-sample behavior of our test statistic in the discrete-data case. We apply our methods to simulated and real-world datasets and show that they can produce useful results in practical applied settings.

翻译：观察因果分析中不确定的未知来源会破坏结果的完整性。人们永远不会希望另一位分析师重复使用同一数据集进行计算, 使用看起来相同的程序, 只找到不同的结论。但是, 正如我们在工作中显示的那样, 存在一个典型的不确定性来源, 基本上在观察因果研究中从未考虑过: 匹配组的匹配任务的选择, 也就是说, 在进行假设测试之前, 哪个单位与哪个单位相匹配。匹配任务的选择是完全的, 但是没有实际的, 并且可能对因果关系结论产生惊人的巨大影响。鉴于大量因果推断研究测试治疗个案后治疗效果的假设与类似控制案例相匹配, 我们应该找到一种方法来量化这种额外的不确定性来源对结果的影响。我们真正想要报告的是, 匹配组组的匹配任务, 只要匹配足够好, 那么假设测试结果仍然有效。在本文中, 我们提供基于离散优化的方法, 来创建稳健的测试, 从而明确解释这种可能性。我们制定稳健的可靠数据测试方法, 以共同的硬性数据测试方法来测试我们的共同的硬性标准。