The abundance of fine-grained spatio-temporal data, such as traffic sensor networks, offers vast opportunities for scientific discovery. However, inferring causal relationships from such observational data remains challenging, particularly due to unobserved confounders that are specific to units (e.g., geographical locations) yet influence outcomes over time. Most existing methods for spatio-temporal causal inference assume that all confounders are observed, an assumption that is often violated in practice. In this paper, we introduce Spatio-Temporal Hierarchical Causal Models (ST-HCMs), a novel graphical framework that extends hierarchical causal modeling to the spatio-temporal domain. At the core of our approach is the Spatio-Temporal Collapse Theorem, which shows that a complex ST-HCM converges to a simpler flat causal model as the amount of subunit data increases. This theoretical result enables a general procedure for causal identification, allowing ST-HCMs to recover causal effects even in the presence of unobserved, time-invariant unit-level confounders, a scenario where standard non-hierarchical models fail. We validate the effectiveness of our framework on both synthetic and real-world datasets, demonstrating its potential for robust causal inference in complex dynamic systems.
翻译:细粒度时空数据(如交通传感器网络)的丰富性为科学发现提供了广阔机遇。然而,从此类观测数据中推断因果关系仍具挑战性,特别是由于存在针对特定单元(如地理位置)且随时间影响结果的未观测混杂因子。现有大多数时空因果推断方法假设所有混杂因子均被观测,这一假设在实践中常被违背。本文提出时空分层因果模型(ST-HCMs),这是一种将分层因果建模扩展至时空领域的新型图模型框架。我们方法的核心是时空坍缩定理,该定理证明随着子单元数据量的增加,复杂的ST-HCM会收敛于更简单的扁平因果模型。这一理论结果启发了通用的因果识别流程,使得ST-HCM即使在存在未观测的、时不变单元级混杂因子的情况下仍能恢复因果效应——而标准的非分层模型在此场景下会失效。我们在合成数据集和真实数据集上验证了该框架的有效性,展示了其在复杂动态系统中实现稳健因果推断的潜力。