Causal discovery aims to uncover causal structure among a set of variables. Score-based approaches mainly focus on searching for the best Directed Acyclic Graph (DAG) based on a predefined score function. However, most of them are not applicable on a large scale due to the limited searchability. Inspired by the active learning in generative flow networks, we propose a novel approach to learning a DAG from observational data called GFlowCausal. It converts the graph search problem to a generation problem, in which direct edges are added gradually. GFlowCausal aims to learn the best policy to generate high-reward DAGs by sequential actions with probabilities proportional to predefined rewards. We propose a plug-and-play module based on transitive closure to ensure efficient sampling. Theoretical analysis shows that this module could guarantee acyclicity properties effectively and the consistency between final states and fully-connected graphs. We conduct extensive experiments on both synthetic and real datasets, and results show the proposed approach to be superior and also performs well in a large-scale setting.
翻译:以分数为基础的方法主要侧重于根据预先界定的评分函数寻找最佳定向循环图(DAG),然而,由于搜索能力有限,其中大多数不大规模适用。在基因流动网络的积极学习的启发下,我们提出一种新的方法,从称为GFlowCausal的观测数据中学习DAG。它将图形搜索问题转换成一代问题,逐渐增加直接边缘。GFlowCausal的目的是学习最佳政策,通过相继行动产生高回报的DAG(DAG),其概率与预先界定的奖励成比例。我们提议了一个基于过渡封闭的插座和游戏模块,以确保有效的抽样。理论分析表明,这一模块可以有效保证周期特性,保证最终状态和完全相连的图表之间的一致性。我们在合成和真实数据集方面进行广泛的实验,结果显示,拟议的方法在大规模环境下是优异的,而且表现也很好。</s>