Graph neural networks (GNNs) have achieved tremendous success in graph mining. However, the inability of GNNs to model substructures in graphs remains a significant drawback. Specifically, message-passing GNNs (MPGNNs), as the prevailing type of GNNs, have been theoretically shown unable to distinguish, detect or count many graph substructures. While efforts have been paid to complement the inability, existing works either rely on pre-defined substructure sets, thus being less flexible, or are lacking in theoretical insights. In this paper, we propose GSKN, a GNN model with a theoretically stronger ability to distinguish graph structures. Specifically, we design GSKN based on anonymous walks (AWs), flexible substructure units, and derive it upon feature mappings of graph kernels (GKs). We theoretically show that GSKN provably extends the 1-WL test, and hence the maximally powerful MPGNNs from both graph-level and node-level viewpoints. Correspondingly, various experiments are leveraged to evaluate GSKN, where GSKN outperforms a wide range of baselines, endorsing the analysis.
翻译:在图形采矿中,GNN无法在图形中建模子结构,这仍然是一个很大的缺点。具体地说,GNN(MPGNN)作为通用的GNN(MPGNN)类型,在理论上显示无法辨别、检测或计算许多图形子结构。虽然已经付出了努力来补充这一能力,但现有的工程要么依赖于预先定义的子结构,因此不那么灵活,或者缺乏理论见解。在本文中,我们提议GNN(GNNN)模式,在理论上更有能力区分图形结构。具体地说,我们设计以匿名行走(AW)为基础的GNN(MPGNN)模式,通过图形内核(GKs)的地貌图图图绘制来得出。我们理论上表明,GSKN(GSKN)可以合理地扩展1-WL测试,因此,从图形层层和节界的观点来看,具有最大功率的MPGGNNN(MPGN)。相应的是利用各种实验来评估GSKN(GKN),在其中,GSKN(GKN)超越了各种基线,以进行广泛的分析。