Link prediction, a foundational task in complex network analysis, has extensive applications in critical scenarios such as social recommendation, drug target discovery, and knowledge graph completion. However, existing evaluations of algorithmic often rely on experiments conducted on a limited number of networks, assuming consistent performance rankings across domains. Despite the significant disparities in generative mechanisms and semantic contexts, previous studies often improperly highlight ``universally optimal" algorithms based solely on naive average over networks across domains. This paper systematically evaluates 12 mainstream link prediction algorithms across 740 real-world networks spanning seven domains. We present substantial empirical evidence elucidating the performance of algorithms in specific domains. This findings reveal a notably low degree of consistency in inter-domain algorithm rankings, a phenomenon that stands in stark contrast to the high degree of consistency observed within individual domains. Principal Component Analysis shows that response vectors formed by the rankings of the 12 algorithms cluster distinctly by domain in low-dimensional space, thus confirming domain attributes as a pivotal factor affecting algorithm performance. We propose a metric called Winner Score that could identify the superior algorithm in each domain: Non-Negative Matrix Factorization for social networks, Neighborhood Overlap-aware Graph Neural Networks for economics, Graph Convolutional Networks for chemistry, and L3-based Resource Allocation for biology. However, these domain-specific top-performing algorithms tend to exhibit suboptimal performance in other domains. This finding underscores the importance of aligning an algorithm's mechanism with the network structure.
翻译:链接预测作为复杂网络分析的一项基础任务,在社交推荐、药物靶点发现和知识图谱补全等关键场景中具有广泛应用。然而,现有算法评估通常基于有限数量网络上的实验,并假设算法在不同领域间具有一致的性能排序。尽管网络在生成机制和语义背景上存在显著差异,先前研究往往仅通过对跨领域网络的简单平均,不当强调“普遍最优”算法。本文系统评估了12种主流链接预测算法在横跨七个领域的740个真实网络上的表现。我们提供了充分的实证证据,阐明了算法在特定领域中的性能。研究结果揭示了领域间算法排序的一致性程度极低,这一现象与单个领域内部观察到的高度一致性形成鲜明对比。主成分分析表明,由12种算法排序构成的响应向量在低维空间中按领域明显聚类,从而证实领域属性是影响算法性能的关键因素。我们提出了一种称为优胜分数的度量方法,可用于识别各领域中的优势算法:社交网络中的非负矩阵分解、经济领域中的邻域重叠感知图神经网络、化学领域中的图卷积网络,以及生物领域中的基于L3的资源分配算法。然而,这些领域内表现最优的算法在其他领域往往表现欠佳。这一发现强调了算法机制与网络结构相匹配的重要性。