基于匹配-过滤与交集约束的高效超图模式匹配 (Efficient Hypergraph Pattern Matching via Match-and-Filter and Intersection Constraint)

A hypergraph is a generalization of a graph, in which a hyperedge can connect multiple vertices, modeling complex relationships involving multiple vertices simultaneously. Hypergraph pattern matching, which is to find all isomorphic embeddings of a query hypergraph in a data hypergraph, is one of the fundamental problems. In this paper, we present a novel algorithm for hypergraph pattern matching by introducing (1) the intersection constraint, a necessary and sufficient condition for valid embeddings, which significantly speeds up the verification process, (2) the candidate hyperedge space, a data structure that stores potential mappings between hyperedges in the query hypergraph and the data hypergraph, and (3) the Match-and-Filter framework, which interleaves matching and filtering operations to maintain only compatible candidates in the candidate hyperedge space during backtracking. Experimental results on real-world datasets demonstrate that our algorithm significantly outperforms the state-of-the-art algorithms, by up to orders of magnitude in terms of query processing time.

翻译：超图是图的一种推广，其中一条超边可以连接多个顶点，从而能够同时对涉及多个顶点的复杂关系进行建模。超图模式匹配旨在数据超图中找出查询超图的所有同构嵌入，是该领域的基础问题之一。本文提出了一种新颖的超图模式匹配算法，其核心贡献包括：（1）引入交集约束，作为有效嵌入的充分必要条件，从而显著加速验证过程；（2）提出候选超边空间这一数据结构，用于存储查询超图与数据超图之间超边的潜在映射关系；（3）设计匹配-过滤框架，通过在回溯过程中交替执行匹配与过滤操作，确保候选超边空间中仅保留相容的候选映射。在真实数据集上的实验结果表明，本算法在查询处理时间上显著优于现有最优算法，性能提升可达数个数量级。