A hypergraph is a generalization of a graph, in which a hyperedge can connect multiple vertices, modeling complex relationships involving multiple vertices simultaneously. Hypergraph pattern matching, which is to find all isomorphic embeddings of a query hypergraph in a data hypergraph, is one of the fundamental problems. In this paper, we present a novel algorithm for hypergraph pattern matching by introducing (1) the intersection constraint, a necessary and sufficient condition for valid embeddings, which significantly speeds up the verification process, (2) the candidate hyperedge space, a data structure that stores potential mappings between hyperedges in the query hypergraph and the data hypergraph, and (3) the Match-and-Filter framework, which interleaves matching and filtering operations to maintain only compatible candidates in the candidate hyperedge space during backtracking. Experimental results on real-world datasets demonstrate that our algorithm significantly outperforms the state-of-the-art algorithms, by up to orders of magnitude in terms of query processing time.
翻译:超图是图的一种推广,其中一条超边可以连接多个顶点,从而能够同时对涉及多个顶点的复杂关系进行建模。超图模式匹配旨在数据超图中找出查询超图的所有同构嵌入,是该领域的基础问题之一。本文提出了一种新颖的超图模式匹配算法,其核心贡献包括:(1)引入交集约束,作为有效嵌入的充分必要条件,从而显著加速验证过程;(2)提出候选超边空间这一数据结构,用于存储查询超图与数据超图之间超边的潜在映射关系;(3)设计匹配-过滤框架,通过在回溯过程中交替执行匹配与过滤操作,确保候选超边空间中仅保留相容的候选映射。在真实数据集上的实验结果表明,本算法在查询处理时间上显著优于现有最优算法,性能提升可达数个数量级。