Temporal Knowledge Graphs have emerged as a powerful way of not only modeling static relationships between entities but also the dynamics of how relations evolve over time. As these informational structures can be used to store information from a real-world setting, such as a news flow, predicting future graph components to a certain extent equates predicting real-world events. Most of the research in this field focuses on embedding-based methods, often leveraging convolutional neural net architectures. These solutions act as black boxes, limiting insight. In this paper, we explore an extension to an established rule-based framework, TLogic, that yields a high accuracy in combination with explainable predictions. This offers transparency and allows the end-user to critically evaluate the rules applied at the end of the prediction stage. The new rule format incorporates entity category as a key component with the purpose of limiting rule application only to relevant entities. When categories are unknown for building the graph, we propose a data-driven method to generate them with an LLM-based approach. Additionally, we investigate the choice of aggregation method for scores of retrieved entities when performing category prediction.
翻译:时序知识图谱已成为一种强大的方法,不仅能建模实体间的静态关系,还能捕捉关系随时间演化的动态特性。由于这类信息结构可用于存储现实世界场景(如新闻流)中的信息,预测未来图谱组件在一定程度上等同于预测现实世界事件。该领域的大多数研究集中于基于嵌入的方法,常利用卷积神经网络架构。这些解决方案如同黑箱,限制了可解释性。本文探索了对已有规则驱动框架TLogic的扩展,该框架在保持高精度的同时提供可解释的预测。这增强了透明度,使终端用户能在预测阶段对应用的规则进行批判性评估。新规则格式将实体类别作为关键组成部分,旨在将规则应用限定于相关实体。当构建图谱时类别信息未知,我们提出一种基于大语言模型的数据驱动方法来自动生成类别。此外,我们研究了在执行类别预测时,对检索实体得分的聚合方法选择问题。