无类型但类型感知的基于预训练语言模型的归纳式链接预测 (Type-Less yet Type-Aware Inductive Link Prediction with Pretrained Language Models)

Inductive link prediction is emerging as a key paradigm for real-world knowledge graphs (KGs), where new entities frequently appear and models must generalize to them without retraining. Predicting links in a KG faces the challenge of guessing previously unseen entities by leveraging generalizable node features such as subgraph structure, type annotations, and ontological constraints. However, explicit type information is often lacking or incomplete. Even when available, type information in most KGs is often coarse-grained, sparse, and prone to errors due to human annotation. In this work, we explore the potential of pre-trained language models (PLMs) to enrich node representations with implicit type signals. We introduce TyleR, a Type-less yet type-awaRe approach for subgraph-based inductive link prediction that leverages PLMs for semantic enrichment. Experiments on standard benchmarks demonstrate that TyleR outperforms state-of-the-art baselines in scenarios with scarce type annotations and sparse graph connectivity. To ensure reproducibility, we share our code at https://github.com/sisinflab/tyler .

翻译：归纳式链接预测正成为现实世界知识图谱（KGs）的关键范式，其中新实体频繁出现，模型必须在不重新训练的情况下泛化到这些新实体。在知识图谱中预测链接面临着通过利用可泛化的节点特征（如子图结构、类型标注和本体约束）来猜测先前未见实体的挑战。然而，显式的类型信息常常缺失或不完整。即使可用，大多数知识图谱中的类型信息也通常是粗粒度的、稀疏的，并且由于人工标注容易出错。在这项工作中，我们探索了预训练语言模型（PLMs）利用隐式类型信号丰富节点表示的潜力。我们提出了TyleR，一种无类型但类型感知的基于子图的归纳式链接预测方法，它利用PLMs进行语义增强。在标准基准测试上的实验表明，在类型标注稀缺和图连接稀疏的场景下，TyleR优于最先进的基线方法。为确保可复现性，我们在 https://github.com/sisinflab/tyler 分享了代码。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日