通过自适应路由与定向推理利用大语言模型在实体链接中的能力 (Leveraging the Power of Large Language Models in Entity Linking via Adaptive Routing and Targeted Reasoning)

Entity Linking (EL) has traditionally relied on large annotated datasets and extensive model fine-tuning. While recent few-shot methods leverage large language models (LLMs) through prompting to reduce training requirements, they often suffer from inefficiencies due to expensive LLM-based reasoning. ARTER (Adaptive Routing and Targeted Entity Reasoning) presents a structured pipeline that achieves high performance without deep fine-tuning by strategically combining candidate generation, context-based scoring, adaptive routing, and selective reasoning. ARTER computes a small set of complementary signals(both embedding and LLM-based) over the retrieved candidates to categorize contextual mentions into easy and hard cases. The cases are then handled by a low-computational entity linker (e.g. ReFinED) and more expensive targeted LLM-based reasoning respectively. On standard benchmarks, ARTER outperforms ReFinED by up to +4.47%, with an average gain of +2.53% on 5 out of 6 datasets, and performs comparably to pipelines using LLM-based reasoning for all mentions, while being as twice as efficient in terms of the number of LLM tokens.

翻译：实体链接传统上依赖于大规模标注数据集和广泛的模型微调。尽管近期少样本方法通过提示利用大语言模型以减少训练需求，但它们常因基于大语言模型的昂贵推理而效率低下。ARTER（自适应路由与定向实体推理）提出了一种结构化流程，通过策略性地结合候选生成、基于上下文的评分、自适应路由和选择性推理，在不进行深度微调的情况下实现了高性能。ARTER在检索到的候选实体上计算一组小型互补信号（包括嵌入和大语言模型信号），将上下文提及分类为简单和困难案例。这些案例随后分别由低计算成本的实体链接器（如ReFinED）和更昂贵的基于大语言模型的定向推理处理。在标准基准测试中，ARTER在6个数据集中的5个上平均提升+2.53%，最高超越ReFinED达+4.47%，且与对所有提及均使用大语言模型推理的流程性能相当，同时在大语言模型令牌使用数量上效率提升一倍。

相关内容

实体

关注 0

实体（entity）是有可区别性且独立存在的某种事物，但它不需要是物质上的存在。尤其是抽象和法律拟制也通常被视为实体。实体可被看成是一包含有子集的集合。在哲学里，这种集合被称为客体。实体可被使用来指涉某个可能是人、动物、植物或真菌等不会思考的生命、无生命物体或信念等的事物。在这一方面，实体可以被视为一全包的词语。有时，实体被当做本质的广义，不论即指的是否为物质上的存在，如时常会指涉到的无物质形式的实体－语言。更有甚者，实体有时亦指存在或本质本身。在法律上，实体是指能具有权利和义务的事物。这通常是指法人，但也包括自然人。

【KDD2024】面向鲁棒推荐的决策边界感知图对比学习

专知会员服务

21+阅读 · 2024年8月8日

【CVPR2022】MSDN: 零样本学习的互语义蒸馏网络

专知会员服务

21+阅读 · 2022年3月8日

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

专知会员服务

44+阅读 · 2020年4月30日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日