实体设置堆积流中共同扩展 (Entity Set Co-Expansion in StackOverflow) - 专知论文

会员服务 ·

0

entity · 情景 · Extensibility · 语言模型化 · Less ·

2022 年 12 月 5 日

Entity Set Co-Expansion in StackOverflow

翻译：实体设置堆积流中共同扩展

Yu Zhang,Yunyi Zhang,Yucheng Jiang,Martin Michalski,Yu Deng,Lucian Popa,ChengXiang Zhai,Jiawei Han

from arxiv, 4 pages; Accepted to IEEE BigData 2022

Given a few seed entities of a certain type (e.g., Software or Programming Language), entity set expansion aims to discover an extensive set of entities that share the same type as the seeds. Entity set expansion in software-related domains such as StackOverflow can benefit many downstream tasks (e.g., software knowledge graph construction) and facilitate better IT operations and service management. Meanwhile, existing approaches are less concerned with two problems: (1) How to deal with multiple types of seed entities simultaneously? (2) How to leverage the power of pre-trained language models (PLMs)? Being aware of these two problems, in this paper, we study the entity set co-expansion task in StackOverflow, which extracts Library, OS, Application, and Language entities from StackOverflow question-answer threads. During the co-expansion process, we use PLMs to derive embeddings of candidate entities for calculating similarities between entities. Experimental results show that our proposed SECoExpan framework outperforms previous approaches significantly.

翻译：鉴于有少数某种类型的种子实体(如软件或编程语言),实体设定的扩大旨在发现与种子具有相同类型的大量实体。实体在StaackOverplow等软件相关领域设定的扩展可有益于许多下游任务(如软件知识图的构建),并促进更好的信息技术业务和服务管理。与此同时,现有办法较少涉及两个问题:(1) 如何同时处理多种类型的种子实体?(2) 如何利用预先培训的语言模型(PLMs)的力量?在本文件中认识到这两个问题之后,我们研究了在StackOverplow中设定的共同扩展任务,该实体从StackOverpolt问答线索中提取图书馆、OS、应用程序和语言实体。在共同扩展过程中,我们使用PLMS来获取候选实体的嵌入,以计算各实体之间的相似之处。实验结果表明,我们提议的SECoExtaan框架大大超越了以前的做法。

0

相关内容

entity

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

《数学学报》期刊

国家自然科学基金

5+阅读 · 2015年12月31日

Ni-Co-Mo/石墨烯多功能纳米催化剂的可控构筑及其碱性介质尿素电催化氧化特性

国家自然科学基金

0+阅读 · 2015年12月31日

西南季风与热带对流云团协同作用对南海热带气旋生成的影响

国家自然科学基金

0+阅读 · 2014年12月31日

倍半硅氧烷季铵盐型硅橡胶的可控制备及抗菌机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

肾癌中KLF4转录调控细胞外基质蛋白fibulin-1的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

云-辐射反馈过程对东亚-西北太平洋地区海气相互作用的影响及其气候模式模拟的不确定性

国家自然科学基金

0+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于海洋要素场的涡旋过程数据建模与可视化

国家自然科学基金

2+阅读 · 2012年12月31日

N-3PUFA降低肥胖型胰岛素抵抗大鼠血脂及提高胰岛素敏感性的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

考虑弥散尺度效应的裂隙介质中溶质运移模型及模拟研究

国家自然科学基金

0+阅读 · 2012年12月31日

Neural Episodic Control with State Abstraction

Arxiv

0+阅读 · 2023年2月5日

FairMILE: A Multi-Level Framework for Fair and Scalable Graph Representation Learning

Arxiv

0+阅读 · 2023年2月5日

Distributional constrained reinforcement learning for supply chain optimization

Arxiv

0+阅读 · 2023年2月3日

GausSetExpander: A Simple Approach for Entity Set Expansion

Arxiv

0+阅读 · 2023年2月2日

Sentiment Overflow in the Testing Stack: Analysing Software Testing Posts on Stack Overflow

Arxiv

0+阅读 · 2023年2月2日

Policy Expansion for Bridging Offline-to-Online Reinforcement Learning

Arxiv

0+阅读 · 2023年2月2日

Graph Ordering Attention Networks

Arxiv

12+阅读 · 2022年11月21日

Dynamic neighbourhood optimisation for task allocation using multi-agent

Arxiv

101+阅读 · 2022年5月11日

Read, Retrospect, Select: An MRC Framework to Short Text Entity Linking

Arxiv

11+阅读 · 2021年1月7日

Generalized Multi-Relational Graph Convolution Network

Arxiv

10+阅读 · 2020年6月12日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【NeurIPS2025】VideoLucy：用于长视频理解的深度记忆回溯机制

不确定环境下无人机与无人地面车辆编队的地下勘探规划算法 | 122页

【NTU博士论文】端到端鲁棒自动语音识别的最新进展

用于强化学习的扩散模型：基础、分类与发展

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Neural Episodic Control with State Abstraction

Arxiv

0+阅读 · 2023年2月5日

FairMILE: A Multi-Level Framework for Fair and Scalable Graph Representation Learning

Arxiv

0+阅读 · 2023年2月5日

Distributional constrained reinforcement learning for supply chain optimization

Arxiv

0+阅读 · 2023年2月3日

GausSetExpander: A Simple Approach for Entity Set Expansion

Arxiv

0+阅读 · 2023年2月2日

Sentiment Overflow in the Testing Stack: Analysing Software Testing Posts on Stack Overflow

Arxiv

0+阅读 · 2023年2月2日

Policy Expansion for Bridging Offline-to-Online Reinforcement Learning

Arxiv

0+阅读 · 2023年2月2日

Graph Ordering Attention Networks

Arxiv

12+阅读 · 2022年11月21日

Dynamic neighbourhood optimisation for task allocation using multi-agent

Arxiv

101+阅读 · 2022年5月11日

Read, Retrospect, Select: An MRC Framework to Short Text Entity Linking

Arxiv

11+阅读 · 2021年1月7日

Generalized Multi-Relational Graph Convolution Network

Arxiv

10+阅读 · 2020年6月12日

相关基金

《数学学报》期刊

国家自然科学基金

5+阅读 · 2015年12月31日

Ni-Co-Mo/石墨烯多功能纳米催化剂的可控构筑及其碱性介质尿素电催化氧化特性

国家自然科学基金

0+阅读 · 2015年12月31日

西南季风与热带对流云团协同作用对南海热带气旋生成的影响

国家自然科学基金

0+阅读 · 2014年12月31日

倍半硅氧烷季铵盐型硅橡胶的可控制备及抗菌机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

肾癌中KLF4转录调控细胞外基质蛋白fibulin-1的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

云-辐射反馈过程对东亚-西北太平洋地区海气相互作用的影响及其气候模式模拟的不确定性

国家自然科学基金

0+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于海洋要素场的涡旋过程数据建模与可视化

国家自然科学基金

2+阅读 · 2012年12月31日

N-3PUFA降低肥胖型胰岛素抵抗大鼠血脂及提高胰岛素敏感性的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

考虑弥散尺度效应的裂隙介质中溶质运移模型及模拟研究

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员