Massively Multilingual Lexical Specialization of Multilingual Transformers - 专知论文

会员服务 ·

0

XLM-R · 变换 · 知识 (knowledge) · 同义词集 · Performer ·

2023 年 5 月 29 日

Massively Multilingual Lexical Specialization of Multilingual Transformers

翻译：暂无翻译

Tommaso Green,Simone Paolo Ponzetto,Goran Glavaš

from arxiv, Accepted in ACL 2023

While pretrained language models (PLMs) primarily serve as general-purpose text encoders that can be fine-tuned for a wide variety of downstream tasks, recent work has shown that they can also be rewired to produce high-quality word representations (i.e., static word embeddings) and yield good performance in type-level lexical tasks. While existing work primarily focused on the lexical specialization of monolingual PLMs with immense quantities of monolingual constraints, in this work we expose massively multilingual transformers (MMTs, e.g., mBERT or XLM-R) to multilingual lexical knowledge at scale, leveraging BabelNet as the readily available rich source of multilingual and cross-lingual type-level lexical knowledge. Concretely, we use BabelNet's multilingual synsets to create synonym pairs (or synonym-gloss pairs) across 50 languages and then subject the MMTs (mBERT and XLM-R) to a lexical specialization procedure guided by a contrastive objective. We show that such massively multilingual lexical specialization brings substantial gains in two standard cross-lingual lexical tasks, bilingual lexicon induction and cross-lingual word similarity, as well as in cross-lingual sentence retrieval. Crucially, we observe gains for languages unseen in specialization, indicating that multilingual lexical specialization enables generalization to languages with no lexical constraints. In a series of subsequent controlled experiments, we show that the number of specialization constraints plays a much greater role than the set of languages from which they originate.

翻译：暂无翻译

0

相关内容

XLM-R

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【AAAI2020接受论文】Emu:使用语义专门化增强多语言句子嵌入，Emu: Enhancing Multilingual Sentence Embeddings with Semantic Specialization

【AAAI2020接受论文】Emu:使用语义专门化增强多语言句子嵌入，Emu: Enhancing Multilingual Sentence Embeddings with Semantic Specialization

专知会员服务

26+阅读 · 2019年11月11日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新五篇信息抽取相关论文—端到端深度模型、调研、聊天机器人、自注意力、科学文本

【论文推荐】最新五篇信息抽取相关论文—端到端深度模型、调研、聊天机器人、自注意力、科学文本

专知

13+阅读 · 2018年4月4日

【论文推荐】最新5篇信息抽取（IE）相关论文—开放信息抽取、不完整信息、主动学习、越南语、依存分析

【论文推荐】最新5篇信息抽取（IE）相关论文—开放信息抽取、不完整信息、主动学习、越南语、依存分析

专知

12+阅读 · 2018年2月2日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

基于变胞原理的AT自动变速箱换挡变拓扑动力学建模方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于lncRNA 探讨针刺抗MCAO大鼠脑缺血后神经血管单元损伤的作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

miR-29a调控PTEN-Akt/Wnt-β-catenin通路促进轴突伸长和神经干细胞增殖修复脊髓损伤的机制

国家自然科学基金

0+阅读 · 2014年12月31日

Na+-K+-ATPase特异性DR抗体对大鼠心肌缺血/再灌注损伤的保护及分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

整合素连接激酶在新生鼠缺氧缺血脑损伤血管修复的信号调控

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

针刺干预内质网应激调节脑缺血再灌注大鼠神经细胞自噬的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

风轮菜黄酮类成分调控Nrf2/ARE信号通路诱导Ⅱ相解毒酶抗心肌缺血再灌注损伤的分子机制及构效关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

维甲酸诱导Ca2+信号通路致神经管畸形的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

α#946;5整合素与RhoA/ROCK信号转导通路介导肠缺血再灌注诱发肺损伤的机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

Decalf: A Directed, Effectful Cost-Aware Logical Framework

Arxiv

0+阅读 · 2023年7月17日

Semi-supervised cross-lingual speech emotion recognition

Arxiv

0+阅读 · 2023年7月17日

Leveraging Large Language Models to Generate Answer Set Programs

Arxiv

0+阅读 · 2023年7月15日

On decoder-only architecture for speech-to-text and large language model integration

Arxiv

0+阅读 · 2023年7月14日

Language-Routing Mixture of Experts for Multilingual and Code-Switching Speech Recognition

Arxiv

0+阅读 · 2023年7月14日

Cross-lingual Cross-temporal Summarization: Dataset, Models, Evaluation

Arxiv

0+阅读 · 2023年7月13日

Exploring the Integration of Large Language Models into Automatic Speech Recognition Systems: An Empirical Study

Arxiv

0+阅读 · 2023年7月13日

K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering

Arxiv

15+阅读 · 2021年9月22日

Contrastive Triple Extraction with Generative Transformer

Arxiv

13+阅读 · 2021年2月4日

Emu: Enhancing Multilingual Sentence Embeddings with Semantic Specialization

Emu: Enhancing Multilingual Sentence Embeddings with Semantic Specialization

Arxiv

10+阅读 · 2019年9月15日

VIP会员

文章信息

相关主题

知识 (knowledge)

相关VIP内容

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【AAAI2020接受论文】Emu:使用语义专门化增强多语言句子嵌入，Emu: Enhancing Multilingual Sentence Embeddings with Semantic Specialization

【AAAI2020接受论文】Emu:使用语义专门化增强多语言句子嵌入，Emu: Enhancing Multilingual Sentence Embeddings with Semantic Specialization

专知会员服务

26+阅读 · 2019年11月11日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《“蛛网”行动：乌克兰不对称作战的演进》报告

美国启动“自有军事人工智能计划”：采用谷歌Gemini以推动全军人工智能应用

《解析陆域作战方向：一个概念性框架》报告

《人工智能与人类的未来》2025年最新300页书籍

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新五篇信息抽取相关论文—端到端深度模型、调研、聊天机器人、自注意力、科学文本

【论文推荐】最新五篇信息抽取相关论文—端到端深度模型、调研、聊天机器人、自注意力、科学文本

专知

13+阅读 · 2018年4月4日

【论文推荐】最新5篇信息抽取（IE）相关论文—开放信息抽取、不完整信息、主动学习、越南语、依存分析

【论文推荐】最新5篇信息抽取（IE）相关论文—开放信息抽取、不完整信息、主动学习、越南语、依存分析

专知

12+阅读 · 2018年2月2日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Decalf: A Directed, Effectful Cost-Aware Logical Framework

Arxiv

0+阅读 · 2023年7月17日

Semi-supervised cross-lingual speech emotion recognition

Arxiv

0+阅读 · 2023年7月17日

Leveraging Large Language Models to Generate Answer Set Programs

Arxiv

0+阅读 · 2023年7月15日

On decoder-only architecture for speech-to-text and large language model integration

Arxiv

0+阅读 · 2023年7月14日

Language-Routing Mixture of Experts for Multilingual and Code-Switching Speech Recognition

Arxiv

0+阅读 · 2023年7月14日

Cross-lingual Cross-temporal Summarization: Dataset, Models, Evaluation

Arxiv

0+阅读 · 2023年7月13日

Exploring the Integration of Large Language Models into Automatic Speech Recognition Systems: An Empirical Study

Arxiv

0+阅读 · 2023年7月13日

K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering

Arxiv

15+阅读 · 2021年9月22日

Contrastive Triple Extraction with Generative Transformer

Arxiv

13+阅读 · 2021年2月4日

Emu: Enhancing Multilingual Sentence Embeddings with Semantic Specialization

Emu: Enhancing Multilingual Sentence Embeddings with Semantic Specialization

Arxiv

10+阅读 · 2019年9月15日

相关基金

基于变胞原理的AT自动变速箱换挡变拓扑动力学建模方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于lncRNA 探讨针刺抗MCAO大鼠脑缺血后神经血管单元损伤的作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

miR-29a调控PTEN-Akt/Wnt-β-catenin通路促进轴突伸长和神经干细胞增殖修复脊髓损伤的机制

国家自然科学基金

0+阅读 · 2014年12月31日

Na+-K+-ATPase特异性DR抗体对大鼠心肌缺血/再灌注损伤的保护及分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

整合素连接激酶在新生鼠缺氧缺血脑损伤血管修复的信号调控

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

针刺干预内质网应激调节脑缺血再灌注大鼠神经细胞自噬的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

风轮菜黄酮类成分调控Nrf2/ARE信号通路诱导Ⅱ相解毒酶抗心肌缺血再灌注损伤的分子机制及构效关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

维甲酸诱导Ca2+信号通路致神经管畸形的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

α#946;5整合素与RhoA/ROCK信号转导通路介导肠缺血再灌注诱发肺损伤的机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员