更好的数据储存、更好的翻译:从近距离神经机器翻译的预培训模型中生成数据储存 (Better Datastore, Better Translation: Generating Datastores from Pre-Trained Models for Nearest Neural Machine Translation) - 专知论文

会员服务 ·

0

Better · NMT · Machine Translation · MoDELS · 近邻 ·

2022 年 12 月 17 日

Better Datastore, Better Translation: Generating Datastores from Pre-Trained Models for Nearest Neural Machine Translation

翻译：更好的数据储存、更好的翻译:从近距离神经机器翻译的预培训模型中生成数据储存

Jiahuan Li,Shanbo Cheng,Zewei Sun,Mingxuan Wang,Shujian Huang

Nearest Neighbor Machine Translation (kNNMT) is a simple and effective method of augmenting neural machine translation (NMT) with a token-level nearest neighbor retrieval mechanism. The effectiveness of kNNMT directly depends on the quality of retrieved neighbors. However, original kNNMT builds datastores based on representations from NMT models, which would result in poor retrieval accuracy when NMT models are not good enough, leading to sub-optimal translation performance. In this paper, we propose PRED, a framework that leverages Pre-trained models for Datastores in kNN-MT. Better representations from pre-trained models allow us to build datastores of better quality. We also design a novel contrastive alignment objective to mitigate the representation gap between the NMT model and pre-trained models, enabling the NMT model to retrieve from better datastores. We conduct extensive experiments on both bilingual and multilingual translation benchmarks, including WMT17 English $\leftrightarrow$ Chinese, WMT14 English $\leftrightarrow$ German, IWSLT14 German $\leftrightarrow$ English, and IWSLT14 multilingual datasets. Empirical results demonstrate the effectiveness of PRED.

翻译：近邻机器翻译( kNNNMT) 是一个简单而有效的增强神经机器翻译( NMT) 的方法, 并有一个象征性的近邻检索机制。 kNNMT 的有效性直接取决于回收邻居的质量。但是, 原始 kNNMT 根据NMT 模型的表示方式建立数据储存, 这将使NMT模型不够好时检索准确性差, 导致亚最佳翻译性能。在本文中, 我们提议 PRED, 这个框架利用了 kNNN- MT 中的数据存储器的预培训模型。预培训模型的更好表现让我们得以建立质量更高的数据储存器。我们还设计了一个新的对比性调整目标, 以缩小NMT模型和预培训模型之间的代表差距, 使NMT模型能够从更好的数据储存处检索数据。我们在双语和多语种翻译基准上进行了广泛的实验, 包括WMT17 $\ leftrightrow$ 中文、 WMT14 $leftrightrowroom 德文、 IWSLightrightrow $ 英文和 IWSLT14 多语言数据集的实效。

1

相关内容

Better

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

【经典书】机器学习白话书，97页pdf，Machine Learning for Humans

【经典书】机器学习白话书，97页pdf，Machine Learning for Humans

专知会员服务

87+阅读 · 2021年1月11日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

多伦多大学最新《机器学习导论》课程，Introduction to Machine Learning

多伦多大学最新《机器学习导论》课程，Introduction to Machine Learning

专知会员服务

25+阅读 · 2020年9月24日

多语言神经机器翻译综述论文，34页pdf，A Comprehensive Survey of Multilingual Neural Machine Translation

多语言神经机器翻译综述论文，34页pdf，A Comprehensive Survey of Multilingual Neural Machine Translation

专知会员服务

19+阅读 · 2020年4月25日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【论文】多语言神经机器翻译综述（A Comprehensive Survey of Multilingual Neural Machine Translation）

【论文】多语言神经机器翻译综述（A Comprehensive Survey of Multilingual Neural Machine Translation）

专知会员服务

20+阅读 · 2020年1月7日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

介孔材料受限空间中的AGET ATRP和ARGET ATRP聚合反应

国家自然科学基金

0+阅读 · 2016年12月31日

线粒体TRAP1分子介导Ago2蛋白表达在肠癌转移中的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

利用球差校正透射电镜表征磷烯的原子和电子结构

国家自然科学基金

0+阅读 · 2014年12月31日

N-乙酰氨基葡萄糖转移酶V对间充质干细胞迁移、分化的作用及分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于高通量测序的前列腺癌耐药相关候选DNA甲基化基因鉴定与功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

利用小鼠疾病模型研究DNA甲基化及非编码RNA在情感与记忆分子机制中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

原癌基因RET在前列腺癌恶性转移中的作用和机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

瞬时受体电位M8（TRPM8）对前列腺癌侵袭和转移影响及其机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

一次性量子计算

国家自然科学基金

1+阅读 · 2009年12月31日

microRNA结合位点多态性与散发性食管癌易感性的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Designing an Encoder for Fast Personalization of Text-to-Image Models

Arxiv

0+阅读 · 2023年2月23日

Federated Nearest Neighbor Machine Translation

Arxiv

0+阅读 · 2023年2月23日

Simple and Scalable Nearest Neighbor Machine Translation

Arxiv

0+阅读 · 2023年2月23日

ACE: Zero-Shot Image to Image Translation via Pretrained Auto-Contrastive-Encoder

Arxiv

0+阅读 · 2023年2月22日

Time Series Clustering with an EM algorithm for Mixtures of Linear Gaussian State Space Models

Arxiv

0+阅读 · 2023年2月22日

A Comprehensive Survey on Pretrained Foundation Models: A History from BERT to ChatGPT

Arxiv

33+阅读 · 2023年2月18日

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

Arxiv

14+阅读 · 2021年1月31日

Data Augmentation using Pre-trained Transformer Models

Arxiv

17+阅读 · 2020年3月4日

Diverse Image-to-Image Translation via Disentangled Representations

Diverse Image-to-Image Translation via Disentangled Representations

Arxiv

13+阅读 · 2018年8月2日

Attention Is All You Need

Arxiv

27+阅读 · 2017年12月6日

VIP会员

文章信息

相关主题

Machine Translation

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

【经典书】机器学习白话书，97页pdf，Machine Learning for Humans

【经典书】机器学习白话书，97页pdf，Machine Learning for Humans

专知会员服务

87+阅读 · 2021年1月11日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

多伦多大学最新《机器学习导论》课程，Introduction to Machine Learning

多伦多大学最新《机器学习导论》课程，Introduction to Machine Learning

专知会员服务

25+阅读 · 2020年9月24日

多语言神经机器翻译综述论文，34页pdf，A Comprehensive Survey of Multilingual Neural Machine Translation

多语言神经机器翻译综述论文，34页pdf，A Comprehensive Survey of Multilingual Neural Machine Translation

专知会员服务

19+阅读 · 2020年4月25日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【论文】多语言神经机器翻译综述（A Comprehensive Survey of Multilingual Neural Machine Translation）

【论文】多语言神经机器翻译综述（A Comprehensive Survey of Multilingual Neural Machine Translation）

专知会员服务

20+阅读 · 2020年1月7日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

新质生成式AI赋能产业变革的实践与路径

用于多模态大模型的离散标记化：全面综述

Nature综述：金融网络中的物理学

【CMU博士论文】通信高效且差分隐私的优化方法

相关资讯

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

Designing an Encoder for Fast Personalization of Text-to-Image Models

Arxiv

0+阅读 · 2023年2月23日

Federated Nearest Neighbor Machine Translation

Arxiv

0+阅读 · 2023年2月23日

Simple and Scalable Nearest Neighbor Machine Translation

Arxiv

0+阅读 · 2023年2月23日

ACE: Zero-Shot Image to Image Translation via Pretrained Auto-Contrastive-Encoder

Arxiv

0+阅读 · 2023年2月22日

Time Series Clustering with an EM algorithm for Mixtures of Linear Gaussian State Space Models

Arxiv

0+阅读 · 2023年2月22日

A Comprehensive Survey on Pretrained Foundation Models: A History from BERT to ChatGPT

Arxiv

33+阅读 · 2023年2月18日

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

Arxiv

14+阅读 · 2021年1月31日

Data Augmentation using Pre-trained Transformer Models

Arxiv

17+阅读 · 2020年3月4日

Diverse Image-to-Image Translation via Disentangled Representations

Diverse Image-to-Image Translation via Disentangled Representations

Arxiv

13+阅读 · 2018年8月2日

Attention Is All You Need

Arxiv

27+阅读 · 2017年12月6日

相关基金

介孔材料受限空间中的AGET ATRP和ARGET ATRP聚合反应

国家自然科学基金

0+阅读 · 2016年12月31日

线粒体TRAP1分子介导Ago2蛋白表达在肠癌转移中的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

利用球差校正透射电镜表征磷烯的原子和电子结构

国家自然科学基金

0+阅读 · 2014年12月31日

N-乙酰氨基葡萄糖转移酶V对间充质干细胞迁移、分化的作用及分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于高通量测序的前列腺癌耐药相关候选DNA甲基化基因鉴定与功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

利用小鼠疾病模型研究DNA甲基化及非编码RNA在情感与记忆分子机制中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

原癌基因RET在前列腺癌恶性转移中的作用和机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

瞬时受体电位M8（TRPM8）对前列腺癌侵袭和转移影响及其机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

一次性量子计算

国家自然科学基金

1+阅读 · 2009年12月31日

microRNA结合位点多态性与散发性食管癌易感性的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员