不受监督的神经机器翻译与重量共享 (Unsupervised Neural Machine Translation with Weight Sharing)

Unsupervised neural machine translation (NMT) is a recently proposed approach for machine translation which aims to train the model without using any labeled data. The models proposed for unsupervised NMT often use only one shared encoder to map the pairs of sentences from different languages to a shared-latent space, which is weak in keeping the unique and internal characteristics of each language, such as the style, terminology, and sentence structure. To address this issue, we introduce an extension by utilizing two independent encoders but sharing some partial weights which are responsible for extracting high-level representations of the input sentences. Besides, two different generative adversarial networks (GANs), namely the local GAN and global GAN, are proposed to enhance the cross-language translation. With this new approach, we achieve significant improvements on English-German, English-French and Chinese-to-English translation tasks.

翻译：无人监督的神经机器翻译(NMT)是最近提出的一种机器翻译方法,目的是在不使用任何标签数据的情况下对模型进行培训,为无人监督的NMT提出的模型通常只使用一个共用编码器来绘制从不同语言到共享空间的一对判决,这在保持每种语言的独特和内部特征方面是软弱的,例如风格、术语和句子结构。为了解决这一问题,我们采用了一种扩展,使用了两个独立的编码器,但分享了部分重量,负责提取输入句的高层表述。此外,还提出了两个不同的基因化对抗网络(GANs),即本地GAN和全球GAN,以加强跨语言翻译。有了这种新方法,我们在英语-德语、英语-法语和中文-英语-英语翻译任务上取得了重大改进。

相关内容

Machine Translation

关注 209

机器翻译（Machine Translation）涵盖计算语言学和语言工程的所有分支，包含多语言方面。特色论文涵盖理论，描述或计算方面的任何下列主题:双语和多语语料库的编写和使用，计算机辅助语言教学，非罗马字符集的计算含义，连接主义翻译方法，对比语言学等。官网地址：http://dblp.uni-trier.de/db/journals/mt/

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【Google】无监督机器翻译，Unsupervised Machine Translation

专知会员服务

36+阅读 · 2020年3月3日