高效的同声机机器翻译等待k 模型 (Efficient Wait-k Models for Simultaneous Machine Translation)

Simultaneous machine translation consists in starting output generation before the entire input sequence is available. Wait-k decoders offer a simple but efficient approach for this problem. They first read k source tokens, after which they alternate between producing a target token and reading another source token. We investigate the behavior of wait-k decoding in low resource settings for spoken corpora using IWSLT datasets. We improve training of these models using unidirectional encoders, and training across multiple values of k. Experiments with Transformer and 2D-convolutional architectures show that our wait-k models generalize well across a wide range of latency levels. We also show that the 2D-convolution architecture is competitive with Transformers for simultaneous translation of spoken language.

翻译：同时的机器翻译包括在整个输入序列提供之前开始产出生成。 Wait-k 解码器为这一问题提供了一个简单但有效的方法。他们首先读取 k 源符号, 之后他们轮流在生产目标符号和读取另一个源符号之间进行交替。我们使用 IWSLT 数据集调查低资源设置中口语公司在低资源设置中待到的解码行为。我们用单向编码器改进对这些模型的培训,并且通过 k. 变换器和 2D 革命结构的多重值培训, 表明我们的等待- k 模型在广泛的延时级别上非常普及。我们还显示, 2D 变换器在同时翻译口语方面具有竞争力。

相关内容

Machine Translation

关注 209

机器翻译（Machine Translation）涵盖计算语言学和语言工程的所有分支，包含多语言方面。特色论文涵盖理论，描述或计算方面的任何下列主题:双语和多语语料库的编写和使用，计算机辅助语言教学，非罗马字符集的计算含义，连接主义翻译方法，对比语言学等。官网地址：http://dblp.uni-trier.de/db/journals/mt/

【伯克利】黑盒机器翻译系统的模仿攻击与防御，Imitation Attacks and Defenses for Black-box Machine Translation Systems

专知会员服务

6+阅读 · 2020年5月4日

多语言神经机器翻译综述论文，34页pdf，A Comprehensive Survey of Multilingual Neural Machine Translation

专知会员服务

18+阅读 · 2020年4月25日

【伯克利】再思考 Transformer中的Batch Normalization

专知会员服务

40+阅读 · 2020年3月21日

【DeepMind-牛津-CMU-CVPR2020】无监督文字翻译视频中的视觉基础，Visual Grounding in Video for Unsupervised Word Translation

专知会员服务

12+阅读 · 2020年3月12日