与 RNNN Seq2seqeq 模式的学习转换和调整</s> (Learning Transductions and Alignments with RNN Seq2seq Models) - 专知论文

会员服务 ·

0

seq2seq · RNN · MoDELS · Learning · 泛化理论 ·

2023 年 3 月 13 日

Learning Transductions and Alignments with RNN Seq2seq Models

翻译：与 RNNN Seq2seqeq 模式的学习转换和调整

Zhengxiang Wang

from arxiv, 24 pages; 9 figures; 7 tables

The paper studies the capabilities of Recurrent-Neural-Network sequence to sequence (RNN seq2seq) models in learning four string-to-string transduction tasks: identity, reversal, total reduplication, and input-specified reduplication. These transductions are traditionally well studied under finite state transducers and attributed with varying complexity. We find that RNN seq2seq models are only able to approximate a mapping that fits the training or in-distribution data. Attention helps significantly, but does not solve the out-of-distribution generalization limitation. Task complexity and RNN variants also play a role in the results. Our results are best understood in terms of the complexity hierarchy of formal languages as opposed to that of string transductions.

翻译：本文研究了经常性神经网络序列序列(RNNN supps2seq)在学习四种从字符串到字符串转换任务(身份、逆转、全面重复和输入指定的重复)模型的能力。这些转换传统上都是在有限的国家传感器下研究的,其性质复杂程度各不相同。我们发现,RNN 后世2seq模型只能接近与培训或分布数据相匹配的绘图。注意大有帮助,但不能解决分配外的通用限制。任务复杂性和RNN变量也在结果中发挥作用。我们的结果最能从正式语言的复杂性等级而不是字符串转换的角度来理解。</s>

0

相关内容

seq2seq

seq2seq 是一个Encoder–Decoder 结构的网络，它的输入是一个序列，输出也是一个序列， Encoder 中将一个可变长度的信号序列变为固定长度的向量表达，Decoder 将这个固定长度的向量变成可变长度的目标的信号序列

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

一种基于多电平变换器的新型混合储能系统研究

国家自然科学基金

0+阅读 · 2012年12月31日

GNSS信号的双频调制复用一体化设计研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型光学纳米结构的电致化学发光性能及其高灵敏生物检测研究

国家自然科学基金

0+阅读 · 2009年12月31日

大数据量空间信息实时传输和三维可视化的技术与方法

国家自然科学基金

0+阅读 · 2009年12月31日

等离子体改性活性炭纤维脱硫脱氮的研究

国家自然科学基金

0+阅读 · 2008年12月31日

DRPT: Disentangled and Recurrent Prompt Tuning for Compositional Zero-Shot Learning

Arxiv

0+阅读 · 2023年5月2日

Model-Contrastive Federated Learning

Arxiv

10+阅读 · 2021年3月30日

Temporal Graph Networks for Deep Learning on Dynamic Graphs

Arxiv

37+阅读 · 2020年10月9日

Learning to Propagate Labels: Transductive Propagation Network for Few-shot Learning

Arxiv

21+阅读 · 2018年12月25日

Learning with Interpretable Structure from RNN

Arxiv

19+阅读 · 2018年10月25日

VIP会员

文章信息

相关主题

相关VIP内容

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《利用人工智能对军事行动进行建模》

《利用人工智能学习、优化与推演美国海军作战部队的战略布局与分散（续文）》

机器人、无人机与实时影像：应对城市爆炸威胁的三大技术方案

《指挥官意图消息中关键概念自动提取》最新47页

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

DRPT: Disentangled and Recurrent Prompt Tuning for Compositional Zero-Shot Learning

Arxiv

0+阅读 · 2023年5月2日

Model-Contrastive Federated Learning

Arxiv

10+阅读 · 2021年3月30日

Temporal Graph Networks for Deep Learning on Dynamic Graphs

Arxiv

37+阅读 · 2020年10月9日

Learning to Propagate Labels: Transductive Propagation Network for Few-shot Learning

Arxiv

21+阅读 · 2018年12月25日

Learning with Interpretable Structure from RNN

Arxiv

19+阅读 · 2018年10月25日

相关基金

一种基于多电平变换器的新型混合储能系统研究

国家自然科学基金

0+阅读 · 2012年12月31日

GNSS信号的双频调制复用一体化设计研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型光学纳米结构的电致化学发光性能及其高灵敏生物检测研究

国家自然科学基金

0+阅读 · 2009年12月31日

大数据量空间信息实时传输和三维可视化的技术与方法

国家自然科学基金

0+阅读 · 2009年12月31日

等离子体改性活性炭纤维脱硫脱氮的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员