基于语音的基本语法：非监督深度神经网络中的自发连接 (Basic syntax from speech: Spontaneous concatenation in unsupervised deep neural networks) - 专知论文

会员服务 ·

0

监督 · 深度神经网络 · 神经网络 · 计算模型 · 词嵌入 ·

2023 年 5 月 2 日

Basic syntax from speech: Spontaneous concatenation in unsupervised deep neural networks

翻译：基于语音的基本语法：非监督深度神经网络中的自发连接

Gašper Beguš,Thomas Lu,Zili Wang

Computational models of syntax are predominantly text-based. Here we propose that basic syntax can be modeled directly from raw speech in a fully unsupervised way. We focus on one of the most ubiquitous and basic properties of syntax -- concatenation. We introduce spontaneous concatenation: a phenomenon where convolutional neural networks (CNNs) trained on acoustic recordings of individual words start generating outputs with two or even three words concatenated without ever accessing data with multiple words in the input. Additionally, networks trained on two words learn to embed words into novel unobserved word combinations. To our knowledge, this is a previously unreported property of CNNs trained on raw speech in the Generative Adversarial Network setting and has implications both for our understanding of how these architectures learn as well as for modeling syntax and its evolution from raw acoustic inputs.

翻译：摘要：目前，语法的计算模型主要是基于文本的。本文提出可以在完全无监督的情况下直接从原始语音中建模基本语法的观点。我们关注了语法最普遍和基本的特性之一——连接。我们介绍了自发连接：一种卷积神经网络(CNNs)的现象，在该网络中，从个体单词的声音记录训练的网络开始生成包含两个甚至三个单词的输出，并且从未访问过多个单词的输入数据。此外，在两个单词上训练的网络学习将单词嵌入到新的未见过的词组合中。据我们所知，这是在生成式对抗网络(GAN)设置下训练的语音原始数据的CNNs中以前未报告过的属性，并且它对我们了解这些体系结构的学习方式以及对语法及其从原始声学输入演化的建模具有影响。

0

相关内容

对比学习简述

专知会员服务

88+阅读 · 2021年6月29日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

43+阅读 · 2020年11月2日

NLP必读经典文献100篇

专知会员服务

123+阅读 · 2020年9月8日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

76+阅读 · 2020年7月26日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

103+阅读 · 2020年5月1日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

161+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

77+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

90+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

64+阅读 · 2019年10月9日

ICLR'23截稿, 图神经网络依然火热 (附42 篇好文整理)

ICLR'23截稿, 图神经网络依然火热 (附42 篇好文整理)

图与推荐

1+阅读 · 2022年10月5日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

17+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

26+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

【论文推荐】最新6篇生成式对抗网络（GAN）相关论文—半监督对抗学习、行人再识别、代表性特征、高分辨率深度卷积、自监督、超分辨

【论文推荐】最新6篇生成式对抗网络（GAN）相关论文—半监督对抗学习、行人再识别、代表性特征、高分辨率深度卷积、自监督、超分辨

专知

10+阅读 · 2018年2月1日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

19+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

SPARC通过磷酸化p38-MAPK信号通路调控角膜缘干细胞生物学特性的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

汉越双语事件语料库构建及舆情观点挖掘方法研究

国家自然科学基金

2+阅读 · 2014年12月31日

锂离子电池电极材料性能调控的界面效应研究

国家自然科学基金

0+阅读 · 2014年12月31日

高容量锂离子电池负极集流体泡沫铜的环境疲劳行为、损伤机理及寿命模型

国家自然科学基金

0+阅读 · 2014年12月31日

基于深度神经网络的噪声鲁棒性语音识别方法研究

国家自然科学基金

3+阅读 · 2013年12月31日

动态面孔语音情绪的整合加工及神经生理机制

国家自然科学基金

0+阅读 · 2013年12月31日

自修复型导电胶的制备和自修复效率表征研究

国家自然科学基金

0+阅读 · 2013年12月31日

藏语语音合成关键技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于二维随机映射和一范数优化的有监督图像分类研究

国家自然科学基金

3+阅读 · 2011年12月31日

嵌段共聚物多级自组装的多尺度模拟

国家自然科学基金

0+阅读 · 2009年12月31日

Unsupervised Learning of Style-Aware Facial Animation from Real Acting Performances

Arxiv

0+阅读 · 2023年6月16日

Learning Transductions and Alignments with RNN Seq2seq Models

Arxiv

0+阅读 · 2023年6月15日

Emotional Speech-Driven Animation with Content-Emotion Disentanglement

Arxiv

0+阅读 · 2023年6月15日

Diffusion Models in Vision: A Survey

Arxiv

29+阅读 · 2022年9月10日

Graph Structure Learning with Variational Information Bottleneck

Arxiv

11+阅读 · 2021年12月16日

Similarity and Matching of Neural Network Representations

Arxiv

10+阅读 · 2021年10月27日

Time-Series Event Prediction with Evolutionary State Graph

Arxiv

14+阅读 · 2020年11月25日

Spectral Clustering with Graph Neural Networks for Graph Pooling

Arxiv

25+阅读 · 2020年6月3日

Dynamic Graph Representation Learning via Self-Attention Networks

Arxiv

52+阅读 · 2019年6月15日

Simplifying Graph Convolutional Networks

Simplifying Graph Convolutional Networks

Arxiv

12+阅读 · 2019年2月19日

VIP会员

文章信息

相关主题

深度神经网络

相关VIP内容

对比学习简述

专知会员服务

88+阅读 · 2021年6月29日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

43+阅读 · 2020年11月2日

NLP必读经典文献100篇

专知会员服务

123+阅读 · 2020年9月8日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

76+阅读 · 2020年7月26日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

103+阅读 · 2020年5月1日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

161+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

77+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

90+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

64+阅读 · 2019年10月9日

热门VIP内容

相关资讯

ICLR'23截稿, 图神经网络依然火热 (附42 篇好文整理)

ICLR'23截稿, 图神经网络依然火热 (附42 篇好文整理)

图与推荐

1+阅读 · 2022年10月5日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

17+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

26+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

【论文推荐】最新6篇生成式对抗网络（GAN）相关论文—半监督对抗学习、行人再识别、代表性特征、高分辨率深度卷积、自监督、超分辨

【论文推荐】最新6篇生成式对抗网络（GAN）相关论文—半监督对抗学习、行人再识别、代表性特征、高分辨率深度卷积、自监督、超分辨

专知

10+阅读 · 2018年2月1日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

19+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

相关论文

Unsupervised Learning of Style-Aware Facial Animation from Real Acting Performances

Arxiv

0+阅读 · 2023年6月16日

Learning Transductions and Alignments with RNN Seq2seq Models

Arxiv

0+阅读 · 2023年6月15日

Emotional Speech-Driven Animation with Content-Emotion Disentanglement

Arxiv

0+阅读 · 2023年6月15日

Diffusion Models in Vision: A Survey

Arxiv

29+阅读 · 2022年9月10日

Graph Structure Learning with Variational Information Bottleneck

Arxiv

11+阅读 · 2021年12月16日

Similarity and Matching of Neural Network Representations

Arxiv

10+阅读 · 2021年10月27日

Time-Series Event Prediction with Evolutionary State Graph

Arxiv

14+阅读 · 2020年11月25日

Spectral Clustering with Graph Neural Networks for Graph Pooling

Arxiv

25+阅读 · 2020年6月3日

Dynamic Graph Representation Learning via Self-Attention Networks

Arxiv

52+阅读 · 2019年6月15日

Simplifying Graph Convolutional Networks

Simplifying Graph Convolutional Networks

Arxiv

12+阅读 · 2019年2月19日

相关基金

SPARC通过磷酸化p38-MAPK信号通路调控角膜缘干细胞生物学特性的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

汉越双语事件语料库构建及舆情观点挖掘方法研究

国家自然科学基金

2+阅读 · 2014年12月31日

锂离子电池电极材料性能调控的界面效应研究

国家自然科学基金

0+阅读 · 2014年12月31日

高容量锂离子电池负极集流体泡沫铜的环境疲劳行为、损伤机理及寿命模型

国家自然科学基金

0+阅读 · 2014年12月31日

基于深度神经网络的噪声鲁棒性语音识别方法研究

国家自然科学基金

3+阅读 · 2013年12月31日

动态面孔语音情绪的整合加工及神经生理机制

国家自然科学基金

0+阅读 · 2013年12月31日

自修复型导电胶的制备和自修复效率表征研究

国家自然科学基金

0+阅读 · 2013年12月31日

藏语语音合成关键技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于二维随机映射和一范数优化的有监督图像分类研究

国家自然科学基金

3+阅读 · 2011年12月31日

嵌段共聚物多级自组装的多尺度模拟

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员