DDKtor: 自动地心动反应器分析 (DDKtor: Automatic Diadochokinetic Speech Analysis) - 专知论文

会员服务 ·

0

Analysis · MoDELS · 长短期记忆网络 · 讲稿 · 层 ·

2022 年 6 月 29 日

DDKtor: Automatic Diadochokinetic Speech Analysis

翻译：DDKtor: 自动地心动反应器分析

Yael Segal,Kasia Hitczenko,Matthew Goldrick,Adam Buchwald,Angela Roberts,Joseph Keshet

from arxiv, Accepted to Interspeech 2022

Diadochokinetic speech tasks (DDK), in which participants repeatedly produce syllables, are commonly used as part of the assessment of speech motor impairments. These studies rely on manual analyses that are time-intensive, subjective, and provide only a coarse-grained picture of speech. This paper presents two deep neural network models that automatically segment consonants and vowels from unannotated, untranscribed speech. Both models work on the raw waveform and use convolutional layers for feature extraction. The first model is based on an LSTM classifier followed by fully connected layers, while the second model adds more convolutional layers followed by fully connected layers. These segmentations predicted by the models are used to obtain measures of speech rate and sound duration. Results on a young healthy individuals dataset show that our LSTM model outperforms the current state-of-the-art systems and performs comparably to trained human annotators. Moreover, the LSTM model also presents comparable results to trained human annotators when evaluated on unseen older individuals with Parkinson's Disease dataset.

翻译：参与者反复制作音调的DDK(DDK), 参与者反复制作音调, 通常作为语言运动障碍评估的一部分使用。这些研究依靠的是时间密集、主观的人工分析,仅提供粗略的语音图象。本文展示了两种深神经网络模型,这些模型自动分离出未加注注解、未经调试的语音和元音。两种模型在原始波形上工作,并使用变动层进行特征提取。第一个模型以LSTM分类器为基础,然后是完全相连的层, 而第二个模型则增加了更多的卷动层,然后是完全相连的层。这些模型预测的分层被用来测量语音速率和声音持续时间。一个年轻健康的个体数据集的结果显示,我们的LSTM模型超越了当前的最新系统,并且可以比较到培训人类警告器。此外, LSTM模型还展示了在用Parkinson疾病数据集对看不见的老年人进行评估时, 培训的人类警告员的类似结果。

0

相关内容

Analysis

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

具有输入时滞的柔性结构系统时滞辨识及自适应控制研究

国家自然科学基金

0+阅读 · 2014年12月31日

创伤后应激障碍与催产素及其受体通路基因多态性的关联性研究

国家自然科学基金

0+阅读 · 2014年12月31日

氧化应激在Leydig细胞老化易感性的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

非亏格1中心的平面二次系统的极限环分支问题

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

Partial Spread Bent函数与Bent-Negabent函数的构造及密码学性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

从内质网应激介导的CHOP凋亡途径探讨BPD发生机制

国家自然科学基金

0+阅读 · 2012年12月31日

Chemerin通过P38MAPK途径介导糖尿病肾病及硫辛酸干预研究

国家自然科学基金

0+阅读 · 2012年12月31日

内质网应激及UPR信号通路在支气管肺发育不良中的作用及IGF-1的干预

国家自然科学基金

0+阅读 · 2011年12月31日

微腔中多重EIT固态系统的量子非线性和量子关联效应研究

国家自然科学基金

0+阅读 · 2011年12月31日

Hierarchical Compositional Representations for Few-shot Action Recognition

Arxiv

0+阅读 · 2022年8月19日

Deep Learning for Choice Modeling

Deep Learning for Choice Modeling

Arxiv

0+阅读 · 2022年8月19日

Representation Learning for the Automatic Indexing of Sound Effects Libraries

Arxiv

0+阅读 · 2022年8月18日

Playing for 3D Human Recovery

Arxiv

0+阅读 · 2022年8月18日

Automatic laser steering for middle ear surgery

Automatic laser steering for middle ear surgery

Arxiv

0+阅读 · 2022年8月18日

Analyzing Robustness of End-to-End Neural Models for Automatic Speech Recognition

Arxiv

0+阅读 · 2022年8月17日

ParaColorizer: Realistic Image Colorization using Parallel Generative Networks

Arxiv

0+阅读 · 2022年8月17日

Meta Learning for End-to-End Low-Resource Speech Recognition

Meta Learning for End-to-End Low-Resource Speech Recognition

Arxiv

20+阅读 · 2019年10月26日

Aspect Based Sentiment Analysis with Gated Convolutional Networks

Arxiv

12+阅读 · 2018年5月18日

Deep Active Learning for Named Entity Recognition

Arxiv

15+阅读 · 2018年2月4日

VIP会员

文章信息

相关主题

长短期记忆网络

相关VIP内容

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【ACL2025教程】大语言模型的护栏与安全性：对其应用的安全、可靠与可控引导

《实现协同自主：从人机协作到多智能体系统》最新190页

【ICML2025】SToFM：一种用于空间转录组学的多尺度基础模型

通信网络智能体白皮书V1.0，61页pdf

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Hierarchical Compositional Representations for Few-shot Action Recognition

Arxiv

0+阅读 · 2022年8月19日

Deep Learning for Choice Modeling

Deep Learning for Choice Modeling

Arxiv

0+阅读 · 2022年8月19日

Representation Learning for the Automatic Indexing of Sound Effects Libraries

Arxiv

0+阅读 · 2022年8月18日

Playing for 3D Human Recovery

Arxiv

0+阅读 · 2022年8月18日

Automatic laser steering for middle ear surgery

Automatic laser steering for middle ear surgery

Arxiv

0+阅读 · 2022年8月18日

Analyzing Robustness of End-to-End Neural Models for Automatic Speech Recognition

Arxiv

0+阅读 · 2022年8月17日

ParaColorizer: Realistic Image Colorization using Parallel Generative Networks

Arxiv

0+阅读 · 2022年8月17日

Meta Learning for End-to-End Low-Resource Speech Recognition

Meta Learning for End-to-End Low-Resource Speech Recognition

Arxiv

20+阅读 · 2019年10月26日

Aspect Based Sentiment Analysis with Gated Convolutional Networks

Arxiv

12+阅读 · 2018年5月18日

Deep Active Learning for Named Entity Recognition

Arxiv

15+阅读 · 2018年2月4日

相关基金

具有输入时滞的柔性结构系统时滞辨识及自适应控制研究

国家自然科学基金

0+阅读 · 2014年12月31日

创伤后应激障碍与催产素及其受体通路基因多态性的关联性研究

国家自然科学基金

0+阅读 · 2014年12月31日

氧化应激在Leydig细胞老化易感性的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

非亏格1中心的平面二次系统的极限环分支问题

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

Partial Spread Bent函数与Bent-Negabent函数的构造及密码学性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

从内质网应激介导的CHOP凋亡途径探讨BPD发生机制

国家自然科学基金

0+阅读 · 2012年12月31日

Chemerin通过P38MAPK途径介导糖尿病肾病及硫辛酸干预研究

国家自然科学基金

0+阅读 · 2012年12月31日

内质网应激及UPR信号通路在支气管肺发育不良中的作用及IGF-1的干预

国家自然科学基金

0+阅读 · 2011年12月31日

微腔中多重EIT固态系统的量子非线性和量子关联效应研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员