比较对遥感收集的语音数据进行声学分析 (Comparing acoustic analyses of speech data collected remotely) - 专知论文

会员服务 ·

0

Performer · Better · Awesome (软件) · Next · COVID-19 ·

2021 年 3 月 1 日

Comparing acoustic analyses of speech data collected remotely

翻译：比较对遥感收集的语音数据进行声学分析

Cong Zhang,Kathleen Jepson,Georg Lohfink,Amalia Arvaniti

from arxiv, 20 pages, 3 figures

Face-to-face speech data collection has been next to impossible globally due to COVID-19 restrictions. To address this problem, simultaneous recordings of three repetitions of the cardinal vowels were made using a Zoom H6 Handy Recorder with external microphone (henceforth H6) and compared with two alternatives accessible to potential participants at home: the Zoom meeting application (henceforth Zoom) and two lossless mobile phone applications (Awesome Voice Recorder, and Recorder; henceforth Phone). F0 was tracked accurately by all devices; however, for formant analysis (F1, F2, F3) Phone performed better than Zoom, i.e. more similarly to H6. Zoom recordings also exhibited unexpected drops in intensity. The results suggest that lossless format phone recordings present a viable option for at least some phonetic studies.

翻译：由于COVID-19的限制,收集面对面的语音数据在全球几乎是不可能的。为了解决这个问题,同时用外麦克风(此后为H6)的Zoom H6手动录音机录制了三次重写基调元音的录音,与国内潜在与会者可以使用的两种替代办法相比:缩放会议应用程序(此后为Zoom)和两个无损移动电话应用程序(Aweome语音录音机和录音机;此后为Phone)。所有设备都准确地跟踪了F0;但是,对于形成分析(F1、F2、F3),电话的制作效果比Zoom要好,即更接近H6。缩放录音的强度也出乎意料地下降。结果显示,无损式电话录音至少为一些语音研究提供了一个可行的选项。

0

相关内容

Performer

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

CCF推荐 | 国际会议信息6条

CCF推荐 | 国际会议信息6条

Call4Papers

9+阅读 · 2019年8月13日

CCF推荐 | 国际会议信息8条

CCF推荐 | 国际会议信息8条

Call4Papers

9+阅读 · 2019年5月23日

已删除

将门创投

5+阅读 · 2018年7月25日

Nonlinear Spatial Filtering in Multichannel Speech Enhancement

Nonlinear Spatial Filtering in Multichannel Speech Enhancement

Arxiv

0+阅读 · 2021年4月22日

Compression with the tudocomp Framework

Arxiv

0+阅读 · 2021年4月22日

Accented Speech Recognition: A Survey

Arxiv

0+阅读 · 2021年4月21日

A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment

Arxiv

3+阅读 · 2018年9月4日

3D Reconstruction in Canonical Co-ordinate Space from Arbitrarily Oriented 2D Images

Arxiv

4+阅读 · 2018年1月23日

VIP会员

文章信息

相关主题

Awesome (软件)

相关VIP内容

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

因果强化学习的统一框架：综述、分类体系、算法与应用

《无人机系统 - 反无人机系统：测试方法》364页

【MIT博士论文】语言模型的推理时学习算法

美军低成本无人作战攻击系统（LUCAS）：扩大无人机战争规模

相关资讯

CCF推荐 | 国际会议信息6条

CCF推荐 | 国际会议信息6条

Call4Papers

9+阅读 · 2019年8月13日

CCF推荐 | 国际会议信息8条

CCF推荐 | 国际会议信息8条

Call4Papers

9+阅读 · 2019年5月23日

已删除

将门创投

5+阅读 · 2018年7月25日

相关论文

Nonlinear Spatial Filtering in Multichannel Speech Enhancement

Nonlinear Spatial Filtering in Multichannel Speech Enhancement

Arxiv

0+阅读 · 2021年4月22日

Compression with the tudocomp Framework

Arxiv

0+阅读 · 2021年4月22日

Accented Speech Recognition: A Survey

Arxiv

0+阅读 · 2021年4月21日

A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment

Arxiv

3+阅读 · 2018年9月4日

3D Reconstruction in Canonical Co-ordinate Space from Arbitrarily Oriented 2D Images

Arxiv

4+阅读 · 2018年1月23日

微信扫码咨询专知VIP会员