Hi,KIA:语音情感识别数据集,用于唤醒的言词 (Hi,KIA: A Speech Emotion Recognition Dataset for Wake-Up Words) - 专知论文

会员服务 ·

0

INFORMS · 数据集 · 分类模型 · 标注 · MoDELS ·

2022 年 11 月 7 日

Hi,KIA: A Speech Emotion Recognition Dataset for Wake-Up Words

翻译：Hi,KIA:语音情感识别数据集,用于唤醒的言词

Taesu Kim,SeungHeon Doh,Gyunpyo Lee,Hyungseok Jeon,Juhan Nam,Hyeon-Jeong Suk

from arxiv, Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2022

Wake-up words (WUW) is a short sentence used to activate a speech recognition system to receive the user's speech input. WUW utterances include not only the lexical information for waking up the system but also non-lexical information such as speaker identity or emotion. In particular, recognizing the user's emotional state may elaborate the voice communication. However, there is few dataset where the emotional state of the WUW utterances is labeled. In this paper, we introduce Hi, KIA, a new WUW dataset which consists of 488 Korean accent emotional utterances collected from four male and four female speakers and each of utterances is labeled with four emotional states including anger, happy, sad, or neutral. We present the step-by-step procedure to build the dataset, covering scenario selection, post-processing, and human validation for label agreement. Also, we provide two classification models for WUW speech emotion recognition using the dataset. One is based on traditional hand-craft features and the other is a transfer-learning approach using a pre-trained neural network. These classification models could be used as benchmarks in further research.

翻译：觉醒单词(WUW)是用于启动语音识别系统的短句,以接收用户的语音输入。 WUW的语句不仅包括唤醒系统所需的词汇信息,而且还包括声音身份或情绪等非历史信息。特别是,承认用户的情绪状态可能会详细描述语音通信。然而,在WUW的语句的情感状态贴上标签的数据集中,很少有。我们在此文件中,我们引入了Hi、KIA,一个新的W数据集,由4位男性和4位女性发言者收集的488韩国口音情感发音组成,每个语句都有四个情感状态的标签,包括愤怒、快乐、悲伤或中性。我们介绍了建立数据集的逐步程序,涵盖情景选择、后处理和标签协议的人类验证。我们还提供了两个WUW语音情绪使用数据集识别分类模型。一个是基于传统的手工艺特征,另一个是基于使用预先培训的神经网络的转移学习方法。这些分类模型可以用作进一步研究的基准。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

【ICDAR2019教程】模式识别和文档图像中基于图的方法，Graph-based Methods in Pattern Recognition and Document Image Analysis

【ICDAR2019教程】模式识别和文档图像中基于图的方法，Graph-based Methods in Pattern Recognition and Document Image Analysis

专知会员服务

30+阅读 · 2019年9月20日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Call for Nominations: 2022 Multimedia Prize Paper Award

Call for Nominations: 2022 Multimedia Prize Paper Award

CCF多媒体专委会

0+阅读 · 2022年2月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hamilton-Jacibi方程的弱KAM理论

国家自然科学基金

2+阅读 · 2017年12月31日

滑弧放电合成纳米晶TiO2光催化剂及其等离子体特性研究

国家自然科学基金

0+阅读 · 2014年12月31日

新型c-Met肺癌靶向分子探针研制及纳米药物治疗评价

国家自然科学基金

0+阅读 · 2013年12月31日

99mTc标记树状大分子包裹金纳米颗粒偶联Duramycin对肿瘤化疗诱导细胞凋亡的分子影像学研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于介孔石墨相氮化碳材料的多维传感器研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向DDS的自驱动Pt纳米机器人运动控制机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

纳米铁炭微电解体系去除地下水中硝酸盐的机理及应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

纳米金属-氧化物-硅复合薄膜波导的光致电导特性和应用研究

国家自然科学基金

0+阅读 · 2011年12月31日

非Pt过渡金属硫化物纳米空球催化氧还原的电化学和拉曼光谱研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于99mTc示踪和SPECT成像技术的纳米树状分子体内肿瘤靶向载药行为研究

国家自然科学基金

0+阅读 · 2009年12月31日

A First Look at Dataset Bias in License Plate Recognition

Arxiv

0+阅读 · 2022年12月30日

Learned Hierarchical B-frame Coding with Adaptive Feature Modulation for YUV 4:2:0 Content

Arxiv

0+阅读 · 2022年12月29日

Speech Synthesis with Mixed Emotions

Arxiv

0+阅读 · 2022年12月28日

Feature Selection Approaches for Optimising Music Emotion Recognition Methods

Arxiv

0+阅读 · 2022年12月27日

FADE: Fusing the Assets of Decoder and Encoder for Task-Agnostic Upsampling

Arxiv

0+阅读 · 2022年12月27日

Recovering 3D Human Mesh from Monocular Images: A Survey

Arxiv

12+阅读 · 2022年3月8日

Hybrid Curriculum Learning for Emotion Recognition in Conversation

Arxiv

14+阅读 · 2021年12月22日

Affective Image Content Analysis: Two Decades Review and New Perspectives

Arxiv

16+阅读 · 2021年6月30日

Meta Learning for End-to-End Low-Resource Speech Recognition

Meta Learning for End-to-End Low-Resource Speech Recognition

Arxiv

20+阅读 · 2019年10月26日

A Survey on Deep Learning for Named Entity Recognition

A Survey on Deep Learning for Named Entity Recognition

Arxiv

73+阅读 · 2018年12月22日

VIP会员

文章信息

相关主题

相关VIP内容

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

【ICDAR2019教程】模式识别和文档图像中基于图的方法，Graph-based Methods in Pattern Recognition and Document Image Analysis

【ICDAR2019教程】模式识别和文档图像中基于图的方法，Graph-based Methods in Pattern Recognition and Document Image Analysis

专知会员服务

30+阅读 · 2019年9月20日

热门VIP内容

开通专知VIP会员享更多权益服务

全球AI工具市场发展现状与趋势分析2025

自动驾驶地图：全流程综述与前沿进展

协同智能体：多智能体人工智能系统如何变革军事训练及其他领域

【NeurIPS2025】TITAN：一种面向轨迹感知的大规模 VQE 自适应参数冻结技术

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Call for Nominations: 2022 Multimedia Prize Paper Award

Call for Nominations: 2022 Multimedia Prize Paper Award

CCF多媒体专委会

0+阅读 · 2022年2月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

A First Look at Dataset Bias in License Plate Recognition

Arxiv

0+阅读 · 2022年12月30日

Learned Hierarchical B-frame Coding with Adaptive Feature Modulation for YUV 4:2:0 Content

Arxiv

0+阅读 · 2022年12月29日

Speech Synthesis with Mixed Emotions

Arxiv

0+阅读 · 2022年12月28日

Feature Selection Approaches for Optimising Music Emotion Recognition Methods

Arxiv

0+阅读 · 2022年12月27日

FADE: Fusing the Assets of Decoder and Encoder for Task-Agnostic Upsampling

Arxiv

0+阅读 · 2022年12月27日

Recovering 3D Human Mesh from Monocular Images: A Survey

Arxiv

12+阅读 · 2022年3月8日

Hybrid Curriculum Learning for Emotion Recognition in Conversation

Arxiv

14+阅读 · 2021年12月22日

Affective Image Content Analysis: Two Decades Review and New Perspectives

Arxiv

16+阅读 · 2021年6月30日

Meta Learning for End-to-End Low-Resource Speech Recognition

Meta Learning for End-to-End Low-Resource Speech Recognition

Arxiv

20+阅读 · 2019年10月26日

A Survey on Deep Learning for Named Entity Recognition

A Survey on Deep Learning for Named Entity Recognition

Arxiv

73+阅读 · 2018年12月22日

相关基金

Hamilton-Jacibi方程的弱KAM理论

国家自然科学基金

2+阅读 · 2017年12月31日

滑弧放电合成纳米晶TiO2光催化剂及其等离子体特性研究

国家自然科学基金

0+阅读 · 2014年12月31日

新型c-Met肺癌靶向分子探针研制及纳米药物治疗评价

国家自然科学基金

0+阅读 · 2013年12月31日

99mTc标记树状大分子包裹金纳米颗粒偶联Duramycin对肿瘤化疗诱导细胞凋亡的分子影像学研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于介孔石墨相氮化碳材料的多维传感器研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向DDS的自驱动Pt纳米机器人运动控制机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

纳米铁炭微电解体系去除地下水中硝酸盐的机理及应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

纳米金属-氧化物-硅复合薄膜波导的光致电导特性和应用研究

国家自然科学基金

0+阅读 · 2011年12月31日

非Pt过渡金属硫化物纳米空球催化氧还原的电化学和拉曼光谱研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于99mTc示踪和SPECT成像技术的纳米树状分子体内肿瘤靶向载药行为研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员