语音演示攻击探测和最近进展介绍 (Introduction to Voice Presentation Attack Detection and Recent Advances)

Over the past few years significant progress has been made in the field of presentation attack detection (PAD) for automatic speaker recognition (ASV). This includes the development of new speech corpora, standard evaluation protocols and advancements in front-end feature extraction and back-end classifiers. The use of standard databases and evaluation protocols has enabled for the first time the meaningful benchmarking of different PAD solutions. This chapter summarises the progress, with a focus on studies completed in the last three years. The article presents a summary of findings and lessons learned from two ASVspoof challenges, the first community-led benchmarking efforts. These show that ASV PAD remains an unsolved problem and that further attention is required to develop generalised PAD solutions which have potential to detect diverse and previously unseen spoofing attacks.

翻译：过去几年来,在自动识别扬声器的演示攻击探测(PAD)领域取得了显著进展,包括开发了新的语音组合、标准评价规程和前端特征提取和后端分类器的进步;使用标准数据库和评价规程首次使不同的PAD解决方案有了有意义的基准;本章总结了进展情况,重点是过去三年完成的研究;文章总结了从两个ASVspoof挑战中得出的调查结果和经验教训,这是社区牵头的第一个基准工作;这些都表明,ASVPAD仍然是一个尚未解决的问题,需要进一步注意制定通用的PAD解决方案,这些解决方案有可能发现各种先前看不见的攻击。

相关内容

声纹识别

关注 443

说话人识别（Speaker Recognition），或者称为声纹识别（Voiceprint Recognition, VPR），是根据语音中所包含的说话人个性信息，利用计算机以及现在的信息识别技术，自动鉴别说话人身份的一种生物特征识别技术。说话人识别研究的目的就是从语音中提取具有说话人表征性的特征，建立有效的模型和系统，实现自动精准的说话人鉴别。

【深度学习社区检测】Deep Learning for Community Detection: Progress, Challenges and Opportunities

专知会员服务

27+阅读 · 2020年6月13日

【深度伪造综述论文】The Creation and Detection of Deepfakes: A Survey

专知会员服务

54+阅读 · 2020年4月26日

【NUS】神经问题生成的最近进展（Recent Advances in Neural Question Generation）

专知会员服务

15+阅读 · 2019年12月22日