根据EEEG的光谱空间空间特征探测低长听觉空间注意力 (Low-latency auditory spatial attention detection based on spectro-spatial features from EEG)

Detecting auditory attention based on brain signals enables many everyday applications, and serves as part of the solution to the cocktail party effect in speech processing. Several studies leverage the correlation between brain signals and auditory stimuli to detect the auditory attention of listeners. Recently, studies show that the alpha band (8-13 Hz) EEG signals enable the localization of auditory stimuli. We believe that it is possible to detect auditory spatial attention without the need of auditory stimuli as references. In this work, we use alpha power signals for automatic auditory spatial attention detection. To the best of our knowledge, this is the first attempt to detect spatial attention based on alpha power neural signals. We propose a spectro-spatial feature extraction technique to detect the auditory spatial attention (left/right) based on the topographic specificity of alpha power. Experiments show that the proposed neural approach achieves 81.7% and 94.6% accuracy for 1-second and 10-second decision windows, respectively. Our comparative results show that this neural approach outperforms other competitive models by a large margin in all test cases.

翻译：根据大脑信号检测听觉的注意,可以进行许多日常应用,并成为语音处理中鸡尾酒效应解决方案的一部分。一些研究利用大脑信号和听觉刺激的关联性来检测听众的听觉注意。最近,研究显示,阿尔法波段(8-13赫兹) EEG信号可以使听觉刺激具有地方性。我们认为,在不需要以听觉刺激作为参考的情况下,可以探测听觉空间注意。在这项工作中,我们使用阿尔法功率信号来自动检测空间注意。根据我们的知识,这是首次尝试检测以阿尔法电线信号为基础的空间注意。我们建议了光谱空间特征提取技术,以根据阿尔法力的地形特性探测听觉空间注意(左/右)。实验显示,拟议的神经方法在1秒和10秒决定窗口中分别达到81.7%和94.6%的准确度。我们的比较结果显示,这种神经方法在所有测试案例中都大大超过其他竞争性模型。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

商业数据分析，39页ppt

专知会员服务

165+阅读 · 2020年6月2日

【ACL2020-亚马逊】Transformers多分辨率和多模态语音识别，Multiresolution and Multimodal Speech Recognition with Transformers

专知会员服务

15+阅读 · 2020年5月5日

【ICLR-2020】网络反卷积，NETWORK DECONVOLUTION

专知会员服务

39+阅读 · 2020年2月21日