Robust and accurate decoding of gesture from non-invasive surface electromyography (sEMG) is important for various applications including spatial computing, healthcare, and entertainment, and has been actively pursued by researchers and industry. Majority of sEMG-based gesture decoding algorithms employ deep neural networks that are designed for Euclidean data, and may not be suitable for analyzing multi-dimensional, non-stationary time-series with long-range dependencies such as sEMG. State-of-the-art sEMG-based decoding methods also demonstrate high variability across subjects and sessions, requiring re-calibration and adaptive fine-tuning to boost performance. To address these shortcomings, this work proposes a geometric deep learning model that learns on symmetric positive definite (SPD) manifolds and leverages unsupervised domain adaptation to desensitize the model to subjects and sessions. The model captures the features in time and across sensors with multiple kernels, projects the features onto SPD manifold, learns on manifolds and projects back to Euclidean space for classification. It uses a domain-specific batch normalization layer to address variability between sessions, alleviating the need for re-calibration or fine-tuning. Experiments with publicly available benchmark gesture decoding datasets (Ninapro DB6, Flexwear-HD) demonstrate the superior generalizability of the model compared to Euclidean and other SPD-based models in the inter-session scenario, with up to 8.83 and 4.63 points improvement in accuracy, respectively. Detailed analyses reveal that the model extracts muscle-specific information for different tasks and ablation studies highlight the importance of modules introduced in the work. The proposed method pushes the state-of-the-art in sEMG-based gesture recognition and opens new research avenues for manifold-based learning for muscle signals.
翻译:基于非侵入式表面肌电信号(sEMG)实现鲁棒且准确的手势解码,对于空间计算、医疗保健和娱乐等多种应用至关重要,并已受到学术界与工业界的积极关注。当前多数基于sEMG的手势解码算法采用为欧几里得数据设计的深度神经网络,可能不适用于分析具有长程依赖关系的多维非平稳时间序列(如sEMG信号)。最先进的基于sEMG的解码方法在不同受试者与实验会话间也表现出高度变异性,常需重新校准与自适应微调以提升性能。为克服这些局限,本研究提出一种几何深度学习模型,该模型在对称正定(SPD)流形上进行学习,并利用无监督域适应技术降低模型对受试者与会话的敏感性。该模型通过多核机制捕获时间维度及传感器间的特征,将特征投影至SPD流形,在流形空间学习后再投影回欧几里得空间进行分类。模型采用特定域的批归一化层处理会话间变异性,从而减少重新校准或微调的需求。在公开基准手势解码数据集(Ninapro DB6、Flexwear-HD)上的实验表明,在跨会话场景下,相比欧几里得模型及其他基于SPD的模型,本模型展现出更优的泛化能力,准确率分别最高提升8.83与4.63个百分点。深入分析显示,该模型能针对不同任务提取肌肉特异性信息,消融实验则验证了本文所引入模块的重要性。所提方法推动了基于sEMG的手势识别技术的前沿发展,并为肌电信号的流形学习开辟了新的研究路径。