SL-DML: 用于多式单层行动识别的信号级深计量学习 (SL-DML: Signal Level Deep Metric Learning for Multimodal One-Shot Action Recognition)

Recognizing an activity with a single reference sample using metric learning approaches is a promising research field. The majority of few-shot methods focus on object recognition or face-identification. We propose a metric learning approach to reduce the action recognition problem to a nearest neighbor search in embedding space. We encode signals into images and extract features using a deep residual CNN. Using triplet loss, we learn a feature embedding. The resulting encoder transforms features into an embedding space in which closer distances encode similar actions while higher distances encode different actions. Our approach is based on a signal level formulation and remains flexible across a variety of modalities. It further outperforms the baseline on the large scale NTU RGB+D 120 dataset for the One-Shot action recognition protocol by 5.6%. With just 60% of the training data, our approach still outperforms the baseline approach by 3.7%. With 40% of the training data, our approach performs comparably well to the second follow up. Further, we show that our approach generalizes well in experiments on the UTD-MHAD dataset for inertial, skeleton and fused data and the Simitate dataset for motion capturing data. Furthermore, our inter-joint and inter-sensor experiments suggest good capabilities on previously unseen setups.

翻译：使用标准化学习方法,以单一参考样本确认一项活动,并采用单一参考样本,这是一个很有希望的研究领域。大多数微小方法都侧重于对象识别或面貌识别。我们建议了一种衡量学习方法,将行动识别问题降低到近邻的嵌入空间的搜索中。我们将信号编码成图像并提取特征,使用深度残余CNN。使用三重损失,我们学习一个特征嵌入。由此产生的编码器将特征转换成一个嵌入空间,使更近距离的类似动作编码,而更远的距离则对不同动作进行编码。我们的方法基于信号级别的配制,并且在不同模式中保持灵活性。我们的方法进一步超越了NTU RGB+D 120 大规模用于单位行动识别协议的基线5.6%。只有60%的培训数据,我们的方法仍然比基线方法高出3.7%。40%的培训数据,我们的方法与第二次后续跟踪相匹配。此外,我们的方法显示,我们的方法在对UTD-MHAD数据集用于惯性、骨质和密质导数据之间的实验中,我们为先前的同步数据设定了良好的模型。

相关内容

度量学习

关注 3354

度量学习的目的为了衡量样本之间的相近程度，而这也正是模式识别的核心问题之一。大量的机器学习方法，比如K近邻、支持向量机、径向基函数网络等分类方法以及K-means聚类方法，还有一些基于图的方法，其性能好坏都主要有样本之间的相似度量方法的选择决定。度量学习通常的目标是使同类样本之间的距离尽可能缩小，不同类样本之间的距离尽可能放大。

图像分类半监督自监督无监督学习综述，A survey on Semi-, Self- and Unsupervised Learning for Image Classification

专知会员服务

45+阅读 · 2020年7月29日

【ICML2020-伯克利-马毅老师组】深度等距学习的视觉识别，Deep Isometric Learning for Visual Recognition

专知会员服务

23+阅读 · 2020年7月1日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

161+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

91+阅读 · 2020年3月12日