自监督辅助损失在基于音乐相似度检索和自动标记中的度量学习中的应用 (Self-supervised Auxiliary Loss for Metric Learning in Music Similarity-based Retrieval and Auto-tagging) - 专知论文

会员服务 ·

0

监督 · 相似度 · 度量学习 · 音乐 · 度量 ·

2023 年 4 月 15 日

Self-supervised Auxiliary Loss for Metric Learning in Music Similarity-based Retrieval and Auto-tagging

翻译：自监督辅助损失在基于音乐相似度检索和自动标记中的度量学习中的应用

Taketo Akama,Hiroaki Kitano,Katsuhiro Takematsu,Yasushi Miyajima,Natalia Polouliakh

from arxiv, 11 pages

In the realm of music information retrieval, similarity-based retrieval and auto-tagging serve as essential components. Given the limitations and non-scalability of human supervision signals, it becomes crucial for models to learn from alternative sources to enhance their performance. Self-supervised learning, which exclusively relies on learning signals derived from music audio data, has demonstrated its efficacy in the context of auto-tagging. In this study, we propose a model that builds on the self-supervised learning approach to address the similarity-based retrieval challenge by introducing our method of metric learning with a self-supervised auxiliary loss. Furthermore, diverging from conventional self-supervised learning methodologies, we discovered the advantages of concurrently training the model with both self-supervision and supervision signals, without freezing pre-trained models. We also found that refraining from employing augmentation during the fine-tuning phase yields better results. Our experimental results confirm that the proposed methodology enhances retrieval and tagging performance metrics in two distinct scenarios: one where human-annotated tags are consistently available for all music tracks, and another where such tags are accessible only for a subset of tracks.

翻译：在音乐信息检索领域，基于相似度的检索和自动标记是必不可少的组成部分。鉴于人工监督信号的限制性和不可扩展性，让模型从替代来源中学习来提升其性能变得至关重要。自监督学习专门依赖于从音乐音频数据中派生的学习信号，在自动标记的背景下已经证明了其有效性。在本研究中，我们提出了一种模型，建立在自监督学习方法的基础上，通过引入自监督辅助损失的度量学习方法来解决基于相似度的检索挑战。此外，我们发现与传统的自监督学习方法不同，同时训练模型使用自监督和监督信号，而不是冻结预训练模型，有益于提高性能。我们还发现，在微调阶段避免使用数据增强可以获得更好的结果。我们的实验结果证实了所提出的方法在两种不同场景下均可提高检索和标记性能指标：一种场景是所有音乐曲目都有人工标注的标记，另一种场景是仅对一部分曲目有这些标记。

0

相关内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【CVPR 2022】一种无需使用负样本的自监督学习方法，Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes

【CVPR 2022】一种无需使用负样本的自监督学习方法，Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes

专知会员服务

15+阅读 · 2022年3月12日

【ICCV2021】参数化对比学习

专知会员服务

33+阅读 · 2021年7月27日

【AAAI2021】学习场景图之间的相似度实现图像到图像的检索

【AAAI2021】学习场景图之间的相似度实现图像到图像的检索

专知会员服务

38+阅读 · 2021年1月3日

【ACL2020-Google】BLEURT:一种基于迁移学习的自然语言生成度量

【ACL2020-Google】BLEURT:一种基于迁移学习的自然语言生成度量

专知会员服务

20+阅读 · 2020年5月12日

【ACL2020-Google】学习鲁棒度量的文本生成，BLEURT: Learning Robust Metrics for Text Generation

【ACL2020-Google】学习鲁棒度量的文本生成，BLEURT: Learning Robust Metrics for Text Generation

专知会员服务

17+阅读 · 2020年4月10日

【CVPR2020-英伟达】从图像集合中学习自监督视点，Self-Supervised Viewpoint Learning From Image Collections

【CVPR2020-英伟达】从图像集合中学习自监督视点，Self-Supervised Viewpoint Learning From Image Collections

专知会员服务

24+阅读 · 2020年4月4日

【CVPR2020-哈工大-京东】自监督结构建模的目标识别，Self-supervised Structure Modeling

【CVPR2020-哈工大-京东】自监督结构建模的目标识别，Self-supervised Structure Modeling

专知会员服务

43+阅读 · 2020年4月1日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

专知

20+阅读 · 2018年4月7日

【泡泡一分钟】使用深度神经网络提取局部特征的大规模图像检索算法(ICCV-2)

【泡泡一分钟】使用深度神经网络提取局部特征的大规模图像检索算法(ICCV-2)

泡泡机器人SLAM

16+阅读 · 2018年2月10日

【论文推荐】最新5篇信息抽取（IE）相关论文—开放信息抽取、不完整信息、主动学习、越南语、依存分析

【论文推荐】最新5篇信息抽取（IE）相关论文—开放信息抽取、不完整信息、主动学习、越南语、依存分析

专知

12+阅读 · 2018年2月2日

非约束环境下的人脸图像预处理计算模型与方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于深度学习的音乐特征学习与分类

国家自然科学基金

7+阅读 · 2014年12月31日

听力损伤评价方法及计算模型

国家自然科学基金

0+阅读 · 2014年12月31日

基于局部不变特征和混合多示例学习的图像检索研究

国家自然科学基金

1+阅读 · 2013年12月31日

冗余字典下的压缩感知理论及应用研究

国家自然科学基金

1+阅读 · 2013年12月31日

运用排序和相似度学习进行基于区域的图像检索研究

国家自然科学基金

0+阅读 · 2012年12月31日

对基于随机比特序列运算的电路的自动综合算法的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于社会媒体信息挖掘的图像标注技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

利用MAGIC群体解析陆地棉重要经济性状的遗传基础

国家自然科学基金

0+阅读 · 2012年12月31日

基于多样化特征表达的生物文献自动分类研究

国家自然科学基金

0+阅读 · 2009年12月31日

Supervised Metric Learning to Rank for Retrieval via Contextual Similarity Optimization

Arxiv

0+阅读 · 2023年6月2日

Scaling Up Semi-supervised Learning with Unconstrained Unlabelled Data

Arxiv

0+阅读 · 2023年6月2日

Class Anchor Margin Loss for Content-Based Image Retrieval

Arxiv

0+阅读 · 2023年6月1日

Large Language Models Are Not Abstract Reasoners

Arxiv

0+阅读 · 2023年5月31日

A Cookbook of Self-Supervised Learning

Arxiv

15+阅读 · 2023年4月24日

Graph Self-Supervised Learning: A Survey

Arxiv

15+阅读 · 2021年8月5日

A Graph-based Relevance Matching Model for Ad-hoc Retrieval

Arxiv

11+阅读 · 2021年1月28日

Deep Image Retrieval: A Survey

Arxiv

16+阅读 · 2021年1月27日

A survey on deep hashing for image retrieval

A survey on deep hashing for image retrieval

Arxiv

15+阅读 · 2020年6月10日

Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks

Arxiv

14+阅读 · 2019年8月8日

VIP会员

文章信息

相关主题

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【CVPR 2022】一种无需使用负样本的自监督学习方法，Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes

【CVPR 2022】一种无需使用负样本的自监督学习方法，Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes

专知会员服务

15+阅读 · 2022年3月12日

【ICCV2021】参数化对比学习

专知会员服务

33+阅读 · 2021年7月27日

【AAAI2021】学习场景图之间的相似度实现图像到图像的检索

【AAAI2021】学习场景图之间的相似度实现图像到图像的检索

专知会员服务

38+阅读 · 2021年1月3日

【ACL2020-Google】BLEURT:一种基于迁移学习的自然语言生成度量

【ACL2020-Google】BLEURT:一种基于迁移学习的自然语言生成度量

专知会员服务

20+阅读 · 2020年5月12日

【ACL2020-Google】学习鲁棒度量的文本生成，BLEURT: Learning Robust Metrics for Text Generation

【ACL2020-Google】学习鲁棒度量的文本生成，BLEURT: Learning Robust Metrics for Text Generation

专知会员服务

17+阅读 · 2020年4月10日

【CVPR2020-英伟达】从图像集合中学习自监督视点，Self-Supervised Viewpoint Learning From Image Collections

【CVPR2020-英伟达】从图像集合中学习自监督视点，Self-Supervised Viewpoint Learning From Image Collections

专知会员服务

24+阅读 · 2020年4月4日

【CVPR2020-哈工大-京东】自监督结构建模的目标识别，Self-supervised Structure Modeling

【CVPR2020-哈工大-京东】自监督结构建模的目标识别，Self-supervised Structure Modeling

专知会员服务

43+阅读 · 2020年4月1日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型中的检索与结构化增强生成综述

《实现多层防御多轮交战机制的扩展型随机齐射模型》2025年最新83页

【CMU博士论文】交互驱动的人体动作估计与生成

如何避免生成式人工智能在作战中失控失效

相关资讯

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

专知

20+阅读 · 2018年4月7日

【泡泡一分钟】使用深度神经网络提取局部特征的大规模图像检索算法(ICCV-2)

【泡泡一分钟】使用深度神经网络提取局部特征的大规模图像检索算法(ICCV-2)

泡泡机器人SLAM

16+阅读 · 2018年2月10日

【论文推荐】最新5篇信息抽取（IE）相关论文—开放信息抽取、不完整信息、主动学习、越南语、依存分析

【论文推荐】最新5篇信息抽取（IE）相关论文—开放信息抽取、不完整信息、主动学习、越南语、依存分析

专知

12+阅读 · 2018年2月2日

相关论文

Supervised Metric Learning to Rank for Retrieval via Contextual Similarity Optimization

Arxiv

0+阅读 · 2023年6月2日

Scaling Up Semi-supervised Learning with Unconstrained Unlabelled Data

Arxiv

0+阅读 · 2023年6月2日

Class Anchor Margin Loss for Content-Based Image Retrieval

Arxiv

0+阅读 · 2023年6月1日

Large Language Models Are Not Abstract Reasoners

Arxiv

0+阅读 · 2023年5月31日

A Cookbook of Self-Supervised Learning

Arxiv

15+阅读 · 2023年4月24日

Graph Self-Supervised Learning: A Survey

Arxiv

15+阅读 · 2021年8月5日

A Graph-based Relevance Matching Model for Ad-hoc Retrieval

Arxiv

11+阅读 · 2021年1月28日

Deep Image Retrieval: A Survey

Arxiv

16+阅读 · 2021年1月27日

A survey on deep hashing for image retrieval

A survey on deep hashing for image retrieval

Arxiv

15+阅读 · 2020年6月10日

Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks

Arxiv

14+阅读 · 2019年8月8日

相关基金

非约束环境下的人脸图像预处理计算模型与方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于深度学习的音乐特征学习与分类

国家自然科学基金

7+阅读 · 2014年12月31日

听力损伤评价方法及计算模型

国家自然科学基金

0+阅读 · 2014年12月31日

基于局部不变特征和混合多示例学习的图像检索研究

国家自然科学基金

1+阅读 · 2013年12月31日

冗余字典下的压缩感知理论及应用研究

国家自然科学基金

1+阅读 · 2013年12月31日

运用排序和相似度学习进行基于区域的图像检索研究

国家自然科学基金

0+阅读 · 2012年12月31日

对基于随机比特序列运算的电路的自动综合算法的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于社会媒体信息挖掘的图像标注技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

利用MAGIC群体解析陆地棉重要经济性状的遗传基础

国家自然科学基金

0+阅读 · 2012年12月31日

基于多样化特征表达的生物文献自动分类研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员