匹配到胜:分析在语音和音频方面高效自我监督学习的序列长度 (Match to Win: Analysing Sequences Lengths for Efficient Self-supervised Learning in Speech and Audio) - 专知论文

会员服务 ·

0

可约的 · SSL · Learning · MoDELS · Performer ·

2022 年 11 月 22 日

Match to Win: Analysing Sequences Lengths for Efficient Self-supervised Learning in Speech and Audio

翻译：匹配到胜:分析在语音和音频方面高效自我监督学习的序列长度

Yan Gao,Javier Fernandez-Marques,Titouan Parcollet,Pedro P. B. de Gusmao,Nicholas D. Lane

Self-supervised learning (SSL) has proven vital in speech and audio-related applications. The paradigm trains a general model on unlabeled data that can later be used to solve specific downstream tasks. This type of model is costly to train as it requires manipulating long input sequences that can only be handled by powerful centralised servers. Surprisingly, despite many attempts to increase training efficiency through model compression, the effects of truncating input sequence lengths to reduce computation have not been studied. In this paper, we provide the first empirical study of SSL pre-training for different specified sequence lengths and link this to various downstream tasks. We find that training on short sequences can dramatically reduce resource costs while retaining a satisfactory performance for all tasks. This simple one-line change would promote the migration of SSL training from data centres to user-end edge devices for more realistic and personalised applications.

翻译：自我监督的学习(SSL)已证明在语言和音频相关应用中至关重要。范式在未贴标签的数据上培养一个通用模型, 供日后用于解决具体的下游任务。这种模式在培训方面成本很高, 因为它需要操纵只能由强大的中央化服务器处理的长输入序列。令人惊讶的是, 尽管多次尝试通过模型压缩来提高培训效率, 但没有研究缩短输入序列长度以减少计算的效果。在本文中, 我们首次对用于不同特定序列长度的 SSL 预培训进行了经验性研究, 并将之与各种下游任务联系起来。我们发现, 短序列培训可以大幅降低资源成本, 同时保留所有任务令人满意的性能。这一简单的一行变化将促进将 SSL 培训从数据中心转移到用户端边缘设备, 以获得更现实和个性化的应用。

0

相关内容

可约的

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

126+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

161+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

ARB抑制miR-193a表达促进早期糖尿病肾病壁层上皮细胞-足细胞转分化研究

国家自然科学基金

0+阅读 · 2015年12月31日

大型风电机组实时可靠性评估与预防维护策略研究

国家自然科学基金

0+阅读 · 2014年12月31日

蛙皮素样肽t-BBN介导的核壳型金磁性纳米粒对乳腺癌的CT和MRI靶向成像研究

国家自然科学基金

0+阅读 · 2013年12月31日

多功能诊疗分子探针多模态显像与治疗乳腺癌

国家自然科学基金

0+阅读 · 2013年12月31日

自噬在糖尿病肾病中的作用及姜黄素的干预研究

国家自然科学基金

0+阅读 · 2013年12月31日

MiR-217 microRNA的表达调节及其对胰腺导管腺癌生长影响的机制和其与患者预后关系的研究

国家自然科学基金

0+阅读 · 2012年12月31日

VEGFR2靶向超声造影定量分析评价小鼠胰腺癌的抗肿瘤治疗效果的实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于可活化胶原蛋白结合肽的肿瘤MMP-14酶活性核素显像研究

国家自然科学基金

0+阅读 · 2012年12月31日

原发性肝细胞癌微灌注及弹性模量状态与复发机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

A Benchmark Generator for Combinatorial Testing

Arxiv

0+阅读 · 2023年1月24日

SMART: Self-supervised Multi-task pretrAining with contRol Transformers

Arxiv

0+阅读 · 2023年1月24日

RF+clust for Leave-One-Problem-Out Performance Prediction

RF+clust for Leave-One-Problem-Out Performance Prediction

Arxiv

0+阅读 · 2023年1月23日

Optimized learned entropy coding parameters for practical neural-based image and video compression

Arxiv

0+阅读 · 2023年1月20日

Enabling Deep Learning on Edge Devices

Arxiv

19+阅读 · 2022年10月6日

Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation

Arxiv

11+阅读 · 2021年12月16日

A Survey of Quantization Methods for Efficient Neural Network Inference

Arxiv

22+阅读 · 2021年6月21日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

Evolving Losses for Unsupervised Video Representation Learning

Arxiv

23+阅读 · 2020年2月26日

VIP会员

文章信息

相关主题

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

126+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

161+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《基于图神经网络、深度强化学习与概率主题建模的战略对手建模》

【CIKM2025教程】用于连续时间分析的神经微分方程

数字孪生行业报告

【CIKM2025教程】语言模型的公平性：一篇教程，170页ppt

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

A Benchmark Generator for Combinatorial Testing

Arxiv

0+阅读 · 2023年1月24日

SMART: Self-supervised Multi-task pretrAining with contRol Transformers

Arxiv

0+阅读 · 2023年1月24日

RF+clust for Leave-One-Problem-Out Performance Prediction

RF+clust for Leave-One-Problem-Out Performance Prediction

Arxiv

0+阅读 · 2023年1月23日

Optimized learned entropy coding parameters for practical neural-based image and video compression

Arxiv

0+阅读 · 2023年1月20日

Enabling Deep Learning on Edge Devices

Arxiv

19+阅读 · 2022年10月6日

Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation

Arxiv

11+阅读 · 2021年12月16日

A Survey of Quantization Methods for Efficient Neural Network Inference

Arxiv

22+阅读 · 2021年6月21日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

Evolving Losses for Unsupervised Video Representation Learning

Arxiv

23+阅读 · 2020年2月26日

相关基金

ARB抑制miR-193a表达促进早期糖尿病肾病壁层上皮细胞-足细胞转分化研究

国家自然科学基金

0+阅读 · 2015年12月31日

大型风电机组实时可靠性评估与预防维护策略研究

国家自然科学基金

0+阅读 · 2014年12月31日

蛙皮素样肽t-BBN介导的核壳型金磁性纳米粒对乳腺癌的CT和MRI靶向成像研究

国家自然科学基金

0+阅读 · 2013年12月31日

多功能诊疗分子探针多模态显像与治疗乳腺癌

国家自然科学基金

0+阅读 · 2013年12月31日

自噬在糖尿病肾病中的作用及姜黄素的干预研究

国家自然科学基金

0+阅读 · 2013年12月31日

MiR-217 microRNA的表达调节及其对胰腺导管腺癌生长影响的机制和其与患者预后关系的研究

国家自然科学基金

0+阅读 · 2012年12月31日

VEGFR2靶向超声造影定量分析评价小鼠胰腺癌的抗肿瘤治疗效果的实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于可活化胶原蛋白结合肽的肿瘤MMP-14酶活性核素显像研究

国家自然科学基金

0+阅读 · 2012年12月31日

原发性肝细胞癌微灌注及弹性模量状态与复发机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员