基于任务向量的免微调语音模型罕见词识别与翻译 (Rare Word Recognition and Translation Without Fine-Tuning via Task Vector in Speech Models) - 专知论文

会员服务 ·

0

识别 · 微调 · 扩展性 · 灾难性遗忘 · 可扩展性 ·

Rare Word Recognition and Translation Without Fine-Tuning via Task Vector in Speech Models

翻译：基于任务向量的免微调语音模型罕见词识别与翻译

Ruihao Jing,Cheng Gong,Yu Jiang,Boyu Zhu,Shansong Liu,Chi Zhang,Xiao-Lei Zhang,Xuelong Li

Rare words remain a critical bottleneck for speech-to-text systems. While direct fine-tuning improves recognition of target words, it often incurs high cost, catastrophic forgetting, and limited scalability. To address these challenges, we propose a training-free paradigm based on task vectors for rare word recognition and translation. By defining task vectors as parameter differences and introducing word-level task vector arithmetic, our approach enables flexible composition of rare-word capabilities, greatly enhancing scalability and reusability. Extensive experiments across multiple domains show that the proposed method matches or surpasses fine-tuned models on target words, improves general performance by about 5 BLEU, and mitigates catastrophic forgetting.

翻译：罕见词仍然是语音转文本系统的关键瓶颈。虽然直接微调能提升目标词的识别能力，但通常伴随着高成本、灾难性遗忘和可扩展性受限等问题。为应对这些挑战，我们提出一种基于任务向量的免训练范式，用于罕见词识别与翻译。通过将任务向量定义为参数差值并引入词级任务向量运算，我们的方法能够灵活组合罕见词处理能力，显著提升可扩展性与可复用性。跨多个领域的广泛实验表明，所提方法在目标词处理上达到或超越微调模型水平，将通用性能提升约5个BLEU值，并有效缓解了灾难性遗忘问题。

0

相关内容

【ICCV2025】具有局部对齐视觉-语言模型的可解释零样本学习

【ICCV2025】具有局部对齐视觉-语言模型的可解释零样本学习

专知会员服务

10+阅读 · 7月1日

【NeurIPS2023】半监督端到端对比学习用于时间序列分类

【NeurIPS2023】半监督端到端对比学习用于时间序列分类

专知会员服务

36+阅读 · 2023年10月17日

【CVPR2022】MSDN: 零样本学习的互语义蒸馏网络

【CVPR2022】MSDN: 零样本学习的互语义蒸馏网络

专知会员服务

21+阅读 · 2022年3月8日

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

专知会员服务

44+阅读 · 2020年4月30日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

字节跳动李航提出AMBERT！超越BERT！多粒度token预训练语言模型

字节跳动李航提出AMBERT！超越BERT！多粒度token预训练语言模型

专知

18+阅读 · 2020年8月31日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知

11+阅读 · 2020年8月28日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知

15+阅读 · 2020年7月23日

论文浅尝 | 当知识图谱遇上零样本学习——零样本学习综述

论文浅尝 | 当知识图谱遇上零样本学习——零样本学习综述

开放知识图谱

22+阅读 · 2018年9月26日

Facebook开源MUSE：多语言无监督和监督词向量库

Facebook开源MUSE：多语言无监督和监督词向量库

论智

20+阅读 · 2017年12月23日

语义Web知识库补全关键技术研究

国家自然科学基金

17+阅读 · 2017年12月31日

视觉识别中的实用鲁棒回归技术研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于格值逻辑的语言真值α-群锁语义归结自动推理研究

国家自然科学基金

0+阅读 · 2015年12月31日

模糊认知集群优化的聚类算法

国家自然科学基金

8+阅读 · 2015年12月31日

高维复杂结构数据降维

国家自然科学基金

10+阅读 · 2014年12月31日

Odysseus: Jailbreaking Commercial Multimodal LLM-integrated Systems via Dual Steganography

Arxiv

0+阅读 · 12月23日

Structured Language Generation Model: Loss Calibration and Formatted Decoding for Robust Structure Prediction and Knowledge Retrieval

Arxiv

0+阅读 · 12月22日

Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation

Arxiv

0+阅读 · 12月21日

Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs

Arxiv

0+阅读 · 12月19日

When F1 Fails: Granularity-Aware Evaluation for Dialogue Topic Segmentation

Arxiv

0+阅读 · 12月18日

VIP会员

文章信息

相关主题

灾难性遗忘

相关VIP内容

【ICCV2025】具有局部对齐视觉-语言模型的可解释零样本学习

【ICCV2025】具有局部对齐视觉-语言模型的可解释零样本学习

专知会员服务

10+阅读 · 7月1日

【NeurIPS2023】半监督端到端对比学习用于时间序列分类

【NeurIPS2023】半监督端到端对比学习用于时间序列分类

专知会员服务

36+阅读 · 2023年10月17日

【CVPR2022】MSDN: 零样本学习的互语义蒸馏网络

【CVPR2022】MSDN: 零样本学习的互语义蒸馏网络

专知会员服务

21+阅读 · 2022年3月8日

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

专知会员服务

44+阅读 · 2020年4月30日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

热门VIP内容

开通专知VIP会员享更多权益服务

【ACML2025教程】迈向鲁棒且可信的大语言模型：问题与缓解策略

《利用人工智能改善军事警察行动：当下现状探索》最新95页报告

Google《AI智能体企业应用手册报告》，46页pdf

面向现代武装力量的高级AI驱动军事模拟与训练软件

相关资讯

字节跳动李航提出AMBERT！超越BERT！多粒度token预训练语言模型

字节跳动李航提出AMBERT！超越BERT！多粒度token预训练语言模型

专知

18+阅读 · 2020年8月31日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知

11+阅读 · 2020年8月28日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知

15+阅读 · 2020年7月23日

论文浅尝 | 当知识图谱遇上零样本学习——零样本学习综述

论文浅尝 | 当知识图谱遇上零样本学习——零样本学习综述

开放知识图谱

22+阅读 · 2018年9月26日

Facebook开源MUSE：多语言无监督和监督词向量库

Facebook开源MUSE：多语言无监督和监督词向量库

论智

20+阅读 · 2017年12月23日

相关论文

Odysseus: Jailbreaking Commercial Multimodal LLM-integrated Systems via Dual Steganography

Arxiv

0+阅读 · 12月23日

Structured Language Generation Model: Loss Calibration and Formatted Decoding for Robust Structure Prediction and Knowledge Retrieval

Arxiv

0+阅读 · 12月22日

Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation

Arxiv

0+阅读 · 12月21日

Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs

Arxiv

0+阅读 · 12月19日

When F1 Fails: Granularity-Aware Evaluation for Dialogue Topic Segmentation

Arxiv

0+阅读 · 12月18日

相关基金

语义Web知识库补全关键技术研究

国家自然科学基金

17+阅读 · 2017年12月31日

视觉识别中的实用鲁棒回归技术研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于格值逻辑的语言真值α-群锁语义归结自动推理研究

国家自然科学基金

0+阅读 · 2015年12月31日

模糊认知集群优化的聚类算法

国家自然科学基金

8+阅读 · 2015年12月31日

高维复杂结构数据降维

国家自然科学基金

10+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员