多任务、多任务、多愿景-语言快速调试 (Multitask Vision-Language Prompt Tuning) - 专知论文

会员服务 ·

0

Prompt · tuning · Learning · 知识 (knowledge) · 向量化 ·

2022 年 12 月 5 日

Multitask Vision-Language Prompt Tuning

翻译：多任务、多任务、多愿景-语言快速调试

Sheng Shen,Shijia Yang,Tianjun Zhang,Bohan Zhai,Joseph E. Gonzalez,Kurt Keutzer,Trevor Darrell

from arxiv, Preprint

Prompt Tuning, conditioning on task-specific learned prompt vectors, has emerged as a data-efficient and parameter-efficient method for adapting large pretrained vision-language models to multiple downstream tasks. However, existing approaches usually consider learning prompt vectors for each task independently from scratch, thereby failing to exploit the rich shareable knowledge across different vision-language tasks. In this paper, we propose multitask vision-language prompt tuning (MVLPT), which incorporates cross-task knowledge into prompt tuning for vision-language models. Specifically, (i) we demonstrate the effectiveness of learning a single transferable prompt from multiple source tasks to initialize the prompt for each target task; (ii) we show many target tasks can benefit each other from sharing prompt vectors and thus can be jointly learned via multitask prompt tuning. We benchmark the proposed MVLPT using three representative prompt tuning methods, namely text prompt tuning, visual prompt tuning, and the unified vision-language prompt tuning. Results in 20 vision tasks demonstrate that the proposed approach outperforms all single-task baseline prompt tuning methods, setting the new state-of-the-art on the few-shot ELEVATER benchmarks and cross-task generalization benchmarks. To understand where the cross-task knowledge is most effective, we also conduct a large-scale study on task transferability with 20 vision tasks in 400 combinations for each prompt tuning method. It shows that the most performant MVLPT for each prompt tuning method prefers different task combinations and many tasks can benefit each other, depending on their visual similarity and label similarity. Code is available at https://github.com/sIncerass/MVLPT.

翻译：以特定任务所学的快速矢量为条件的快速调控,现已成为一种数据高效和参数高效的方法,使大型预先训练的视觉语言模型适应多个下游任务。然而,现有方法通常考虑为每项任务从零开始独立学习快速矢量,从而无法利用不同视觉语言任务的丰富共享知识。在本文件中,我们提议多任务视觉语言快速调控(MVLPT),将跨任务知识纳入对视觉语言模型的快速调控。具体地说,(一) 我们展示从多个来源任务中学习单一可转移的快速度,以启动每项目标任务的快速度;(二) 我们显示许多目标任务可以相互受益于共享快速矢量,从而可以通过多任务快速调控来共同学习。我们用三种具有代表性的快速调校准方法对拟议 MVLPT(MLPT)进行测试,即文本调控调、视觉快速调控、20项结果显示所有单一任务都优度基线快速调方法,为每项目标任务设定新的州级级调算,并在每个通用任务中显示最快速的全任务基准。

0

相关内容

Prompt

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

专知会员服务

29+阅读 · 2022年3月12日

NLP必读经典文献100篇

专知会员服务

123+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

123+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

161+阅读 · 2020年3月18日

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

专知会员服务

56+阅读 · 2019年12月24日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

53+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

77+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

77+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

64+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

39+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

25+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

26+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

19+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

新型高性能、低成本Ti基固溶体氧化物电极材料开发与反应机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

半导体晶体基人工光合成体系的研究

国家自然科学基金

0+阅读 · 2014年12月31日

高压下新型碱土金属碳化物的结构与性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

分级孔结构的普鲁士蓝类配合物纳米晶体的合成及在钠离子电池中的应用研究

国家自然科学基金

0+阅读 · 2013年12月31日

掺杂二氧化锰纳米材料合成及热电效应研究

国家自然科学基金

0+阅读 · 2012年12月31日

氧化锌二维晶体的制备及光电特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

TM (Zr, V, Ti)-Si系相关相图与新型轻质高温结构材料探索

国家自然科学基金

0+阅读 · 2012年12月31日

银掺杂氧化锌纳米结构的制备及高压研究

国家自然科学基金

0+阅读 · 2012年12月31日

动力学可控合成Fe@Au纳米核壳结构的研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向不确定性的Web2.0用户创作内容管理研究

国家自然科学基金

0+阅读 · 2011年12月31日

Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery

Arxiv

0+阅读 · 2023年2月7日

Medical Image Understanding with Pretrained Vision Language Models: A Comprehensive Study

Arxiv

0+阅读 · 2023年2月7日

Exploring the Benefits of Training Expert Language Models over Instruction Tuning

Arxiv

0+阅读 · 2023年2月7日

MetaPrompting: Learning to Learn Better Prompts

Arxiv

0+阅读 · 2023年2月3日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Conditional Prompt Learning for Vision-Language Models

Conditional Prompt Learning for Vision-Language Models

Arxiv

13+阅读 · 2022年3月10日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

29+阅读 · 2021年7月28日

SiT: Self-supervised vIsion Transformer

Arxiv

19+阅读 · 2021年4月8日

Transfer Adaptation Learning: A Decade Survey

Transfer Adaptation Learning: A Decade Survey

Arxiv

37+阅读 · 2019年3月12日

Weakly Supervised One-Shot Detection with Attention Siamese Networks

Arxiv

14+阅读 · 2018年1月12日

VIP会员

文章信息

相关主题

知识 (knowledge)

相关VIP内容

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

【CVPR 2022】视觉提示调整（VPT），Vision Prompt Tuning

专知会员服务

29+阅读 · 2022年3月12日

NLP必读经典文献100篇

专知会员服务

123+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

123+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

161+阅读 · 2020年3月18日

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

专知会员服务

56+阅读 · 2019年12月24日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

53+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

77+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

77+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

64+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

39+阅读 · 2019年10月9日

热门VIP内容

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

25+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

26+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

19+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

相关论文

Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery

Arxiv

0+阅读 · 2023年2月7日

Medical Image Understanding with Pretrained Vision Language Models: A Comprehensive Study

Arxiv

0+阅读 · 2023年2月7日

Exploring the Benefits of Training Expert Language Models over Instruction Tuning

Arxiv

0+阅读 · 2023年2月7日

MetaPrompting: Learning to Learn Better Prompts

Arxiv

0+阅读 · 2023年2月3日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Conditional Prompt Learning for Vision-Language Models

Conditional Prompt Learning for Vision-Language Models

Arxiv

13+阅读 · 2022年3月10日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

29+阅读 · 2021年7月28日

SiT: Self-supervised vIsion Transformer

Arxiv

19+阅读 · 2021年4月8日

Transfer Adaptation Learning: A Decade Survey

Transfer Adaptation Learning: A Decade Survey

Arxiv

37+阅读 · 2019年3月12日

Weakly Supervised One-Shot Detection with Attention Siamese Networks

Arxiv

14+阅读 · 2018年1月12日

相关基金

新型高性能、低成本Ti基固溶体氧化物电极材料开发与反应机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

半导体晶体基人工光合成体系的研究

国家自然科学基金

0+阅读 · 2014年12月31日

高压下新型碱土金属碳化物的结构与性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

分级孔结构的普鲁士蓝类配合物纳米晶体的合成及在钠离子电池中的应用研究

国家自然科学基金

0+阅读 · 2013年12月31日

掺杂二氧化锰纳米材料合成及热电效应研究

国家自然科学基金

0+阅读 · 2012年12月31日

氧化锌二维晶体的制备及光电特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

TM (Zr, V, Ti)-Si系相关相图与新型轻质高温结构材料探索

国家自然科学基金

0+阅读 · 2012年12月31日

银掺杂氧化锌纳米结构的制备及高压研究

国家自然科学基金

0+阅读 · 2012年12月31日

动力学可控合成Fe@Au纳米核壳结构的研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向不确定性的Web2.0用户创作内容管理研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员