面向即时少数热量语言学生的反向学习 (Contrastive Learning for Prompt-Based Few-Shot Language Learners) - 专知论文

会员服务 ·

0

contrastive · 对比学习 · 小样本学习 · 学成 · Prompt ·

2022 年 5 月 3 日

Contrastive Learning for Prompt-Based Few-Shot Language Learners

翻译：面向即时少数热量语言学生的反向学习

Yiren Jian,Chongyang Gao,Soroush Vosoughi

from arxiv, accepted to NAACL 2022

The impressive performance of GPT-3 using natural language prompts and in-context learning has inspired work on better fine-tuning of moderately-sized models under this paradigm. Following this line of work, we present a contrastive learning framework that clusters inputs from the same class for better generality of models trained with only limited examples. Specifically, we propose a supervised contrastive framework that clusters inputs from the same class under different augmented "views" and repel the ones from different classes. We create different "views" of an example by appending it with different language prompts and contextual demonstrations. Combining a contrastive loss with the standard masked language modeling (MLM) loss in prompt-based few-shot learners, the experimental results show that our method can improve over the state-of-the-art methods in a diverse set of 15 language tasks. Our framework makes minimal assumptions on the task or the base model, and can be applied to many recent methods with little modification. The code will be made available at: https://github.com/yiren-jian/LM-SupCon.

翻译：GPT-3使用自然语言的提示和内文学习的令人印象深刻的成绩激励了在这种范式下更好地微调中等规模模型的工作。根据这一类工作,我们提出了一个对比式学习框架,将同一类的投入集中起来,以便更普遍地使用仅以有限实例培训的模型。具体地说,我们提出了一个监督式对比性框架,将同一类的投入集中到不同的扩大“视图”之下,并击退不同类别的投入。我们用不同的语言提示和背景演示来创建不同的“视图”。在快速点播的少数学习者中,将一个对比性损失与标准隐蔽语言模型损失结合起来,实验结果显示,我们的方法可以改进15种语言任务中最先进的方法。我们的框架对任务或基本模型提出了最低限度的假设,并且可以稍加修改后应用于许多近期的方法。代码将在以下网址上公布:https://github.com/yren-jian/LM-SupCon。

0

相关内容

contrastive

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

分布式有监督学习的学习理论

国家自然科学基金

17+阅读 · 2015年12月31日

相变材料应变工程与锗多栅晶体管的优化集成方案

国家自然科学基金

0+阅读 · 2015年12月31日

纳米双金属复合氧化物催化臭氧的效能及机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

用于高分子太阳能电池的高功函氧化石墨烯阳极界面材料

国家自然科学基金

0+阅读 · 2013年12月31日

差异表达基因microRNA结合位点遗传变异与中国人乳腺癌发病机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

磁分离/自清洁复合型单分散微球SERS基底对农药残留的高效检测

国家自然科学基金

0+阅读 · 2012年12月31日

Hippo通路在急性肾损伤发病中的作用及其机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于偏好信息学习引导的混合性能指标智能优化决策模型与方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

低表面能亚微米通道内流动阻力与传热机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

NMDAR基因多态性与微波辐射致学习记忆障碍的关联性研究

国家自然科学基金

0+阅读 · 2009年12月31日

Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning

Arxiv

5+阅读 · 2022年6月21日

Self-Distilled Self-Supervised Representation Learning

Arxiv

0+阅读 · 2022年6月21日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

Few-shot Learning for Multi-label Intent Detection

Arxiv

21+阅读 · 2020年10月11日

Pre-training Text Representations as Meta Learning

Arxiv

13+阅读 · 2020年4月12日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Self-Supervised Learning For Few-Shot Image Classification

Self-Supervised Learning For Few-Shot Image Classification

Arxiv

19+阅读 · 2019年11月14日

Few-shot Learning with Meta Metric Learners

Arxiv

13+阅读 · 2019年1月26日

VIP会员

文章信息

相关主题

小样本学习

相关VIP内容

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】以人为中心的强化学习

任务规划与地形分析：现代复杂环境作战导航体系

认知优势：人工智能在国家安全决策中的核心作用

大模型赋能的具身智能：决策与具身学习综述

相关资讯

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning

Arxiv

5+阅读 · 2022年6月21日

Self-Distilled Self-Supervised Representation Learning

Arxiv

0+阅读 · 2022年6月21日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

Few-shot Learning for Multi-label Intent Detection

Arxiv

21+阅读 · 2020年10月11日

Pre-training Text Representations as Meta Learning

Arxiv

13+阅读 · 2020年4月12日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Self-Supervised Learning For Few-Shot Image Classification

Self-Supervised Learning For Few-Shot Image Classification

Arxiv

19+阅读 · 2019年11月14日

Few-shot Learning with Meta Metric Learners

Arxiv

13+阅读 · 2019年1月26日

相关基金

分布式有监督学习的学习理论

国家自然科学基金

17+阅读 · 2015年12月31日

相变材料应变工程与锗多栅晶体管的优化集成方案

国家自然科学基金

0+阅读 · 2015年12月31日

纳米双金属复合氧化物催化臭氧的效能及机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

用于高分子太阳能电池的高功函氧化石墨烯阳极界面材料

国家自然科学基金

0+阅读 · 2013年12月31日

差异表达基因microRNA结合位点遗传变异与中国人乳腺癌发病机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

磁分离/自清洁复合型单分散微球SERS基底对农药残留的高效检测

国家自然科学基金

0+阅读 · 2012年12月31日

Hippo通路在急性肾损伤发病中的作用及其机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于偏好信息学习引导的混合性能指标智能优化决策模型与方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

低表面能亚微米通道内流动阻力与传热机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

NMDAR基因多态性与微波辐射致学习记忆障碍的关联性研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员