缩小基金会:建模嵌入模型和薄弱的监督 (Shoring Up the Foundations: Fusing Model Embeddings and Weak Supervision)

Foundation models offer an exciting new paradigm for constructing models with out-of-the-box embeddings and a few labeled examples. However, it is not clear how to best apply foundation models without labeled data. A potential approach is to fuse foundation models with weak supervision frameworks, which use weak label sources -- pre-trained models, heuristics, crowd-workers -- to construct pseudolabels. The challenge is building a combination that best exploits the signal available in both foundation models and weak sources. We propose Liger, a combination that uses foundation model embeddings to improve two crucial elements of existing weak supervision techniques. First, we produce finer estimates of weak source quality by partitioning the embedding space and learning per-part source accuracies. Second, we improve source coverage by extending source votes in embedding space. Despite the black-box nature of foundation models, we prove results characterizing how our approach improves performance and show that lift scales with the smoothness of label distributions in embedding space. On six benchmark NLP and video tasks, Liger outperforms vanilla weak supervision by 14.1 points, weakly-supervised kNN and adapters by 11.8 points, and kNN and adapters supervised by traditional hand labels by 7.2 points.

翻译：基础模型为构建模型提供了令人振奋的新范例,这些模型包括箱外嵌入和几个贴标签的例子。然而,尚不清楚如何在没有贴标签数据的情况下最好地应用基础模型。一种潜在的办法是将基础模型与薄弱的监管框架结合起来,这些框架使用薄弱的标签源 -- -- 预先培训的模型、疲劳主义、人群工人 -- -- 来构建假标签。挑战在于构建一个能够最好地利用基础模型和薄弱来源中现有信号的组合。我们提议了Liger,这种组合利用基础模型嵌入来改进现有薄弱监督技术的两个关键要素。首先,我们通过分割嵌入空间和学习每个部分源的精细估计来源的缺陷质量。第二,我们通过扩大嵌入空间的源投票来改进源的覆盖范围。尽管基础模型具有黑盒性质,但我们证明我们的方法如何改进了绩效,并展示了在嵌入空间中标签分布的平滑滑的提升尺度。在六个基准 NLP 和视频任务上,Liger用14.1点、薄弱的固化的标签和11点的KNNN和调整了VAN的标签。

相关内容

MoDELS

关注 30

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【教程】深度学习Keras与TensorFlow教程，Deep Learning with Keras and Tensorflow in R

专知会员服务

31+阅读 · 2022年3月9日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

47+阅读 · 2021年1月20日

迁移学习简明教程，11页ppt

专知会员服务

107+阅读 · 2020年8月4日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

161+阅读 · 2020年3月18日