具有一个统一嵌入空间的通用多模式检索 (Universal Multi-Modality Retrieval with One Unified Embedding Space) - 专知论文

会员服务 ·

0

模态 · MoDELS · Learning · state-of-the-art · 大学 ·

2022 年 9 月 1 日

Universal Multi-Modality Retrieval with One Unified Embedding Space

翻译：具有一个统一嵌入空间的通用多模式检索

Zhenghao Liu,Chenyan Xiong,Yuanhuiyi Lv,Zhiyuan Liu,Ge Yu

from arxiv, 10 pages

This paper presents Vision-Language Universal Search (VL-UnivSearch), which builds a unified model for multi-modality retrieval. VL-UnivSearch encodes query and multi-modality sources in a universal embedding space for searching related candidates and routing modalities. To learn a tailored embedding space for multi-modality retrieval, VL-UnivSearch proposes two techniques: 1) Universal embedding optimization, which contrastively optimizes the embedding space using the modality-balanced hard negatives; 2) Image verbalization method, which bridges the modality gap between images and texts in the raw data space. VL-UnivSearch achieves the state-of-the-art on the multi-modality open-domain question answering benchmark, WebQA, and outperforms all retrieval models in each single modality task. It demonstrates that universal multi-modality search is feasible to replace the divide-and-conquer pipeline with a united model and also benefit per modality tasks. All source codes of this work will be released via Github.

翻译：本文介绍视野-语言通用搜索(VL-UniviSearch),该模型为多模式检索构建了统一的模型。 VL-UnivSearch 编码查询和多模式源,用于搜索相关候选人和路由模式的通用嵌入空间。为学习适合多模式检索的嵌入空间,VL-UnivSearch提出了两种技术:1) 通用嵌入优化,以不同方式平衡硬底片优化嵌入空间;2) 图像语言化方法,以弥合原始数据空间图像和文本之间的模式差距。VL-UnivSearch在多模式开放式问题回答基准、WebQA方面达到了最新水平,并超越了每个单一模式任务中的所有检索模式。它表明,通用的多模式搜索是可行的,可以用统一模式取代分解管道,也使每个模式的任务受益。这项工作的所有源代码将通过Githhub发布。

0

相关内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

47+阅读 · 2022年10月2日

NLP必读经典文献100篇

专知会员服务

123+阅读 · 2020年9月8日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

59+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

161+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

92+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

52+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

53+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

77+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

90+阅读 · 2019年10月10日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

25+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

0+阅读 · 2013年12月31日

Beclin 1在阿尔茨海默病样神经元损伤中的调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

温阳活血利水法对TGF-β-Smads信号通路介导糖尿病心肌重构的效应机制

国家自然科学基金

0+阅读 · 2012年12月31日

附睾中microRNA与雄激素受体的相互调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于光纤拉锥波导的光子晶体微腔的制作和应用

国家自然科学基金

0+阅读 · 2012年12月31日

塑料太赫兹纤维/波导管制造技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

针刺预处理对心肌缺血再灌注损伤保护作用机制及其信号途径的研究

国家自然科学基金

0+阅读 · 2012年12月31日

高压下II-VI族量子点的超快载流子动力学

国家自然科学基金

0+阅读 · 2012年12月31日

基于热光效应的微机械非致冷光学读出红外成像阵列研究

国家自然科学基金

0+阅读 · 2011年12月31日

单量子点和纳微组合结构中激子的超快速激光光谱研究

国家自然科学基金

0+阅读 · 2008年12月31日

Making a MIRACL: Multilingual Information Retrieval Across a Continuum of Languages

Arxiv

0+阅读 · 2022年10月18日

6th Place Solution to Google Universal Image Embedding

Arxiv

0+阅读 · 2022年10月17日

GenURL: A General Framework for Unsupervised Representation Learning

Arxiv

0+阅读 · 2022年10月17日

The Power of Selecting Key Blocks with Local Pre-ranking for Long Document Information Retrieval

Arxiv

0+阅读 · 2022年10月15日

A Conversationalist Approach to Information Quality in Information Interaction and Retrieval

Arxiv

0+阅读 · 2022年10月13日

PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval

Arxiv

11+阅读 · 2020年10月20日

Beyond Lexical: A Semantic Retrieval Framework for Textual SearchEngine

Beyond Lexical: A Semantic Retrieval Framework for Textual SearchEngine

Arxiv

16+阅读 · 2020年8月10日

Embedding-based Retrieval in Facebook Search

Arxiv

12+阅读 · 2020年6月20日

UNITER: Learning UNiversal Image-TExt Representations

UNITER: Learning UNiversal Image-TExt Representations

Arxiv

23+阅读 · 2019年9月25日

Embedding Uncertain Knowledge Graphs

Arxiv

12+阅读 · 2019年2月26日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

47+阅读 · 2022年10月2日

NLP必读经典文献100篇

专知会员服务

123+阅读 · 2020年9月8日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

59+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

161+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

92+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

52+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

53+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

77+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

90+阅读 · 2019年10月10日

热门VIP内容

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

25+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Making a MIRACL: Multilingual Information Retrieval Across a Continuum of Languages

Arxiv

0+阅读 · 2022年10月18日

6th Place Solution to Google Universal Image Embedding

Arxiv

0+阅读 · 2022年10月17日

GenURL: A General Framework for Unsupervised Representation Learning

Arxiv

0+阅读 · 2022年10月17日

The Power of Selecting Key Blocks with Local Pre-ranking for Long Document Information Retrieval

Arxiv

0+阅读 · 2022年10月15日

A Conversationalist Approach to Information Quality in Information Interaction and Retrieval

Arxiv

0+阅读 · 2022年10月13日

PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval

Arxiv

11+阅读 · 2020年10月20日

Beyond Lexical: A Semantic Retrieval Framework for Textual SearchEngine

Beyond Lexical: A Semantic Retrieval Framework for Textual SearchEngine

Arxiv

16+阅读 · 2020年8月10日

Embedding-based Retrieval in Facebook Search

Arxiv

12+阅读 · 2020年6月20日

UNITER: Learning UNiversal Image-TExt Representations

UNITER: Learning UNiversal Image-TExt Representations

Arxiv

23+阅读 · 2019年9月25日

Embedding Uncertain Knowledge Graphs

Arxiv

12+阅读 · 2019年2月26日

相关基金

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

0+阅读 · 2013年12月31日

Beclin 1在阿尔茨海默病样神经元损伤中的调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

温阳活血利水法对TGF-β-Smads信号通路介导糖尿病心肌重构的效应机制

国家自然科学基金

0+阅读 · 2012年12月31日

附睾中microRNA与雄激素受体的相互调控机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于光纤拉锥波导的光子晶体微腔的制作和应用

国家自然科学基金

0+阅读 · 2012年12月31日

塑料太赫兹纤维/波导管制造技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

针刺预处理对心肌缺血再灌注损伤保护作用机制及其信号途径的研究

国家自然科学基金

0+阅读 · 2012年12月31日

高压下II-VI族量子点的超快载流子动力学

国家自然科学基金

0+阅读 · 2012年12月31日

基于热光效应的微机械非致冷光学读出红外成像阵列研究

国家自然科学基金

0+阅读 · 2011年12月31日

单量子点和纳微组合结构中激子的超快速激光光谱研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员