元组式指称表达式分割的元元学习框架 (Meta Compositional Referring Expression Segmentation) - 专知论文

会员服务 ·

0

指称 · 元学习 · 测试集 · 分割 · 训练集 ·

2023 年 4 月 10 日

Meta Compositional Referring Expression Segmentation

翻译：元组式指称表达式分割的元元学习框架

Li Xu,Mark He Huang,Xindi Shang,Zehuan Yuan,Ying Sun,Jun Liu

from arxiv, Accepted by CVPR 2023

Referring expression segmentation aims to segment an object described by a language expression from an image. Despite the recent progress on this task, existing models tackling this task may not be able to fully capture semantics and visual representations of individual concepts, which limits their generalization capability, especially when handling novel compositions of learned concepts. In this work, through the lens of meta learning, we propose a Meta Compositional Referring Expression Segmentation (MCRES) framework to enhance model compositional generalization performance. Specifically, to handle various levels of novel compositions, our framework first uses training data to construct a virtual training set and multiple virtual testing sets, where data samples in each virtual testing set contain a level of novel compositions w.r.t. the virtual training set. Then, following a novel meta optimization scheme to optimize the model to obtain good testing performance on the virtual testing sets after training on the virtual training set, our framework can effectively drive the model to better capture semantics and visual representations of individual concepts, and thus obtain robust generalization performance even when handling novel compositions. Extensive experiments on three benchmark datasets demonstrate the effectiveness of our framework.

翻译：指称表达式分割旨在从图像中将用语言表达描述的对象分割出来。尽管近年来这项任务取得了进展，但现有模型可能无法完全捕捉单个概念的语义和视觉表示，这限制了它们的泛化能力，尤其是在处理学习概念的新组合时。在这项工作中，通过元学习的视角，我们提出了一个元元学习框架（MCRES）来增强模型的组合泛化性能。具体而言，为了处理各种级别的新组合，我们的框架首先使用训练数据构建一个虚拟训练集和多个虚拟测试集，每个虚拟测试集中的数据样本都包含相对于虚拟训练集的一个新组合级别。然后，遵循一种新颖的元优化方案来优化模型，在虚拟训练集上训练后在虚拟测试集中获得良好的测试性能，我们的框架可以有效地驱动模型更好地捕捉单个概念的语义和视觉表示，从而在处理新组合时获得强大的泛化性能。在三个基准数据集上进行的大量实验证明了我们框架的有效性。

0

相关内容

指称是指某些代词名词在文章中的具体称述对象。用来指称事物的词语叫“指称语”；所指称的事物叫指称对象。充当指称语的一般是代词和名词及其词组。

【CVPR2022】整合少样本学习的分类和分割

【CVPR2022】整合少样本学习的分类和分割

专知会员服务

28+阅读 · 2022年3月31日

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

专知会员服务

21+阅读 · 2022年3月18日

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

【KDD2021】知识增强领域自适应的少样本关系分类

专知会员服务

38+阅读 · 2021年9月15日

近期必读的5篇顶会ICCV 2021【语义分割】相关论文和代码

专知会员服务

43+阅读 · 2021年8月20日

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

专知会员服务

34+阅读 · 2020年6月19日

【Mila-Google】使用元学习动态调整源代码模型，On-the-Fly Adaptation of Source Code Models using Meta-Learning

【Mila-Google】使用元学习动态调整源代码模型，On-the-Fly Adaptation of Source Code Models using Meta-Learning

专知会员服务

21+阅读 · 2020年3月28日

【ICLR2020-MIT】元学习的好奇心算法，Meta-learning curiosity algorithms

【ICLR2020-MIT】元学习的好奇心算法，Meta-learning curiosity algorithms

专知会员服务

34+阅读 · 2020年3月13日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

专知会员服务

60+阅读 · 2019年12月24日

AAAI 2022 | MAVEx—基于知识的视觉问答方法

AAAI 2022 | MAVEx—基于知识的视觉问答方法

PaperWeekly

3+阅读 · 2022年10月8日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

专知

13+阅读 · 2018年1月23日

网络图像标注中多视图半监督稀疏特征选择算法研究

国家自然科学基金

1+阅读 · 2015年12月31日

融合多尺度上下文的图像标注研究

国家自然科学基金

2+阅读 · 2013年12月31日

面向网络中心战的动态火力分配问题研究

国家自然科学基金

47+阅读 · 2013年12月31日

目标实体轮廓跟踪中动态高阶能量最小化问题的研究

国家自然科学基金

0+阅读 · 2013年12月31日

大型武器装备全寿命风险动态演化的多分辨率建模方法

国家自然科学基金

4+阅读 · 2013年12月31日

新型网络环境下的身份相关安全问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于鲁棒相似性测度的含噪图像分割的谱聚类方法

国家自然科学基金

0+阅读 · 2012年12月31日

奇异摄动问题各向异性自适应有限元

国家自然科学基金

0+阅读 · 2012年12月31日

视像整体特征指导下的局部特征动态整合模型研究

国家自然科学基金

0+阅读 · 2011年12月31日

复合材料弱粘结结构的半解析模型与灵敏度分析算法研究

国家自然科学基金

0+阅读 · 2009年12月31日

Deep Learning and Symbolic Regression for Discovering Parametric Equations

Arxiv

0+阅读 · 2023年5月28日

StyleS2ST: Zero-shot Style Transfer for Direct Speech-to-speech Translation

Arxiv

0+阅读 · 2023年5月28日

Inverse Dynamics Pretraining Learns Good Representations for Multitask Imitation

Arxiv

0+阅读 · 2023年5月26日

Evaluating Open-Domain Dialogues in Latent Space with Next Sentence Prediction and Mutual Information

Arxiv

0+阅读 · 2023年5月26日

Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets

Arxiv

0+阅读 · 2023年5月26日

Representation Transfer Learning via Multiple Pre-trained models for Linear Regression

Arxiv

0+阅读 · 2023年5月25日

CMD: Self-supervised 3D Action Representation Learning with Cross-modal Mutual Distillation

Arxiv

0+阅读 · 2023年5月25日

PDE+: Enhancing Generalization via PDE with Adaptive Distributional Diffusion

Arxiv

0+阅读 · 2023年5月25日

Recovering 3D Human Mesh from Monocular Images: A Survey

Arxiv

12+阅读 · 2022年3月8日

Deep Representation Learning for Domain Adaptation of Semantic Image Segmentation

Arxiv

10+阅读 · 2018年5月10日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR2022】整合少样本学习的分类和分割

【CVPR2022】整合少样本学习的分类和分割

专知会员服务

28+阅读 · 2022年3月31日

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

专知会员服务

21+阅读 · 2022年3月18日

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

【KDD2021】知识增强领域自适应的少样本关系分类

专知会员服务

38+阅读 · 2021年9月15日

近期必读的5篇顶会ICCV 2021【语义分割】相关论文和代码

专知会员服务

43+阅读 · 2021年8月20日

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

专知会员服务

34+阅读 · 2020年6月19日

【Mila-Google】使用元学习动态调整源代码模型，On-the-Fly Adaptation of Source Code Models using Meta-Learning

【Mila-Google】使用元学习动态调整源代码模型，On-the-Fly Adaptation of Source Code Models using Meta-Learning

专知会员服务

21+阅读 · 2020年3月28日

【ICLR2020-MIT】元学习的好奇心算法，Meta-learning curiosity algorithms

【ICLR2020-MIT】元学习的好奇心算法，Meta-learning curiosity algorithms

专知会员服务

34+阅读 · 2020年3月13日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

专知会员服务

60+阅读 · 2019年12月24日

热门VIP内容

开通专知VIP会员享更多权益服务

新质生成式AI赋能产业变革的实践与路径

用于多模态大模型的离散标记化：全面综述

Nature综述：金融网络中的物理学

【CMU博士论文】通信高效且差分隐私的优化方法

相关资讯

AAAI 2022 | MAVEx—基于知识的视觉问答方法

AAAI 2022 | MAVEx—基于知识的视觉问答方法

PaperWeekly

3+阅读 · 2022年10月8日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

专知

13+阅读 · 2018年1月23日

相关论文

Deep Learning and Symbolic Regression for Discovering Parametric Equations

Arxiv

0+阅读 · 2023年5月28日

StyleS2ST: Zero-shot Style Transfer for Direct Speech-to-speech Translation

Arxiv

0+阅读 · 2023年5月28日

Inverse Dynamics Pretraining Learns Good Representations for Multitask Imitation

Arxiv

0+阅读 · 2023年5月26日

Evaluating Open-Domain Dialogues in Latent Space with Next Sentence Prediction and Mutual Information

Arxiv

0+阅读 · 2023年5月26日

Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets

Arxiv

0+阅读 · 2023年5月26日

Representation Transfer Learning via Multiple Pre-trained models for Linear Regression

Arxiv

0+阅读 · 2023年5月25日

CMD: Self-supervised 3D Action Representation Learning with Cross-modal Mutual Distillation

Arxiv

0+阅读 · 2023年5月25日

PDE+: Enhancing Generalization via PDE with Adaptive Distributional Diffusion

Arxiv

0+阅读 · 2023年5月25日

Recovering 3D Human Mesh from Monocular Images: A Survey

Arxiv

12+阅读 · 2022年3月8日

Deep Representation Learning for Domain Adaptation of Semantic Image Segmentation

Arxiv

10+阅读 · 2018年5月10日

相关基金

网络图像标注中多视图半监督稀疏特征选择算法研究

国家自然科学基金

1+阅读 · 2015年12月31日

融合多尺度上下文的图像标注研究

国家自然科学基金

2+阅读 · 2013年12月31日

面向网络中心战的动态火力分配问题研究

国家自然科学基金

47+阅读 · 2013年12月31日

目标实体轮廓跟踪中动态高阶能量最小化问题的研究

国家自然科学基金

0+阅读 · 2013年12月31日

大型武器装备全寿命风险动态演化的多分辨率建模方法

国家自然科学基金

4+阅读 · 2013年12月31日

新型网络环境下的身份相关安全问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于鲁棒相似性测度的含噪图像分割的谱聚类方法

国家自然科学基金

0+阅读 · 2012年12月31日

奇异摄动问题各向异性自适应有限元

国家自然科学基金

0+阅读 · 2012年12月31日

视像整体特征指导下的局部特征动态整合模型研究

国家自然科学基金

0+阅读 · 2011年12月31日

复合材料弱粘结结构的半解析模型与灵敏度分析算法研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员