对我而言正确的东西,对于你而言并不正确:通过多任务学习建立相对方向的数据集 (What is Right for Me is Not Yet Right for You: A Dataset for Grounding Relative Directions via Multi-Task Learning) - 专知论文

会员服务 ·

0

有向 · 视觉问答 · 数据集 · 学成 · 可辨认的 ·

2022 年 5 月 5 日

What is Right for Me is Not Yet Right for You: A Dataset for Grounding Relative Directions via Multi-Task Learning

翻译：对我而言正确的东西,对于你而言并不正确:通过多任务学习建立相对方向的数据集

Jae Hee Lee,Matthias Kerzel,Kyra Ahrens,Cornelius Weber,Stefan Wermter

from arxiv, Accepted to IJCAI 2022

Understanding spatial relations is essential for intelligent agents to act and communicate in the physical world. Relative directions are spatial relations that describe the relative positions of target objects with regard to the intrinsic orientation of reference objects. Grounding relative directions is more difficult than grounding absolute directions because it not only requires a model to detect objects in the image and to identify spatial relation based on this information, but it also needs to recognize the orientation of objects and integrate this information into the reasoning process. We investigate the challenging problem of grounding relative directions with end-to-end neural networks. To this end, we provide GRiD-3D, a novel dataset that features relative directions and complements existing visual question answering (VQA) datasets, such as CLEVR, that involve only absolute directions. We also provide baselines for the dataset with two established end-to-end VQA models. Experimental evaluations show that answering questions on relative directions is feasible when questions in the dataset simulate the necessary subtasks for grounding relative directions. We discover that those subtasks are learned in an order that reflects the steps of an intuitive pipeline for processing relative directions.

翻译：了解空间关系对于智能剂在物理世界中采取行动和进行交流至关重要。相对方向是描述目标物体相对于参照对象内在方向的相对位置的空间关系。定位相对方向比确定绝对方向更加困难, 因为它不仅需要一种模型来检测图像中的物体, 并根据这种信息确定空间关系, 而且它也需要识别对象的方向, 并将这种信息纳入推理过程。我们调查了用端对端神经网络定位相对方向的棘手问题。为此, 我们提供了GRID-3D, 一套具有相对方向的新数据集, 并补充了现有的直观回答(如CLEVR)数据集, 仅涉及绝对方向。我们还提供了两个既定端对端VQA模型的数据集基线。实验性评估表明,当数据集中的问题模拟确定相对方向所需的子任务时, 回答相对方向问题是可行的。我们发现, 这些子任务是在反映直径管道步骤以处理相对方向的顺序中学习的。

0

相关内容

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

专知

15+阅读 · 2018年6月29日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Plücker直线摄影测量的理论与方法

国家自然科学基金

0+阅读 · 2014年12月31日

ECoG,EEG-fMRI多模态癫痫监测与病灶定位研究

国家自然科学基金

0+阅读 · 2014年12月31日

S100A6基因修饰的骨髓间充质干细胞移植治疗糖尿病性勃起功能障碍的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

重载车辆ECAS/CTIS集成系统耦合机理及主动控制研究

国家自然科学基金

0+阅读 · 2013年12月31日

星系巡天中弱引力透镜的精确测量与计算

国家自然科学基金

0+阅读 · 2012年12月31日

microRNA-101靶向调控EZH2在肝癌化疗耐药中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

磁化氢气脉冲放电的PIC/MC/DSMC模拟研究

国家自然科学基金

0+阅读 · 2012年12月31日

DEC1、DEC2对人乳腺癌细胞衰老的调控作用及其作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

非小细胞肺癌术前N分期中PET/CT多模态复合信息特征研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于AIS的船舶无线自动定位的新理论与关键技术

国家自然科学基金

0+阅读 · 2011年12月31日

Open-source objective-oriented framework for head-related transfer function

Arxiv

0+阅读 · 2022年6月24日

Cooperative Hybrid Networks with Active Relays and RISs for B5G: Applications, Challenges, and Research Directions

Cooperative Hybrid Networks with Active Relays and RISs for B5G: Applications, Challenges, and Research Directions

Arxiv

0+阅读 · 2022年6月23日

VisFIS: Visual Feature Importance Supervision with Right-for-the-Right-Reason Objectives

VisFIS: Visual Feature Importance Supervision with Right-for-the-Right-Reason Objectives

Arxiv

0+阅读 · 2022年6月22日

Supervised Learning for Coverage-Directed Test Selection in Simulation-Based Verification

Arxiv

0+阅读 · 2022年6月22日

A Study on the Evaluation of Generative Models

Arxiv

0+阅读 · 2022年6月22日

A Bidirectional Fabric-based Pneumatic Actuator for the Infant Shoulder: Design and Comparative Kinematic Analysis

Arxiv

0+阅读 · 2022年6月21日

Multi-view Contrastive Graph Clustering

Arxiv

13+阅读 · 2021年10月22日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

A Survey on Multi-Task Learning

Arxiv

31+阅读 · 2021年3月29日

Learning from Very Few Samples: A Survey

Arxiv

126+阅读 · 2020年9月6日

VIP会员

文章信息

相关主题

相关VIP内容

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【ACL2025教程】大语言模型的护栏与安全性：对其应用的安全、可靠与可控引导

《实现协同自主：从人机协作到多智能体系统》最新190页

【ICML2025】SToFM：一种用于空间转录组学的多尺度基础模型

通信网络智能体白皮书V1.0，61页pdf

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

专知

15+阅读 · 2018年6月29日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Open-source objective-oriented framework for head-related transfer function

Arxiv

0+阅读 · 2022年6月24日

Cooperative Hybrid Networks with Active Relays and RISs for B5G: Applications, Challenges, and Research Directions

Cooperative Hybrid Networks with Active Relays and RISs for B5G: Applications, Challenges, and Research Directions

Arxiv

0+阅读 · 2022年6月23日

VisFIS: Visual Feature Importance Supervision with Right-for-the-Right-Reason Objectives

VisFIS: Visual Feature Importance Supervision with Right-for-the-Right-Reason Objectives

Arxiv

0+阅读 · 2022年6月22日

Supervised Learning for Coverage-Directed Test Selection in Simulation-Based Verification

Arxiv

0+阅读 · 2022年6月22日

A Study on the Evaluation of Generative Models

Arxiv

0+阅读 · 2022年6月22日

A Bidirectional Fabric-based Pneumatic Actuator for the Infant Shoulder: Design and Comparative Kinematic Analysis

Arxiv

0+阅读 · 2022年6月21日

Multi-view Contrastive Graph Clustering

Arxiv

13+阅读 · 2021年10月22日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

A Survey on Multi-Task Learning

Arxiv

31+阅读 · 2021年3月29日

Learning from Very Few Samples: A Survey

Arxiv

126+阅读 · 2020年9月6日

相关基金

Plücker直线摄影测量的理论与方法

国家自然科学基金

0+阅读 · 2014年12月31日

ECoG,EEG-fMRI多模态癫痫监测与病灶定位研究

国家自然科学基金

0+阅读 · 2014年12月31日

S100A6基因修饰的骨髓间充质干细胞移植治疗糖尿病性勃起功能障碍的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

重载车辆ECAS/CTIS集成系统耦合机理及主动控制研究

国家自然科学基金

0+阅读 · 2013年12月31日

星系巡天中弱引力透镜的精确测量与计算

国家自然科学基金

0+阅读 · 2012年12月31日

microRNA-101靶向调控EZH2在肝癌化疗耐药中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

磁化氢气脉冲放电的PIC/MC/DSMC模拟研究

国家自然科学基金

0+阅读 · 2012年12月31日

DEC1、DEC2对人乳腺癌细胞衰老的调控作用及其作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

非小细胞肺癌术前N分期中PET/CT多模态复合信息特征研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于AIS的船舶无线自动定位的新理论与关键技术

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员