对VQ3D来说,为以自我为中心视频拍摄的摄像头估计更多摄像头是关键 (Estimating more camera poses for ego-centric videos is essential for VQ3D) - 专知论文

会员服务 ·

0

估计/估计量 · 3D · Performer · Better · 向量化 ·

2022 年 11 月 18 日

Estimating more camera poses for ego-centric videos is essential for VQ3D

翻译：对VQ3D来说,为以自我为中心视频拍摄的摄像头估计更多摄像头是关键

Jinjie Mai,Chen Zhao,Abdullah Hamdi,Silvio Giancola,Bernard Ghanem

from arxiv, Second International Ego4D Workshop at ECCV 2022

Visual queries 3D localization (VQ3D) is a task in the Ego4D Episodic Memory Benchmark. Given an egocentric video, the goal is to answer queries of the form "Where did I last see object X?", where the query object X is specified as a static image, and the answer should be a 3D displacement vector pointing to object X. However, current techniques use naive ways to estimate the camera poses of video frames, resulting in a low query with pose (QwP) ratio, thus a poor overall success rate. We design a new pipeline for the challenging egocentric video camera pose estimation problem in our work. Moreover, we revisit the current VQ3D framework and optimize it in terms of performance and efficiency. As a result, we get the top-1 overall success rate of 25.8% on VQ3D leaderboard, which is two times better than the 8.7% reported by the baseline.

翻译：视觉查询 3D 本地化 (VQ3D) 是 Ego4D Episodic Memory 基准( VQ3D) 中的一项任务。在以自我为中心的视频中, 目标是回答“ 我最后一次看到对象 X 在哪里? ” 的答题, 查询对象 X 是静态图像, 答案应该是指向对象 X 的 3D 迁移矢量。然而, 当前技术使用天真的方法来估计摄像头的摄像头配置, 从而导致低质质( QwP) 比例的询问, 从而导致总体成功率低下。我们设计了一条新的管道, 用于挑战性的以自我为中心的视频摄像头的管道, 给我们的工作带来了估算问题。此外, 我们重新审视了当前的 VQ3D 框架, 并在性能和效率方面优化了它。结果, 我们在 VQ3D 头板上获得了25.8% 的总成功率比基准报告的8.7%高2倍。

0

相关内容

估计/估计量

估计/估计量

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

VALSE 论文速览第45期：Neural Body：用带有隐式结构编码的INR生成动态人体新视角

VALSE 论文速览第45期：Neural Body：用带有隐式结构编码的INR生成动态人体新视角

VALSE

0+阅读 · 2022年1月28日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新6篇生成式对抗网络（GAN）相关论文—半监督对抗学习、行人再识别、代表性特征、高分辨率深度卷积、自监督、超分辨

【论文推荐】最新6篇生成式对抗网络（GAN）相关论文—半监督对抗学习、行人再识别、代表性特征、高分辨率深度卷积、自监督、超分辨

专知

10+阅读 · 2018年2月1日

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

2-十三烷酮诱导的棉铃虫HaTrf基因调控细胞凋亡的分子机制

国家自然科学基金

0+阅读 · 2016年12月31日

套子代数的Hochschild上同调及套的分类

国家自然科学基金

3+阅读 · 2014年12月31日

mTOR功能性单倍体通过ERS-IRE1/α-JNK通路调控乳腺癌细胞药物敏感性的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

microRNA调节肿瘤抑制因子Caliban应答DNA损伤的机制

国家自然科学基金

1+阅读 · 2012年12月31日

西沙群岛四种海绵新颖结构抗肿瘤活性成分的发现研究

国家自然科学基金

0+阅读 · 2012年12月31日

lnc-Oct4结合miR-145上调Oct4促进膀胱癌演进的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

水稻OsMYB2P-1基因功能的研究

国家自然科学基金

0+阅读 · 2011年12月31日

GPU加速的视频抽象化和卡通化

国家自然科学基金

0+阅读 · 2009年12月31日

白念珠菌凋亡相关新基因IPF4847的功能研究

国家自然科学基金

0+阅读 · 2008年12月31日

Code-Verification Techniques for the Method-of-Moments Implementation of the Magnetic-Field Integral Equation

Arxiv

0+阅读 · 2023年1月19日

Improving Food Detection For Images From a Wearable Egocentric Camera

Arxiv

0+阅读 · 2023年1月19日

OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models

Arxiv

0+阅读 · 2023年1月18日

Real-Time Viewport-Aware Optical Flow Estimation in 360-degree Videos for Visually-Induced Motion Sickness Mitigation

Arxiv

0+阅读 · 2023年1月18日

HSTFormer: Hierarchical Spatial-Temporal Transformers for 3D Human Pose Estimation

Arxiv

0+阅读 · 2023年1月18日

Mixed Attention with Deep Supervision for Delineation of COVID Infection in Lung CT

Arxiv

0+阅读 · 2023年1月17日

YeLan: Event Camera-Based 3D Human Pose Estimation for Technology-Mediated Dancing in Challenging Environments with Comprehensive Motion-to-Event Simulator

Arxiv

0+阅读 · 2023年1月17日

Disambiguation of One-Shot Visual Classification Tasks: A Simplex-Based Approach

Arxiv

0+阅读 · 2023年1月16日

Causal mediation analysis: From simple to more robust strategies for estimation of marginal natural (in)direct effects

Arxiv

0+阅读 · 2023年1月14日

Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling

Arxiv

10+阅读 · 2021年2月11日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【斯坦福博士论文】计算受限的持续学习：基础与算法

生成式人工智能时代的多目标推荐：最新进展与未来展望综述

AI大模型技术在电力系统中的应用及发展趋势

【ICML2025】SparseLoRA：利用上下文稀疏性加速大语言模型微调

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

VALSE 论文速览第45期：Neural Body：用带有隐式结构编码的INR生成动态人体新视角

VALSE 论文速览第45期：Neural Body：用带有隐式结构编码的INR生成动态人体新视角

VALSE

0+阅读 · 2022年1月28日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新6篇生成式对抗网络（GAN）相关论文—半监督对抗学习、行人再识别、代表性特征、高分辨率深度卷积、自监督、超分辨

【论文推荐】最新6篇生成式对抗网络（GAN）相关论文—半监督对抗学习、行人再识别、代表性特征、高分辨率深度卷积、自监督、超分辨

专知

10+阅读 · 2018年2月1日

相关论文

Code-Verification Techniques for the Method-of-Moments Implementation of the Magnetic-Field Integral Equation

Arxiv

0+阅读 · 2023年1月19日

Improving Food Detection For Images From a Wearable Egocentric Camera

Arxiv

0+阅读 · 2023年1月19日

OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models

Arxiv

0+阅读 · 2023年1月18日

Real-Time Viewport-Aware Optical Flow Estimation in 360-degree Videos for Visually-Induced Motion Sickness Mitigation

Arxiv

0+阅读 · 2023年1月18日

HSTFormer: Hierarchical Spatial-Temporal Transformers for 3D Human Pose Estimation

Arxiv

0+阅读 · 2023年1月18日

Mixed Attention with Deep Supervision for Delineation of COVID Infection in Lung CT

Arxiv

0+阅读 · 2023年1月17日

YeLan: Event Camera-Based 3D Human Pose Estimation for Technology-Mediated Dancing in Challenging Environments with Comprehensive Motion-to-Event Simulator

Arxiv

0+阅读 · 2023年1月17日

Disambiguation of One-Shot Visual Classification Tasks: A Simplex-Based Approach

Arxiv

0+阅读 · 2023年1月16日

Causal mediation analysis: From simple to more robust strategies for estimation of marginal natural (in)direct effects

Arxiv

0+阅读 · 2023年1月14日

Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling

Arxiv

10+阅读 · 2021年2月11日

相关基金

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

2-十三烷酮诱导的棉铃虫HaTrf基因调控细胞凋亡的分子机制

国家自然科学基金

0+阅读 · 2016年12月31日

套子代数的Hochschild上同调及套的分类

国家自然科学基金

3+阅读 · 2014年12月31日

mTOR功能性单倍体通过ERS-IRE1/α-JNK通路调控乳腺癌细胞药物敏感性的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

microRNA调节肿瘤抑制因子Caliban应答DNA损伤的机制

国家自然科学基金

1+阅读 · 2012年12月31日

西沙群岛四种海绵新颖结构抗肿瘤活性成分的发现研究

国家自然科学基金

0+阅读 · 2012年12月31日

lnc-Oct4结合miR-145上调Oct4促进膀胱癌演进的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

水稻OsMYB2P-1基因功能的研究

国家自然科学基金

0+阅读 · 2011年12月31日

GPU加速的视频抽象化和卡通化

国家自然科学基金

0+阅读 · 2009年12月31日

白念珠菌凋亡相关新基因IPF4847的功能研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员