HUM3DIL: 半监督的多式3D 人类自主驾驶估计 (HUM3DIL: Semi-supervised Multi-modal 3D Human Pose Estimation for Autonomous Driving) - 专知论文

会员服务 ·

0

LIDAR · 估计/估计量 · 3D · INFORMS · state-of-the-art ·

2022 年 12 月 15 日

HUM3DIL: Semi-supervised Multi-modal 3D Human Pose Estimation for Autonomous Driving

翻译：HUM3DIL: 半监督的多式3D 人类自主驾驶估计

Andrei Zanfir,Mihai Zanfir,Alexander Gorban,Jingwei Ji,Yin Zhou,Dragomir Anguelov,Cristian Sminchisescu

from arxiv, Published at the 6th Conference on Robot Learning (CoRL 2022), Auckland, New Zealand

Autonomous driving is an exciting new industry, posing important research questions. Within the perception module, 3D human pose estimation is an emerging technology, which can enable the autonomous vehicle to perceive and understand the subtle and complex behaviors of pedestrians. While hardware systems and sensors have dramatically improved over the decades -- with cars potentially boasting complex LiDAR and vision systems and with a growing expansion of the available body of dedicated datasets for this newly available information -- not much work has been done to harness these novel signals for the core problem of 3D human pose estimation. Our method, which we coin HUM3DIL (HUMan 3D from Images and LiDAR), efficiently makes use of these complementary signals, in a semi-supervised fashion and outperforms existing methods with a large margin. It is a fast and compact model for onboard deployment. Specifically, we embed LiDAR points into pixel-aligned multi-modal features, which we pass through a sequence of Transformer refinement stages. Quantitative experiments on the Waymo Open Dataset support these claims, where we achieve state-of-the-art results on the task of 3D pose estimation.

翻译：自主驾驶是一个令人兴奋的新产业,它提出了重要的研究问题。在感知模块中,3D人构成估计是一种新兴技术,它可以使自主载体能够感知和理解行人微妙和复杂的行为。虽然硬件系统和传感器在过去几十年里已经大为改善 -- -- 汽车可能夸大复杂的激光雷达和视觉系统,而且汽车可能为这种新获得的信息而拥有的专用数据集越来越多 -- -- 在利用这些新的3D人构成估计核心问题的新信号方面没有做多少工作。我们用HUM3DIL(来自图像和激光雷达的HUMan 3D)这个方法,有效地使用这些辅助信号,以半受监督的方式,并且大大超越了现有方法。这是一个在机上部署的快速和紧凑模式。具体地说,我们将激光雷达点嵌入像素调整的多模式特征中,我们通过一个变异器改进阶段的顺序通过。Waymo Open数据集的量化实验支持了这些主张,我们在3D估计任务中取得了最先进的结果。

0

相关内容

LIDAR

【CVPR2022】自动驾驶中的伪双目三维目标检测，Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving

【CVPR2022】自动驾驶中的伪双目三维目标检测，Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving

专知会员服务

18+阅读 · 2022年3月19日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

161+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

地西他滨诱导人调节性gammadeltaT细胞Foxp3基因表达上调的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于极化中子反射技术的CoFe/IrMn双层膜的自旋结构研究

国家自然科学基金

0+阅读 · 2015年12月31日

"β-hCG-ERK1/2-MMP-2"信号通路在卵巢癌侵袭、转移中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

鱼类特有基因Gig1和Gig2在干扰素抗病毒反应中的功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

纤维结构不良中破骨细胞过度激活的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

ING3：原发性肝癌的诊断与治疗新靶点

国家自然科学基金

0+阅读 · 2012年12月31日

DBFC燃料电池Au基合金催化剂结构设计、性能表征及电催化特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于PMN-PT单晶的层状结构中弹性波传播特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

CXCR7/SDF-1/ITAC信号调控前列腺癌细胞迁徙、侵袭及增殖的作用机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

PMN-PT单晶的高频压电性能及机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

Navya3DSeg -- Navya 3D Semantic Segmentation Dataset & split generation for autonomous vehicles

Arxiv

0+阅读 · 2023年2月16日

A Review of Uncertainty Estimation and its Application in Medical Imaging

Arxiv

1+阅读 · 2023年2月16日

Pose-Oriented Transformer with Uncertainty-Guided Refinement for 2D-to-3D Human Pose Estimation

Arxiv

0+阅读 · 2023年2月15日

Semi-Supervised Visual Tracking of Marine Animals using Autonomous Underwater Vehicles

Arxiv

0+阅读 · 2023年2月14日

An Image Processing Pipeline for Autonomous Deep-Space Optical Navigation

Arxiv

0+阅读 · 2023年2月14日

Surround-View Vision-based 3D Detection for Autonomous Driving: A Survey

Arxiv

0+阅读 · 2023年2月13日

$Explicit3D: Graph Network with Spatial Inference \\for Single Image 3D Object Detection$

Explicit3D: Graph Network with Spatial Inference \\for Single Image 3D Object Detection

Arxiv

0+阅读 · 2023年2月13日

3D Object Detection for Autonomous Driving: A Survey

Arxiv

12+阅读 · 2021年6月21日

Deep Learning-Based Human Pose Estimation: A Survey

Arxiv

27+阅读 · 2020年12月24日

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

Arxiv

10+阅读 · 2020年3月20日

VIP会员

文章信息

相关主题

估计/估计量

state-of-the-art

相关VIP内容

【CVPR2022】自动驾驶中的伪双目三维目标检测，Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving

【CVPR2022】自动驾驶中的伪双目三维目标检测，Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving

专知会员服务

18+阅读 · 2022年3月19日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

161+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《国防领域的人工智能：国防工业基础未来路线图——通过人工智能战略整合、确保安全与开创国防创新》2025最新31页报告

《美海军陆战队训练与教育司令部战役计划2025》最新报告

生成式人工智能的军事应用及路径探讨

《生成式人工智能的军事安全应用：弹性可信部署框架》北约最新51页slides

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

Navya3DSeg -- Navya 3D Semantic Segmentation Dataset & split generation for autonomous vehicles

Arxiv

0+阅读 · 2023年2月16日

A Review of Uncertainty Estimation and its Application in Medical Imaging

Arxiv

1+阅读 · 2023年2月16日

Pose-Oriented Transformer with Uncertainty-Guided Refinement for 2D-to-3D Human Pose Estimation

Arxiv

0+阅读 · 2023年2月15日

Semi-Supervised Visual Tracking of Marine Animals using Autonomous Underwater Vehicles

Arxiv

0+阅读 · 2023年2月14日

An Image Processing Pipeline for Autonomous Deep-Space Optical Navigation

Arxiv

0+阅读 · 2023年2月14日

Surround-View Vision-based 3D Detection for Autonomous Driving: A Survey

Arxiv

0+阅读 · 2023年2月13日

$Explicit3D: Graph Network with Spatial Inference \\for Single Image 3D Object Detection$

Explicit3D: Graph Network with Spatial Inference \\for Single Image 3D Object Detection

Arxiv

0+阅读 · 2023年2月13日

3D Object Detection for Autonomous Driving: A Survey

Arxiv

12+阅读 · 2021年6月21日

Deep Learning-Based Human Pose Estimation: A Survey

Arxiv

27+阅读 · 2020年12月24日

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

Arxiv

10+阅读 · 2020年3月20日

相关基金

地西他滨诱导人调节性gammadeltaT细胞Foxp3基因表达上调的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于极化中子反射技术的CoFe/IrMn双层膜的自旋结构研究

国家自然科学基金

0+阅读 · 2015年12月31日

"β-hCG-ERK1/2-MMP-2"信号通路在卵巢癌侵袭、转移中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

鱼类特有基因Gig1和Gig2在干扰素抗病毒反应中的功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

纤维结构不良中破骨细胞过度激活的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

ING3：原发性肝癌的诊断与治疗新靶点

国家自然科学基金

0+阅读 · 2012年12月31日

DBFC燃料电池Au基合金催化剂结构设计、性能表征及电催化特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于PMN-PT单晶的层状结构中弹性波传播特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

CXCR7/SDF-1/ITAC信号调控前列腺癌细胞迁徙、侵袭及增殖的作用机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

PMN-PT单晶的高频压电性能及机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员