3D人体姿态估计的直观物理方法 (3D Human Pose Estimation via Intuitive Physics) - 专知论文

会员服务 ·

0

3D · 人体姿态 · 人体姿态估计 · 姿态估计 · 推断 ·

2023 年 3 月 31 日

3D Human Pose Estimation via Intuitive Physics

翻译：3D人体姿态估计的直观物理方法

Shashank Tripathi,Lea Müller,Chun-Hao P. Huang,Omid Taheri,Michael J. Black,Dimitrios Tzionas

from arxiv, Accepted in CVPR'23. Project page: https://ipman.is.tue.mpg.de

Estimating 3D humans from images often produces implausible bodies that lean, float, or penetrate the floor. Such methods ignore the fact that bodies are typically supported by the scene. A physics engine can be used to enforce physical plausibility, but these are not differentiable, rely on unrealistic proxy bodies, and are difficult to integrate into existing optimization and learning frameworks. In contrast, we exploit novel intuitive-physics (IP) terms that can be inferred from a 3D SMPL body interacting with the scene. Inspired by biomechanics, we infer the pressure heatmap on the body, the Center of Pressure (CoP) from the heatmap, and the SMPL body's Center of Mass (CoM). With these, we develop IPMAN, to estimate a 3D body from a color image in a "stable" configuration by encouraging plausible floor contact and overlapping CoP and CoM. Our IP terms are intuitive, easy to implement, fast to compute, differentiable, and can be integrated into existing optimization and regression methods. We evaluate IPMAN on standard datasets and MoYo, a new dataset with synchronized multi-view images, ground-truth 3D bodies with complex poses, body-floor contact, CoM and pressure. IPMAN produces more plausible results than the state of the art, improving accuracy for static poses, while not hurting dynamic ones. Code and data are available for research at https://ipman.is.tue.mpg.de.

翻译：从图像中估计3D人体往往会产生倾斜、漂浮或穿过地面的不合理身体。这样的方法忽略了身体通常由场景支撑的事实。物理引擎可以用于实施物理可行性，但它们不可微，依赖于不真实的代理身体，并且难以集成到现有的优化和学习框架中。相反，我们利用可以从3D SMPL身体与场景相互作用中推断出的新颖直觉物理(IP)项。受生物力学的启发，我们推断出身体上的压力热图，热图中的压力中心(CoP)和SMPL身体的质心(CoM)。借助这些，我们开发了IPMAN，通过鼓励合理的地面接触和CoP和CoM的重叠，在彩色图像中估计3D身体处于“稳定”状态。我们的IP术语直观、易于实现、计算速度快、可微分，并且可以集成到现有的优化和回归方法中。我们在标准数据集和MoYo上进行评估，后者具有同步的多视图图像、具有复杂姿势、身体-地面接触、CoM和压力的地面真实身体。IPMAN比现有技术产生了更合理的结果，在静态姿势的准确性方面有所提高，而不会影响到动态姿势。研究中的代码和数据可在https://ipman.is.tue.mpg.de上获得。

0

相关内容

3D是英文“Three Dimensions”的简称，中文是指三维、三个维度、三个坐标，即有长、有宽、有高，换句话说，就是立体的，是相对于只有长和宽的平面（2D）而言。

【CVPR2022】多视图聚合的大规模三维语义分割

【CVPR2022】多视图聚合的大规模三维语义分割

专知会员服务

20+阅读 · 2022年4月20日

【CVPR 2021】姿态可控的语音驱动说话人脸

专知会员服务

15+阅读 · 2021年5月13日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

30+阅读 · 2020年5月12日

【旷视-CVPR2020】领域自适应对象检测的探索类别正则化，Exploring Categorical Regularization for Domain Adaptive Object Detection

【旷视-CVPR2020】领域自适应对象检测的探索类别正则化，Exploring Categorical Regularization for Domain Adaptive Object Detection

专知会员服务

36+阅读 · 2020年3月23日

【香港中文大学-CVPR2020】Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

【香港中文大学-CVPR2020】Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

专知会员服务

21+阅读 · 2020年3月18日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

49+阅读 · 2020年2月26日

近期必读的5篇 CVPR 2019【图卷积网络】相关论文和代码

近期必读的5篇 CVPR 2019【图卷积网络】相关论文和代码

专知会员服务

32+阅读 · 2020年1月10日

【康奈尔大学】度量数据粒度，Measuring Dataset Granularity

【康奈尔大学】度量数据粒度，Measuring Dataset Granularity

专知会员服务

11+阅读 · 2019年12月27日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

77+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

39+阅读 · 2019年10月9日

“CVPR 2020 接受论文列表 1470篇论文都在这了

“CVPR 2020 接受论文列表 1470篇论文都在这了

专知

71+阅读 · 2020年6月10日

CVPR2020接收论文开源代码

CVPR2020接收论文开源代码

专知

30+阅读 · 2020年2月29日

异常检测论文大列表：方法、应用、综述

异常检测论文大列表：方法、应用、综述

专知

125+阅读 · 2019年7月15日

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

AI科技评论

21+阅读 · 2019年6月23日

CVPR 2019 | 重磅！34篇 CVPR2019 论文实现代码

CVPR 2019 | 重磅！34篇 CVPR2019 论文实现代码

AI研习社

11+阅读 · 2019年6月21日

CVPR2019| 9篇CVPR论文开源代码（行人检测/物体检测/3D Face等）

CVPR2019| 9篇CVPR论文开源代码（行人检测/物体检测/3D Face等）

极市平台

12+阅读 · 2019年5月31日

CVPR2019| 04-24更新12篇论文及代码（位姿估计/自动驾驶/GAN/图像生成等）

CVPR2019| 04-24更新12篇论文及代码（位姿估计/自动驾驶/GAN/图像生成等）

极市平台

11+阅读 · 2019年4月24日

人脸检测库：libfacedetection

人脸检测库：libfacedetection

Python程序员

15+阅读 · 2019年3月22日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

11+阅读 · 2018年6月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

光场成像的轴向超分辨率方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

准一维碳纳米线圈的导电机制及其光、热、力响应研究

国家自然科学基金

0+阅读 · 2012年12月31日

微纳米结构中极化激元太赫兹光子发射研究

国家自然科学基金

0+阅读 · 2012年12月31日

AdS/CFT对应在凝聚态物理中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

ICF中电子/离子输运的PIC-FLUID混合模拟方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cocycle动力学和拟周期薛定谔算子的谱

国家自然科学基金

0+阅读 · 2012年12月31日

微阱中囚禁离子的量子相干调控

国家自然科学基金

0+阅读 · 2011年12月31日

强激光与多电子原子的相互作用研究

国家自然科学基金

0+阅读 · 2009年12月31日

非对易空间和非对易相空间中的量子物理

国家自然科学基金

0+阅读 · 2009年12月31日

基于代谢组学方法研究通塞脉微丸治疗缺血性中风的机制

国家自然科学基金

0+阅读 · 2008年12月31日

AD-MERCS: Modeling Normality and Abnormality in Unsupervised Anomaly Detection

Arxiv

0+阅读 · 2023年5月22日

You Only Look at One: Category-Level Object Representations for Pose Estimation From a Single Example

Arxiv

0+阅读 · 2023年5月22日

Towards Long-Tailed 3D Detection

Arxiv

0+阅读 · 2023年5月19日

Generating Visual Spatial Description via Holistic 3D Scene Understanding

Arxiv

0+阅读 · 2023年5月19日

Object-centric and memory-guided normality reconstruction for video anomaly detection

Arxiv

0+阅读 · 2023年5月19日

Progressive Learning of 3D Reconstruction Network from 2D GAN Data

Arxiv

0+阅读 · 2023年5月18日

Spectral Change Point Estimation for High Dimensional Time Series by Sparse Tensor Decomposition

Arxiv

0+阅读 · 2023年5月18日

EquiBind: Geometric Deep Learning for Drug Binding Structure Prediction

Arxiv

19+阅读 · 2022年3月17日

Deep Learning-Based Human Pose Estimation: A Survey

Arxiv

27+阅读 · 2020年12月24日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

16+阅读 · 2019年3月3日

VIP会员

文章信息

相关主题

人体姿态估计

相关VIP内容

【CVPR2022】多视图聚合的大规模三维语义分割

【CVPR2022】多视图聚合的大规模三维语义分割

专知会员服务

20+阅读 · 2022年4月20日

【CVPR 2021】姿态可控的语音驱动说话人脸

专知会员服务

15+阅读 · 2021年5月13日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

30+阅读 · 2020年5月12日

【旷视-CVPR2020】领域自适应对象检测的探索类别正则化，Exploring Categorical Regularization for Domain Adaptive Object Detection

【旷视-CVPR2020】领域自适应对象检测的探索类别正则化，Exploring Categorical Regularization for Domain Adaptive Object Detection

专知会员服务

36+阅读 · 2020年3月23日

【香港中文大学-CVPR2020】Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

【香港中文大学-CVPR2020】Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images

专知会员服务

21+阅读 · 2020年3月18日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

49+阅读 · 2020年2月26日

近期必读的5篇 CVPR 2019【图卷积网络】相关论文和代码

近期必读的5篇 CVPR 2019【图卷积网络】相关论文和代码

专知会员服务

32+阅读 · 2020年1月10日

【康奈尔大学】度量数据粒度，Measuring Dataset Granularity

【康奈尔大学】度量数据粒度，Measuring Dataset Granularity

专知会员服务

11+阅读 · 2019年12月27日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

77+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

39+阅读 · 2019年10月9日

热门VIP内容

相关资讯

“CVPR 2020 接受论文列表 1470篇论文都在这了

“CVPR 2020 接受论文列表 1470篇论文都在这了

专知

71+阅读 · 2020年6月10日

CVPR2020接收论文开源代码

CVPR2020接收论文开源代码

专知

30+阅读 · 2020年2月29日

异常检测论文大列表：方法、应用、综述

异常检测论文大列表：方法、应用、综述

专知

125+阅读 · 2019年7月15日

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

CVPR 2019 | 34篇 CVPR 2019 论文实现代码

AI科技评论

21+阅读 · 2019年6月23日

CVPR 2019 | 重磅！34篇 CVPR2019 论文实现代码

CVPR 2019 | 重磅！34篇 CVPR2019 论文实现代码

AI研习社

11+阅读 · 2019年6月21日

CVPR2019| 9篇CVPR论文开源代码（行人检测/物体检测/3D Face等）

CVPR2019| 9篇CVPR论文开源代码（行人检测/物体检测/3D Face等）

极市平台

12+阅读 · 2019年5月31日

CVPR2019| 04-24更新12篇论文及代码（位姿估计/自动驾驶/GAN/图像生成等）

CVPR2019| 04-24更新12篇论文及代码（位姿估计/自动驾驶/GAN/图像生成等）

极市平台

11+阅读 · 2019年4月24日

人脸检测库：libfacedetection

人脸检测库：libfacedetection

Python程序员

15+阅读 · 2019年3月22日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

11+阅读 · 2018年6月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

AD-MERCS: Modeling Normality and Abnormality in Unsupervised Anomaly Detection

Arxiv

0+阅读 · 2023年5月22日

You Only Look at One: Category-Level Object Representations for Pose Estimation From a Single Example

Arxiv

0+阅读 · 2023年5月22日

Towards Long-Tailed 3D Detection

Arxiv

0+阅读 · 2023年5月19日

Generating Visual Spatial Description via Holistic 3D Scene Understanding

Arxiv

0+阅读 · 2023年5月19日

Object-centric and memory-guided normality reconstruction for video anomaly detection

Arxiv

0+阅读 · 2023年5月19日

Progressive Learning of 3D Reconstruction Network from 2D GAN Data

Arxiv

0+阅读 · 2023年5月18日

Spectral Change Point Estimation for High Dimensional Time Series by Sparse Tensor Decomposition

Arxiv

0+阅读 · 2023年5月18日

EquiBind: Geometric Deep Learning for Drug Binding Structure Prediction

Arxiv

19+阅读 · 2022年3月17日

Deep Learning-Based Human Pose Estimation: A Survey

Arxiv

27+阅读 · 2020年12月24日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

16+阅读 · 2019年3月3日

相关基金

光场成像的轴向超分辨率方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

准一维碳纳米线圈的导电机制及其光、热、力响应研究

国家自然科学基金

0+阅读 · 2012年12月31日

微纳米结构中极化激元太赫兹光子发射研究

国家自然科学基金

0+阅读 · 2012年12月31日

AdS/CFT对应在凝聚态物理中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

ICF中电子/离子输运的PIC-FLUID混合模拟方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cocycle动力学和拟周期薛定谔算子的谱

国家自然科学基金

0+阅读 · 2012年12月31日

微阱中囚禁离子的量子相干调控

国家自然科学基金

0+阅读 · 2011年12月31日

强激光与多电子原子的相互作用研究

国家自然科学基金

0+阅读 · 2009年12月31日

非对易空间和非对易相空间中的量子物理

国家自然科学基金

0+阅读 · 2009年12月31日

基于代谢组学方法研究通塞脉微丸治疗缺血性中风的机制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员