基础设施感知中无需标定的BEV表示 (Calibration-free BEV Representation for Infrastructure Perception) - 专知论文

会员服务 ·

0

CBR · 表示 · 相似性匹配 · 相机参数 · 交通场景 ·

2023 年 4 月 14 日

Calibration-free BEV Representation for Infrastructure Perception

翻译：基础设施感知中无需标定的BEV表示

Siqi Fan,Zhe Wang,Xiaoliang Huo,Yan Wang,Jingjing Liu

Effective BEV object detection on infrastructure can greatly improve traffic scenes understanding and vehicle-toinfrastructure (V2I) cooperative perception. However, cameras installed on infrastructure have various postures, and previous BEV detection methods rely on accurate calibration, which is difficult for practical applications due to inevitable natural factors (e.g., wind and snow). In this paper, we propose a Calibration-free BEV Representation (CBR) network, which achieves 3D detection based on BEV representation without calibration parameters and additional depth supervision. Specifically, we utilize two multi-layer perceptrons for decoupling the features from perspective view to front view and birdeye view under boxes-induced foreground supervision. Then, a cross-view feature fusion module matches features from orthogonal views according to similarity and conducts BEV feature enhancement with front view features. Experimental results on DAIR-V2X demonstrate that CBR achieves acceptable performance without any camera parameters and is naturally not affected by calibration noises. We hope CBR can serve as a baseline for future research addressing practical challenges of infrastructure perception.

翻译：摘要：在基础设施上有效地进行BEV目标检测可以大大改善交通场景的理解和车辆对基础设施（V2I）的协同感知。然而，基础设施上安装的相机姿态各异，以前的BEV检测方法依赖于准确的标定，由于不可避免的自然因素（如风和雪）在实际应用中难以实现标定。本文提出了一种 Calibration-free BEV Representation (CBR) 网络，它基于 BEV 表示实现了3D检测，无需标定参数和额外的深度监督。具体而言，我们利用两个多层感知器将特征从透视视图分解为前视图和鸟瞰图，并采用基于框的前景监督。然后，通过交叉视图特征融合模块，根据相似性匹配正交视图中的特征，并利用前视图特征进行BEV特征增强。DAIR-V2X上的实验结果表明，CBR在不需要任何相机参数的情况下实现了可接受的性能，并且自然不受标定噪声的影响。我们希望CBR成为未来研究基础设施感知实际挑战的基础。

0

相关内容

CBR

【视觉和语言导航:任务、方法和未来方向的综述】Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions

【视觉和语言导航:任务、方法和未来方向的综述】Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions

专知会员服务

36+阅读 · 2022年3月25日

【新加破南洋理工】点云的无监督表示学习综述，Unsupervised Representation Learning for Point Clouds: A Survey

【新加破南洋理工】点云的无监督表示学习综述，Unsupervised Representation Learning for Point Clouds: A Survey

专知会员服务

29+阅读 · 2022年3月2日

AAAI2021 | 图神经网络的异质图结构学习，Heterogeneous Graph Structure Learning for Graph Neural Networks

专知会员服务

92+阅读 · 2021年1月20日

【CVPR2020】语义增强的场景文本识别的编码-解码器框架，SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition

【CVPR2020】语义增强的场景文本识别的编码-解码器框架，SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition

专知会员服务

25+阅读 · 2020年5月22日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

【CVPR2020-Uber】物理上可实现的对抗性的例子，用于激光雷达的目标检测，Physically Realizable Adversarial Examples for LiDAR Object Detection

【CVPR2020-Uber】物理上可实现的对抗性的例子，用于激光雷达的目标检测，Physically Realizable Adversarial Examples for LiDAR Object Detection

专知会员服务

22+阅读 · 2020年4月16日

【CVPR2020】通过潦草注释的弱监督显著目标检测，Weakly-Supervised Salient Object Detection via Scribble Annotations

【CVPR2020】通过潦草注释的弱监督显著目标检测，Weakly-Supervised Salient Object Detection via Scribble Annotations

专知会员服务

39+阅读 · 2020年3月19日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ICRA 2019 论文速览 | 基于Deep Learning 的SLAM

ICRA 2019 论文速览 | 基于Deep Learning 的SLAM

计算机视觉life

41+阅读 · 2019年7月22日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

泡泡机器人SLAM

11+阅读 · 2019年1月4日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

深度学习医学图像分析文献集

深度学习医学图像分析文献集

机器学习研究会

19+阅读 · 2017年10月13日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

基于视觉上下文与文字显著性的复杂自然场景中文字检测研究

国家自然科学基金

1+阅读 · 2015年12月31日

自适应两阶段非线性容积Kalman滤波融合方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于Wyner-Ziv分布式编码的无线视频通信端到端失真度估算

国家自然科学基金

0+阅读 · 2014年12月31日

面向城市空气质量细粒度感知的移动传感数据重构方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

高精度实时水汽Raman激光雷达自标定方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

惯性与高阶特征辅助的图像动态环境感知方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于阴影恢复技术的SAR三维重建与目标检测方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于地面干涉雷达监测数据的微变形特征识别方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

ERG介导组蛋白修饰调控CRMP4失活启动前列腺癌转移的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于表达残差稀疏性的遮挡人脸识别方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cross Modal Data Discovery over Structured and Unstructured Data Lakes

Arxiv

0+阅读 · 2023年6月1日

A Quality Index Metric and Method for Online Self-Assessment of Autonomous Vehicles Sensory Perception

Arxiv

0+阅读 · 2023年6月1日

Learning Explicit Contact for Implicit Reconstruction of Hand-held Objects from Monocular Images

Arxiv

0+阅读 · 2023年5月31日

Template-free Articulated Neural Point Clouds for Reposable View Synthesis

Arxiv

0+阅读 · 2023年5月30日

Robust Multimodal Failure Detection for Microservice Systems

Arxiv

0+阅读 · 2023年5月30日

Occ-BEV: Multi-Camera Unified Pre-training via 3D Scene Reconstruction

Arxiv

0+阅读 · 2023年5月30日

Global high-order numerical schemes for the time evolution of the general relativistic radiation magneto-hydrodynamics equations

Arxiv

0+阅读 · 2023年5月29日

Multimodality Representation Learning: A Survey on Evolution, Pretraining and Its Applications

Arxiv

20+阅读 · 2023年2月1日

Scene Graph Generation: A Comprehensive Survey

Arxiv

26+阅读 · 2022年1月3日

Deep Graph Structure Learning for Robust Representations: A Survey

Arxiv

21+阅读 · 2021年3月4日

VIP会员

文章信息

相关主题

相似性匹配

相关VIP内容

【视觉和语言导航:任务、方法和未来方向的综述】Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions

【视觉和语言导航:任务、方法和未来方向的综述】Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions

专知会员服务

36+阅读 · 2022年3月25日

【新加破南洋理工】点云的无监督表示学习综述，Unsupervised Representation Learning for Point Clouds: A Survey

【新加破南洋理工】点云的无监督表示学习综述，Unsupervised Representation Learning for Point Clouds: A Survey

专知会员服务

29+阅读 · 2022年3月2日

AAAI2021 | 图神经网络的异质图结构学习，Heterogeneous Graph Structure Learning for Graph Neural Networks

专知会员服务

92+阅读 · 2021年1月20日

【CVPR2020】语义增强的场景文本识别的编码-解码器框架，SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition

【CVPR2020】语义增强的场景文本识别的编码-解码器框架，SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition

专知会员服务

25+阅读 · 2020年5月22日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

【CVPR2020-Uber】物理上可实现的对抗性的例子，用于激光雷达的目标检测，Physically Realizable Adversarial Examples for LiDAR Object Detection

【CVPR2020-Uber】物理上可实现的对抗性的例子，用于激光雷达的目标检测，Physically Realizable Adversarial Examples for LiDAR Object Detection

专知会员服务

22+阅读 · 2020年4月16日

【CVPR2020】通过潦草注释的弱监督显著目标检测，Weakly-Supervised Salient Object Detection via Scribble Annotations

【CVPR2020】通过潦草注释的弱监督显著目标检测，Weakly-Supervised Salient Object Detection via Scribble Annotations

专知会员服务

39+阅读 · 2020年3月19日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型中的检索与结构化增强生成综述

《实现多层防御多轮交战机制的扩展型随机齐射模型》2025年最新83页

【CMU博士论文】交互驱动的人体动作估计与生成

如何避免生成式人工智能在作战中失控失效

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ICRA 2019 论文速览 | 基于Deep Learning 的SLAM

ICRA 2019 论文速览 | 基于Deep Learning 的SLAM

计算机视觉life

41+阅读 · 2019年7月22日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

泡泡机器人SLAM

11+阅读 · 2019年1月4日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

深度学习医学图像分析文献集

深度学习医学图像分析文献集

机器学习研究会

19+阅读 · 2017年10月13日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

相关论文

Cross Modal Data Discovery over Structured and Unstructured Data Lakes

Arxiv

0+阅读 · 2023年6月1日

A Quality Index Metric and Method for Online Self-Assessment of Autonomous Vehicles Sensory Perception

Arxiv

0+阅读 · 2023年6月1日

Learning Explicit Contact for Implicit Reconstruction of Hand-held Objects from Monocular Images

Arxiv

0+阅读 · 2023年5月31日

Template-free Articulated Neural Point Clouds for Reposable View Synthesis

Arxiv

0+阅读 · 2023年5月30日

Robust Multimodal Failure Detection for Microservice Systems

Arxiv

0+阅读 · 2023年5月30日

Occ-BEV: Multi-Camera Unified Pre-training via 3D Scene Reconstruction

Arxiv

0+阅读 · 2023年5月30日

Global high-order numerical schemes for the time evolution of the general relativistic radiation magneto-hydrodynamics equations

Arxiv

0+阅读 · 2023年5月29日

Multimodality Representation Learning: A Survey on Evolution, Pretraining and Its Applications

Arxiv

20+阅读 · 2023年2月1日

Scene Graph Generation: A Comprehensive Survey

Arxiv

26+阅读 · 2022年1月3日

Deep Graph Structure Learning for Robust Representations: A Survey

Arxiv

21+阅读 · 2021年3月4日

相关基金

基于视觉上下文与文字显著性的复杂自然场景中文字检测研究

国家自然科学基金

1+阅读 · 2015年12月31日

自适应两阶段非线性容积Kalman滤波融合方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于Wyner-Ziv分布式编码的无线视频通信端到端失真度估算

国家自然科学基金

0+阅读 · 2014年12月31日

面向城市空气质量细粒度感知的移动传感数据重构方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

高精度实时水汽Raman激光雷达自标定方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

惯性与高阶特征辅助的图像动态环境感知方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于阴影恢复技术的SAR三维重建与目标检测方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于地面干涉雷达监测数据的微变形特征识别方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

ERG介导组蛋白修饰调控CRMP4失活启动前列腺癌转移的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于表达残差稀疏性的遮挡人脸识别方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员