LaRa: 用于多卡梅拉鸟眼观察语义分割的中继和射线 (LaRa: Latents and Rays for Multi-Camera Bird's-Eye-View Semantic Segmentation) - 专知论文

会员服务 ·

0

潜在 · Projection · Attention · INFORMS · 编码器-解码器（模型） ·

2022 年 6 月 27 日

LaRa: Latents and Rays for Multi-Camera Bird's-Eye-View Semantic Segmentation

翻译：LaRa: 用于多卡梅拉鸟眼观察语义分割的中继和射线

Florent Bartoccioni,Éloi Zablocki,Andrei Bursuc,Patrick Pérez,Matthieu Cord,Karteek Alahari

Recent works in autonomous driving have widely adopted the bird's-eye-view (BEV) semantic map as an intermediate representation of the world. Online prediction of these BEV maps involves non-trivial operations such as multi-camera data extraction as well as fusion and projection into a common top-view grid. This is usually done with error-prone geometric operations (e.g., homography or back-projection from monocular depth estimation) or expensive direct dense mapping between image pixels and pixels in BEV (e.g., with MLP or attention). In this work, we present 'LaRa', an efficient encoder-decoder, transformer-based model for vehicle semantic segmentation from multiple cameras. Our approach uses a system of cross-attention to aggregate information over multiple sensors into a compact, yet rich, collection of latent representations. These latent representations, after being processed by a series of self-attention blocks, are then reprojected with a second cross-attention in the BEV space. We demonstrate that our model outperforms on nuScenes the best previous works using transformers.

翻译：最近自主驾驶工程已广泛采用鸟眼视语义图(BEV)作为世界的中间表示。这些BEV地图的在线预测涉及非三角操作,如多相机数据提取以及聚合和投射到共同的顶视图网格中。这通常与易出错的几何操作(如单层深度估计的同影或反射)或BEV图像像素和像素(如MLP或注意)之间的高密度直接测绘有关(如MLP或注意)有关。在这项工作中,我们展示了“LaRa”,一种高效的解码器、基于变异器的模型,用于从多个相机中提取车辆的语义分解。我们的方法是使用一种交叉注意系统,将多个传感器的信息汇总成一个紧凑但丰富的潜伏图。这些潜伏图在经过一系列自留区处理后,又用BEV空间的第二次交叉保护进行重新预测。我们用变压器展示了我们模型在以前最佳的变压器上的外形模型。

0

相关内容

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

30+阅读 · 2021年6月12日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

76+阅读 · 2020年7月26日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

158+阅读 · 2020年1月16日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

18+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

32+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

54+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

79+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

64+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

39+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

26+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

【推荐】全卷积语义分割综述

【推荐】全卷积语义分割综述

机器学习研究会

19+阅读 · 2017年8月31日

基于Amalgam空间的Hardy空间实变理论及其应用

国家自然科学基金

1+阅读 · 2017年12月31日

大脑后顶叶皮层内的空间编码和多感觉整合

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

MICROMEGAS探测器用于低剂量X射线成像的研究

国家自然科学基金

0+阅读 · 2013年12月31日

偏微分方程的正则性

国家自然科学基金

3+阅读 · 2012年12月31日

复几何中的对称性及其在数学物理中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

函数空间与度量测度空间上的分析

国家自然科学基金

0+阅读 · 2012年12月31日

中国汉族人群先天性脊柱侧凸TBX6候选基因多态性与相关蛋白质学研究

国家自然科学基金

0+阅读 · 2011年12月31日

Navier-Stokes方程解的适定性和粘性消失问题

国家自然科学基金

0+阅读 · 2011年12月31日

相关于算子的Orlicz-型函数空间的实变理论

国家自然科学基金

0+阅读 · 2011年12月31日

Multi-View Correlation Consistency for Semi-Supervised Semantic Segmentation

Multi-View Correlation Consistency for Semi-Supervised Semantic Segmentation

Arxiv

0+阅读 · 2022年8月17日

Efficient dynamic point cloud coding using Slice-Wise Segmentation

Arxiv

0+阅读 · 2022年8月17日

InterTrack: Interaction Transformer for 3D Multi-Object Tracking

Arxiv

0+阅读 · 2022年8月17日

EgoBody: Human Body Shape and Motion of Interacting People from Head-Mounted Devices

Arxiv

0+阅读 · 2022年8月16日

Multiview Detection with Cardboard Human Modeling

Arxiv

0+阅读 · 2022年8月16日

An Efficient Multi-Scale Fusion Network for 3D Organ at Risk (OAR) Segmentation

Arxiv

0+阅读 · 2022年8月15日

DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning

Arxiv

0+阅读 · 2022年8月15日

Cross-Modal Object Tracking: Modality-Aware Representations and A Unified Benchmark

Arxiv

14+阅读 · 2021年11月11日

Evolving Losses for Unsupervised Video Representation Learning

Arxiv

23+阅读 · 2020年2月26日

mvn2vec: Preservation and Collaboration in Multi-View Network Embedding

Arxiv

10+阅读 · 2018年1月19日

VIP会员

文章信息

相关主题

编码器-解码器（模型）

相关VIP内容

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

30+阅读 · 2021年6月12日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

76+阅读 · 2020年7月26日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

158+阅读 · 2020年1月16日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

18+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

32+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

54+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

79+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

64+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

39+阅读 · 2019年10月9日

热门VIP内容

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

26+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

【推荐】全卷积语义分割综述

【推荐】全卷积语义分割综述

机器学习研究会

19+阅读 · 2017年8月31日

相关论文

Multi-View Correlation Consistency for Semi-Supervised Semantic Segmentation

Multi-View Correlation Consistency for Semi-Supervised Semantic Segmentation

Arxiv

0+阅读 · 2022年8月17日

Efficient dynamic point cloud coding using Slice-Wise Segmentation

Arxiv

0+阅读 · 2022年8月17日

InterTrack: Interaction Transformer for 3D Multi-Object Tracking

Arxiv

0+阅读 · 2022年8月17日

EgoBody: Human Body Shape and Motion of Interacting People from Head-Mounted Devices

Arxiv

0+阅读 · 2022年8月16日

Multiview Detection with Cardboard Human Modeling

Arxiv

0+阅读 · 2022年8月16日

An Efficient Multi-Scale Fusion Network for 3D Organ at Risk (OAR) Segmentation

Arxiv

0+阅读 · 2022年8月15日

DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning

Arxiv

0+阅读 · 2022年8月15日

Cross-Modal Object Tracking: Modality-Aware Representations and A Unified Benchmark

Arxiv

14+阅读 · 2021年11月11日

Evolving Losses for Unsupervised Video Representation Learning

Arxiv

23+阅读 · 2020年2月26日

mvn2vec: Preservation and Collaboration in Multi-View Network Embedding

Arxiv

10+阅读 · 2018年1月19日

相关基金

基于Amalgam空间的Hardy空间实变理论及其应用

国家自然科学基金

1+阅读 · 2017年12月31日

大脑后顶叶皮层内的空间编码和多感觉整合

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

MICROMEGAS探测器用于低剂量X射线成像的研究

国家自然科学基金

0+阅读 · 2013年12月31日

偏微分方程的正则性

国家自然科学基金

3+阅读 · 2012年12月31日

复几何中的对称性及其在数学物理中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

函数空间与度量测度空间上的分析

国家自然科学基金

0+阅读 · 2012年12月31日

中国汉族人群先天性脊柱侧凸TBX6候选基因多态性与相关蛋白质学研究

国家自然科学基金

0+阅读 · 2011年12月31日

Navier-Stokes方程解的适定性和粘性消失问题

国家自然科学基金

0+阅读 · 2011年12月31日

相关于算子的Orlicz-型函数空间的实变理论

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员