Retrieval-based place recognition is an efficient and effective solution for re-localization within a pre-built map, or global data association for Simultaneous Localization and Mapping (SLAM). The accuracy of such an approach is heavily dependent on the quality of the extracted scene-level representation. While end-to-end solutions - which learn a global descriptor from input point clouds - have demonstrated promising results, such approaches are limited in their ability to enforce desirable properties at the local feature level. In this paper, we introduce a local consistency loss to guide the network towards learning local features which are consistent across revisits, hence leading to more repeatable global descriptors resulting in an overall improvement in 3D place recognition performance. We formulate our approach in an end-to-end trainable architecture called LoGG3D-Net. Experiments on two large-scale public benchmarks (KITTI and MulRan) show that our method achieves mean $F1_{max}$ scores of $0.939$ and $0.968$ on KITTI and MulRan respectively, achieving state-of-the-art performance while operating in near real-time. The open-source implementation is available at: https://github.com/csiro-robotics/LoGG3D-Net.
翻译:以检索为基础的地点识别是在预先建成的地图或全球同步本地化和绘图数据协会(SLAM)内重新定位的一个高效而有效的解决方案。这种方法的准确性在很大程度上取决于提取的现场代表质量。虽然端到端解决方案(从输入点云中学习全球描述器)已经显示出有希望的结果,但这类方法在当地地物层面实施适当属性的能力有限。在本文件中,我们引入了地方一致性损失,以指导网络学习本地特征,这些特征在重新审视中是一致的,从而导致更可重复的全球描述器导致3D地点识别性能的全面改进。我们以端到端的可培训架构制定我们的方法,称为LoGG3D-Net。对两个大型公共基准(KITTI和Mulran)的实验表明,我们的方法在KITTI和MulRan上分别达到0.939美元和0.968美元,在接近实时运行时实现了状态-D-版/MulRan。