Place recognition is a crucial task in autonomous driving, allowing vehicles to determine their position using sensor data. While most existing methods rely on contrastive learning, we explore an alternative approach by framing place recognition as a multi-class classification problem. Our method assigns discrete location labels to LiDAR scans and trains an encoder-decoder model to classify each scan's position directly. We evaluate this approach on the NuScenes dataset and show that it achieves competitive performance compared to contrastive learning-based methods while offering advantages in training efficiency and stability.
翻译:场景识别是自动驾驶中的关键任务,使车辆能够利用传感器数据确定自身位置。尽管现有方法大多依赖对比学习,本文探索了一种替代方案:将场景识别构建为多类别分类问题。我们的方法为LiDAR扫描数据分配离散位置标签,并训练编码器-解码器模型直接对每个扫描位置进行分类。我们在NuScenes数据集上评估该方法,结果表明其性能与基于对比学习的方法相当,同时在训练效率和稳定性方面具有优势。