Real-time 6D object pose estimation is essential for many real-world applications, such as robotic grasping and augmented reality. To achieve an accurate object pose estimation from RGB images in real-time, we propose an effective and lightweight model, namely High-Resolution 6D Pose Estimation Network (HRPose). We adopt the efficient and small HRNetV2-W18 as a feature extractor to reduce computational burdens while generating accurate 6D poses. With only 33\% of the model size and lower computational costs, our HRPose achieves comparable performance compared with state-of-the-art models. Moreover, by transferring knowledge from a large model to our proposed HRPose through output and feature-similarity distillations, the performance of our HRPose is improved in effectiveness and efficiency. Numerical experiments on the widely-used benchmark LINEMOD demonstrate the superiority of our proposed HRPose against state-of-the-art methods.
翻译:实时 6D 对象构成估计对于许多现实应用至关重要,例如机器人捕捉和扩大现实。为了实现一个精确的天体代表实时RGB图像的准确估计,我们提议了一个有效和轻量级模型,即高分辨率 6D 粒子估计网络(HRPose ) 。我们采用了高效和小型的 HRNetV2-W18 功能提取器,以减少计算负担,同时生成准确的 6D 构成。由于模型大小和计算成本只有33 ⁇,我们的HRPose与最新模型相比,取得了可比的性能。此外,通过输出和特征相似的蒸馏将知识从一个大模型转移到我们提议的HRPose,我们HRPose的性能提高了效力和效率。关于广泛使用的基准LINEMOD的量化实验显示了我们提议的HRPose相对于最新方法的优势。