The loss function for bounding box regression (BBR) is essential to object detection. Its good definition will bring significant performance improvement to the model. Most existing works assume that the examples in the training data are high-quality and focus on strengthening the fitting ability of BBR loss. If we blindly strengthen BBR on low-quality examples, it will jeopardize localization performance. Focal-EIoU v1 was proposed to solve this problem, but due to its static focusing mechanism (FM), the potential of non-monotonic FM was not fully exploited. Based on this idea, we propose an IoU-based loss with a dynamic non-monotonic FM named Wise-IoU (WIoU). When WIoU is applied to the state-of-the-art real-time detector YOLOv7, the AP-75 on the MS-COCO dataset is improved from 53.03% to 54.50%.
翻译:捆绑框回归(BBR) 的损失功能对于物体检测至关重要。 良好的定义将给模型带来显著的性能改进。 多数现有工作假设培训数据中的例子质量高,侧重于加强BBR损失的适当能力。 如果我们盲目地加强BBR低质量实例, 将危及本地化性能。 提出Coint- EIoU v1 来解决这个问题, 但由于它的静态聚焦机制(FM), 非调频的潜力没有被充分利用。 基于这个想法, 我们提议以动态的非monotonoc调频(Wise- IoU)为基基于IoU的损失。 当WIOU被应用到最先进的实时探测器YOLOv7时, MS-CO数据集的AP-75从53.03%提高到54.50%。