空中图像中的物体探测:大型基准和挑战 (Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges)

In the past decade, object detection has achieved significant progress in natural images but not in aerial images, due to the massive variations in the scale and orientation of objects caused by the bird's-eye view of aerial images. More importantly, the lack of large-scale benchmarks becomes a major obstacle to the development of object detection in aerial images (ODAI). In this paper, we present a large-scale Dataset of Object deTection in Aerial images (DOTA) and comprehensive baselines for ODAI. The proposed DOTA dataset contains 1,793,658 object instances of 18 categories of oriented-bounding-box annotations collected from 11,268 aerial images. Based on this large-scale and well-annotated dataset, we build baselines covering 10 state-of-the-art algorithms with over 70 configurations, where the speed and accuracy performances of each model have been evaluated. Furthermore, we provide a uniform code library for ODAI and build a website for testing and evaluating different algorithms. Previous challenges run on DOTA have attracted more than 1300 teams worldwide. We believe that the expanded large-scale DOTA dataset, the extensive baselines, the code library and the challenges can facilitate the designs of robust algorithms and reproducible research on the problem of object detection in aerial images.

翻译：过去十年来,天体探测在自然图像方面取得了显著进展,但在航空图像方面没有取得显著进展。由于鸟类对空中图像视视视视造成天体天体规模和方向的巨大变化,天体探测在自然图像上取得了显著进展,但由于天体图像视距造成天体物体天体观测规模和方向的巨大变化,更为重要的是,缺乏大规模基准成为空中图像天体探测的一大障碍。在本文件中,我们提供了空中图像天体探测物体大规模数据集(DOTA)和官方发展援助综合基线。拟议的DOTA数据集包含从11 268空中图像收集的18类定向框说明的18个物体实例。基于这一大规模和有良好说明的数据集,我们建立了涵盖10种最先进的算法的基线,其中每个模型的速度和准确性能都得到了评估。此外,我们为DODIAI提供了一个统一的代码库,并建立了一个测试和评价不同算法的网站。DOTA以往的挑战吸引了1 300多个团队。我们认为,扩大的DATA数据设置、广泛的基线、稳健的代码图书馆以及图像的重新设计中的各种挑战可以促进。