The development of supervised deep learning-based methods for multi-label scene classification (MLC) is one of the prominent research directions in remote sensing (RS). However, collecting annotations for large RS image archives is time-consuming and costly. To address this issue, several data augmentation methods have been introduced in RS. Among others, the CutMix data augmentation technique, which combines parts of two existing training images to generate an augmented image, stands out as a particularly effective approach. However, the direct application of CutMix in RS MLC can lead to the erasure or addition of class labels (i.e., label noise) in the augmented (i.e., combined) training image. To address this problem, we introduce a label propagation (LP) strategy that allows the effective application of CutMix in the context of MLC problems in RS without being affected by label noise. To this end, our proposed LP strategy exploits pixel-level class positional information to update the multi-label of the augmented training image. We propose to access such class positional information from reference maps (e.g., thematic products) associated with each training image or from class explanation masks provided by an explanation method if no reference maps are available. Similarly to pairing two training images, our LP strategy carries out a pairing operation on the associated pixel-level class positional information to derive the updated multi-label for the augmented image. Experimental results show the effectiveness of our LP strategy in general (e.g., an improvement of 2% to 4% mAP macro compared to standard CutMix) and its robustness in the case of various simulated and real scenarios with noisy class positional information in particular. Code is available at https://git.tu-berlin.de/rsim/cutmix_lp.
翻译:基于监督深度学习的多标签场景分类(MLC)方法是遥感(RS)领域的重要研究方向之一。然而,为大规模遥感图像档案收集标注耗时且成本高昂。为解决此问题,遥感领域已引入多种数据增强方法。其中,CutMix数据增强技术通过组合两幅现有训练图像的部分区域生成增强图像,成为一种尤为有效的方法。然而,在遥感多标签分类中直接应用CutMix可能导致增强(即组合)训练图像中出现类别标签的擦除或新增(即标签噪声)。针对该问题,本文提出一种标签传播(LP)策略,使得CutMix能够有效应用于遥感多标签分类问题而不受标签噪声影响。为此,所提出的LP策略利用像素级类别位置信息来更新增强训练图像的多标签。我们建议通过两种方式获取此类位置信息:若存在参考图(如专题产品),则从每幅训练图像关联的参考图中提取;若无参考图,则通过解释方法提供的类别解释掩码获取。与配对两幅训练图像类似,我们的LP策略对关联的像素级类别位置信息执行配对操作,从而推导出增强图像的更新多标签。实验结果表明:1)LP策略整体效果显著(较标准CutMix在mAP宏平均指标上提升2%至4%);2)在具有噪声类别位置信息的各类模拟及真实场景中表现出强鲁棒性。代码发布于https://git.tu-berlin.de/rsim/cutmix_lp。