Data augmentation is a very practical technique that can be used to improve the generalization ability of neural networks and prevent overfitting. Recently, mixed sample data augmentation has received a lot of attention and achieved great success. In order to enhance the performance of mixed sample data augmentation, a series of recent works are devoted to obtaining and analyzing the salient regions of the image, and using the saliency area to guide the image mixing. However, obtaining the salient information of an image requires a lot of extra calculations. Different from improving performance through saliency analysis, our proposed method RandomMix mainly increases the diversity of the mixed sample to enhance the generalization ability and performance of neural networks. Moreover, RandomMix can improve the robustness of the model, does not require too much additional calculation, and is easy to insert into the training pipeline. Finally, experiments on the CIFAR-10/100, Tiny-ImageNet, ImageNet, and Google Speech Commands datasets demonstrate that RandomMix achieves better performance than other state-of-the-art mixed sample data augmentation methods.
翻译:增强数据是一种非常实用的技术,可用于提高神经网络的普及能力并防止过度适应。最近,混合样本数据增强受到了很多关注,并取得了巨大成功。为了提高混合样本数据增强的性能,最近开展了一系列工作,专门获取和分析图像的突出区域,并利用突出区域来指导图像混合。然而,获得图像的突出信息需要大量额外的计算。与通过显著分析改进性能不同的是,我们提议的随机混合方法主要增加了混合样本的多样性,以提高神经网络的普及能力和性能。此外,随机混合可以提高模型的稳健性,不需要太多额外的计算,而且很容易插入到培训管道中。最后,关于CIFAR-10-100、Tinyy-ImagiNet、图像网络和谷歌语音指令的实验显示, RomMix的性能优于其他最先进的混合样本数据增强方法。