地方革命导致对高频反对立案例的隐含偏见 (Local Convolutions Cause an Implicit Bias towards High Frequency Adversarial Examples)

Despite great efforts, neural networks are still prone to adversarial attacks. Recent work has shown that adversarial perturbations typically contain high-frequency features, but the root cause of this phenomenon remains unknown. Inspired by the theoretical work in linear full-width convolutional models (Gunasekar et al, 2018), we hypothesize that the nonlinear local (i.e. bounded-width) convolutional models used in practice are implicitly biased to learn high frequency features, and that this is the root cause of high frequency adversarial examples. To test this hypothesis, we analyzed the impact of different choices of linear and nonlinear architectures on the implicit bias of the learned features and the adversarial perturbations, in both spatial and frequency domains. We find that the high-frequency adversarial perturbations are critically dependent on the convolution operation in two ways: (i) the translation invariance of the convolution induces an implicit bias towards sparsity in the frequency domain; and (ii) the spatially-limited nature of local convolutions induces an implicit bias towards high frequency features. The explanation for the latter involves the Fourier Uncertainty Principle: a spatially-limited (local in the space domain) filter cannot also be frequency-limited (local in the frequency domain). Furthermore, using larger convolution kernel sizes or avoiding convolutions altogether (e.g. by using Visual Transformers architecture) significantly reduces this high frequency bias, but not the overall susceptibility to attacks. Looking forward, our work strongly suggests that understanding and controlling the implicit bias of architectures will be essential for achieving adversarial robustness.

翻译：尽管做出了巨大的努力,神经网络仍然容易发生对抗性攻击。最近的工作表明,对抗性扰动通常含有高频率特征,但这一现象的根源仍然未知。受全宽线性共变模型(Gunasekar等人,2018年)理论工作启发,我们假设,在实践中使用的非线性(即捆绑-宽度)共变模式暗含偏见,以学习高频率特征,这是高频率对抗性辩论性例子的根源。为检验这一假设,我们分析了线性和非线性结构的不同选择在空间和频度范围内对所学特征的隐性偏差和非线性结构的影响。我们发现,高频率的对立性对立性在空间和频度范围内对立性模型(Gunasekar等人,2018年)的理论工作,从两方面看,非线性对立性地方(即被捆绑起来的)变异性在频率域内产生隐性偏差,(由于频率范围的变异性, 和(二) 地方变形在空间变异性性质上不会引起对高频域域内基本理解的隐性偏差。(在空间范围解释后,在空间范围上也无法理解。