It has been shown that perturbing the input during training implicitly regularises the gradient of the learnt function, leading to smoother models and enhancing generalisation. However, previous research mostly considered the addition of ambient noise in the input space, without considering the underlying structure of the data. In this work, we propose several strategies of adding geometry-aware input noise that accounts for the lower dimensional manifold the input space inhabits. We start by projecting ambient Gaussian noise onto the tangent space of the manifold. In a second step, the noise sample is mapped on the manifold via the associated geodesic curve. We also consider Brownian motion noise, which moves in random steps along the manifold. We show that geometry-aware noise leads to improved generalisation and robustness to hyperparameter selection on highly curved manifolds, while performing at least as well as training without noise on simpler manifolds. Our proposed framework extends to data manifolds approximated by generative models and we observe similar trends on the MNIST digits dataset.
翻译:已有研究表明,在训练过程中对输入进行扰动可以隐式正则化学习函数的梯度,从而获得更平滑的模型并提升泛化能力。然而,先前研究主要考虑在输入空间中添加环境噪声,而未考虑数据的底层结构。本文提出多种几何感知输入噪声注入策略,这些策略考虑了输入空间所处的低维流形结构。我们首先将环境高斯噪声投影到流形的切空间上;第二步,通过相应的测地线将噪声样本映射到流形上。此外,我们还考虑了沿流形随机步进的布朗运动噪声。实验表明,在高曲率流形上,几何感知噪声能提升泛化能力并增强超参数选择的鲁棒性;而在较简单的流形上,其性能至少与无噪声训练相当。我们提出的框架可扩展至由生成模型近似的数据流形,并在MNIST手写数字数据集上观察到类似的趋势。