GANs的梯度爆炸缓解:假的能真实 (Alleviation of Gradient Exploding in GANs: Fake Can Be Real)

In order to alleviate the notorious mode collapse phenomenon in generative adversarial networks (GANs), we propose a novel training method of GANs in which certain fake samples are considered as real ones during the training process. This strategy can reduce the gradient value that generator receives in the region where gradient exploding happens. We show the process of an unbalanced generation and a vicious circle issue resulted from gradient exploding in practical training, which explains the instability of GANs. We also theoretically prove that gradient exploding can be alleviated by penalizing the difference between discriminator outputs and fake-as-real consideration for very close real and fake samples. Accordingly, Fake-As-Real GAN (FARGAN) is proposed with a more stable training process and a more faithful generated distribution. Experiments on different datasets verify our theoretical analysis.

翻译：为了缓解基因对抗网络中臭名昭著的模式崩溃现象,我们提议对基因对抗网络采用一种新的培训方法,在培训过程中将某些假样品视为真实样品;这一战略可以降低梯度爆炸发生地区生成器的梯度值;我们展示了不平衡的一代过程,以及实际培训中的梯度爆炸造成的恶性循环问题,这解释了基因对抗网络的不稳定性;我们还从理论上证明,通过惩罚歧视产物与假冒真实考虑之间的差别,可以减轻梯度爆炸。因此,提议采用更稳定的培训过程和更加忠实的分布法,对假冒的真真真假样品进行惩罚。关于不同数据集的实验证实了我们的理论分析。

相关内容

梯度爆炸

关注 0

误差梯度是神经网络训练过程中计算的方向和数量，用于以正确的方向和合适的量更新网络权重。在深层网络或循环神经网络中，误差梯度可在更新中累积，变成非常大的梯度，然后导致网络权重的大幅更新，并因此使网络变得不稳定。在极端情况下，权重的值变得非常大，以至于溢出，导致NaN值。网络层之间的梯度（值大于 1.0）重复相乘导致的指数级增长会产生梯度爆炸。

【论文】结构GANs，Structured GANs，

专知会员服务

15+阅读 · 2020年1月16日

GANs最新综述论文: 生成式对抗网络及其变种如何有用

专知会员服务

72+阅读 · 2019年10月19日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日