SMU:利用平滑最大技术为深网络使用平滑最大技术顺利启动功能 (SMU: smooth activation function for deep networks using smoothing maximum technique)

Deep learning researchers have a keen interest in proposing two new novel activation functions which can boost network performance. A good choice of activation function can have significant consequences in improving network performance. A handcrafted activation is the most common choice in neural network models. ReLU is the most common choice in the deep learning community due to its simplicity though ReLU has some serious drawbacks. In this paper, we have proposed a new novel activation function based on approximation of known activation functions like Leaky ReLU, and we call this function Smooth Maximum Unit (SMU). Replacing ReLU by SMU, we have got 6.22% improvement in the CIFAR100 dataset with the ShuffleNet V2 model.

翻译：深层学习研究者非常有兴趣提出两个可以提升网络性能的新的新激活功能。良好的激活功能选择会在改善网络性能方面产生重大影响。人工制作的激活是神经网络模型中最常见的选择。 RELU是深层学习社群中最常见的选择, 因为它很简单, 尽管RELU有一些严重的缺点。在本文中, 我们基于Leaky ReLU等已知激活功能的近似值, 提出了一个新的新型激活功能。我们称之为此功能的平滑最大单位。以 SMU取代 ReLU, 我们用ShuffleNet V2 模型来取代 CIRFAR100 数据集, 我们得到了6. 22%的改进。

相关内容

激活函数

关注 44

在人工神经网络中，给定一个输入或一组输入，节点的激活函数定义该节点的输出。一个标准集成电路可以看作是一个由激活函数组成的数字网络，根据输入的不同，激活函数可以是开(1)或关(0)。这类似于神经网络中的线性感知器的行为。然而，只有非线性激活函数允许这样的网络只使用少量的节点来计算重要问题，并且这样的激活函数被称为非线性。

【经典书】线性代数元素，197页pdf

专知会员服务

56+阅读 · 2021年3月4日

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

专知会员服务

170+阅读 · 2020年5月10日

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS