能否计算出稳定和准确的神经网络? -- -- 关于深层学习的障碍和Smal的第18个问题。 (Can stable and accurate neural networks be computed? -- On the barriers of deep learning and Smale's 18th problem)

Deep learning (DL) has had unprecedented success and is now entering scientific computing with full force. However, DL suffers from a universal phenomenon: instability, despite universal approximating properties that often guarantee the existence of stable neural networks (NNs). We show the following paradox. There are basic well-conditioned problems in scientific computing where one can prove the existence of NNs with great approximation qualities, however, there does not exist any algorithm, even randomised, that can train (or compute) such a NN. Indeed, for any positive integers $K > 2$ and $L$, there are cases where simultaneously: (a) no randomised algorithm can compute a NN correct to $K$ digits with probability greater than $1/2$, (b) there exists a deterministic algorithm that computes a NN with $K-1$ correct digits, but any such (even randomised) algorithm needs arbitrarily many training data, (c) there exists a deterministic algorithm that computes a NN with $K-2$ correct digits using no more than $L$ training samples. These results provide basic foundations for Smale's 18th problem and imply a potentially vast, and crucial, classification theory describing conditions under which (stable) NNs with a given accuracy can be computed by an algorithm. We begin this theory by initiating a unified theory for compressed sensing and DL, leading to sufficient conditions for the existence of algorithms that compute stable NNs in inverse problems. We introduce Fast Iterative REstarted NETworks (FIRENETs), which we prove and numerically verify are stable. Moreover, we prove that only $\mathcal{O}(|\log(\epsilon)|)$ layers are needed for an $\epsilon$ accurate solution to the inverse problem (exponential convergence), and that the inner dimensions in the layers do not exceed the dimension of the inverse problem. Thus, FIRENETs are computationally very efficient.

翻译：深层学习( DL) 取得了前所未有的成功, 并且正在完全进入科学计算。然而, DL 存在普遍现象: 不稳定性, 尽管普遍接近性能常常保证稳定的神经网络的存在。我们展示了以下悖论。在科学计算中存在一些基本的、条件良好的问题, 从而可以证明存在具有巨大近似质量的NN( DL), 然而, 不存在任何能够( 或计算) 这样的 NNN 培训的算法, 甚至随机化算法。事实上, 对于任何正数整数 $K > 2美元和 $ $ 美元, 确实存在一些同时发生的现象:(a) 没有一种随机化的算法能够将NNR的正确度折成$ 数字, 概率大于1/2美元, (b) 存在一种确定性算法, 但是任何这样的( 甚至随机化) 算法都需要任意的很多培训数据, (c) 存在一种确定性算法, 用美元- 2 正确的位数计算方法来计算, 使用不超过 $ $ 美元培训的。这些结果可以证明一个稳定的Onal ral deal dalational 的准确性。

相关内容

Neural Networks

关注 1651

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

专知会员服务

39+阅读 · 2020年11月3日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【CMU课程：深度学习导论(Spring 2020)】“11-785 Introduction to Deep Learning | Carnegie Mellon University | Spring 2020” by Bhiksha Raj

专知会员服务

29+阅读 · 2020年2月3日

【微软&CMU】后向特征校正，深度学习如何深度学习？Backward Feature Correction: How Deep Learning Performs Deep Learning

专知会员服务

13+阅读 · 2020年1月18日

【CCL 2019】人机对话--开放领域对话系统的最新进展（Recent Advances towards Dialogue Systems in Open Domain），北京大学助理教授严睿