能否计算出稳定和准确的神经网络? -- -- 关于深层学习的障碍和Smal的第18个问题。 (Can stable and accurate neural networks be computed? -- On the barriers of deep learning and Smale's 18th problem)

Deep learning (DL) has had unprecedented success and is now entering scientific computing with full force. However, current DL methods typically suffer from instability, even when universal approximation properties guarantee the existence of stable neural networks (NNs). We address this paradox by demonstrating basic well-conditioned problems in scientific computing where one can prove the existence of NNs with great approximation qualities, however, there does not exist any algorithm, even randomised, that can train (or compute) such a NN. For any positive integers $K > 2$ and $L$, there are cases where simultaneously: (a) no randomised training algorithm can compute a NN correct to $K$ digits with probability greater than $1/2$, (b) there exists a deterministic training algorithm that computes a NN with $K-1$ correct digits, but any such (even randomised) algorithm needs arbitrarily many training data, (c) there exists a deterministic training algorithm that computes a NN with $K-2$ correct digits using no more than $L$ training samples. These results imply a classification theory describing conditions under which (stable) NNs with a given accuracy can be computed by an algorithm. We begin this theory by establishing sufficient conditions for the existence of algorithms that compute stable NNs in inverse problems. We introduce Fast Iterative REstarted NETworks (FIRENETs), which we both prove and numerically verify are stable. Moreover, we prove that only $\mathcal{O}(|\log(\epsilon)|)$ layers are needed for an $\epsilon$-accurate solution to the inverse problem.

翻译：深层学习( DL) 取得了前所未有的成功,现在正在全力进行科学计算。然而, 目前的 DL 方法通常会不稳定, 即使通用近似特性保证了稳定的神经网络的存在。我们通过在科学计算中展示基本条件良好的问题来解决这一悖论。科学计算中,人们可以用高水平的近距离数字来计算NN, 但是, 没有任何算法, 甚至随机化的算法, 可以( 或随机化的) 这样的NN( 或计算) 。对于任何正整数 $ > 2美元和 $2 美元, 任何同时出现的情况是:(a) 没有随机化的培训算法能够将NN的校正值转换为$2美元的数字。这些结果意味着一种确定性的培训算法, 以1K-1美元的正确数字来计算NNNU, 但是任何这种( 随机化的) 算法都需要任意性的培训数据。 (c) 有一种确定性的培训算法, 用美元和正值的正值数字来计算 NNUR 。这些结果意味着我们只能用一个精确的精确度来计算。

相关内容

Neural Networks

关注 1631

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

迁移学习简明教程，11页ppt

专知会员服务

105+阅读 · 2020年8月4日

一份简单《图神经网络》教程，28页ppt

专知会员服务

120+阅读 · 2020年8月2日

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

专知会员服务

71+阅读 · 2020年7月6日

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

专知会员服务

30+阅读 · 2020年4月15日