在神经网络和内核机器中进行特色学习,这些神经网络和内核机器可反复学习特征 (Feature learning in neural networks and kernel machines that recursively learn features)

Neural networks have achieved impressive results on many technological and scientific tasks. Yet, their empirical successes have outpaced our fundamental understanding of their structure and function. By identifying mechanisms driving the successes of neural networks, we can provide principled approaches for improving neural network performance and develop simple and effective alternatives. In this work, we isolate the key mechanism driving feature learning in fully connected neural networks by connecting neural feature learning to the average gradient outer product. We subsequently leverage this mechanism to design \textit{Recursive Feature Machines} (RFMs), which are kernel machines that learn features. We show that RFMs (1) accurately capture features learned by deep fully connected neural networks, (2) close the gap between kernel machines and fully connected networks, and (3) surpass a broad spectrum of models including neural networks on tabular data. Furthermore, we demonstrate that RFMs shed light on recently observed deep learning phenomena such as grokking, lottery tickets, simplicity biases, and spurious features. We provide a Python implementation to make our method broadly accessible [\href{https://github.com/aradha/recursive_feature_machines}{GitHub}].

翻译：在许多技术和科学任务上,神经网络取得了令人印象深刻的成果。然而,它们的实证成功超过了我们对自身结构和功能的基本理解。通过确定推动神经网络成功的机制,我们可以提供改善神经网络性能和开发简单而有效的替代方法的原则性方法。在这项工作中,我们通过将神经特征学习与平均梯度外产品连接起来,将完全连接的神经网络学习的关键机制特征分离出来。我们随后利用这一机制设计了具有内核特征的机器。我们表明,RFM(1) 准确地捕捉了完全连接的神经网络所学到的特征,(2) 缩小了内核机器和完全连接的网络之间的差距,(3) 超越了广泛的模型范围,包括表格数据的神经网络。此外,我们证明,RFM为最近观察到的深层学习现象,如石化、彩票、简单偏差和假性特征等提供了亮光。我们提供了一种Python实施方法,使我们的方法能够广泛获得[hrefes://github.com/aradha/recuritrus_strain_strain_matural_matystrat_stratriat_

相关内容

Neural Networks

关注 1651

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

专知会员服务

39+阅读 · 2020年11月3日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别