深度學習: 神經注意力森林：改進的基於Transformer的森林方法 (Neural Attention Forests: Transformer-Based Forest Improvement) - 专知论文

会员服务 ·

0

IBM Watson · Transformer · 端到端 · 变换 · 表示 ·

2023 年 4 月 12 日

Neural Attention Forests: Transformer-Based Forest Improvement

翻译：深度學習: 神經注意力森林：改進的基於Transformer的森林方法

Andrei V. Konstantinov,Lev V. Utkin,Alexey A. Lukashin,Vladimir A. Muliukha

from arxiv, Submitted for the 7th International Scientific Conference "Intelligent Information Technologies for Industry" in St. Petersburg

A new approach called NAF (the Neural Attention Forest) for solving regression and classification tasks under tabular training data is proposed. The main idea behind the proposed NAF model is to introduce the attention mechanism into the random forest by assigning attention weights calculated by neural networks of a specific form to data in leaves of decision trees and to the random forest itself in the framework of the Nadaraya-Watson kernel regression. In contrast to the available models like the attention-based random forest, the attention weights and the Nadaraya-Watson regression are represented in the form of neural networks whose weights can be regarded as trainable parameters. The first part of neural networks with shared weights is trained for all trees and computes attention weights of data in leaves. The second part aggregates outputs of the tree networks and aims to minimize the difference between the random forest prediction and the truth target value from a training set. The neural network is trained in an end-to-end manner. The combination of the random forest and neural networks implementing the attention mechanism forms a transformer for enhancing the forest predictions. Numerical experiments with real datasets illustrate the proposed method. The code implementing the approach is publicly available.

翻译：本論文提出了一種名為NAF（神經注意力森林）的新方法，用於處理表格訓練數據下的回歸和分類任務。所提出的NAF模型的主要思想是通過將一個特定形式的神經網絡計算的注意力權重分配給決策樹的葉子和隨機森林本身，在Nadaraya-Watson核回歸的框架中引入注意機制。與可用模型（如基於注意力的隨機森林）不同，注意力權重和Nadaraya-Watson回歸以可訓練的參數形式表示為神經網絡的權重。對於所有樹木訓練的共享權重的第一部分計算葉子數據的注意力權重，第二部分聚合樹網絡的輸出，旨在最小化隨機森林預測和訓練集真實目標值之間的差異。神經網絡以端到端的方式訓練。隨機森林和實現注意機制的神經網絡的組合形成了一個Transformer，用於增強森林的預測能力。使用實際數據集進行的數值實驗說明了所提出的方法。實現該方法的代碼公開可用。

0

相关内容

IBM Watson

IBM 开发的继深蓝之后的新一代大型计算机。 Watson得名于IBM创始人Thomas J. Watson，是当下人工智能的最高端应用。

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

72+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

50+阅读 · 2020年12月14日

【IJCAI2020】神经摘要结构性注意力，Neural Abstractive Summarization with Structural Attention

【IJCAI2020】神经摘要结构性注意力，Neural Abstractive Summarization with Structural Attention

专知会员服务

32+阅读 · 2020年4月24日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

238+阅读 · 2020年4月19日

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

专知会员服务

25+阅读 · 2020年3月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

92+阅读 · 2020年3月12日

【ICLR2020-】基于记忆的图网络，MEMORY-BASED GRAPH NETWORKS

【ICLR2020-】基于记忆的图网络，MEMORY-BASED GRAPH NETWORKS

专知会员服务

108+阅读 · 2020年2月22日

Transformer文本分类代码

Transformer文本分类代码

专知会员服务

116+阅读 · 2020年2月3日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

31+阅读 · 2019年10月17日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

39+阅读 · 2019年10月9日

ICLR'23截稿, 图神经网络依然火热 (附42 篇好文整理)

ICLR'23截稿, 图神经网络依然火热 (附42 篇好文整理)

图与推荐

1+阅读 · 2022年10月5日

深度学习注意力机制-Attention in Deep learning-附101页PPT

深度学习注意力机制-Attention in Deep learning-附101页PPT

专知

137+阅读 · 2019年9月23日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

26+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

14+阅读 · 2019年4月13日

从Seq2seq到Attention模型到Self Attention（二）

从Seq2seq到Attention模型到Self Attention（二）

量化投资与机器学习

22+阅读 · 2018年10月9日

【论文推荐】最新六篇序列推荐相关论文—卷积序列嵌入学习、用户记忆网络、上下文GRU、迁移学习

【论文推荐】最新六篇序列推荐相关论文—卷积序列嵌入学习、用户记忆网络、上下文GRU、迁移学习

专知

10+阅读 · 2018年4月8日

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

专知

19+阅读 · 2018年4月7日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

Fe-N 磁性纳米颗粒的各向异性调控及其高频性能的研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于脉冲系统方法的事件触发网络同步问题研究

国家自然科学基金

1+阅读 · 2015年12月31日

结合前馈和反馈机制的自然场景文本识别技术

国家自然科学基金

0+阅读 · 2014年12月31日

Resveratrol联合MSCs移植对阿尔茨海默鼠的干预效果及Sirt1分子信号的介导作用

国家自然科学基金

0+阅读 · 2014年12月31日

品种资源群体抗性性状QTL互作检测新方法及其应用

国家自然科学基金

0+阅读 · 2013年12月31日

基于超稀疏结构学习的压缩感知重建研究

国家自然科学基金

5+阅读 · 2013年12月31日

Trop2对CBSCs移植修复梗死心肌的影响及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

利用全基因组连锁和关联分析技术定位小麦条锈病成株抗性QTL

国家自然科学基金

0+阅读 · 2012年12月31日

非对易空间和非对易相空间中的量子物理

国家自然科学基金

0+阅读 · 2009年12月31日

基于多源观测数据的三维云融合分析算法研究

国家自然科学基金

2+阅读 · 2009年12月31日

On the Impact of Operators and Populations within Evolutionary Algorithms for the Dynamic Weighted Traveling Salesperson Problem

Arxiv

0+阅读 · 2023年5月30日

Prediction Error-based Classification for Class-Incremental Learning

Arxiv

0+阅读 · 2023年5月30日

HySST: A Stable Sparse Rapidly-Exploring Random Trees Optimal Motion Planning Algorithm for Hybrid Dynamical Systems

Arxiv

0+阅读 · 2023年5月29日

Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear Bandit Algorithms

Arxiv

1+阅读 · 2023年5月27日

Understanding Sparse Feature Updates in Deep Networks using Iterative Linearisation

Arxiv

0+阅读 · 2023年5月26日

Stability of implicit neural networks for long-term forecasting in dynamical systems

Arxiv

0+阅读 · 2023年5月26日

Laplace-Approximated Neural Additive Models: Improving Interpretability with Bayesian Inference

Arxiv

0+阅读 · 2023年5月26日

Lightweight Parameter Pruning for Energy-Efficient Deep Learning: A Binarized Gating Module Approach

Arxiv

0+阅读 · 2023年5月26日

Bayesian inference with finitely wide neural networks

Arxiv

0+阅读 · 2023年5月25日

Active Bayesian Causal Inference

Arxiv

14+阅读 · 2022年10月15日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

72+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

50+阅读 · 2020年12月14日

【IJCAI2020】神经摘要结构性注意力，Neural Abstractive Summarization with Structural Attention

【IJCAI2020】神经摘要结构性注意力，Neural Abstractive Summarization with Structural Attention

专知会员服务

32+阅读 · 2020年4月24日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

238+阅读 · 2020年4月19日

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

专知会员服务

25+阅读 · 2020年3月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

92+阅读 · 2020年3月12日

【ICLR2020-】基于记忆的图网络，MEMORY-BASED GRAPH NETWORKS

【ICLR2020-】基于记忆的图网络，MEMORY-BASED GRAPH NETWORKS

专知会员服务

108+阅读 · 2020年2月22日

Transformer文本分类代码

Transformer文本分类代码

专知会员服务

116+阅读 · 2020年2月3日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

31+阅读 · 2019年10月17日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

39+阅读 · 2019年10月9日

热门VIP内容

相关资讯

ICLR'23截稿, 图神经网络依然火热 (附42 篇好文整理)

ICLR'23截稿, 图神经网络依然火热 (附42 篇好文整理)

图与推荐

1+阅读 · 2022年10月5日

深度学习注意力机制-Attention in Deep learning-附101页PPT

深度学习注意力机制-Attention in Deep learning-附101页PPT

专知

137+阅读 · 2019年9月23日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

26+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

14+阅读 · 2019年4月13日

从Seq2seq到Attention模型到Self Attention（二）

从Seq2seq到Attention模型到Self Attention（二）

量化投资与机器学习

22+阅读 · 2018年10月9日

【论文推荐】最新六篇序列推荐相关论文—卷积序列嵌入学习、用户记忆网络、上下文GRU、迁移学习

【论文推荐】最新六篇序列推荐相关论文—卷积序列嵌入学习、用户记忆网络、上下文GRU、迁移学习

专知

10+阅读 · 2018年4月8日

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

专知

19+阅读 · 2018年4月7日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

On the Impact of Operators and Populations within Evolutionary Algorithms for the Dynamic Weighted Traveling Salesperson Problem

Arxiv

0+阅读 · 2023年5月30日

Prediction Error-based Classification for Class-Incremental Learning

Arxiv

0+阅读 · 2023年5月30日

HySST: A Stable Sparse Rapidly-Exploring Random Trees Optimal Motion Planning Algorithm for Hybrid Dynamical Systems

Arxiv

0+阅读 · 2023年5月29日

Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear Bandit Algorithms

Arxiv

1+阅读 · 2023年5月27日

Understanding Sparse Feature Updates in Deep Networks using Iterative Linearisation

Arxiv

0+阅读 · 2023年5月26日

Stability of implicit neural networks for long-term forecasting in dynamical systems

Arxiv

0+阅读 · 2023年5月26日

Laplace-Approximated Neural Additive Models: Improving Interpretability with Bayesian Inference

Arxiv

0+阅读 · 2023年5月26日

Lightweight Parameter Pruning for Energy-Efficient Deep Learning: A Binarized Gating Module Approach

Arxiv

0+阅读 · 2023年5月26日

Bayesian inference with finitely wide neural networks

Arxiv

0+阅读 · 2023年5月25日

Active Bayesian Causal Inference

Arxiv

14+阅读 · 2022年10月15日

相关基金

Fe-N 磁性纳米颗粒的各向异性调控及其高频性能的研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于脉冲系统方法的事件触发网络同步问题研究

国家自然科学基金

1+阅读 · 2015年12月31日

结合前馈和反馈机制的自然场景文本识别技术

国家自然科学基金

0+阅读 · 2014年12月31日

Resveratrol联合MSCs移植对阿尔茨海默鼠的干预效果及Sirt1分子信号的介导作用

国家自然科学基金

0+阅读 · 2014年12月31日

品种资源群体抗性性状QTL互作检测新方法及其应用

国家自然科学基金

0+阅读 · 2013年12月31日

基于超稀疏结构学习的压缩感知重建研究

国家自然科学基金

5+阅读 · 2013年12月31日

Trop2对CBSCs移植修复梗死心肌的影响及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

利用全基因组连锁和关联分析技术定位小麦条锈病成株抗性QTL

国家自然科学基金

0+阅读 · 2012年12月31日

非对易空间和非对易相空间中的量子物理

国家自然科学基金

0+阅读 · 2009年12月31日

基于多源观测数据的三维云融合分析算法研究

国家自然科学基金

2+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员