超越注意力可视化的变换器解释性 (Transformer Interpretability Beyond Attention Visualization)

Self-attention techniques, and specifically Transformers, are dominating the field of text processing and are becoming increasingly popular in computer vision classification tasks. In order to visualize the parts of the image that led to a certain classification, existing methods either rely on the obtained attention maps or employ heuristic propagation along the attention graph. In this work, we propose a novel way to compute relevancy for Transformer networks. The method assigns local relevance based on the Deep Taylor Decomposition principle and then propagates these relevancy scores through the layers. This propagation involves attention layers and skip connections, which challenge existing methods. Our solution is based on a specific formulation that is shown to maintain the total relevancy across layers. We benchmark our method on very recent visual Transformer networks, as well as on a text classification problem, and demonstrate a clear advantage over the existing explainability methods.

翻译：自留技术,特别是变异器,正在主导文本处理领域,在计算机视觉分类任务中日益流行。为了直观图像中导致某种分类的部分,现有方法要么依靠获得的注意图,要么在关注图上进行超常传播。在这项工作中,我们提出了计算变异器网络相关性的新办法。这种方法根据深泰勒分解原则指定了地方相关性,然后通过层传播这些相关性的分数。这种传播涉及注意层和跳过连接,对现行方法提出了挑战。我们的解决办法基于一种具体的公式,显示它能够保持各层之间的完全相关性。我们把我们的方法以最近的视觉变异器网络以及文本分类问题作为基准,并展示了相对于现有解释方法的明显优势。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

注意力机制综述

专知会员服务

198+阅读 · 2021年1月26日

最新《Transformers模型》教程，64页ppt

专知会员服务

287+阅读 · 2020年11月26日

自然语言处理中的注意力机制，Attention in Natural Language Processing

专知会员服务

133+阅读 · 2020年5月30日

【上海交通大学-张拳石】可解释CNN，Interpretable CNNs for Object Classification

专知会员服务

44+阅读 · 2020年3月13日