大象在可解释房间里:当我们有突出的方法时,为什么用注意力来解释? (The elephant in the interpretability room: Why use attention as explanation when we have saliency methods?)

There is a recent surge of interest in using attention as explanation of model predictions, with mixed evidence on whether attention can be used as such. While attention conveniently gives us one weight per input token and is easily extracted, it is often unclear toward what goal it is used as explanation. We find that often that goal, whether explicitly stated or not, is to find out what input tokens are the most relevant to a prediction, and that the implied user for the explanation is a model developer. For this goal and user, we argue that input saliency methods are better suited, and that there are no compelling reasons to use attention, despite the coincidence that it provides a weight for each input. With this position paper, we hope to shift some of the recent focus on attention to saliency methods, and for authors to clearly state the goal and user for their explanations.

翻译：最近人们热衷于将注意力用作模型预测的解释,但对于能否将注意力作为模型预测的解释,有好坏参半的证据。虽然注意方便地给我们每个输入的象征一个分量,而且容易提取,但对于它用作解释的目的往往不清楚。我们发现,这个目标,无论是否明确声明,通常都是要找出哪些输入的象征与预测最相关,解释的隐含用户是模型开发者。对于这个目标和使用者,我们认为,投入的突出方法更合适,而且没有令人信服的理由利用注意力,尽管它为每一项投入提供了分量。我们想通过这份立场文件,把最近关注的重点转移到突出的方法上,让作者清楚地说明目标和用户的解释。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

【万字长文】注意力机制可解释大论述

专知会员服务

55+阅读 · 2020年11月17日

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

《可解释的机器学习-interpretable-ml》238页pdf

专知会员服务

208+阅读 · 2020年2月24日

【贝叶斯深度学习：一种基于模型的可解释方法】Bayesian deep learning: A model-based interpretable approach

专知会员服务

49+阅读 · 2020年1月1日