具有重要地物的许多方面:在文本分类中比较内建和后热后地物的重要性 (Many Faces of Feature Importance: Comparing Built-in and Post-hoc Feature Importance in Text Classification)

Feature importance is commonly used to explain machine predictions. While feature importance can be derived from a machine learning model with a variety of methods, the consistency of feature importance via different methods remains understudied. In this work, we systematically compare feature importance from built-in mechanisms in a model such as attention values and post-hoc methods that approximate model behavior such as LIME. Using text classification as a testbed, we find that 1) no matter which method we use, important features from traditional models such as SVM and XGBoost are more similar with each other, than with deep learning models; 2) post-hoc methods tend to generate more similar important features for two models than built-in methods. We further demonstrate how such similarity varies across instances. Notably, important features do not always resemble each other better when two models agree on the predicted label than when they disagree.

翻译：通常使用特征的重要性来解释机器预测。虽然特征的重要性可以从机器学习模式中得出,并采用各种方法,但不同方法的特征重要性的一致性仍然没有得到充分研究。在这项工作中,我们系统地比较了在模型中内在机制中具有的特征重要性,例如关注值和类似LIME等模型行为的热后方法。使用文本分类作为测试台,我们发现:(1) 我们使用哪种方法,诸如SVM和XGBoost等传统模型的重要特征,与深层学习模型相比,彼此更为相似;(2) 后热方法往往为两种模型产生比内生方法更为相似的重要特征。我们进一步表明这种相似性在各种情况下如何不同。值得注意的是,重要特征在两个模型就预测的标签达成一致时并不总是比在它们不一致时更加相似。

相关内容

MoDELS

关注 30

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

91+阅读 · 2020年3月12日

【CVPR2020】强化特征点，Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level Task

专知会员服务

48+阅读 · 2020年2月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

30+阅读 · 2019年10月17日