Feature importance methods promise to provide a ranking of features according to importance for a given classification task. A wide range of methods exist but their rankings often disagree and they are inherently difficult to evaluate due to a lack of ground truth beyond synthetic datasets. In this work, we put feature importance methods to the test on real-world data in the domain of cardiology, where we try to distinguish three specific pathologies from healthy subjects based on ECG features comparing to features used in cardiologists' decision rules as ground truth. Some methods generally performed well and others performed poorly, while some methods did well on some but not all of the problems considered.
翻译:特征重要性方法承诺根据给定分类任务对特征进行重要性排名。虽然存在各种各样的方法,但它们的排名经常存在异议,由于缺乏除合成数据集之外的真实数据而难以进行评估。在这项工作中,我们对心脏病领域的真实数据进行了特征重要性方法测试,在此领域中,我们基于ECG特征将三种特定病理与健康受试者区分开来,并将心脏病专家的决策规则中使用的特征作为基本事实进行比较。一些方法表现良好,另一些方法表现不佳,而有些方法在某些问题上表现良好,但在其他问题上表现不佳。