使用智多调持续方法逐步选择地物的更有力和一般选择性推论 (More Powerful and General Selective Inference for Stepwise Feature Selection using the Homotopy Continuation Approach)

Conditional selective inference (SI) has been actively studied as a new statistical inference framework for data-driven hypotheses. The basic idea of conditional SI is to make inferences conditional on the selection event characterized by a set of linear and/or quadratic inequalities. Conditional SI has been mainly studied in the context of feature selection such as stepwise feature selection (SFS). The main limitation of the existing conditional SI methods is the loss of power due to over-conditioning, which is required for computational tractability. In this study, we develop a more powerful and general conditional SI method for SFS using the homotopy method which enables us to overcome this limitation. The homotopy-based SI is especially effective for more complicated feature selection algorithms. As an example, we develop a conditional SI method for forward-backward SFS with AIC-based stopping criteria and show that it is not adversely affected by the increased complexity of the algorithm. We conduct several experiments to demonstrate the effectiveness and efficiency of the proposed method.

翻译：作为数据驱动假设的新的统计推论框架,对有条件的选择性推断进行了积极研究,这是数据驱动假设的新统计推论框架。有条件的SI的基本想法是,以一系列线性和/或二次不平等为特点的选择活动为条件推论;有条件的SI主要在特征选择方面进行了研究,如分级特征选择(SFS),现有有条件的SI方法的主要局限性是,由于超调而丧失了权力,而这是计算可移动性的必要条件。在本研究中,我们用同质调法为SFS开发了一种更强大和一般的有条件的SI方法,使我们能够克服这一限制。基于同质调制的SI对于更复杂的特征选择算法特别有效。举例来说,我们开发了一种条件性SI方法,即前向后向SFSFS,采用基于AC的停止标准,并表明它不会受到算法日益复杂的不利影响。我们进行了几次实验,以证明拟议方法的有效性和效率。

相关内容

特征选择

关注 5931

特征选择( Feature Selection )也称特征子集选择( Feature Subset Selection , FSS )，或属性选择( Attribute Selection )。是指从已有的M个特征(Feature)中选择N个特征使得系统的特定指标最优化，是从原始特征中选择出一些最有效特征以降低数据集维度的过程,是提高学习算法性能的一个重要手段,也是模式识别中关键的数据预处理步骤。对于一个学习算法来说,好的学习样本是训练模型的关键。

【CVPR2021】动态度量学习

专知会员服务

40+阅读 · 2021年3月30日

【KDD2020】图神经网络的无冗余计算

专知会员服务

38+阅读 · 2020年11月24日