与高维数据的不确定性一起推断特征重要性 (Inferring feature importance with uncertainties in high-dimensional data) - 专知论文

会员服务 ·

0

估计/估计量 · Shapley value · 推断 · 数据生成过程 · MoDELS ·

2021 年 9 月 2 日

Inferring feature importance with uncertainties in high-dimensional data

翻译：与高维数据的不确定性一起推断特征重要性

Pål Vegard Johnsen,Inga Strümke,Signe Riemer-Sørensen,Andrew Thomas DeWand,Mette Langaas

Estimating feature importance is a significant aspect of explaining data-based models. Besides explaining the model itself, an equally relevant question is which features are important in the underlying data generating process. We present a Shapley value based framework for inferring the importance of individual features, including uncertainty in the estimator. We build upon the recently published feature importance measure of SAGE (Shapley additive global importance) and introduce sub-SAGE which can be estimated without resampling for tree-based models. We argue that the uncertainties can be estimated from bootstrapping and demonstrate the approach for tree ensemble methods. The framework is exemplified on synthetic data as well as high-dimensional genomics data.

翻译：估计地物的重要性是解释以数据为基础的模型的一个重要方面。除了解释模型本身之外,一个同样相关的问题是哪些特征在基本数据生成过程中很重要。我们提出了一个基于光滑价值的框架,用以推断个别特征的重要性,包括估算器中的不确定性。我们以最近公布的SAGE(Shapley添加剂的全球重要性)的地物重要性衡量尺度为基础,并采用可不重新标注以树为基础的模型而加以估计的子SAGE。我们争辩说,不确定性可以从靴子中估算,并展示树木合用方法的方法。该框架以合成数据和高位基因组数据为示例。

0

相关内容

估计/估计量

估计/估计量

【因果人工智能系统】106页ppt，Causal AI for Systems

专知会员服务

97+阅读 · 2021年8月28日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

专知会员服务

127+阅读 · 2019年12月13日

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

专知会员服务

21+阅读 · 2019年12月2日

【O'Reilly AI Conference 2019】应用NLP的医疗科技：功能工程与模型诊断（NLP for healthcare: Feature engineering and model diagnostics），美国医疗保健公司Episource，Manas Ranjan Kar

【O'Reilly AI Conference 2019】应用NLP的医疗科技：功能工程与模型诊断（NLP for healthcare: Feature engineering and model diagnostics），美国医疗保健公司Episource，Manas Ranjan Kar

专知会员服务

8+阅读 · 2019年11月6日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

161+阅读 · 2019年10月12日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

机器学习研究会

13+阅读 · 2017年12月25日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Compositional Modeling of Nonlinear Dynamical Systems with ODE-based Random Features

Arxiv

0+阅读 · 2021年10月25日

Scalable Optimal Classifiers for Adversarial Settings under Uncertainty

Arxiv

0+阅读 · 2021年10月25日

An efficient estimation of time-varying parameters of dynamic models by combining offline batch optimization and online data assimilation

Arxiv

0+阅读 · 2021年10月24日

Imputation of Missing Data Using Linear Gaussian Cluster-Weighted Modeling

Arxiv

0+阅读 · 2021年10月24日

When and How Mixup Improves Calibration

Arxiv

0+阅读 · 2021年10月22日

Bayesian Uncertainty Estimation of Learned Variational MRI Reconstruction

Arxiv

0+阅读 · 2021年10月22日

High-Dimensional Learning under ApproximateSparsity with Applications to Nonsmooth Estimation and Regularized Neural Networks

Arxiv

0+阅读 · 2021年10月22日

Unraveling S&P500 stock volatility and networks -- An encoding-and-decoding approach

Arxiv

0+阅读 · 2021年10月21日

CIMTx: An R package for causal inference with multiple treatments using observational data

Arxiv

0+阅读 · 2021年10月19日

A Survey of Uncertainty in Deep Neural Networks

Arxiv

30+阅读 · 2021年7月7日

VIP会员

文章信息

相关主题

估计/估计量

数据生成过程

相关VIP内容

【因果人工智能系统】106页ppt，Causal AI for Systems

专知会员服务

97+阅读 · 2021年8月28日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

专知会员服务

127+阅读 · 2019年12月13日

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

专知会员服务

21+阅读 · 2019年12月2日

【O'Reilly AI Conference 2019】应用NLP的医疗科技：功能工程与模型诊断（NLP for healthcare: Feature engineering and model diagnostics），美国医疗保健公司Episource，Manas Ranjan Kar

【O'Reilly AI Conference 2019】应用NLP的医疗科技：功能工程与模型诊断（NLP for healthcare: Feature engineering and model diagnostics），美国医疗保健公司Episource，Manas Ranjan Kar

专知会员服务

8+阅读 · 2019年11月6日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

161+阅读 · 2019年10月12日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【EMNLP2025最佳论文】INFINI-GRAM MINI：基于 FM-Index 的互联网级精确 n-gram 搜索

【EMNLP2025教程】高效的大语言模型推理：算法、模型与系统，203页ppt

AI医疗行业研究报告：AI医疗前景广阔

【斯坦福博士论文】多模态基础模型：从科学理解到科学发现

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

机器学习研究会

13+阅读 · 2017年12月25日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Compositional Modeling of Nonlinear Dynamical Systems with ODE-based Random Features

Arxiv

0+阅读 · 2021年10月25日

Scalable Optimal Classifiers for Adversarial Settings under Uncertainty

Arxiv

0+阅读 · 2021年10月25日

An efficient estimation of time-varying parameters of dynamic models by combining offline batch optimization and online data assimilation

Arxiv

0+阅读 · 2021年10月24日

Imputation of Missing Data Using Linear Gaussian Cluster-Weighted Modeling

Arxiv

0+阅读 · 2021年10月24日

When and How Mixup Improves Calibration

Arxiv

0+阅读 · 2021年10月22日

Bayesian Uncertainty Estimation of Learned Variational MRI Reconstruction

Arxiv

0+阅读 · 2021年10月22日

High-Dimensional Learning under ApproximateSparsity with Applications to Nonsmooth Estimation and Regularized Neural Networks

Arxiv

0+阅读 · 2021年10月22日

Unraveling S&P500 stock volatility and networks -- An encoding-and-decoding approach

Arxiv

0+阅读 · 2021年10月21日

CIMTx: An R package for causal inference with multiple treatments using observational data

Arxiv

0+阅读 · 2021年10月19日

A Survey of Uncertainty in Deep Neural Networks

Arxiv

30+阅读 · 2021年7月7日

微信扫码咨询专知VIP会员