太阳: 高维空间稳定变量选择最小角回归 (Solar: a least-angle regression for stable variable selection in high-dimensional spaces) - 专知论文

会员服务 ·

0

子采样 · 特化 · 自助法/自举法 · 可约的 · 留出法 ·

2021 年 4 月 26 日

Solar: a least-angle regression for stable variable selection in high-dimensional spaces

翻译：太阳: 高维空间稳定变量选择最小角回归

Ning Xu,Timothy C. G. Fisher,Jian Hong

We propose a new algorithm for variable selection in high-dimensional data, called subsample-ordered least-angle regression (solar). Solar relies on the average $L_0$ solution path computed across subsamples and alleviates several known high-dimensional issues with lasso and least-angle regression. We illustrate in simulations that, with the same computation load, solar yields substantial improvements over lasso in terms of the sparsity (37-64\% reduction in the average number of selected variables), stability and accuracy of variable selection. Moreover, solar supplemented with the hold-out average (an adaptation of classical post-OLS tests) successfully purges almost all of the redundant variables while retaining all of the informative variables. Using simulations and real-world data, we also illustrate numerically that sparse solar variable selection is robust to complicated dependence structures and harsh settings of the irrepresentable condition. Moreover, replacing lasso with solar in an ensemble system (e.g., the bootstrap ensemble), significantly reduces the computation load (at least 96\% fewer subsample repetitions) of the bootstrap ensemble and improves selection sparsity. We provide a Python parallel computing package for solar (solarpy) in the supplementary file and https://github.com/isaac2math/solar.

翻译：我们为高维数据中的变量选择提出了一个新的算法,称为子抽样顺序最小角回归(索拉尔)。太阳能依赖在子样本中计算出的平均$L_0美元解决方案路径,并缓解了Lasso和最小角回归的一些已知高维问题。我们在模拟中用同样的计算负荷来说明,太阳能在宽度(选定变量平均数量减少37-64 ⁇ )、稳定性和变量选择的准确性方面比拉索产生显著的改善。此外,太阳能还依靠暂停平均(古典后OLS测试的调整)成功地清除了几乎所有冗余变量,同时保留了所有信息变量。我们使用模拟和真实世界数据,还用数字方式说明,稀少的太阳变量选择对于复杂的依赖结构和无法反映的严酷环境是强大的。此外,在堆积系统中(例如靴套)用太阳能代替拉索(例如),大大降低了计算负荷(至少96 ⁇ 次子粘固性重复性),同时为Schestegreasima2号/Wegasirmassal selsimal ASlemental ASlimental adal ASlimentalgaslemental.

0

相关内容

子采样

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【干货书】贝叶斯推断随机过程，449页pdf

【干货书】贝叶斯推断随机过程，449页pdf

专知会员服务

155+阅读 · 2020年8月27日

最新《非光滑优化》十讲硬核课程，剑桥大学梁经纬博士主讲

最新《非光滑优化》十讲硬核课程，剑桥大学梁经纬博士主讲

专知会员服务

33+阅读 · 2020年8月14日

【实用书】Python编程与解决问题，424页pdf，PROGRAMMING AND PROBLEM SOLVING WITH PYTHON

【实用书】Python编程与解决问题，424页pdf，PROGRAMMING AND PROBLEM SOLVING WITH PYTHON

专知会员服务

76+阅读 · 2020年7月12日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

122+阅读 · 2020年5月30日

【ACL2020-斯坦福】低维双曲知识图谱嵌入，Low-Dimensional Hyperbolic KGE

【ACL2020-斯坦福】低维双曲知识图谱嵌入，Low-Dimensional Hyperbolic KGE

专知会员服务

46+阅读 · 2020年5月6日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【课程】普林斯顿大学19年春季学期《机器学习优化》课程讲义

【课程】普林斯顿大学19年春季学期《机器学习优化》课程讲义

专知会员服务

85+阅读 · 2019年10月29日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知

21+阅读 · 2020年5月30日

【ACL2020-斯坦福】低维双曲知识图谱嵌入，Low-Dimensional Hyperbolic KGE

【ACL2020-斯坦福】低维双曲知识图谱嵌入，Low-Dimensional Hyperbolic KGE

专知

8+阅读 · 2020年5月6日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

Robust Inference for High-Dimensional Linear Models via Residual Randomization

Arxiv

0+阅读 · 2021年6月14日

Machine Learning for Variance Reduction in Online Experiments

Arxiv

0+阅读 · 2021年6月14日

Adapting to Misspecification in Contextual Bandits with Offline Regression Oracles

Adapting to Misspecification in Contextual Bandits with Offline Regression Oracles

Arxiv

0+阅读 · 2021年6月11日

Optimal Model Selection in Contextual Bandits with Many Classes via Offline Oracles

Arxiv

0+阅读 · 2021年6月11日

Structured Bayesian variable selection for multiple correlated response variables and high-dimensional predictors

Arxiv

0+阅读 · 2021年6月11日

Anytime Monte Carlo

Arxiv

0+阅读 · 2021年6月10日

Parameter and Feature Selection in Stochastic Linear Bandits

Arxiv

0+阅读 · 2021年6月9日

Noise Conditional Flow Model for Learning the Super-Resolution Space

Arxiv

0+阅读 · 2021年6月6日

Empirical Models for Multidimensional Regression of Fission Systems

Arxiv

0+阅读 · 2021年5月30日

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

Arxiv

8+阅读 · 2018年12月18日

VIP会员

文章信息

相关主题

自助法/自举法

相关VIP内容

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【干货书】贝叶斯推断随机过程，449页pdf

【干货书】贝叶斯推断随机过程，449页pdf

专知会员服务

155+阅读 · 2020年8月27日

最新《非光滑优化》十讲硬核课程，剑桥大学梁经纬博士主讲

最新《非光滑优化》十讲硬核课程，剑桥大学梁经纬博士主讲

专知会员服务

33+阅读 · 2020年8月14日

【实用书】Python编程与解决问题，424页pdf，PROGRAMMING AND PROBLEM SOLVING WITH PYTHON

【实用书】Python编程与解决问题，424页pdf，PROGRAMMING AND PROBLEM SOLVING WITH PYTHON

专知会员服务

76+阅读 · 2020年7月12日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

122+阅读 · 2020年5月30日

【ACL2020-斯坦福】低维双曲知识图谱嵌入，Low-Dimensional Hyperbolic KGE

【ACL2020-斯坦福】低维双曲知识图谱嵌入，Low-Dimensional Hyperbolic KGE

专知会员服务

46+阅读 · 2020年5月6日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【课程】普林斯顿大学19年春季学期《机器学习优化》课程讲义

【课程】普林斯顿大学19年春季学期《机器学习优化》课程讲义

专知会员服务

85+阅读 · 2019年10月29日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《人工智能模型风险目录：开发者与研究者对现实世界AI危害的认知盲区》

《印美国防合作：“自力更生”计划》最新126页报告

构建新大脑：将军事院校转型为AI作战实验室

《革命性软件智能：融合神经程序合成、量子安全运维与可解释人工智能的下一代自主系统统一框架》最新报告

相关资讯

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知

21+阅读 · 2020年5月30日

【ACL2020-斯坦福】低维双曲知识图谱嵌入，Low-Dimensional Hyperbolic KGE

【ACL2020-斯坦福】低维双曲知识图谱嵌入，Low-Dimensional Hyperbolic KGE

专知

8+阅读 · 2020年5月6日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

相关论文

Robust Inference for High-Dimensional Linear Models via Residual Randomization

Arxiv

0+阅读 · 2021年6月14日

Machine Learning for Variance Reduction in Online Experiments

Arxiv

0+阅读 · 2021年6月14日

Adapting to Misspecification in Contextual Bandits with Offline Regression Oracles

Adapting to Misspecification in Contextual Bandits with Offline Regression Oracles

Arxiv

0+阅读 · 2021年6月11日

Optimal Model Selection in Contextual Bandits with Many Classes via Offline Oracles

Arxiv

0+阅读 · 2021年6月11日

Structured Bayesian variable selection for multiple correlated response variables and high-dimensional predictors

Arxiv

0+阅读 · 2021年6月11日

Anytime Monte Carlo

Arxiv

0+阅读 · 2021年6月10日

Parameter and Feature Selection in Stochastic Linear Bandits

Arxiv

0+阅读 · 2021年6月9日

Noise Conditional Flow Model for Learning the Super-Resolution Space

Arxiv

0+阅读 · 2021年6月6日

Empirical Models for Multidimensional Regression of Fission Systems

Arxiv

0+阅读 · 2021年5月30日

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

Arxiv

8+阅读 · 2018年12月18日

微信扫码咨询专知VIP会员