密度比率估计和Neyman Pearson分类及缺少数据 (Density Ratio Estimation and Neyman Pearson Classification with Missing Data) - 专知论文

会员服务 ·

0

估计/估计量 · 情景 · Minimax · 估计误差 · Performer ·

2023 年 2 月 21 日

Density Ratio Estimation and Neyman Pearson Classification with Missing Data

翻译：密度比率估计和Neyman Pearson分类及缺少数据

Josh Givens,Song Liu,Henry W J Reeve

from arxiv, 40 pages, 11 Figures. To be published in proceedings for AISTAT 2023

Density Ratio Estimation (DRE) is an important machine learning technique with many downstream applications. We consider the challenge of DRE with missing not at random (MNAR) data. In this setting, we show that using standard DRE methods leads to biased results while our proposal (M-KLIEP), an adaptation of the popular DRE procedure KLIEP, restores consistency. Moreover, we provide finite sample estimation error bounds for M-KLIEP, which demonstrate minimax optimality with respect to both sample size and worst-case missingness. We then adapt an important downstream application of DRE, Neyman-Pearson (NP) classification, to this MNAR setting. Our procedure both controls Type I error and achieves high power, with high probability. Finally, we demonstrate promising empirical performance both synthetic data and real-world data with simulated missingness.

翻译：密度比估计(DRE)是一个重要的机器学习技术,有许多下游应用。我们认为DRE的挑战在于没有随机(MNAR)数据。在这个背景下,我们表明使用标准的DRE方法会导致偏差结果,而我们的提案(M-KLIEP)是修改流行的DRE程序KLIEP, 恢复一致性。此外,我们为M-KLIEP提供了有限的样本估计误差界限,这显示了样本大小和最坏的缺失情况两方面的微小最佳性。然后,我们将DRE、Neyman-Pearson(NP)分类(NP)的重要下游应用适用于MNAR设置。我们的程序既控制了I型错误,也实现了高能量,概率很高。最后,我们展示了模拟缺失的合成数据和真实世界数据有希望的经验性表现。

0

相关内容

估计/估计量

估计/估计量

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

51+阅读 · 2022年10月22日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

基于近似对称的扰动方程的若干研究

国家自然科学基金

0+阅读 · 2014年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

TIPE2诱导自噬的信号通路及其对巨噬细胞功能的调控

国家自然科学基金

0+阅读 · 2014年12月31日

基于Fermi-LAT和AMS-02的暗物质理论研究

国家自然科学基金

0+阅读 · 2013年12月31日

中药升清胶囊干预脂代谢异常对乳腺癌肝转移的影响

国家自然科学基金

0+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于风险测度的供应链鲁棒建模与策略研究

国家自然科学基金

2+阅读 · 2012年12月31日

欧氏空间中加倍测度的限制与延拓

国家自然科学基金

0+阅读 · 2012年12月31日

基于鲁棒优化理论的二人零和对策研究

国家自然科学基金

0+阅读 · 2011年12月31日

Identification of Systematic Errors of Image Classifiers on Rare Subgroups

Arxiv

0+阅读 · 2023年4月12日

Benchmarking optimality of time series classification methods in distinguishing diffusions

Arxiv

0+阅读 · 2023年4月12日

Smoothness-Penalized Deconvolution (SPeD) of a Density Estimate

Arxiv

0+阅读 · 2023年4月10日

Unsupervised Mixture Estimation via Approximate Maximum Likelihood based on the Cramér - von Mises distance

Arxiv

0+阅读 · 2023年4月10日

Non-asymptotic inference for multivariate change point detection

Arxiv

0+阅读 · 2023年4月10日

Robust adaptive Lasso in high-dimensional logistic regression

Arxiv

0+阅读 · 2023年4月7日

Nonparametric Copula Models for Multivariate, Mixed, and Missing Data

Arxiv

0+阅读 · 2023年4月7日

Identification and Estimation of Causal Effects with Confounders Missing Not at Random

Arxiv

0+阅读 · 2023年4月7日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

Deep learning for time series classification: a review

Arxiv

12+阅读 · 2019年3月14日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

51+阅读 · 2022年10月22日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

新质生成式AI赋能产业变革的实践与路径

用于多模态大模型的离散标记化：全面综述

Nature综述：金融网络中的物理学

【CMU博士论文】通信高效且差分隐私的优化方法

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Identification of Systematic Errors of Image Classifiers on Rare Subgroups

Arxiv

0+阅读 · 2023年4月12日

Benchmarking optimality of time series classification methods in distinguishing diffusions

Arxiv

0+阅读 · 2023年4月12日

Smoothness-Penalized Deconvolution (SPeD) of a Density Estimate

Arxiv

0+阅读 · 2023年4月10日

Unsupervised Mixture Estimation via Approximate Maximum Likelihood based on the Cramér - von Mises distance

Arxiv

0+阅读 · 2023年4月10日

Non-asymptotic inference for multivariate change point detection

Arxiv

0+阅读 · 2023年4月10日

Robust adaptive Lasso in high-dimensional logistic regression

Arxiv

0+阅读 · 2023年4月7日

Nonparametric Copula Models for Multivariate, Mixed, and Missing Data

Arxiv

0+阅读 · 2023年4月7日

Identification and Estimation of Causal Effects with Confounders Missing Not at Random

Arxiv

0+阅读 · 2023年4月7日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

Deep learning for time series classification: a review

Arxiv

12+阅读 · 2019年3月14日

相关基金

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

基于近似对称的扰动方程的若干研究

国家自然科学基金

0+阅读 · 2014年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

TIPE2诱导自噬的信号通路及其对巨噬细胞功能的调控

国家自然科学基金

0+阅读 · 2014年12月31日

基于Fermi-LAT和AMS-02的暗物质理论研究

国家自然科学基金

0+阅读 · 2013年12月31日

中药升清胶囊干预脂代谢异常对乳腺癌肝转移的影响

国家自然科学基金

0+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于风险测度的供应链鲁棒建模与策略研究

国家自然科学基金

2+阅读 · 2012年12月31日

欧氏空间中加倍测度的限制与延拓

国家自然科学基金

0+阅读 · 2012年12月31日

基于鲁棒优化理论的二人零和对策研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员