附带结果和暴露错误分类的二次使用数据最佳多用途最佳多用途验证 (Optimal Multi-Wave Validation of Secondary Use Data with Outcome and Exposure Misclassification) - 专知论文

会员服务 ·

0

优化器 · 估计/估计量 · Extensibility · 极大似然 · INFORMS ·

2021 年 8 月 30 日

Optimal Multi-Wave Validation of Secondary Use Data with Outcome and Exposure Misclassification

翻译：附带结果和暴露错误分类的二次使用数据最佳多用途最佳多用途验证

Sarah C. Lotspeich,Gustavo G. C. Amorim,Pamela A. Shaw,Ran Tao,Bryan E. Shepherd

from arxiv, 15 pages, 3 tables, 3 figures, supplemental materials can be found in ancillary file supplement.pdf

The growing availability of observational databases like electronic health records (EHR) provides unprecedented opportunities for secondary use of such data in biomedical research. However, these data can be error-prone and need to be validated before use. It is usually unrealistic to validate the whole database due to resource constraints. A cost-effective alternative is to implement a two-phase design that validates a subset of patient records that are enriched for information about the research question of interest. Herein, we consider odds ratio estimation under differential outcome and exposure misclassification. We propose optimal designs that minimize the variance of the maximum likelihood odds ratio estimator. We develop a novel adaptive grid search algorithm that can locate the optimal design in a computationally feasible and numerically accurate manner. Because the optimal design requires specification of unknown parameters at the outset and thus is unattainable without prior information, we introduce a multi-wave sampling strategy to approximate it in practice. We demonstrate the efficiency gains of the proposed designs over existing ones through extensive simulations and two large observational studies. We provide an R package and Shiny app to facilitate the use of the optimal designs.

翻译：越来越多的观察数据库,如电子健康记录(EHR),为生物医学研究中二次使用这类数据提供了前所未有的机会。然而,这些数据可能容易出错,在使用前需要验证,由于资源有限,验证整个数据库通常不切实际。一个成本效益高的替代办法是实施一个两阶段设计,验证一组病人记录,这些记录丰富了有关研究问题的资料。在这里,我们考虑在差别结果和暴露分类错误下对差异比率进行估计。我们提出最佳设计,尽量减少最大可能性概率估计值的差异。我们开发了一种新的适应性电网搜索算法,能够以计算可行和数字准确的方式找到最佳设计。由于最佳设计首先需要说明未知参数,因此在没有事先资料的情况下是无法实现的,我们采用了多波取样战略,以便在实践中加以估计。我们通过广泛的模拟和两次大型观测研究,展示了拟议设计对现有设计的效率收益。我们提供了一套R包和Shiny App,以便利最佳设计的使用。

0

相关内容

优化器

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

专知会员服务

92+阅读 · 2020年5月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

计算机 | USENIX Security 2020等国际会议信息5条

计算机 | USENIX Security 2020等国际会议信息5条

Call4Papers

7+阅读 · 2019年4月25日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

carla 体验效果及代码

carla 体验效果及代码

CreateAMind

7+阅读 · 2018年2月3日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Bayesian non-parametric ordinal regression under a monotonicity constraint

Arxiv

0+阅读 · 2021年10月21日

On Multiply Robust Mendelian Randomization (MR$^2$) With Many Invalid Genetic Instruments

Arxiv

0+阅读 · 2021年10月20日

Discrete Teissier distribution: properties, estimation and application

Arxiv

0+阅读 · 2021年10月20日

Addressing Positivity Violations in Causal Effect Estimation using Gaussian Process Priors

Arxiv

0+阅读 · 2021年10月19日

Maximal Spaces for Approximation Rates in $\ell^1$-regularization

Arxiv

0+阅读 · 2021年10月18日

Approximate Sampling and Counting of Graphs with Near-Regular Degree Intervals

Arxiv

0+阅读 · 2021年10月18日

Multioutput Gaussian Processes with Functional Data: A Study on Coastal Flood Hazard Assessment

Arxiv

0+阅读 · 2021年10月17日

Building Degradation Index with Variable Selection for Multivariate Sensory Data

Arxiv

0+阅读 · 2021年10月17日

Generalized regression operator estimation for continuous time functional data processes with missing at random response

Arxiv

0+阅读 · 2021年10月15日

Signal Processing and Piecewise Convex Estimation

Arxiv

4+阅读 · 2018年3月14日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

专知会员服务

92+阅读 · 2020年5月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】通过真实世界实践赋能机器人自主性

军用无人机集群技术尚未成熟——但潜力可期

人工智能安全治理白皮书（2025）

AgentOps综述：分类、挑战与未来方向

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

计算机 | USENIX Security 2020等国际会议信息5条

计算机 | USENIX Security 2020等国际会议信息5条

Call4Papers

7+阅读 · 2019年4月25日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

carla 体验效果及代码

carla 体验效果及代码

CreateAMind

7+阅读 · 2018年2月3日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Bayesian non-parametric ordinal regression under a monotonicity constraint

Arxiv

0+阅读 · 2021年10月21日

On Multiply Robust Mendelian Randomization (MR$^2$) With Many Invalid Genetic Instruments

Arxiv

0+阅读 · 2021年10月20日

Discrete Teissier distribution: properties, estimation and application

Arxiv

0+阅读 · 2021年10月20日

Addressing Positivity Violations in Causal Effect Estimation using Gaussian Process Priors

Arxiv

0+阅读 · 2021年10月19日

Maximal Spaces for Approximation Rates in $\ell^1$-regularization

Arxiv

0+阅读 · 2021年10月18日

Approximate Sampling and Counting of Graphs with Near-Regular Degree Intervals

Arxiv

0+阅读 · 2021年10月18日

Multioutput Gaussian Processes with Functional Data: A Study on Coastal Flood Hazard Assessment

Arxiv

0+阅读 · 2021年10月17日

Building Degradation Index with Variable Selection for Multivariate Sensory Data

Arxiv

0+阅读 · 2021年10月17日

Generalized regression operator estimation for continuous time functional data processes with missing at random response

Arxiv

0+阅读 · 2021年10月15日

Signal Processing and Piecewise Convex Estimation

Arxiv

4+阅读 · 2018年3月14日

微信扫码咨询专知VIP会员