Survival is a key metric for evaluating standards of care for people living with HIV. In resource-limited settings, high rates of loss to follow-up (LTFU) often result in underestimation of mortality when only observed deaths are considered. Resampling, which tracks a subset of LTFU patients to ascertain their outcomes, mitigates bias and improves survival estimates. However, common estimators for survival in resampling designs, such as weighted Kaplan-Meier (KM), fail to leverage covariate information collected during repeated clinic visits, even though this information is highly predictive of survival. We propose a Targeted Maximum Likelihood Estimator (TMLE) for survival in resampling designs, which addresses these limitations by leveraging baseline and longitudinal covariates to achieve greater efficiency. Our TMLE is a plug-in estimator and is robust to misspecification of the initial model for the conditional hazard of death, guaranteeing consistency of our estimator due to known resampling probabilities. We present: (1) a fully efficient TMLE for data from resampling studies with fixed follow-up time for all participants and (2) an inverse probability of censoring weighted (IPCW) TMLE that accounts for varied follow-up times by stratifying on patients with sufficient follow-up to evaluate survival. This IPCW-TMLE can be made highly efficient through nonparametric or targeted estimation of the follow-up censoring mechanism. In simulations, our TMLE reduced variance by up to 55% compared with the commonly used weighted KM estimator while preserving nominal confidence interval coverage. These findings demonstrate the potential of our TMLE to improve survival estimation in resampling designs, offering a robust and resource-efficient framework for HIV research. Keywords: Resampling designs, Survival analysis, Targeted Maximum Likelihood Estimation, Inverse probability weighting
翻译:生存率是评估HIV感染者护理标准的关键指标。在资源有限的环境中,高失访率常导致仅基于观测死亡数据时死亡率被低估。重抽样设计通过追踪失访患者子集以确定其结局,可有效减少偏倚并提升生存估计的准确性。然而,重抽样设计中常用的生存估计方法(如加权Kaplan-Meier法)未能充分利用重复临床访视中收集的协变量信息,尽管这些信息对生存具有高度预测性。本文提出一种适用于重抽样设计的靶向最大似然估计器,通过整合基线及纵向协变量以提升估计效率。该TMLE为插件式估计器,对死亡条件风险的初始模型设定偏误具有稳健性,且基于已知重抽样概率保证估计的一致性。我们提出:(1)针对固定随访期重抽样研究的全效率TMLE;(2)通过逆概率删失加权TMLE处理变长随访时间,该方法通过对具有充分随访数据的患者分层来评估生存。IPCW-TMLE可通过随访删失机制的非参数或靶向估计实现高效性。模拟研究表明,与传统加权KM估计器相比,本方法在保持置信区间名义覆盖水平的同时,方差降低最高达55%。这些发现证明TMLE在提升重抽样设计生存估计方面的潜力,为HIV研究提供了稳健且资源高效的框架。关键词:重抽样设计,生存分析,靶向最大似然估计,逆概率加权