Survival analysis is a critical tool for modeling time-to-event data. Recent deep learning-based models have reduced various modeling assumptions including proportional hazard and linearity. However, a persistent challenge remains in incorporating longitudinal covariates, with prior work largely focusing on cross-sectional features, and in assessing calibration of these models, with research primarily focusing on discrimination during evaluation. We introduce TraCeR, a transformer-based survival analysis framework for incorporating longitudinal covariates. Based on a factorized self-attention architecture, TraCeR estimates the hazard function from a sequence of measurements, naturally capturing temporal covariate interactions without assumptions about the underlying data-generating process. The framework is inherently designed to handle censored data and competing events. Experiments on multiple real-world datasets demonstrate that TraCeR achieves substantial and statistically significant performance improvements over state-of-the-art methods. Furthermore, our evaluation extends beyond discrimination metrics and assesses model calibration, addressing a key oversight in literature.
翻译:生存分析是建模时间-事件数据的关键工具。近年来基于深度学习的模型已减少了对比例风险和线性性等多种建模假设的依赖。然而,在纳入纵向协变量方面仍存在持续挑战——先前研究主要关注横截面特征,同时这些模型的校准评估也存在不足——现有评估研究多集中于区分度指标。本文提出TraCeR,一种基于Transformer的生存分析框架,专门用于整合纵向协变量。该框架基于因子化自注意力架构,能够从测量序列中估计风险函数,自然捕获时序协变量交互作用,且无需对底层数据生成过程进行假设。该框架本质设计可处理删失数据和竞争事件。在多个真实世界数据集上的实验表明,TraCeR相较于现有最优方法实现了显著且具有统计学意义的性能提升。此外,我们的评估超越了传统区分度指标,首次系统评估了模型校准性能,弥补了现有文献的关键空白。