Cluster-randomized trials (CRTs) are a well-established class of designs for evaluating community-based interventions. An essential task in planning these trials is determining the number of clusters and cluster sizes needed to achieve sufficient statistical power for detecting a clinically relevant effect size. While methods for evaluating the average treatment effect (ATE) for the entire study population are well-established, sample size methods for testing heterogeneity of treatment effects (HTEs), i.e., treatment-covariate interaction or difference in subpopulation-specific treatment effects, in CRTs have only recently been developed. For pre-specified analyses of HTEs in CRTs, effect-modifying covariates should, ideally, be accompanied by sample size or power calculations to ensure the trial has adequate power for the planned analyses. Power analysis for testing HTEs is more complex than for ATEs due to the additional design parameters that must be specified. Power and sample size formulas for testing HTEs via linear mixed effects (LME) models have been separately derived for different cluster-randomized designs, including single and multi-period parallel designs, crossover designs, and stepped-wedge designs, and for continuous and binary outcomes. This tutorial provides a consolidated reference guide for these methods and enhances their accessibility through an online R Shiny calculator. We further discuss key considerations for conducting sample size and power calculations to test pre-specified HTE hypotheses in CRTs, highlighting the importance of specifying advanced estimates of intracluster correlation coefficients for both outcomes and covariates, and their implications for power. The sample size methodology and calculator functionality are demonstrated through a real CRT example.
翻译:整群随机试验(CRTs)是评估社区干预措施的一类成熟设计方法。规划此类试验时,一项关键任务是确定所需的群组数量和群组规模,以确保能够以足够的统计功效检测出具有临床意义的效果量。尽管评估整个研究人群的平均处理效应(ATE)的方法已较为成熟,但用于检验整群随机试验中处理效应异质性(HTEs)——即处理-协变量交互作用或亚群特异性处理效应差异——的样本量方法直到最近才得以发展。对于整群随机试验中预先设定的HTE分析,理想情况下,效应修饰协变量应辅以样本量或功效计算,以确保试验对计划的分析具备足够的检验效能。由于需要指定额外的设计参数,检验HTE的功效分析比ATE更为复杂。针对不同整群随机设计(包括单周期与多周期平行设计、交叉设计和阶梯楔形设计)以及连续型和二分类结局,已分别推导出通过线性混合效应(LME)模型检验HTE的功效与样本量公式。本教程整合了这些方法作为参考指南,并通过在线R Shiny计算器提升了其可及性。我们进一步讨论了在整群随机试验中为检验预先设定的HTE假设进行样本量与功效计算时的关键考量,重点强调了为结局和协变量预先设定群内相关系数估计值的重要性及其对功效的影响。最后,通过一个真实的整群随机试验案例展示了样本量方法及计算器的功能。