论文标题
有条件的力量和朋友:(联合国)计划的,未盲的样本量重新计算的原因和方式
Conditional Power and Friends: The Why and How of (Un)planned, Unblinded Sample Size Recalculations in Confirmatory Trials
论文作者
论文摘要
在试验期间,将试验的最终样本量调整为证据是解决计划不确定性的一种自然方法。具有自适应样本量的设计需要说明其可选的停止,以确保严格的I型错误率控制。文献中提出了各种不同的方法来维持I型误差率控制。这使得临时分析是为了在调节环境中可行的样本量重新计算。由于通常基于试验的能力来确定样本量,因此临时分析提出了一个问题,即应如何根据应计信息确定最终样本量。有条件的力量是在这种情况下经常提出的概念。由于它取决于未知的效应大小,因此我们采用严格的估计透视图,并比较假定的条件功率,观察到的条件功率以及其性质作为未知条件功率的估计量的预测能力。然后,我们证明使用方法学进行外临时分析的方法进行预计划是无效的,并且自然会导致最佳的两阶段设计的概念。我们得出的结论是,计划外的设计适应性仅应作为对试验外部新证据的反应,违反最初选择的设计或客观标准中事后变化的操作需求。最后,我们表明,通常讨论的样本量重新计算规则可能会导致矛盾的结果,并提出了两种对新出现的试验 - 外部证据做出反应的替代方法。
Adapting the final sample size of a trial to the evidence accruing during the trial is a natural way to address planning uncertainty. Designs with adaptive sample size need to account for their optional stopping to guarantee strict type-I error-rate control. A variety of different methods to maintain type-I error-rate control after unplanned changes of the initial sample size have been proposed in the literature. This makes interim analyses for the purpose of sample size recalculation feasible in a regulatory context. Since the sample size is usually determined via an argument based on the power of the trial, an interim analysis raises the question of how the final sample size should be determined conditional on the accrued information. Conditional power is a concept often put forward in this context. Since it depends on the unknown effect size, we take a strict estimation perspective and compare assumed conditional power, observed conditional power, and predictive power with respect to their properties as estimators of the unknown conditional power. We then demonstrate that pre-planning an interim analysis using methodology for unplanned interim analyses is ineffective and naturally leads to the concept of optimal two-stage designs. We conclude that unplanned design adaptations should only be conducted as reaction to trial-external new evidence, operational needs to violate the originally chosen design, or post hoc changes in the objective criterion. Finally, we show that commonly discussed sample size recalculation rules can lead to paradoxical outcomes and propose two alternative ways of reacting to newly emerging trial-external evidence.