论文标题
汇总数据荟萃分析的仿真模型:汇总效应大小和出版偏见的评估
Simulation Models for Aggregated Data Meta-Analysis: Evaluation of Pooling Effect Sizes and Publication Biases
论文作者
论文摘要
模拟研究通常用于评估新开发的荟萃分析方法的性能。对于针对汇总数据荟萃分析而开发的方法,研究人员通常会直接模拟汇总数据,而不是模拟单个参与者数据,从中计算汇总数据。显然,汇总数据统计数据的分布特性可以从基本数据的分布假设中得出,但通常不会在出版物中明确说明。本文提供了汇总的数据统计数据的分布,这些分布是从异质的混合效应模型中得出的连续单个数据。结果,我们提供了一个直接模拟汇总数据统计信息的过程。我们还通过描述其理论差异并进行三种荟萃分析方法的模拟研究,将分布发现与文献中使用的其他汇总数据的模拟方法进行了比较:Dersimonian和Laird的汇总估算值以及调整式偏见的修剪和填充和PEESE方法。我们证明,汇总数据的仿真模型的选择可能会对荟萃分析方法的性能(结论)产生相关影响。我们建议使用多个汇总数据模拟模型来研究新方法,以确定敏感性或以其他方式使单个参与者数据模型显式,这将导致模拟中使用的汇总数据统计信息的分布选择。
Simulation studies are commonly used to evaluate the performance of newly developed meta-analysis methods. For methodology that is developed for an aggregated data meta-analysis, researchers often resort to simulation of the aggregated data directly, instead of simulating individual participant data from which the aggregated data would be calculated in reality. Clearly, distributional characteristics of the aggregated data statistics may be derived from distributional assumptions of the underlying individual data, but they are often not made explicit in publications. This paper provides the distribution of the aggregated data statistics that were derived from a heteroscedastic mixed effects model for continuous individual data. As a result, we provide a procedure for directly simulating the aggregated data statistics. We also compare our distributional findings with other simulation approaches of aggregated data used in literature by describing their theoretical differences and by conducting a simulation study for three meta-analysis methods: DerSimonian and Laird's pooled estimate and the Trim & Fill and PET-PEESE method for adjustment of publication bias. We demonstrate that the choices of simulation model for aggregated data may have a relevant impact on (the conclusions of) the performance of the meta-analysis method. We recommend the use of multiple aggregated data simulation models for investigation of new methodology to determine sensitivity or otherwise make the individual participant data model explicit that would lead to the distributional choices of the aggregated data statistics used in the simulation.