论文标题
微生物组数据的贝叶斯模型,用于同时鉴定协变量关联和表型结果的预测
A Bayesian model of microbiome data for simultaneous identification of covariate associations and prediction of phenotypic outcomes
论文作者
论文摘要
有关人类微生物组研究的主要研究问题之一是设计干预措施的可行性,该干预措施调节微生物组的组成以促进健康和治愈疾病。这需要广泛了解微生物组的调节因子,例如饮食摄入,以及微生物组成与表型结果之间的关系,例如体重指数(BMI)。以前的努力已经采用了两步方法来分别建模这些数据,从而产生对结果的有偏见的解释。在这里,我们提出了一个贝叶斯联合模型,该模型同时识别与微生物组成数据相关的临床协变量,并使用组成数据中包含的信息预测表型反应。使用尖峰和斜线先验,我们的方法可以处理高维成分和临床数据。此外,我们通过微生物样品中通常发现的平衡和过度分散来适应数据的组成结构。我们应用模型来了解饮食摄入,微生物样品和BMI之间的关系。在此分析中,我们发现微生物类群与饮食因素之间的许多关联可能导致微生物组,这通常对慢性疾病(例如肥胖症)的发展更为热情。此外,我们在模拟数据上证明了我们的方法如何优于两步方法并提出灵敏度分析。
One of the major research questions regarding human microbiome studies is the feasibility of designing interventions that modulate the composition of the microbiome to promote health and cure disease. This requires extensive understanding of the modulating factors of the microbiome, such as dietary intake, as well as the relation between microbial composition and phenotypic outcomes, such as body mass index (BMI). Previous efforts have modeled these data separately, employing two-step approaches that can produce biased interpretations of the results. Here, we propose a Bayesian joint model that simultaneously identifies clinical covariates associated with microbial composition data and predicts a phenotypic response using information contained in the compositional data. Using spike-and-slab priors, our approach can handle high-dimensional compositional as well as clinical data. Additionally, we accommodate the compositional structure of the data via balances and overdispersion typically found in microbial samples. We apply our model to understand the relations between dietary intake, microbial samples, and BMI. In this analysis, we find numerous associations between microbial taxa and dietary factors that may lead to a microbiome that is generally more hospitable to the development of chronic diseases, such as obesity. Additionally, we demonstrate on simulated data how our method outperforms two-step approaches and also present a sensitivity analysis.