论文标题

贝叶斯半参数协变量多元密度反卷积

Bayesian Semiparametric Covariate Informed Multivariate Density Deconvolution

论文作者

Sarkar, Abhra

论文摘要

估计不同饮食成分的长期平均摄入量的边际和关节密度是营养流行病学的重要问题。由于无法直接测量这些变量,因此通常以24小时召回摄入量的形式收集数据。然后,从观察到的但误差污染的召回中估算潜在长期平均摄入量的密度的问题成为了密度多元反卷积的问题。潜在的密度可能会随着受试者的人口统计特征(例如性别,种族,年龄等)而有所不同。但是,在存在相关的精确测量的协变量的情况下,密度去卷积的问题也从未考虑过,即使在单变量环境中也没有被考虑。我们提出了一种灵活的贝叶斯半参数方法,以协变知情的多元反卷积。我们提出的方法以最新进展和条件张量分解技术的进步为基础,不仅允许关节和边际密度随相关的预测变量而灵活地变化,而且还可以自动选择最有影响力的预测指标。重要的是,该方法还允许感兴趣的密度和测量误差的密度随着潜在的不同预测变量而变化。我们设计了马尔可夫链蒙特卡洛算法,以实现有效的后部推断,适当地适应分析的各个方面的不确定性。通过模拟实验说明了所提出方法的经验疗效。在估计不同饮食成分的协变量调整后的长期摄入量时,它在上述营养流行病学应用中证明了其实用性。补充材料包括实质性的其他详细信息,R型也可以在线获得。

Estimating the marginal and joint densities of the long-term average intakes of different dietary components is an important problem in nutritional epidemiology. Since these variables cannot be directly measured, data are usually collected in the form of 24-hour recalls of the intakes. The problem of estimating the density of the latent long-term average intakes from their observed but error contaminated recalls then becomes a problem of multivariate deconvolution of densities. The underlying densities could potentially vary with the subjects' demographic characteristics such as sex, ethnicity, age, etc. The problem of density deconvolution in the presence of associated precisely measured covariates has, however, never been considered before, not even in the univariate setting. We present a flexible Bayesian semiparametric approach to covariate informed multivariate deconvolution. Building on recent advances in copula deconvolution and conditional tensor factorization techniques, our proposed method not only allows the joint and the marginal densities to vary flexibly with the associated predictors but also allows automatic selection of the most influential predictors. Importantly, the method also allows the density of interest and the density of the measurement errors to vary with potentially different sets of predictors. We design Markov chain Monte Carlo algorithms that enable efficient posterior inference, appropriately accommodating uncertainty in all aspects of our analysis. The empirical efficacy of the proposed method is illustrated through simulation experiments. Its practical utility is demonstrated in the afore-described nutritional epidemiology applications in estimating covariate-adjusted long term intakes of different dietary components. Supplementary materials include substantive additional details and R codes are also available online.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源