论文标题
基于耦合的融合评估一些Gibbs采样器,用于高维贝叶斯的回归和收缩先验
Coupling-based convergence assessment of some Gibbs samplers for high-dimensional Bayesian regression with shrinkage priors
论文作者
论文摘要
我们认为Markov Chain Carlo(MCMC)算法用于贝叶斯高维回归,并连续收缩先验。这些算法的一个普遍挑战是选择要执行的迭代次数。当每次迭代都很昂贵时,这是至关重要的,就像处理现代数据集时一样,例如与数千行的基因组关联研究以及最多数十万列。我们开发了量身定制的耦合技术,该技术可以通过收缩率进行高维回归的设置,从而实现了融合的实用,非反应性诊断,而无需依赖微图或长期渐近造型。通过建立正在考虑的算法的几何漂移和较小的条件,我们证明所提出的耦合具有有限的预期会议时间。我们专注于包括“马蹄铁”在内的收缩先验,我们从经验上证明了所提出的耦合的可扩展性。我们发现的一个亮点是,少于1000次迭代就足以使Gibbs采样器在100,000个协变量的回归中达到平稳性。数值结果还说明了先前对耦合计算效率的影响,并建议使用先验的先验,其中局部精度是半T分布的,其自由度大于一个。
We consider Markov chain Monte Carlo (MCMC) algorithms for Bayesian high-dimensional regression with continuous shrinkage priors. A common challenge with these algorithms is the choice of the number of iterations to perform. This is critical when each iteration is expensive, as is the case when dealing with modern data sets, such as genome-wide association studies with thousands of rows and up to hundred of thousands of columns. We develop coupling techniques tailored to the setting of high-dimensional regression with shrinkage priors, which enable practical, non-asymptotic diagnostics of convergence without relying on traceplots or long-run asymptotics. By establishing geometric drift and minorization conditions for the algorithm under consideration, we prove that the proposed couplings have finite expected meeting time. Focusing on a class of shrinkage priors which includes the 'Horseshoe', we empirically demonstrate the scalability of the proposed couplings. A highlight of our findings is that less than 1000 iterations can be enough for a Gibbs sampler to reach stationarity in a regression on 100,000 covariates. The numerical results also illustrate the impact of the prior on the computational efficiency of the coupling, and suggest the use of priors where the local precisions are Half-t distributed with degree of freedom larger than one.