合奏多品质：不确定性定量的自适应灵活分布预测

论文标题

合奏多品质：不确定性定量的自适应灵活分布预测

Ensemble Multi-Quantiles: Adaptively Flexible Distribution Prediction for Uncertainty Quantification

论文作者

Yan, Xing, Su, Yonghua, Ma, Wenxuan

论文摘要

我们提出了一种新颖，简洁且有效的方法，用于分布预测，以量化机器学习中的不确定性。它在回归任务中包含了$ \ mathbb {p} $ \ mathbb {p}（\ mathbf {y} | \ mathbf {y} | \ mathbf {y} = x）$的自适应灵活分布预测。该条件分布的概率级别分数传播了间隔$（0,1）$，这是由具有直觉和解释性设计的加性模型增强的。我们在结构完整性和$ \ mathbb {p}（\ Mathbf {y} | \ Mathbf {X} = X）$之间寻求适应性平衡，而高斯假设导致缺乏灵活性的真实数据和高度灵活的方法（例如，在没有分布结构的情况下估计量化的量化），并且可以分布良好的结构。我们提出的这种称为EMQ的合奏多品质方法完全由数据驱动，并且可以逐渐离开高斯，并发现增强中的最佳条件分布。在UCI数据集的广泛回归任务中，我们表明EMQ与许多最近的不确定性量化方法相比，EMQ达到了最先进的性能。 Visualization results further illustrate the necessity and the merits of such an ensemble model.

We propose a novel, succinct, and effective approach for distribution prediction to quantify uncertainty in machine learning. It incorporates adaptively flexible distribution prediction of $\mathbb{P}(\mathbf{y}|\mathbf{X}=x)$ in regression tasks. This conditional distribution's quantiles of probability levels spreading the interval $(0,1)$ are boosted by additive models which are designed by us with intuitions and interpretability. We seek an adaptive balance between the structural integrity and the flexibility for $\mathbb{P}(\mathbf{y}|\mathbf{X}=x)$, while Gaussian assumption results in a lack of flexibility for real data and highly flexible approaches (e.g., estimating the quantiles separately without a distribution structure) inevitably have drawbacks and may not lead to good generalization. This ensemble multi-quantiles approach called EMQ proposed by us is totally data-driven, and can gradually depart from Gaussian and discover the optimal conditional distribution in the boosting. On extensive regression tasks from UCI datasets, we show that EMQ achieves state-of-the-art performance comparing to many recent uncertainty quantification methods. Visualization results further illustrate the necessity and the merits of such an ensemble model.

下载PDF全文

下载文献需遵守相关版权规定

论文标题