对认知不确定性的过多风险分析，并应用于变异推理

论文标题

对认知不确定性的过多风险分析，并应用于变异推理

Excess risk analysis for epistemic uncertainty with application to variational inference

论文作者

Futami, Futoshi, Iwata, Tomoharu, Ueda, Naonori, Sato, Issei, Sugiyama, Masashi

论文摘要

贝叶斯深度学习起着重要的作用，尤其是其评估认知不确定性（EU）的能力。由于计算复杂性问题，在实践中使用了诸如变异推理（VI）之类的近似方法来获得后验分布及其概括能力，例如，通过Pac-Bayesian理论进行了广泛的分析。然而，尽管对欧盟进行了许多数字实验，但对欧盟的分析很少。在这项研究中，我们通过关注其多余的风险来分析近似贝叶斯推论的监督学习欧盟。首先，我们从理论上显示了概括误差与广泛使用的欧盟测量值之间的新颖关系，例如预测分布的方差和互信息，并得出它们的收敛行为。接下来，我们阐明VI的目标函数如何使欧盟正规化。通过此分析，我们为VI提出了一个新的目标函数，该功能直接基于Pac-Bayesian理论直接控制预测性能和欧盟。数值实验表明，我们的算法显着改善了现有VI方法的欧盟评估。

Bayesian deep learning plays an important role especially for its ability evaluating epistemic uncertainty (EU). Due to computational complexity issues, approximation methods such as variational inference (VI) have been used in practice to obtain posterior distributions and their generalization abilities have been analyzed extensively, for example, by PAC-Bayesian theory; however, little analysis exists on EU, although many numerical experiments have been conducted on it. In this study, we analyze the EU of supervised learning in approximate Bayesian inference by focusing on its excess risk. First, we theoretically show the novel relations between generalization error and the widely used EU measurements, such as the variance and mutual information of predictive distribution, and derive their convergence behaviors. Next, we clarify how the objective function of VI regularizes the EU. With this analysis, we propose a new objective function for VI that directly controls the prediction performance and the EU based on the PAC-Bayesian theory. Numerical experiments show that our algorithm significantly improves the EU evaluation over the existing VI methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题