分离潜在空间的语义不确定性间隔

论文标题

分离潜在空间的语义不确定性间隔

Semantic uncertainty intervals for disentangled latent spaces

论文作者

Sankaranarayanan, Swami, Angelopoulos, Anastasios N., Bates, Stephen, Romano, Yaniv, Isola, Phillip

论文摘要

计算机视觉中有意义的不确定性量化需要有关语义信息的推理 - 例如，照片中的人的头发颜色或街上汽车的位置。为此，最近在生成建模方面的突破使我们能够在分离的潜在空间中代表语义信息，但是在语义潜在变量上提供不确定性仍然具有挑战性。在这项工作中，我们提供了原则上的不确定性间隔，这些间隔可保证为任何基本生成模型包含真正的语义因素。该方法执行以下操作：（1）它使用分位数回归来输出潜在空间中每个元素的启发式不确定性间隔（2）校准了这些不确定性，以便它们包含新的，看不见的输入的潜在值。然后可以通过发电机传播这些校准间隔的终点，以为每个语义因素产生可解释的不确定性可视化。该技术可靠地传达了语义上有意义的，有原则和实例自适应的不确定性，例如图像超分辨率和图像完成。

Meaningful uncertainty quantification in computer vision requires reasoning about semantic information -- say, the hair color of the person in a photo or the location of a car on the street. To this end, recent breakthroughs in generative modeling allow us to represent semantic information in disentangled latent spaces, but providing uncertainties on the semantic latent variables has remained challenging. In this work, we provide principled uncertainty intervals that are guaranteed to contain the true semantic factors for any underlying generative model. The method does the following: (1) it uses quantile regression to output a heuristic uncertainty interval for each element in the latent space (2) calibrates these uncertainties such that they contain the true value of the latent for a new, unseen input. The endpoints of these calibrated intervals can then be propagated through the generator to produce interpretable uncertainty visualizations for each semantic factor. This technique reliably communicates semantically meaningful, principled, and instance-adaptive uncertainty in inverse problems like image super-resolution and image completion.

下载PDF全文

下载文献需遵守相关版权规定

论文标题