校准的对抗性改进，用于随机语义分割

论文标题

校准的对抗性改进，用于随机语义分割

Calibrated Adversarial Refinement for Stochastic Semantic Segmentation

论文作者

Kassapis, Elias, Dikov, Georgi, Gupta, Deepak K., Nugteren, Cedric

论文摘要

在语义分割任务中，输入图像通常可以具有多个合理的解释，从而允许多个有效的标签。为了捕捉这种歧义，最近的工作探索了可以学习预测分布的概率网络的使用。但是，这些不一定准确地代表经验分布。在这项工作中，我们提出了一种在语义图上学习校准的预测分布的策略，在语义图中，与每个预测相关的概率反映了其基础真理正确性的可能性。 To this end, we propose a novel two-stage, cascaded approach for calibrated adversarial refinement: (i) a standard segmentation network is trained with categorical cross entropy to predict a pixelwise probability distribution over semantic classes and (ii) an adversarially trained stochastic network is used to model the inter-pixel correlations to refine the output of the first network into coherent samples.重要的是，要校准改进网络并防止模式崩溃，第二阶段的样品的期望与第一个阶段的概率相匹配。我们通过在多式LIDC数据集中以及带有歧义的改良的CityScapes数据集上实现最新的结果来证明该方法的多功能性和鲁棒性。此外，我们表明核心设计可以适应其他任务，这些任务需要通过在玩具回归数据集中进行实验来学习校准的预测分布。我们在https://github.com/eliaskassapis/carsss提供了我们方法的开源实现。

In semantic segmentation tasks, input images can often have more than one plausible interpretation, thus allowing for multiple valid labels. To capture such ambiguities, recent work has explored the use of probabilistic networks that can learn a distribution over predictions. However, these do not necessarily represent the empirical distribution accurately. In this work, we present a strategy for learning a calibrated predictive distribution over semantic maps, where the probability associated with each prediction reflects its ground truth correctness likelihood. To this end, we propose a novel two-stage, cascaded approach for calibrated adversarial refinement: (i) a standard segmentation network is trained with categorical cross entropy to predict a pixelwise probability distribution over semantic classes and (ii) an adversarially trained stochastic network is used to model the inter-pixel correlations to refine the output of the first network into coherent samples. Importantly, to calibrate the refinement network and prevent mode collapse, the expectation of the samples in the second stage is matched to the probabilities predicted in the first. We demonstrate the versatility and robustness of the approach by achieving state-of-the-art results on the multigrader LIDC dataset and on a modified Cityscapes dataset with injected ambiguities. In addition, we show that the core design can be adapted to other tasks requiring learning a calibrated predictive distribution by experimenting on a toy regression dataset. We provide an open source implementation of our method at https://github.com/EliasKassapis/CARSSS.

下载PDF全文

下载文献需遵守相关版权规定

论文标题