减轻语义级别的偏移：一种半监督的域适应方法，用于语义分割

论文标题

减轻语义级别的偏移：一种半监督的域适应方法，用于语义分割

Alleviating Semantic-level Shift: A Semi-supervised Domain Adaptation Method for Semantic Segmentation

论文作者

Wang, Zhonghao, Wei, Yunchao, Feris, Rogerior, Xiong, Jinjun, Hwu, Wen-Mei, Huang, Thomas S., Shi, Humphrey

论文摘要

从合成数据到适应真实数据的学习分割可以显着减轻人类在标记像素级面罩上的努力。该任务的关键挑战是如何减轻源域和目标域之间的数据分布差异，即减少域移动。解决这个问题的常见方法是通过对抗训练将特征分布之间的特征分布之间的差异最小化。但是，直接对齐全球特征分布无法保证从局部视图（即语义级别）保持一致性，这阻止了在源域中学习的某些语义知识，无法将其应用于目标域。为了解决这个问题，我们提出了一种名为“减轻语义级别转移（ASS）”的半监督方法，该方法可以成功地促进全球和本地观点的分布一致性。具体而言，我们通过平均与像素级掩码建议的相同类别相对应的特征来利用少数来自目标域的标记数据直接从源和目标域提取语义级特征表示。然后，我们将生产的功能喂给歧视者，以进行语义水平的对手学习，该学习与从全球视图的对抗性学习合作，以更好地减轻域的转变。我们将屁股应用于从GTA5到CityScapes以及从合成到CityScapes的两个领域适应任务。广泛的实验表明：（1）通过采用少量的带有目标域的带注释的样品，可以显着胜过当前无监督的最先进；（2）通过使用来自目标域的带注释的样本来增强合成源数据，而不会遇到过度拟合到源域的普遍问题，可以通过在整个目标数据集上训练的Oracle模型超过3点。

Learning segmentation from synthetic data and adapting to real data can significantly relieve human efforts in labelling pixel-level masks. A key challenge of this task is how to alleviate the data distribution discrepancy between the source and target domains, i.e. reducing domain shift. The common approach to this problem is to minimize the discrepancy between feature distributions from different domains through adversarial training. However, directly aligning the feature distribution globally cannot guarantee consistency from a local view (i.e. semantic-level), which prevents certain semantic knowledge learned on the source domain from being applied to the target domain. To tackle this issue, we propose a semi-supervised approach named Alleviating Semantic-level Shift (ASS), which can successfully promote the distribution consistency from both global and local views. Specifically, leveraging a small number of labeled data from the target domain, we directly extract semantic-level feature representations from both the source and the target domains by averaging the features corresponding to same categories advised by pixel-level masks. We then feed the produced features to the discriminator to conduct semantic-level adversarial learning, which collaborates with the adversarial learning from the global view to better alleviate the domain shift. We apply our ASS to two domain adaptation tasks, from GTA5 to Cityscapes and from Synthia to Cityscapes. Extensive experiments demonstrate that: (1) ASS can significantly outperform the current unsupervised state-of-the-arts by employing a small number of annotated samples from the target domain; (2) ASS can beat the oracle model trained on the whole target dataset by over 3 points by augmenting the synthetic source data with annotated samples from the target domain without suffering from the prevalent problem of overfitting to the source domain.

下载PDF全文

下载文献需遵守相关版权规定

论文标题