MLRSNET：多标签高空间分辨率遥感数据集用于语义场景理解

论文标题

MLRSNET：多标签高空间分辨率遥感数据集用于语义场景理解

MLRSNet: A Multi-label High Spatial Resolution Remote Sensing Dataset for Semantic Scene Understanding

论文作者

Qi, Xiaoman, Zhu, PanPan, Wang, Yuebin, Zhang, Liqiang, Peng, Junhuan, Wu, Mengfan, Chen, Jialong, Zhao, Xudong, Zang, Ning, Mathiopoulos, P. Takis

论文摘要

为了更好地了解遥感领域中的场景图像，需要对场景图像的多标签注释。此外，为了提高深度学习模型的性能，以处理语义场景理解任务，至关重要的是在大规模注释的数据上训练它们。但是，大多数现有的数据集由单个标签注释，该标签无法很好地描述复杂的遥感图像，因为场景图像可能具有多个土地覆盖类别。几乎没有开发多标签高空间分辨率遥感数据集来训练基于多标签的任务的深度学习模型，例如场景分类和图像检索。为了解决这个问题，在本文中，我们构建了一个名为MLRSNET的多标签高空间分辨率遥感数据集，用于语义场景，从高间接的角度深入学习。它由高分辨率光学卫星或空中图像组成。 MLRSNET在46个场景类别中总共包含109,161个样本，每个图像至少具有60个预定义标签中的一个。我们设计了视觉识别任务，包括基于多标签的图像分类和图像检索，其中使用MLRSNET评估了各种深度学习方法。实验结果表明，MLRSNET是未来研究的重要基准，它补充了当前使用的数据集（例如Imagenet），该数据集填补了多标签图像研究中的空白。此外，我们将继续扩展MLRSNET。 MLRSNET和所有相关材料已在https://data.mendeley.com/datasets/7j9bv9vwsx/2和https://github.com/cugbrs/mlrsnet.git上公开提供。

To better understand scene images in the field of remote sensing, multi-label annotation of scene images is necessary. Moreover, to enhance the performance of deep learning models for dealing with semantic scene understanding tasks, it is vital to train them on large-scale annotated data. However, most existing datasets are annotated by a single label, which cannot describe the complex remote sensing images well because scene images might have multiple land cover classes. Few multi-label high spatial resolution remote sensing datasets have been developed to train deep learning models for multi-label based tasks, such as scene classification and image retrieval. To address this issue, in this paper, we construct a multi-label high spatial resolution remote sensing dataset named MLRSNet for semantic scene understanding with deep learning from the overhead perspective. It is composed of high-resolution optical satellite or aerial images. MLRSNet contains a total of 109,161 samples within 46 scene categories, and each image has at least one of 60 predefined labels. We have designed visual recognition tasks, including multi-label based image classification and image retrieval, in which a wide variety of deep learning approaches are evaluated with MLRSNet. The experimental results demonstrate that MLRSNet is a significant benchmark for future research, and it complements the current widely used datasets such as ImageNet, which fills gaps in multi-label image research. Furthermore, we will continue to expand the MLRSNet. MLRSNet and all related materials have been made publicly available at https://data.mendeley.com/datasets/7j9bv9vwsx/2 and https://github.com/cugbrs/MLRSNet.git.

下载PDF全文

下载文献需遵守相关版权规定

论文标题