小物体检测的多分辨率注意提取器

论文标题

小物体检测的多分辨率注意提取器

MultiResolution Attention Extractor for Small Object Detection

论文作者

Zhang, Fan, Jiao, Licheng, Li, Lingling, Liu, Fang, Liu, Xu

论文摘要

小物体由于低分辨率和小尺寸而难以检测。现有的小对象检测方法主要集中于数据预处理或缩小大对象和小对象之间的差异。受到人类视力“注意”机制的启发，我们利用了两种特征提取方法来挖掘小物体最有用的信息。两种方法均基于多分辨率特征提取。我们最初设计并探索了软注意方法，但我们发现其收敛速度很慢。然后，我们提出第二种方法，一种基于注意力的特征相互作用方法，称为多分辨率注意提取器（MRAE），显示出作为小物体检测中的通用特征提取器的显着改善。在香草功能提取器中的每个构建块之后，我们附加了一个小网络，以产生注意力权重，然后加权操作以获取最终的注意力图。我们基于注意力的功能提取器是可可小对象检测基准上“硬”注意对应物（普通架构）的AP的2.0倍，证明MRAE可以通过自适应学习捕获有用的位置和上下文信息。

Small objects are difficult to detect because of their low resolution and small size. The existing small object detection methods mainly focus on data preprocessing or narrowing the differences between large and small objects. Inspired by human vision "attention" mechanism, we exploit two feature extraction methods to mine the most useful information of small objects. Both methods are based on multiresolution feature extraction. We initially design and explore the soft attention method, but we find that its convergence speed is slow. Then we present the second method, an attention-based feature interaction method, called a MultiResolution Attention Extractor (MRAE), showing significant improvement as a generic feature extractor in small object detection. After each building block in the vanilla feature extractor, we append a small network to generate attention weights followed by a weighted-sum operation to get the final attention maps. Our attention-based feature extractor is 2.0 times the AP of the "hard" attention counterpart (plain architecture) on the COCO small object detection benchmark, proving that MRAE can capture useful location and contextual information through adaptive learning.

下载PDF全文

下载文献需遵守相关版权规定

论文标题