论文标题

TSAA:在拥挤的对象检测中锚定漂移的两阶段锚分配方法

TSAA: A Two-Stage Anchor Assignment Method towards Anchor Drift in Crowded Object Detection

论文作者

Xiang, Li, Miao, He, Haibo, Luo, Huiyuan, Yang, Jiajie, Xiao

论文摘要

在当前基于锚的检测器中,将直观地分配一个正锚盒,以将其重叠的对象分配。每个锚的分配标签将直接确定相应预测框的优化方向,包括框回归和类别预测的方向。但是,在我们对物体检测的实践中,结果表明,当多个对象重叠时,积极的锚并不总是会回归对象最大的对象。我们将其命名为锚漂移。锚漂移反映出,锚点匹配关系由锚和对象之间的重叠程度确定,并不总是最佳的。在过去的培训过程中,固定匹配关系与学习经验之间的冲突可能会导致模棱两可的预测,从而提高虚假阳性率。在本文中,提出了一种简单但有效的自适应两阶段锚定(TSAA)方法。它利用最终的预测框,而不是固定锚来计算与对象的重叠度,以确定每个锚点要回归的对象。预测框的参与使锚定对象分配机制自适应。对CrowdHuman和Coco的三个经典视网膜,更快的RCNN和Yolov3进行了广泛的实验,以评估TSAA的有效性。结果表明,TSAA可以显着改善检测器的性能,而无需额外的计算成本或网络结构变化。

Among current anchor-based detectors, a positive anchor box will be intuitively assigned to the object that overlaps it the most. The assigned label to each anchor will directly determine the optimization direction of the corresponding prediction box, including the direction of box regression and category prediction. In our practice of crowded object detection, however, the results show that a positive anchor does not always regress toward the object that overlaps it the most when multiple objects overlap. We name it anchor drift. The anchor drift reflects that the anchor-object matching relation, which is determined by the degree of overlap between anchors and objects, is not always optimal. Conflicts between the fixed matching relation and learned experience in the past training process may cause ambiguous predictions and thus raise the false-positive rate. In this paper, a simple but efficient adaptive two-stage anchor assignment (TSAA) method is proposed. It utilizes the final prediction boxes rather than the fixed anchors to calculate the overlap degree with objects to determine which object to regress for each anchor. The participation of the prediction box makes the anchor-object assignment mechanism adaptive. Extensive experiments are conducted on three classic detectors RetinaNet, Faster-RCNN and YOLOv3 on CrowdHuman and COCO to evaluate the effectiveness of TSAA. The results show that TSAA can significantly improve the detectors' performance without additional computational costs or network structure changes.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源