论文标题
通过识别人类对象对来定位枪支载体
Localizing Firearm Carriers by Identifying Human-Object Pairs
论文作者
论文摘要
在人群中对枪手的视觉识别是一个具有挑战性的问题,需要解决一个人与物体(枪支)的关联。我们通过定义人类对象的相互作用(和非相互作用)边界框来提出一种解决这个问题的新方法。在给定的图像中,分别检测到人类和枪支。每个检测到的人都与每个检测到的枪支配对,从而使我们可以创建一个包含对象和人类的配对边界框。对网络进行了训练,可以将这些配对的框架分类为携带已确定的枪支的人。进行了广泛的实验以评估算法的有效性,包括利用人类的全部姿势,手动点及其与枪支的关联。通过使用具有自适应平均池的多规模建议,对空间局部特征的了解是我们方法成功的关键。我们还通过在扩展数据集中添加更多图像并在人体擦拭之间添加更多图像并标记(包括枪支和枪手的边界框),扩展了以前的枪支检测数据集。实验结果($ ap_ {hold} = 78.5 $)证明了该方法的有效性。
Visual identification of gunmen in a crowd is a challenging problem, that requires resolving the association of a person with an object (firearm). We present a novel approach to address this problem, by defining human-object interaction (and non-interaction) bounding boxes. In a given image, human and firearms are separately detected. Each detected human is paired with each detected firearm, allowing us to create a paired bounding box that contains both object and the human. A network is trained to classify these paired-bounding-boxes into human carrying the identified firearm or not. Extensive experiments were performed to evaluate effectiveness of the algorithm, including exploiting full pose of the human, hand key-points, and their association with the firearm. The knowledge of spatially localized features is key to success of our method by using multi-size proposals with adaptive average pooling. We have also extended a previously firearm detection dataset, by adding more images and tagging in extended dataset the human-firearm pairs (including bounding boxes for firearms and gunmen). The experimental results ($AP_{hold} = 78.5$) demonstrate effectiveness of the proposed method.