步态循环重建和人类从遮挡序列的识别

论文标题

步态循环重建和人类从遮挡序列的识别

Gait Cycle Reconstruction and Human Identification from Occluded Sequences

论文作者

Paul, Abhishek, Jain, Manav Mukesh, Jain, Jinesh, Chattopadhyay, Pratik

论文摘要

从基于计算机视觉的技术在监视站点捕获的视频中，基于步态的人识别非常具有挑战性，因为这些步行序列通常会被遮挡损坏，并且步态的完整周期并不总是可用。在这项工作中，我们提出了一个有效的基于神经网络的模型，以在进行步态识别之前重建输入序列中的遮挡框架。具体而言，我们采用LSTM网络来预测从向前和向后方向的每个遮挡框架的嵌入，然后通过使用残留块和卷积层网络来融合两个LSTM的预测。虽然LSTM训练以最大程度地减少均方损失，但训练了融合网络，以优化地面真相和重建样品之间的像素跨透明镜损失。使用了由OU-ISIR LP和CASIA-B数据产生的合成遮挡序列以及TUM-IITKGP数据中存在的实重序列，对我们的方法进行了评估。通过一些流行的步态识别方法，通过骰子评分和基于步态的识别精度来验证了提出的重建模型的有效性。与步态识别中现有的遮挡处理方法的比较研究突出了我们所提出的闭塞重建方法的优越性。

Gait-based person identification from videos captured at surveillance sites using Computer Vision-based techniques is quite challenging since these walking sequences are usually corrupted with occlusion, and a complete cycle of gait is not always available. In this work, we propose an effective neural network-based model to reconstruct the occluded frames in an input sequence before carrying out gait recognition. Specifically, we employ LSTM networks to predict an embedding for each occluded frame both from the forward and the backward directions, and next fuse the predictions from the two LSTMs by employing a network of residual blocks and convolutional layers. While the LSTMs are trained to minimize the mean-squared loss, the fusion network is trained to optimize the pixel-wise cross-entropy loss between the ground-truth and the reconstructed samples. Evaluation of our approach has been done using synthetically occluded sequences generated from the OU-ISIR LP and CASIA-B data and real-occluded sequences present in the TUM-IITKGP data. The effectiveness of the proposed reconstruction model has been verified through the Dice score and gait-based recognition accuracy using some popular gait recognition methods. Comparative study with existing occlusion handling methods in gait recognition highlights the superiority of our proposed occlusion reconstruction approach over the others.

下载PDF全文

下载文献需遵守相关版权规定

论文标题