论文标题
Drivermhg:一个多模式数据集,用于动态识别驱动程序微手势和实时识别框架
DriverMHG: A Multi-Modal Dataset for Dynamic Recognition of Driver Micro Hand Gestures and a Real-Time Recognition Framework
论文作者
论文摘要
手势的使用为人类计算机(HCI)系统的繁琐界面设备提供了自然的替代方法。然而,对车载的场景对动态微手势手势的实时认识对于(i)由于(i)应自然进行手势,而不必分散驾驶员的注意力,(ii)微手势在很短的时间间隔内发生在空间上的限制区域内,(iii)应识别出一定的手势,并且要识别出一个启动的手势,并且要启用整个架构(iv)(IV),并且(IV)是一定的。嵌入式系统。在这项工作中,我们提出了一个HCI系统,以动态识别驾驶员微手势,这可能会对汽车领域产生至关重要的影响,尤其是对于安全相关问题。为此,我们最初收集了一个名为Driver Micro Hand手势的数据集(DRIVERMHG),该数据集由RGB,DEPTH和红外模态组成。通过提出一个基于轻巧的卷积神经网络(CNN)的体系结构,可以通过滑动窗口方法有效地在线运行,从而解决了微手势的动态识别挑战。对于CNN模型,应用了几个三维资源有效的网络,并分析了其性能。 3D-Mobilenetv2对在线识别手势进行了认可,该识别提供了具有相似计算复杂性的应用网络之间的最佳离线准确性。最终架构部署在实时操作的驱动程序模拟器上。我们将DrivermHG数据集和我们的源代码公开可用。
The use of hand gestures provides a natural alternative to cumbersome interface devices for Human-Computer Interaction (HCI) systems. However, real-time recognition of dynamic micro hand gestures from video streams is challenging for in-vehicle scenarios since (i) the gestures should be performed naturally without distracting the driver, (ii) micro hand gestures occur within very short time intervals at spatially constrained areas, (iii) the performed gesture should be recognized only once, and (iv) the entire architecture should be designed lightweight as it will be deployed to an embedded system. In this work, we propose an HCI system for dynamic recognition of driver micro hand gestures, which can have a crucial impact in automotive sector especially for safety related issues. For this purpose, we initially collected a dataset named Driver Micro Hand Gestures (DriverMHG), which consists of RGB, depth and infrared modalities. The challenges for dynamic recognition of micro hand gestures have been addressed by proposing a lightweight convolutional neural network (CNN) based architecture which operates online efficiently with a sliding window approach. For the CNN model, several 3-dimensional resource efficient networks are applied and their performances are analyzed. Online recognition of gestures has been performed with 3D-MobileNetV2, which provided the best offline accuracy among the applied networks with similar computational complexities. The final architecture is deployed on a driver simulator operating in real-time. We make DriverMHG dataset and our source code publicly available.