论文标题
针对小儿种群的深度学习方法
A Deep Learning Approach to Tongue Detection for Pediatric Population
论文作者
论文摘要
严重的残疾儿童和复杂的沟通需要在通道技术(AT)设备的使用中面临限制。常规的ATS(例如机械开关)对于非语言儿童和有限自愿运动控制的儿童不足。检测舌头手势的自动技术代表了有前途的途径。先前的研究表明,舌头检测算法对成年参与者的鲁棒性,但是需要进一步的研究才能与儿童一起使用这些方法。在这项研究中,在孩子们播放视频游戏时在自然主义环境中录制的视频中实施并评估了用于舌头识别的网络体系结构。使用级联对象检测器算法来检测参与者的面孔,并使用卷积神经网络(CNN)开发了舌姿势检测的自动分类方案。在进行评估实验中,使用成人和儿童图像对网络进行了训练。使用一对受试者的交叉验证评估网络分类精度。从五个典型发育中的儿童的视频分析获得的初步分类结果表明,预测舌头手势的准确性高达99%。此外,我们证明,仅使用儿童数据训练分类器比成人支持小儿舌姿势数据集的表现更好。
Children with severe disabilities and complex communication needs face limitations in the usage of access technology (AT) devices. Conventional ATs (e.g., mechanical switches) can be insufficient for nonverbal children and those with limited voluntary motion control. Automatic techniques for the detection of tongue gestures represent a promising pathway. Previous studies have shown the robustness of tongue detection algorithms on adult participants, but further research is needed to use these methods with children. In this study, a network architecture for tongue-out gesture recognition was implemented and evaluated on videos recorded in a naturalistic setting when children were playing a video-game. A cascade object detector algorithm was used to detect the participants' faces, and an automated classification scheme for tongue gesture detection was developed using a convolutional neural network (CNN). In evaluation experiments conducted, the network was trained using adults and children's images. The network classification accuracy was evaluated using leave-one-subject-out cross-validation. Preliminary classification results obtained from the analysis of videos of five typically developing children showed an accuracy of up to 99% in predicting tongue-out gestures. Moreover, we demonstrated that using only children data for training the classifier yielded better performance than adult's one supporting the need for pediatric tongue gesture datasets.