论文标题
验证和概括像素方面的相关性在接受面部分类训练的卷积神经网络中
Validation and generalization of pixel-wise relevance in convolutional neural networks trained for face classification
论文作者
论文摘要
在科学,治理和更广泛的社会中,对卷积神经网络的面部识别的越来越多,对方法产生了急需,可以表明这些“黑匣子”决定是如何做出的。为了对人类有可能的解释和有用,这种方法应以一种适用于输入数据中随机初始化或虚假相关性的方式来传达模型的学习分类策略。为此,我们应用了层次相关性传播(LRP)的分解像素归因方法来解决几类培训面部识别的VGG-16模型的决策。然后,我们量化了这些相关性如何随关键模型参数而变化和推广,例如预处理数据集(ImageNet或vggface),Finetuning Task(性别或身份分类)以及模型权重的随机初始化。使用基于相关的图像掩蔽,我们发现面部分类的相关图通常在随机初始化中稳定,并且可以跨越芬太的任务概括。但是,在预处理的数据集中的概括显着较少,这表明ImageNet和VGGFace训练的模型以不同的方式示例信息,即使它们实现了相当高的分类性能。跨模型的相关性图的细粒度分析揭示了概括中的不对称图,这表明了选择参数的特定益处,并建议可以找到一组潜在的重要面部图像像素,以驱动跨卷积神经网络和任务的决策。最后,我们评估了模型的决策加权与人类相似性的度量,为解释人类和机器的面部识别决策提供了新的框架。
The increased use of convolutional neural networks for face recognition in science, governance, and broader society has created an acute need for methods that can show how these 'black box' decisions are made. To be interpretable and useful to humans, such a method should convey a model's learned classification strategy in a way that is robust to random initializations or spurious correlations in input data. To this end, we applied the decompositional pixel-wise attribution method of layer-wise relevance propagation (LRP) to resolve the decisions of several classes of VGG-16 models trained for face recognition. We then quantified how these relevance measures vary with and generalize across key model parameters, such as the pretraining dataset (ImageNet or VGGFace), the finetuning task (gender or identity classification), and random initializations of model weights. Using relevance-based image masking, we find that relevance maps for face classification prove generally stable across random initializations, and can generalize across finetuning tasks. However, there is markedly less generalization across pretraining datasets, indicating that ImageNet- and VGGFace-trained models sample face information differently even as they achieve comparably high classification performance. Fine-grained analyses of relevance maps across models revealed asymmetries in generalization that point to specific benefits of choice parameters, and suggest that it may be possible to find an underlying set of important face image pixels that drive decisions across convolutional neural networks and tasks. Finally, we evaluated model decision weighting against human measures of similarity, providing a novel framework for interpreting face recognition decisions across human and machine.