论文标题
自我竞争的神经网络
Self-Competitive Neural Networks
论文作者
论文摘要
深度神经网络(DNN)提高了许多应用程序中分类问题的准确性。训练DNN的挑战之一是它需要由丰富的数据集喂养以提高其准确性并避免过度拟合的痛苦。提高DNN概括的一种方法是使用新的合成对抗样本来增强训练数据。最近,研究人员广泛致力于提出数据增强的方法。在本文中,我们生成对抗样本,以完善每个类的吸引力(DOA)。在这种方法中,在每个阶段,我们使用由基本和生成的对抗数据(直到该阶段)学到的模型来操纵主要数据,以使DNN看起来复杂。然后使用增强数据对DNN进行重新训练,然后再次生成很难预测自身的对抗数据。当DNN试图通过与自身竞争(生成硬样品然后学习)来提高其准确性时,该技术称为自我竞争性神经网络(SCNN)。为了生成此类样本,我们将问题作为优化任务提出,其中网络权重固定并使用基于梯度下降的方法合成在其真实标签和最近错误标签边界的对抗样本。我们的实验结果表明,使用SCNN的数据增加可以显着提高原始网络的准确性。例如,我们可以提到提高经过1000个MNIST数据集有限培训数据训练的CNN的准确性,从94.26%到98.25%。
Deep Neural Networks (DNNs) have improved the accuracy of classification problems in lots of applications. One of the challenges in training a DNN is its need to be fed by an enriched dataset to increase its accuracy and avoid it suffering from overfitting. One way to improve the generalization of DNNs is to augment the training data with new synthesized adversarial samples. Recently, researchers have worked extensively to propose methods for data augmentation. In this paper, we generate adversarial samples to refine the Domains of Attraction (DoAs) of each class. In this approach, at each stage, we use the model learned by the primary and generated adversarial data (up to that stage) to manipulate the primary data in a way that look complicated to the DNN. The DNN is then retrained using the augmented data and then it again generates adversarial data that are hard to predict for itself. As the DNN tries to improve its accuracy by competing with itself (generating hard samples and then learning them), the technique is called Self-Competitive Neural Network (SCNN). To generate such samples, we pose the problem as an optimization task, where the network weights are fixed and use a gradient descent based method to synthesize adversarial samples that are on the boundary of their true labels and the nearest wrong labels. Our experimental results show that data augmentation using SCNNs can significantly increase the accuracy of the original network. As an example, we can mention improving the accuracy of a CNN trained with 1000 limited training data of MNIST dataset from 94.26% to 98.25%.