论文标题
哈密顿的蒙特卡洛方法用于概率的对抗性攻击和学习
A Hamiltonian Monte Carlo Method for Probabilistic Adversarial Attack and Learning
论文作者
论文摘要
尽管深度卷积神经网络(CNN)在多个计算机视觉任务上表现出了出色的性能,但对逆转学习的研究表明,深层模型很容易受到对抗性示例的影响,这些示例是通过在输入图像中添加视觉上不可察觉的扰动而设计的。大多数现有的对抗攻击方法仅为输入创建一个单一的对抗示例,该示例只能瞥见对抗性示例的基本数据歧管。一个有吸引力的解决方案是探索对抗性示例的解决方案空间,并产生各种各样的解决方案,从而有可能改善现实世界系统的稳健性,并有助于防止严重的安全威胁和脆弱性。在本文中,我们提出了一种有效的方法,称为具有累积动量的汉密尔顿蒙特卡洛(HMCAM),旨在产生一系列对抗性例子。为了提高HMC的效率,我们提出了一个新的制度,以自动控制轨迹的长度,这使该算法可以沿着不同位置的搜索方向以自适应步骤大小移动。此外,我们重新审视了MCMC的观点的高度计算成本的高度计算成本,并设计了一种新的生成方法,称为对比对抗性训练(CAT),该方法处理对抗性示例的平衡分布,仅通过从少量修改标准对比度差异(CD)的少量修改中构建对对抗性示例的平衡分布(CD),并实现了贸易(CD)并实现贸易和准确的效率和准确的效率。对几个自然图像数据集和实际系统的定量和定性分析都证实了所提出算法的优越性。
Although deep convolutional neural networks (CNNs) have demonstrated remarkable performance on multiple computer vision tasks, researches on adversarial learning have shown that deep models are vulnerable to adversarial examples, which are crafted by adding visually imperceptible perturbations to the input images. Most of the existing adversarial attack methods only create a single adversarial example for the input, which just gives a glimpse of the underlying data manifold of adversarial examples. An attractive solution is to explore the solution space of the adversarial examples and generate a diverse bunch of them, which could potentially improve the robustness of real-world systems and help prevent severe security threats and vulnerabilities. In this paper, we present an effective method, called Hamiltonian Monte Carlo with Accumulated Momentum (HMCAM), aiming to generate a sequence of adversarial examples. To improve the efficiency of HMC, we propose a new regime to automatically control the length of trajectories, which allows the algorithm to move with adaptive step sizes along the search direction at different positions. Moreover, we revisit the reason for high computational cost of adversarial training under the view of MCMC and design a new generative method called Contrastive Adversarial Training (CAT), which approaches equilibrium distribution of adversarial examples with only few iterations by building from small modifications of the standard Contrastive Divergence (CD) and achieve a trade-off between efficiency and accuracy. Both quantitative and qualitative analysis on several natural image datasets and practical systems have confirmed the superiority of the proposed algorithm.