论文标题

一种统一的解释和增强对抗性转移性的方法

A Unified Approach to Interpreting and Boosting Adversarial Transferability

论文作者

Wang, Xin, Ren, Jie, Lin, Shuyun, Zhu, Xiangming, Wang, Yisen, Zhang, Quanshi

论文摘要

在本文中,我们使用对抗性扰动内部的相互作用来解释和增强对抗性转移性。我们发现并证明了对抗性转移性与对抗扰动内部的相互作用之间的负相关性。通过不同输入的不同DNN进一步验证负相关。此外,这种负相关性可以被视为了解当前可转移性方法的统一观点。为此,我们证明了一些增强可转移性的经典方法基本上是对抗扰动内部的相互作用。基于此,我们建议在攻击过程中直接惩罚相互作用,从而显着提高对抗性的转移性。

In this paper, we use the interaction inside adversarial perturbations to explain and boost the adversarial transferability. We discover and prove the negative correlation between the adversarial transferability and the interaction inside adversarial perturbations. The negative correlation is further verified through different DNNs with various inputs. Moreover, this negative correlation can be regarded as a unified perspective to understand current transferability-boosting methods. To this end, we prove that some classic methods of enhancing the transferability essentially decease interactions inside adversarial perturbations. Based on this, we propose to directly penalize interactions during the attacking process, which significantly improves the adversarial transferability.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源