论文标题

表面上:快速无替代的黑盒攻击

SurFree: a fast surrogate-free black-box attack

论文作者

Maho, Thibault, Furon, Teddy, Merrer, Erwan Le

论文摘要

机器学习分类器非常容易逃避攻击。对抗性示例是稍微修改的输入,然后被错误分类,同时感知到其原始作品。最近几年,黑匣子攻击的查询数量显着减少,以提交目标分类器,以锻造对手。这尤其涉及基于黑框的设置,攻击者可以访问最高的预测概率:查询数量从数百万到不到一千不到一千。本文介绍了Surfree,这是一种几何方法,在最难设置中的查询量大幅度降低了:基于黑匣子的基于决策的攻击(只有TOP-1标签)。我们首先强调,该设置,HSJA,Qeba和Geoda的最新攻击都执行了昂贵的梯度替代估计。 Surfree提议绕过这些,而是​​通过针对分类器决策边界的几何特性的精确指示进行仔细的试验,以仔细的试验。我们在与先前的攻击与头等公民的询问数量进行正面比较之前,我们就激励这种几何方法。我们在低查询量(几百到千)下表现出更快的失真衰减,同时在更高的查询预算中保持竞争力。

Machine learning classifiers are critically prone to evasion attacks. Adversarial examples are slightly modified inputs that are then misclassified, while remaining perceptively close to their originals. Last couple of years have witnessed a striking decrease in the amount of queries a black box attack submits to the target classifier, in order to forge adversarials. This particularly concerns the black-box score-based setup, where the attacker has access to top predicted probabilites: the amount of queries went from to millions of to less than a thousand. This paper presents SurFree, a geometrical approach that achieves a similar drastic reduction in the amount of queries in the hardest setup: black box decision-based attacks (only the top-1 label is available). We first highlight that the most recent attacks in that setup, HSJA, QEBA and GeoDA all perform costly gradient surrogate estimations. SurFree proposes to bypass these, by instead focusing on careful trials along diverse directions, guided by precise indications of geometrical properties of the classifier decision boundaries. We motivate this geometric approach before performing a head-to-head comparison with previous attacks with the amount of queries as a first class citizen. We exhibit a faster distortion decay under low query amounts (few hundreds to a thousand), while remaining competitive at higher query budgets.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源