论文标题
对腿机器人的对抗联合攻击
Adversarial joint attacks on legged robots
论文作者
论文摘要
我们解决了对经过深入强化学习训练的腿部机器人关节的对手攻击。联合攻击的脆弱性会严重影响腿部机器人的安全性和鲁棒性。在这项研究中,我们证明了对执行器的扭矩控制信号的对抗性扰动可以显着降低奖励并导致机器人的行走不稳定。为了找到对抗性扭矩扰动,我们开发了黑盒对抗攻击,在那里,对手无法访问通过深度强化学习训练的神经网络。无论是深入强化学习的建筑和算法,黑匣子攻击都可以应用于腿部机器人。我们采用三种搜索方法来进行黑框对抗攻击:随机搜索,差异进化和数值梯度下降方法。在开放式健身室环境中,在使用四倍的机器人ANT-V2和双足机器人V2的实验中,我们发现差异进化可以有效地找到三种方法中最强的扭矩扰动。此外,我们意识到,四倍的机器人ANT-V2容易受到对抗扰动的影响,而双皮亚机器人人体V2对扰动是可靠的。因此,联合攻击可用于主动诊断机器人步行不稳定。
We address adversarial attacks on the actuators at the joints of legged robots trained by deep reinforcement learning. The vulnerability to the joint attacks can significantly impact the safety and robustness of legged robots. In this study, we demonstrate that the adversarial perturbations to the torque control signals of the actuators can significantly reduce the rewards and cause walking instability in robots. To find the adversarial torque perturbations, we develop black-box adversarial attacks, where, the adversary cannot access the neural networks trained by deep reinforcement learning. The black box attack can be applied to legged robots regardless of the architecture and algorithms of deep reinforcement learning. We employ three search methods for the black-box adversarial attacks: random search, differential evolution, and numerical gradient descent methods. In experiments with the quadruped robot Ant-v2 and the bipedal robot Humanoid-v2, in OpenAI Gym environments, we find that differential evolution can efficiently find the strongest torque perturbations among the three methods. In addition, we realize that the quadruped robot Ant-v2 is vulnerable to the adversarial perturbations, whereas the bipedal robot Humanoid-v2 is robust to the perturbations. Consequently, the joint attacks can be used for proactive diagnosis of robot walking instability.