在挑战地形上学习四倍的运动

论文标题

在挑战地形上学习四倍的运动

Learning Quadrupedal Locomotion over Challenging Terrain

论文作者

Lee, Joonho, Hwangbo, Jemin, Wellhausen, Lorenz, Koltun, Vladlen, Hutter, Marco

论文摘要

四足动物可以使用我们地球上一些最具挑战性的环境，但自动驾驶机器仍然无法触及。腿部运动可以大大扩展机器人技术的操作领域。但是，用于腿部运动的常规控制器是基于明确触发运动原语和反射执行的精细状态机。这些设计的复杂性已经升级，而动物运动的一般性和鲁棒性则没有。在这里，我们为在挑战性的自然环境中提供了一个非常健壮的控制器，用于腿部运动。我们提出了一种新颖的解决方案，可以将本体感受反馈纳入运动控制中，并证明了从模拟到自然环境的显着零弹性概括。通过模拟中的增强学习来训练控制器。它基于作用于本体感受信号流的神经网络。训练有素的控制器已将两代人的Anymal机器人带到了各种自然环境，这些自然环境无法实现，这些自然环境超出了腿部运动的先前发表工作。控制器在训练期间从未遇到过的条件下保留其稳健性：诸如泥土和雪等可变形地形，诸如瓦砾之类的动态立足点以及地面障碍，例如厚厚的植被和涌入水。提出的工作为机器人技术打开了新的边界，并表明可以通过在更简单的域中训练自然环境中的根本鲁棒性。

Some of the most challenging environments on our planet are accessible to quadrupedal animals but remain out of reach for autonomous machines. Legged locomotion can dramatically expand the operational domains of robotics. However, conventional controllers for legged locomotion are based on elaborate state machines that explicitly trigger the execution of motion primitives and reflexes. These designs have escalated in complexity while falling short of the generality and robustness of animal locomotion. Here we present a radically robust controller for legged locomotion in challenging natural environments. We present a novel solution to incorporating proprioceptive feedback in locomotion control and demonstrate remarkable zero-shot generalization from simulation to natural environments. The controller is trained by reinforcement learning in simulation. It is based on a neural network that acts on a stream of proprioceptive signals. The trained controller has taken two generations of quadrupedal ANYmal robots to a variety of natural environments that are beyond the reach of prior published work in legged locomotion. The controller retains its robustness under conditions that have never been encountered during training: deformable terrain such as mud and snow, dynamic footholds such as rubble, and overground impediments such as thick vegetation and gushing water. The presented work opens new frontiers for robotics and indicates that radical robustness in natural environments can be achieved by training in much simpler domains.

下载PDF全文

下载文献需遵守相关版权规定

论文标题