论文标题
双手操作和通过SIM真实加强学习的依恋
Bi-Manual Manipulation and Attachment via Sim-to-Real Reinforcement Learning
论文作者
论文摘要
机器人操作中的大多数成功都仅限于单臂机器人,这将可解决的任务的范围限制为拾取,插入和物体重新排列。相比之下,双臂机器人平台解锁了可以解决的各种各样的问题,例如洗衣折叠和执行烹饪技巧。但是,开发用于多臂机器人的控制器会因许多独特的挑战而复杂化,例如需要协调的双人行为以及机器人之间避免碰撞。鉴于这些挑战,在这项工作中,我们研究了如何使用对模拟训练的强化学习(RL)来解决双重任务,以便可以在实际的机器人平台上执行所得的政策。我们的RL方法由于使用实时(4Hz)关节控制并直接将未经过滤的观测值传递到神经网络策略而导致了重大简化。我们还广泛讨论了对模拟环境的修改,从而导致对RL政策的有效培训。除了设计控制算法外,一个关键的挑战是如何为强调双重协调的双手机器人设计公平的评估任务,同时消除了正交的复杂因素,例如高级感知。在这项工作中,我们设计了一个连接任务,其目的是使两个机器人臂拾起并连接两个具有磁性连接点的块。我们使用两个带有磁性附件的XARM6机器人和3D打印块来验证我们的方法,并发现我们的系统在拾取块时具有100%的成功率,而连接任务的成功率为65%。
Most successes in robotic manipulation have been restricted to single-arm robots, which limits the range of solvable tasks to pick-and-place, insertion, and objects rearrangement. In contrast, dual and multi arm robot platforms unlock a rich diversity of problems that can be tackled, such as laundry folding and executing cooking skills. However, developing controllers for multi-arm robots is complexified by a number of unique challenges, such as the need for coordinated bimanual behaviors, and collision avoidance amongst robots. Given these challenges, in this work we study how to solve bi-manual tasks using reinforcement learning (RL) trained in simulation, such that the resulting policies can be executed on real robotic platforms. Our RL approach results in significant simplifications due to using real-time (4Hz) joint-space control and directly passing unfiltered observations to neural networks policies. We also extensively discuss modifications to our simulated environment which lead to effective training of RL policies. In addition to designing control algorithms, a key challenge is how to design fair evaluation tasks for bi-manual robots that stress bimanual coordination, while removing orthogonal complicating factors such as high-level perception. In this work, we design a Connect Task, where the aim is for two robot arms to pick up and attach two blocks with magnetic connection points. We validate our approach with two xArm6 robots and 3D printed blocks with magnetic attachments, and find that our system has 100% success rate at picking up blocks, and 65% success rate at the Connect Task.