通过加固学习协调爬行

论文标题

通过加固学习协调爬行

Coordinated Crawling via Reinforcement Learning

论文作者

Mishra, Shruti, van Rees, Wim M., Mahadevan, L.

论文摘要

直线爬行运动是细长，柔软的动物运动的原始且常见的模式。它需要协调的收缩，以沿着摩擦与环境相互作用的身体传播。我们提出了一种简单的方法，可以理解这些协调如何通过迭代过程的分割，柔软的轨道神经力学模型中出现，该过程可能具有生物学先例和技术相关性。使用简单的增强学习算法，我们表明，最初的全部神经耦合会收敛于简单的最近邻邻神经接线，这使得爬虫可以使用局部收缩的局部收缩浪潮在定性上与D. melanogaster幼虫中观察到的局部收缩浪潮前进，并在许多生物象征解决方案中使用。最终的解决方案是我们如何在奖励中加重步态正则化，并在速度和稳健性之间取决于本体感受噪声。总体而言，我们的结果将脑体环境三合会嵌入了学习方案中，它与软机器人技术相关，同时阐明了运动的演变和发展。

Rectilinear crawling locomotion is a primitive and common mode of locomotion in slender, soft-bodied animals. It requires coordinated contractions that propagate along a body that interacts frictionally with its environment. We propose a simple approach to understand how these coordinations arise in a neuromechanical model of a segmented, soft-bodied crawler via an iterative process that might have both biological antecedents and technological relevance. Using a simple reinforcement learning algorithm, we show that an initial all-to-all neural coupling converges to a simple nearest-neighbor neural wiring that allows the crawler to move forward using a localized wave of contraction that is qualitatively similar to what is observed in D. melanogaster larvae and used in many biomimetic solutions. The resulting solution is a function of how we weight gait regularization in the reward, with a tradeoff between speed and robustness to proprioceptive noise. Overall, our results, which embed the brain-body-environment triad in a learning scheme, has relevance for soft robotics while shedding light on the evolution and development of locomotion.

下载PDF全文

下载文献需遵守相关版权规定

论文标题