论文标题
通过加固学习协调爬行
Coordinated Crawling via Reinforcement Learning
论文作者
论文摘要
直线爬行运动是细长,柔软的动物运动的原始且常见的模式。它需要协调的收缩,以沿着摩擦与环境相互作用的身体传播。我们提出了一种简单的方法,可以理解这些协调如何通过迭代过程的分割,柔软的轨道神经力学模型中出现,该过程可能具有生物学先例和技术相关性。使用简单的增强学习算法,我们表明,最初的全部神经耦合会收敛于简单的最近邻邻神经接线,这使得爬虫可以使用局部收缩的局部收缩浪潮在定性上与D. melanogaster幼虫中观察到的局部收缩浪潮前进,并在许多生物象征解决方案中使用。最终的解决方案是我们如何在奖励中加重步态正则化,并在速度和稳健性之间取决于本体感受噪声。总体而言,我们的结果将脑体环境三合会嵌入了学习方案中,它与软机器人技术相关,同时阐明了运动的演变和发展。
Rectilinear crawling locomotion is a primitive and common mode of locomotion in slender, soft-bodied animals. It requires coordinated contractions that propagate along a body that interacts frictionally with its environment. We propose a simple approach to understand how these coordinations arise in a neuromechanical model of a segmented, soft-bodied crawler via an iterative process that might have both biological antecedents and technological relevance. Using a simple reinforcement learning algorithm, we show that an initial all-to-all neural coupling converges to a simple nearest-neighbor neural wiring that allows the crawler to move forward using a localized wave of contraction that is qualitatively similar to what is observed in D. melanogaster larvae and used in many biomimetic solutions. The resulting solution is a function of how we weight gait regularization in the reward, with a tradeoff between speed and robustness to proprioceptive noise. Overall, our results, which embed the brain-body-environment triad in a learning scheme, has relevance for soft robotics while shedding light on the evolution and development of locomotion.