深度库：深度模仿的对比度无监督优先表示，以增强无人驾驶飞机的无地图导航

论文标题

深度库：深度模仿的对比度无监督优先表示，以增强无人驾驶飞机的无地图导航

Depth-CUPRL: Depth-Imaged Contrastive Unsupervised Prioritized Representations in Reinforcement Learning for Mapless Navigation of Unmanned Aerial Vehicles

论文作者

de Jesus, Junior Costa, Kich, Victor Augusto, Kolling, Alisson Henrique, Grando, Ricardo Bedin, Guerra, Rodrigo da Silva, Drews Jr, Paulo Lilles Jorge

论文摘要

强化学习（RL）通过原始像素成像和连续的控制任务在视频游戏中表现出了令人印象深刻的表现。但是，RL的性能很差，例如原始像素图像等高维观测。人们普遍认为，基于物理状态的RL策略（例如激光传感器测量值）比通过像素学习更有效的结果。这项工作提出了一种新方法，该方法从深度图估算中提取信息，以教授RL代理以执行无人飞机（UAV）的无地图导航。我们提出了深度模仿的对比度无监督的优先表示（DEPTH-CUPRL），该表示用优先的重播记忆估算图像的深度。我们使用RL和对比度学习的组合，基于图像的RL问题。从无人驾驶汽车（UAV）对结果的分析中可以得出结论，我们的深度加上方法在无地图导航能力中对决策有效，并且优于最先进的像素的方法。

Reinforcement Learning (RL) has presented an impressive performance in video games through raw pixel imaging and continuous control tasks. However, RL performs poorly with high-dimensional observations such as raw pixel images. It is generally accepted that physical state-based RL policies such as laser sensor measurements give a more sample-efficient result than learning by pixels. This work presents a new approach that extracts information from a depth map estimation to teach an RL agent to perform the mapless navigation of Unmanned Aerial Vehicle (UAV). We propose the Depth-Imaged Contrastive Unsupervised Prioritized Representations in Reinforcement Learning(Depth-CUPRL) that estimates the depth of images with a prioritized replay memory. We used a combination of RL and Contrastive Learning to lead with the problem of RL based on images. From the analysis of the results with Unmanned Aerial Vehicles (UAVs), it is possible to conclude that our Depth-CUPRL approach is effective for the decision-making and outperforms state-of-the-art pixel-based approaches in the mapless navigation capability.

下载PDF全文

下载文献需遵守相关版权规定

论文标题