论文标题
用LiDar作为相机传感器分析通用的深度学习检测和分割模型
Analyzing General-Purpose Deep-Learning Detection and Segmentation Models with Images from a Lidar as a Camera Sensor
论文作者
论文摘要
在过去的十年中,机器人感知算法从深度学习的快速进步(DL)中受益匪浅。的确,不同商业和研究平台的大量自治堆栈依赖于DL来征求情境意识,尤其是视力传感器。这项工作探讨了通用DL感知算法的潜力,特别是检测和分割神经网络,用于处理高级激光雷达传感器的图像样输出。据我们所知,与其处理三维点云数据,而是第一份专注于使用LIDAR传感器获得360 \ textDegree视野的低分辨率图像,通过编码图像像素中的深度,反射率或近红外光线来获得。我们表明,通过足够的预处理,通用的DL模型可以处理这些图像,从而在视觉传感器呈现固有局限性的环境条件下为它们的使用打开了大门。我们对各种神经网络体系结构的性能进行定性和定量分析。我们认为,使用为相机构建的DL模型,由于与基于点云的感知相比,可用性和成熟度更高,因此具有显着优势。
Over the last decade, robotic perception algorithms have significantly benefited from the rapid advances in deep learning (DL). Indeed, a significant amount of the autonomy stack of different commercial and research platforms relies on DL for situational awareness, especially vision sensors. This work explores the potential of general-purpose DL perception algorithms, specifically detection and segmentation neural networks, for processing image-like outputs of advanced lidar sensors. Rather than processing the three-dimensional point cloud data, this is, to the best of our knowledge, the first work to focus on low-resolution images with 360\textdegree field of view obtained with lidar sensors by encoding either depth, reflectivity, or near-infrared light in the image pixels. We show that with adequate preprocessing, general-purpose DL models can process these images, opening the door to their usage in environmental conditions where vision sensors present inherent limitations. We provide both a qualitative and quantitative analysis of the performance of a variety of neural network architectures. We believe that using DL models built for visual cameras offers significant advantages due to the much wider availability and maturity compared to point cloud-based perception.