论文标题
3-D点云的流对象检测
Streaming Object Detection for 3-D Point Clouds
论文作者
论文摘要
自动驾驶汽车在动态环境中运行,在这种环境中,车辆能够感知和反应的速度会影响系统的安全性和有效性。 LIDAR提供了一种突出的感官方式,可为许多现有的感知系统提供信息,包括对象检测,细分,运动估计和动作识别。基于点云数据的感知系统的延迟可以由完整的旋转扫描(例如100 ms)的时间主导。这种内置的数据捕获延迟是人造的,并且基于将点云视为相机图像,以利用相机启发的架构。但是,与摄像机传感器不同,大多数激光雷达点云数据是本地数据源,在该数据源中,基于激光束的进动,激光反射是依次记录的。在这项工作中,我们探讨了如何构建一个消除这种人工潜伏约束的对象检测器,而是在本机流数据上操作以大大减少潜伏期。这种方法具有额外的好处,即通过在获取时间进行扫描时间传播计算来减少推理硬件的峰值计算负担。我们通过一系列对传统检测元结构结构的修改,基于顺序建模的流探测系统家族。我们强调,如果不是最先进的,传统的非流传输检测系统,则该模型如何实现竞争性,同时实现了显着的延迟增长(例如,峰值潜伏期的1/15'th-1/3'rd)。我们的结果表明,在其本机流式配方中使用LiDAR数据在自动驾驶对象检测中提供了几个优点 - 我们希望这对于任何使延迟最小化对于安全有效的操作至关重要的激光雷达感知系统都会有用。
Autonomous vehicles operate in a dynamic environment, where the speed with which a vehicle can perceive and react impacts the safety and efficacy of the system. LiDAR provides a prominent sensory modality that informs many existing perceptual systems including object detection, segmentation, motion estimation, and action recognition. The latency for perceptual systems based on point cloud data can be dominated by the amount of time for a complete rotational scan (e.g. 100 ms). This built-in data capture latency is artificial, and based on treating the point cloud as a camera image in order to leverage camera-inspired architectures. However, unlike camera sensors, most LiDAR point cloud data is natively a streaming data source in which laser reflections are sequentially recorded based on the precession of the laser beam. In this work, we explore how to build an object detector that removes this artificial latency constraint, and instead operates on native streaming data in order to significantly reduce latency. This approach has the added benefit of reducing the peak computational burden on inference hardware by spreading the computation over the acquisition time for a scan. We demonstrate a family of streaming detection systems based on sequential modeling through a series of modifications to the traditional detection meta-architecture. We highlight how this model may achieve competitive if not superior predictive performance with state-of-the-art, traditional non-streaming detection systems while achieving significant latency gains (e.g. 1/15'th - 1/3'rd of peak latency). Our results show that operating on LiDAR data in its native streaming formulation offers several advantages for self driving object detection -- advantages that we hope will be useful for any LiDAR perception system where minimizing latency is critical for safe and efficient operation.