在RGB-D序列中查看对象的3D多对象跟踪

论文标题

在RGB-D序列中查看对象的3D多对象跟踪

Seeing Behind Objects for 3D Multi-Object Tracking in RGB-D Sequences

论文作者

Müller, Norman, Wong, Yu-Shiang, Mitra, Niloy J., Dai, Angela, Nießner, Matthias

论文摘要

RGB-D视频序列中的多对象跟踪是一个具有挑战性的问题，因为随着时间的推移，观点，运动和阻塞的结合结合了。我们观察到，具有完整的物体几何形状有助于其跟踪，因此建议共同推断物体的完整几何形状以及跟踪它们，以随着时间的推移而刚性地移动对象。我们的关键见解是，推断物体的完整几何形状有助于跟踪。通过幻觉看不见的物体区域，我们可以在同一实例之间获得其他对应关系，从而在外观发生强烈变化下也可以提供健壮的跟踪。从一系列RGB-D帧中，我们检测到每个帧中的对象，并学会预测其完整的对象几何形状以及密集的对应关系映射到规范空间中。这使我们能够在每个帧中的对象以及它们之间的对应关系中得出6DOF姿势，从而在RGB-D序列上提供了可靠的对象跟踪。合成和现实世界RGB-D数据的实验表明，我们在动态对象跟踪上实现了最新性能。此外，我们表明我们的对象完成极大地有助于跟踪，从而提高了$ 6.5 \％$的平均MOTA。

Multi-object tracking from RGB-D video sequences is a challenging problem due to the combination of changing viewpoints, motion, and occlusions over time. We observe that having the complete geometry of objects aids in their tracking, and thus propose to jointly infer the complete geometry of objects as well as track them, for rigidly moving objects over time. Our key insight is that inferring the complete geometry of the objects significantly helps in tracking. By hallucinating unseen regions of objects, we can obtain additional correspondences between the same instance, thus providing robust tracking even under strong change of appearance. From a sequence of RGB-D frames, we detect objects in each frame and learn to predict their complete object geometry as well as a dense correspondence mapping into a canonical space. This allows us to derive 6DoF poses for the objects in each frame, along with their correspondence between frames, providing robust object tracking across the RGB-D sequence. Experiments on both synthetic and real-world RGB-D data demonstrate that we achieve state-of-the-art performance on dynamic object tracking. Furthermore, we show that our object completion significantly helps tracking, providing an improvement of $6.5\%$ in mean MOTA.

下载PDF全文

下载文献需遵守相关版权规定

论文标题