论文标题
长期视觉图与异质GNN的稀疏
Long-term Visual Map Sparsification with Heterogeneous GNN
论文作者
论文摘要
我们解决了长期视觉定位的地图稀疏问题。对于MAP的稀疏性,通常使用的假设是构建前地图和后来的捕获的本地化查询是一致的。但是,在动态世界中可以很容易地违反这种假设。此外,随着新数据的积累,地图的大小会增长,从长远来看会导致大型数据开销。在本文中,我们旨在通过选择对未来本地化有价值的点来克服环境变化并同时减少地图规模。受图形神经网络(GNN)的最新进展的启发,我们提出了第一批建模SFM映射为异质图的工作,并通过GNN预测3D点重要性得分,这使我们能够直接利用SFM Map图中的丰富信息。提出了两个新颖的监督:1)根据培训查询选择未来本地化的宝贵点的数据拟合术语; 2)一个K覆盖术语,用于选择具有完整地图覆盖的稀疏点。该实验表明,我们的方法在稳定且可见的结构上选择了地图点,并且在本地化性能方面的表现优于基线。
We address the problem of map sparsification for long-term visual localization. For map sparsification, a commonly employed assumption is that the pre-build map and the later captured localization query are consistent. However, this assumption can be easily violated in the dynamic world. Additionally, the map size grows as new data accumulate through time, causing large data overhead in the long term. In this paper, we aim to overcome the environmental changes and reduce the map size at the same time by selecting points that are valuable to future localization. Inspired by the recent progress in Graph Neural Network(GNN), we propose the first work that models SfM maps as heterogeneous graphs and predicts 3D point importance scores with a GNN, which enables us to directly exploit the rich information in the SfM map graph. Two novel supervisions are proposed: 1) a data-fitting term for selecting valuable points to future localization based on training queries; 2) a K-Cover term for selecting sparse points with full map coverage. The experiments show that our method selected map points on stable and widely visible structures and outperformed baselines in localization performance.