增强具有对比匹配和加权焦点损失的多视图立体声

论文标题

增强具有对比匹配和加权焦点损失的多视图立体声

Enhancing Multi-view Stereo with Contrastive Matching and Weighted Focal Loss

论文作者

Ding, Yikang, Li, Zhenyang, Huang, Dihe, Li, Zhiheng, Zhang, Kai

论文摘要

基于学习的多视图立体声（MVS）方法取得了令人印象深刻的进步，并且近年来超过了传统方法。但是，它们的准确性和完整性仍在挣扎。在本文中，我们提出了一种新方法，以增强受对比度学习和功能匹配启发的现有网络的性能。首先，我们提出了一个对比匹配损失（CML），该损失将正确的匹配点视为正样样本，并将其他点视为阴性样本，并根据特征的相似性计算对比度损失。我们进一步提出了一个加权局灶性损失（WFL），以提高分类能力，从而削弱了根据预测的置信度，在不重要的区域中低符合像素对损失的贡献。在DTU，坦克和寺庙和混合物数据集上进行的广泛实验显示，我们的方法可实现最先进的性能，并且对基线网络进行了显着改善。

Learning-based multi-view stereo (MVS) methods have made impressive progress and surpassed traditional methods in recent years. However, their accuracy and completeness are still struggling. In this paper, we propose a new method to enhance the performance of existing networks inspired by contrastive learning and feature matching. First, we propose a Contrast Matching Loss (CML), which treats the correct matching points in depth-dimension as positive sample and other points as negative samples, and computes the contrastive loss based on the similarity of features. We further propose a Weighted Focal Loss (WFL) for better classification capability, which weakens the contribution of low-confidence pixels in unimportant areas to the loss according to predicted confidence. Extensive experiments performed on DTU, Tanks and Temples and BlendedMVS datasets show our method achieves state-of-the-art performance and significant improvement over baseline network.

下载PDF全文

下载文献需遵守相关版权规定

论文标题