论文标题
关于深度度量学习的背景偏见
On Background Bias in Deep Metric Learning
论文作者
论文摘要
深度度量学习训练神经网络以将输入图像映射到较低维的嵌入空间,以使相似的图像比不同的图像更接近。当用于项目检索时,使用训练有素的模型嵌入查询图像,并且存储它们各自嵌入的数据库中的最接近的项目将作为查询最相似的项目返回。尤其是在产品检索中,用户通过拍摄照片来搜索某个产品,图像背景通常并不重要,因此不应影响嵌入过程。理想情况下,检索过程始终返回拍照对象的合适项目,无论拍摄照片如何。我们发现,深度度量学习网络容易出现所谓的背景偏见,在推断期间改变图像背景时,可以导致检索性能的严重下降。我们还表明,用随机背景图像在训练过程中替换图像的背景可以减轻此问题。由于我们使用自动背景删除方法进行此背景替换,因此在推理时间保持不变时,不需要其他手动标记工作和模型更改。我们介绍了一个新的评估指标的定性和定量分析,确认接受替换背景的模型更多地参与了图像中的主要对象,从而使项目检索系统受益。
Deep Metric Learning trains a neural network to map input images to a lower-dimensional embedding space such that similar images are closer together than dissimilar images. When used for item retrieval, a query image is embedded using the trained model and the closest items from a database storing their respective embeddings are returned as the most similar items for the query. Especially in product retrieval, where a user searches for a certain product by taking a photo of it, the image background is usually not important and thus should not influence the embedding process. Ideally, the retrieval process always returns fitting items for the photographed object, regardless of the environment the photo was taken in. In this paper, we analyze the influence of the image background on Deep Metric Learning models by utilizing five common loss functions and three common datasets. We find that Deep Metric Learning networks are prone to so-called background bias, which can lead to a severe decrease in retrieval performance when changing the image background during inference. We also show that replacing the background of images during training with random background images alleviates this issue. Since we use an automatic background removal method to do this background replacement, no additional manual labeling work and model changes are required while inference time stays the same. Qualitative and quantitative analyses, for which we introduce a new evaluation metric, confirm that models trained with replaced backgrounds attend more to the main object in the image, benefitting item retrieval systems.