论文标题
混合配方检索的跨语性改编
Cross-lingual Adaptation for Recipe Retrieval with Mixup
论文作者
论文摘要
近年来,由于大规模配对数据进行培训,近年来,跨模式食谱检索引起了研究的关注。然而,如果不是不可能,获得涵盖大部分的监督学习美食的足够食谱对图。通过将知识从数据富的美食转移到数据筛选美食中,域的适应性阐明了这个实际问题。然而,现有作品假设源和目标域中的食谱主要来自同一美食,并以相同的语言书写。本文研究了图像到配件检索的无监督域的适应性,其中源和目标域中的食谱以不同的语言为单位。此外,只有食谱可用于目标域中的培训。提出了一种新颖的食谱混合方法来学习两个域之间的可转移嵌入特征。具体而言,食谱混合物会通过离散地交换源和目标食谱之间的截面来产生混合的食谱以形成中间域。为了弥合域间隙,提出了配方混合损失,以强制执行中间域以定位在配方嵌入空间中的源和目标域之间的最短大地路径。通过将食谱1M数据集用作源域(英语)和Vireo-FoodTransfer数据集作为目标域(中文),经验实验验证了在图像到组件回归的背景下,配方混合物对跨语义适应的有效性。
Cross-modal recipe retrieval has attracted research attention in recent years, thanks to the availability of large-scale paired data for training. Nevertheless, obtaining adequate recipe-image pairs covering the majority of cuisines for supervised learning is difficult if not impossible. By transferring knowledge learnt from a data-rich cuisine to a data-scarce cuisine, domain adaptation sheds light on this practical problem. Nevertheless, existing works assume recipes in source and target domains are mostly originated from the same cuisine and written in the same language. This paper studies unsupervised domain adaptation for image-to-recipe retrieval, where recipes in source and target domains are in different languages. Moreover, only recipes are available for training in the target domain. A novel recipe mixup method is proposed to learn transferable embedding features between the two domains. Specifically, recipe mixup produces mixed recipes to form an intermediate domain by discretely exchanging the section(s) between source and target recipes. To bridge the domain gap, recipe mixup loss is proposed to enforce the intermediate domain to locate in the shortest geodesic path between source and target domains in the recipe embedding space. By using Recipe 1M dataset as source domain (English) and Vireo-FoodTransfer dataset as target domain (Chinese), empirical experiments verify the effectiveness of recipe mixup for cross-lingual adaptation in the context of image-to-recipe retrieval.