论文标题
对偏射式学习的现实评估
Realistic Evaluation of Transductive Few-Shot Learning
论文作者
论文摘要
转导推断被广泛用于几次学习中,因为它利用了未标记的一些射击任务的统计数据,通常会产生比其电感对应物更好的性能。当前的几杆基准测试在推理时使用完美的类平衡任务。我们认为这种人为的规律性是不现实的,因为它假定测试样品的边际标签概率已知并固定在均匀分布上。实际上,在现实的情况下,未标记的查询集带有任意和未知的标签边缘。我们在推理时介绍和研究任意类分布的效果,从而删除了级别的平衡伪像。具体而言,我们将类的边缘概率模拟为Dirichlet分布的随机变量,该变量在单纯形中产生了原则性和现实的采样。这利用当前的几杆基准测试,通过任意类别分布来构建测试任务。我们在3个广泛使用的数据集上评估了实验性的最先进的转导方法,并令人惊讶地观察出大量的性能下降,甚至在某些情况下低于电感方法。此外,我们提出了基于$α$ diverences的相互信息损失的概括,该损失可以有效地处理类分布的变化。从经验上讲,我们表明我们的托管$α$ - 差优化优化优于几个数据集,模型和少量设置的最先进方法。我们的代码在https://github.com/oveilleux/realistic_transductive_few_shot上公开获取。
Transductive inference is widely used in few-shot learning, as it leverages the statistics of the unlabeled query set of a few-shot task, typically yielding substantially better performances than its inductive counterpart. The current few-shot benchmarks use perfectly class-balanced tasks at inference. We argue that such an artificial regularity is unrealistic, as it assumes that the marginal label probability of the testing samples is known and fixed to the uniform distribution. In fact, in realistic scenarios, the unlabeled query sets come with arbitrary and unknown label marginals. We introduce and study the effect of arbitrary class distributions within the query sets of few-shot tasks at inference, removing the class-balance artefact. Specifically, we model the marginal probabilities of the classes as Dirichlet-distributed random variables, which yields a principled and realistic sampling within the simplex. This leverages the current few-shot benchmarks, building testing tasks with arbitrary class distributions. We evaluate experimentally state-of-the-art transductive methods over 3 widely used data sets, and observe, surprisingly, substantial performance drops, even below inductive methods in some cases. Furthermore, we propose a generalization of the mutual-information loss, based on $α$-divergences, which can handle effectively class-distribution variations. Empirically, we show that our transductive $α$-divergence optimization outperforms state-of-the-art methods across several data sets, models and few-shot settings. Our code is publicly available at https://github.com/oveilleux/Realistic_Transductive_Few_Shot.