论文标题
通过知识继承有效的少量射击对象检测
Efficient Few-Shot Object Detection via Knowledge Inheritance
论文作者
论文摘要
旨在学习可以通过稀缺训练样本适应看不见的任务的通用检测器的几个射击对象检测(FSOD)最近见证了一致的改进。但是,大多数现有方法忽略了效率问题,例如,计算复杂性高和缓慢的适应速度。值得注意的是,由于嵌入AI的新兴趋势,效率已成为越来越重要的评估指标。为此,我们提出了一个没有计算增量的有效的预处理转移框架(PTF)基线,这与先前的最新方法(SOTA)方法可相当。在此基线上,我们设计了一个名为知识继承(KI)的初始化器,以可靠地初始化盒子分类器的新颖权重,从而有效地促进了知识传递过程并提高了适应速度。在KI初始化器中,我们提出了一种自适应长度重新缩放(ALR)策略,以减轻预测的新型重量与预验证的基本重量之间的矢量长度不一致。最后,我们的方法不仅在三个公共基准(即Pascal VOC,可可和LVI)上实现了SOTA结果,而且在几次转移期间,针对可可/LVIS基准测试的其他方法的适应性速度更高,效率很高。据我们所知,这是第一项考虑FSOD效率问题的工作。我们希望激励朝着强大而有效的少量射击技术开发发展趋势。这些代码可在https://github.com/ze-yang/fefticed-fsod上公开获取。
Few-shot object detection (FSOD), which aims at learning a generic detector that can adapt to unseen tasks with scarce training samples, has witnessed consistent improvement recently. However, most existing methods ignore the efficiency issues, e.g., high computational complexity and slow adaptation speed. Notably, efficiency has become an increasingly important evaluation metric for few-shot techniques due to an emerging trend toward embedded AI. To this end, we present an efficient pretrain-transfer framework (PTF) baseline with no computational increment, which achieves comparable results with previous state-of-the-art (SOTA) methods. Upon this baseline, we devise an initializer named knowledge inheritance (KI) to reliably initialize the novel weights for the box classifier, which effectively facilitates the knowledge transfer process and boosts the adaptation speed. Within the KI initializer, we propose an adaptive length re-scaling (ALR) strategy to alleviate the vector length inconsistency between the predicted novel weights and the pretrained base weights. Finally, our approach not only achieves the SOTA results across three public benchmarks, i.e., PASCAL VOC, COCO and LVIS, but also exhibits high efficiency with 1.8-100x faster adaptation speed against the other methods on COCO/LVIS benchmark during few-shot transfer. To our best knowledge, this is the first work to consider the efficiency problem in FSOD. We hope to motivate a trend toward powerful yet efficient few-shot technique development. The codes are publicly available at https://github.com/Ze-Yang/Efficient-FSOD.