论文标题
通过拒绝论证的镜头主动学习算法
Active learning algorithm through the lens of rejection arguments
论文作者
论文摘要
主动学习是机器学习的范式,旨在减少培训分类器所需的标记数据量。它的总体原理是依次选择最有用的数据点,这相当于确定输入空间区域的不确定性。主要的挑战在于建立一个在计算上有效且具有吸引力的理论属性的程序;当前大多数方法仅满足一种或另一种。在本文中,我们以一种新颖的方式将分类用于拒绝,以估计不确定的区域。我们提供一种积极的学习算法,并在经典假设下证明其理论上的好处。除了理论结果外,还对合成和非合成数据集进行了数值实验。这些实验提供了经验证据,表明在我们的主动学习算法中使用排斥论证是有益的,并且可以在各种统计情况下进行良好的性能。
Active learning is a paradigm of machine learning which aims at reducing the amount of labeled data needed to train a classifier. Its overall principle is to sequentially select the most informative data points, which amounts to determining the uncertainty of regions of the input space. The main challenge lies in building a procedure that is computationally efficient and that offers appealing theoretical properties; most of the current methods satisfy only one or the other. In this paper, we use the classification with rejection in a novel way to estimate the uncertain regions. We provide an active learning algorithm and prove its theoretical benefits under classical assumptions. In addition to the theoretical results, numerical experiments have been carried out on synthetic and non-synthetic datasets. These experiments provide empirical evidence that the use of rejection arguments in our active learning algorithm is beneficial and allows good performance in various statistical situations.