论文标题
自动开放世界可靠性评估
Automatic Open-World Reliability Assessment
论文作者
论文摘要
开放世界中的图像分类必须处理分布(OOD)图像。系统应理想地拒绝OOD图像,否则它们将在已知类别上绘制并降低可靠性。使用可以拒绝OOD输入的开放集分类器可以有所帮助。但是,开放式分类器的最佳精度取决于OOD数据的频率。因此,对于标准或开放设定的分类器,重要的是要确定世界何时改变和增加OOD输入将导致系统可靠性降低。但是,在操作过程中,由于没有标签,我们无法直接评估准确性。因此,这些分类器的可靠性评估必须由人类运营商完成,因为网络不是100%准确的,因此更为复杂,因此可以预期一些故障。为了自动化此过程,在此,我们将开放世界识别可靠性问题正式化,并提出了多个自动可靠性评估策略,以仅使用报告的分数/概率数据的分布来解决这个新问题。分布算法可以应用于具有SoftMax的经典分类器以及开放世界的极值机器(EVM),以提供自动的可靠性评估。我们表明,所有新算法都使用SoftMax的平均值明显胜过检测。
Image classification in the open-world must handle out-of-distribution (OOD) images. Systems should ideally reject OOD images, or they will map atop of known classes and reduce reliability. Using open-set classifiers that can reject OOD inputs can help. However, optimal accuracy of open-set classifiers depend on the frequency of OOD data. Thus, for either standard or open-set classifiers, it is important to be able to determine when the world changes and increasing OOD inputs will result in reduced system reliability. However, during operations, we cannot directly assess accuracy as there are no labels. Thus, the reliability assessment of these classifiers must be done by human operators, made more complex because networks are not 100% accurate, so some failures are to be expected. To automate this process, herein, we formalize the open-world recognition reliability problem and propose multiple automatic reliability assessment policies to address this new problem using only the distribution of reported scores/probability data. The distributional algorithms can be applied to both classic classifiers with SoftMax as well as the open-world Extreme Value Machine (EVM) to provide automated reliability assessment. We show that all of the new algorithms significantly outperform detection using the mean of SoftMax.