论文标题
多级标签通过损失分解和质心估计的噪声学习
Multi-class Label Noise Learning via Loss Decomposition and Centroid Estimation
论文作者
论文摘要
在实际情况下,许多大规模数据集通常包含标签不正确的标签,即嘈杂的标签,这可能会混淆模型培训并导致性能退化。为了克服这个问题,标签噪声学习(LNL)最近引起了很多关注,并且已经提出了各种方法将无偏见的风险估计器设计到无噪声数据集中,以打击此类标签噪声。其中,基于损失分解和质心估计(LDCE)的作品的趋势显示出非常有希望的表现。但是,基于LDCE的现有LNL方法仅是为二进制分类而设计的,并且它们不能直接扩展到多类情况。在本文中,我们为LDCE提出了一种新型的多级鲁棒学习方法,该方法称为“ MC-LDCE”。具体而言,我们将通常采用的损失(例如平方平方损失)函数分解为依赖标签的部分和独立的标签部分,其中只有前者受标签噪声的影响。此外,通过定义一种新的数据质心形式,我们将依赖标签依赖性部分的恢复问题转换为质心估计问题。最后,通过对观察到的嘈杂集进行批判性研究清洁数据质心的数学期望,可以估算质心,这有助于为多级学习构建无偏见的风险估计器。所提出的MC-LDCE方法是一般的,并且适用于分类模型的不同类型(即线性和非线性)。五个公共数据集的实验结果表明,在解决多级标签噪声问题方面,提出的MC-LDCE与其他代表性LNL方法的优越性。
In real-world scenarios, many large-scale datasets often contain inaccurate labels, i.e., noisy labels, which may confuse model training and lead to performance degradation. To overcome this issue, Label Noise Learning (LNL) has recently attracted much attention, and various methods have been proposed to design an unbiased risk estimator to the noise-free dataset to combat such label noise. Among them, a trend of works based on Loss Decomposition and Centroid Estimation (LDCE) has shown very promising performance. However, existing LNL methods based on LDCE are only designed for binary classification, and they are not directly extendable to multi-class situations. In this paper, we propose a novel multi-class robust learning method for LDCE, which is termed "MC-LDCE". Specifically, we decompose the commonly adopted loss (e.g., mean squared loss) function into a label-dependent part and a label-independent part, in which only the former is influenced by label noise. Further, by defining a new form of data centroid, we transform the recovery problem of a label-dependent part to a centroid estimation problem. Finally, by critically examining the mathematical expectation of clean data centroid given the observed noisy set, the centroid can be estimated which helps to build an unbiased risk estimator for multi-class learning. The proposed MC-LDCE method is general and applicable to different types (i.e., linear and nonlinear) of classification models. The experimental results on five public datasets demonstrate the superiority of the proposed MC-LDCE against other representative LNL methods in tackling multi-class label noise problem.