论文标题
伪偏平衡学习的胸部X射线分类
Pseudo Bias-Balanced Learning for Debiased Chest X-ray Classification
论文作者
论文摘要
经常报告深度学习模型,以从数据集偏见等快捷方式中学习。随着深度学习在现代医疗保健系统中起着越来越重要的作用,这是在医疗数据中与快捷方式学习以及发展公正和可信赖的模型非常需要。在本文中,我们研究了从偏见的训练数据中开发出偏见的胸部X射线诊断模型的问题,而不必确切知道偏见标签。我们从观察到偏差分布的不平衡是引起快捷方式学习的关键原因之一,并且模型比预期的功能更容易学习,而数据集偏见是模型的首选。基于这些观察结果,我们提出了一种新型算法,即伪平衡的学习,该学习首先通过广义跨熵损失捕获并预测每样本偏差标记,然后使用伪偏置标签和偏见平衡的软性软性功能来训练一个模型。我们使用各种数据集偏置情况构建了几个胸部X射线数据集,并通过广泛的实验证明了我们所提出的方法对其他最新方法进行了一致的改进。
Deep learning models were frequently reported to learn from shortcuts like dataset biases. As deep learning is playing an increasingly important role in the modern healthcare system, it is of great need to combat shortcut learning in medical data as well as develop unbiased and trustworthy models. In this paper, we study the problem of developing debiased chest X-ray diagnosis models from the biased training data without knowing exactly the bias labels. We start with the observations that the imbalance of bias distribution is one of the key reasons causing shortcut learning, and the dataset biases are preferred by the model if they were easier to be learned than the intended features. Based on these observations, we proposed a novel algorithm, pseudo bias-balanced learning, which first captures and predicts per-sample bias labels via generalized cross entropy loss and then trains a debiased model using pseudo bias labels and bias-balanced softmax function. We constructed several chest X-ray datasets with various dataset bias situations and demonstrated with extensive experiments that our proposed method achieved consistent improvements over other state-of-the-art approaches.