论文标题
学习不变特征表示,以改善跨胸部X射线数据集的概括
Learning Invariant Feature Representation to Improve Generalization across Chest X-ray Datasets
论文作者
论文摘要
胸部射线照相是医院筛查和诊断的最常见医学图像检查。在入门级放射科医生水平上对胸部X射线的自动解释可以极大地使工作优先级,并有助于分析较大的人群。随后,已经提出了一些数据集和基于深度学习的解决方案,以根据胸部X射线图像识别疾病。但是,这些方法被证明很容易受到数据源的变化:在与培训数据相同的数据集上测试时,深度学习模型的性能良好,当从其他源在数据集上测试时,它们的性能开始差。在这项工作中,我们通过强迫网络学习源不变表示形式来解决对新来源的概括挑战。通过采用对抗性培训策略,我们表明网络可以被迫学习源不变的表示。通过多源胸部X射线数据集的肺炎分类实验,我们表明该算法有助于提高X射线数据集的新源上的分类精度。
Chest radiography is the most common medical image examination for screening and diagnosis in hospitals. Automatic interpretation of chest X-rays at the level of an entry-level radiologist can greatly benefit work prioritization and assist in analyzing a larger population. Subsequently, several datasets and deep learning-based solutions have been proposed to identify diseases based on chest X-ray images. However, these methods are shown to be vulnerable to shift in the source of data: a deep learning model performing well when tested on the same dataset as training data, starts to perform poorly when it is tested on a dataset from a different source. In this work, we address this challenge of generalization to a new source by forcing the network to learn a source-invariant representation. By employing an adversarial training strategy, we show that a network can be forced to learn a source-invariant representation. Through pneumonia-classification experiments on multi-source chest X-ray datasets, we show that this algorithm helps in improving classification accuracy on a new source of X-ray dataset.