论文标题
半监督肺癌筛查的深度期望最大化
Deep Expectation-Maximization for Semi-Supervised Lung Cancer Screening
论文作者
论文摘要
我们提出了一种半监督算法,用于肺癌筛查,其中3D卷积神经网络(CNN)使用预期最大化(EM)元词素进行训练。半监督学习允许将较小的标记数据集与未标记的数据集结合使用,以提供更大,更多样化的培训样本。 EM允许该算法同时计算CNN训练系数的最大似然估计以及未标记的训练集的标签,该标签被定义为潜在可变空间。我们通过国家肺肺筛查试验(NLST)数据集评估了CNN的半监督EM算法的模型性能。我们的结果表明,半监督的EM算法极大地提高了跨域肺癌筛查的分类精度,尽管结果比完全监督的方法低于完全监督的方法,从而获得了无处可比样品的其他标记数据。因此,我们证明了半监督的EM是使用3D CNN提高肺癌筛查模型准确性的宝贵技术。
We present a semi-supervised algorithm for lung cancer screening in which a 3D Convolutional Neural Network (CNN) is trained using the Expectation-Maximization (EM) meta-algorithm. Semi-supervised learning allows a smaller labelled data-set to be combined with an unlabeled data-set in order to provide a larger and more diverse training sample. EM allows the algorithm to simultaneously calculate a maximum likelihood estimate of the CNN training coefficients along with the labels for the unlabeled training set which are defined as a latent variable space. We evaluate the model performance of the Semi-Supervised EM algorithm for CNNs through cross-domain training of the Kaggle Data Science Bowl 2017 (Kaggle17) data-set with the National Lung Screening Trial (NLST) data-set. Our results show that the Semi-Supervised EM algorithm greatly improves the classification accuracy of the cross-domain lung cancer screening, although results are lower than a fully supervised approach with the advantage of additional labelled data from the unsupervised sample. As such, we demonstrate that Semi-Supervised EM is a valuable technique to improve the accuracy of lung cancer screening models using 3D CNNs.