半监督肺癌筛查的深度期望最大化

论文标题

半监督肺癌筛查的深度期望最大化

Deep Expectation-Maximization for Semi-Supervised Lung Cancer Screening

论文作者

Menon, Sumeet, Chapman, David, Nguyen, Phuong, Yesha, Yelena, Morris, Michael, Saboury, Babak

论文摘要

我们提出了一种半监督算法，用于肺癌筛查，其中3D卷积神经网络（CNN）使用预期最大化（EM）元词素进行训练。半监督学习允许将较小的标记数据集与未标记的数据集结合使用，以提供更大，更多样化的培训样本。 EM允许该算法同时计算CNN训练系数的最大似然估计以及未标记的训练集的标签，该标签被定义为潜在可变空间。我们通过国家肺肺筛查试验（NLST）数据集评估了CNN的半监督EM算法的模型性能。我们的结果表明，半监督的EM算法极大地提高了跨域肺癌筛查的分类精度，尽管结果比完全监督的方法低于完全监督的方法，从而获得了无处可比样品的其他标记数据。因此，我们证明了半监督的EM是使用3D CNN提高肺癌筛查模型准确性的宝贵技术。

We present a semi-supervised algorithm for lung cancer screening in which a 3D Convolutional Neural Network (CNN) is trained using the Expectation-Maximization (EM) meta-algorithm. Semi-supervised learning allows a smaller labelled data-set to be combined with an unlabeled data-set in order to provide a larger and more diverse training sample. EM allows the algorithm to simultaneously calculate a maximum likelihood estimate of the CNN training coefficients along with the labels for the unlabeled training set which are defined as a latent variable space. We evaluate the model performance of the Semi-Supervised EM algorithm for CNNs through cross-domain training of the Kaggle Data Science Bowl 2017 (Kaggle17) data-set with the National Lung Screening Trial (NLST) data-set. Our results show that the Semi-Supervised EM algorithm greatly improves the classification accuracy of the cross-domain lung cancer screening, although results are lower than a fully supervised approach with the advantage of additional labelled data from the unsupervised sample. As such, we demonstrate that Semi-Supervised EM is a valuable technique to improve the accuracy of lung cancer screening models using 3D CNNs.

下载PDF全文

下载文献需遵守相关版权规定

论文标题