通过完全卷积神经网络对医疗图像分割的转移学习的批判性评估

论文标题

通过完全卷积神经网络对医疗图像分割的转移学习的批判性评估

Critical Assessment of Transfer Learning for Medical Image Segmentation with Fully Convolutional Neural Networks

论文作者

Karimi, Davood, Warfield, Simon K., Gholipour, Ali

论文摘要

转移学习广泛用于培训机器学习模型。在这里，我们研究了转移学习在训练完全卷积网络（FCN）中的作用，以进行医学图像分割。我们的实验表明，尽管转移学习减少了目标任务的训练时间，但分割精度的提高是高度任务/数据依赖性的。当分割任务更具挑战性并且目标训练数据较小时，可以观察到准确性的更大提高。我们观察到FCN的卷积过滤器在训练医疗图像分割过程中很少发生变化，并且仍然是随机的收敛性。我们进一步表明，可以通过以随机值冻结网络的编码部分，而仅训练解码器部分来构建非常准确的FCN。至少对于医学图像细分，这一发现挑战了共同的信念，即编码器部分需要学习数据/特定于任务的表示。我们研究了FCN表示的演变，以更好地了解转移学习对训练动态的影响。我们的分析表明，尽管通过转移学习训练的FCN学习与接受随机初始化训练的FCN不同，但通过转移学习训练的FCN的变异性可能与接受随机初始化训练的FCN一样高。此外，功能重用不仅限于早期编码器层。相反，它在更深的层中可能更为重要。这些发现提供了新的见解，并提出了培训FCN的替代方法，以进行医学图像细分。

Transfer learning is widely used for training machine learning models. Here, we study the role of transfer learning for training fully convolutional networks (FCNs) for medical image segmentation. Our experiments show that although transfer learning reduces the training time on the target task, the improvement in segmentation accuracy is highly task/data-dependent. Larger improvements in accuracy are observed when the segmentation task is more challenging and the target training data is smaller. We observe that convolutional filters of an FCN change little during training for medical image segmentation, and still look random at convergence. We further show that quite accurate FCNs can be built by freezing the encoder section of the network at random values and only training the decoder section. At least for medical image segmentation, this finding challenges the common belief that the encoder section needs to learn data/task-specific representations. We examine the evolution of FCN representations to gain a better insight into the effects of transfer learning on the training dynamics. Our analysis shows that although FCNs trained via transfer learning learn different representations than FCNs trained with random initialization, the variability among FCNs trained via transfer learning can be as high as that among FCNs trained with random initialization. Moreover, feature reuse is not restricted to the early encoder layers; rather, it can be more significant in deeper layers. These findings offer new insights and suggest alternative ways of training FCNs for medical image segmentation.

下载PDF全文

下载文献需遵守相关版权规定

论文标题