通过卷积神经网络分布的域泛化方法12铅ECG分类

论文标题

通过卷积神经网络分布的域泛化方法12铅ECG分类

A Domain Generalization Approach for Out-Of-Distribution 12-lead ECG Classification with Convolutional Neural Networks

论文作者

Ballas, Aristotelis, Diou, Christos

论文摘要

在过去的几年中，深度学习系统取得了巨大的成功，甚至在某些情况下超越了人类智能。最近，他们还在生物医学和医疗保健领域中建立了自己的希望，在那里他们表现出了很多希望，但尚未获得广泛的采用。这部分是由于以下事实：大多数方法在被要求对数据的决定源于与训练的数据不同的数据（即分布式分布（OOD）数据）时无法保持其性能。例如，在生物信号分类的情况下，由于不同数据源之间的分布差异，模型通常无法在不同医院的数据集上概括。我们的目标是通过利用在深层神经网络的整个结构中提取的信息并捕获信号的潜在结构来证明不同医院数据库之间存在的域泛化问题，并提出了一种对12铅心电图（ECG）进行异常的方法。为此，我们采用RESNET-18作为骨干模型，并从网络的几个中间卷积层中提取特征。为了评估我们的方法，我们从四个来源采用公开可用的ECG数据集，并将其作为单独的域处理。为了模拟现实世界中存在的分配变化，我们在域的子集上训练模型，并遗漏其余的模型。然后，我们在训练时间（分数内）和固定数据（分布式）上评估我们的模型，在大多数情况下，取得了有希望的结果并超过了香草残留网络的基线。

Deep Learning systems have achieved great success in the past few years, even surpassing human intelligence in several cases. As of late, they have also established themselves in the biomedical and healthcare domains, where they have shown a lot of promise, but have not yet achieved widespread adoption. This is in part due to the fact that most methods fail to maintain their performance when they are called to make decisions on data that originate from a different distribution than the one they were trained on, namely Out-Of-Distribution (OOD) data. For example, in the case of biosignal classification, models often fail to generalize well on datasets from different hospitals, due to the distribution discrepancy amongst different sources of data. Our goal is to demonstrate the Domain Generalization problem present between distinct hospital databases and propose a method that classifies abnormalities on 12-lead Electrocardiograms (ECGs), by leveraging information extracted across the architecture of a Deep Neural Network, and capturing the underlying structure of the signal. To this end, we adopt a ResNet-18 as the backbone model and extract features from several intermediate convolutional layers of the network. To evaluate our method, we adopt publicly available ECG datasets from four sources and handle them as separate domains. To simulate the distributional shift present in real-world settings, we train our model on a subset of the domains and leave-out the remaining ones. We then evaluate our model both on the data present at training time (intra-distribution) and the held-out data (out-of-distribution), achieving promising results and surpassing the baseline of a vanilla Residual Network in most of the cases.

下载PDF全文

下载文献需遵守相关版权规定

论文标题