论文标题
不断发展的Simgans改善异常心电图分类
Evolving SimGANs to Improve Abnormal Electrocardiogram Classification
论文作者
论文摘要
机器学习模型用于各种领域。但是,机器学习方法通常需要大量数据才能成功。这在收集现实世界数据困难和/或昂贵的域中尤其麻烦。这些域中确实存在数据模拟器,但是由于缺乏现实世界噪声等因素,它们没有充分反映现实世界数据。最近对生成对抗网络(GAN)进行了修改,以使用SIMGAN方法将模拟图像数据完善到更好地适合现实世界分布的数据中。虽然已将进化计算用于GAN演化,但目前尚无框架可以进化SIMGAN。在本文中,我们(1)将SIMGAN方法扩展到完善一维数据,(2)修改Easy Cartesian遗传编程(EZCGP),一种进化计算框架(EZCGP),以创建更准确地完善模拟数据的SIMGAN,并(3)创建基于新功能的定量衡量指标来评估精制数据。我们还使用框架来增强心电图(ECG)数据集,该数据集遇到了前面提到的问题。特别是,尽管可以模拟健康的心电图,但目前没有异常的心电图模拟器。我们表明,通过使用进化的SIMGAN来完善模拟的Healthy ECG数据,以模仿现实世界中的异常ECG,我们可以提高异常ECG分类器的准确性。
Machine Learning models are used in a wide variety of domains. However, machine learning methods often require a large amount of data in order to be successful. This is especially troublesome in domains where collecting real-world data is difficult and/or expensive. Data simulators do exist for many of these domains, but they do not sufficiently reflect the real world data due to factors such as a lack of real-world noise. Recently generative adversarial networks (GANs) have been modified to refine simulated image data into data that better fits the real world distribution, using the SimGAN method. While evolutionary computing has been used for GAN evolution, there are currently no frameworks that can evolve a SimGAN. In this paper we (1) extend the SimGAN method to refine one-dimensional data, (2) modify Easy Cartesian Genetic Programming (ezCGP), an evolutionary computing framework, to create SimGANs that more accurately refine simulated data, and (3) create new feature-based quantitative metrics to evaluate refined data. We also use our framework to augment an electrocardiogram (ECG) dataset, a domain that suffers from the issues previously mentioned. In particular, while healthy ECGs can be simulated there are no current simulators of abnormal ECGs. We show that by using an evolved SimGAN to refine simulated healthy ECG data to mimic real-world abnormal ECGs, we can improve the accuracy of abnormal ECG classifiers.