论文标题
Optimam乳房X线摄影图像数据库:乳房X线摄影图像和临床数据的大规模资源
OPTIMAM Mammography Image Database: a large scale resource of mammography images and clinical data
论文作者
论文摘要
医学成像研究的主要障碍,尤其是人工智能(AI)的发展是缺乏与其他研究人员共享图像的大型医学图像数据库。没有这样的数据库,就无法培训可通用的AI算法,并且花费大量时间和资金用于在单个研究中心收集较小的数据集。开发了Optimam图像数据库(OMI-DB)来克服这些障碍。 OMI-DB由几个关系数据库和云存储系统组成,其中包含乳房X线摄影图像以及相关的临床和病理信息。该数据库包含从三个英国乳房筛查中心收集的173,319名妇女的250万张图像。其中包括154,832名乳房正常的妇女,6909名具有良性发现的妇女,9690名患有筛查癌症的妇女和1888名伴随癌症的女性。收集正在进行中,所有妇女都经过跟进,其临床状况根据随后的筛查情节进行了更新。先前筛选乳房X线照片和间隔癌症的可用性是AI开发的重要资源。自2014年以来,来自OMI-DB的数据已与30多个研究小组和公司共享。通过筹集者和批准的学术和商业研究小组之间的共享协议,这种进步的方法是可能的。 OMI-DB等研究数据集为研究提供了强大的资源。
A major barrier to medical imaging research and in particular the development of artificial intelligence (AI) is a lack of large databases of medical images which share images with other researchers. Without such databases it is not possible to train generalisable AI algorithms, and large amounts of time and funding is spent collecting smaller datasets at individual research centres. The OPTIMAM image database (OMI-DB) has been developed to overcome these barriers. OMI-DB consists of several relational databases and cloud storage systems, containing mammography images and associated clinical and pathological information. The database contains over 2.5 million images from 173,319 women collected from three UK breast screening centres. This includes 154,832 women with normal breasts, 6909 women with benign findings, 9690 women with screen-detected cancers and 1888 women with interval cancers. Collection is on-going and all women are followed-up and their clinical status updated according to subsequent screening episodes. The availability of prior screening mammograms and interval cancers is a vital resource for AI development. Data from OMI-DB has been shared with over 30 research groups and companies, since 2014. This progressive approach has been possible through sharing agreements between the funder and approved academic and commercial research groups. A research dataset such as the OMI-DB provides a powerful resource for research.