论文标题

SSIM引导的CGAN结构,用于临床驱动的多重空间蛋白质组学通道的临床驱动生成图像合成

A SSIM Guided cGAN Architecture For Clinically Driven Generative Image Synthesis of Multiplexed Spatial Proteomics Channels

论文作者

Saurav, Jillur Rahman, Nasr, Mohammad Sadegh, Koomey, Paul, Robben, Michael, Huber, Manfred, Weidanz, Jon, Ryan, Bríd, Ruppin, Eytan, Jiang, Peng, Luber, Jacob M.

论文摘要

在这里,我们提出了结构相似性指数度量(SSIM)引导条件性生成对抗网络(CGAN),该网络有条件地执行图像到图像图像(I2i)合成以在多路复用空间蛋白质组学图像中生成光准确的蛋白质通道。该方法可以用于准确地产生缺失的空间蛋白质组学通道,这些通道在台式或诊所的实验数据收集过程中未包含。来自人类生物分子图(Hubmap)的实验空间蛋白质组学数据用于通过基于U-NET的图像合成管道来生成缺失蛋白质的空间表示。 Hubmap通道在层次上由(SSIM)作为启发式,以获得概括由蛋白质空间景观所代表的基本生物学所需的最小集合。随后,我们证明了我们的基于SSIM的体系结构允许将生成图像合成缩放到具有多达100个通道的幻灯片,这比限于具有11个通道的数据的当前状态要好。我们通过从人类肺腺癌组织切片中生成新的实验空间蛋白质组学数据集来验证这些主张,并表明在Hubmap上训练的模型可以准确地从我们的新数据集中合成通道。从稀疏染色的含有空间蛋白质组学的稀疏染色的多重组织学载玻片中概括实验数据的能力将对医学诊断和药物开发产生巨大影响,并提出了有关利用临床环境中生成图像合成产生的数据的医学伦理学的重要问题。我们在本文中提出的算法将使研究人员和临床医生可以节省基于蛋白质组学的组织学染色的时间和成本,同时还增加了通过实验可以生成的数据量。

Here we present a structural similarity index measure (SSIM) guided conditional Generative Adversarial Network (cGAN) that generatively performs image-to-image (i2i) synthesis to generate photo-accurate protein channels in multiplexed spatial proteomics images. This approach can be utilized to accurately generate missing spatial proteomics channels that were not included during experimental data collection either at the bench or the clinic. Experimental spatial proteomic data from the Human BioMolecular Atlas Program (HuBMAP) was used to generate spatial representations of missing proteins through a U-Net based image synthesis pipeline. HuBMAP channels were hierarchically clustered by the (SSIM) as a heuristic to obtain the minimal set needed to recapitulate the underlying biology represented by the spatial landscape of proteins. We subsequently prove that our SSIM based architecture allows for scaling of generative image synthesis to slides with up to 100 channels, which is better than current state of the art algorithms which are limited to data with 11 channels. We validate these claims by generating a new experimental spatial proteomics data set from human lung adenocarcinoma tissue sections and show that a model trained on HuBMAP can accurately synthesize channels from our new data set. The ability to recapitulate experimental data from sparsely stained multiplexed histological slides containing spatial proteomic will have tremendous impact on medical diagnostics and drug development, and also raises important questions on the medical ethics of utilizing data produced by generative image synthesis in the clinical setting. The algorithm that we present in this paper will allow researchers and clinicians to save time and costs in proteomics based histological staining while also increasing the amount of data that they can generate through their experiments.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源