论文标题

Roibin-SZ:串行晶体学的快速和科学压缩

ROIBIN-SZ: Fast and Science-Preserving Compression for Serial Crystallography

论文作者

Underwood, Robert, Yoon, Chun, Gok, Ali, Di, Sheng, Cappello, Franck

论文摘要

晶体学是研究蛋白质原子结构并产生大量信息的领先技术,这些信息可以将菌株放在同步加速器和自由电子激光光源的存储和数据传输能力上。有损压缩已被确定为应对增长的数据量的可能手段。但是,先前的方法没有以足够的速度产生足够的质量来满足科学需求。本文提出了感兴趣的区域,其中SZ有损耗压缩(Roibin-SZ)是一种新颖,平行和加速的压缩方案,该方案将动态选择的关键区域分开,并通过背景信息有损耗的压缩。我们对此压缩方案的共同设计对性能和质量结果进行了广泛的评估。我们可以在溶菌酶和丝氏素蛋白酶链霉替肽上达到最高196倍和46.44倍的压缩比,同时充分保留数据以在带宽和尺度上重建结构,以满足即将到来的光源的需求

Crystallography is the leading technique to study atomic structures of proteins and produces enormous volumes of information that can place strains on the storage and data transfer capabilities of synchrotron and free-electron laser light sources. Lossy compression has been identified as a possible means to cope with the growing data volumes; however, prior approaches have not produced sufficient quality at a sufficient rate to meet scientific needs. This paper presents Region Of Interest BINning with SZ lossy compression (ROIBIN-SZ) a novel, parallel, and accelerated compression scheme that separates the dynamically selected preservation of key regions with lossy compression of background information. We perform and present an extensive evaluation of the performance and quality results made by the co-design of this compression scheme. We can achieve up to a 196x and 46.44x compression ratio on lysozyme and selenobiotinyl-streptavidin while preserving the data sufficiently to reconstruct the structure at bandwidths and scales that approach the needs of the upcoming light sources

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源