论文标题

使用Kokkos加速X射线跟踪为Exascale系统

Accelerating X-Ray Tracing for Exascale Systems using Kokkos

论文作者

Wittwer, Felix, Sauter, Nicholas K., Mendez, Derek, Poon, Billy K., Brewster, Aaron S., Holton, James M., Wall, Michael E., Hart, William E., Bard, Deborah J., Blaschke, Johannes P.

论文摘要

即将到来的Exascale计算系统边界和Aurora将从GPU加速器中汲取其许多计算能力。这些系统的硬件将分别由AMD和Intel提供,每个硬件都支持自己的GPU编程模型。利用这些Exascale系统之一的应用程序的挑战将是避免锁定并保留性能可移植性。 我们在此报告我们使用Kokkos在NERSC的Perlmutter阶段1(使用NVIDIA A100加速器)和OLCF边界(使用AMD MI250X)上加速了现实世界应用程序。通过移植到Kokkos,我们能够在两个系统上成功运行相同的X射线跟踪代码,并且与原始CUDA代码相比,我们能够在两个系统上成功运行相同的X射线跟踪代码,并达到了13%至66%的速度。这些结果令人鼓舞地证明了使用Kokkos加速生产科学守则。

The upcoming exascale computing systems Frontier and Aurora will draw much of their computing power from GPU accelerators. The hardware for these systems will be provided by AMD and Intel, respectively, each supporting their own GPU programming model. The challenge for applications that harness one of these exascale systems will be to avoid lock-in and to preserve performance portability. We report here on our results of using Kokkos to accelerate a real-world application on NERSC's Perlmutter Phase 1 (using NVIDIA A100 accelerators) and the testbed system for OLCF's Frontier (using AMD MI250X). By porting to Kokkos, we were able to successfully run the same X-ray tracing code on both systems and achieved speed-ups between 13% and 66% compared to the original CUDA code. These results are a highly encouraging demonstration of using Kokkos to accelerate production science code.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源