论文标题

funcx:一种联合功能,用于科学的面料

funcX: A Federated Function Serving Fabric for Science

论文作者

Chard, Ryan, Babuji, Yadu, Li, Zhuozhao, Skluzacek, Tyler, Woodard, Anna, Blaiszik, Ben, Foster, Ian, Chard, Kyle

论文摘要

爆炸数据量和速度,新的计算方法和平台以及无处不在的连通性需要科学中的新计算方法。这些新方法必须使计算能够移动,例如,它可以在数据附近发生,并由事件(例如,新数据到达)触发,将其卸载到专用加速器上,或者在可用资源的地方远程运行。他们还需要新的设计方法,其中可以将单片应用分解为较小的组件,而这些应用程序可能会单独执行和最合适的资源执行。为了满足这些需求,我们提出了funcx ---分布式功能作为服务(FAAS)平台,可实现灵活,可扩展和高性能远程功能执行。 Funcx的端点软件可以将现有的云,簇和超级计算机转换为功能服务系统,而FunCX的云托管服务可在端点的联合生态系统中提供透明,安全且可靠的功能执行。我们通过几项科学案例研究激发了对Funcx的需求,介绍了我们的原型设计和实施,显示出每秒超过100万个功能的吞吐量的优化,并通过对两个超级计算机进行实验来证明Funcx可以扩展到超过130000名并发工人。

Exploding data volumes and velocities, new computational methods and platforms, and ubiquitous connectivity demand new approaches to computation in the sciences. These new approaches must enable computation to be mobile, so that, for example, it can occur near data, be triggered by events (e.g., arrival of new data), be offloaded to specialized accelerators, or run remotely where resources are available. They also require new design approaches in which monolithic applications can be decomposed into smaller components, that may in turn be executed separately and on the most suitable resources. To address these needs we present funcX---a distributed function as a service (FaaS) platform that enables flexible, scalable, and high performance remote function execution. funcX's endpoint software can transform existing clouds, clusters, and supercomputers into function serving systems, while funcX's cloud-hosted service provides transparent, secure, and reliable function execution across a federated ecosystem of endpoints. We motivate the need for funcX with several scientific case studies, present our prototype design and implementation, show optimizations that deliver throughput in excess of 1 million functions per second, and demonstrate, via experiments on two supercomputers, that funcX can scale to more than more than 130000 concurrent workers.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源