论文标题
评估用于描述,复制和重复科学工作流程的工具
Evaluation of tools for describing, reproducing and reusing scientific workflows
论文作者
论文摘要
在计算科学和工程领域中,工作流通常需要应用各种软件的应用,例如进行模拟或预处理。通常,这些组件必须在任意复杂的工作流程中合并,以解决特定的研究问题。为了使同伴研究人员理解,复制和(重新)使用科学出版物的发现,必须解决一些挑战。例如,必须自动化所采用的工作流程,并且所有使用的软件的信息都必须用于复制结果。此外,结果必须是可追溯的,并且可以记录和可读的工作流程,以允许外部验证和更大的信任。在本文中,讨论了现有的工作流管理系统(WFMS),以描述,再现和重复使用科学工作流程。为此,我们认为从我们认为与计算科学和工程领域相关的用户故事中得出的一系列wfmsswere的一般要求。根据示例性的工作流程实施,在GitHub公开托管(https://github.com/bamresearch/nfdi4ingscientificworkflowrequirements),与这些要求相比,将选择不同的WFMS的选择与这些要求相比,以支持同伴确定最适合其需求的同伴。
In the field of computational science and engineering, workflows often entail the application of various software, for instance, for simulation or pre- and postprocessing. Typically, these components have to be combined in arbitrarily complex workflows to address a specific research question. In order for peer researchers to understand, reproduce and (re)use the findings of a scientific publication, several challenges have to be addressed. For instance, the employed workflow has to be automated and information on all used software must be available for a reproduction of the results. Moreover, the results must be traceable and the workflow documented and readable to allow for external verification and greater trust. In this paper, existing workflow management systems (WfMSs) are discussed regarding their suitability for describing, reproducing and reusing scientific workflows. To this end, a set of general requirements for WfMSswere deduced from user stories that we deem relevant in the domain of computational science and engineering. On the basis of an exemplary workflow implementation, publicly hosted at GitHub (https:// github.com/BAMresearch/NFDI4IngScientificWorkflowRequirements), a selection of different WfMSs is compared with respect to these requirements, to support fellow scientists in identifying the WfMSs that best suit their requirements.