论文标题

引用源代码工件:软件引用的单独关注

Referencing Source Code Artifacts: a Separate Concern in Software Citation

论文作者

Di Cosmo, Roberto, Gruenpeter, Morane, Zacchiroli, Stefano

论文摘要

在软件引用涉及的实体中,软件源代码需要特别关注,因为它在确保科学可重复性方面所扮演的角色。为了引用源代码,我们需要标识符,这些标识符不仅是唯一且持久的,而且还支持\ emph {Integrity}内在检查。合适的标识符必须保证,不依赖外部第三方和行政流程,表示始终保持不变。我们分析了标识符对数字对象(IDO)的作用,这些属性与当今流行的软件和数据引用工具的构件的各种数字标识符(DIOS)不同,与互补的数字标识符(DIO)的作用。 SoftWaresource代码工件,例如源代码文件,目录和提交。

Among the entities involved in software citation, software source code requires special attention, due to the role it plays in ensuring scientific reproducibility. To reference source code we need identifiers that are not only unique and persistent, but also support \emph{integrity} checking intrinsically. Suitable identifiers must guarantee that denotedobjects will always stay the same, without relying on external third parties and administrative processes. We analyze the role of identifiers for digital objects (IDOs), whose properties are different from, and complementary to, those of the various digital identifiers of objects (DIOs) that are today popular building blocks of software and data citation toolchains.We argue that both kinds of identifiers are needed and detail the syntax, semantics, and practical implementation of the persistent identifiers (PIDs) adopted by the Software Heritage project to reference billions of softwaresource code artifacts such as source code files, directories, and commits.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源