论文标题

敏捷开源软件项目的多功能数据集

A Versatile Dataset of Agile Open Source Software Projects

论文作者

Tawosi, Vali, Al-Subaihin, Afnan, Moussa, Rebecca, Sarro, Federica

论文摘要

如今,敏捷软件开发已成为开源和工业软件项目中广泛采用的实践。敏捷团队通常在很大程度上依靠问题管理工具来记录新问题并跟踪杰出问题,此外还存储了他们的技术细节,努力估算,对开发人员的分配等等。先前的工作将存储在问题管理系统中的历史信息用于各种目的;但是,当研究人员公开其经验数据时,通常与研究的目标仅相关。在本文中,我们提出了一个更全面,更广泛的数据集,其中包含有关44个开源敏捷软件的500,000多个问题的大量信息,使其非常适合几种研究途径,以及其中的交叉分析,包括努力估算,发行优先级优先级,发行,发行分配等等。我们在GitHub上公开提供此数据,以促进易于使用,维护和可扩展性。

Agile software development is nowadays a widely adopted practise in both open-source and industrial software projects. Agile teams typically heavily rely on issue management tools to document new issues and keep track of outstanding ones, in addition to storing their technical details, effort estimates, assignment to developers, and more. Previous work utilised the historical information stored in issue management systems for various purposes; however, when researchers make their empirical data public, it is usually relevant solely to the study's objective. In this paper, we present a more holistic and versatile dataset containing a wealth of information on more than 500,000 issues from 44 open-source Agile software, making it well-suited to several research avenues, and cross-analyses therein, including effort estimation, issue prioritization, issue assignment and many more. We make this data publicly available on GitHub to facilitate ease of use, maintenance, and extensibility.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源