论文标题
共享开发人员之间沟通对开源软件项目生产率的影响
Influence of Communication Among Shared Developers on the Productivity of Open Source Software Projects
论文作者
论文摘要
许多软件开发人员依靠开源软件来开发其应用程序并编写其源代码。对于许多技术公司来说,衡量独立项目的整体生产力仍然是一个空旷的问题。在这个项目中,我们致力于弥合分析差距,这是预测基于生产力系统的最重要特征。我们选择通过其应用程序编程界面(API)从GitHub收集数据,并分析我们收集的数据,以了解关闭问题的平均时间与我们收集的功能之间的关系。由于我们收集的大多数数据不是高斯,因此我们必须在统计建模之前使用离群值检测和应用转换进行预处理。我们观察到的最好的模型是高度5学位的多项式回归。总体而言,我们注意到软件开发的许多方面使开发人员提高了其生产率。
Many software developers rely on open source software for developing their applications and writing their source codes. Measuring an independent project's overall productivity is still an open problem for many technology companies. In this project, we address to bridge the gap of analyzing which are the most important features for prediction of a productivity based system. We have chosen to collect data from GitHub via their application programming interfaces (API) and analyze the data we gathered to understand the relation between the average time to close an issue and the features that we collected. Since most of the data we gathered were not Gaussian, we had to preprocess the data using outlier detection and applying transformations before statistical modeling. The best model we observed was polynomial regression with degree 5. Overall, we noticed that there are many aspects of software development that make developers increase their productivity.