使用分层贝叶斯网络提高可追溯性链接恢复的有效性

论文标题

使用分层贝叶斯网络提高可追溯性链接恢复的有效性

Improving the Effectiveness of Traceability Link Recovery using Hierarchical Bayesian Networks

论文作者

Moran, Kevin, Palacio, David N., Bernal-Cárdenas, Carlos, McCrystal, Daniel, Poshyvanyk, Denys, Shenefiel, Chris, Johnson, Jeff

论文摘要

可追溯性是现代软件开发过程的基本组成部分，该过程有助于确保正确运行，安全的程序。由于手动建立痕量链接的高成本，研究人员开发了自动化的方法，这些方法使用相似性措施在成对的文本软件工件之间建立了关系。但是，此类技术的有效性通常受到限制，因为它们仅利用一种伪像的相似性度量，并且不能同时建模不同的开发文物组之间（隐式和显式）关系。在本文中，我们说明了如何通过使用量身定制的概率模型来克服这些局限性。为此，我们设计并实施了能够推断候选跟踪链接的软件可追溯性（彗星）的层次概率模型。彗星能够通过结合多种文本相似性测量的互补观察能力来建模工件之间的关系。此外，我们的模型可以整体上合并来自各种来源的信息，包括开发人员的反馈和软件伪像之间的传播（通常是隐式）关系，以提高推理准确性。我们对彗星进行了全面的经验评估，该评估说明了一组最佳配置的基准在最佳情况下约为$ 14％，在所有受试者中，就平均精度而言，$ \ $ \ $ 5％。通常不可能使用最佳配置的彗星在实践中的比较有效性可能会更高。最后，我们说明了与使用原型詹金斯插件的Cisco Systems的开发人员进行的调查中的实际适用性的彗星。

Traceability is a fundamental component of the modern software development process that helps to ensure properly functioning, secure programs. Due to the high cost of manually establishing trace links, researchers have developed automated approaches that draw relationships between pairs of textual software artifacts using similarity measures. However, the effectiveness of such techniques are often limited as they only utilize a single measure of artifact similarity and cannot simultaneously model (implicit and explicit) relationships across groups of diverse development artifacts. In this paper, we illustrate how these limitations can be overcome through the use of a tailored probabilistic model. To this end, we design and implement a HierarchiCal PrObabilistic Model for SoftwarE Traceability (Comet) that is able to infer candidate trace links. Comet is capable of modeling relationships between artifacts by combining the complementary observational prowess of multiple measures of textual similarity. Additionally, our model can holistically incorporate information from a diverse set of sources, including developer feedback and transitive (often implicit) relationships among groups of software artifacts, to improve inference accuracy. We conduct a comprehensive empirical evaluation of Comet that illustrates an improvement over a set of optimally configured baselines of $\approx$14% in the best case and $\approx$5% across all subjects in terms of average precision. The comparative effectiveness of Comet in practice, where optimal configuration is typically not possible, is likely to be higher. Finally, we illustrate Comets potential for practical applicability in a survey with developers from Cisco Systems who used a prototype Comet Jenkins plugin.

下载PDF全文

下载文献需遵守相关版权规定

论文标题