论文标题

关于软件工程系统文献评论的混合搜索策略的性能

On the Performance of Hybrid Search Strategies for Systematic Literature Reviews in Software Engineering

论文作者

Mourão, Erica, Pimentel, João Felipe, Murta, Leonardo, Kalinowski, Marcos, Mendes, Emilia, Wohlin, Claes

论文摘要

环境:进行系统文献综述(SLR)时,研究人员通常会面临设计搜索策略的挑战,该搜索策略适当地平衡了质量和审查工作。仅使用数字图书馆(或数据库)搜索或仅滚雪球可能不足以获得高质量的结果。另一方面,使用数字图书馆搜索并共同滚雪球可能会增加整体审查工作。 目的:这项研究的目的是提出和评估混合搜索策略,这些策略有选择地将数据库搜索与滚雪球结合在一起。 方法:我们提出了四种混合搜索策略,将数字库中的数据库搜索与迭代,平行或顺序向后和向前滚雪球相结合。我们模拟了SE中三个现有SLR的策略,这些策略都采用了数据库搜索和滚雪球。我们使用精确,召回和F量表来研究每种策略的性能,比较了数字图书馆搜索,滚雪球和混合策略的结果。 结果:我们的结果表明,对于经过分析的SLR,将Scopus数字库中的数据库搜索与并行或顺序的滚雪球结合在一起,达到了精确和回忆的最合适平衡。 结论:我们提出,根据SLR的目标和可用资源的目标,使用涉及代表性数字图书馆的混合搜索策略和平行或顺序的滚雪球倾向于代表在SLR中搜索证据时要使用的适当替代方案。

Context: When conducting a Systematic Literature Review (SLR), researchers usually face the challenge of designing a search strategy that appropriately balances result quality and review effort. Using digital library (or database) searches or snowballing alone may not be enough to achieve high-quality results. On the other hand, using both digital library searches and snowballing together may increase the overall review effort. Objective: The goal of this research is to propose and evaluate hybrid search strategies that selectively combine database searches with snowballing. Method: We propose four hybrid search strategies combining database searches in digital libraries with iterative, parallel, or sequential backward and forward snowballing. We simulated the strategies over three existing SLRs in SE that adopted both database searches and snowballing. We compared the outcome of digital library searches, snowballing, and hybrid strategies using precision, recall, and F-measure to investigate the performance of each strategy. Results: Our results show that, for the analyzed SLRs, combining database searches from the Scopus digital library with parallel or sequential snowballing achieved the most appropriate balance of precision and recall. Conclusion: We put forward that, depending on the goals of the SLR and the available resources, using a hybrid search strategy involving a representative digital library and parallel or sequential snowballing tends to represent an appropriate alternative to be used when searching for evidence in SLRs.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源