论文标题
实践中的视觉GUI测试:扩展的工业案例研究
Visual GUI testing in practice: An extended industrial case study
论文作者
论文摘要
上下文:视觉GUI测试(VGT)被称为最新一代GUI测试。它是一种工具驱动的技术,它使用图像识别来与正在测试的系统的行为进行交互和断言。由一家大型土耳其软件和系统公司在国防和IT领域提供解决方案的工业需求,最近启动了一个行动研究项目,以在公司的几个团队和项目中实施VGT。 目的:为了满足上述需求,我们计划并进行了一项实证研究,目的是使用两种工具(Sikuli和Jautomate)评估VGT。目的是确定公司中给定项目(软件产品)VGT的合适方法和工具,增加公司测试团队的专业知识。 方法:使用动作研究案例研究设计,我们研究了在研究组织中使用VGT的。具体而言,使用两种选定的VGT工具,我们进行了VGT的定量和定性评估。 结果:通过评估先前工作中提出的挑战,问题和局限性(CPL)的列表,在我们的经验研究的背景下,我们发现与测试工具和SUT相关的CPL相当可与先前的经验研究相当可比。在评估测试维护活动的类型时,在下一个SUTS上执行自动测试案例时,对于两个测试工具,我们发现下一个版本中约有一半的测试用例(59.1%和47.8%)失败了。 结论:通过我们的结果,我们在进行VGT时确认了一些先前报告的问题。此外,我们在使用VGT时重点介绍了测试维护方面的一些其他挑战。
Context: Visual GUI testing (VGT) is referred to as the latest generation GUI-based testing. It is a tool-driven technique, which uses image recognition for interacting with and asserting the behavior of the system under test. Motivated by the industrial need of a large Turkish software and systems company providing solutions in the areas of defense and IT sector, an action-research project was recently initiated to implement VGT in several teams and projects in the company. Objective: To address the above needs, we planned and carried out an empirical investigation with the goal of assessing VGT using two tools (Sikuli and JAutomate). The purpose was to determine a suitable approach and tool for VGT of a given project (software product) in the company, increase the know-how in the company's test teams. Method: Using an action-research case-study design, we investigated the use of VGT in the studied organization. Specifically, using the two selected VGT tools, we conducted a quantitative and a qualitative evaluation of VGT. Results: By assessing the list of Challenges, Problems and Limitations (CPL), proposed in previous work, in the context of our empirical study, we found that test-tool- and SUT-related CPLs were quite comparable to a previous empirical study, e.g., the synchronization between SUT and test tools were not always robust and there were failures in test tools' image recognition features. When assessing the types of test maintenance activities, when executing the automated test cases on next versions of the SUTs, for the case of the two test tools, we found that about half of the test cases (59.1% and 47.8%) failed in the next version. Conclusion: By our results, we confirm some of the previously-reported issues when conducting VGT. Further, we highlight some additional challenges in test maintenance when using VGT.