论文标题

测试支持工具进行窃检测

Testing of Support Tools for Plagiarism Detection

论文作者

Foltýnek, Tomáš, Dlabolová, Dita, Anohina-Naumeca, Alla, Razı, Salim, Kravjar, Július, Kamzola, Laima, Guerrero-Dib, Jean, Çelik, Özgür, Weber-Wulff, Debora

论文摘要

人们普遍认为,软件必须能够轻松地完成人类觉得困难的事情。由于在文本中找到窃的资源并不是一件容易的事,因此人们普遍期望软件必须简单地确定文本是否被窃。软件无法确定窃,但它可以作为支持工具来确定可能构成窃的文本相似性。但是,各种系统的工作效果如何?本文报告了15个基于Web的文本匹配系统的协作测试,该系统可在怀疑窃时使用。它是由来自七个国家 /地区的研究人员使用八种不同语言的测试材料进行的,评估了系统对单源和多源文档的有效性。还进行了可用性检查。清醒的结果表明,尽管某些系统确实可以帮助识别一些窃内容,但显然没有发现所有窃,有时还可以将非跨性材料识别为有问题。

There is a general belief that software must be able to easily do things that humans find difficult. Since finding sources for plagiarism in a text is not an easy task, there is a wide-spread expectation that it must be simple for software to determine if a text is plagiarized or not. Software cannot determine plagiarism, but it can work as a support tool for identifying some text similarity that may constitute plagiarism. But how well do the various systems work? This paper reports on a collaborative test of 15 web-based text-matching systems that can be used when plagiarism is suspected. It was conducted by researchers from seven countries using test material in eight different languages, evaluating the effectiveness of the systems on single-source and multi-source documents. A usability examination was also performed. The sobering results show that although some systems can indeed help identify some plagiarized content, they clearly do not find all plagiarism and at times also identify non-plagiarized material as problematic.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源