论文标题
COVID-19-Infodemic:人群可以客观地判断最近的错误信息吗?
The COVID-19 Infodemic: Can the Crowd Judge Recent Misinformation Objectively?
论文作者
论文摘要
错误信息是一个越来越多的问题,很难为研究界解决,并对整个社会产生负面影响。最近,通过基于众包的方法来扩展标签工作的方法解决了问题:为了评估声明的真实性,而不是依靠一些专家,而是(非专家)法官的人群被利用。我们遵循相同的方法来研究众包是否是一种评估大流行期间陈述真实性的有效和可靠的方法。我们专门针对与Covid-19卫生紧急情况有关的陈述,该紧急情况仍在研究时仍在进行中,并且可以说是导致在网上扩散的错误信息的增加(一种使用了“ Infodecity”一词)。通过这样做,我们能够解决与敏感和个人问题(如健康)相关的(MIS)信息,以及与做出判断的何时相比,最近的信息:在相关工作中尚未分析的两个问题。在我们的实验中,要求人群工人评估陈述的真实性,并为评估作为URL和文本辩护提供证据。除了表明人群能够准确地判断陈述的真实性外,我们还报告了许多不同方面的结果,包括:工人之间的一致性,不同聚合功能的效果,量表转换的影响以及工人的背景 /偏见。我们还根据提交的查询,找到 /选择的URL,文本理由以及其他行为数据(例如通过临时记录器收集的点击和鼠标动作)分析工人的行为。
Misinformation is an ever increasing problem that is difficult to solve for the research community and has a negative impact on the society at large. Very recently, the problem has been addressed with a crowdsourcing-based approach to scale up labeling efforts: to assess the truthfulness of a statement, instead of relying on a few experts, a crowd of (non-expert) judges is exploited. We follow the same approach to study whether crowdsourcing is an effective and reliable method to assess statements truthfulness during a pandemic. We specifically target statements related to the COVID-19 health emergency, that is still ongoing at the time of the study and has arguably caused an increase of the amount of misinformation that is spreading online (a phenomenon for which the term "infodemic" has been used). By doing so, we are able to address (mis)information that is both related to a sensitive and personal issue like health and very recent as compared to when the judgment is done: two issues that have not been analyzed in related work. In our experiment, crowd workers are asked to assess the truthfulness of statements, as well as to provide evidence for the assessments as a URL and a text justification. Besides showing that the crowd is able to accurately judge the truthfulness of the statements, we also report results on many different aspects, including: agreement among workers, the effect of different aggregation functions, of scales transformations, and of workers background / bias. We also analyze workers behavior, in terms of queries submitted, URLs found / selected, text justifications, and other behavioral data like clicks and mouse actions collected by means of an ad hoc logger.