论文标题
以功能驱动的方法来识别致病社交媒体帐户
A Feature-Driven Approach for Identifying Pathogenic Social Media Accounts
论文作者
论文摘要
在过去的几年中,我们观察到了不同的媒体通过构建信息来支持促进其目标的叙述来改变公众舆论的尝试。恶意用户称为“致病社交媒体”(PSM)帐户更有可能通过将错误信息传播到病毒比例来扩大这种现象。因此,从帐户级别的角度了解错误信息的传播是一个紧迫的问题。在这项工作中,我们旨在提出一种以功能驱动的方法来检测社交媒体中的PSM帐户。受文献的启发,我们着手从三个广泛的角度评估PSM:(1)与用户相关的信息(例如,用户活动,个人资料特征),(2)与源相关信息(即通过用户共享的URL链接的信息)和(3)与内容相关的信息(例如,Temersications)。对于用户相关信息,我们使用因果分析(即用户经常是病毒级联原因)和个人资料特征(例如,关注者的数量等)调查恶意信号。对于与源相关的信息,我们探讨了与URL相关的各种恶意属性(例如,URL地址,关联网站的内容等)。最后,对于与内容相关的信息,我们从用户发布的推文中检查了属性(例如,主题标签,可疑主题标签等)。来自不同国家的现实世界Twitter数据的实验证明了拟议方法在识别PSM用户方面的有效性。
Over the past few years, we have observed different media outlets' attempts to shift public opinion by framing information to support a narrative that facilitate their goals. Malicious users referred to as "pathogenic social media" (PSM) accounts are more likely to amplify this phenomena by spreading misinformation to viral proportions. Understanding the spread of misinformation from account-level perspective is thus a pressing problem. In this work, we aim to present a feature-driven approach to detect PSM accounts in social media. Inspired by the literature, we set out to assess PSMs from three broad perspectives: (1) user-related information (e.g., user activity, profile characteristics), (2) source-related information (i.e., information linked via URLs shared by users) and (3) content-related information (e.g., tweets characteristics). For the user-related information, we investigate malicious signals using causality analysis (i.e., if user is frequently a cause of viral cascades) and profile characteristics (e.g., number of followers, etc.). For the source-related information, we explore various malicious properties linked to URLs (e.g., URL address, content of the associated website, etc.). Finally, for the content-related information, we examine attributes (e.g., number of hashtags, suspicious hashtags, etc.) from tweets posted by users. Experiments on real-world Twitter data from different countries demonstrate the effectiveness of the proposed approach in identifying PSM users.