通过多个实例学习的谣言验证和立场检测的弱监督传播模型

论文标题

通过多个实例学习的谣言验证和立场检测的弱监督传播模型

A Weakly Supervised Propagation Model for Rumor Verification and Stance Detection with Multiple Instance Learning

论文作者

Yang, Ruichao, Ma, Jing, Lin, Hongzhan, Gao, Wei

论文摘要

谣言对微博的扩散通常遵循传播树结构，该结构提供了有关用户如何随时间传输和响应原始消息的宝贵线索。最近的研究表明，谣言检测和立场检测是两个不同但相关的任务，可以共同增强彼此，例如，可以通过交叉检查其相关的微博哨所所传达的立场来揭穿谣言，并且立场也基于谣言的性质。但是，大多数立场检测方法都需要大量的后立场标签进行训练，鉴于大量职位，这是劳动密集型的。通过多个实例学习（MIL）方案的启发，我们首先代表了自下而上和自上而下的树木的索赔的扩散，然后提出了两个树结构的弱监督框架，以共同对谣言和立场进行分类，其中只需要关于索赔真实性的行李级标签。具体而言，我们将多级问题转换为基于多MIL的二进制分类问题，其中每个二进制模型都致力于区分目标立场或谣言类型和其他类型。最后，我们提出了一种分层注意机制来汇总二进制预测，包括（1）自下而上或自上而下的树木注意层，将二进制立场汇总为二进制真实性；（2）将二进制类别汇总成细粒类别的歧视性注意力层。与最先进的方法相比，在三个基于Twitter的数据集上进行的广泛实验表明，我们模型在索赔级谣言检测和后立场分类方面表现出色。

The diffusion of rumors on microblogs generally follows a propagation tree structure, that provides valuable clues on how an original message is transmitted and responded by users over time. Recent studies reveal that rumor detection and stance detection are two different but relevant tasks which can jointly enhance each other, e.g., rumors can be debunked by cross-checking the stances conveyed by their relevant microblog posts, and stances are also conditioned on the nature of the rumor. However, most stance detection methods require enormous post-level stance labels for training, which are labor-intensive given a large number of posts. Enlightened by Multiple Instance Learning (MIL) scheme, we first represent the diffusion of claims with bottom-up and top-down trees, then propose two tree-structured weakly supervised frameworks to jointly classify rumors and stances, where only the bag-level labels concerning claim's veracity are needed. Specifically, we convert the multi-class problem into a multiple MIL-based binary classification problem where each binary model focuses on differentiating a target stance or rumor type and other types. Finally, we propose a hierarchical attention mechanism to aggregate the binary predictions, including (1) a bottom-up or top-down tree attention layer to aggregate binary stances into binary veracity; and (2) a discriminative attention layer to aggregate the binary class into finer-grained classes. Extensive experiments conducted on three Twitter-based datasets demonstrate promising performance of our model on both claim-level rumor detection and post-level stance classification compared with state-of-the-art methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题