论文标题
公共智慧很重要!情绪含义的双曲线傅立叶共同注意社会文本分类
Public Wisdom Matters! Discourse-Aware Hyperbolic Fourier Co-Attention for Social-Text Classification
论文作者
论文摘要
社交媒体已成为各种沟通形式的支点。对假新闻,谣言,讽刺等等社交文本进行分类,引起了人们的重大关注。社会文本本身表达的表面级信号可能不足以适合此类任务。因此,最近的方法试图合并其他内在信号,例如用户行为和基础图结构。通常,通过对社会文本的评论/答复表达的“公共智慧”作为众包观点的替代品,可能会为我们提供互补的信号。关于社会文本分类的最新方法倾向于忽略这种丰富的层次结构信号。在这里,我们提出了一种连字符,这是一种话语感知的双曲光谱共同注意网络。连字符是双曲线图表示学习与一种新颖的傅立叶共同注意机制的融合,试图通过结合公共话语来概括社会文本分类任务。我们将公共话语作为一种抽象含义表示(AMR)图形解析,并使用强大的双曲几何表示形式将其用于具有层次结构的模型图。最后,我们为其配备了一种新颖的傅立叶共同注意机制,以捕获源邮报和公共话语之间的相关性。对四个不同的社交文本分类任务进行了广泛的实验,即检测到假新闻,仇恨言论,谣言和讽刺,表明连字符概括,并在十个基准数据集中实现了最先进的结果。我们还采用了句子级的事实检查和注释数据集来评估Hyphen如何能够产生解释作为与最终预测的类似证据。
Social media has become the fulcrum of all forms of communication. Classifying social texts such as fake news, rumour, sarcasm, etc. has gained significant attention. The surface-level signals expressed by a social-text itself may not be adequate for such tasks; therefore, recent methods attempted to incorporate other intrinsic signals such as user behavior and the underlying graph structure. Oftentimes, the `public wisdom' expressed through the comments/replies to a social-text acts as a surrogate of crowd-sourced view and may provide us with complementary signals. State-of-the-art methods on social-text classification tend to ignore such a rich hierarchical signal. Here, we propose Hyphen, a discourse-aware hyperbolic spectral co-attention network. Hyphen is a fusion of hyperbolic graph representation learning with a novel Fourier co-attention mechanism in an attempt to generalise the social-text classification tasks by incorporating public discourse. We parse public discourse as an Abstract Meaning Representation (AMR) graph and use the powerful hyperbolic geometric representation to model graphs with hierarchical structure. Finally, we equip it with a novel Fourier co-attention mechanism to capture the correlation between the source post and public discourse. Extensive experiments on four different social-text classification tasks, namely detecting fake news, hate speech, rumour, and sarcasm, show that Hyphen generalises well, and achieves state-of-the-art results on ten benchmark datasets. We also employ a sentence-level fact-checked and annotated dataset to evaluate how Hyphen is capable of producing explanations as analogous evidence to the final prediction.