论文标题

无监督的可区分的多主观网络嵌入

Unsupervised Differentiable Multi-aspect Network Embedding

论文作者

Park, Chanyoung, Yang, Carl, Zhu, Qi, Kim, Donghyun, Yu, Hwanjo, Han, Jiawei

论文摘要

网络嵌入是一种有影响力的图挖掘技术,用于表示图形中的节点作为分布式向量。但是,大多数网络嵌入方法着重于学习每个节点的单个向量表示,最近批评该方法无法对节点的多个方面进行建模。为了捕获每个节点的多个方面,现有研究主要依赖于在实际嵌入之前执行的离线图聚类,这导致每个节点的群集成员资格(即节点方面分布)固定在整个嵌入模型的训练中。我们认为,这不仅使每个节点始终具有相同的方面分布,无论其动态上下文如何,而且都阻碍了模型的端到端训练,最终导致最终嵌入质量在很大程度上取决于聚类。在本文中,我们提出了一个用于多光值网络嵌入的新型端到端框架,称为ASP2VEC,其中每个节点的各个方面都是根据其本地上下文动态分配的。更确切地说,在多个方面中,我们根据当前上下文动态为每个节点分配一个方面,而我们的方面选择模块是通过Gumbel-Softmax技巧端到端区分的。我们还介绍了正则化框架,以在相关性和多样性方面捕获多个方面之间的相互作用。我们进一步证明,我们提出的框架可以很容易地扩展到异质网络。针对各种类型的均质网络和异质网络的各种下游任务的广泛实验证明了ASP2VEC的优势。

Network embedding is an influential graph mining technique for representing nodes in a graph as distributed vectors. However, the majority of network embedding methods focus on learning a single vector representation for each node, which has been recently criticized for not being capable of modeling multiple aspects of a node. To capture the multiple aspects of each node, existing studies mainly rely on offline graph clustering performed prior to the actual embedding, which results in the cluster membership of each node (i.e., node aspect distribution) fixed throughout training of the embedding model. We argue that this not only makes each node always have the same aspect distribution regardless of its dynamic context, but also hinders the end-to-end training of the model that eventually leads to the final embedding quality largely dependent on the clustering. In this paper, we propose a novel end-to-end framework for multi-aspect network embedding, called asp2vec, in which the aspects of each node are dynamically assigned based on its local context. More precisely, among multiple aspects, we dynamically assign a single aspect to each node based on its current context, and our aspect selection module is end-to-end differentiable via the Gumbel-Softmax trick. We also introduce the aspect regularization framework to capture the interactions among the multiple aspects in terms of relatedness and diversity. We further demonstrate that our proposed framework can be readily extended to heterogeneous networks. Extensive experiments towards various downstream tasks on various types of homogeneous networks and a heterogeneous network demonstrate the superiority of asp2vec.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源