节点复制：有效图采样的随机图模型

论文标题

节点复制：有效图采样的随机图模型

Node Copying: A Random Graph Model for Effective Graph Sampling

论文作者

Regol, Florence, Pal, Soumyasundar, Sun, Jianing, Zhang, Yingxue, Geng, Yanhui, Coates, Mark

论文摘要

基于观察到的图，对在关系结构数据上应用机器学习技术的兴趣增加了。通常，该图并不能完全代表节点之间的真实关系。在这些设置中，构建以观测图为条件的生成模型可以考虑图形不确定性。各种现有技术要么依赖于限制性假设，无法在样本中保留拓扑特性，要么在较大的图表中昂贵。在这项工作中，我们介绍了用于在图形上构建分布的节点复制模型。随机图的采样是通过通过随机采样类似节点的邻居替换每个节点的邻居来进行的。采样图保留图形结构的关键特征，而无需明确定位它们。此外，该模型的采样非常简单，并且与节点线性缩放。我们在三个任务中显示了复制模型的有用性。首先，在节点分类中，基于节点复制的贝叶斯公式在稀疏数据设置中实现了更高的精度。其次，我们采用建议的模型来减轻对抗攻击对图形拓扑的影响。最后，在推荐系统设置中纳入模型可以改善对最新方法的回忆。

There has been an increased interest in applying machine learning techniques on relational structured-data based on an observed graph. Often, this graph is not fully representative of the true relationship amongst nodes. In these settings, building a generative model conditioned on the observed graph allows to take the graph uncertainty into account. Various existing techniques either rely on restrictive assumptions, fail to preserve topological properties within the samples or are prohibitively expensive for larger graphs. In this work, we introduce the node copying model for constructing a distribution over graphs. Sampling of a random graph is carried out by replacing each node's neighbors by those of a randomly sampled similar node. The sampled graphs preserve key characteristics of the graph structure without explicitly targeting them. Additionally, sampling from this model is extremely simple and scales linearly with the nodes. We show the usefulness of the copying model in three tasks. First, in node classification, a Bayesian formulation based on node copying achieves higher accuracy in sparse data settings. Second, we employ our proposed model to mitigate the effect of adversarial attacks on the graph topology. Last, incorporation of the model in a recommendation system setting improves recall over state-of-the-art methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题