算法和系统共同设计，用于有效的基于子图的图表学习

论文标题

算法和系统共同设计，用于有效的基于子图的图表学习

Algorithm and System Co-design for Efficient Subgraph-based Graph Representation Learning

论文作者

Yin, Haoteng, Zhang, Muhan, Wang, Yanbang, Wang, Jianguo, Li, Pan

论文摘要

最近提出了基于子图的图表学习（SGRL）来应对规范图神经网络（GNN）遇到的一些基本挑战，并在许多重要的数据科学应用中都证明了优势，例如链接，关系和主题预测。但是，当前的SGRL方法遇到了可伸缩性问题，因为它们需要为每个培训或测试查询提取子图。扩大规范GNN的最新解决方案可能不适用于SGRL。在这里，我们通过共同设计学习算法及其系统支持，为可扩展的SGRL提出了一种新颖的框架Surel。 Surel采用基于步行的子图表分解，并将步行重新形成子图，从而大大降低了子图提取的冗余，并支持并行计算。具有数百万个节点和边缘的六个同质，异质和高阶图的实验证明了Surel的有效性和可扩展性。特别是，与SGRL基线相比，Surel可以达到10 $ \ times $加速，并具有可比甚至更好的预测性能；与规范GNN相比，Surel可实现50％的预测准确性。

Subgraph-based graph representation learning (SGRL) has been recently proposed to deal with some fundamental challenges encountered by canonical graph neural networks (GNNs), and has demonstrated advantages in many important data science applications such as link, relation and motif prediction. However, current SGRL approaches suffer from scalability issues since they require extracting subgraphs for each training or test query. Recent solutions that scale up canonical GNNs may not apply to SGRL. Here, we propose a novel framework SUREL for scalable SGRL by co-designing the learning algorithm and its system support. SUREL adopts walk-based decomposition of subgraphs and reuses the walks to form subgraphs, which substantially reduces the redundancy of subgraph extraction and supports parallel computation. Experiments over six homogeneous, heterogeneous and higher-order graphs with millions of nodes and edges demonstrate the effectiveness and scalability of SUREL. In particular, compared to SGRL baselines, SUREL achieves 10$\times$ speed-up with comparable or even better prediction performance; while compared to canonical GNNs, SUREL achieves 50% prediction accuracy improvement.

下载PDF全文

下载文献需遵守相关版权规定

论文标题