论文标题

将符号领域知识纳入图神经网络

Incorporating Symbolic Domain Knowledge into Graph Neural Networks

论文作者

Dash, Tirtharaj, Srinivasan, Ashwin, Vig, Lovekesh

论文摘要

我们的兴趣是具有以下特征的科学问题:(1)数据自然表示为图; (2)可用的数据量通常很少; (3)有重要的领域知识,通常以某种符号形式表达。通过归纳逻辑编程(ILP)有效地解决了这类问题,这是有2个重要特征的:(a)使用一种表示语言,可以轻松捕获图形结构数据中编码的关系,以及(b)包含先前信息的域特异性关系,可减轻与范围稀缺性关系的问题,并构建了数据稀缺性关系,并构建了新的关系。最近的进步已经看到了专门为图形结构数据(基于图的神经网络或GNN)开发的深神经网络的出现。尽管已证明GNN能够处理图形结构化数据,但减少了研究域知识的包含。在这里,我们通过采用操作我们称为“顶点增强”,并将相应的gnns表示为“ vegnns”,从经验上研究了GNNS的这一方面。使用70多个现实世界数据集和大量符号域知识,我们检查了5种不同的GNN变体跨顶点增强的结果。我们的结果为以下方面提供了支持:(a)通过顶点添加来包含域知识,可以显着提高GNN的性能。也就是说,在所有GNN变体中,性能vegnns明显好于GNN。 (b)使用ILP构建的域特异性关系的纳入所有GNN变体中的VEGNN的性能。综上所述,结果提供了证据表明,可以将符号领域知识纳入GNN,并且ILP可以在提供GNN不容易发现的高级关系中发挥重要作用。

Our interest is in scientific problems with the following characteristics: (1) Data are naturally represented as graphs; (2) The amount of data available is typically small; and (3) There is significant domain-knowledge, usually expressed in some symbolic form. These kinds of problems have been addressed effectively in the past by Inductive Logic Programming (ILP), by virtue of 2 important characteristics: (a) The use of a representation language that easily captures the relation encoded in graph-structured data, and (b) The inclusion of prior information encoded as domain-specific relations, that can alleviate problems of data scarcity, and construct new relations. Recent advances have seen the emergence of deep neural networks specifically developed for graph-structured data (Graph-based Neural Networks, or GNNs). While GNNs have been shown to be able to handle graph-structured data, less has been done to investigate the inclusion of domain-knowledge. Here we investigate this aspect of GNNs empirically by employing an operation we term "vertex-enrichment" and denote the corresponding GNNs as "VEGNNs". Using over 70 real-world datasets and substantial amounts of symbolic domain-knowledge, we examine the result of vertex-enrichment across 5 different variants of GNNs. Our results provide support for the following: (a) Inclusion of domain-knowledge by vertex-enrichment can significantly improve the performance of a GNN. That is, the performance VEGNNs is significantly better than GNNs across all GNN variants; (b) The inclusion of domain-specific relations constructed using ILP improves the performance of VEGNNs, across all GNN variants. Taken together, the results provide evidence that it is possible to incorporate symbolic domain knowledge into a GNN, and that ILP can play an important role in providing high-level relationships that are not easily discovered by a GNN.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源