将符号领域知识纳入图神经网络

论文标题

将符号领域知识纳入图神经网络

Incorporating Symbolic Domain Knowledge into Graph Neural Networks

论文作者

Dash, Tirtharaj, Srinivasan, Ashwin, Vig, Lovekesh

论文摘要

我们的兴趣是具有以下特征的科学问题：（1）数据自然表示为图；（2）可用的数据量通常很少；（3）有重要的领域知识，通常以某种符号形式表达。通过归纳逻辑编程（ILP）有效地解决了这类问题，这是有2个重要特征的：（a）使用一种表示语言，可以轻松捕获图形结构数据中编码的关系，以及（b）包含先前信息的域特异性关系，可减轻与范围稀缺性关系的问题，并构建了数据稀缺性关系，并构建了新的关系。最近的进步已经看到了专门为图形结构数据（基于图的神经网络或GNN）开发的深神经网络的出现。尽管已证明GNN能够处理图形结构化数据，但减少了研究域知识的包含。在这里，我们通过采用操作我们称为“顶点增强”，并将相应的gnns表示为“ vegnns”，从经验上研究了GNNS的这一方面。使用70多个现实世界数据集和大量符号域知识，我们检查了5种不同的GNN变体跨顶点增强的结果。我们的结果为以下方面提供了支持：（a）通过顶点添加来包含域知识，可以显着提高GNN的性能。也就是说，在所有GNN变体中，性能vegnns明显好于GNN。（b）使用ILP构建的域特异性关系的纳入所有GNN变体中的VEGNN的性能。综上所述，结果提供了证据表明，可以将符号领域知识纳入GNN，并且ILP可以在提供GNN不容易发现的高级关系中发挥重要作用。

Our interest is in scientific problems with the following characteristics: (1) Data are naturally represented as graphs; (2) The amount of data available is typically small; and (3) There is significant domain-knowledge, usually expressed in some symbolic form. These kinds of problems have been addressed effectively in the past by Inductive Logic Programming (ILP), by virtue of 2 important characteristics: (a) The use of a representation language that easily captures the relation encoded in graph-structured data, and (b) The inclusion of prior information encoded as domain-specific relations, that can alleviate problems of data scarcity, and construct new relations. Recent advances have seen the emergence of deep neural networks specifically developed for graph-structured data (Graph-based Neural Networks, or GNNs). While GNNs have been shown to be able to handle graph-structured data, less has been done to investigate the inclusion of domain-knowledge. Here we investigate this aspect of GNNs empirically by employing an operation we term "vertex-enrichment" and denote the corresponding GNNs as "VEGNNs". Using over 70 real-world datasets and substantial amounts of symbolic domain-knowledge, we examine the result of vertex-enrichment across 5 different variants of GNNs. Our results provide support for the following: (a) Inclusion of domain-knowledge by vertex-enrichment can significantly improve the performance of a GNN. That is, the performance VEGNNs is significantly better than GNNs across all GNN variants; (b) The inclusion of domain-specific relations constructed using ILP improves the performance of VEGNNs, across all GNN variants. Taken together, the results provide evidence that it is possible to incorporate symbolic domain knowledge into a GNN, and that ILP can play an important role in providing high-level relationships that are not easily discovered by a GNN.

下载PDF全文

下载文献需遵守相关版权规定

论文标题