论文标题

从动态图中进行定向的无环结构学习

Directed Acyclic Graph Structure Learning from Dynamic Graphs

论文作者

Fan, Shaohua, Zhang, Shuyang, Wang, Xiao, Shi, Chuan

论文摘要

估计特征(变量)的定向无环图(DAG)的结构在揭示潜在数据生成过程并提供各种应用中的因果见解方面起着至关重要的作用。尽管有许多关于结构学习的研究,但尚未探索动态图上的结构学习,因此我们研究了这种无处不在的动态图数据的节点特征生成机制的学习问题。在动态图中,我们建议同时估计节点特征之间的同时关系和时置的相互作用关系。这两种关系形成了DAG,可以有效地以简洁的方式表征特征生成过程。为了学习这样的DAG,我们将学习问题作为基于连续得分的优化问题,该问题由可区分的分数函数组成,以衡量学到的DAG的有效性和平滑的无符合性约束,以确保学习dag的无效性。这两个组件被转化为一个不可约束的增强拉格朗日目标,可以通过成熟的连续优化技术最小化。所得的算法(名为cheaphnotears)在现实世界应用程序中可能遇到的广泛设置上的模拟数据上的表现优于基线。我们还将提出的方法应用于从现实世界Yelp数据集构建的两个动态图上,证明我们的方法可以学习节点特征之间的连接,这与域知识相符。

Estimating the structure of directed acyclic graphs (DAGs) of features (variables) plays a vital role in revealing the latent data generation process and providing causal insights in various applications. Although there have been many studies on structure learning with various types of data, the structure learning on the dynamic graph has not been explored yet, and thus we study the learning problem of node feature generation mechanism on such ubiquitous dynamic graph data. In a dynamic graph, we propose to simultaneously estimate contemporaneous relationships and time-lagged interaction relationships between the node features. These two kinds of relationships form a DAG, which could effectively characterize the feature generation process in a concise way. To learn such a DAG, we cast the learning problem as a continuous score-based optimization problem, which consists of a differentiable score function to measure the validity of the learned DAGs and a smooth acyclicity constraint to ensure the acyclicity of the learned DAGs. These two components are translated into an unconstraint augmented Lagrangian objective which could be minimized by mature continuous optimization techniques. The resulting algorithm, named GraphNOTEARS, outperforms baselines on simulated data across a wide range of settings that may encounter in real-world applications. We also apply the proposed approach on two dynamic graphs constructed from the real-world Yelp dataset, demonstrating our method could learn the connections between node features, which conforms with the domain knowledge.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源