论文标题
单通道对比度学习可以适用于同质和异性图
Single-Pass Contrastive Learning Can Work for Both Homophilic and Heterophilic Graph
论文作者
论文摘要
现有的图形对比学习(GCL)技术通常需要两个向前传球才能构建对比度损失,这对于捕获节点特征的低频信号有效。这种双通行设计在同粒细胞图上显示出经验成功,但其在直接连接的节点通常具有不同标签的异质图上的有效性尚不清楚。此外,现有的GCL方法无法提供强大的性能保证。再加上GCL方法在异性图上的不可预测性,它们在现实世界中的适用性是有限的。然后,出现一个自然的问题:我们可以设计一种适用于具有性能保证的同粒细胞和异性图的GCL方法吗?为了回答这个问题,我们从理论上研究了通过邻里聚集在同粒细胞和异性图上获得的特征的浓度特性,从该特性介绍了基于该特性的单通行的无扩增图对比度学习损失,并为下游任务损失的最小化器提供绩效保证。作为我们的分析的直接结果,我们实施了单通道图对比度学习方法(SP-GCL)。从经验上讲,在14个基准数据集具有不同程度的同质性的基准数据集上,SP-GCL所学的功能可以匹配或胜过现有的强基础,并且计算机开销明显较小,这证明了我们在现实情况下发现的有用性。
Existing graph contrastive learning (GCL) techniques typically require two forward passes for a single instance to construct the contrastive loss, which is effective for capturing the low-frequency signals of node features. Such a dual-pass design has shown empirical success on homophilic graphs, but its effectiveness on heterophilic graphs, where directly connected nodes typically have different labels, is unknown. In addition, existing GCL approaches fail to provide strong performance guarantees. Coupled with the unpredictability of GCL approaches on heterophilic graphs, their applicability in real-world contexts is limited. Then, a natural question arises: Can we design a GCL method that works for both homophilic and heterophilic graphs with a performance guarantee? To answer this question, we theoretically study the concentration property of features obtained by neighborhood aggregation on homophilic and heterophilic graphs, introduce the single-pass augmentation-free graph contrastive learning loss based on the property, and provide performance guarantees for the minimizer of the loss on downstream tasks. As a direct consequence of our analysis, we implement the Single-Pass Graph Contrastive Learning method (SP-GCL). Empirically, on 14 benchmark datasets with varying degrees of homophily, the features learned by the SP-GCL can match or outperform existing strong baselines with significantly less computational overhead, which demonstrates the usefulness of our findings in real-world cases.