论文标题
有监督的超图重建
Supervised Hypergraph Reconstruction
论文作者
论文摘要
我们研究了图形数据分析通常看到的一个问题:许多涉及高阶相互作用的现实世界复杂系统最好由超图表编码;但是,他们的数据集通常最终仅以预测的形式出版或研究(具有二元边缘)。要了解这个问题,我们首先建立一个理论框架来表征此问题的含义和最坏情况。该分析激发了我们对新任务的制定,受监督的超图重建:在应用程序域的一些现有知识的帮助下,从其投影图中重建现实世界中的超图。 为了重建HyperGraph数据,我们首先分析投影中的HyperEDGE分布,基于该数据,我们创建一个包含两个模块的框架:(1)处理潜在超越的巨大搜索空间,我们设计了一种具有有效性的采样策略,可确保有效地将空间缩小到较小的候选人集中; (2)为了识别候选人的超核,我们进一步设计了一个良好工作变体中的超边缘分类器,这些变体捕获了投影中的结构特征。广泛的实验验证了我们的主张,方法和扩展。值得注意的是,我们的方法在硬数据集上的准确性均优于所有基准。我们的代码和数据可以从bit.ly/shyre下载。
We study an issue commonly seen with graph data analysis: many real-world complex systems involving high-order interactions are best encoded by hypergraphs; however, their datasets often end up being published or studied only in the form of their projections (with dyadic edges). To understand this issue, we first establish a theoretical framework to characterize this issue's implications and worst-case scenarios. The analysis motivates our formulation of the new task, supervised hypergraph reconstruction: reconstructing a real-world hypergraph from its projected graph, with the help of some existing knowledge of the application domain. To reconstruct hypergraph data, we start by analyzing hyperedge distributions in the projection, based on which we create a framework containing two modules: (1) to handle the enormous search space of potential hyperedges, we design a sampling strategy with efficacy guarantees that significantly narrows the space to a smaller set of candidates; (2) to identify hyperedges from the candidates, we further design a hyperedge classifier in two well-working variants that capture structural features in the projection. Extensive experiments validate our claims, approach, and extensions. Remarkably, our approach outperforms all baselines by an order of magnitude in accuracy on hard datasets. Our code and data can be downloaded from bit.ly/SHyRe.