论文标题
多层网络中密集子图发现的随机解决方案
Stochastic Solutions for Dense Subgraph Discovery in Multilayer Networks
论文作者
论文摘要
网络分析在知识发现和数据挖掘中起着关键作用。在近年来,在许多现实世界中,我们对挖掘多层网络感兴趣,在该网络中,我们有许多称为图层的边缘集,它们在同一顶点上编码不同类型的连接和/或时间相关的连接。在许多网络分析技术中,旨在在网络中找到密集组件的密集子图发现是一种必不可少的原始性,在不同域中具有多种应用。在本文中,我们引入了一个新颖的优化模型,用于多层网络中的密集子图发现。我们的模型旨在找到一个随机解决方案,即,在顶点子集的家族的概率分布,而不是单个顶点子集,而它也可以用于获得单个顶点子集。对于我们的模型,我们设计了一种基于LP的多项式精确算法。此外,为了处理大规模网络,我们还设计了一种简单,可扩展的预处理算法,该算法通常会大大降低输入网络的大小,并导致大量加速。计算实验证明了我们的模型的有效性和算法的有效性。
Network analysis has played a key role in knowledge discovery and data mining. In many real-world applications in recent years, we are interested in mining multilayer networks, where we have a number of edge sets called layers, which encode different types of connections and/or time-dependent connections over the same set of vertices. Among many network analysis techniques, dense subgraph discovery, aiming to find a dense component in a network, is an essential primitive with a variety of applications in diverse domains. In this paper, we introduce a novel optimization model for dense subgraph discovery in multilayer networks. Our model aims to find a stochastic solution, i.e., a probability distribution over the family of vertex subsets, rather than a single vertex subset, whereas it can also be used for obtaining a single vertex subset. For our model, we design an LP-based polynomial-time exact algorithm. Moreover, to handle large-scale networks, we also devise a simple, scalable preprocessing algorithm, which often reduces the size of the input networks significantly and results in a substantial speed-up. Computational experiments demonstrate the validity of our model and the effectiveness of our algorithms.