论文标题
大型网络中的分布式社区检测
Distributed Community Detection in Large Networks
论文作者
论文摘要
大型网络的社区发现构成了高度计算成本以及异质社区结构的挑战。 在本文中,我们考虑了与``分组社区''(或``组结构'')的广泛存在的现实世界网络,其中分组社区内的节点密切相关,并且整个社区之间的节点相对宽松地连接。 我们为此类网络提出了一种两步的社区检测方法。 首先,我们利用模块化优化方法将网络划分为组间连接较低的组。 其次,我们采用随机块模型(SBM)或经度校正的SBM(DCSBM)来将各组进一步划分为社区,从而使社区间连接的水平有所不同。 通过结合这种两步结构,我们引入了一种新颖的分裂和诱导算法,该算法渐近地恢复了组的结构和社区结构。 数值研究证实,我们的方法在实现竞争性能的同时大大降低了计算成本。 该框架为检测具有分组社区的网络中的社区结构提供了全面的解决方案,为各种应用程序提供了有价值的工具。
Community detection for large networks poses challenges due to the high computational cost as well as heterogeneous community structures. In this paper, we consider widely existing real-world networks with ``grouped communities'' (or ``the group structure''), where nodes within grouped communities are densely connected and nodes across grouped communities are relatively loosely connected. We propose a two-step community detection approach for such networks. Firstly, we leverage modularity optimization methods to partition the network into groups, where between-group connectivity is low. Secondly, we employ the stochastic block model (SBM) or degree-corrected SBM (DCSBM) to further partition the groups into communities, allowing for varying levels of between-community connectivity. By incorporating this two-step structure, we introduce a novel divide-and-conquer algorithm that asymptotically recovers both the group structure and the community structure. Numerical studies confirm that our approach significantly reduces computational costs while achieving competitive performance. This framework provides a comprehensive solution for detecting community structures in networks with grouped communities, offering a valuable tool for various applications.