论文标题
有向图的高阶光谱聚类
Higher-Order Spectral Clustering of Directed Graphs
论文作者
论文摘要
聚类是算法中的重要主题,并且在机器学习,计算机视觉,统计和其他几个研究学科中都有许多应用。图形聚类的传统目标是查找电导率低的簇。这些目标不仅适用于无向图,而且还无法考虑群集之间的关系,这对于许多应用程序可能至关重要。为了克服这些弊端,我们研究了定向图(Digraphs),其簇彼此之间表现出进一步的“结构”信息。基于Digraphs的Hermitian矩阵表示形式,我们提出了一种几乎线性的时间算法,用于挖掘集群,并进一步表明我们提出的算法可以在合理假设下以sublerear的时间实现。我们的理论工作的重要性是通过对联合国跨贸易数据集的广泛实验结果证明的:我们算法的输出聚类不仅表现出群集(国家集合(集合)在进出口记录方面如何相互关系,而且这些杂物随着时间的推移如何随着时间的推移而在国际贸易中发展。
Clustering is an important topic in algorithms, and has a number of applications in machine learning, computer vision, statistics, and several other research disciplines. Traditional objectives of graph clustering are to find clusters with low conductance. Not only are these objectives just applicable for undirected graphs, they are also incapable to take the relationships between clusters into account, which could be crucial for many applications. To overcome these downsides, we study directed graphs (digraphs) whose clusters exhibit further "structural" information amongst each other. Based on the Hermitian matrix representation of digraphs, we present a nearly-linear time algorithm for digraph clustering, and further show that our proposed algorithm can be implemented in sublinear time under reasonable assumptions. The significance of our theoretical work is demonstrated by extensive experimental results on the UN Comtrade Dataset: the output clustering of our algorithm exhibits not only how the clusters (sets of countries) relate to each other with respect to their import and export records, but also how these clusters evolve over time, in accordance with known facts in international trade.