论文标题
结构化的图形学习用于聚类和半监督分类
Structured Graph Learning for Clustering and Semi-supervised Classification
论文作者
论文摘要
在过去的十年中,图表在建模结构和相互作用中变得越来越流行。基于图的聚类和半监督分类技术表现出令人印象深刻的性能。本文提出了一个图形学习框架,以保留数据的本地和全局结构。具体而言,我们的方法使用样本的自我表达性来捕获全局结构和自适应邻居方法来尊重局部结构。此外,大多数现有基于图的方法在从原始数据矩阵中学到的图形上进行群集和半监督分类,该矩阵没有明确的群集结构,因此它们可能无法实现最佳性能。通过考虑等级约束,如果有$ C $簇或类,则已实现的图将具有$ C $连接的组件。作为此的副产品,图形学习和标签推理是共同的,并以原则性的方式迭代实施。从理论上讲,我们表明我们的模型等同于在某些条件下的内核K-均值和K-均值方法的组合。关于聚类和半监督分类的广泛实验表明,所提出的方法的表现优于其他最新方法。
Graphs have become increasingly popular in modeling structures and interactions in a wide variety of problems during the last decade. Graph-based clustering and semi-supervised classification techniques have shown impressive performance. This paper proposes a graph learning framework to preserve both the local and global structure of data. Specifically, our method uses the self-expressiveness of samples to capture the global structure and adaptive neighbor approach to respect the local structure. Furthermore, most existing graph-based methods conduct clustering and semi-supervised classification on the graph learned from the original data matrix, which doesn't have explicit cluster structure, thus they might not achieve the optimal performance. By considering rank constraint, the achieved graph will have exactly $c$ connected components if there are $c$ clusters or classes. As a byproduct of this, graph learning and label inference are jointly and iteratively implemented in a principled way. Theoretically, we show that our model is equivalent to a combination of kernel k-means and k-means methods under certain condition. Extensive experiments on clustering and semi-supervised classification demonstrate that the proposed method outperforms other state-of-the-art methods.