论文标题
GridShift:用于图像分割和对象跟踪的更快的模式寻求算法
GridShift: A Faster Mode-seeking Algorithm for Image Segmentation and Object Tracking
论文作者
论文摘要
在机器学习和计算机视觉中,平均移位(MS)符合用于聚类和图像分割的最流行模式寻求算法之一。它迭代地将每个数据转移到其邻居数据点的加权平均值。找到每个数据点的邻居所需的计算成本与数据点数量是二次的。因此,对于大型数据集而言,香草MS似乎非常慢。为了解决这个问题,我们提出了一种称为GridShift的模式寻求算法,并具有很大的加速,主要基于MS。为了加速,GridShift采用基于网格的邻居搜索方法,该方法在数据点数中是线性的。此外,GridShift将活跃的网格单元(与至少一个数据点相关的网格单元)代替数据点朝着较高的密度移动,这一步骤提供了更多的加速。 GridShift的运行时间在活动网格单元的数量中是线性的,并且特征数量为指数。因此,它是大规模低维应用的理想选择,例如对象跟踪和图像分割。通过广泛的实验,我们在基于MS的其他基于MS的算法和最先进的算法上展示了GridShift的出色性能,以用于图像分割的基准数据集上的准确性和运行时。最后,我们基于GridShift提供了一种新的对象跟踪算法,并显示出与凸轮矩和刻痕++相比的对象跟踪的有希望的结果。
In machine learning and computer vision, mean shift (MS) qualifies as one of the most popular mode-seeking algorithms used for clustering and image segmentation. It iteratively moves each data point to the weighted mean of its neighborhood data points. The computational cost required to find the neighbors of each data point is quadratic to the number of data points. Consequently, the vanilla MS appears to be very slow for large-scale datasets. To address this issue, we propose a mode-seeking algorithm called GridShift, with significant speedup and principally based on MS. To accelerate, GridShift employs a grid-based approach for neighbor search, which is linear in the number of data points. In addition, GridShift moves the active grid cells (grid cells associated with at least one data point) in place of data points towards the higher density, a step that provides more speedup. The runtime of GridShift is linear in the number of active grid cells and exponential in the number of features. Therefore, it is ideal for large-scale low-dimensional applications such as object tracking and image segmentation. Through extensive experiments, we showcase the superior performance of GridShift compared to other MS-based as well as state-of-the-art algorithms in terms of accuracy and runtime on benchmark datasets for image segmentation. Finally, we provide a new object-tracking algorithm based on GridShift and show promising results for object tracking compared to CamShift and meanshift++.