Graph-based clustering methods like spectral clustering and SpectralNet are very efficient in detecting clusters of non-convex shapes. Unlike the popular $k$-means, graph-based clustering methods do not assume that each cluster has a single mean. However, these methods need a graph where vertices in the same cluster are connected by edges of large weights. To achieve this goal, many studies have proposed graph reduction methods with parameters. Unfortunately, these parameters have to be tuned for every dataset. We introduce a graph reduction method that does not require any parameters. First, the distances from every point $p$ to its neighbors are filtered using an adaptive threshold to only keep neighbors with similar surrounding density. Second, the similarities with close neighbors are computed and only high similarities are kept. The edges that survive these two filtering steps form the constructed graph that was passed to spectral clustering and SpectralNet. The experiments showed that our method provides a stable alternative, where other methods performance fluctuated according to the setting of their parameters.
翻译:光谱群集和光谱网等基于图形的集群方法在探测非convex形状群集方面非常有效。 与流行的 $k$ 值不同, 图形群集方法并不假定每个群集有一个单一的平均值。 但是, 这些方法需要一张图表, 在同一组群中, 脊椎通过大重量的边缘连接在一起。 为了实现这一目标, 许多研究都提出了带有参数的图形减少方法。 不幸的是, 这些参数必须针对每个数据集调整。 我们引入了一种不需要任何参数的图形减少方法。 首先, 每点美元到其邻居的距离都用一个适应性阈值过滤, 以便只保持周围密度相似的邻居。 其次, 计算与近邻的相似点, 并且只保持高度的相似点 。 在这两个过滤步骤中幸存的边缘, 被传递到光谱集和光谱网的图形。 实验显示, 我们的方法提供了一种稳定的替代方法, 在那里, 其他方法的性能随参数的设置而波动。</s>