Multi-view clustering is an important research topic due to its capability to utilize complementary information from multiple views. However, there are few methods to consider the negative impact caused by certain views with unclear clustering structures, resulting in poor multi-view clustering performance. To address this drawback, we propose self-supervised discriminative feature learning for deep multi-view clustering (SDMVC). Concretely, deep autoencoders are applied to learn embedded features for each view independently. To leverage the multi-view complementary information, we concatenate all views' embedded features to form the global features, which can overcome the negative impact of some views' unclear clustering structures. In a self-supervised manner, pseudo-labels are obtained to build a unified target distribution to perform multi-view discriminative feature learning. During this process, global discriminative information can be mined to supervise all views to learn more discriminative features, which in turn are used to update the target distribution. Besides, this unified target distribution can make SDMVC learn consistent cluster assignments, which accomplishes the clustering consistency of multiple views while preserving their features' diversity. Experiments on various types of multi-view datasets show that SDMVC outperforms 14 competitors including classic and state-of-the-art methods. The code is available at https://github.com/SubmissionsIn/SDMVC.
翻译:多视图聚类因其能够利用来自多个视图的互补信息而成为一个重要的研究课题。然而,现有方法很少考虑某些视图因聚类结构不清晰而带来的负面影响,这导致了多视图聚类性能不佳。为克服这一缺陷,我们提出了面向深度多视图聚类的自监督判别性特征学习方法(SDMVC)。具体而言,我们采用深度自编码器独立学习每个视图的嵌入特征。为利用多视图的互补信息,我们将所有视图的嵌入特征拼接形成全局特征,这能够克服部分视图聚类结构不清晰带来的负面影响。通过自监督方式,我们获取伪标签以构建统一的目标分布,进而执行多视图判别性特征学习。在此过程中,可以挖掘全局判别性信息以监督所有视图学习更具判别性的特征,而这些特征又用于更新目标分布。此外,该统一目标分布可使SDMVC学习到一致的聚类分配,从而在保持各视图特征多样性的同时实现多视图的聚类一致性。在多种类型的多视图数据集上的实验表明,SDMVC优于包括经典方法和前沿方法在内的14种竞争方法。代码可在 https://github.com/SubmissionsIn/SDMVC 获取。