We study the computational problem of computing a fair means clustering of discrete vectors, which admits an equivalent formulation as editing a colored matrix into one with few distinct color-balanced rows by changing at most $k$ values. While NP-hard in both the fairness-oblivious and the fair settings, the problem is well-known to admit a fixed-parameter algorithm in the former ``vanilla'' setting. As our first contribution, we exclude an analogous algorithm even for highly restricted fair means clustering instances. We then proceed to obtain a full complexity landscape of the problem, and establish tractability results which capture three means of circumventing our obtained lower bound: placing additional constraints on the problem instances, fixed-parameter approximation, or using an alternative parameterization targeting tree-like matrices.
翻译:我们研究了计算离散向量公平均值聚类的计算问题,该问题可等价表述为通过改变至多$k$个值,将彩色矩阵编辑为具有少量不同颜色平衡行的矩阵。尽管在忽略公平性和公平性设置下该问题均为NP难问题,但众所周知在前者“经典”设置中该问题存在固定参数算法。作为我们的首个贡献,我们排除了即使在高度受限的公平均值聚类实例中也存在类似算法的可能性。随后,我们全面刻画了该问题的复杂性图景,并建立了可处理性结果,这些结果捕捉了规避我们所得下界的三种途径:对问题实例施加额外约束、固定参数近似、或采用针对树状矩阵的替代参数化方法。