通过数据管理镜头:对公平分类的实验分析和评价 (Through the Data Management Lens: Experimental Analysis and Evaluation of Fair Classification)

Classification, a heavily-studied data-driven machine learning task, drives an increasing number of prediction systems involving critical human decisions such as loan approval and criminal risk assessment. However, classifiers often demonstrate discriminatory behavior, especially when presented with biased data. Consequently, fairness in classification has emerged as a high-priority research area. Data management research is showing an increasing presence and interest in topics related to data and algorithmic fairness, including the topic of fair classification. The interdisciplinary efforts in fair classification, with machine learning research having the largest presence, have resulted in a large number of fairness notions and a wide range of approaches that have not been systematically evaluated and compared. In this paper, we contribute a broad analysis of 13 fair classification approaches and additional variants, over their correctness, fairness, efficiency, scalability, and stability, using a variety of metrics and real-world datasets. Our analysis highlights novel insights on the impact of different metrics and high-level approach characteristics on different aspects of performance. We also discuss general principles for choosing approaches suitable for different practical settings, and identify areas where data-management-centric solutions are likely to have the most impact.

翻译：分类是一项大量研究数据驱动的机器学习任务,它促使越来越多的预测系统涉及关键的人类决定,例如贷款批准和犯罪风险评估,然而,分类者往往表现出歧视性的行为,特别是在提出偏差数据时。因此,分类的公平性已成为一个高度优先的研究领域。数据管理研究显示,对数据和算法公正相关专题,包括公平分类专题,越来越有存在和兴趣。公平分类的跨学科努力,以机器学习研究为主,产生了大量公平概念和广泛的办法,而这些办法尚未系统地评估和比较。在本文件中,我们广泛分析了13种公平的分类办法和其他变式,其正确性、公平性、效率、可缩放性和稳定性,使用了各种计量数据和真实世界数据集。我们的分析突出了关于不同计量和高层次方法特点对业绩不同方面的影响的新见解。我们还讨论了选择适合不同实际环境的方法的一般原则,并确定了数据管理解决办法可能产生最大影响的领域。