We make three contributions to conformal prediction. First, we propose fuzzy conformal prediction sets that offer a degree of exclusion, generalizing beyond the binary inclusion/exclusion offered by classical prediction sets. We connect fuzzy prediction sets to e-values to show this degree of exclusion is equivalent to an exclusion at different confidence levels, capturing precisely what e-values bring to conformal prediction. We show that a fuzzy prediction set is a predictive distribution with an arguably more appropriate error guarantee. Second, we derive optimal conformal prediction sets by interpreting the minimization of the expected measure of a prediction set as an optimal testing problem against a particular alternative. We use this to characterize exactly in what sense traditional conformal prediction is optimal, and show how this may generally be used to construct optimal (fuzzy) prediction sets. Third, we generalize the inheritance of guarantees by subsequent minimax decisions from prediction sets to fuzzy prediction sets. All results generalize beyond the conformal setting to prediction sets for arbitrary models. In particular, we find that constructing a (fuzzy) prediction set for a model is equivalent to constructing a test (e-value) for that model as a hypothesis.
翻译:本文对共形预测领域做出三项贡献。首先,我们提出模糊共形预测集,其提供了一种排除度量的概念,超越了传统预测集仅提供二元包含/排除的局限。通过将模糊预测集与e值建立联系,我们证明这种排除度量等价于在不同置信水平下的排除操作,精确刻画了e值为共形预测带来的优势。我们表明模糊预测集是一种预测分布,其具备更为合理的误差保证。其次,我们通过将预测集期望测度最小化问题解释为针对特定备择假设的最优检验问题,推导出最优共形预测集的构造方法。基于此,我们精确刻画了传统共形预测在何种意义上具有最优性,并展示如何将此框架推广至构建最优(模糊)预测集。第三,我们将后续极小极大决策的保证继承性从传统预测集推广至模糊预测集。所有结果均可超越共形预测框架,适用于任意模型的预测集构造。特别地,我们发现为模型构建(模糊)预测集等价于将该模型作为假设构造检验统计量(e值)。