Conformal prediction is a popular method to construct prediction intervals for black-box machine learning models with marginal coverage guarantees. In applications with potentially high-impact events, such as flooding or financial crises, regulators often require very high confidence for such intervals. However, if the desired level of confidence is too large relative to the amount of data used for calibration, then classical conformal methods provide infinitely wide, thus, uninformative prediction intervals. In this paper, we propose a new method to overcome this limitation. We bridge extreme value statistics and conformal prediction to provide reliable and informative prediction intervals with high-confidence coverage, which can be constructed using any black-box extreme quantile regression method. A weighted version of our approach can account for nonstationary data. The advantages of our extreme conformal prediction method are illustrated in a simulation study and in an application to flood risk forecasting.
翻译:共形预测是一种为黑盒机器学习模型构建具有边际覆盖保证的预测区间的常用方法。在涉及潜在高影响事件(如洪水或金融危机)的应用中,监管机构通常要求此类区间具备极高的置信度。然而,若所需置信水平相对于用于校准的数据量过大,则经典共形方法会提供无限宽度的预测区间,从而失去信息价值。本文提出一种新方法以克服此限制。我们结合极值统计与共形预测,通过任意黑盒极端分位数回归方法,构建具有高置信覆盖的可靠且信息丰富的预测区间。该方法的加权版本可处理非平稳数据。通过仿真研究及洪水风险预测应用,展示了极端共形预测方法的优势。