Economically responsible mitigation of multivariate extreme risks -- extreme rainfall in a large area, huge variations of many stock prices, widespread breakdowns in transportation systems -- requires estimates of the probabilities that such risks will materialize in the future. This paper develops a new method, Wasserstein--Aitchison Generative Adversarial Networks (WA-GAN) to, which provides simulated values of $d$-dimensional multivariate extreme events and which can hence be used to give estimates of such probabilities. The main hypothesis is that, after transforming the observations to the unit-Pareto scale, their distribution is regularly varying in the sense that the distributions of their radial and angular components (with respect to the $L_1$-norm) converge and become asymptotically independent as the radius gets large. The method is a combination of standard extreme value analysis modeling of the tails of the marginal distributions with nonparametric GAN modeling of the angular distribution. For the latter, the angular values are transformed to Aitchison coordinates in a full $(d-1)$-dimensional linear space, and a Wasserstein GAN is trained on these coordinates and used to generate new values. A reverse transformation is then applied to these values and gives simulated values on the original data scale. Our method is applied to simulated data and to a financial data set from the Kenneth French Data Library. The method shows good performance compared to other existing methods in the literature, both in terms of capturing the dependence structure of the extremes in the data and in generating accurate new extremes.
翻译:对多元极端风险(如大范围极端降雨、多股票价格剧烈波动、交通系统大规模瘫痪)进行经济合理的缓解,需要预估此类风险未来发生的概率。本文提出一种新方法——Wasserstein-Aitchison生成对抗网络(WA-GAN),该方法可生成d维多元极端事件的模拟值,从而用于估计此类概率。核心假设是:将观测值转换至单位帕累托尺度后,其分布具有正则变差特性,即其径向分量与角分量(基于L1范数)的分布在半径趋大时收敛且渐近独立。该方法结合了边缘分布尾部的标准极值分析建模与角分布的非参数GAN建模。对于后者,角值被转换为完整(d-1)维线性空间中的Aitchison坐标,随后在此坐标上训练Wasserstein GAN并用于生成新值。通过对这些值进行逆变换,可获得原始数据尺度上的模拟值。我们将该方法应用于模拟数据及Kenneth French金融数据库中的实际数据集。与文献中现有方法相比,该方法在捕捉数据极端值依赖结构及生成精确新极端值方面均表现出优越性能。