A novel methodology is developed for the solution of the data-driven Monge optimal transport barycenter problem, where the pushforward condition is formulated in terms of the statistical independence between two sets of random variables: the factors $z$ and a transformed outcome $y$. Relaxing independence to the uncorrelation between all functions of $z$ and $y$ within suitable finite-dimensional spaces leads to an adversarial formulation, for which the adversarial strategy can be found in closed form through the first principal components of a small-dimensional matrix. The resulting pure minimization problem can be solved very efficiently through gradient descent driven flows in phase space. The methodology extends beyond scenarios where only discrete factors affect the outcome, to multivariate sets of both discrete and continuous factors, for which the corresponding barycenter problems have infinitely many marginals. Corollaries include a new framework for the solution of the Monge optimal transport problem, a procedure for the data-based simulation and estimation of conditional probability densities, and a nonparametric methodology for Bayesian inference.
翻译:本文针对数据驱动的蒙日最优传输重心问题提出了一种新颖的求解方法,其中前推条件通过两组随机变量——因子$z$与变换后结果$y$——之间的统计独立性来表述。将独立性松弛为在适当有限维空间中所有$z$和$y$函数之间的不相关性,导出了一个对抗性表述形式,其对抗策略可通过低维矩阵的第一主成分以闭式解形式获得。由此得到的纯最小化问题可通过相空间中的梯度下降流高效求解。该方法不仅适用于仅离散因子影响结果的场景,还可扩展至包含离散与连续因子的多元集合,对应的重心问题具有无穷多个边缘分布。推论包括:一种求解蒙日最优传输问题的新框架、基于数据的条件概率密度模拟与估计流程,以及一种用于贝叶斯推断的非参数化方法。