This work presents a multilevel variant of Stein variational gradient descent to more efficiently sample from target distributions. The key ingredient is a sequence of distributions with growing fidelity and costs that converges to the target distribution of interest. For example, such a sequence of distributions is given by a hierarchy of ever finer discretization levels of the forward model in Bayesian inverse problems. The proposed multilevel Stein variational gradient descent moves most of the iterations to lower, cheaper levels with the aim of requiring only a few iterations on the higher, more expensive levels when compared to the traditional, single-level Stein variational gradient descent variant that uses the highest-level distribution only. Under certain assumptions, in the mean-field limit, the error of the proposed multilevel Stein method decays by a log factor faster than the error of the single-level counterpart with respect to computational costs. Numerical experiments with Bayesian inverse problems show speedups of more than one order of magnitude of the proposed multilevel Stein method compared to the single-level variant that uses the highest level only.
翻译:这项工作展示了Stein变异梯度下降到目标分布中更高效抽样的多层次变量。 关键要素是分布序列,其忠诚度和成本不断提高,与目标利益分布相匹配。 例如,巴伊西亚反面问题中远方模型的细细分级等级给出了这种分布顺序。 拟议的多层次斯坦度变异梯度下降将大多数迭代降低到更低、更便宜的水平, 目的是与仅使用最高水平分配的传统、 单级斯坦度变异梯度变量相比,只需要在较高、 更昂贵的水平上进行几次迭代。 根据某些假设,在平均场限度中,拟议多层次施泰因法的错误由比单级对应方计算成本错误更快的日志系数衰减。 与巴伊西亚反面问题相比, 数值实验显示,与仅使用最高水平的单级版本相比,拟议多层次施泰因方法的加速度超过一个级级级数。