简单和最佳方法处理随机可变不平等,一:操作者外推法 (Simple and optimal methods for stochastic variational inequalities, I: operator extrapolation)

In this paper we first present a novel operator extrapolation (OE) method for solving deterministic variational inequality (VI) problems. Similar to the gradient (operator) projection method, OE updates one single search sequence by solving a single projection subproblem in each iteration. We show that OE can achieve the optimal rate of convergence for solving a variety of VI problems in a much simpler way than existing approaches. We then introduce the stochastic operator extrapolation (SOE) method and establish its optimal convergence behavior for solving different stochastic VI problems. In particular, SOE achieves the optimal complexity for solving a fundamental problem, i.e., stochastic smooth and strongly monotone VI, for the first time in the literature. We also present a stochastic block operator extrapolations (SBOE) method to further reduce the iteration cost for the OE method applied to large-scale deterministic VIs with a certain block structure. Numerical experiments have been conducted to demonstrate the potential advantages of the proposed algorithms. In fact, all these algorithms are applied to solve generalized monotone variational inequality (GMVI) problems whose operator is not necessarily monotone. We will also discuss optimal OE-based policy evaluation methods for reinforcement learning in a companion paper.

翻译：在本文中,我们首先提出一种解决确定性差异不平等问题的新操作者外推法(OE) 。与梯度(操作者)预测方法类似,OE通过解决每个迭代中单一预测子问题,更新单一搜索序列。我们显示,OE能够以比现有方法简单得多的方式实现解决各种六种问题的最佳趋同率。然后我们引入随机操作者外推法(OE),并确立其解决不同随机性六种问题的最佳趋同行为。特别是,SOE实现了解决一个基本问题的最佳复杂性,即平滑和强烈单体六,这是第一次在文献中。我们还提出了一种对块操作者外推法(SBOEE),以进一步降低对具有某种块状结构的大型确定性六种使用OE方法的迭代法成本。进行了数值实验,以展示拟议的算法的潜在优势。事实上,所有这些算法都用于解决统一性单体政策的强化性软体变换方法。