以非高加索噪音抽样对自治系统进行强力控制 (Sampling-Based Robust Control of Autonomous Systems with Non-Gaussian Noise)

Controllers for autonomous systems that operate in safety-critical settings must account for stochastic disturbances. Such disturbances are often modelled as process noise, and common assumptions are that the underlying distributions are known and/or Gaussian. In practice, however, these assumptions may be unrealistic and can lead to poor approximations of the true noise distribution. We present a novel planning method that does not rely on any explicit representation of the noise distributions. In particular, we address the problem of computing a controller that provides probabilistic guarantees on safely reaching a target. First, we abstract the continuous system into a discrete-state model that captures noise by probabilistic transitions between states. As a key contribution, we adapt tools from the scenario approach to compute probably approximately correct (PAC) bounds on these transition probabilities, based on a finite number of samples of the noise. We capture these bounds in the transition probability intervals of a so-called interval Markov decision process (iMDP). This iMDP is robust against uncertainty in the transition probabilities, and the tightness of the probability intervals can be controlled through the number of samples. We use state-of-the-art verification techniques to provide guarantees on the iMDP, and compute a controller for which these guarantees carry over to the autonomous system. Realistic benchmarks show the practical applicability of our method, even when the iMDP has millions of states or transitions.

翻译：在安全关键环境下运行的自主系统控制者必须说明在安全关键环境下运行的自动系统的干扰。这种扰动往往是以过程噪音为模范,通常的假设是,基本分布为人所知和/或高斯。但在实践中,这些假设可能是不现实的,可能导致真实噪音分布的近似值差。我们提出了一个新的规划方法,并不依赖噪音分布的任何明确表述。特别是,我们解决了计算一个控制器的问题,该控制器为安全达到目标提供了概率保障。首先,我们将连续系统转化为一个独立状态模型,通过各州之间的概率过渡捕捉噪音。作为一项关键贡献,我们从假设方法中调整工具,以便根据噪音分布的有限数量大致正确(PAC)地理解这些过渡的概率。我们从一个所谓的间隙马可(iMDP)决策过程(iMDP)的过渡概率间隔中捕捉到这些界限。这个iMDP的概率间隔很紧,因为过渡的概率间隔期很不稳定,而且能够通过数字的IDP检验方法来控制。我们使用这些自动核查方法,我们用这种系统进行真正的核查。