Bertrand et al. introduced a model of parameterised systems, where each agent is represented by a finite state system, and studied the following control problem: for any number of agents, does there exist a controller able to bring all agents to a target state? They showed that the problem is decidable and EXPTIME-complete in the adversarial setting, and posed as an open problem the stochastic setting, where the agent is represented by a Markov decision process. In this paper, we show that the stochastic control problem is decidable. Our solution makes significant uses of well quasi orders, of the max-flow min-cut theorem, and of the theory of regular cost functions. We introduce an intermediate problem of independent interest called the sequential flow problem, and study the complexity of solving it.
翻译:Bertrand等人引入了一个参数化系统模型,其中每种物剂都有一定的状态系统作为代表,并研究了以下控制问题:对于任何数个物剂来说,是否有一个控制器能够将所有物剂带入目标状态?它们表明,这个问题在对抗状态下是可以分解的,EXPTIME是完整的,并作为一个开放的问题提出,该物剂由Markov决策程序作为代表。在本文中,我们表明,随机控制问题是可以分解的。我们的解决办法大量利用了良好的准定单、最大流量的微积分定理和常规成本功能理论。我们引入了一个称为连续流问题的中间独立利益问题,并研究解决这一问题的复杂性。