用于控制和共同设计机器人和应用物理系统中的系统 (Stochastic Spatio-Temporal Optimization for Control and Co-Design of Systems in Robotics and Applied Physics)

Correlated with the trend of increasing degrees of freedom in robotic systems is a similar trend of rising interest in Spatio-Temporal systems described by Partial Differential Equations (PDEs) among the robotics and control communities. These systems often exhibit dramatic under-actuation, high dimensionality, bifurcations, and multimodal instabilities. Their control represents many of the current-day challenges facing the robotics and automation communities. Not only are these systems challenging to control, but the design of their actuation is an NP-hard problem on its own. Recent methods either discretize the space before optimization, or apply tools from linear systems theory under restrictive linearity assumptions in order to arrive at a control solution. This manuscript provides a novel sampling-based stochastic optimization framework based entirely in Hilbert spaces suitable for the general class of \textit{semi-linear} SPDEs which describes many systems in robotics and applied physics. This framework is utilized for simultaneous policy optimization and actuator co-design optimization. The resulting algorithm is based on variational optimization, and performs joint episodic optimization of the feedback control law and the actuation design over episodes. We study first and second order systems, and in doing so, extend several results to the case of second order SPDEs. Finally, we demonstrate the efficacy of the proposed approach with several simulated experiments on a variety of SPDEs in robotics and applied physics including an infinite degree-of-freedom soft robotic manipulator.

翻译：与机器人系统自由度上升趋势相关联的是,机器人系统自由度上升的趋势与机器人系统对Spatio-Temporal系统的兴趣上升趋势相似,这种趋势与机器人和控制群体对部分差异(PDEs)所描述的Spatio-Temporal系统的兴趣上升趋势相似。这些系统往往表现出惊人的低活度、高维度、双向和多式不稳定性。它们的控制代表了机器人和自动化社区当前面临的许多挑战。这些系统不仅对控制具有挑战性,而且它们的自由操作的设计本身也是一个软硬的问题。最近采用的方法要么在优化前将空间分离,要么在限制性线性线性假设下应用线性系统理论的工具,以达成控制解决方案。这个手稿提供了全新的基于抽样的随机优化框架,完全基于适用于一般等级的Hilbert空间。它们描述了机器人和应用物理学的许多系统。这个框架建议用于同步的政策优化和动作模拟模拟模拟优化。由此产生的算法的第二次演算以变异性优化为基础,并进行联合的直线系统缩缩缩缩缩缩缩缩图。