Co:通过监督学习实现在线混合内插控制 (CoCo: Online Mixed-Integer Control via Supervised Learning)

Many robotics problems, from robot motion planning to object manipulation, can be modeled as mixed-integer convex programs (MICPs). However, state-of-the-art algorithms are still unable to solve MICPs for control problems quickly enough for online use and existing heuristics can typically only find suboptimal solutions that might degrade robot performance. In this work, we turn to data-driven methods and present the Combinatorial Offline, Convex Online (CoCo) algorithm for quickly finding high quality solutions for MICPs. CoCo consists of a two-stage approach. In the offline phase, we train a neural network classifier that maps the problem parameters to a (logical strategy), which we define as the discrete arguments and relaxed big-M constraints associated with the optimal solution for that problem. Online, the classifier is applied to select a candidate logical strategy given new problem parameters; applying this logical strategy allows us to solve the original MICP as a convex optimization problem. We show through numerical experiments how CoCo finds near optimal solutions to MICPs arising in robot planning and control with 1 to 2 orders of magnitude solution speedup compared to other data-driven approaches and solvers.

翻译：许多机器人问题,从机器人运动规划到天体操纵,都可以以混合内插式 convex 程序(MICPs)为模型。然而,最先进的算法仍然无法迅速解决MICP的在线控制问题,而现有的超自然学通常只能找到可能降低机器人性能的亚最佳解决方案。在这项工作中,我们转向数据驱动方法,并推出组合式离线、Convex Online(CoCoCo)算法,以迅速为 MICPs找到高质量解决方案。CoCo由两阶段方法组成。在离线阶段,我们训练一个神经网络分类器,将问题参数映射到一个(逻辑战略),我们把这个问题定义为离散的参数和与该问题最佳解决方案相关的放松的大M限制。Online,这个分类器用于选择一个符合新问题参数的候选逻辑战略;应用这一逻辑战略,我们就可以解决原始的MICP作为同系优化问题。我们通过数字实验,通过机器人的规划和控制,用1至2个驱动式的数据速度方法,将其他解算算出最接近最佳的解决方案。