贝耶斯学习系统动态学中的控制障碍 (Control Barriers in Bayesian Learning of System Dynamics) - 专知论文

会员服务 ·

0

控制器 · 学成 · MoDELS · 泛函 · 估计/估计量 ·

2021 年 8 月 28 日

Control Barriers in Bayesian Learning of System Dynamics

翻译：贝耶斯学习系统动态学中的控制障碍

Vikas Dhiman,Mohammad Javad Khojasteh,Massimo Franceschetti,Nikolay Atanasov

from arxiv, Submitted to IEEE Transactions on Automatic Control. Journal extension of 1912.10116. arXiv admin note: text overlap with arXiv:1912.10116

This paper focuses on learning a model of system dynamics online while satisfying safety constraints. Our objective is to avoid offline system identification or hand-specified models and allow a system to safely and autonomously estimate and adapt its own model during operation. Given streaming observations of the system state, we use Bayesian learning to obtain a distribution over the system dynamics. Specifically, we propose a new matrix variate Gaussian process (MVGP) regression approach with an efficient covariance factorization to learn the drift and input gain terms of a nonlinear control-affine system. The MVGP distribution is then used to optimize the system behavior and ensure safety with high probability, by specifying control Lyapunov function (CLF) and control barrier function (CBF) chance constraints. We show that a safe control policy can be synthesized for systems with arbitrary relative degree and probabilistic CLF-CBF constraints by solving a second order cone program (SOCP). Finally, we extend our design to a self-triggering formulation, adaptively determining the time at which a new control input needs to be applied in order to guarantee safety.

翻译：本文侧重于在满足安全限制的同时在网上学习系统动态模型。我们的目标是避免离线系统识别或手指定的模型,并允许一个系统在运行期间安全自主地估计和调整自己的模型。根据系统状态的不断观测,我们利用巴伊西亚学习获得系统动态的分布。具体地说,我们建议采用新的矩阵变式高萨进程(MVGP)回归法,并采用高效的共变系数,学习非线性控制-芬菲系统的漂移和输入增益条件。然后,MVGP分布被用于优化系统行为并确保高度概率的安全,具体指定控制 Lyapunov 函数和控制屏障功能(CBF) 。我们表明,可以通过解决第二个调控调程序(SOP),将安全控制政策综合到任意相对和具有概率性的系统。最后,我们把设计扩大到自触发式配制,适应性地决定需要应用新的控制输入的时间,以保证安全。

0

相关内容

控制器

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【开放书】贝叶斯推理与机器学习，690页pdf，Bayesian Reasoning and Machine Learning

【开放书】贝叶斯推理与机器学习，690页pdf，Bayesian Reasoning and Machine Learning

专知会员服务

191+阅读 · 2020年5月30日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

强化学习最优表示的几何视角（A Geometric Perspective on Optimal Representations for Reinforcement Learning）

强化学习最优表示的几何视角（A Geometric Perspective on Optimal Representations for Reinforcement Learning）

专知会员服务

9+阅读 · 2019年12月24日

【电子书推荐】机器学习导论Introduction to Machine Learning，斯坦福大学 | Nils J. Nilsson

【电子书推荐】机器学习导论Introduction to Machine Learning，斯坦福大学 | Nils J. Nilsson

专知会员服务

46+阅读 · 2019年11月19日

【DLBM-SS暑期课程】深度学习与贝叶斯方法 Deep Learning and Bayesian Methods

【DLBM-SS暑期课程】深度学习与贝叶斯方法 Deep Learning and Bayesian Methods

专知会员服务

67+阅读 · 2019年11月10日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

carla 体验效果及代码

carla 体验效果及代码

CreateAMind

7+阅读 · 2018年2月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

最大熵原理（一）

最大熵原理（一）

深度学习探索

12+阅读 · 2017年8月3日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Inverse Optimal Control Adapted to the Noise Characteristics of the Human Sensorimotor System

Arxiv

0+阅读 · 2021年10月21日

Targeted Active Learning for Bayesian Decision-Making

Arxiv

0+阅读 · 2021年10月20日

Interactive simulation for easy decision-making in fluid dynamics

Arxiv

0+阅读 · 2021年10月20日

Feedback Linearization of Car Dynamics for Racing via Reinforcement Learning

Arxiv

0+阅读 · 2021年10月20日

Quadrotor Trajectory Tracking with Learned Dynamics: Joint Koopman-based Learning of System Models and Function Dictionaries

Arxiv

0+阅读 · 2021年10月20日

BNPdensity: Bayesian nonparametric mixture modeling in R

Arxiv

0+阅读 · 2021年10月19日

Improving Robustness of Reinforcement Learning for Power System Control with Adversarial Training

Arxiv

0+阅读 · 2021年10月19日

A Tour of Reinforcement Learning: The View from Continuous Control

Arxiv

6+阅读 · 2018年6月25日

Safety-aware Adaptive Reinforcement Learning with Applications to Brushbot Navigation

Arxiv

4+阅读 · 2018年1月29日

Multiple Object Detection, Tracking and Long-Term Dynamics Learning in Large 3D Maps

Arxiv

6+阅读 · 2018年1月28日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【开放书】贝叶斯推理与机器学习，690页pdf，Bayesian Reasoning and Machine Learning

【开放书】贝叶斯推理与机器学习，690页pdf，Bayesian Reasoning and Machine Learning

专知会员服务

191+阅读 · 2020年5月30日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

强化学习最优表示的几何视角（A Geometric Perspective on Optimal Representations for Reinforcement Learning）

强化学习最优表示的几何视角（A Geometric Perspective on Optimal Representations for Reinforcement Learning）

专知会员服务

9+阅读 · 2019年12月24日

【电子书推荐】机器学习导论Introduction to Machine Learning，斯坦福大学 | Nils J. Nilsson

【电子书推荐】机器学习导论Introduction to Machine Learning，斯坦福大学 | Nils J. Nilsson

专知会员服务

46+阅读 · 2019年11月19日

【DLBM-SS暑期课程】深度学习与贝叶斯方法 Deep Learning and Bayesian Methods

【DLBM-SS暑期课程】深度学习与贝叶斯方法 Deep Learning and Bayesian Methods

专知会员服务

67+阅读 · 2019年11月10日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

【NTU博士论文】利用强化学习与生成模型推进可靠且可泛化的决策

美海军研发“增强侦察与态势评估系统（ARES）”应用程序以优化作战规划（附研究论文）

【NeurIPS2025】DNA-DetectLLM：基于 DNA 启发的“突变-修复”范式揭示 AI 生成文本

面向深度研究系统的强化学习基础：综述

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

carla 体验效果及代码

carla 体验效果及代码

CreateAMind

7+阅读 · 2018年2月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

最大熵原理（一）

最大熵原理（一）

深度学习探索

12+阅读 · 2017年8月3日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Inverse Optimal Control Adapted to the Noise Characteristics of the Human Sensorimotor System

Arxiv

0+阅读 · 2021年10月21日

Targeted Active Learning for Bayesian Decision-Making

Arxiv

0+阅读 · 2021年10月20日

Interactive simulation for easy decision-making in fluid dynamics

Arxiv

0+阅读 · 2021年10月20日

Feedback Linearization of Car Dynamics for Racing via Reinforcement Learning

Arxiv

0+阅读 · 2021年10月20日

Quadrotor Trajectory Tracking with Learned Dynamics: Joint Koopman-based Learning of System Models and Function Dictionaries

Arxiv

0+阅读 · 2021年10月20日

BNPdensity: Bayesian nonparametric mixture modeling in R

Arxiv

0+阅读 · 2021年10月19日

Improving Robustness of Reinforcement Learning for Power System Control with Adversarial Training

Arxiv

0+阅读 · 2021年10月19日

A Tour of Reinforcement Learning: The View from Continuous Control

Arxiv

6+阅读 · 2018年6月25日

Safety-aware Adaptive Reinforcement Learning with Applications to Brushbot Navigation

Arxiv

4+阅读 · 2018年1月29日

Multiple Object Detection, Tracking and Long-Term Dynamics Learning in Large 3D Maps

Arxiv

6+阅读 · 2018年1月28日

微信扫码咨询专知VIP会员