持续RL控制稳定保证 (Stability Guarantees for Continuous RL Control) - 专知论文

会员服务 ·

0

Continuity · Learning · 控制器 · Analysis · 评论员 ·

2022 年 9 月 15 日

Stability Guarantees for Continuous RL Control

翻译：持续RL控制稳定保证

Bing Song,Jean-Jacques Slotine,Quang-Cuong Pham

Lack of stability guarantees strongly limits the use of reinforcement learning (RL) in safety critical robotic applications. Here we propose a control system architecture for continuous RL control and derive corresponding stability theorems via contraction analysis, yielding constraints on the network weights to ensure stability. The control architecture can be implemented in general RL algorithms and improve their stability, robustness, and sample efficiency. We demonstrate the importance and benefits of such guarantees for RL on two standard examples, PPO learning of a 2D problem and HIRO learning of maze tasks.

翻译：缺乏稳定性保证了在安全关键机器人应用中使用强化学习(RL)的严格限制。在这里,我们提出一个控制系统架构,用于持续RL控制,并通过收缩分析得出相应的稳定性定理,对网络权重施加限制,以确保稳定性。控制架构可以在一般RL算法中实施,并提高其稳定性、稳健性和样本效率。我们通过两个标准例子,即PPPO学习2D问题和HIRO学习迷宫任务,来证明这种保障对RL的重要性和好处。

0

相关内容

Continuity

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

等离子体-催化协同分解H2S机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于正压电效应的全有机路面发电材料调控机制与动态响应

国家自然科学基金

0+阅读 · 2013年12月31日

木质素在离子液体中的溶解及其氢键解离机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

马氏切换随机神经网络的动力学行为分析与控制

国家自然科学基金

0+阅读 · 2012年12月31日

含极性非质子溶剂的离子液体-kosmotropic盐双水相体系的研究

国家自然科学基金

0+阅读 · 2012年12月31日

金属晶粒长大动力学的多尺度模拟

国家自然科学基金

0+阅读 · 2012年12月31日

肿瘤微环境应激反应蛋白PAGE4在间质细胞中的激活对前列腺癌进展的影响

国家自然科学基金

0+阅读 · 2012年12月31日

黄土滑坡双层复合滑动液化机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

冷等离子体作用下离子液体催化甲烷转化气-液反应机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

Reduced Order Model Predictive Control for Parametrized Parabolic Partial Differential Equations

Arxiv

0+阅读 · 2022年10月25日

Asynchronous Distributed Reinforcement Learning for LQR Control via Zeroth-Order Block Coordinate Descent

Arxiv

0+阅读 · 2022年10月25日

Off-Policy Correction for Actor-Critic Methods without Importance Sampling

Arxiv

0+阅读 · 2022年10月24日

Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds

Arxiv

0+阅读 · 2022年10月24日

Learning a subspace of policies for online adaptation in Reinforcement Learning

Arxiv

0+阅读 · 2022年10月24日

Lazy Incremental Search for Efficient Replanning with Bounded Suboptimality Guarantees

Arxiv

0+阅读 · 2022年10月23日

Solving Continuous Control via Q-learning

Arxiv

0+阅读 · 2022年10月22日

Batch Bayesian optimisation via density-ratio estimation with guarantees

Arxiv

0+阅读 · 2022年10月22日

Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年10月21日

NESTANets: Stable, accurate and efficient neural networks for analysis-sparse inverse problems

Arxiv

0+阅读 · 2022年10月20日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

新型数字杀伤链：理解综合战术网络对野战炮兵体系的能力与效益

《对抗环境中运用数字孪生技术优化预测性维护与后勤保障》2025最新93页

《任务式指挥十六个案例研究》232页

《幻觉还是事实：国防大型语言模型的可信度评估研究》2025最新109页

相关资讯

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Reduced Order Model Predictive Control for Parametrized Parabolic Partial Differential Equations

Arxiv

0+阅读 · 2022年10月25日

Asynchronous Distributed Reinforcement Learning for LQR Control via Zeroth-Order Block Coordinate Descent

Arxiv

0+阅读 · 2022年10月25日

Off-Policy Correction for Actor-Critic Methods without Importance Sampling

Arxiv

0+阅读 · 2022年10月24日

Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds

Arxiv

0+阅读 · 2022年10月24日

Learning a subspace of policies for online adaptation in Reinforcement Learning

Arxiv

0+阅读 · 2022年10月24日

Lazy Incremental Search for Efficient Replanning with Bounded Suboptimality Guarantees

Arxiv

0+阅读 · 2022年10月23日

Solving Continuous Control via Q-learning

Arxiv

0+阅读 · 2022年10月22日

Batch Bayesian optimisation via density-ratio estimation with guarantees

Arxiv

0+阅读 · 2022年10月22日

Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年10月21日

NESTANets: Stable, accurate and efficient neural networks for analysis-sparse inverse problems

Arxiv

0+阅读 · 2022年10月20日

相关基金

等离子体-催化协同分解H2S机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于正压电效应的全有机路面发电材料调控机制与动态响应

国家自然科学基金

0+阅读 · 2013年12月31日

木质素在离子液体中的溶解及其氢键解离机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

马氏切换随机神经网络的动力学行为分析与控制

国家自然科学基金

0+阅读 · 2012年12月31日

含极性非质子溶剂的离子液体-kosmotropic盐双水相体系的研究

国家自然科学基金

0+阅读 · 2012年12月31日

金属晶粒长大动力学的多尺度模拟

国家自然科学基金

0+阅读 · 2012年12月31日

肿瘤微环境应激反应蛋白PAGE4在间质细胞中的激活对前列腺癌进展的影响

国家自然科学基金

0+阅读 · 2012年12月31日

黄土滑坡双层复合滑动液化机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

冷等离子体作用下离子液体催化甲烷转化气-液反应机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员