基于强化学习的分布参数系统数据驱动控制 - 专知基金

会员服务 ·

0

分布参数系统 · 强化学习 ·

2015 年 12 月 31 日

基于强化学习的分布参数系统数据驱动控制

国家自然科学基金

国家自然科学基金委员会

项目名称： 基于强化学习的分布参数系统数据驱动控制

项目编号： No.61503377

项目类型： 青年科学基金项目

立项/批准年度： 2016

项目学科： 自动化技术、计算机技术

项目作者： 罗彪

作者单位： 中国科学院自动化研究所

项目金额： 21万元

中文摘要： 大部分实际工业过程均为分布参数系统(DPS)，它们本质上由复杂的非线性偏微分方程描述，DPS具有无穷维自由度的特征，所以现有针对集中参数系统的控制理论与方法无法直接用于DPS。而且，随着科学技术的快速发展，工业系统的规模越来越大，生产过程越来越复杂，导致精确建立DPS数学模型的代价非常大，或是模型非常复杂而无法用于控制器设计。为解决这一困难，本项目拟引入强化学习的思想，研究DPS数据驱动控制问题，建立相应的性能分析与稳定性理论，并通过计算机仿真，验证方法的有效性，和探讨实际应用的可行性。通过对本项目的研究，将为DPS的控制设计提供一些新的、有效的方法和理论依据，促进数据驱动控制理论的发展，具有重要的科学意义和应用价值。

中文关键词： 数据驱动控制；分布参数系统；强化学习；偏微分方程

英文摘要： Most of practical industrial processes are distributed parameter system (DPS), which are essentially described by a set of complex nonlinear partial differential equations. Due to the infinite-dimensional nature of the DPSs, the direct application of the control theories and methods for lumped parameter systems to them is impossible. Moreover, with the fast developments of science technologies, many industrial processes become more and more complicated due to their large scale and complex manufacturing techniques, equipments and procedures. Therefore, the accurately modeling and identification of these processes are often costly to conduct, or the established models are too complicated to support controller design. To overcome these difficulties, this project aims at studying reinforcement learning methods for data-driven control problem of the DPSs, and establishing its theories for performance and stability analysis. The effectiveness and the practical feasibility of the methods will be overified with computer simulations. Through the research of this project, some novel and effective methods and theories will be provided for control design of DPSs, which are extremely important for the development of data-driven control theories and meaningful in both scientific researches and real engineering applications.

英文关键词： Data-driven control;distributed parameter systems;reinforcement learning;partial differential equation

成为VIP会员查看完整内容

5

相关内容

分布参数系统

分布参数系统

【AAAI2022】一种基于状态扰动的鲁棒强化学习算法

【AAAI2022】一种基于状态扰动的鲁棒强化学习算法

专知会员服务

32+阅读 · 2022年1月31日

牛津大学发布60页最新《强化学习金融应用进展》综述论文

牛津大学发布60页最新《强化学习金融应用进展》综述论文

专知会员服务

114+阅读 · 2021年12月12日

Kyoto大学Toshiyuki：快速复杂控制系统的实时优化，133页ppt

Kyoto大学Toshiyuki：快速复杂控制系统的实时优化，133页ppt

专知会员服务

39+阅读 · 2021年12月7日

专家控制系统

专知会员服务

29+阅读 · 2021年9月14日

逆优化: 理论与应用

逆优化: 理论与应用

专知会员服务

35+阅读 · 2021年9月13日

强化学习的自动驾驶控制技术研究进展

专知会员服务

127+阅读 · 2021年2月17日

【2020新书】基于人工智能的机器人安全学习和控制，138页pdf

【2020新书】基于人工智能的机器人安全学习和控制，138页pdf

专知会员服务

49+阅读 · 2020年12月12日

基于深度强化学习的组合优化研究进展

专知会员服务

81+阅读 · 2020年12月11日

【UCLA】基于深度神经网络的工业大模型预测控制，36页ppt

【UCLA】基于深度神经网络的工业大模型预测控制，36页ppt

专知会员服务

46+阅读 · 2020年5月23日

《强化学习》简介小册，24页pdf

《强化学习》简介小册，24页pdf

专知会员服务

263+阅读 · 2020年4月19日

我的天，强化学习还能用在自动驾驶领域？

我的天，强化学习还能用在自动驾驶领域？

夕小瑶的卖萌屋

1+阅读 · 2022年3月24日

【2022新书】强化学习工业应用

【2022新书】强化学习工业应用

专知

15+阅读 · 2022年2月3日

【AAAI2022】一种基于状态扰动的鲁棒强化学习算法

【AAAI2022】一种基于状态扰动的鲁棒强化学习算法

专知

3+阅读 · 2022年1月31日

Kyoto大学Toshiyuki：快速复杂控制系统的实时优化，133页ppt

Kyoto大学Toshiyuki：快速复杂控制系统的实时优化，133页ppt

专知

0+阅读 · 2021年12月7日

人工神经网络在材料科学中的研究进展

人工神经网络在材料科学中的研究进展

专知

0+阅读 · 2021年5月7日

【APC】先进过程控制系统（APC: Advanced Process Control）

【APC】先进过程控制系统（APC: Advanced Process Control）

产业智能官

57+阅读 · 2020年7月12日

ICLR 2019论文解读：深度学习应用于复杂系统控制

ICLR 2019论文解读：深度学习应用于复杂系统控制

机器之心

11+阅读 · 2019年1月10日

生成对抗网络的研究进展与趋势

生成对抗网络的研究进展与趋势

中国计算机学会

35+阅读 · 2018年11月14日

论强化学习和概率推断的等价性：一种全新概率模型

论强化学习和概率推断的等价性：一种全新概率模型

机器之心

24+阅读 · 2018年5月5日

【强化学习】「ICML教程」深度强化学习，决策与控制（117 PPT）

【强化学习】「ICML教程」深度强化学习，决策与控制（117 PPT）

产业智能官

40+阅读 · 2017年8月11日

数据信息驱动的非高斯随机分布系统抗干扰控制

国家自然科学基金

0+阅读 · 2014年12月31日

基于平行系统的游梁式抽油机采油系统建模与控制方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

复杂非线性系统数据驱动自学习最优控制理论与方法研究

国家自然科学基金

2+阅读 · 2012年12月31日

分布参数系统的迭代学习控制及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

粒子滤波在复杂系统模型辨识与状态估计中的应用研究

国家自然科学基金

0+阅读 · 2009年12月31日

随机分布系统两步建模及自适应、自学习控制方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

分布参数系统的H-无穷控制理论

国家自然科学基金

0+阅读 · 2009年12月31日

基于过程输入输出模型的非线性非高斯随机系统输出概率密度函数控制

国家自然科学基金

0+阅读 · 2009年12月31日

分布参数系统逻辑切换自适应控制及应用

国家自然科学基金

0+阅读 · 2009年12月31日

基于支持向量机的复杂连续系统强化学习控制研究

国家自然科学基金

11+阅读 · 2008年12月31日

State machines for large scale computer software and systems

Arxiv

0+阅读 · 2022年4月19日

Learning-Based Approaches for Graph Problems: A Survey

Arxiv

0+阅读 · 2022年4月17日

Efficient Attribute Unlearning: Towards Selective Removal of Input Attributes from Feature Representations

Arxiv

0+阅读 · 2022年4月16日

Large-Scale Deep Learning Optimizations: A Comprehensive Survey

Arxiv

21+阅读 · 2021年11月2日

Survey: Transformer based Video-Language Pre-training

Arxiv

20+阅读 · 2021年9月21日

Embedding-based Retrieval in Facebook Search

Arxiv

12+阅读 · 2020年6月20日

Distributed Machine Learning on Mobile Devices: A Survey

Distributed Machine Learning on Mobile Devices: A Survey

Arxiv

35+阅读 · 2019年9月18日

Optimization Models for Machine Learning: A Survey

Arxiv

18+阅读 · 2019年1月16日

Notes on Deep Learning for NLP

Arxiv

22+阅读 · 2018年8月30日

Multiagent Soft Q-Learning

Arxiv

11+阅读 · 2018年4月25日

阅读: 0 点赞: 0

小贴士

登录享主题订阅及个性化推荐

相关主题

分布参数系统

热门VIP内容

相关VIP内容

【AAAI2022】一种基于状态扰动的鲁棒强化学习算法

【AAAI2022】一种基于状态扰动的鲁棒强化学习算法

专知会员服务

32+阅读 · 2022年1月31日

牛津大学发布60页最新《强化学习金融应用进展》综述论文

牛津大学发布60页最新《强化学习金融应用进展》综述论文

专知会员服务

114+阅读 · 2021年12月12日

Kyoto大学Toshiyuki：快速复杂控制系统的实时优化，133页ppt

Kyoto大学Toshiyuki：快速复杂控制系统的实时优化，133页ppt

专知会员服务

39+阅读 · 2021年12月7日

专家控制系统

专知会员服务

29+阅读 · 2021年9月14日

逆优化: 理论与应用

逆优化: 理论与应用

专知会员服务

35+阅读 · 2021年9月13日

强化学习的自动驾驶控制技术研究进展

专知会员服务

127+阅读 · 2021年2月17日

【2020新书】基于人工智能的机器人安全学习和控制，138页pdf

【2020新书】基于人工智能的机器人安全学习和控制，138页pdf

专知会员服务

49+阅读 · 2020年12月12日

基于深度强化学习的组合优化研究进展

专知会员服务

81+阅读 · 2020年12月11日

【UCLA】基于深度神经网络的工业大模型预测控制，36页ppt

【UCLA】基于深度神经网络的工业大模型预测控制，36页ppt

专知会员服务

46+阅读 · 2020年5月23日

《强化学习》简介小册，24页pdf

《强化学习》简介小册，24页pdf

专知会员服务

263+阅读 · 2020年4月19日

相关资讯

我的天，强化学习还能用在自动驾驶领域？

我的天，强化学习还能用在自动驾驶领域？

夕小瑶的卖萌屋

1+阅读 · 2022年3月24日

【2022新书】强化学习工业应用

【2022新书】强化学习工业应用

专知

15+阅读 · 2022年2月3日

【AAAI2022】一种基于状态扰动的鲁棒强化学习算法

【AAAI2022】一种基于状态扰动的鲁棒强化学习算法

专知

3+阅读 · 2022年1月31日

Kyoto大学Toshiyuki：快速复杂控制系统的实时优化，133页ppt

Kyoto大学Toshiyuki：快速复杂控制系统的实时优化，133页ppt

专知

0+阅读 · 2021年12月7日

人工神经网络在材料科学中的研究进展

人工神经网络在材料科学中的研究进展

专知

0+阅读 · 2021年5月7日

【APC】先进过程控制系统（APC: Advanced Process Control）

【APC】先进过程控制系统（APC: Advanced Process Control）

产业智能官

57+阅读 · 2020年7月12日

ICLR 2019论文解读：深度学习应用于复杂系统控制

ICLR 2019论文解读：深度学习应用于复杂系统控制

机器之心

11+阅读 · 2019年1月10日

生成对抗网络的研究进展与趋势

生成对抗网络的研究进展与趋势

中国计算机学会

35+阅读 · 2018年11月14日

论强化学习和概率推断的等价性：一种全新概率模型

论强化学习和概率推断的等价性：一种全新概率模型

机器之心

24+阅读 · 2018年5月5日

【强化学习】「ICML教程」深度强化学习，决策与控制（117 PPT）

【强化学习】「ICML教程」深度强化学习，决策与控制（117 PPT）

产业智能官

40+阅读 · 2017年8月11日

相关基金

数据信息驱动的非高斯随机分布系统抗干扰控制

国家自然科学基金

0+阅读 · 2014年12月31日

基于平行系统的游梁式抽油机采油系统建模与控制方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

复杂非线性系统数据驱动自学习最优控制理论与方法研究

国家自然科学基金

2+阅读 · 2012年12月31日

分布参数系统的迭代学习控制及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

粒子滤波在复杂系统模型辨识与状态估计中的应用研究

国家自然科学基金

0+阅读 · 2009年12月31日

随机分布系统两步建模及自适应、自学习控制方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

分布参数系统的H-无穷控制理论

国家自然科学基金

0+阅读 · 2009年12月31日

基于过程输入输出模型的非线性非高斯随机系统输出概率密度函数控制

国家自然科学基金

0+阅读 · 2009年12月31日

分布参数系统逻辑切换自适应控制及应用

国家自然科学基金

0+阅读 · 2009年12月31日

基于支持向量机的复杂连续系统强化学习控制研究

国家自然科学基金

11+阅读 · 2008年12月31日

相关论文

State machines for large scale computer software and systems

Arxiv

0+阅读 · 2022年4月19日

Learning-Based Approaches for Graph Problems: A Survey

Arxiv

0+阅读 · 2022年4月17日

Efficient Attribute Unlearning: Towards Selective Removal of Input Attributes from Feature Representations

Arxiv

0+阅读 · 2022年4月16日

Large-Scale Deep Learning Optimizations: A Comprehensive Survey

Arxiv

21+阅读 · 2021年11月2日

Survey: Transformer based Video-Language Pre-training

Arxiv

20+阅读 · 2021年9月21日

Embedding-based Retrieval in Facebook Search

Arxiv

12+阅读 · 2020年6月20日

Distributed Machine Learning on Mobile Devices: A Survey

Distributed Machine Learning on Mobile Devices: A Survey

Arxiv

35+阅读 · 2019年9月18日

Optimization Models for Machine Learning: A Survey

Arxiv

18+阅读 · 2019年1月16日

Notes on Deep Learning for NLP

Arxiv

22+阅读 · 2018年8月30日

Multiagent Soft Q-Learning

Arxiv

11+阅读 · 2018年4月25日

微信扫码咨询专知VIP会员