跳过对工业浪潮能源转换器多机构强化学习主计长的培训 (Skip Training for Multi-Agent Reinforcement Learning Controller for Industrial Wave Energy Converters) - 专知论文

会员服务 ·

0

控制器 · Learning · Agent · Spring · 强化学习 ·

2022 年 9 月 13 日

Skip Training for Multi-Agent Reinforcement Learning Controller for Industrial Wave Energy Converters

翻译：跳过对工业浪潮能源转换器多机构强化学习主计长的培训

Soumyendu Sarkar,Vineet Gundecha,Sahand Ghorbanpour,Alexander Shmakov,Ashwin Ramesh Babu,Alexandre Pichard,Mathieu Cocho

from arxiv, 2022 IEEE 18th International Conference on Automation Science and Engineering (CASE) August 20-24, 2022

Recent Wave Energy Converters (WEC) are equipped with multiple legs and generators to maximize energy generation. Traditional controllers have shown limitations to capture complex wave patterns and the controllers must efficiently maximize the energy capture. This paper introduces a Multi-Agent Reinforcement Learning controller (MARL), which outperforms the traditionally used spring damper controller. Our initial studies show that the complex nature of problems makes it hard for training to converge. Hence, we propose a novel skip training approach which enables the MARL training to overcome performance saturation and converge to more optimum controllers compared to default MARL training, boosting power generation. We also present another novel hybrid training initialization (STHTI) approach, where the individual agents of the MARL controllers can be initially trained against the baseline Spring Damper (SD) controller individually and then be trained one agent at a time or all together in future iterations to accelerate convergence. We achieved double-digit gains in energy efficiency over the baseline Spring Damper controller with the proposed MARL controllers using the Asynchronous Advantage Actor-Critic (A3C) algorithm.

翻译：最近波能转换器(WEC)配备了多条腿和发电机,以最大限度地产生能源。传统控制器在捕捉复杂的波形模式方面表现出了局限性,控制器必须有效地最大限度地增加能源捕获量。本文介绍了一个多代理强化学习控制器(MARL),该控制器比传统上使用的弹簧阻隔控制器(MARL)表现得更好。我们的初步研究显示,问题的复杂性使得培训难于集中。因此,我们建议采用新的跳过培训方法,使MARL培训能够克服性能饱和,并与默认的MARL培训相比,向更优化的控制器汇合。我们还介绍了另一种新型混合培训初始化(STHTI)方法,在这个方法下,MARL控制器的个别代理器可以单独地接受基线弹簧阻控制器(SDML)控制器的训练,然后在将来一起培训一个代理器,以加速趋同。我们利用Asyncronous Advantage Ador-Critict (A3C) 算算算法,在基线的Spring Spring Spry-Dy-Dmarper控制器上实现了能源效率取得了两位数。我们与拟议的MARL控制器(A3C)。

0

相关内容

控制器

加速图神经网络推理，121页ppt，普林斯顿大学JAVIER DUARTE主讲

加速图神经网络推理，121页ppt，普林斯顿大学JAVIER DUARTE主讲

专知会员服务

33+阅读 · 2022年6月13日

【NUS-Xavier 教授】图神经网络应用概述，15页ppt

专知会员服务

53+阅读 · 2021年6月30日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

M365热招 | N+Offer“职”等你来

M365热招 | N+Offer“职”等你来

微软招聘

0+阅读 · 2021年3月17日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

superstrate结构铜锌硒硫太阳电池制备中的关键科学问题研究

国家自然科学基金

0+阅读 · 2014年12月31日

Yb3+、Ca2+离子共掺新型硼硅酸盐超快激光晶体的研究

国家自然科学基金

0+阅读 · 2013年12月31日

介尺度磁性复合囊泡状结构材料的可控构筑及性能

国家自然科学基金

0+阅读 · 2012年12月31日

抑癌基因MIIP加速EGFR蛋白质降解并抑制肺癌细胞生长的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

脉络膜新生血管疾病中HTRA1基因的表观遗传学机制

国家自然科学基金

0+阅读 · 2012年12月31日

高效中红外激光晶体Cr,Er,Re:YSGG（Re＝Eu3+, Tb3+）的生长及性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型D-A-π-A纯有机太阳能电池敏化染料的设计及光伏性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

水溶性手性高荧光和磷光量子点的构筑及生物相容性研究

国家自然科学基金

0+阅读 · 2011年12月31日

新型中红外激光晶体Er3＋:CaReAlO4(Re=Y,Gd)的研究

国家自然科学基金

0+阅读 · 2009年12月31日

探索ERG变异体维持“#33258;组装”#24037;程化软骨永久性表型的生物学性能研究

国家自然科学基金

0+阅读 · 2008年12月31日

Language-free Training for Zero-shot Video Grounding

Arxiv

0+阅读 · 2022年10月24日

The Design and Realization of Multi-agent Obstacle Avoidance based on Reinforcement Learning

Arxiv

0+阅读 · 2022年10月24日

Self-Supervised Pretraining on Satellite Imagery: a Case Study on Label-Efficient Vehicle Detection

Arxiv

0+阅读 · 2022年10月21日

Continued Pretraining for Better Zero- and Few-Shot Promptability

Arxiv

0+阅读 · 2022年10月21日

gSuite: A Flexible and Framework Independent Benchmark Suite for Graph Neural Network Inference on GPUs

Arxiv

0+阅读 · 2022年10月20日

Augmentative Topology Agents For Open-Ended Learning

Arxiv

0+阅读 · 2022年10月20日

Optimal Settings for Cryptocurrency Trading Pairs

Arxiv

0+阅读 · 2022年10月20日

Exploitation of material consolidation trade-offs in a multi-tier complex supply networks

Arxiv

0+阅读 · 2022年10月19日

MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration

Arxiv

12+阅读 · 2021年2月7日

Label-aware Double Transfer Learning for Cross-Specialty Medical Named Entity Recognition

Arxiv

10+阅读 · 2018年4月28日

VIP会员

文章信息

相关主题

相关VIP内容

加速图神经网络推理，121页ppt，普林斯顿大学JAVIER DUARTE主讲

加速图神经网络推理，121页ppt，普林斯顿大学JAVIER DUARTE主讲

专知会员服务

33+阅读 · 2022年6月13日

【NUS-Xavier 教授】图神经网络应用概述，15页ppt

专知会员服务

53+阅读 · 2021年6月30日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

最新，DeepSeek-R1论文登上Nature封面，附83页补充材料

人工智能与未来战争

自动驾驶中的轨迹预测大型基础模型：全面综述

万字长文《对抗雷达系统的电子战综述》

相关资讯

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

M365热招 | N+Offer“职”等你来

M365热招 | N+Offer“职”等你来

微软招聘

0+阅读 · 2021年3月17日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

相关论文

Language-free Training for Zero-shot Video Grounding

Arxiv

0+阅读 · 2022年10月24日

The Design and Realization of Multi-agent Obstacle Avoidance based on Reinforcement Learning

Arxiv

0+阅读 · 2022年10月24日

Self-Supervised Pretraining on Satellite Imagery: a Case Study on Label-Efficient Vehicle Detection

Arxiv

0+阅读 · 2022年10月21日

Continued Pretraining for Better Zero- and Few-Shot Promptability

Arxiv

0+阅读 · 2022年10月21日

gSuite: A Flexible and Framework Independent Benchmark Suite for Graph Neural Network Inference on GPUs

Arxiv

0+阅读 · 2022年10月20日

Augmentative Topology Agents For Open-Ended Learning

Arxiv

0+阅读 · 2022年10月20日

Optimal Settings for Cryptocurrency Trading Pairs

Arxiv

0+阅读 · 2022年10月20日

Exploitation of material consolidation trade-offs in a multi-tier complex supply networks

Arxiv

0+阅读 · 2022年10月19日

MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration

Arxiv

12+阅读 · 2021年2月7日

Label-aware Double Transfer Learning for Cross-Specialty Medical Named Entity Recognition

Arxiv

10+阅读 · 2018年4月28日

相关基金

superstrate结构铜锌硒硫太阳电池制备中的关键科学问题研究

国家自然科学基金

0+阅读 · 2014年12月31日

Yb3+、Ca2+离子共掺新型硼硅酸盐超快激光晶体的研究

国家自然科学基金

0+阅读 · 2013年12月31日

介尺度磁性复合囊泡状结构材料的可控构筑及性能

国家自然科学基金

0+阅读 · 2012年12月31日

抑癌基因MIIP加速EGFR蛋白质降解并抑制肺癌细胞生长的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

脉络膜新生血管疾病中HTRA1基因的表观遗传学机制

国家自然科学基金

0+阅读 · 2012年12月31日

高效中红外激光晶体Cr,Er,Re:YSGG（Re＝Eu3+, Tb3+）的生长及性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型D-A-π-A纯有机太阳能电池敏化染料的设计及光伏性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

水溶性手性高荧光和磷光量子点的构筑及生物相容性研究

国家自然科学基金

0+阅读 · 2011年12月31日

新型中红外激光晶体Er3＋:CaReAlO4(Re=Y,Gd)的研究

国家自然科学基金

0+阅读 · 2009年12月31日

探索ERG变异体维持“#33258;组装”#24037;程化软骨永久性表型的生物学性能研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员