以元加强学习为基础的自我适应系统方法 (A Meta Reinforcement Learning-based Approach for Self-Adaptive System)

A self-learning adaptive system (SLAS) uses machine learning to enable and enhance its adaptability. Such systems are expected to perform well in dynamic situations. For learning high-performance adaptation policy, some assumptions must be made on the environment-system dynamics when information about the real situation is incomplete. However, these assumptions cannot be expected to be always correct, and yet it is difficult to enumerate all possible assumptions. This leads to the problem of incomplete-information learning. We consider this problem as multiple model problem in terms of finding the adaptation policy that can cope with multiple models of environment-system dynamics. This paper proposes a novel approach to engineering the online adaptation of SLAS. It separates three concerns that are related to the adaptation policy and presents the modeling and synthesis process, with the goal of achieving higher model construction efficiency. In addition, it designs a meta-reinforcement learning algorithm for learning the meta policy over the multiple models, so that the meta policy can quickly adapt to the real environment-system dynamics. At last, it reports the case study on a robotic system to evaluate the adaptability of the approach.

翻译：自学适应系统(SLAS)使用机器学习来扶持和加强其适应能力,这种系统在动态情况下可望运作良好。为了学习高性能适应政策,在真实情况信息不完整时,必须对环境系统动态进行一些假设,但不能预期这些假设总是正确,但很难列举所有可能的假设。这导致了信息学习不全的问题。我们认为,从寻找适应政策能够应对环境系统动态的多种模型的角度来看,这个问题是一个多重模式问题。本文提出了设计系统在线适应的新办法。它分离了三个与适应政策有关的关切,并提出了模型和综合进程,目的是实现更高的模型建设效率。此外,它设计了一个元化强化学习算法,用于学习多重模型的元政策,以便元政策能够迅速适应实际环境系统动态。最后,它报告了关于机器人系统评估方法适应性的案例研究。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【斯坦福大学课程】2021年深度多任务学习与元学习，CS 330: Deep Multi-Task and Meta Learning

专知会员服务

110+阅读 · 2022年3月2日

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

专知会员服务

89+阅读 · 2021年1月12日

【牛津大学博士论文】基于强化学习的无地图机器人导航，Reinforcement Learning Based MRN

专知会员服务

122+阅读 · 2020年5月18日

元学习(meta learning) 最新进展综述论文

专知会员服务

281+阅读 · 2020年5月8日