学习通过适应计划模型在开放世界中操作 (Learning to Operate in Open Worlds by Adapting Planning Models) - 专知论文

会员服务 ·

0

开放世界 · 操作 · 环境模型 · 启发式 · 推断 ·

2023 年 3 月 24 日

Learning to Operate in Open Worlds by Adapting Planning Models

翻译：学习通过适应计划模型在开放世界中操作

Wiktor Piotrowski,Roni Stern,Yoni Sher,Jacob Le,Matthew Klenk,Johan deKleer,Shiwali Mohan

from arxiv, To appears in the Proceedings of the 22nd International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2023)

Planning agents are ill-equipped to act in novel situations in which their domain model no longer accurately represents the world. We introduce an approach for such agents operating in open worlds that detects the presence of novelties and effectively adapts their domain models and consequent action selection. It uses observations of action execution and measures their divergence from what is expected, according to the environment model, to infer existence of a novelty. Then, it revises the model through a heuristics-guided search over model changes. We report empirical evaluations on the CartPole problem, a standard Reinforcement Learning (RL) benchmark. The results show that our approach can deal with a class of novelties very quickly and in an interpretable fashion.

翻译：计划代理在其领域模型不再准确表示世界的新情况下往往无法有效地行动。我们引入了一种适用于在开放世界中操作的代理的方法，该方法检测到新颖性的存在并有效地调整其领域模型和结果动作选择。它使用动作执行的观察结果，并根据环境模型测量其与预期的差异来推断新颖性的存在。然后，通过启发式引导搜索模型变化来修订模型。我们在CartPole问题上进行经验评估，这是一个标准的强化学习（RL）基准。结果表明，我们的方法可以快速处理一类新颖性，并且具有可解释性。

0

相关内容

开放世界

终身学习如何构建？NeurIPS2022《终身学习机》教程，70页ppt

终身学习如何构建？NeurIPS2022《终身学习机》教程，70页ppt

专知会员服务

46+阅读 · 2023年1月26日

【深度学习中的不确定性-贝叶斯CNN | TensorFlow概率】Uncertainty In Deep Learning — Bayesian CNN | TensorFlow Probability

【深度学习中的不确定性-贝叶斯CNN | TensorFlow概率】Uncertainty In Deep Learning — Bayesian CNN | TensorFlow Probability

专知会员服务

40+阅读 · 2022年3月19日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

【2022新书】强化学习工业应用，408页pdf

【2022新书】强化学习工业应用，408页pdf

专知会员服务

231+阅读 · 2022年2月3日

不可错过！Pisa大学最新《持续学习》课程，带你学习最新深度架构持续学习进展

不可错过！Pisa大学最新《持续学习》课程，带你学习最新深度架构持续学习进展

专知会员服务

29+阅读 · 2021年12月16日

【Manning新书】迁移学习自然语言处理，266页pdf，Transfer Learning for NLP

【Manning新书】迁移学习自然语言处理，266页pdf，Transfer Learning for NLP

专知会员服务

137+阅读 · 2021年11月6日

【Manning新书】现代Java实战，592页pdf

【Manning新书】现代Java实战，592页pdf

专知会员服务

101+阅读 · 2020年5月22日

【牛津大学ICLR2020】通过元学习的贝叶斯自适应深度RL, VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning

【牛津大学ICLR2020】通过元学习的贝叶斯自适应深度RL, VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning

专知会员服务

25+阅读 · 2020年2月28日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新6篇视觉问答（VQA）相关论文—目标推理、深度循环模型、可解释性、数据可视化、Triplet学习、基准

【论文推荐】最新6篇视觉问答（VQA）相关论文—目标推理、深度循环模型、可解释性、数据可视化、Triplet学习、基准

专知

15+阅读 · 2018年2月3日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

NOX2- ROS-线粒体在高血糖增加局麻药周围神经毒性中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

有机磷农药暴露对大鼠海马神经细胞的毒性效应及对映体选择性

国家自然科学基金

0+阅读 · 2013年12月31日

四溴双酚A对斑马鱼胚胎发育和神经行为毒性效应研究

国家自然科学基金

0+阅读 · 2013年12月31日

吸附-电催化氧化与纳滤法的耦合过程研究

国家自然科学基金

0+阅读 · 2012年12月31日

不确定环境下强化学习和决策的神经机制

国家自然科学基金

11+阅读 · 2012年12月31日

高维数据的图模型学习与统计推断

国家自然科学基金

8+阅读 · 2012年12月31日

Eulerian bond-cubic 模型渗流性质的数值研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于贝叶斯推理的模糊逻辑强化学习模型研究

国家自然科学基金

18+阅读 · 2012年12月31日

大气环境中黑碳在臭氧(O3)作用下的老化过程研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于"非监督-监督-激励"集成学习模式的机器人行为自主学习系统研究

国家自然科学基金

1+阅读 · 2010年12月31日

An Empirical Study on the Language Modal in Visual Question Answering

Arxiv

0+阅读 · 2023年5月17日

Task and Motion Planning with Large Language Models for Object Rearrangement

Arxiv

0+阅读 · 2023年5月16日

Rapid Adaptation in Online Continual Learning: Are We Evaluating It Right?

Arxiv

0+阅读 · 2023年5月16日

An Ensemble Approach for Automated Theorem Proving Based on Efficient Name Invariant Graph Neural Representations

Arxiv

0+阅读 · 2023年5月15日

A Control Approach for Human-Robot Ergonomic Payload Lifting

Arxiv

0+阅读 · 2023年5月15日

Towards Open World NeRF-Based SLAM

Arxiv

0+阅读 · 2023年5月14日

AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners

AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners

Arxiv

0+阅读 · 2023年5月12日

Probabilistic Traversability Model for Risk-Aware Motion Planning in Off-Road Environments

Arxiv

0+阅读 · 2023年5月11日

Multimodal Prompting with Missing Modalities for Visual Recognition

Arxiv

11+阅读 · 2023年3月6日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

VIP会员

文章信息

相关主题

相关VIP内容

终身学习如何构建？NeurIPS2022《终身学习机》教程，70页ppt

终身学习如何构建？NeurIPS2022《终身学习机》教程，70页ppt

专知会员服务

46+阅读 · 2023年1月26日

【深度学习中的不确定性-贝叶斯CNN | TensorFlow概率】Uncertainty In Deep Learning — Bayesian CNN | TensorFlow Probability

【深度学习中的不确定性-贝叶斯CNN | TensorFlow概率】Uncertainty In Deep Learning — Bayesian CNN | TensorFlow Probability

专知会员服务

40+阅读 · 2022年3月19日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

【2022新书】强化学习工业应用，408页pdf

【2022新书】强化学习工业应用，408页pdf

专知会员服务

231+阅读 · 2022年2月3日

不可错过！Pisa大学最新《持续学习》课程，带你学习最新深度架构持续学习进展

不可错过！Pisa大学最新《持续学习》课程，带你学习最新深度架构持续学习进展

专知会员服务

29+阅读 · 2021年12月16日

【Manning新书】迁移学习自然语言处理，266页pdf，Transfer Learning for NLP

【Manning新书】迁移学习自然语言处理，266页pdf，Transfer Learning for NLP

专知会员服务

137+阅读 · 2021年11月6日

【Manning新书】现代Java实战，592页pdf

【Manning新书】现代Java实战，592页pdf

专知会员服务

101+阅读 · 2020年5月22日

【牛津大学ICLR2020】通过元学习的贝叶斯自适应深度RL, VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning

【牛津大学ICLR2020】通过元学习的贝叶斯自适应深度RL, VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning

专知会员服务

25+阅读 · 2020年2月28日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型中的检索与结构化增强生成综述

《实现多层防御多轮交战机制的扩展型随机齐射模型》2025年最新83页

【CMU博士论文】交互驱动的人体动作估计与生成

如何避免生成式人工智能在作战中失控失效

相关资讯

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新6篇视觉问答（VQA）相关论文—目标推理、深度循环模型、可解释性、数据可视化、Triplet学习、基准

【论文推荐】最新6篇视觉问答（VQA）相关论文—目标推理、深度循环模型、可解释性、数据可视化、Triplet学习、基准

专知

15+阅读 · 2018年2月3日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

An Empirical Study on the Language Modal in Visual Question Answering

Arxiv

0+阅读 · 2023年5月17日

Task and Motion Planning with Large Language Models for Object Rearrangement

Arxiv

0+阅读 · 2023年5月16日

Rapid Adaptation in Online Continual Learning: Are We Evaluating It Right?

Arxiv

0+阅读 · 2023年5月16日

An Ensemble Approach for Automated Theorem Proving Based on Efficient Name Invariant Graph Neural Representations

Arxiv

0+阅读 · 2023年5月15日

A Control Approach for Human-Robot Ergonomic Payload Lifting

Arxiv

0+阅读 · 2023年5月15日

Towards Open World NeRF-Based SLAM

Arxiv

0+阅读 · 2023年5月14日

AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners

AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners

Arxiv

0+阅读 · 2023年5月12日

Probabilistic Traversability Model for Risk-Aware Motion Planning in Off-Road Environments

Arxiv

0+阅读 · 2023年5月11日

Multimodal Prompting with Missing Modalities for Visual Recognition

Arxiv

11+阅读 · 2023年3月6日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

相关基金

NOX2- ROS-线粒体在高血糖增加局麻药周围神经毒性中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

有机磷农药暴露对大鼠海马神经细胞的毒性效应及对映体选择性

国家自然科学基金

0+阅读 · 2013年12月31日

四溴双酚A对斑马鱼胚胎发育和神经行为毒性效应研究

国家自然科学基金

0+阅读 · 2013年12月31日

吸附-电催化氧化与纳滤法的耦合过程研究

国家自然科学基金

0+阅读 · 2012年12月31日

不确定环境下强化学习和决策的神经机制

国家自然科学基金

11+阅读 · 2012年12月31日

高维数据的图模型学习与统计推断

国家自然科学基金

8+阅读 · 2012年12月31日

Eulerian bond-cubic 模型渗流性质的数值研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于贝叶斯推理的模糊逻辑强化学习模型研究

国家自然科学基金

18+阅读 · 2012年12月31日

大气环境中黑碳在臭氧(O3)作用下的老化过程研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于"非监督-监督-激励"集成学习模式的机器人行为自主学习系统研究

国家自然科学基金

1+阅读 · 2010年12月31日

微信扫码咨询专知VIP会员