低宗教信仰国家机器人任务规划 (Robot Task Planning for Low Entropy Belief States) - 专知论文

会员服务 ·

0

机器人 · Performer · 估计/估计量 · 状态估计 · AIM ·

2020 年 11 月 18 日

Robot Task Planning for Low Entropy Belief States

翻译：低宗教信仰国家机器人任务规划

Alphonsus Adu-Bredu,Zhen Zeng,Neha Pusalkar,Odest Chadwicke Jenkins

Recent advances in computational perception have significantly improved the ability of autonomous robots to perform state estimation with low entropy. Such advances motivate a reconsideration of robot decision-making under uncertainty. Current approaches to solving sequential decision-making problems model states as inhabiting the extremes of the perceptual entropy spectrum. As such, these methods are either incapable of overcoming perceptual errors or asymptotically inefficient in solving problems with low perceptual entropy. With low entropy perception in mind, we aim to explore a happier medium that balances computational efficiency with the forms of uncertainty we now observe from modern robot perception. We propose FastDownward Replanner (FD-Replan) as an efficient task planning method for goal-directed robot reasoning. FD-Replan combines belief space representation with the fast, goal-directed features of classical planning to efficiently plan for low entropy goal-directed reasoning tasks. We compare FD-Replan with current classical planning and belief space planning approaches by solving low entropy goal-directed grocery packing tasks in simulation. FD-Replan shows positive results and promise with respect to planning time, execution time, and task success rate in our simulation experiments.

翻译：在计算概念方面最近取得的进展大大提高了自主机器人进行国家估计和低摄氏度观测的能力。这些进展促使在不确定的情况下重新考虑机器人决策。目前解决连续决策问题的方法表明,它们位于感知的摄氏谱的极端。因此,这些方法要么无法克服感知错误,要么在解决低感知的摄氏度问题方面无实际效率。在考虑低感知时,我们的目标是探索一种更幸福的媒介,将计算效率与我们从现代机器人概念中观察到的不确定形式相平衡。我们建议快速自上而下的重新规划器(FD-REplan)作为目标导向的机器人推理的有效任务规划方法。FD-REplan将信仰空间代表与快速、目标导向的典型规划特征结合起来,以有效规划低感知知性目标导向的推理任务。我们把FD-REplan与当前的典型规划和信任空间规划方法相比较,方法是解决低感感控控目标的现代机器人概念的包装任务。我们建议快速自上向上重新规划,显示积极的结果,并承诺我们进行时间规划的成功试验。

0

相关内容

机器人

机器人（英语：Robot）包括一切模拟人类行为或思想与模拟其他生物的机械（如机器狗，机器猫等）。狭义上对机器人的定义还有很多分类法及争议，有些电脑程序甚至也被称为机器人。在当代工业中，机器人指能自动运行任务的人造机器设备，用以取代或协助人类工作，一般会是机电设备，由计算机程序或是电子电路控制。

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

威斯康辛大学《机器学习导论》2020秋季课程完结，课件、视频资源已开放

专知会员服务

15+阅读 · 2020年12月25日

NLPCC 2020《预训练语言模型回顾》讲义下载，156页PPT

NLPCC 2020《预训练语言模型回顾》讲义下载，156页PPT

专知会员服务

47+阅读 · 2020年10月17日

多伦多大学最新《机器学习导论》课程，Introduction to Machine Learning

多伦多大学最新《机器学习导论》课程，Introduction to Machine Learning

专知会员服务

24+阅读 · 2020年9月24日

【开放书-纽约大学】面向数据科学的概率与统计，237页pdf

【开放书-纽约大学】面向数据科学的概率与统计，237页pdf

专知会员服务

138+阅读 · 2020年7月6日

【Manning新书】现代Java实战，592页pdf

【Manning新书】现代Java实战，592页pdf

专知会员服务

98+阅读 · 2020年5月22日

【哈佛-ICLR2020】基于残差能量模型的文本生成，Residual Energy-Based Models for Text Generation

【哈佛-ICLR2020】基于残差能量模型的文本生成，Residual Energy-Based Models for Text Generation

专知会员服务

10+阅读 · 2020年4月27日

【AAAI 2019 Tutorial】城市交通控制的规划与调度方法（Planning and Scheduling Approaches for Urban Traffic Control），Scott Sanner，Mauro Vallati，Stephen F. Smith

【AAAI 2019 Tutorial】城市交通控制的规划与调度方法（Planning and Scheduling Approaches for Urban Traffic Control），Scott Sanner，Mauro Vallati，Stephen F. Smith

专知会员服务

7+阅读 · 2019年11月18日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

31+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

56+阅读 · 2019年10月17日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

35+阅读 · 2019年10月11日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

25+阅读 · 2019年5月18日

已删除

将门创投

7+阅读 · 2019年3月28日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

spinningup.openai 强化学习资源完整

spinningup.openai 强化学习资源完整

CreateAMind

6+阅读 · 2018年12月17日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

学界 | Pieter Abbeel NIPS 2017大会报告《Deep Learning for Robots》（附PDF）

学界 | Pieter Abbeel NIPS 2017大会报告《Deep Learning for Robots》（附PDF）

AI科技评论

4+阅读 · 2017年12月9日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Analyzing Training Using Phase Transitions in Entropy---Part I: General Theory

Arxiv

0+阅读 · 2021年1月6日

Bayesian Incremental Inference Update by Re-using Calculations from Belief Space Planning: A New Paradigm

Arxiv

0+阅读 · 2021年1月5日

Inverse reinforcement learning for autonomous navigation via differentiable semantic mapping and planning

Arxiv

0+阅读 · 2021年1月1日

Residual Policy Learning

Residual Policy Learning

Arxiv

4+阅读 · 2018年12月15日

CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving

CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving

Arxiv

8+阅读 · 2018年7月10日

A survey on policy search algorithms for learning robot controllers in a handful of trials

Arxiv

3+阅读 · 2018年7月6日

Psychological State in Text: A Limitation of Sentiment Analysis

Arxiv

8+阅读 · 2018年6月3日

PEORL: Integrating Symbolic Planning and Hierarchical Reinforcement Learning for Robust Decision-Making

Arxiv

6+阅读 · 2018年4月20日

Learnable pooling with Context Gating for video classification

Arxiv

3+阅读 · 2018年3月5日

Safety-aware Adaptive Reinforcement Learning with Applications to Brushbot Navigation

Arxiv

4+阅读 · 2018年1月29日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

威斯康辛大学《机器学习导论》2020秋季课程完结，课件、视频资源已开放

专知会员服务

15+阅读 · 2020年12月25日

NLPCC 2020《预训练语言模型回顾》讲义下载，156页PPT

NLPCC 2020《预训练语言模型回顾》讲义下载，156页PPT

专知会员服务

47+阅读 · 2020年10月17日

多伦多大学最新《机器学习导论》课程，Introduction to Machine Learning

多伦多大学最新《机器学习导论》课程，Introduction to Machine Learning

专知会员服务

24+阅读 · 2020年9月24日

【开放书-纽约大学】面向数据科学的概率与统计，237页pdf

【开放书-纽约大学】面向数据科学的概率与统计，237页pdf

专知会员服务

138+阅读 · 2020年7月6日

【Manning新书】现代Java实战，592页pdf

【Manning新书】现代Java实战，592页pdf

专知会员服务

98+阅读 · 2020年5月22日

【哈佛-ICLR2020】基于残差能量模型的文本生成，Residual Energy-Based Models for Text Generation

【哈佛-ICLR2020】基于残差能量模型的文本生成，Residual Energy-Based Models for Text Generation

专知会员服务

10+阅读 · 2020年4月27日

【AAAI 2019 Tutorial】城市交通控制的规划与调度方法（Planning and Scheduling Approaches for Urban Traffic Control），Scott Sanner，Mauro Vallati，Stephen F. Smith

【AAAI 2019 Tutorial】城市交通控制的规划与调度方法（Planning and Scheduling Approaches for Urban Traffic Control），Scott Sanner，Mauro Vallati，Stephen F. Smith

专知会员服务

7+阅读 · 2019年11月18日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

31+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

56+阅读 · 2019年10月17日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

35+阅读 · 2019年10月11日

热门VIP内容

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

25+阅读 · 2019年5月18日

已删除

将门创投

7+阅读 · 2019年3月28日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

spinningup.openai 强化学习资源完整

spinningup.openai 强化学习资源完整

CreateAMind

6+阅读 · 2018年12月17日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

学界 | Pieter Abbeel NIPS 2017大会报告《Deep Learning for Robots》（附PDF）

学界 | Pieter Abbeel NIPS 2017大会报告《Deep Learning for Robots》（附PDF）

AI科技评论

4+阅读 · 2017年12月9日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Analyzing Training Using Phase Transitions in Entropy---Part I: General Theory

Arxiv

0+阅读 · 2021年1月6日

Bayesian Incremental Inference Update by Re-using Calculations from Belief Space Planning: A New Paradigm

Arxiv

0+阅读 · 2021年1月5日

Inverse reinforcement learning for autonomous navigation via differentiable semantic mapping and planning

Arxiv

0+阅读 · 2021年1月1日

Residual Policy Learning

Residual Policy Learning

Arxiv

4+阅读 · 2018年12月15日

CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving

CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving

Arxiv

8+阅读 · 2018年7月10日

A survey on policy search algorithms for learning robot controllers in a handful of trials

Arxiv

3+阅读 · 2018年7月6日

Psychological State in Text: A Limitation of Sentiment Analysis

Arxiv

8+阅读 · 2018年6月3日

PEORL: Integrating Symbolic Planning and Hierarchical Reinforcement Learning for Robust Decision-Making

Arxiv

6+阅读 · 2018年4月20日

Learnable pooling with Context Gating for video classification

Arxiv

3+阅读 · 2018年3月5日

Safety-aware Adaptive Reinforcement Learning with Applications to Brushbot Navigation

Arxiv

4+阅读 · 2018年1月29日

微信扫码咨询专知VIP会员