推推世界:使用工具和动产障碍物进行操纵规划的基准 (PushWorld: A benchmark for manipulation planning with tools and movable obstacles)

While recent advances in artificial intelligence have achieved human-level performance in environments like Starcraft and Go, many physical reasoning tasks remain challenging for modern algorithms. To date, few algorithms have been evaluated on physical tasks that involve manipulating objects when movable obstacles are present and when tools must be used to perform the manipulation. To promote research on such tasks, we introduce PushWorld, an environment with simplistic physics that requires manipulation planning with both movable obstacles and tools. We provide a benchmark of more than 200 PushWorld puzzles in PDDL and in an OpenAI Gym environment. We evaluate state-of-the-art classical planning and reinforcement learning algorithms on this benchmark, and we find that these baseline results are below human-level performance. We then provide a new classical planning heuristic that solves the most puzzles among the baselines, and although it is 40 times faster than the best baseline planner, it remains below human-level performance.

翻译：虽然最近人工智能的进步在Starcraft和Go等环境中取得了人类层面的绩效,但许多物理推理任务对现代算法仍然具有挑战性。到目前为止,在存在移动障碍和必须使用工具进行操纵时,很少有算法被评估到涉及操纵物体的物理任务。为了推动对此类任务的研究,我们引入了普什世界,这是一个简单物理学环境,需要用移动障碍和工具进行操纵规划。我们在PDDL和OpenAI Gym环境中提供了一个200多个推式世界拼图的基准。我们评估了这一基准的最先进的古典规划和强化学习算法,我们发现这些基线结果低于人类层面的绩效。我们随后提供了一种新的经典规划精华,解决了基线中最棘手的问题,尽管比最好的基线规划员快40倍,但它仍然低于人类层面的绩效。

相关内容

TOOLS

关注 0

这个新版本的工具会议系列恢复了从1989年到2012年的50个会议的传统。工具最初是“面向对象语言和系统的技术”，后来发展到包括软件技术的所有创新方面。今天许多最重要的软件概念都是在这里首次引入的。2019年TOOLS 50+1在俄罗斯喀山附近举行，以同样的创新精神、对所有与软件相关的事物的热情、科学稳健性和行业适用性的结合以及欢迎该领域所有趋势和社区的开放态度，延续了该系列。官网链接：http://tools2019.innopolis.ru/

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日