LLM作为机器人大脑：统一自我中心记忆和控制 (LLM as A Robotic Brain: Unifying Egocentric Memory and Control) - 专知论文

会员服务 ·

0

机器人 · 语言模型 · 系统 · 多轮对话 · 闭式 ·

2023 年 4 月 19 日

LLM as A Robotic Brain: Unifying Egocentric Memory and Control

翻译：LLM作为机器人大脑：统一自我中心记忆和控制

Jinjie Mai,Jun Chen,Bing Li,Guocheng Qian,Mohamed Elhoseiny,Bernard Ghanem

Embodied AI focuses on the study and development of intelligent systems that possess a physical or virtual embodiment (i.e. robots) and are able to dynamically interact with their environment. Memory and control are the two essential parts of an embodied system and usually require separate frameworks to model each of them. In this paper, we propose a novel and generalizable framework called LLM-Brain: using Large-scale Language Model as a robotic brain to unify egocentric memory and control. The LLM-Brain framework integrates multiple multimodal language models for robotic tasks, utilizing a zero-shot learning approach. All components within LLM-Brain communicate using natural language in closed-loop multi-round dialogues that encompass perception, planning, control, and memory. The core of the system is an embodied LLM to maintain egocentric memory and control the robot. We demonstrate LLM-Brain by examining two downstream tasks: active exploration and embodied question answering. The active exploration tasks require the robot to extensively explore an unknown environment within a limited number of actions. Meanwhile, the embodied question answering tasks necessitate that the robot answers questions based on observations acquired during prior explorations.

翻译：机身AI侧重于研究和开发具有物理或虚拟体现（即机器人）的智能系统，能够与其环境动态交互。记忆和控制是具有体现系统的两个基本部分，通常需要分别用框架来模拟它们。在本文中，我们提出了一个新颖的、可推广的框架，称为LLM-Brain：使用大规模语言模型作为机器人大脑，以统一自我中心记忆和控制。LLM-Brain框架集成了多个多模态语言模型，用于机器人任务，采用零样本学习方法。LLM-Brain中的所有组件通过自然语言进行通信，在封闭式多轮对话中涵盖感知、规划、控制和记忆。系统的核心是一个具有自我中心记忆和控制机器人的体现LLM。我们通过检查两个下游任务来演示LLM-Brain：主动探索和体验问题回答。主动探索任务要求机器人在有限的行动次数内广泛探索未知环境。与此同时，具有体验问题回答的任务要求机器人根据先前探索中获取的观察结果回答问题。

1

相关内容

机器人

机器人（英语：Robot）包括一切模拟人类行为或思想与模拟其他生物的机械（如机器狗，机器猫等）。狭义上对机器人的定义还有很多分类法及争议，有些电脑程序甚至也被称为机器人。在当代工业中，机器人指能自动运行任务的人造机器设备，用以取代或协助人类工作，一般会是机电设备，由计算机程序或是电子电路控制。

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

多模态认知计算

多模态认知计算

专知会员服务

180+阅读 · 2022年9月16日

【Meta AI】多模态理解研究进展，Advances in multimodal understanding research at Meta AI

【Meta AI】多模态理解研究进展，Advances in multimodal understanding research at Meta AI

专知会员服务

68+阅读 · 2022年3月20日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

专知会员服务

40+阅读 · 2020年9月21日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

111+阅读 · 2020年6月10日

【清华大学-微软研究院】构建智能开放域对话系统的挑战综述论文，31页pdf，Challenges in Building Intelligent Open-domain Dialog Systems

【清华大学-微软研究院】构建智能开放域对话系统的挑战综述论文，31页pdf，Challenges in Building Intelligent Open-domain Dialog Systems

专知会员服务

28+阅读 · 2019年10月23日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

多模态认知计算

多模态认知计算

专知

7+阅读 · 2022年9月16日

港中文任洪亮教授招募机器人视觉智能传感方向人才，博士后、研究助理、访问学者多个岗位

港中文任洪亮教授招募机器人视觉智能传感方向人才，博士后、研究助理、访问学者多个岗位

机器之心

0+阅读 · 2022年6月8日

博后招募 | 香港中文大学招收机器人视觉智能传感方向博士后/RA/访问学者

博后招募 | 香港中文大学招收机器人视觉智能传感方向博士后/RA/访问学者

PaperWeekly

0+阅读 · 2022年5月30日

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年6月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

基于混杂建模的耦合神经系统最优控制及应用

国家自然科学基金

0+阅读 · 2014年12月31日

p53基因突变促进Wilms 肿瘤发展转移的小鼠动物模型研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于fMRI脑功能成像的机器人辅助腕手神经康复训练与评价方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

机器人节律运动控制框架模型研究

国家自然科学基金

0+阅读 · 2013年12月31日

切换关联时滞系统的非脆弱分散控制

国家自然科学基金

0+阅读 · 2013年12月31日

统一框架下奇异Markov跳变时滞系统的多目标控制与滤波

国家自然科学基金

0+阅读 · 2012年12月31日

ROS在调控心肌衰老过程中Beclin 1-Vps34复合体功能和自噬流的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于人工心理的机器人的仿人交互与合作研究

国家自然科学基金

3+阅读 · 2011年12月31日

联想词序列训练重塑失语症言语功能神经网络的机制

国家自然科学基金

0+阅读 · 2011年12月31日

病理性情感记忆的唤起和消退的神经生物学机制

国家自然科学基金

0+阅读 · 2011年12月31日

Networked Communication for Decentralised Agents in Mean-Field Games

Arxiv

0+阅读 · 2023年6月5日

Agency and legibility for artists through Experiential AI

Arxiv

0+阅读 · 2023年6月4日

brainlife.io: A decentralized and open source cloud platform to support neuroscience research

Arxiv

0+阅读 · 2023年6月3日

Milestones in Autonomous Driving and Intelligent Vehicles Part II: Perception and Planning

Arxiv

0+阅读 · 2023年6月3日

Inferring Mood-While-Eating with Smartphone Sensing and Community-Based Model Personalization

Arxiv

0+阅读 · 2023年6月1日

Outsourcing Control requires Control Complexity

Arxiv

0+阅读 · 2023年6月1日

Experiential AI: A transdisciplinary framework for legibility and agency in AI

Arxiv

0+阅读 · 2023年6月1日

Developing and Building Ontologies in Cyber Security

Arxiv

0+阅读 · 2023年6月1日

Bridging Control-Centric and Data-Centric Optimization

Arxiv

0+阅读 · 2023年6月1日

Agile, Antifragile, Artificial-Intelligence-Enabled, Command and Control

Arxiv

48+阅读 · 2021年9月14日

VIP会员

文章信息

相关主题

相关VIP内容

多模态认知计算

多模态认知计算

专知会员服务

180+阅读 · 2022年9月16日

【Meta AI】多模态理解研究进展，Advances in multimodal understanding research at Meta AI

【Meta AI】多模态理解研究进展，Advances in multimodal understanding research at Meta AI

专知会员服务

68+阅读 · 2022年3月20日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

专知会员服务

40+阅读 · 2020年9月21日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

111+阅读 · 2020年6月10日

【清华大学-微软研究院】构建智能开放域对话系统的挑战综述论文，31页pdf，Challenges in Building Intelligent Open-domain Dialog Systems

【清华大学-微软研究院】构建智能开放域对话系统的挑战综述论文，31页pdf，Challenges in Building Intelligent Open-domain Dialog Systems

专知会员服务

28+阅读 · 2019年10月23日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

851页！《潮涨之海：代数几何的基础》新书

从二维到三维认知：通用世界模型简要综述

航天遥感大模型发展综述与产业化应用展望

WWW 2025 | 基于模式引导的多智能体协同知识抽取框架

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

多模态认知计算

多模态认知计算

专知

7+阅读 · 2022年9月16日

港中文任洪亮教授招募机器人视觉智能传感方向人才，博士后、研究助理、访问学者多个岗位

港中文任洪亮教授招募机器人视觉智能传感方向人才，博士后、研究助理、访问学者多个岗位

机器之心

0+阅读 · 2022年6月8日

博后招募 | 香港中文大学招收机器人视觉智能传感方向博士后/RA/访问学者

博后招募 | 香港中文大学招收机器人视觉智能传感方向博士后/RA/访问学者

PaperWeekly

0+阅读 · 2022年5月30日

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年6月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Networked Communication for Decentralised Agents in Mean-Field Games

Arxiv

0+阅读 · 2023年6月5日

Agency and legibility for artists through Experiential AI

Arxiv

0+阅读 · 2023年6月4日

brainlife.io: A decentralized and open source cloud platform to support neuroscience research

Arxiv

0+阅读 · 2023年6月3日

Milestones in Autonomous Driving and Intelligent Vehicles Part II: Perception and Planning

Arxiv

0+阅读 · 2023年6月3日

Inferring Mood-While-Eating with Smartphone Sensing and Community-Based Model Personalization

Arxiv

0+阅读 · 2023年6月1日

Outsourcing Control requires Control Complexity

Arxiv

0+阅读 · 2023年6月1日

Experiential AI: A transdisciplinary framework for legibility and agency in AI

Arxiv

0+阅读 · 2023年6月1日

Developing and Building Ontologies in Cyber Security

Arxiv

0+阅读 · 2023年6月1日

Bridging Control-Centric and Data-Centric Optimization

Arxiv

0+阅读 · 2023年6月1日

Agile, Antifragile, Artificial-Intelligence-Enabled, Command and Control

Arxiv

48+阅读 · 2021年9月14日

相关基金

基于混杂建模的耦合神经系统最优控制及应用

国家自然科学基金

0+阅读 · 2014年12月31日

p53基因突变促进Wilms 肿瘤发展转移的小鼠动物模型研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于fMRI脑功能成像的机器人辅助腕手神经康复训练与评价方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

机器人节律运动控制框架模型研究

国家自然科学基金

0+阅读 · 2013年12月31日

切换关联时滞系统的非脆弱分散控制

国家自然科学基金

0+阅读 · 2013年12月31日

统一框架下奇异Markov跳变时滞系统的多目标控制与滤波

国家自然科学基金

0+阅读 · 2012年12月31日

ROS在调控心肌衰老过程中Beclin 1-Vps34复合体功能和自噬流的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于人工心理的机器人的仿人交互与合作研究

国家自然科学基金

3+阅读 · 2011年12月31日

联想词序列训练重塑失语症言语功能神经网络的机制

国家自然科学基金

0+阅读 · 2011年12月31日

病理性情感记忆的唤起和消退的神经生物学机制

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员