CACTI: 一个可缩放的多任务多功能视觉模拟学习框架 (CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation Learning) - 专知论文

会员服务 ·

0

Learning · 回合 · 机器人 · 缩放 · state-of-the-art ·

2022 年 12 月 12 日

CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation Learning

翻译：CACTI: 一个可缩放的多任务多功能视觉模拟学习框架

Zhao Mandi,Homanga Bharadhwaj,Vincent Moens,Shuran Song,Aravind Rajeswaran,Vikash Kumar

Developing robots that are capable of many skills and generalization to unseen scenarios requires progress on two fronts: efficient collection of large and diverse datasets, and training of high-capacity policies on the collected data. While large datasets have propelled progress in other fields like computer vision and natural language processing, collecting data of comparable scale is particularly challenging for physical systems like robotics. In this work, we propose a framework to bridge this gap and better scale up robot learning, under the lens of multi-task, multi-scene robot manipulation in kitchen environments. Our framework, named CACTI, has four stages that separately handle data collection, data augmentation, visual representation learning, and imitation policy training. In the CACTI framework, we highlight the benefit of adapting state-of-the-art models for image generation as part of the augmentation stage, and the significant improvement of training efficiency by using pretrained out-of-domain visual representations at the compression stage. Experimentally, we demonstrate that 1) on a real robot setup, CACTI enables efficient training of a single policy capable of 10 manipulation tasks involving kitchen objects, and robust to varying layouts of distractor objects; 2) in a simulated kitchen environment, CACTI trains a single policy on 18 semantic tasks across up to 50 layout variations per task. The simulation task benchmark and augmented datasets in both real and simulated environments will be released to facilitate future research.

翻译：在这项工作中,我们提出了一个框架,以弥合这一差距,并在厨房环境的多任务、多扫描机器人操纵下,更好地扩大机器人的学习范围。我们称为CACTI的框架有四个阶段,分别处理数据收集、数据增强、视觉演示学习和模拟政策培训。在CACTI框架内,我们强调改造最先进的图像生成模型作为增强阶段的一部分的好处,以及通过在压缩阶段使用预先培训的外部外表显示,大大提高培训效率。我们实验性地表明,1)关于真正的机器人设置,CACTI能够有效地培训一项单一的政策,该政策可包含10项涉及厨房对象的操作任务,以及坚固到不同变异式的变异型。在CACTI框架内,我们强调调整最先进的图像生成模型的好处,作为增强阶段的一部分,以及通过在压缩阶段使用预先培训的外部外向外的图像展示,我们提出了一个框架。实验性地表明,1)关于真正的机器人设置,CACTI能够高效地培训一项单一的政策,该政策涉及厨房对象的10项操作任务,以及坚固地进行稳定的变异式的配置。在CARCI上,每18天平的模型任务中,一个模拟任务中,一个模拟任务将一个模拟到一个SALLI将可升级到18天平压。

0

相关内容

Learning

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

专知

18+阅读 · 2018年9月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

基于GIS的典型黄土区地质灾害致灾因子的优选与地表灾害过程模拟

国家自然科学基金

0+阅读 · 2014年12月31日

基于AOD-FLIM/CARS多模光学平台监测单个活细胞内RNA合成过程

国家自然科学基金

0+阅读 · 2013年12月31日

ApoCIII调节Lp-PLA2的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

非平衡等离子合成CNT/TiC强化多孔材料的结构研究

国家自然科学基金

0+阅读 · 2012年12月31日

白藜芦醇调节STIM1抑制血管平滑肌细胞增殖机制的探讨

国家自然科学基金

0+阅读 · 2012年12月31日

并轴II-VI/IV纳米线异质结构的电子学性质研究

国家自然科学基金

0+阅读 · 2011年12月31日

Notch信号途径对胶质瘤干细胞的调控作用和机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

新型Fe基磁致伸缩材料大载荷下磁致伸缩行为研究

国家自然科学基金

0+阅读 · 2009年12月31日

转录因子BCLAF1在细胞凋亡调节机制中作用的研究

国家自然科学基金

0+阅读 · 2009年12月31日

脚手架蛋白NEDD9在宫颈癌细胞衰老中的作用及其分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

IDMS: Instance Depth for Multi-scale Monocular 3D Object Detection

Arxiv

0+阅读 · 2023年2月13日

Learning Complex Teamwork Tasks using a Sub-task Curriculum

Arxiv

0+阅读 · 2023年2月9日

One-shot Visual Imitation via Attributed Waypoints and Demonstration Augmentation

Arxiv

0+阅读 · 2023年2月9日

Adaptive large neighborhood search for a personnel task scheduling problem with task selection and parallel task assignments

Arxiv

0+阅读 · 2023年2月9日

Multi-Task Learning for Visual Scene Understanding

Arxiv

29+阅读 · 2022年3月28日

Masked Autoencoders Are Scalable Vision Learners

Arxiv

27+阅读 · 2021年11月11日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

Multi-Task Feature Learning for Knowledge Graph Enhanced Recommendation

Arxiv

15+阅读 · 2019年1月23日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《海军陆战队感知任务引导技术评估：提升训练与作战效能的探索》最新100页

《光纤无人机技术：从通信基础到军事应用的多学科综述》

联合国：2025年军事人工智能、和平与安全对话核心要义

《无人机蜂群中继器攻击模拟研究》

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

专知

18+阅读 · 2018年9月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

IDMS: Instance Depth for Multi-scale Monocular 3D Object Detection

Arxiv

0+阅读 · 2023年2月13日

Learning Complex Teamwork Tasks using a Sub-task Curriculum

Arxiv

0+阅读 · 2023年2月9日

One-shot Visual Imitation via Attributed Waypoints and Demonstration Augmentation

Arxiv

0+阅读 · 2023年2月9日

Adaptive large neighborhood search for a personnel task scheduling problem with task selection and parallel task assignments

Arxiv

0+阅读 · 2023年2月9日

Multi-Task Learning for Visual Scene Understanding

Arxiv

29+阅读 · 2022年3月28日

Masked Autoencoders Are Scalable Vision Learners

Arxiv

27+阅读 · 2021年11月11日

Generative Models as a Data Source for Multiview Representation Learning

Arxiv

16+阅读 · 2021年6月9日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

Multi-Task Feature Learning for Knowledge Graph Enhanced Recommendation

Arxiv

15+阅读 · 2019年1月23日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

相关基金

基于GIS的典型黄土区地质灾害致灾因子的优选与地表灾害过程模拟

国家自然科学基金

0+阅读 · 2014年12月31日

基于AOD-FLIM/CARS多模光学平台监测单个活细胞内RNA合成过程

国家自然科学基金

0+阅读 · 2013年12月31日

ApoCIII调节Lp-PLA2的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

非平衡等离子合成CNT/TiC强化多孔材料的结构研究

国家自然科学基金

0+阅读 · 2012年12月31日

白藜芦醇调节STIM1抑制血管平滑肌细胞增殖机制的探讨

国家自然科学基金

0+阅读 · 2012年12月31日

并轴II-VI/IV纳米线异质结构的电子学性质研究

国家自然科学基金

0+阅读 · 2011年12月31日

Notch信号途径对胶质瘤干细胞的调控作用和机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

新型Fe基磁致伸缩材料大载荷下磁致伸缩行为研究

国家自然科学基金

0+阅读 · 2009年12月31日

转录因子BCLAF1在细胞凋亡调节机制中作用的研究

国家自然科学基金

0+阅读 · 2009年12月31日

脚手架蛋白NEDD9在宫颈癌细胞衰老中的作用及其分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员