From loco-motion to dextrous manipulation, humanoid robots have made remarkable strides in demonstrating complex full-body capabilities. However, the majority of current robot learning datasets and benchmarks mainly focus on stationary robot arms, and the few existing humanoid datasets are either confined to fixed environments or limited in task diversity, often lacking human-humanoid interaction and lower-body locomotion. Moreover, there are a few standardized evaluation platforms for benchmarking learning-based policies on humanoid data. In this work, we present Humanoid Everyday, a large-scale and diverse humanoid manipulation dataset characterized by extensive task variety involving dextrous object manipulation, human-humanoid interaction, locomotion-integrated actions, and more. Leveraging a highly efficient human-supervised teleoperation pipeline, Humanoid Everyday aggregates high-quality multimodal sensory data, including RGB, depth, LiDAR, and tactile inputs, together with natural language annotations, comprising 10.3k trajectories and over 3 million frames of data across 260 tasks across 7 broad categories. In addition, we conduct an analysis of representative policy learning methods on our dataset, providing insights into their strengths and limitations across different task categories. For standardized evaluation, we introduce a cloud-based evaluation platform that allows researchers to seamlessly deploy their policies in our controlled setting and receive performance feedback. By releasing Humanoid Everyday along with our policy learning analysis and a standardized cloud-based evaluation platform, we intend to advance research in general-purpose humanoid manipulation and lay the groundwork for more capable and embodied robotic agents in real-world scenarios. Our dataset, data collection code, and cloud evaluation website are made publicly available on our project website.
翻译:从运动到灵巧操作,人形机器人在展现复杂全身能力方面取得了显著进展。然而,当前大多数机器人学习数据集和基准测试主要集中于固定机械臂,而现存的少数人形数据集要么局限于固定环境,要么任务多样性有限,通常缺乏人-人形交互及下半身运动功能。此外,目前缺乏用于评估基于学习策略在人形数据上性能的标准化评估平台。本研究提出"人形机器人日常"——一个大规模、多样化的人形操作数据集,其特点在于涵盖涉及灵巧物体操作、人-人形交互、运动整合动作等广泛任务类型。通过采用高效的人工监督遥操作流程,该数据集聚合了包括RGB、深度、激光雷达和触觉输入在内的高质量多模态感知数据,并辅以自然语言标注,共包含10.3万条轨迹和超过300万帧数据,涵盖7个大类260项任务。此外,我们在数据集上对代表性策略学习方法进行分析,揭示其在不同任务类别中的优势与局限。为实现标准化评估,我们推出了基于云的评估平台,使研究人员能够在受控环境中无缝部署其策略并获取性能反馈。通过公开发布"人形机器人日常"数据集、策略学习分析及标准化云评估平台,我们旨在推动通用人形操作研究的发展,为现实场景中更具能力与具身性的机器人智能体奠定基础。本项目的网站已公开提供数据集、数据采集代码及云端评估平台。