Hrs Humanoids In-the-Wild Dataset

Tycho871 pts0 comments

HIW-500: Humanoids In-the-Wild Dataset

Your browser does not support the video tag.

HIW-500 Humanoids In-the-Wild Dataset

An in-the-wild dataset

for whole-body humanoid robot learning.

II

Contributors

Overview

Built for learning from real homes, not lab-only scenes.

HIW-500: Humanoids In-the-Wild Dataset focuses on task execution in natural environments where layout,<br>object state, lighting, clutter, and human operating style vary from episode to episode.

The dataset is collected in real homes in Southeast Asia through human whole-body teleoperation of Unitree G1 ,<br>providing demonstrations for mobile manipulation, bimanual interaction, and long-horizon household skills.

0+<br>Hours

0+<br>Episodes

Data

0+<br>Tasks

Homes

Hardware

Acquisition setup

The hardware stack uses grippers for household manipulation, with a head camera<br>and wrist-mounted cameras for visual observations.

01<br>Gripper manipulation

End-effector grippers are used for household grasping, placing, and object interaction.

02<br>Head camera

Head-mounted camera provides a wider scene view for task context and navigation.

03<br>Hand stereo IR cameras

Wrist-mounted stereo IR cameras capture close-range observations around the robot hands.

Dataset

What data is collected?

Each episode records human whole-body teleoperation of Unitree G1 in real homes,<br>combining camera streams, robot states, action traces, and language annotations.

Head camera

RGB

Stereo

480P

30 FPS

Wrist camera

RGB

IR

Stereo

480P

30 FPS

State and actions

29-DoF Joints

End-Effector

IMU

Odometry

Metadata

Language annotation

Episode info

Camera intrinsics & extrinsics

Live demo

Open the Rerun preview in a new tab

Dataset Statistics

Task Coverage

Building children table

Hang hanger

Clean up the room

Setting the table

Restocking fridge

Kitchen organization

Hang keys on a hook

Move pillow to sofa

Sweep floor

Picking trash

Clothes washing

Task Episodes

Average Duration

Subtask Labels

161<br>Total subtask labels

148k+<br>Subtask annotation

Top 40<br>Shown in treemap

Every robot subtask is annotated with a fine-grained action label. Hover tiles for details.

Roadmap

Open Source timeline

June 2026

Release V1: 500+ hours of data

V2

Release more tasks and environments

LeRobot Integration

We host the dataset on Hugging Face in two formats: raw ROS bag / MCAP recordings and<br>a version in LeRobot format for robot learning workflows.

View raw dataset

View dataset in LeRobot format (Coming Soon)

Citation

@misc{hiw500_2026,<br>title={HIW-500: Humanoids In-the-Wild Dataset for Robot Learning},<br>author={BitRobot and Unitree and Hugging Face},<br>year={2026},<br>howpublished={\url{https://bitrobot-foundation.github.io/humanoids-in-the-wild-500-hours/}}

License Humanoid Data for Training & Evaluation

Access our growing in-the-wild humanoid dataset or request custom data collection

Request Data Access

dataset wild camera humanoids robot data

Related Articles