Adversarial Locomotion and Motion Imitation for Humanoid Policy Learning

Video

Abstract

Humans exhibit diverse and expressive whole-body movements. However, attaining human-like whole-body coordination in humanoid robots remains challenging, as conventional approaches that mimic whole-body motions often neglect the distinct roles of upper and lower body. This oversight leads to computationally intensive policy learning and frequently causes robot instability and falls during real-world execution. To address these issues, we propose Adversarial Locomotion and Motion Imitation (ALMI), a novel framework that enables adversarial policy learning between upper and lower body. Specifically, the lower body aims to provide robust locomotion capabilities to follow velocity commands while the upper body tracks various motions. Conversely, the upper-body policy ensures effective motion tracking when the robot executes velocity-based movements. Through iterative updates, these policies achieve coordinated whole-body control, which can be extended to loco-manipulation tasks with teleoperation systems. Extensive experiments demonstrate that our method achieves robust locomotion and precise motion tracking in both simulation and on the full-size Unitree H1 robot. Additionally, we release a large-scale whole-body motion control dataset featuring high-quality episodic trajectories from MuJoCo simulations deployable on real robots.

Method

图片1

ALMI Deployment Setting

We conduct experiments using the Unitree-H1-2 robot equipped with the ROBOTERA XHAND robotic hand, applying ALMI-trained policies to the lower body control and various interfaces (i.e., open-loop controller, ALMI policy, and VR device) for the upper body control


图片1


Experiments


Motion Tracking

Shake hand and stand still
Punch and stand still
Salute and stand still
Wave both hands and stand still
Play the guitar and go forward
Play the violin and go forward
Wave left hand and go forward
Wave right hand and turn right
Play tennis with left hand and turn right
Play tennis with right hand and go forward
Drink and go backward
High five and go forward
Play golf and go backward
Wash arm and turn right
Go backward to the left
Make a circle and go forward

Loco-manipulation Tasks


Kitchen Manipulation Tasks
Pouring Water Task
Carrying Box Task
Cleaning the Floor Task
Office Task
Opening Door Task



ALMI-X Dataset and Foundational Model

We also release a large-scale whole-body motion control dataset - ALMI-X, featuring high-quality episodic trajectories from MuJoCo simulations deployable on real robots, based on our humanoid control policy - ALMI.


图片1

BibTeX