ArXiv

The paper presents a complete real-time whole-body teleoperation pipeline that…

Authors
Hamza Ahmed Durrani, Suleman Khan
Categories
cs.RO
arXiv
https://arxiv.org/abs/2605.12347v1
PDF
https://arxiv.org/pdf/2605.12347v1

Brief

The paper tackles low-latency, whole-body humanoid teleoperation by mapping Virdyn IMU suit data to a Unitree G1 using a custom motion-processing, kinematic-retargeting, and control pipeline that avoids offline buffering and learning-based modules. Validated in MuJoCo then transferred unchanged to the physical robot, the system reportedly achieves stable, synchronized reproduction of a wide motion repertoire; summary based on the abstract and metadata.

Why it matters

The paper presents a complete real-time whole-body teleoperation pipeline that maps a Virdyn IMU-based full-body motion-capture suit directly onto a Unitree G1 humanoid; validated first in MuJoCo (sim2sim) and then deployed without modification on the real robot (sim2real), reproducing walking, standing, sitting, turning, bowing, and coordinated expressive gestures with stable, synchronized performance.

Key details

  • The system uses a custom motion-processing, kinematic-retargeting, and control pipeline engineered for continuous, low-latency operation with no offline buffering or learning-based components; authored by Hamza Ahmed Durrani and Suleman Khan (arXiv:2605.12347v1, 2026-05-12; 8 pages, 4 figures).
Source evidence

Abstract

Stable, low-latency whole-body teleoperation of humanoid robots is an open research challenge, complicated by kinematic mismatches between human and robot morphologies, accumulated inertial sensor noise, non-trivial control latency, and persistent sim-to-real transfer gaps. This paper presents a complete real-time whole-body teleoperation system that maps human motion, recorded with a Virdyn IMU-based full-body motion capture suit, directly onto a Unitree G1 humanoid robot. We introduce a custom motion-processing, kinematic retargeting, and control pipeline engineered for continuous, low-latency operation without any offline buffering or learning-based components. The system is first validated in simulation using the MuJoCo physics model of the Unitree G1 (sim2sim), and then deployed without modification on the physical platform (sim2real). Experimental results demonstrate stable, synchronized reproduction of a broad motion repertoire, including walking, standing, sitting, turning, bowing, and coordinated expressive full-body gestures. This work establishes a practical, scalable framework for whole-body humanoid teleoperation using commodity wearable motion capture hardware.

Comment: 8 pages, 4 figures