Thanks to visit codestin.com
Credit goes to text-op.github.io

TextOp: Real-time Interactive Text-Driven Humanoid Robot Motion Generation and Control

1Institute of Artificial Intelligence (TeleAI), China Telecom
2Shanghai Jiao Tong University 3East China University of Science and Technology
*Equal contribution Corresponding author
TeleAI SJTU ECUST

Video

Abstract

Recent advances in humanoid whole-body motion tracking have enabled the execution of diverse and highly coordinated motions on real hardware. However, existing controllers are commonly driven either by predefined motion trajectories, which offer limited flexibility when user intent changes, or by continuous human teleoperation, which requires constant human involvement and limits autonomy. This work addresses the problem of how to drive a universal humanoid controller in a real-time and interactive manner. We present TextOp, a real-time text-driven humanoid motion generation and control framework that supports streaming language commands and on-the-fly instruction modification during execution. TextOp adopts a two-level architecture in which a high-level autoregressive motion diffusion model continuously generates short-horizon kinematic trajectories conditioned on the current text input, while a low-level motion tracking policy executes these trajectories on a physical humanoid robot. By bridging interactive motion generation with robust whole-body control, TextOp unlocks free-form intent expression and enables smooth transitions across multiple challenging behaviors such as dancing and jumping, within a single continuous motion execution. Extensive real-robot experiments and offline evaluations demonstrate instant responsiveness, smooth whole-body motion, and precise control. The project page and the open-source code are available at https://text-op.github.io/ .

Teaser Image

Method

Method Image

Real Robot Experiments

One-Take Demonstration

Multi Skill Demonstration

Daily Life
Dancing
Gestures
Musical Instruments
Jumping
Kungfu
Locomotion
Sports
Stretching

Robustness to Perturbation

Citation

@article{xie2025textop,
  title={TextOp: Real-time Interactive Text-Driven Humanoid Robot Motion Generation and Control},
  author={Xie, Weiji and Zheng, Jiakun and Han, Jinrui and Shi, Jiyuan and Zhang, Weinan and Bai, Chenjia and Li, Xuelong},
  journal={arXiv preprint},
  year={2026}
}