-
HKUST-GZ
- GuangZhou
- https://www.hkust-gz.edu.cn/zh/
Stars
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
Eagle: Frontier Vision-Language Models with Data-Centric Strategies
SAPIEN Manipulation Skill Framework, an open source GPU parallelized robotics simulator and benchmark, led by Hillbot, Inc.
Nvidia GEAR Lab's initiative to solve the robotics data problem using world models
NextFlow🚀: Unified Sequential Modeling Activates Multimodal Understanding and Generation
Block Diffusion for Ultra-Fast Speculative Decoding
UniVideo: Unified Understanding, Generation, and Editing for Videos
VITRA: Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos
Any4D: Unified Feed-Forward Metric 4D Reconstruction
A general physic-based retargeting framework.
UniTacHand: Unified Spatio-Tactile Representation for Human-to-Dexterous-Hand Skill Transfer
Ctrl-World: A Controllable Generative World Model for Robot Manipualtion
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
SOTAMak1r / Infinite-Forcing
Forked from guandeh17/Self-ForcingInfinite-Forcing: Towards Infinite-Long Video Generation
Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation Model
Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"
The official repository of "Astra : General Interactive World Model with Autoregressive Denoising"