-
The Hong Kong University of Science and Technology (Guangzhou)
-
01:37
(UTC +08:00) - https://yipko.com
Highlights
- Pro
Lists (4)
Sort Name ascending (A-Z)
Stars
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
VLA-RFT: Vision-Language-Action Models with Reinforcement Fine-Tuning
WoW (World-Omniscient World Model) is a generative world model trained on 2 million robotic interaction trajectories, designed to imagine, reason, and act in the physical world. Unlike passive vide…
starVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Official code for "Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation"
Implementation of deep learning papers.
[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat
Wan: Open and Advanced Large-Scale Video Generative Models
Official repo for GraspGen: A Diffusion-based Framework for 6-DOF Grasping
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
Legacy-Mess Detector – assess the “legacy-mess level” of your code and output a beautiful report | 屎山代码检测器,评估代码的“屎山等级”并输出美观的报告
ModelTC / Wan2.2-Lightning
Forked from Wan-Video/Wan2.2Wan2.2-Lightning: Speed up wan2.2 model with distillation
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
[Embodied-AI-Survey-2025] Paper List and Resource Repository for Embodied AI
Converts a depth map image to a normal map image using Python
An intuitive and low-overhead instrumentation tool for Python
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
M2T2: Multi-Task Masked Transformer for Object-centric Pick and Plac
Code and website for "GraspMolmo: Generalizable Task-Oriented Grasping via Large-Scale Synthetic Data Generation"
A python module to repair invalid JSON from LLMs