Stars
View disk space usage and delete unwanted data, fast.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
verl: Volcano Engine Reinforcement Learning for LLMs
Official implementation of SwiftSketch
List of Hackathon Hackers' personal sites.
[CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation
GUI Grounding for Professional High-Resolution Computer Use
[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"
Web application generating interactive and highly customizable maps
The source code of the Medieval Fantasy City Generator
Home of StarCoder: fine-tuning & inference!
Procedurally generated Chinese landscape painting.
全网乱传的Deepseek从入门到精通的PDF版本,清华大学新闻与传播学院 新媒体研究中心 元宇宙文化实验室
Janus-Series: Unified Multimodal Understanding and Generation Models
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…
Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Official repo of Text-Free Learning of a Natural Language Interface for Pretrained Face Generators
Motion-conditional image animation for video editing
CoTracker is a model for tracking any point (pixel) on a video.
[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction