Stars
Generate game character animations with AI. Text to sprite sheet in seconds.
The official SpeakerVid-5M data curation code.
[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving
Official implementation of the paper "Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model"
Database of Steam video game data from October 2024, including game details, genres, reviews, tags, and SteamSpy insights
A generative world for general-purpose robotics & embodied AI learning.
An implementation of the principle of Maximal Coding Rate Reduction (MCR2).
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
Productive, portable, and performant GPU programming in Python.
Companion repository for the paper "Representation Learning via Manifold Flattening and Reconstruction"
This is the official implementation for Image Clustering via the Principle of Rate Reduction in the Age of Pretrained Models.
Hardware-synchronized device for FAST-LIVO (Handheld & UAV).
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
An open-source impl. of Large Reconstruction Models
Code for CRATE (Coding RAte reduction TransformEr).
[NeurIPS 2023] Official implementation of the paper "DreamWaltz: Make a Scene with Complex 3D Animatable Avatars".
Python 3.8+ toolbox for submitting jobs to Slurm
PyTorch implementation of ``User-Controllable Latent Transformer for StyleGAN Image Layout Editing'' [Computer Graphics Forum (Proc. of Pacific Graphics 2022)]
A unified framework for 3D content generation.
PyTorch implementation of the ICCV paper "3D-aware Image Generation using 2D Diffusion Models"
ImageBind One Embedding Space to Bind Them All
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
[CVPR 2023] Official implementation of the paper "One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer"
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…