Stars
[NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation
OmniGen2: Exploration to Advanced Multimodal Generation.
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
A lightweight LMM-based Document Parsing Model
Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"
[NeurIPS 2025] Direct3D‑S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention
Official Code Release for [SIGGRAPH 2025] RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination
A Python-embedded modeling language for convex optimization problems.
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Agent S: an open agentic framework that uses computers like a human
基于 FastAPI 构建的企业级后端架构解决方案
fastapi + pydantic-v2 + sqlalchemy 2.0 + alembic + mysql + redis
A ComfyUI custom node designed for advanced image background removal and object, face, clothes, and fashion segmentation, utilizing multiple models including RMBG-2.0, INSPYRENET, BEN, BEN2, BiRefN…
本仓库将BiRefNet最新模型封装为ComfyUI节点来使用,相较于旧模型来说,最新模型的抠图精度更高更好。This repository wraps the latest BiRefNet model as ComfyUI nodes. Compared to the previous model, the latest model offers higher and better ma…
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
OpenMMLab Detection Toolbox and Benchmark
[CVPR 2025] VideoWorld is a simple generative model that learns purely from unlabeled videos—much like how babies learn by observing their environment.
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
Project page of Motionshop-2: An advanced version of the Motionshop; We add a new fancy feature: 3D Animation Engine; AI for computer graphics