Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View zcxu-eric's full-sized avatar

Block or report zcxu-eric

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"

Python 279 12 Updated Apr 23, 2025
Python 50 1 Updated Apr 28, 2025

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,068 518 Updated Jun 9, 2025

Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generation

Python 109 Updated Apr 16, 2025

[NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.

Python 50 Updated Oct 14, 2024

High-resolution models for human tasks.

Python 5,207 303 Updated Nov 18, 2024

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,770 76 Updated Oct 22, 2025

(ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator

Python 114 Updated Mar 21, 2025
Python 359 15 Updated Oct 21, 2024
Python 131 7 Updated Aug 10, 2024

CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)

Python 348 8 Updated Jul 26, 2024

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,470 544 Updated Nov 10, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 27,846 2,767 Updated Apr 30, 2025

AvatarStudio: High-fidelity and Animatable 3D Avatar Creation from Text

117 7 Updated Nov 30, 2023

[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"

Python 10,876 1,107 Updated Aug 29, 2025

[NeurIPS 2023] XAGen: 3D Expressive Human Avatars Generation

Python 77 8 Updated Apr 5, 2024

[ICCV 2023] GETAvatar: Generative Textured Meshes for Animatable Human Avatars

Python 113 9 Updated Jun 18, 2025

[IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Python 1,132 57 Updated Sep 13, 2025

FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)

1,888 78 Updated Dec 24, 2024

MagicEdit: High-Fidelity Temporally Coherent Video Editing

1,805 102 Updated Aug 29, 2023

MagicAvatar: Multimodal Avatar Generation and Animation

624 34 Updated Aug 29, 2023

[CVPR 2024] ViT-Lens: Towards Omni-modal Representations

Python 183 11 Updated Feb 3, 2025

[NeurIPS2023] DatasetDM:Synthesizing Data with Perception Annotations Using Diffusion Models

Python 320 15 Updated Nov 3, 2023

[ICCV 2023] UniVTG: Towards Unified Video-Language Temporal Grounding

Python 369 34 Updated May 8, 2024

[CVPR2024, Highlight] Official code for DragDiffusion

Python 1,239 96 Updated Jan 29, 2024

The repository for paper Unsupervised Volumetric Animation

Python 69 1 Updated Sep 22, 2023
43 3 Updated Nov 12, 2024

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Python 4,367 399 Updated Oct 25, 2023
Jupyter Notebook 81 3 Updated Aug 1, 2023

[Image 2 Text Para] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.

Python 821 55 Updated Apr 28, 2023
Next