Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View hejingwenhejingwen's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report hejingwenhejingwen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2025] Improving Video Generation with Human Feedback

Python 313 6 Updated Sep 24, 2025

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,524 82 Updated Oct 28, 2025

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

Python 84 3 Updated Sep 12, 2025

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Python 1,706 175 Updated Oct 4, 2025

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,576 75 Updated Oct 23, 2025

CogView4, CogView3-Plus and CogView3(ECCV 2024)

Python 1,090 78 Updated Mar 29, 2025

[SIGGRAPH Asia 2024, Best Paper Honorable Mention] This is the official implementation of our SIGGRAPH Asia journal artical: TEXGen: a Generative Diffusion Model for Mesh Textures

Python 313 9 Updated Dec 18, 2024

A light-weight and high-efficient training framework for accelerating diffusion tasks.

Python 50 2 Updated Sep 14, 2024

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Python 916 23 Updated Mar 17, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,078 1,204 Updated Sep 7, 2025

In 2024, the strongest open-source implementation of asymmetric magvit_v2 supports inference code but excludes VQVAE. It supports the joint encoding of images and videos, accommodating arbitrary vi…

Python 150 1 Updated Jul 30, 2024

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 4,987 395 Updated Jul 10, 2024

Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation

Python 556 31 Updated Sep 16, 2024
62 Updated Jun 25, 2024

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,236 94 Updated Feb 16, 2025

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,910 382 Updated Mar 14, 2024

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,290 86 Updated Oct 16, 2025

An Open-source Toolkit for LLM Development

Python 2,790 176 Updated Jan 13, 2025

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 3,217 195 Updated Oct 31, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 27,687 2,745 Updated Apr 30, 2025

[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.

Python 1,881 189 Updated Oct 30, 2025

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

Python 881 51 Updated Jan 3, 2025

[ICCV 2023] MatrixCity: A Large-scale City Dataset for City-scale Neural Rendering and Beyond.

Python 303 13 Updated Jun 5, 2024

[CVPR 2024] CoSeR: Bridging Image and Language for Cognitive Super-Resolution

350 11 Updated Aug 5, 2024

[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models

Python 532 21 Updated Jan 18, 2024

[CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

Python 1,375 79 Updated Sep 27, 2024

[ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.

Python 507 27 Updated Mar 7, 2024

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Python 3,142 272 Updated Jan 10, 2025

Implementation of MagViT2 Tokenizer in Pytorch

Python 645 34 Updated Jan 12, 2025

Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch

Python 912 88 Updated Feb 29, 2024
Next