Stars
[NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation
GoatWu / Self-Forcing-Plus
Forked from guandeh17/Self-ForcingUnofficial extension implementation of Self-Forcing to support I2V && 14B training.
SGLang is a fast serving framework for large language models and vision language models.
Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives
A curated list of Diffusion Model in RL resources (continually updated)
[NeurIPS 2025] Improving Video Generation with Human Feedback
Tarsier -- a family of large-scale video-language models, which is designed to generate high-quality video descriptions , together with good capability of general video understanding.
A pipeline parallel training script for diffusion models.
Open-Sora: Democratizing Efficient Video Production for All
Official Implementation for CVPR2023 paper "GP-VTON: Towards General Purpose Virtual Try-on via Collaborative Local-Flow Global-Parsing Learning"
ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information
qaneel / kohya-trainer
Forked from Linaqruf/kohya-trainerAdapted from https://note.com/kohya_ss/n/nbf7ce8d80f29 for easier cloning
[CVPR 2024] DreamComposer: Controllable 3D Object Generation via Multi-View Conditions
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
The state-of-the-art image restoration model without nonlinear activation functions.
Unofficial implementation of Image Super-Resolution via Iterative Refinement by Pytorch
CVPR2023 - Activating More Pixels in Image Super-Resolution Transformer TPAMI - HAT: Hybrid Attention Transformer for Image Restoration
[ECCV 2024] codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
X-Super-Resolution is dedicated to presenting the research efforts of XPixel in the realm of image super-resolution.
Awesome List of Attention Modules and Plug&Play Modules in Computer Vision
ECCV2020 - Practical Deep Raw Image Denoising on Mobile Devices
Official repository of "Deep Coupled Feedback Network for Joint Exposure Fusion and Image Super-Resolution"
This is the code for "Multi-Exposure Image Fusion via Deep Perceptual Enhancement".
A PyTorch implementation of 'Deep Bilateral Learning for Real-Time Image Enhancement'
A simple and light-weight camera image processing pipeline