- NanJing
-
DF-GAN Public
[CVPR2022 oral] A Simple and Effective Baseline for Text-to-Image Synthesis
-
A Survey on End-to-End (One-Step) Visual Generative Models
1 UpdatedJul 7, 2025 -
A Survey on World Models for Autonomous Driving
-
-
StoryImager Public
[ECCV2024] StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion
-
CoIn Public
[ACM MM 2024] A fast and effective Story Visualization and Continuation Model
-
-
GALIP Public
[CVPR2023] A faster, smaller, and better text-to-image model for large-scale training
-
Transformers-Tutorials Public
Forked from NielsRogge/Transformers-TutorialsThis repository contains demos I made with the Transformers library by HuggingFace.
-
GLIGEN Public
Forked from gligen/GLIGENOpen-Set Grounded Text-to-Image Generation
Python UpdatedMar 4, 2023 -
DE-Net Public
[AAAI 2023] Dynamic Text-guided Image Editing Adversarial Networks
-
-
Awesome-Text-to-Image Public
Forked from Yutong-Zhou-cv/Awesome-Text-to-ImageA Survey on Text-to-Image Generation/Synthesis.
-
disco-diffusion Public
Forked from alembics/disco-diffusionJupyter Notebook Other UpdatedMay 26, 2022 -
mmgeneration Public
Forked from open-mmlab/mmgenerationMMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.
Python Apache License 2.0 UpdatedMay 7, 2022 -
CLIP-Guided-Diffusion Public
Forked from nerdyrodent/CLIP-Guided-DiffusionJust playing with getting CLIP Guided Diffusion running locally, rather than having to use colab.
Python MIT License UpdatedMay 1, 2022 -
DenseCLIP Public
Forked from raoyongming/DenseCLIP[CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting
Python UpdatedApr 14, 2022 -
MaskGIT-pytorch Public
Forked from dome272/MaskGIT-pytorchPytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)
Python MIT License UpdatedFeb 23, 2022 -
intro_dgm Public
Forked from jmtomczak/intro_dgmAn Introduction to Deep Generative Modeling: Examples
Jupyter Notebook MIT License UpdatedFeb 21, 2022 -
blended-diffusion Public
Forked from omriav/blended-diffusionOfficial implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]
Jupyter Notebook MIT License UpdatedFeb 15, 2022 -
-
BLIP Public
Forked from salesforce/BLIPPyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedFeb 3, 2022 -
lang-seg Public
Forked from isl-org/lang-segLanguage-Driven Semantic Segmentation
Jupyter Notebook MIT License UpdatedJan 11, 2022 -
vit-pytorch Public
Forked from lucidrains/vit-pytorchImplementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Python MIT License UpdatedDec 31, 2021 -
ViLT Public
Forked from dandelin/ViLTCode for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
Python Apache License 2.0 UpdatedDec 30, 2021 -
gansformer Public
Forked from dorarad/gansformerGenerative Adversarial Transformers
Python MIT License UpdatedDec 28, 2021 -
aphantasia Public
Forked from eps696/aphantasiaCLIP + FFT/DWT/RGB = text to image/video
Python MIT License UpdatedNov 27, 2021 -
-
annotated_deep_learning_paper_implementations Public
Forked from labmlai/annotated_deep_learning_paper_implementations🧑🏫 50! Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cycle…
Jupyter Notebook MIT License UpdatedOct 22, 2021 -
Swin-Transformer Public
Forked from microsoft/Swin-TransformerThis is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Python MIT License UpdatedOct 13, 2021