Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
Change the repository type filter

All

    Repositories list

    • physical-ai-bench

      Public
      PAI-Bench: A Comprehensive Benchmark for Physical AI
      Python
      04010Updated Dec 2, 2025Dec 2, 2025
    • Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models, 2023
      Python
      813570Updated Oct 22, 2025Oct 22, 2025
    • VisPer-LM

      Public
      [NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation
      Python
      16920Updated Oct 17, 2025Oct 17, 2025
    • IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance, ICCV 2025
      Python
      33010Updated Oct 1, 2025Oct 1, 2025
    • StyleNAT

      Public
      New flexible and efficient image generation framework that sets new SOTA on FFHQ-256 with FID 2.05, 2022
      Python
      1310100Updated Jun 26, 2025Jun 26, 2025
    • Python
      12720Updated Apr 8, 2025Apr 8, 2025
    • Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment, arXiv 2024 / CVPR 2025
      Python
      23810Updated Mar 1, 2025Mar 1, 2025
    • Escaping the Big Data Paradigm with Compact Transformers, 2021 (Train your Vision Transformers in 30 mins on CIFAR-10 with a single GPU!)
      Python
      8453972Updated Nov 5, 2024Nov 5, 2024
    • Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024
      Python
      7353130Updated Sep 24, 2024Sep 24, 2024
    • CuMo

      Public
      CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
      Python
      816201Updated Jun 8, 2024Jun 8, 2024
    • Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022
      Python
      881.2k50Updated May 15, 2024May 15, 2024
    • VCoder

      Public
      [CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models
      Python
      1628041Updated Apr 17, 2024Apr 17, 2024
    • Rethinking-Text-Segmentation

      Public
      [CVPR 2021] Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement Approach
      Python
      29270130Updated Dec 2, 2023Dec 2, 2023
    • Matting Anything Model (MAM), an efficient and versatile framework for estimating the alpha matte of any instance in an image with flexible and interactive visual or linguistic user prompt guidance.
      Python
      4968981Updated Nov 18, 2023Nov 18, 2023
    • Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024
      Python
      38757151Updated Nov 16, 2023Nov 16, 2023
    • VIM

      Public
      Python
      46340Updated Nov 8, 2023Nov 8, 2023
    • Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
      Python
      851.3k91Updated Aug 10, 2023Aug 10, 2023
    • [Colab Demo Code] OneFormer: One Transformer to Rule Universal Image Segmentation.
      Python
      101410Updated May 24, 2023May 24, 2023
    • PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models, 2023
      Python
      21300Updated May 19, 2023May 19, 2023
    • a copy of "Text-to-Image Diffusion Models are Zero-Shot Video Generators", ICCV 2023
      Python
      389200Updated May 6, 2023May 6, 2023
    • Python
      158260Updated Apr 10, 2023Apr 10, 2023
    • SH-GAN

      Public
      [WACV 2023] Image Completion with Heterogeneously Filtered Spectral Hints
      Python
      46930Updated Mar 28, 2023Mar 28, 2023
    • Boosted Dynamic Neural Networks, AAAI 2023
      Python
      3810Updated Dec 1, 2022Dec 1, 2022
    • VMFormer

      Public
      [Preprint] VMFormer: End-to-End Video Matting with Transformer
      Python
      912080Updated Nov 30, 2022Nov 30, 2022
    • [CVPR 2020] Differential Treatment for Stuff and Things: A Simple Unsupervised Domain Adaptation Method for Semantic Segmentation
      Python
      149252Updated Nov 22, 2022Nov 22, 2022
    • [Preprint] ConvMLP: Hierarchical Convolutional MLPs for Vision, 2021
      Python
      1616740Updated Oct 11, 2022Oct 11, 2022
    • [CVPR 2022 Oral] Towards Layer-wise Image Vectorization
      Python
      63200Updated Jun 10, 2022Jun 10, 2022
    • [CVPR 2022] VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution
      Python
      23000Updated Jun 9, 2022Jun 9, 2022
    • SinNeRF

      Public
      "SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image", Dejia Xu, Yifan Jiang, Peihao Wang, Zhiwen Fan, Humphrey Shi, Zhangyang Wang
      Python
      25000Updated May 3, 2022May 3, 2022
    • Python
      10000Updated May 1, 2022May 1, 2022