Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Tensors and Dynamic neural networks in Python with strong GPU acceleration
😘 让你“爱”上 GitHub,解决访问时图裂、加载慢的问题。(无需安装)
Official Project Page for Deep Delta Learning (https://huggingface.co/papers/2601.00417)
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
🐍 Geometric Computer Vision Library for Spatial AI
Effortless data labeling with AI support from Segment Anything and other awesome models.
Datasets for deep learning with satellite & aerial imagery
Techniques for deep learning with satellite & aerial imagery
High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle
[CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception
[CVPR 2025] The official implementation of EMRDM, which is a novel diffusion model for cloud removal of remote sensing images.
This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).
Models and examples built with TensorFlow
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Light Image Video Generation Inference Framework
Simulating the Real World: Survey & Resources, which contains our survey "Simulating the Real World: A Unified Survey of Multimodal Generative Models" and Awesome-Text2X-Resources. Watch this repos…
collection of diffusion model papers categorized by their subareas
⏰ Collaboratively track worldwide conference deadlines (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
Interactive visualizations of the geometric intuition behind diffusion models.
Awesome-RAG-Vision: a curated list of advanced retrieval augmented generation (RAG) for Computer Vision
Implementation of popular deep learning networks with TensorRT network definition API
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
EarthVL: A Progressive Earth Vision-Language Understanding and Generation Framework
[AAAI 2024] EarthVQA: Towards Queryable Earth via Relational Reasoning-Based Remote Sensing Visual Question Answering