-
FG-CLIP Public
Forked from 360CVGroup/FG-CLIPNew generation of CLIP with fine grained discrimination capability, ICML2025
Python Apache License 2.0 UpdatedJul 10, 2025 -
Qwen2-VL-Finetune Public
Forked from 2U1/Qwen-VL-Series-FinetuneAn open-source implementaion for fine-tuning Qwen2-VL and Qwen2.5-VL series by Alibaba Cloud.
Python Apache License 2.0 UpdatedJun 2, 2025 -
CapArena Public
Forked from njucckevin/CapArenaAn Arena-style Automated Evaluation Benchmark for Detailed Captioning
Python UpdatedJun 1, 2025 -
MegaPairs Public
Forked from VectorSpaceLab/MegaPairsMegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval
Jupyter Notebook MIT License UpdatedMay 22, 2025 -
VLM_survey Public
Forked from jingyi0000/VLM_surveyCollection of AWESOME vision-language models for vision tasks
UpdatedMay 19, 2025 -
self-llm Public
Forked from datawhalechina/self-llm《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
Jupyter Notebook Apache License 2.0 UpdatedMay 8, 2025 -
tiny-universe Public
Forked from datawhalechina/tiny-universe《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
Jupyter Notebook UpdatedApr 30, 2025 -
HunyuanVideo Public
Forked from Tencent-Hunyuan/HunyuanVideoHunyuanVideo: A Systematic Framework For Large Video Generation Model
Python Other UpdatedApr 27, 2025 -
describe-anything Public
Forked from NVlabs/describe-anythingImplementation for Describe Anything: Detailed Localized Image and Video Captioning
Python Apache License 2.0 UpdatedApr 24, 2025 -
LLM_RethinkFun Public
Forked from XihWang/LLM_RethinkFunRethinkFun大模型个人学习笔记
Jupyter Notebook UpdatedApr 7, 2025 -
easy-dataset Public
Forked from ConardLi/easy-datasetA powerful tool for creating fine-tuning datasets for LLM
JavaScript UpdatedMar 30, 2025 -
AIInfra Public
Forked from Infrasys-AI/AIInfraAIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。
Python Apache License 2.0 UpdatedMar 29, 2025 -
-
Qwen2.5-VL-Fine-Tuning Public
Forked from libing64/Qwen2.5-VL-Fine-TuningPython UpdatedMar 2, 2025 -
minimind Public
Forked from jingyaogong/minimind「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练!
Python Apache License 2.0 UpdatedOct 13, 2024 -
LAVIS Public
Forked from salesforce/LAVISLAVIS - A One-stop Library for Language-Vision Intelligence
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedOct 11, 2024 -
minimind-v Public
Forked from jingyaogong/minimind-v「大模型」3小时从0训练27M参数的视觉多模态VLM,个人显卡即可推理训练!
Python Apache License 2.0 UpdatedOct 10, 2024 -
BEVBlip Public
Forked from BaranEkin/BEVBlipEfficient and lightweight Vision-Language model for Visual Question Answering in autonomous driving scenarios. The approach replaces images in BLIP's architecture with spatio-temporal BEV feature maps
Python UpdatedSep 30, 2024 -
-
End-to-end-Autonomous-Driving Public
Forked from OpenDriveLab/End-to-end-Autonomous-Driving[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving
MIT License UpdatedAug 15, 2024 -
Chinese-CLIP Public
Forked from OFA-Sys/Chinese-CLIPChinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Python MIT License UpdatedAug 6, 2024 -
CLIP4Clip Public
Forked from ArrowLuo/CLIP4ClipAn official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Python MIT License UpdatedApr 12, 2024 -
Open-World-Papers Public
Forked from SarahRastegar/Open-World-PapersA comprehensive collection of open world papers from top tier conferences and journals
UpdatedMar 15, 2024 -
Awesome-Table-Recognition Public
Forked from cv-small-snails/Awesome-Table-RecognitionA curated list of resources dedicated to table recognition
UpdatedJan 28, 2024 -
3DAL_PyTorch Public
Forked from jacky121298/3DAL_PyTorchThis is the pytorch implementation of 3DAL proposed by Qi et. al, "Offboard 3D Object Detection from Point Cloud Sequences", CVPR 2021.
Python UpdatedOct 31, 2023 -
-
data-centric-AI Public
Forked from daochenzha/data-centric-AIA curated, but incomplete, list of data-centric AI resources.
UpdatedMay 11, 2023 -
AugSeg Public
Forked from ZhenZHAO/AugSeg[CVPR'23] Augmentation Matters: A Simple-yet-Effective Approach to Semi-supervised Semantic Segmentation
Python UpdatedMar 30, 2023 -
ssl_detection Public
Forked from google-research/ssl_detectionSemi-supervised learning for object detection
Python Apache License 2.0 UpdatedMar 24, 2023 -
Semi-supervised-learning Public
Forked from microsoft/Semi-supervised-learningA Unified Semi-Supervised Learning Codebase (NeurIPS'22)
Python MIT License UpdatedMar 18, 2023