-
LAVIS Public
Forked from salesforce/LAVISLAVIS - A One-stop Library for Language-Vision Intelligence
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedJan 15, 2024 -
vit-pytorch Public
Forked from lucidrains/vit-pytorchImplementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Python MIT License UpdatedDec 23, 2023 -
Chinese-CLIP Public
Forked from OFA-Sys/Chinese-CLIPChinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Python MIT License UpdatedNov 29, 2023 -
-
you-get Public
Forked from soimort/you-get⏬ Dumb downloader that scrapes the web
Python Other UpdatedAug 3, 2023 -
bubogpt Public
Forked from magic-research/bubogptBuboGPT: Enabling Visual Grounding in Multi-Modal LLMs
Python BSD 3-Clause "New" or "Revised" License UpdatedJul 21, 2023 -
SegFormer Public
Forked from NVlabs/SegFormerOfficial PyTorch implementation of SegFormer
Python Other UpdatedJun 13, 2023 -
BLIP Public
Forked from salesforce/BLIPPyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
-
Pretrained-Language-Model Public
Forked from huawei-noah/Pretrained-Language-ModelPretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
Python UpdatedMay 21, 2023 -
MiniGPT-4 Public
Forked from Vision-CAIR/MiniGPT-4MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
-
AliceMind Public
Forked from alibaba/AliceMindALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
Python Apache License 2.0 UpdatedFeb 20, 2023 -
chatGPT-multimodal-bot Public
Forked from sahil280114/chatGPT-multimodal-botPython MIT License UpdatedJan 5, 2023 -
Cream Public
Forked from microsoft/CreamThis is a collection of our NAS and Vision Transformer work.
Python MIT License UpdatedDec 9, 2022 -
disco-diffusion Public
Forked from alembics/disco-diffusionJupyter Notebook Other UpdatedOct 24, 2022 -
-
unilm Public
Forked from microsoft/unilmLarge-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Python MIT License UpdatedSep 23, 2022 -
Video-Captioning Public
Forked from Shreyz-max/Video-CaptioningVideo Captioning is an encoder decoder mode based on sequence to sequence learning
Python UpdatedAug 19, 2022 -
VideoX Public
Forked from microsoft/VideoXVideoX: a collection of video cross-modal models
Python Other UpdatedAug 9, 2022 -
dlrm Public
Forked from facebookresearch/dlrmAn implementation of a deep learning recommendation model (DLRM)
Python MIT License UpdatedJul 28, 2022 -
-
CLIP4Clip Public
Forked from ArrowLuo/CLIP4ClipAn official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Python MIT License UpdatedJun 1, 2022 -
mae Public
Forked from facebookresearch/maePyTorch implementation of MAE https//arxiv.org/abs/2111.06377
-
DALLE2-pytorch Public
Forked from lucidrains/DALLE2-pytorchImplementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Python MIT License UpdatedApr 17, 2022 -
OFA Public
Forked from OFA-Sys/OFAOfficial repository of OFA. Paper: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Python Apache License 2.0 UpdatedMar 29, 2022 -
t5-pegasus-chinese Public
Forked from SunnyGJing/t5-pegasus-chinese基于GOOGLE T5中文生成式模型的摘要生成/指代消解,支持batch批量生成,多进程
Python MIT License UpdatedMar 22, 2022 -
MASTER-pytorch Public
Forked from wenwenyu/MASTER-pytorchCode for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)
Python MIT License UpdatedDec 26, 2021 -
PartialLabelingCSL Public
Forked from Alibaba-MIIL/PartialLabelingCSLOfficial implementation for the paper: "Multi-label Classification with Partial Annotations using Class-aware Selective Loss"
Python MIT License UpdatedOct 29, 2021 -
CSRA Public
Forked from Kevinz-code/CSRAOfficial code of ICCV2021 paper "Residual Attention: A Simple but Effective Method for Multi-Label Recognition"
Python GNU Affero General Public License v3.0 UpdatedOct 8, 2021 -
MobileModels Public
Forked from KHwang9883/MobileModels手机品牌型号汇总 | Mobile Models | This repository is licensed under CC BY-NC-SA 4.0
UpdatedSep 16, 2021 -
Informer2020 Public
Forked from zhouhaoyi/Informer2020The GitHub repository for the paper "Informer" accepted by AAAI 2021.
Python Apache License 2.0 UpdatedAug 12, 2021