Stars
vlm
3 repositories
LAVIS - A One-stop Library for Language-Vision Intelligence
A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.
An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch