Highlights
- Pro
Stars
Code release for "Reading Recognition in the Wild".
Code release for "Reading Recognition in the Wild".
Character-aware audio-only subtitling
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
[ACCV 2024] Official Implementation of "AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description". Junyu Xie, Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman
Dataset page for Look, Listen and Recognise : character-aware audio-visual subtitling (ICASSP 2024)
The official PyTorch implementation of Google's Gemma models
Large World Model -- Modeling Text and Video with Millions Context
Includes FSC-147-D and the code for training and testing the CounTX model from the paper Open-world Text-specified Object Counting.
AI & parametric QR code generator. AI & 参数化二维码生成器。https://qrbtf.com
A high-throughput and memory-efficient inference and serving engine for LLMs
QLoRA: Efficient Finetuning of Quantized LLMs
ImageBind One Embedding Space to Bind Them All
PyTorch code and models for the DINOv2 self-supervised learning method.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Source code for the X Recommendation Algorithm
An open-source framework for training large multimodal models.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"
Examples and guides for using the OpenAI API
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Instruct-tune LLaMA on consumer hardware
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"