Stars
Repository for the paper: Teaching VLMs to Localize Specific Objects from In-context Examples
FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions
Refine high-quality datasets and visual AI models
FairMOT for Multi-Class MOT using YOLOX as Detector