-
linzhiqiu.github.io Public
Forked from RayeRen/acad-homepage.github.ioZhiqiu Lin's site
-
-
-
MiraData Public
Forked from mira-space/MiraDataOfficial repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"
Python Other UpdatedNov 1, 2025 -
ShareGPT4Video Public
Forked from ShareGPT4Omni/ShareGPT4Video[NeurIPS 2024] An official implementation of "ShareGPT4Video: Improving Video Understanding and Generation with Better Captions"
Python UpdatedNov 1, 2025 -
-
t2v_metrics Public
Evaluating text-to-image/video/3D models with VQAScore
-
-
cross_modal_adaptation Public
Cross-modal few-shot adaptation with CLIP
-
-
lmms-eval Public
Forked from EvolvingLMMs-Lab/lmms-evalAccelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
Python Other UpdatedFeb 14, 2025 -
streamlit-video-captioning Public
Forked from streamlit/llm-examplesStreamlit LLM app
Python Apache License 2.0 UpdatedJan 30, 2025 -
pytorchvideo Public
Forked from facebookresearch/pytorchvideoA deep learning library for video understanding research.
Python Apache License 2.0 UpdatedJan 25, 2025 -
streamlit-feedback-video Public
Forked from trubrics/streamlit-feedbackCollect user feedback from within your Streamlit app
JavaScript MIT License UpdatedJan 13, 2025 -
CLIP-FlanT5 Public
Training code for CLIP-FlanT5
-
llm-can-optimize-vlm.github.io Public
Forked from llm-can-optimize-vlm/llm-can-optimize-vlm.github.ioJavaScript UpdatedMay 5, 2024 -
LLaVA Public
Forked from haotian-liu/LLaVA[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.
Python Apache License 2.0 UpdatedDec 14, 2023 -
PerceptualSimilarity Public
Forked from richzhang/PerceptualSimilarityLPIPS metric. pip install lpips
Python BSD 2-Clause "Simplified" License UpdatedOct 27, 2023 -
visual_gpt_score Public
VisualGPTScore for visio-linguistic reasoning
-
avalanche Public
Forked from ContinualAI/avalancheAvalanche: an End-to-End Library for Continual Learning.
Python MIT License UpdatedSep 16, 2023 -
vision-language-models-are-bows Public
Forked from mertyg/vision-language-models-are-bowsExperiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023
Python MIT License UpdatedApr 9, 2023 -
debiased-pseudo-labeling Public
Forked from frank-xwang/debiased-pseudo-labeling[CVPR 2022] Debiased Learning from Naturally Imbalanced Pseudo-Labels
Jupyter Notebook MIT License UpdatedFeb 19, 2023 -
why-winoground-hard Public
Forked from ajd12342/why-winoground-hardCode for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022
Python MIT License UpdatedFeb 4, 2023 -
-
-
-
mmselfsup Public
Forked from open-mmlab/mmselfsupOpenMMLab Self-Supervised Learning Toolbox and Benchmark
Python Apache License 2.0 UpdatedAug 11, 2022 -
HRNet-Semantic-Segmentation Public
Forked from HRNet/HRNet-Semantic-SegmentationThe OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919
Python Other UpdatedAug 1, 2022 -
-
clear-benchmark.github.io Public
Forked from clear-benchmark/clear-benchmark.github.ioHTML UpdatedJul 6, 2022