-
21:37
(UTC -12:00) - LJungang.github.io
-
WebClone Public
Forked from franskey-0112/WebCloneAn Offline Evaluation Toolkit for Dynamic Assessment of Computer-Use Agents
JavaScript UpdatedJan 26, 2026 -
๐ฅAn open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.
-
RTV-Bench Public
[NeurIPS 2025] ๐ก๐ฃ๐ฅ-๐๐ฎ๐ท๐ฌ๐ฑ: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time Video.
-
JavisGPT Public
Forked from JavisVerse/JavisGPT[NeurIPS 2025 Spotlight] JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation
-
CMAT Public
[Applied Soft Computing, 2025] CMAT: A Cross-Model Adversarial Texture for Scanned Document Privacy Protection.
-
TimeLens Public
Forked from TencentARC/TimeLensTimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs
Python Other UpdatedDec 19, 2025 -
Awesome-Streaming-Video-Understanding Public
Forked from sotayang/Awesome-Streaming-Video-Understanding[Awesome] ๐ฅ๐ฅ๐ฅ Latest Papers, Codes and Datasets on Streaming / Online Video Understanding
-
Awesome-Video-LMM-Post-Training Public
Forked from yunlong10/Awesome-Video-LMM-Post-Training๐ฅ๐ฅ๐ฅ Latest Papers, Codes and Datasets on Video-LMM Post-Training
-
MotionSight Public
Forked from NJU-PCALab/MotionSightPython Apache License 2.0 UpdatedSep 26, 2025 -
-
-
SAVEn-Vid Public
SAVEn-Vid: Synergistic Audio-Visual Integration for Enhanced Understanding in Long Video Context
5 UpdatedDec 21, 2024 -
NExT-GPT Public
Forked from NExT-GPT/NExT-GPTCode and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Python BSD 3-Clause "New" or "Revised" License UpdatedNov 10, 2024 -
Artista-o Public
Forked from showlab/Show-oFollowing Show-o, Artista advances multimodal capabilities with a unified architecture that seamlessly integrates understanding and generation across modalities, acting like a versatile artist.
Python Apache License 2.0 UpdatedOct 28, 2024 -
LLaVA Public
Forked from haotian-liu/LLaVAFollowing LLaVA for Personal Learning.
Python Apache License 2.0 UpdatedAug 12, 2024