More
More
-
APL Public
Forked from zhangbin-ai/APL[2024 AAAI] Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering
-
AVSBench Public
Forked from OpenNLPLab/AVSBench[2022 ECCV] Audio-Visual Segmentation
-
awesome-audiovisual-learning Public
Forked from GeWu-Lab/awesome-audiovisual-learningA curated list of audio-visual learning methods and datasets.
UpdatedJul 3, 2024 -
CPSP Public
[2023 TPAMI] Contrastive Positive Sample Propagation along the Audio-Visual Event Line
-
FAVDBench Public
Forked from OpenNLPLab/FAVDBench[CVPR 2023] Official implementation of the paper: Fine-grained Audible Video Description
Python Apache License 2.0 UpdatedDec 4, 2023 -
LEAP Public
[2024 ECCV] Label-anticipated Event Disentanglement for Audio-Visual Video Parsing
-
Mettle Public
[2025 Arxiv] Mettle: Meta-Token Learning for Memory-Efficient Audio-Visual Adaptation
1 UpdatedAug 5, 2025 -
OV-AVEL Public
[2025 CVPR] Towards Open-Vocabulary Audio-Visual Event Localization
-
PSP_CVPR_2021 Public
[2021 CVPR] Positive Sample Propagation along the Audio-Visual Event Line
-
TGS-Agent Public
[2025 Arxiv] Think Before You Segment: An Object-aware Reasoning Agent for Referring Audio-Visual Segmentation
8 UpdatedAug 13, 2025 -
video_features Public
Forked from v-iashin/video_featuresExtract video features from raw videos using multiple GPUs. We support RAFT and PWC flow frames as well as I3D, R(2+1)D, VGGish, ResNet features.
Python GNU General Public License v3.0 UpdatedMay 27, 2022