Thanks to visit codestin.com
Credit goes to Github.com

GeWu-Lab

All

50 repositories

MokA
Public
MokA: Multimodal Low-Rank Adaptation for MLLMs
Python
•4•60•11•0•Updated Dec 30, 2025Dec 30, 2025
Crab
Public
[CVPR 2025] Crab: A Unified Audio-Visual Scene Understanding Model with Explicit Cooperation
mllms
Python
•2•80•4•0•Updated Dec 24, 2025Dec 24, 2025
awesome-balanced-multimodal-learning
Public
A curated list of balanced multimodal learning methods.
5•147•1•0•Updated Dec 22, 2025Dec 22, 2025
InfoReg_CVPR2025
Public
This is the repo for "Adaptive Unimodal Regulation for Balanced Multimodal Information Acquisition", CVPR2025.
Python
•5•19•4•0•Updated Dec 22, 2025Dec 22, 2025
action_preference_optimization
Public
JavaScript
•0•0•0•0•Updated Oct 26, 2025Oct 26, 2025
Action-Preference-Optimization
Public
Python
•
MIT License
•2•7•2•0•Updated Oct 26, 2025Oct 26, 2025
gewu-lab.github.io
Public
HTML
•1•0•0•0•Updated Oct 15, 2025Oct 15, 2025
Ref-AVS
Public
The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024
Python
•
MIT License
•2•49•0•0•Updated Oct 12, 2025Oct 12, 2025
hapo_human_assisted_preference_optimization
Public
JavaScript
•0•0•0•0•Updated Sep 25, 2025Sep 25, 2025
OGM-GE_CVPR2022
Public
The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)
Python
•
MIT License
•23•302•36•0•Updated Sep 22, 2025Sep 22, 2025
MGIPF
Public
The repo for "MGIPF: Multi-Granularity Interest Prediction Framework for Personalized Recommendation", SIGIR 2025
Python
•
MIT License
•1•2•0•0•Updated Jul 26, 2025Jul 26, 2025
WCAE
Public
Python
•0•0•0•0•Updated Jul 1, 2025Jul 1, 2025
MS-Bot
Public
The offical repo for "Play to the Score: Stage-Guided Dynamic Multi-Sensory Fusion for Robotic Manipulation", CoRL 2024 (ORAL)
Python
•3•19•1•0•Updated Jun 25, 2025Jun 25, 2025
AnyTouch
Public
The repo for "AnyTouch: Learning Unified Static-Dynamic Representation across Multiple Visuo-tactile Sensors", ICLR 2025
Python
•
MIT License
•7•77•2•0•Updated Jun 25, 2025Jun 25, 2025
RollingQ_ICML2025
Public
Official repo for ICML 2025 paper "RollingQ: Reviving the Cooperation Dynamics in Multimodal Transformer"
Python
•2•13•3•0•Updated Jun 21, 2025Jun 21, 2025
Certifiable-Robust-Multi-modal-Training
Public
A python implement for Certifiable Robust Multi-modal Training
Python
•0•19•0•0•Updated Jun 21, 2025Jun 21, 2025
Patch-Matters
Public
[CVPR2025] Code Release of Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception
Python
•0•19•2•0•Updated Jun 17, 2025Jun 17, 2025
LSMI_Estimator
Public
The official repo for "Efficient Quantification of Multimodal Interaction at Sample Level", ICML 2025
Python
•1•7•1•0•Updated Jun 5, 2025Jun 5, 2025
Motion-based-Self-Reflection-Framework
Public
Python
•0•12•1•0•Updated Apr 30, 2025Apr 30, 2025
LFAV
Public
Towards Long Form Audio-visual Video Understanding
Python
•
MIT License
•0•14•1•0•Updated Apr 27, 2025Apr 27, 2025
Sounding-Object-Segmentation-Preference
Public
The official repo for "Can Textual Semantics Mitigate Sounding Object Segmentation Preference?", ECCV 2024
Python
•0•6•1•0•Updated Mar 1, 2025Mar 1, 2025
BalanceBenchmark
Public
Python
•0•36•4•0•Updated Feb 23, 2025Feb 23, 2025
awesome-audiovisual-learning
Public
A curated list of audio-visual learning methods and datasets.
awesome awesome-list
20•281•1•0•Updated Dec 3, 2024Dec 3, 2024
Valuate-and-Enhance-Multimodal-Cooperation
Public
The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024
Python
•4•59•7•0•Updated Nov 5, 2024Nov 5, 2024
TSPM
Public
Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.
Python
•1•16•4•0•Updated Oct 25, 2024Oct 25, 2024
Keystate_Online_Imitation
Public
The repo for "KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance", CoRL 2024
Python
•1•9•0•0•Updated Oct 17, 2024Oct 17, 2024
Stepping-Stones
Public
The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024
Python
•2•18•1•0•Updated Oct 11, 2024Oct 11, 2024
bias_in_AVS
Public
Official repository for "Unveiling and Mitigating Bias in Audio Visual Segmentation" in ACM MM 2024
Python
•0•6•0•0•Updated Oct 10, 2024Oct 10, 2024
BML_TPAMI2024
Public
The repo for "On-the-fly Modulation for Balanced Multimodal Learning", T-PAMI 2024
Python
•1•18•3•0•Updated Sep 29, 2024Sep 29, 2024
DepthHelps-IROS2024
Public
Python
•1•18•3•0•Updated Aug 21, 2024Aug 21, 2024