Thanks to visit codestin.com
Credit goes to github.com

Skip to content
@LaVi-Lab

LaVi Lab

We are the Language and Vision (LaVi) Lab in CSE@CUHK led by Prof. Liwei Wang.

Popular repositories Loading

  1. Video-3D-LLM Video-3D-LLM Public

    [CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.

    Python 171 10

  2. VG-LLM VG-LLM Public

    The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'

    Jupyter Notebook 145 2

  3. CLEVA CLEVA Public

    [EMNLP 2023 Demo] "CLEVA: Chinese Language Models EVAluation Platform"

    Shell 62 3

  4. NaviLLM NaviLLM Public

    Forked from zd11024/NaviLLM

    [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'

    Python 53 3

  5. AIM AIM Public

    [ICCV 2025] Official code for "AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning"

    Python 40 2

  6. Visual-Table Visual-Table Public

    [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"

    Python 20 1

Repositories

Showing 10 of 14 repositories
  • LaVi-Lab/LaVi-Lab.github.io’s past year of commit activity
    JavaScript 0 BSD-3-Clause 0 0 0 Updated Oct 21, 2025
  • AIM Public

    [ICCV 2025] Official code for "AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning"

    LaVi-Lab/AIM’s past year of commit activity
    Python 40 Apache-2.0 2 0 0 Updated Oct 9, 2025
  • VG-LLM Public

    The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'

    LaVi-Lab/VG-LLM’s past year of commit activity
    Jupyter Notebook 145 2 9 0 Updated Oct 9, 2025
  • EgoMask Public

    [ICCV 2025] "Fine-grained Spatiotemporal Grounding on Egocentric Videos"

    LaVi-Lab/EgoMask’s past year of commit activity
    Python 17 0 0 0 Updated Aug 4, 2025
  • Video-3D-LLM Public

    [CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.

    LaVi-Lab/Video-3D-LLM’s past year of commit activity
    Python 171 Apache-2.0 10 7 0 Updated Jun 4, 2025
  • C2LEVA Public

    [Findings of ACL 2025] "C2LEVA: Toward Comprehensive and Contamination-Free Language Model Evaluation"

    LaVi-Lab/C2LEVA’s past year of commit activity
    2 0 0 0 Updated May 27, 2025
  • CLEVA Public

    [EMNLP 2023 Demo] "CLEVA: Chinese Language Models EVAluation Platform"

    LaVi-Lab/CLEVA’s past year of commit activity
    Shell 62 3 1 0 Updated May 16, 2025
  • FTTT Public

    [ACL 2025] Official code for ''Learning to Reason from Feedback at Test-Time''.

    LaVi-Lab/FTTT’s past year of commit activity
    Python 15 MIT 0 0 0 Updated May 16, 2025
  • Visual-Table Public

    [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"

    LaVi-Lab/Visual-Table’s past year of commit activity
    Python 20 Apache-2.0 1 0 0 Updated Oct 17, 2024
  • TG-Vid Public

    [EMNLP 2024] Official code for "Enhancing Temporal Modeling of Video LLMs via Time Gating"

    LaVi-Lab/TG-Vid’s past year of commit activity
    Python 6 0 0 0 Updated Oct 10, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics