FuxiaoLiu

Fuxiao Liu FuxiaoLiu

Hi! I'm research scientist from Nvidia

58 followers · 0 following

Achievements

Stars

zli12321 / Vision-SR1

Reinforcement Learning of Vision Language Models with Self Visual Perception Reward

Python 160 17 Updated Sep 23, 2025

zli12321 / Vision-Language-Models-Overview

A most Frontend Collection and survey of vision-language model papers, and models GitHub repository. Continuous updates.

527 30 Updated Feb 5, 2026

NVlabs / Eagle

Eagle: Frontier Vision-Language Models with Data-Centric Strategies

Python 928 48 Updated Oct 25, 2025

VITA-MLLM / VITA

✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,487 182 Updated Mar 28, 2025

FuxiaoLiu / MMC

[NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning

Python 95 5 Updated Jan 7, 2025

tianyi-lab / Mosaic-IT

[ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning

Python 20 4 Updated Sep 27, 2025

FuxiaoLiu / awesome-Large-MultiModal-Hallucination

Forked from xieyuquanxx/awesome-Large-MultiModal-Hallucination

😎 up-to-date & curated list of awesome LMM hallucinations papers, methods & resources.

2 Updated Jan 8, 2024

umd-huang-lab / Mementos

Forked from si0wang/Mementos

Jupyter Notebook 32 Updated Feb 8, 2024

AI-Jie01 / MMC

Forked from FuxiaoLiu/MMC

Python 1 Updated Dec 30, 2023

MLLM2024 / COLING2024

MLLM Tutorial @ LREC-COLING 2024

HTML 3 1 Updated Nov 9, 2024

FuxiaoLiu / LRV-Instruction

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

Python 295 14 Updated Mar 13, 2024

PKU-YuanGroup / Chat-UniVi

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Python 945 48 Updated Oct 16, 2024

caoyunkang / GPT4V-for-Generic-Anomaly-Detection

[CSCWD] Towards Generic Anomaly Detection and Understanding: Large-scale Visual-linguistic Model (GPT-4V) Takes the Lead.

129 7 Updated Mar 4, 2025

zhiqix / NL2GQL

The LLM of NL2GQL with NebulaGraph or Neo4j

Python 97 10 Updated Dec 14, 2023

mbzuai-oryx / groundingLMM

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 945 53 Updated Aug 5, 2025

tianyi-lab / HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Python 325 9 Updated Oct 14, 2025

yushundong / Fairness-must-read-list

Papers on fairness

12 1 Updated Oct 20, 2020

yushundong / Graph-Mining-Fairness-Data

Open-source datasets for paper "Fairness in Graph Mining: A Survey".

Jupyter Notebook 19 4 Updated Nov 3, 2022

yushundong / AdaGNN

Open-source code for ''Graph Neural Networks with Adaptive Frequency Response Filter''.

Python 25 5 Updated Jul 8, 2022

yushundong / EDITS

Open source code for paper "EDITS: Modeling and Mitigating Data Bias for Graph Neural Networks".

Python 28 6 Updated Jul 8, 2022

yushundong / PyGDebias

Open-source Library PyGDebias: Graph Datasets and Fairness-Aware Graph Mining Algorithms

Python 65 8 Updated May 7, 2024

FuxiaoLiu / HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

3 Updated Mar 17, 2024