xingling0

xingling0

Stars

inst-it / inst-it

[NeurIPS 2025] The official repository of "Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning"

Python 39 Updated Feb 20, 2025

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,415 230 Updated Nov 12, 2025

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,199 2,686 Updated Aug 12, 2024

WisconsinAIVision / ViP-LLaVA

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

Python 335 21 Updated Jul 17, 2024

YU-deep / Awesome-Latent-Space

A paper list of Awesome Latent Space.

248 10 Updated Dec 22, 2025

ThinkMorph / ThinkMorph

The official repository for the paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"

Jupyter Notebook 125 3 Updated Dec 22, 2025

Wakals / CoVT

Official repo of "Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens"

Python 225 11 Updated Dec 9, 2025

mbzuai-oryx / Video-CoM

Video-CoM: Interactive Video Reasoning via Chain of Manipulations

16 Updated Dec 1, 2025

CSU-JPG / Glance

Glance: Accelerating Diffusion Models with 1 Sample

Python 135 1 Updated Dec 15, 2025

Video-Reason / Awesome-Video-Reasoning

This is a collection of recent papers on reasoning in video generation models.

86 1 Updated Dec 15, 2025

christykl / saia

Jupyter Notebook 3 Updated Oct 24, 2025

Meituan-AutoML / MobileVLM

Strong and Open Vision Language Assistant for Mobile Devices

Python 1,315 85 Updated Apr 15, 2024

jbhuang0604 / awesome-tips

4,451 240 Updated Dec 8, 2025

showlab / VisInContext

Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning

Python 27 3 Updated Oct 30, 2024

thu-coai / Glyph

Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"

Python 525 50 Updated Nov 4, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 21,527 1,926 Updated Oct 25, 2025

krennic999 / ARsample

Code for paper "Towards Better & Faster Autoregressive Image Generation: From the Perspective of Entropy" [NeurIPS 2025] .

Python 12 Updated Dec 6, 2025

allenai / molmo

Code for the Molmo Vision-Language Model

Python 839 80 Updated Dec 12, 2024

CSU-JPG / Awesome-VLM-Reasoning

21 1 Updated May 19, 2025

sigmorphon / 2022SegmentationST

SIGMORPHON 2022 Shared Task on Morpheme Segmentation

Jupyter Notebook 30 13 Updated Mar 26, 2023

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,312 7,793 Updated Dec 21, 2025

UKPLab / naacl2019-like-humans-visual-attacks

Python 26 7 Updated Nov 21, 2022

TianxingChen / Embodied-AI-Guide

[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide

10,101 681 Updated Dec 3, 2025

floatai / TKEval

[EMNLP-Findings'24] Tokenization Falling Short: On Subword Robustness in Large Language Models

Python 9 Updated Mar 7, 2025

mulanai / MuLan

MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)

Python 144 3 Updated Jan 24, 2025

fe1ixxu / ALMA

State-of-the-art LLM-based translation models.

Ruby 570 45 Updated Apr 9, 2025

slone-nlp / myv-nmt

Jupyter Notebook 27 Updated Mar 9, 2025

Shark-NLP / OpenICL

OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.

Python 583 30 Updated Oct 3, 2023

NJUNLP / MMT-LLM

Python 35 1 Updated Jun 15, 2023

facebookresearch / flores

Facebook Low Resource (FLoRes) MT Benchmark

Python 757 133 Updated Nov 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly