NROwind

Zhihong 陈 NROwind

5 followers · 10 following

Achievements

Stars

RuijieZhu94 / StatisticalLearning_USTC

Statistical Learning course in USTC. 中科大统计学习（刘东）课程复习资料。

TeX 63 10 Updated Jan 9, 2024

Qinying-Liu / Awesome-omni-modal-understanding

Collection of papers about video-audio understanding

22 1 Updated Dec 26, 2025

Jinghaoleven / RLFR

Official implementation of RLFR: Extending Reinforcement Learning for LLMs with Flow Environment

Python 46 1 Updated Nov 15, 2025

NROwind / OpenGPT-4o-Image

A Comprehensive Dataset for Advanced Image Generation and Editing}

31 2 Updated Oct 2, 2025

QwenLM / Qwen3-Embedding

Python 1,754 107 Updated Sep 30, 2025

IDEA-Research / Rex-Thinker

Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning

Python 135 7 Updated Jun 30, 2025

Visual-Agent / DeepEyes

Python 1,099 68 Updated Nov 20, 2025

ligeng0197 / Awesome-Thinking-With-Images

Latest open-source "Thinking with images" (O3/O4-mini) papers, covering training-free, SFT-based, and RL-enhanced methods for "fine-grained visual understanding".

110 2 Updated Aug 21, 2025

zhaochen0110 / Awesome_Think_With_Images

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,282 40 Updated Dec 23, 2025

QQ-MM / QQMM-embed

Python 23 1 Updated Oct 16, 2025

360CVGroup / FG-CLIP

New generation of CLIP with fine grained discrimination capability, ICML2025

Python 541 31 Updated Oct 27, 2025

XMUDeepLIT / LLaVE

LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning

Python 75 3 Updated May 23, 2025

GAIR-NLP / anole

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 822 49 Updated Jun 16, 2025

JiuhaiChen / BLIP3o

Official implementation of BLIP3o-Series

Python 1,626 77 Updated Nov 29, 2025

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,532 60 Updated Jun 14, 2025