hqhQAQ

hqhQAQ

46 followers · 107 following

Zhejiang University
[email protected]

Achievements

Lists (6)

Sort

Stars

Yangr116 / VST

Visual Spatial Tuning

Jupyter Notebook 114 2 Updated Nov 11, 2025

VILA-Lab / Awesome-DLMs

The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".

453 20 Updated Nov 11, 2025

666ghj / BettaFish

微舆：人人可用的多Agent舆情分析助手，打破信息茧房，还原舆情原貌，预测未来走向，辅助决策！从0实现，不依赖任何框架。

Python 26,614 5,084 Updated Nov 13, 2025

JinjieNi / MegaDLMs

GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 training.

Python 255 21 Updated Nov 11, 2025

apple / pico-banana-400k

Python 1,613 68 Updated Oct 28, 2025

ZhengrongYue / UniFlow

Official Implementation of "UniFlow: A Unified Pixel Flow Tokenizer for Visual Understanding and Generation"

Jupyter Notebook 123 2 Updated Oct 17, 2025

knightnemo / Awesome-World-Models

A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.

759 17 Updated Nov 7, 2025

meituan-longcat / LongCat-Flash-Omni

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 387 21 Updated Nov 12, 2025

NoFxAiOS / nofx

NOFX: Defining the Next-Generation AI Trading Operating System. A multi-exchange Al trading platform(Binance/Hyperliquid/Aster) with multi-Ai competition(deepseek/qwen/claude)self-evolution, and re…

Go 7,506 1,909 Updated Nov 13, 2025

myscius / Awesome-Multimodal-Large-Language-Models

Forked from BradyFU/Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

5 Updated Nov 12, 2025

MoonshotAI / Kimi-Linear

1,157 50 Updated Oct 31, 2025

baaivision / Emu3.5

Native Multimodal Models are World Learners

Python 1,228 42 Updated Nov 13, 2025

HKUDS / AI-Trader

"AI-Trader: Can AI Beat the Market?" Live Trading Bench: https://ai4trade.ai

Python 9,329 1,411 Updated Nov 12, 2025

camel-ai / owl

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 18,327 2,123 Updated Sep 24, 2025

RUC-NLPIR / DeepAgent

🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets

Python 755 97 Updated Nov 2, 2025

NVlabs / OmniVinci

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

Python 523 45 Updated Oct 29, 2025

EvolvingLMMs-Lab / lmms-engine

A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.

Python 503 20 Updated Nov 11, 2025

meituan-longcat / LongCat-Video

Python 1,140 102 Updated Nov 4, 2025

kinza99 / openevolve

EvoSyn: Generalizable Evolutionary Data Synthesis for Verifiable Learning

Python 8 Updated Oct 22, 2025

aakaran / reasoning-with-sampling

Python 312 36 Updated Nov 7, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 20,383 1,647 Updated Oct 25, 2025

agentica-project / rllm

Jupyter Notebook 267 26 Updated Sep 17, 2025

thuml / MiniVeo3-Reasoner

Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give it a star 🌟 if you find it useful.

Python 177 5 Updated Oct 12, 2025

bytetriper / RAE

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,542 43 Updated Oct 15, 2025

TencentCloudADP / youtu-agent

A simple yet powerful agent framework that delivers with open-source models

Python 3,812 372 Updated Nov 12, 2025

stepfun-ai / Step1X-Edit

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 1,735 81 Updated Sep 8, 2025

WayneJin0918 / SRUM

About Official repo of paper "SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models". A post-training framework that creates a cost-effective, self-iterative optimization loop.

Python 82 4 Updated Oct 19, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 11,033 968 Updated Nov 13, 2025

thinking-machines-lab / tinker-cookbook

Post-training with Tinker

Python 1,913 151 Updated Nov 13, 2025

wangqinsi1 / Vision-Zero

This is the official Python version of Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play.

Python 99 2 Updated Oct 21, 2025

hqhQAQ

Lists (6)

Image Generation

Large Language Model

Large Multimodal Model

Reality

Unified Model

Video Generation

Stars