fengzx99

木辛 fengzx99

6 followers · 12 following

Starred repositories

rednote-hilab / dots.ocr

Multilingual Document Layout Parsing in a Single Vision-Language Model

Python 6,121 609 Updated Dec 27, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,416 3,046 Updated Jan 16, 2026

virattt / ai-hedge-fund

An AI Hedge Fund Team

Python 45,303 7,937 Updated Dec 1, 2025

AccumulateMore / CV

✔（已完结）超级全面的深度学习笔记【土堆 Pytorch】【李沐动手学深度学习】【吴恩达深度学习】【大飞大模型Agent】

Jupyter Notebook 16,047 1,876 Updated Jan 12, 2026

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,955 1,381 Updated Jan 12, 2026

opendatalab / OmniDocBench

[CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation

Python 1,366 132 Updated Dec 19, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 5,059 329 Updated Jan 16, 2026

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 8,803 851 Updated Jan 8, 2026

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 67,682 12,638 Updated Jan 16, 2026

punkpeye / awesome-mcp-servers

A collection of MCP servers.

79,034 6,816 Updated Jan 16, 2026

moonlight-stream / moonlight-qt

GameStream client for PCs (Windows, Mac, Linux, and Steam Link)

C++ 15,848 951 Updated Jan 16, 2026

opendatalab / MinerU

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 52,240 4,347 Updated Jan 14, 2026

opendatalab / UniMERNet

UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition

Python 448 38 Updated Sep 28, 2025

poloclub / unitable

UniTable: Towards a Unified Table Foundation Model

Jupyter Notebook 521 40 Updated Jun 4, 2024

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,821 1,526 Updated Jan 4, 2026

datajuicer / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,739 316 Updated Jan 16, 2026

mfdycs / markdown-webviewer

A Python web, show and edit markdown files

JavaScript 4 Updated Mar 31, 2025

allenai / olmocr

Toolkit for linearizing PDFs for LLM datasets/training

Python 16,751 1,327 Updated Jan 16, 2026

RapidAI / RapidTable

基于序列表格识别算法推理库，集成PP-Structure和modelscope等表格识别算法。

Python 407 37 Updated Sep 4, 2025

FreedomIntelligence / InstructionZoo

282 24 Updated Apr 26, 2024

Infrasys-AI / AIInfra

AIInfra（AI 基础设施）指AI系统从底层芯片等硬件，到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 5,754 795 Updated Dec 22, 2025

Zjh-819 / LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

3,340 228 Updated Nov 28, 2023

CrazyBoyM / llama3-Chinese-chat

Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。

Python 4,159 337 Updated Jan 6, 2026

togethercomputer / RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,914 368 Updated Dec 7, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,174 3,504 Updated Jan 26, 2025

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,477 2,153 Updated Jan 16, 2026

ggml-org / llama.cpp

LLM inference in C/C++

C++ 93,107 14,503 Updated Jan 16, 2026

yanx27 / Pointnet_Pointnet2_pytorch

PointNet and PointNet++ implemented by pytorch (pure python) and on ModelNet, ShapeNet and S3DIS.

Python 4,672 999 Updated Apr 24, 2024

qicosmos / rest_rpc

modern C++(C++20), simple, easy to use rpc framework

C++ 1,975 397 Updated Jan 9, 2026

chriskohlhoff / asio

Asio C++ Library

C++ 5,703 1,336 Updated Nov 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly