leedewdew

leedewdew

3 followers · 23 following

Stars

RLHF-V / RLAIF-V

[CVPR'25 highlight] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness

Python 424 19 Updated May 14, 2025

meituan-longcat / LongCat-Flash-Thinking

246 21 Updated Oct 31, 2025

ByteDance-Seed / seed-oss

Python 832 44 Updated Sep 15, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 20,334 2,110 Updated Nov 3, 2025

mjun0812 / flash-attention-prebuild-wheels

Provide with pre-build flash-attention package wheels on Linux and Windows platforms using GitHub Actions

Python 380 33 Updated Nov 4, 2025

xorbitsai / inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 8,691 759 Updated Nov 4, 2025

HKUDS / LightRAG

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 22,371 3,357 Updated Nov 4, 2025

ollama / ollama

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 155,375 13,535 Updated Nov 4, 2025

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,484 58 Updated Jun 14, 2025

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

66,599 7,508 Updated Jun 4, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,244 1,758 Updated Oct 13, 2025

microsoft / generative-ai-for-beginners

21 Lessons, Get Started Building with Generative AI

Jupyter Notebook 101,342 53,748 Updated Nov 3, 2025

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 16,439 1,285 Updated Oct 6, 2025

bytedance / UI-TARS-desktop

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

TypeScript 19,395 1,838 Updated Nov 4, 2025

kamranahmedse / developer-roadmap

Interactive roadmaps, guides and other educational content to help developers grow in their careers.

TypeScript 342,755 43,292 Updated Nov 4, 2025

Byaidu / PDFMathTranslate

[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译，支持 Google/DeepL/Ollama/OpenAI 等服务，提供 CLI/GUI/MCP/Docker/Zotero

Python 29,563 2,620 Updated Oct 31, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,096 2,421 Updated Nov 4, 2025

BytedTsinghua-SIA / DAPO

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,616 70 Updated May 11, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,663 440 Updated Nov 4, 2025

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 1,416 70 Updated Feb 8, 2025

StellarCN / scp_zh

恒星共识协议中文翻译

TeX 149 43 Updated Dec 17, 2021

Osilly / Vision-R1

This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reas…

Python 722 19 Updated Sep 10, 2025