Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View leedewdew's full-sized avatar

Block or report leedewdew

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR'25 highlight] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness

Python 424 19 Updated May 14, 2025
Python 832 44 Updated Sep 15, 2025

Fast and memory-efficient exact attention

Python 20,334 2,110 Updated Nov 3, 2025

Provide with pre-build flash-attention package wheels on Linux and Windows platforms using GitHub Actions

Python 380 33 Updated Nov 4, 2025

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 8,691 759 Updated Nov 4, 2025

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 22,371 3,357 Updated Nov 4, 2025

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 155,375 13,535 Updated Nov 4, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,484 58 Updated Jun 14, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

66,599 7,508 Updated Jun 4, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,244 1,758 Updated Oct 13, 2025

21 Lessons, Get Started Building with Generative AI

Jupyter Notebook 101,342 53,748 Updated Nov 3, 2025

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 16,439 1,285 Updated Oct 6, 2025

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

TypeScript 19,395 1,838 Updated Nov 4, 2025

Interactive roadmaps, guides and other educational content to help developers grow in their careers.

TypeScript 342,755 43,292 Updated Nov 4, 2025

[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

Python 29,563 2,620 Updated Oct 31, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,096 2,421 Updated Nov 4, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,616 70 Updated May 11, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,663 440 Updated Nov 4, 2025

A fork to add multimodal model training to open-r1

Python 1,416 70 Updated Feb 8, 2025

恒星共识协议中文翻译

TeX 149 43 Updated Dec 17, 2021

This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reas…

Python 722 19 Updated Sep 10, 2025

Train transformer language models with reinforcement learning.

Python 16,150 2,272 Updated Nov 4, 2025

Fully open reproduction of DeepSeek-R1

Python 25,606 2,400 Updated Sep 8, 2025
Jupyter Notebook 34 3 Updated Mar 6, 2025

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,243 99 Updated Oct 29, 2025

Curated list of datasets and tools for post-training.

3,837 318 Updated Jul 27, 2025

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 4,966 457 Updated Oct 6, 2025
Python 8,117 570 Updated Oct 30, 2025
Next