Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View ChessQian's full-sized avatar
  • CAD&CG State Key Laboratory
  • Zhejiang Univ, Hangzhou, Zhejiang, China

Block or report ChessQian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 12,885 2,140 Updated Sep 6, 2025

Scalable toolkit for efficient model alignment

Python 843 102 Updated Oct 6, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,841 374 Updated Oct 17, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,208 51 Updated Nov 16, 2024

Train transformer language models with reinforcement learning.

Python 16,098 2,261 Updated Nov 1, 2025

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,824 135 Updated Jan 17, 2025

Distributed platform for building autonomic network functions.

C++ 920 244 Updated Oct 31, 2025

✨✨Latest Advances on Multimodal Large Language Models

16,581 1,069 Updated Oct 31, 2025

LLM全栈优质资源汇总

Shell 649 73 Updated Jul 15, 2025

[EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference"

Python 103 7 Updated Nov 9, 2024

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,799 76 Updated Oct 31, 2025

DataComp for Language Models

HTML 1,384 127 Updated Sep 9, 2025

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,652 316 Updated Aug 19, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,204 1,754 Updated Oct 13, 2025

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,337 467 Updated Aug 7, 2024

A curated list for Efficient Large Language Models

Python 1,887 144 Updated Jun 17, 2025

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,644 573 Updated Jan 16, 2025

Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%

Python 4,553 374 Updated Jul 29, 2025
Python 36 17 Updated Dec 18, 2024

Pytorch❤️ Keras 😋😋

Jupyter Notebook 1,998 255 Updated Sep 22, 2025

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

1,791 147 Updated Jun 17, 2025

A curated list of reinforcement learning with human feedback resources (continually updated)

4,188 249 Updated Sep 19, 2025

🆓免费的 ChatGPT 镜像网站列表,持续更新。List of free ChatGPT mirror sites, continuously updated.

Python 20,534 1,397 Updated Jun 23, 2025

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,720 483 Updated Jan 8, 2024

A toolbox of vision models and algorithms based on MindSpore

Python 261 147 Updated Jul 24, 2025

MindSpore large-scale recommender system library.

Python 10 Updated Dec 21, 2023

Technical Documents

2 1 Updated Apr 4, 2023

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 151,909 31,006 Updated Oct 31, 2025

Language-Agnostic SEntence Representations

Jupyter Notebook 3,649 463 Updated May 2, 2024
Next