Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View openingelevator's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report openingelevator

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

DeepConf: Deep Think with Confidence

Python 276 38 Updated Sep 18, 2025

Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"

Python 262 23 Updated Sep 5, 2025

everything about llm & aigc

Jupyter Notebook 106 12 Updated Sep 24, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,602 68 Updated May 11, 2025

This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems

Python 91 8 Updated Mar 21, 2025

A library for advanced large language model reasoning

Python 2,292 201 Updated Jun 10, 2025

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,632 571 Updated Jan 16, 2025

《Reinforcement Learning: An Introduction》(第二版)中文翻译

Python 609 108 Updated Apr 9, 2022

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,365 1,233 Updated Oct 18, 2025

Inference code for Llama models

Python 58,873 9,815 Updated Jan 26, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,564 430 Updated Oct 24, 2025

Python Implementation of Reinforcement Learning: An Introduction

Python 14,364 4,958 Updated Aug 9, 2024

Train your Agent model via our easy and efficient framework

Python 1,582 143 Updated Oct 24, 2025
Jupyter Notebook 19 Updated Aug 20, 2025

个人构建MoE大模型:从预训练到DPO的完整实践

Python 1,667 130 Updated Oct 21, 2025

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 923 71 Updated Feb 16, 2025

Must-read Papers on LLM Agents.

2,735 159 Updated Oct 24, 2025
Python 44 2 Updated Sep 27, 2025

Code for paper: Optimizing Length Compression in Large Reasoning Models

Python 26 4 Updated Oct 20, 2025

Paper list for Efficient Reasoning.

706 25 Updated Oct 25, 2025

Pocket Flow: 100-line LLM framework. Let Agents build Agents!

Python 8,698 981 Updated Aug 13, 2025

LLMs-from-scratch项目中文翻译

Jupyter Notebook 1,818 296 Updated Oct 15, 2025

卡码网-23种设计模式精讲,每种设计模式都配套代码练习题,支持 Java,CPP,Python,Go🔥

837 168 Updated Jul 4, 2025

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…

TypeScript 26,105 6,710 Updated Oct 24, 2025

Resource for published videos

HTML 59 11 Updated Oct 8, 2025
Next