-
South China University of Technology
- China
-
01:17
(UTC -12:00) - https://www.scut.edu.en
Lists (14)
Sort Name ascending (A-Z)
Starred repositories
Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"
An Open-source RL System from ByteDance Seed and Tsinghua AIR
This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems
A library for advanced large language model reasoning
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
《Reinforcement Learning: An Introduction》(第二版)中文翻译
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Democratizing Reinforcement Learning for LLMs
Python Implementation of Reinforcement Learning: An Introduction
Train your Agent model via our easy and efficient framework
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
Code for paper: Optimizing Length Compression in Large Reasoning Models
Paper list for Efficient Reasoning.
Pocket Flow: 100-line LLM framework. Let Agents build Agents!
LLMs-from-scratch项目中文翻译
卡码网-23种设计模式精讲,每种设计模式都配套代码练习题,支持 Java,CPP,Python,Go🔥
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…