Thanks to visit codestin.com
Credit goes to github.com

mktal

Follow

Xiaocheng Tang mktal

Follow

AI Research Scientist

55 followers · 115 following

Achievements

Achievements

Stars

dmarcotte / easy-move-resize

Adds "modifier key + mouse drag" move and resize to OSX

Objective-C 1,223 84 Updated Sep 14, 2025

THUDM / AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 3,130 224 Updated Nov 17, 2025

latentcat / qrbtf

AI & parametric QR code generator. AI & 参数化二维码生成器。https://qrbtf.com

TypeScript 6,847 583 Updated Apr 17, 2025

openai / prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,085 122 Updated Jun 1, 2023

Liuhong99 / Sophia

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Python 981 57 Updated Jan 30, 2024

anthropics / hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

1,810 151 Updated Jun 17, 2025

guidance-ai / guidance

A guidance language for controlling large language models.

Jupyter Notebook 21,227 1,143 Updated Jan 28, 2026

Mooler0410 / LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

10,142 783 Updated May 31, 2024

Lightning-AI / lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 6,093 527 Updated Jul 1, 2025

AgileRL / AgileRL

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools, with 10x faster training through evolutionary hyperparameter optimization.

Python 868 66 Updated Jan 28, 2026

vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 8,975 974 Updated Jul 8, 2025

maxreciprocate / offline

Offline RL experiments

Python 15 Updated Oct 1, 2022

grapeot / VoiceNoteTaker

Python 32 9 Updated Jan 11, 2024

ymcui / Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,970 1,871 Updated Jul 15, 2025

openai / chatgpt-retrieval-plugin

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,227 3,666 Updated Jul 4, 2024

epfml / llm-baselines

nanoGPT-like codebase for LLM training

Python 113 37 Updated Nov 7, 2025

qwopqwop200 / GPTQ-for-LLaMa

4 bits quantization of LLaMA using GPTQ

Python 3,076 457 Updated Jul 13, 2024

wilicc / gpu-burn

Multi-GPU CUDA stress test

C++ 2,079 387 Updated Nov 4, 2025

mansimov / chatgpt_cli

Lightweight wrapper of the official ChatGPT API in your terminal

Shell 43 2 Updated Mar 10, 2023

google / seqio

Task-based datasets, preprocessing, and evaluation for sequence models.

Python 594 60 Updated Jan 14, 2026

lllyasviel / ControlNet

Let us control diffusion models!

Python 33,605 2,999 Updated Feb 25, 2024

google-research / FLAN

Python 1,560 159 Updated Jan 22, 2026

tysam-code / hlb-CIFAR10

Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)

Python 1,299 78 Updated Dec 18, 2024

mlfoundations / open_clip

An open source implementation of CLIP.

Python 13,315 1,229 Updated Nov 4, 2025

Div-Infinity / IQ-Learn

(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation

Python 376 44 Updated Nov 28, 2022

langchain-ai / langchain

🦜🔗 The platform for reliable agents.

Python 125,387 20,636 Updated Jan 28, 2026

openai / guided-diffusion

Python 7,272 891 Updated Jul 2, 2024

huggingface / trl

Train transformer language models with reinforcement learning.

Python 17,187 2,458 Updated Jan 29, 2026

allenai / RL4LMs

A modular RL library to fine-tune language models to human preferences

Python 2,376 201 Updated Mar 1, 2024

kazukiosawa / asdl

ASDL: Automatic Second-order Differentiation Library for PyTorch

Python 191 18 Updated Dec 5, 2024