Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View mktal's full-sized avatar

Block or report mktal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Adds "modifier key + mouse drag" move and resize to OSX

Objective-C 1,223 84 Updated Sep 14, 2025

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 3,130 224 Updated Nov 17, 2025

AI & parametric QR code generator. AI & 参数化二维码生成器。https://qrbtf.com

TypeScript 6,847 583 Updated Apr 17, 2025

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,085 122 Updated Jun 1, 2023

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Python 981 57 Updated Jan 30, 2024

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

1,810 151 Updated Jun 17, 2025

A guidance language for controlling large language models.

Jupyter Notebook 21,227 1,143 Updated Jan 28, 2026

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

10,142 783 Updated May 31, 2024

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 6,093 527 Updated Jul 1, 2025

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools, with 10x faster training through evolutionary hyperparameter optimization.

Python 868 66 Updated Jan 28, 2026

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 8,975 974 Updated Jul 8, 2025

Offline RL experiments

Python 15 Updated Oct 1, 2022
Python 32 9 Updated Jan 11, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,970 1,871 Updated Jul 15, 2025

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,227 3,666 Updated Jul 4, 2024

nanoGPT-like codebase for LLM training

Python 113 37 Updated Nov 7, 2025

4 bits quantization of LLaMA using GPTQ

Python 3,076 457 Updated Jul 13, 2024

Multi-GPU CUDA stress test

C++ 2,079 387 Updated Nov 4, 2025

Lightweight wrapper of the official ChatGPT API in your terminal

Shell 43 2 Updated Mar 10, 2023

Task-based datasets, preprocessing, and evaluation for sequence models.

Python 594 60 Updated Jan 14, 2026

Let us control diffusion models!

Python 33,605 2,999 Updated Feb 25, 2024
Python 1,560 159 Updated Jan 22, 2026

Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)

Python 1,299 78 Updated Dec 18, 2024

An open source implementation of CLIP.

Python 13,315 1,229 Updated Nov 4, 2025

(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation

Python 376 44 Updated Nov 28, 2022

🦜🔗 The platform for reliable agents.

Python 125,387 20,636 Updated Jan 28, 2026

Train transformer language models with reinforcement learning.

Python 17,187 2,458 Updated Jan 29, 2026

A modular RL library to fine-tune language models to human preferences

Python 2,376 201 Updated Mar 1, 2024

ASDL: Automatic Second-order Differentiation Library for PyTorch

Python 191 18 Updated Dec 5, 2024
Next