glistering96

glistering96

2 followers · 2 following

Achievements

Lists (1)

Sort

Reinforcemnent learnung

1 repository

Stars

langgenius / dify

Production-ready platform for agentic workflow development.

Python 125,425 19,504 Updated Jan 10, 2026

ggml-org / llama.cpp

LLM inference in C/C++

C++ 92,762 14,428 Updated Jan 10, 2026

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

72,786 8,365 Updated Dec 22, 2025

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 65,254 6,577 Updated Nov 11, 2025

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 41,316 4,544 Updated Dec 22, 2025

Aider-AI / aider

aider is AI pair programming in your terminal

Python 39,668 3,816 Updated Jan 4, 2026

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,432 2,149 Updated Jan 9, 2026

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,141 2,678 Updated Nov 3, 2025

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,598 2,003 Updated Jan 11, 2026

artidoro / qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,804 871 Updated Jun 10, 2024

thu-ml / tianshou

An elegant PyTorch deep reinforcement learning library.

Python 9,039 1,201 Updated Dec 1, 2025

bitsandbytes-foundation / bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python 7,880 813 Updated Jan 8, 2026

zhouhaoyi / Informer2020

The GitHub repository for the paper "Informer" accepted by AAAI 2021.

Python 6,383 1,285 Updated Jun 20, 2025

leela-zero / leela-zero

Go engine with no human-provided knowledge, modeled after the AlphaGo Zero paper.

C++ 5,550 1,018 Updated May 2, 2024

suragnair / alpha-zero-general

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Jupyter Notebook 4,351 1,142 Updated Jan 1, 2025

opendilab / DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Python 3,570 422 Updated Dec 7, 2025

pytorch / rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Python 3,246 431 Updated Jan 10, 2026

seungeunrho / minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Python 3,124 486 Updated Apr 22, 2023

werner-duvaud / muzero-general

MuZero

Python 2,752 667 Updated Sep 3, 2024

thuml / Autoformer

About Code release for "Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting" (NeurIPS 2021), https://arxiv.org/abs/2106.13008

Jupyter Notebook 2,389 489 Updated Feb 28, 2025

nikhilbarhate99 / PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Python 2,290 416 Updated Jul 9, 2024

facebookresearch / schedule_free

Schedule-Free Optimization in PyTorch

Python 2,251 70 Updated May 21, 2025

lucidrains / lion-pytorch

🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

Python 2,182 56 Updated Nov 27, 2024

Thinklab-SJTU / awesome-ml4co

Awesome machine learning for combinatorial optimization papers.

Python 2,055 230 Updated Nov 7, 2025

wouterkool / attention-learn-to-route

Attention based model for learning to solve different routing problems

Jupyter Notebook 1,321 382 Updated Aug 4, 2024

ericyangyu / PPO-for-Beginners

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

Python 1,192 157 Updated Oct 1, 2024

OpenLMLab / LOMO

LOMO: LOw-Memory Optimization

Python 988 68 Updated Jul 2, 2024

lucidrains / PaLM-pytorch

Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways

Python 828 82 Updated Nov 9, 2022

shunyaoshih / TPA-LSTM

Temporal Pattern Attention for Multivariate Time Series Forecasting

Python 735 191 Updated Nov 29, 2018

OptMLGroup / VRP-RL

Reinforcement Learning for Solving the Vehicle Routing Problem

Python 698 234 Updated May 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

glistering96

Achievements

Achievements

Block or report glistering96

Lists (1)

Reinforcemnent learnung

Stars

langgenius / dify

ggml-org / llama.cpp

mlabonne / llm-course

labmlai / annotated_deep_learning_paper_implementations

hpcaitech / ColossalAI

Aider-AI / aider

huggingface / peft

meta-llama / llama-cookbook

NVIDIA / TensorRT-LLM

artidoro / qlora

thu-ml / tianshou

bitsandbytes-foundation / bitsandbytes

zhouhaoyi / Informer2020

leela-zero / leela-zero

suragnair / alpha-zero-general

opendilab / DI-engine

pytorch / rl

seungeunrho / minimalRL

werner-duvaud / muzero-general

thuml / Autoformer

nikhilbarhate99 / PPO-PyTorch

facebookresearch / schedule_free

lucidrains / lion-pytorch

Thinklab-SJTU / awesome-ml4co

wouterkool / attention-learn-to-route

ericyangyu / PPO-for-Beginners

OpenLMLab / LOMO

lucidrains / PaLM-pytorch

shunyaoshih / TPA-LSTM

OptMLGroup / VRP-RL