Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View glistering96's full-sized avatar

Block or report glistering96

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Neural Combinatorial Optimization

Python 84 8 Updated Nov 15, 2025

Production-ready platform for agentic workflow development.

TypeScript 122,446 19,038 Updated Dec 22, 2025

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Python 3,223 427 Updated Dec 18, 2025

LLM inference in C/C++

C++ 91,843 14,195 Updated Dec 22, 2025

[ACL 2024 Findings] Deep Exploration of Cross-Lingual Zero-Shot Generalization in Instruction Tuning

Python 6 1 Updated Jun 25, 2024

The official implementation of Self-Play Preference Optimization (SPPO)

Python 583 47 Updated Jan 23, 2025

🧭 COMPASS: Combinatorial Optimization with Policy Adaptation using Latent Space Search

Python 42 4 Updated Jun 21, 2024

aider is AI pair programming in your terminal

Python 39,127 3,759 Updated Dec 18, 2025
Jupyter Notebook 69 13 Updated Mar 21, 2024

Schedule-Free Optimization in PyTorch

Python 2,241 69 Updated May 21, 2025

This is the official code for the published paper 'Solve routing problems with a residual edge-graph attention neural network'

Python 264 35 Updated Sep 5, 2023

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

Python 1,176 157 Updated Oct 1, 2024

Awesome machine learning for combinatorial optimization papers.

Python 2,045 230 Updated Nov 7, 2025

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Python 3,561 421 Updated Dec 7, 2025

This repository contains the implementation of paper Online 3D Bin Packing with Constrained Deep Reinforcement Learning.

Python 627 90 Updated Nov 17, 2023

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,448 1,971 Updated Dec 23, 2025

Accessible large language models via k-bit quantization for PyTorch.

Python 7,845 804 Updated Dec 12, 2025

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,791 869 Updated Jun 10, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,320 2,131 Updated Dec 18, 2025

Making large AI models cheaper, faster and more accessible

Python 41,299 4,546 Updated Dec 22, 2025

LOMO: LOw-Memory Optimization

Python 991 68 Updated Jul 2, 2024

🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

Python 2,178 56 Updated Nov 27, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

71,008 8,124 Updated Dec 22, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 1 Updated Jan 4, 2024

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,103 2,672 Updated Nov 3, 2025

Go engine with no human-provided knowledge, modeled after the AlphaGo Zero paper.

C++ 5,548 1,019 Updated May 2, 2024

Related papers for reinforcement learning, including classic papers and latest papers in top conferences

509 36 Updated Nov 11, 2025

Codebase for SEFS: Self-Supervision Enhanced Feature Selection with Correlated Gates

Python 24 7 Updated Sep 11, 2023

ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802

Python 97 7 Updated Aug 18, 2023

Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways

Python 827 82 Updated Nov 9, 2022
Next