Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View itsnamgyu's full-sized avatar
🌝
Excited
🌝
Excited

Block or report itsnamgyu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Nano vLLM

Python 11,202 1,464 Updated Nov 3, 2025

GPUGrants - a list of GPU grants that I can think of

65 5 Updated Sep 13, 2025

Why is this running?

Go 12,180 289 Updated Jan 24, 2026

[NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Guided Fast Inference in Masked Diffusion Models"

Python 23 1 Updated Jan 3, 2026

Two Heads Are Better Than One: Audio-Visual Speech Error Correction with Dual Hypotheses

Python 9 Updated Oct 15, 2025

Sends virtual input commands

Python 2,089 280 Updated Aug 12, 2025

Train transformer language models with reinforcement learning.

Python 17,189 2,458 Updated Jan 29, 2026

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,938 297 Updated Aug 9, 2025

Fully open reproduction of DeepSeek-R1

Python 25,844 2,411 Updated Nov 24, 2025

Efficient Triton Kernels for LLM Training

Python 6,085 474 Updated Jan 27, 2026

Robust recipes to align language models with human and AI preferences

Python 5,481 467 Updated Sep 8, 2025

Official Code implementation of "Flex-Judge: Text-Only Reasoning Unleashes Zero-Shot Multimodal Evaluators"

Python 11 1 Updated Sep 19, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,381 1,862 Updated Jan 9, 2026

Code for using and evaluating SpanBERT.

Python 903 177 Updated Jul 25, 2023

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 68,948 13,002 Updated Jan 29, 2026

Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation (NeurIPS 2025)

Python 538 79 Updated Sep 26, 2025

A list of free LLM inference resources accessible via API.

Python 8,049 781 Updated Jan 29, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,805 3,132 Updated Jan 29, 2026

Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/

Python 1,720 85 Updated Jan 24, 2026

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 2,157 233 Updated Aug 17, 2024

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 2,510 257 Updated Aug 13, 2024

A PyTorch native platform for training generative AI models

Python 5,018 682 Updated Jan 29, 2026

[ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring

Python 267 21 Updated Jul 6, 2025

Gemma open-weight LLM library, from Google DeepMind

Python 3,990 647 Updated Jan 23, 2026

JAX-based neural network library

Python 3,178 280 Updated Jan 24, 2026

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry lead…

Python 548 70 Updated Jan 13, 2026

An interactive HTML pretty-printer for machine learning research in IPython notebooks.

Python 458 23 Updated Aug 8, 2025

Modular, scalable library to train ML models

Python 203 20 Updated Jan 29, 2026

Kanana: Compute-efficient Bilingual Language Models

278 15 Updated Jul 23, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,752 271 Updated Jul 18, 2025
Next