Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View giladturok's full-sized avatar
😃
frantically tuning hyper-parameters
😃
frantically tuning hyper-parameters

Block or report giladturok

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Causal, Higher-Order, Probabilistic Programming

Julia 175 17 Updated Oct 21, 2025

[NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference

Python 13 2 Updated Oct 28, 2025
Python 19 1 Updated Sep 14, 2025

Accessible large language models via k-bit quantization for PyTorch.

Python 7,700 791 Updated Oct 28, 2025

LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.

Python 852 122 Updated Oct 29, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,163 269 Updated Oct 29, 2025

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,536 183 Updated Jul 12, 2024

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,837 366 Updated Dec 7, 2024

TL;DR: We only have one life. Let's stop wasting it on YouTube shorts.

JavaScript 93 4 Updated Oct 23, 2025

Fast and memory-efficient exact attention

Python 20,228 2,095 Updated Oct 28, 2025

Minimal and annotated implementations of key ideas from modern deep learning research.

Python 1,180 95 Updated Sep 28, 2025

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 1,916 144 Updated Oct 29, 2025

Debug PyTorch code using PySnooper

Python 801 43 Updated Apr 28, 2021

A comprehensive JAX/NNX library for diffusion and flow matching generative algorithms, featuring DiT (Diffusion Transformer) and its variants as the primary backbone with support for ImageNet train…

Python 109 6 Updated Oct 16, 2025

Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`

Python 47 5 Updated May 31, 2024

📊 Save matplotlib figures as TikZ/PGFplots for smooth integration into LaTeX.

Python 2,541 240 Updated Aug 16, 2024
Python 139 9 Updated Oct 9, 2025

Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"

Python 330 23 Updated Dec 22, 2024

Awesome Reasoning LLM Tutorial/Survey/Guide

Python 2,115 149 Updated Oct 14, 2025

Post-training with Tinker

Python 1,292 90 Updated Oct 29, 2025

Code for "Variational Reasoning for Language Models"

Python 51 1 Updated Sep 29, 2025

Muon is Scalable for LLM Training

1,346 70 Updated Aug 3, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

8,403 552 Updated Sep 11, 2025

List of AI Internships

128 8 Updated Oct 6, 2023

JAX implementation of MeanFlow

Python 460 17 Updated Jul 30, 2025

[ICML 2025] Customizing the Inductive Biases of Softmax Attention using Structured Matrices

Jupyter Notebook 8 Updated Jul 14, 2025

EDM2 and Autoguidance -- Official PyTorch implementation

Python 778 49 Updated Dec 9, 2024

Supporting code for the blog post on modular manifolds.

Python 97 14 Updated Sep 26, 2025
Next