Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View sanchitintel's full-sized avatar
đź’­
Please send a message on Slack/MS Teams if I miss a notification. Thanks!
đź’­
Please send a message on Slack/MS Teams if I miss a notification. Thanks!
  • San Francisco Bay Area

Block or report sanchitintel

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Advanced Quantization Algorithm for LLMs and VLMs, with support for CPU, Intel GPU, CUDA and HPU.

Python 678 57 Updated Oct 27, 2025

Causal depthwise conv1d in CUDA, with a PyTorch interface

Cuda 630 133 Updated Oct 20, 2025

SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs

C++ 43 64 Updated Oct 24, 2025

Puzzles for learning Triton

Jupyter Notebook 2,078 170 Updated Nov 18, 2024

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 508 51 Updated Oct 27, 2025

[ICLR 2025] Palu: Compressing KV-Cache with Low-Rank Projection

Python 143 8 Updated Feb 20, 2025

Curated collection of papers in MoE model inference

292 11 Updated Oct 20, 2025

Perplexity GPU Kernels

C++ 509 69 Updated Sep 19, 2025

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 580 30 Updated Aug 12, 2025

Reproducing R1 for Code with Reliable Rewards

Python 261 16 Updated May 5, 2025

Tile primitives for speedy kernels

Cuda 2,841 190 Updated Oct 24, 2025

s1: Simple test-time scaling

Python 6,582 764 Updated Jun 25, 2025
Python 2 Updated Apr 5, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 31,248 3,805 Updated Jul 23, 2024

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,728 274 Updated Jul 18, 2025

oneAPI - Data Parallel C++ course for students

C++ 44 12 Updated Nov 4, 2024

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

C++ 373 18 Updated Apr 13, 2025

A batched offline inference oriented version of segment-anything

Python 1,251 75 Updated Aug 22, 2025

Applied AI experiments and examples for PyTorch

Python 301 29 Updated Aug 22, 2025

Helpful tools and examples for working with flex-attention

Python 1,034 64 Updated Oct 23, 2025

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,617 249 Updated Sep 10, 2025

PyTorch media decoding and encoding

Python 769 66 Updated Oct 26, 2025

A PyTorch native platform for training generative AI models

Python 4,600 573 Updated Oct 27, 2025

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Python 23,504 9,761 Updated Sep 1, 2025

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Python 1,658 157 Updated Oct 20, 2025

Train transformer language models with reinforcement learning.

Python 16,025 2,254 Updated Oct 25, 2025

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 12,866 1,345 Updated Oct 20, 2025

Go ahead and axolotl questions

Python 10,677 1,176 Updated Oct 26, 2025

PyTorch native post-training library

Python 5,561 680 Updated Oct 26, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 76,446 11,260 Updated Oct 22, 2025
Next