sanchitintel

💭

Please send a message on Slack/MS Teams if I miss a notification. Thanks!

sanchitintel

💭

Please send a message on Slack/MS Teams if I miss a notification. Thanks!

18 followers · 162 following

San Francisco Bay Area

Lists (2)

Sort

🔮 Future ideas

✨ Inspiration

Stars

intel / auto-round

Advanced Quantization Algorithm for LLMs and VLMs, with support for CPU, Intel GPU, CUDA and HPU.

Python 678 57 Updated Oct 27, 2025

Dao-AILab / causal-conv1d

Causal depthwise conv1d in CUDA, with a PyTorch interface

Cuda 630 133 Updated Oct 20, 2025

intel / sycl-tla

Forked from NVIDIA/cutlass

SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs

C++ 43 64 Updated Oct 24, 2025

srush / Triton-Puzzles

Puzzles for learning Triton

Jupyter Notebook 2,078 170 Updated Nov 18, 2024

pytorch / helion

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 508 51 Updated Oct 27, 2025

shadowpa0327 / Palu

[ICLR 2025] Palu: Compressing KV-Cache with Low-Rank Projection

Python 143 8 Updated Feb 20, 2025

MoE-Inf / awesome-moe-inference

Curated collection of papers in MoE model inference

292 11 Updated Oct 20, 2025

perplexityai / pplx-kernels

Perplexity GPU Kernels

C++ 509 69 Updated Sep 19, 2025

BobMcDear / attorch

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 580 30 Updated Aug 12, 2025

ganler / code-r1

Reproducing R1 for Code with Reliable Rewards

Python 261 16 Updated May 5, 2025

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 2,841 190 Updated Oct 24, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 6,582 764 Updated Jun 25, 2025

jgong5 / llm_finetune_study

Python 2 Updated Apr 5, 2024

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 31,248 3,805 Updated Jul 23, 2024

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,728 274 Updated Jul 18, 2025

pengzhao-intel / oneAPI_course

oneAPI - Data Parallel C++ course for students

C++ 44 12 Updated Nov 4, 2024

HanGuo97 / flute

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

C++ 373 18 Updated Apr 13, 2025

meta-pytorch / segment-anything-fast

A batched offline inference oriented version of segment-anything

Python 1,251 75 Updated Aug 22, 2025

meta-pytorch / applied-ai

Applied AI experiments and examples for PyTorch

Python 301 29 Updated Aug 22, 2025

meta-pytorch / attention-gym

Helpful tools and examples for working with flex-attention

Python 1,034 64 Updated Oct 23, 2025

pytorch / torchchat

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,617 249 Updated Sep 10, 2025

meta-pytorch / torchcodec

PyTorch media decoding and encoding

Python 769 66 Updated Oct 26, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,600 573 Updated Oct 27, 2025

pytorch / examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Python 23,504 9,761 Updated Sep 1, 2025

facebookresearch / multimodal

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Python 1,658 157 Updated Oct 20, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 16,025 2,254 Updated Oct 25, 2025

Lightning-AI / litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 12,866 1,345 Updated Oct 20, 2025

axolotl-ai-cloud / axolotl

Go ahead and axolotl questions

Python 10,677 1,176 Updated Oct 26, 2025

meta-pytorch / torchtune

PyTorch native post-training library

Python 5,561 680 Updated Oct 26, 2025

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 76,446 11,260 Updated Oct 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly