Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Prathmesh234's full-sized avatar

Block or report Prathmesh234

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Nano vLLM

Python 9,950 1,249 Updated Nov 3, 2025

Build RL environments for LLM training

Python 521 31 Updated Dec 21, 2025

A PyTorch native platform for training generative AI models

Python 4,863 648 Updated Dec 22, 2025
Python 614 58 Updated Dec 19, 2025

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 336 34 Updated Dec 21, 2025

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 931 87 Updated Sep 23, 2025

My learning notes for ML SYS.

Python 4,748 300 Updated Dec 22, 2025

Post-training with Tinker

Python 2,599 254 Updated Dec 20, 2025

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 7,072 522 Updated May 5, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,694 1,355 Updated Dec 17, 2025

VS Code extension for Pyroscope continuous profiling with inline metrics and heat maps

TypeScript 10 Updated Oct 31, 2025

Thermodynamic Hypergraphical Model Library in JAX

Python 966 119 Updated Nov 16, 2025

PyTorch-native post-training at scale

Python 572 71 Updated Dec 22, 2025

Expert Parallelism Load Balancer

Python 1,321 195 Updated Mar 24, 2025

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 692 89 Updated Dec 22, 2025

Contexts Optical Compression

Python 21,530 1,926 Updated Oct 25, 2025

Fastest kernels written from scratch

Cuda 500 61 Updated Sep 18, 2025

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Python 779 51 Updated Oct 15, 2025

The best ChatGPT that $100 can buy.

Python 39,046 4,950 Updated Dec 9, 2025

A novel data compression framework

C 2,854 122 Updated Dec 22, 2025

Perplexity GPU Kernels

C++ 540 74 Updated Nov 7, 2025

Simple high-throughput inference library

Python 153 10 Updated May 14, 2025

A Lightweight LLM Post-Training Library

Python 2,043 204 Updated Dec 22, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,670 751 Updated Dec 22, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,463 478 Updated Dec 22, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,702 2,867 Updated Dec 22, 2025

Language modeling with linear-cost context

Jupyter Notebook 114 12 Updated Sep 25, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,944 12,122 Updated Dec 22, 2025

Ongoing research training transformer models at scale

Python 14,671 3,404 Updated Dec 22, 2025

The Meta Spatial SDK Samples is a collection of code samples and projects that demonstrate the capabilities of the Meta Spatial SDK. Meta Spatial SDK enables mobile developers to build Meta Horizon…

Kotlin 242 57 Updated Dec 18, 2025
Next