johnrachwan123

John Rachwan johnrachwan123

CTO @ Pruna AI

22 followers · 11 following

Achievements

Organizations

Starred repositories

PrunaAI / ai-efficiency-courses

Courses on building, compressing, evaluating, and deploying efficient AI models.

Jupyter Notebook 65 5 Updated Nov 10, 2025

enkeejunior1 / min-pi-flow

Python 52 3 Updated Nov 6, 2025

vlad-ds / anki-german-citizen-test

Questions and answers to the Germany Citizenship Test (Einbürgerungstest) in Anki format.

22 3 Updated Jul 20, 2025

lldacing / ComfyUI_BiRefNet_ll

Python 266 28 Updated Jun 1, 2025

partial-model-collapse-unlearning / pmc-unlearning

Implementation of our unlearning method "Partial Model Collapse" introduced in the paper: "Model Collapse Is Not a Bug but a Feature in Machine Unlearning for LLMs" (Preprint).

Python 27 Updated Jan 4, 2026

alexgenovese / docker-pruna

Download and Compile Any Diffusion Models in your Endpoint

Python 7 Updated Aug 21, 2025

quentin-py / awesome-pricing

A curated list of the best software pricing pages and useful resources for pricing research

15 2 Updated Nov 28, 2025

Neuralk-AI / TabBench

TabBench is a benchmark built to evaluate machine learning models on tabular data, focusing on real-world industry use cases.

Jupyter Notebook 108 1 Updated Sep 29, 2025

xuyang-liu16 / Awesome-Token-level-Model-Compression

📚 Collection of token-level model compression resources.

189 8 Updated Sep 3, 2025

PrunaAI / pruna

Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.

Python 1,075 77 Updated Jan 22, 2026

PrunaAI / replicate-example

Python 6 Updated Apr 7, 2025

PrunaAI / awesome-ai-efficiency

A curated list of materials on AI efficiency

203 19 Updated Dec 14, 2025

PrunaAI / ComfyUI_pruna

This is a ComfyUI node that integrates pruna

Python 65 3 Updated Sep 8, 2025

PrunaAI / tritonserver

This repository describes how to use pruna with tritonserver

Python 7 Updated May 28, 2025

wangkai930418 / awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

2,124 98 Updated Jan 23, 2026

xlite-dev / LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 9,461 931 Updated Jan 18, 2026

xlite-dev / Awesome-DiT-Inference

📚A curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.🎉

Python 504 25 Updated Jan 18, 2026

merrymercy / awesome-tensor-compilers

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,719 324 Updated Oct 19, 2024

johannaSommer / generalization-neural-co-solvers

Official Repository for the ICLR 2022 paper "Generalization of Neural Combinatorial Solvers through the Lens of Adversarial Robustness"

Jupyter Notebook 14 1 Updated Nov 20, 2022

TUM-DAML / MAGNet

Official Implementation of the Paper "MAGNet: Motif-Agnostic Generation of Molecules from Shapes"

Python 15 Updated Nov 25, 2023

johnrachwan123 / Winning-The-Lottery-Ahead-of-Time

Code for Winning the Lottery Ahead of Time: Efficient Early Network Pruning (ICML 2022)

Python 30 3 Updated Nov 15, 2023

HigherOrderCO / Bend

A massively parallel, high-level programming language

Rust 19,144 469 Updated Jun 3, 2025

openai / openai-cookbook

Examples and guides for using the OpenAI API

Jupyter Notebook 71,101 11,895 Updated Jan 22, 2026

Zeqiang-Lai / OpenDMD

Open source implementation and models of One-step Diffusion with Distribution Matching Distillation

Python 180 14 Updated May 26, 2024

HuangOwen / Awesome-LLM-Compression

Awesome LLM compression research papers and tools.

1,761 117 Updated Nov 10, 2025

SJTU-IPADS / PowerInfer

High-speed Large Language Model Serving for Local Deployment

C++ 8,591 477 Updated Aug 2, 2025

xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,935 337 Updated Jan 18, 2026

leetcode-mafia / cheetah

Mac app for crushing tech interviews with AI

Swift 4,265 304 Updated Jan 14, 2025

Efficient-ML / Awesome-Efficient-AIGC

A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including language and vision, we are continuously improving the project. Wel…

204 10 Updated Feb 10, 2025

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,718 2,037 Updated Jan 24, 2026

John Rachwan johnrachwan123

Organizations

Starred repositories

spike-sorting