🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.

2,821 184 Updated Aug 5, 2025

NVIDIA / cccl

CUDA Core Compute Libraries

C++ 2,121 320 Updated Jan 13, 2026

ASLP-lab / SongEval

A song aesthetic evaluation toolkit trained on SongEval.

Python 269 22 Updated Jun 15, 2025

wangshusen / RecommenderSystem

3,820 522 Updated Feb 7, 2024

huggingface / picotron

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,943 152 Updated Aug 26, 2025

NVIDIA / recsys-examples

Examples for Recommenders - easy to train and deploy on accelerated infrastructure.

Python 202 41 Updated Jan 12, 2026

computerhistory / AlexNet-Source-Code

This package contains the original 2012 AlexNet code.

Cuda 2,816 365 Updated Mar 12, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,768 776 Updated Jan 13, 2026

ASLP-lab / OSUM

OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.

Python 469 31 Updated Nov 23, 2025

slow-steppers / NeighborHash

A faster int-to-int hashmap implemented in C++.

C++ 50 9 Updated Jan 6, 2025

Shenggan / xfold

Democratizing AlphaFold3: an PyTorch reimplementation to accelerate protein structure prediction

Python 54 10 Updated Dec 16, 2024

NVIDIA / Model-Optimizer

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 1,803 234 Updated Jan 13, 2026

jeng1220 / cuda_examples

Simple CUDA Examples

Cuda 3 Updated Jan 5, 2025

HeKun-NVIDIA / CUDA-Programming-Guide-in-Chinese

This is a Chinese translation of the CUDA programming guide

1,817 272 Updated Nov 13, 2024

leimao / ONNX-Python-Examples

ONNX Python Examples

Dockerfile 16 6 Updated Sep 13, 2022

triton-inference-server / tensorrtllm_backend

The Triton TensorRT-LLM Backend

911 133 Updated Jan 12, 2026

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,613 2,005 Updated Jan 13, 2026

EdVince / whisper-trtllm

Whisper in TensorRT-LLM

C++ 17 2 Updated Sep 21, 2023

Tlntin / Qwen-TensorRT-LLM

Python 622 57 Updated Jul 31, 2024

yuekaizhang / Triton-ASR-Client

ASR client for Triton ASR Service

Python 36 8 Updated Jan 12, 2026

DC-Shi / cudaNppSample

C 1 1 Updated Mar 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

黄(Huáng)瓒(Zàn) pmixer

Achievements

Achievements

Highlights

Block or report pmixer

Stars

benenzhu / learn-ptx

yjmade / datapipe

NVIDIA / cutile-python

thynics / MapReduce

volcengine / MineContext

SJTU-ReArch-Group / Paper-Reading-List

ai-dynamo / aiconfigurator

cherichy / tilecute

SJTU-Liquid / Awesome-GraphRAG

Meirtz / Awesome-Context-Engineering