Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View pmixer's full-sized avatar
㊗️
stay safe
㊗️
stay safe

Highlights

  • Pro

Block or report pmixer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 7 Updated Dec 19, 2025

A data processing framework

Python 11 Updated Jul 19, 2017

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 1,804 98 Updated Jan 12, 2026

lab of map reduce

Go 1 Updated Dec 4, 2025

MineContext is your proactive context-aware AI partner(Context-Engineering+ChatGPT Pulse)

Python 4,754 337 Updated Jan 5, 2026

Offline optimization of your disaggregated Dynamo graph

Python 147 49 Updated Jan 13, 2026
C++ 32 2 Updated Jul 2, 2025

A list of papers for Graph Retrieval-Augmented Generation (GraphRAG).

9 Updated Mar 13, 2025

🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.

2,821 184 Updated Aug 5, 2025

CUDA Core Compute Libraries

C++ 2,121 320 Updated Jan 13, 2026

A song aesthetic evaluation toolkit trained on SongEval.

Python 269 22 Updated Jun 15, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,943 152 Updated Aug 26, 2025

Examples for Recommenders - easy to train and deploy on accelerated infrastructure.

Python 202 41 Updated Jan 12, 2026

This package contains the original 2012 AlexNet code.

Cuda 2,816 365 Updated Mar 12, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,768 776 Updated Jan 13, 2026

OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.

Python 469 31 Updated Nov 23, 2025

A faster int-to-int hashmap implemented in C++.

C++ 50 9 Updated Jan 6, 2025

Democratizing AlphaFold3: an PyTorch reimplementation to accelerate protein structure prediction

Python 54 10 Updated Dec 16, 2024

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 1,803 234 Updated Jan 13, 2026

Simple CUDA Examples

Cuda 3 Updated Jan 5, 2025

This is a Chinese translation of the CUDA programming guide

1,817 272 Updated Nov 13, 2024

ONNX Python Examples

Dockerfile 16 6 Updated Sep 13, 2022

The Triton TensorRT-LLM Backend

911 133 Updated Jan 12, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,613 2,005 Updated Jan 13, 2026

Whisper in TensorRT-LLM

C++ 17 2 Updated Sep 21, 2023
Python 622 57 Updated Jul 31, 2024

ASR client for Triton ASR Service

Python 36 8 Updated Jan 12, 2026
C 1 1 Updated Mar 4, 2023
Next