Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View beoy's full-sized avatar

Block or report beoy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 769 112 Updated Feb 27, 2026

A framework for efficient model inference with omni-modality models

Python 2,837 465 Updated Feb 27, 2026

Fast low-bit matmul kernels in Triton

Python 435 31 Updated Feb 1, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 86,185 13,066 Updated Feb 19, 2026

[Lumina具身智能社区] 具身智能技术指南 Embodied-AI-Guide

12,033 799 Updated Feb 26, 2026

Cloud native networking and network security

Go 7,092 1,545 Updated Feb 27, 2026

Tools for building GPU clusters

Shell 1,421 350 Updated Feb 23, 2026

A PyTorch native platform for training generative AI models

Python 5,097 719 Updated Feb 27, 2026

CUDA Python: Performance meets Productivity

Cython 3,174 250 Updated Feb 27, 2026

A next generation Python CMake adaptor and Python API for plugins

Python 445 82 Updated Feb 26, 2026

NVIDIA curated collection of educational resources related to general purpose GPU programming.

Jupyter Notebook 1,268 227 Updated Feb 25, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,151 880 Updated Feb 27, 2026

A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS

253 12 Updated May 6, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 104,389 11,930 Updated Feb 27, 2026

making the official triton tutorials actually comprehensible

Python 123 27 Updated Aug 25, 2025

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Shell 1,008 102 Updated Jul 29, 2024

Fully open reproduction of DeepSeek-R1

Python 25,907 2,417 Updated Nov 24, 2025

Open-source search and retrieval database for AI applications.

Rust 26,345 2,077 Updated Feb 27, 2026

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

Python 28,242 4,558 Updated Feb 26, 2026

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,327 1,700 Updated Feb 27, 2026

PyZMQ: Python bindings for zeromq

Python 4,104 661 Updated Feb 2, 2026

Distributed Task Queue (development branch)

Python 28,154 4,965 Updated Feb 25, 2026

Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

Go 163,585 14,691 Updated Feb 27, 2026

a language for fast, portable data-parallel computation

C++ 6,571 1,093 Updated Feb 26, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 71,396 13,762 Updated Feb 27, 2026

Ghidra is a software reverse engineering (SRE) framework

Java 65,067 7,180 Updated Feb 25, 2026

TensorFlow/TensorRT integration

Jupyter Notebook 743 223 Updated Nov 30, 2023

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Python 15,266 1,706 Updated Jun 25, 2025

Protocol Buffers - Google's data interchange format

C++ 70,752 16,037 Updated Feb 27, 2026
Next