Codestin Search App

TensorRT-LLM

Public

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

cuda pytorch moeblackwell llm-serving

Python

•

Other

•2k•12k•519•486•Updated

Dec 25, 2025

gpu-operator

Public

NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes

kubernetes gpu cudanvidia

Go

•

Apache License 2.0

•431•2.5k•94•67•Updated

Dec 25, 2025

recsys-examples

Public

Examples for Recommenders - easy to train and deploy on accelerated infrastructure.

pytorch recommender-system recommendersgenerative-recommenders

Python

•

Other

•39•193•38•8•Updated

Dec 25, 2025

Model-Optimizer

Public

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

Python

•

Apache License 2.0

•222•1.7k•56•57•Updated

Dec 25, 2025

TileGym

Public

Helpful kernel tutorials and examples for tile-based GPU programming

Python

•

Other

•28•491•0•2•Updated

Dec 25, 2025

doca-platform

Public

DOCA Platform manages provisioning and service orchestration for Bluefield DPUs

Go

•

Apache License 2.0

•16•64•0•0•Updated

Dec 25, 2025

nccl

Public

Optimized primitives for collective multi-GPU communication

deep-learning cpp gpucuda nvidia communications

C++

•

Other

•1.1k•4.3k•190•73•Updated

Dec 25, 2025

skyhook

Public

A Kubernetes Operator to manage Node OS customizations.

Go

•

Apache License 2.0

•3•35•0•0•Updated

Dec 25, 2025

spark-rapids-examples

Public

A repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.

Jupyter Notebook

•

Apache License 2.0

•62•164•22•3•Updated

Dec 25, 2025

NVFlare

Public

NVIDIA Federated Learning Application Runtime Environment

python decentralized petprivacy-protection federated-learning federated-analytics federated-computing

Python

•

Apache License 2.0

•226•851•15•16•Updated

Dec 25, 2025

cuEquivariance

Public

cuEquivariance is a math library that is a collective of low-level primitives and tensor ops to accelerate widely-used models, like DiffDock, MACE, Allegro and NEQUIP, based on equivariant neural networks. Also includes kernels for accelerated structure prediction.

Python

•23•335•13•5•Updated

Dec 25, 2025

stdexec

Public

`std::execution`, the proposed C++ framework for asynchronous and parallel programming.

C++

•

Apache License 2.0

•222•2.2k•114•13•Updated

Dec 24, 2025

Fuser

Public

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

C++

•

Other

•74•368•210•218•Updated

Dec 24, 2025

warp

Public

A Python framework for accelerated simulation, data generation and spatial computing.

python gpu cudanvidia gpu-acceleration differentiable-programming nvidia-warp

Python

•

Apache License 2.0

•404•6k•178•3•Updated

Dec 24, 2025

Megatron-LM

Public

Ongoing research training transformer models at scale

transformers model-para large-language-models

Python

•

Other

•3.4k•15k•338•255•Updated

Dec 24, 2025

spark-rapids-tools

Public

User tools for Spark RAPIDS

Scala

•

Apache License 2.0

•47•65•262•1•Updated

Dec 24, 2025

spark-rapids-ml

Public

Spark RAPIDS MLlib – accelerate Apache Spark MLlib with GPUs

Jupyter Notebook

•

Apache License 2.0

•31•86•31•1•Updated

Dec 24, 2025

VisRTX

Public

NVIDIA OptiX based implementation of ANARI

C++

•

Other

•38•271•12•0•Updated

Dec 24, 2025

nv-ingest

Public

NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.

Python

•

Apache License 2.0

•280•2.8k•101•36•Updated

Dec 24, 2025

numbast

Public

Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.

cuda numba

Python

•

Apache License 2.0

•18•55•27•10•Updated

Dec 24, 2025

KAI-Scheduler

Public

KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale

Go

•

Apache License 2.0

•126•1k•24•41•Updated

Dec 24, 2025

nsmd

Public

MCTP VDM-based Nvidia System Management API

C++

•

Apache License 2.0

•1•4•1•0•Updated

Dec 24, 2025

bionemo-framework

Public

BioNeMo Framework: For building and adapting AI models in drug discovery at scale

machine-learning gpu pytorchdrug-discovery

Jupyter Notebook

•108•608•61•111•Updated

Dec 24, 2025

JAX-Toolbox

Public

JAX-Toolbox

Python

•

Apache License 2.0

•68•369•80•40•Updated

Dec 24, 2025

makani

Public

Massively parallel training of machine-learning based weather and climate models

Python

•

Other

•63•342•3•4•Updated

Dec 24, 2025

edk2

Public

NVIDIA fork of tianocore/edk2

C

•

Other

•16•25•0•15•Updated

Dec 24, 2025

spark-rapids-jni

Public

RAPIDS Accelerator JNI For Apache Spark

Cuda

•

Apache License 2.0

•78•52•86•9•Updated

Dec 24, 2025

OSMO

Public

The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge devices in a simple YAML

Python

•

Apache License 2.0

•6•61•22•13•Updated

Dec 24, 2025

nvidia-container-toolkit

Public

Build and run containers leveraging NVIDIA GPUs

Go

•

Apache License 2.0

•455•3.9k•123•31•Updated

Dec 24, 2025

k8s-device-plugin

Public

NVIDIA device plugin for Kubernetes

kubernetes

Go

•

Apache License 2.0

•768•3.6k•75•44•Updated

Dec 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NVIDIA Corporation

All

All

645 repositories

TensorRT-LLM

gpu-operator

recsys-examples

Model-Optimizer

TileGym

doca-platform

nccl

skyhook

spark-rapids-examples

NVFlare

cuEquivariance

stdexec

Fuser

warp

Megatron-LM

spark-rapids-tools

spark-rapids-ml

VisRTX

nv-ingest

numbast

KAI-Scheduler

nsmd

bionemo-framework

JAX-Toolbox

makani

edk2

spark-rapids-jni

OSMO

nvidia-container-toolkit

k8s-device-plugin

All

All

Repositories list

645 repositories