Codestin Search App

NeMo-Agent-Toolkit

Public

The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.

Python

•

Apache License 2.0

•462•1.6k•63•34•Updated

Dec 19, 2025

cuda-python

Public

CUDA Python: Performance meets Productivity

Cython

•

Other

•233•3.1k•202•16•Updated

Dec 19, 2025

Fuser

Public

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

C++

•

Other

•74•367•210•214•Updated

Dec 19, 2025

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

cuda pytorch moeblackwell llm-serving

Python

•

Other

•2k•12k•538•477•Updated

Dec 19, 2025

gpu-operator

Public

NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes

kubernetes gpu cudanvidia

Go

•

Apache License 2.0

•431•2.5k•94•76•Updated

Dec 19, 2025

Model-Optimizer

Public

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

Python

•

Apache License 2.0

•218•1.7k•55•52•Updated

Dec 19, 2025

JAX-Toolbox

Public

JAX-Toolbox

Python

•

Apache License 2.0

•68•368•80•39•Updated

Dec 19, 2025

cuCollections

Public

datastructures cpp gpucuda hashmap cpp17 hashset hashtable

C++

•

Apache License 2.0

•102•606•55•13•Updated

Dec 19, 2025

bionemo-framework

Public

BioNeMo Framework: For building and adapting AI models in drug discovery at scale

machine-learning gpu pytorchdrug-discovery

Jupyter Notebook

•108•607•60•109•Updated

Dec 19, 2025

TransformerEngine

Public

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

python machine-learning deep-learninggpu cuda pytorch jax fp8 fp4

Python

•

Apache License 2.0

•583•3k•280•101•Updated

Dec 19, 2025

cccl

Public

CUDA Core Compute Libraries

cpp hpc gpumodern-cpp parallel-computing cuda nvidia gpu-acceleration cuda-kernels gpu-computing

C++

•

Other

•309•2.1k•1.1k•200•Updated

Dec 19, 2025

NV-Kernels

Public

Ubuntu kernels which are optimized for NVIDIA server systems

C

•

Other

•47•73•0•7•Updated

Dec 19, 2025

Megatron-LM

Public

Ongoing research training transformer models at scale

transformers model-para large-language-models

Python

•

Other

•3.4k•15k•344•250•Updated

Dec 19, 2025

cuda-quantum

Public

C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

python cpp quantumquantum-computing hacktoberfest quantum-programming-language quantum-algorithms quantum-machine-learning unitaryhack

C++

•

Other

•313•876•405•80•Updated

Dec 19, 2025

nvidia-resiliency-ext

Public

NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to failures and interruptions.

Python

•

Other

•39•239•2•17•Updated

Dec 19, 2025

aistore

Public

AIStore: scalable storage for AI applications

kubernetes high-performance distributed-storagehigh-availability object-storage multi-cloud batch-jobs s3-compatible multipart-upload ml-training

Go

•

MIT License

•231•1.7k•1•0•Updated

Dec 19, 2025

physicsnemo

Public

Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods

machine-learning deep-learning physicspytorch nvidia-gpu nvidia-warp

Python

•

Apache License 2.0

•520•2.2k•39•44•Updated

Dec 19, 2025

apex

Public

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python

•

BSD 3-Clause "New" or "Revised" License

•1.5k•8.9k•677•79•Updated

Dec 19, 2025

cuopt

Public

GPU accelerated decision optimization

gpu optimization cudalinear-programming

Cuda

•

Apache License 2.0

•104•626•85•34•Updated

Dec 19, 2025

NVSentinel

Public

NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments

Go

•

Apache License 2.0

•29•127•34•8•Updated

Dec 19, 2025

TensorRT-Incubator

Public

Experimental projects related to TensorRT

MLIR

•22•117•37•11•Updated

Dec 19, 2025

DALI

Public

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

python machine-learning deep-learningneural-network mxnet gpu image-processing pytorch gpu-tensorflow data-processing

C++

•

Apache License 2.0

•655•5.6k•222•26•Updated

Dec 19, 2025

OSMO

Public

The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge devices in a simple YAML

Python

•

Apache License 2.0

•6•61•14•12•Updated

Dec 19, 2025

OWL

Public

The OptiX Wrappers Library

C++

•

Apache License 2.0

•3•17•0•0•Updated

Dec 19, 2025

stdexec

Public

`std::execution`, the proposed C++ framework for asynchronous and parallel programming.

C++

•

Apache License 2.0

•222•2.1k•114•13•Updated

Dec 19, 2025

earth2studio

Public

Open-source deep-learning framework for exploring, building and deploying AI weather/climate workflows.

weather ai deep-learningclimate-science

Python

•

Apache License 2.0

•85•316•10•7•Updated

Dec 19, 2025

edk2-edkrepo-manifest

Public

NVIDIA fork of tianocore/edk2-edkrepo-manifest

Other

•6•6•0•1•Updated

Dec 19, 2025

edk2-nvidia

Public

NVIDIA EDK2 platform support

C

•

Other

•53•118•16•6•Updated

Dec 19, 2025

edk2

Public

NVIDIA fork of tianocore/edk2

C

•

Other

•16•26•0•15•Updated

Dec 19, 2025

edk2-nvidia-non-osi

Public

NVIDIA EDK2 non-OSI licensed content

BitBake

•2•4•0•0•Updated

Dec 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NVIDIA Corporation

All

All

643 repositories

NeMo-Agent-Toolkit

cuda-python

Fuser

TensorRT-LLM

gpu-operator

Model-Optimizer

JAX-Toolbox

cuCollections

bionemo-framework

TransformerEngine

cccl

NV-Kernels

Megatron-LM

cuda-quantum

nvidia-resiliency-ext

aistore

physicsnemo

apex

cuopt

NVSentinel

TensorRT-Incubator

DALI

OSMO

OWL

stdexec

earth2studio

edk2-edkrepo-manifest

edk2-nvidia

edk2

edk2-nvidia-non-osi

All

All

Repositories list

643 repositories