Thanks to visit codestin.com
Credit goes to github.com

Skip to content
Change the repository type filter

All

    Repositories list

    • NeMo-Agent-Toolkit

      Public
      The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
      Python
      4621.6k6334Updated Dec 19, 2025Dec 19, 2025
    • cuda-python

      Public
      CUDA Python: Performance meets Productivity
      Cython
      2333.1k20216Updated Dec 19, 2025Dec 19, 2025
    • Fuser

      Public
      A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
      C++
      74367210214Updated Dec 19, 2025Dec 19, 2025
    • TensorRT-LLM

      Public
      TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
      Python
      2k12k538477Updated Dec 19, 2025Dec 19, 2025
    • gpu-operator

      Public
      NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
      Go
      4312.5k9476Updated Dec 19, 2025Dec 19, 2025
    • Model-Optimizer

      Public
      A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
      Python
      2181.7k5552Updated Dec 19, 2025Dec 19, 2025
    • JAX-Toolbox

      Public
      JAX-Toolbox
      Python
      683688039Updated Dec 19, 2025Dec 19, 2025
    • cuCollections

      Public
      C++
      1026065513Updated Dec 19, 2025Dec 19, 2025
    • bionemo-framework

      Public
      BioNeMo Framework: For building and adapting AI models in drug discovery at scale
      Jupyter Notebook
      10860760109Updated Dec 19, 2025Dec 19, 2025
    • TransformerEngine

      Public
      A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.
      Python
      5833k280101Updated Dec 19, 2025Dec 19, 2025
    • cccl

      Public
      CUDA Core Compute Libraries
      C++
      3092.1k1.1k200Updated Dec 19, 2025Dec 19, 2025
    • NV-Kernels

      Public
      Ubuntu kernels which are optimized for NVIDIA server systems
      C
      477307Updated Dec 19, 2025Dec 19, 2025
    • Megatron-LM

      Public
      Ongoing research training transformer models at scale
      Python
      3.4k15k344250Updated Dec 19, 2025Dec 19, 2025
    • cuda-quantum

      Public
      C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
      C++
      31387640580Updated Dec 19, 2025Dec 19, 2025
    • nvidia-resiliency-ext

      Public
      NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to failures and interruptions.
      Python
      39239217Updated Dec 19, 2025Dec 19, 2025
    • aistore

      Public
      AIStore: scalable storage for AI applications
      Go
      2311.7k10Updated Dec 19, 2025Dec 19, 2025
    • physicsnemo

      Public
      Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods
      Python
      5202.2k3944Updated Dec 19, 2025Dec 19, 2025
    • apex

      Public
      A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
      Python
      1.5k8.9k67779Updated Dec 19, 2025Dec 19, 2025
    • cuopt

      Public
      GPU accelerated decision optimization
      Cuda
      1046268534Updated Dec 19, 2025Dec 19, 2025
    • NVSentinel

      Public
      NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments
      Go
      29127348Updated Dec 19, 2025Dec 19, 2025
    • TensorRT-Incubator

      Public
      Experimental projects related to TensorRT
      MLIR
      221173711Updated Dec 19, 2025Dec 19, 2025
    • DALI

      Public
      A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
      C++
      6555.6k22226Updated Dec 19, 2025Dec 19, 2025
    • OSMO

      Public
      The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge devices in a simple YAML
      Python
      6611412Updated Dec 19, 2025Dec 19, 2025
    • OWL

      Public
      The OptiX Wrappers Library
      C++
      31700Updated Dec 19, 2025Dec 19, 2025
    • stdexec

      Public
      `std::execution`, the proposed C++ framework for asynchronous and parallel programming.
      C++
      2222.1k11413Updated Dec 19, 2025Dec 19, 2025
    • earth2studio

      Public
      Open-source deep-learning framework for exploring, building and deploying AI weather/climate workflows.
      Python
      85316107Updated Dec 19, 2025Dec 19, 2025
    • edk2-edkrepo-manifest

      Public
      NVIDIA fork of tianocore/edk2-edkrepo-manifest
      6601Updated Dec 19, 2025Dec 19, 2025
    • edk2-nvidia

      Public
      NVIDIA EDK2 platform support
      C
      53118166Updated Dec 19, 2025Dec 19, 2025
    • edk2

      Public
      NVIDIA fork of tianocore/edk2
      C
      1626015Updated Dec 19, 2025Dec 19, 2025
    • edk2-nvidia-non-osi

      Public
      NVIDIA EDK2 non-OSI licensed content
      BitBake
      2400Updated Dec 19, 2025Dec 19, 2025