Thanks to visit codestin.com
Credit goes to github.com

Skip to content
Change the repository type filter

All

    Repositories list

    • cuda-quantum

      Public
      C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
      C++
      31387640580Updated Dec 20, 2025Dec 20, 2025
    • OSMO

      Public
      The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge devices in a simple YAML
      Python
      6611412Updated Dec 20, 2025Dec 20, 2025
    • TensorRT-Incubator

      Public
      Experimental projects related to TensorRT
      MLIR
      221163711Updated Dec 20, 2025Dec 20, 2025
    • Model-Optimizer

      Public
      A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
      Python
      2181.7k5551Updated Dec 20, 2025Dec 20, 2025
    • cudnn-frontend

      Public
      cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
      C++
      138656361Updated Dec 20, 2025Dec 20, 2025
    • k8s-driver-manager

      Public
      The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.
      Go
      204655Updated Dec 20, 2025Dec 20, 2025
    • bionemo-framework

      Public
      BioNeMo Framework: For building and adapting AI models in drug discovery at scale
      Jupyter Notebook
      10860761109Updated Dec 20, 2025Dec 20, 2025
    • cccl

      Public
      CUDA Core Compute Libraries
      C++
      3092.1k1.1k198Updated Dec 20, 2025Dec 20, 2025
    • Ongoing research training transformer models at scale
      Python
      3.4k15k344249Updated Dec 20, 2025Dec 20, 2025
    • NeMo-Agent-Toolkit

      Public
      The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
      Python
      4631.6k6330Updated Dec 19, 2025Dec 19, 2025
    • skyhook

      Public
      A Kubernetes Operator to manage Node OS customizations.
      Go
      33400Updated Dec 19, 2025Dec 19, 2025
    • nvidia-monitor-eventing

      Public
      Monitor and react to platform events
      C++
      0500Updated Dec 19, 2025Dec 19, 2025
    • NVFlare

      Public
      NVIDIA Federated Learning Application Runtime Environment
      Python
      2268521519Updated Dec 19, 2025Dec 19, 2025
    • cuCollections

      Public
      C++
      1026065513Updated Dec 19, 2025Dec 19, 2025
    • NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments
      Go
      29127349Updated Dec 19, 2025Dec 19, 2025
    • Fuser

      Public
      A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
      C++
      74367210211Updated Dec 19, 2025Dec 19, 2025
    • TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
      Python
      2k12k535479Updated Dec 19, 2025Dec 19, 2025
    • warp

      Public
      A Python framework for accelerated simulation, data generation and spatial computing.
      Python
      4005.9k1763Updated Dec 19, 2025Dec 19, 2025
    • NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
      Go
      4312.5k9476Updated Dec 19, 2025Dec 19, 2025
    • The CUDA target for Numba
      Python
      5123310125Updated Dec 19, 2025Dec 19, 2025
    • RAPIDS Accelerator JNI For Apache Spark
      Cuda
      77528710Updated Dec 19, 2025Dec 19, 2025
    • nv-ingest

      Public
      NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
      Python
      2802.8k10131Updated Dec 19, 2025Dec 19, 2025
    • NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to failures and interruptions.
      Python
      39239217Updated Dec 19, 2025Dec 19, 2025
    • CUDA Python: Performance meets Productivity
      Cython
      2333.1k20216Updated Dec 19, 2025Dec 19, 2025
    • The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.
      Shell
      671482539Updated Dec 19, 2025Dec 19, 2025
    • This repo contains CUDA-Q Academic materials, including self-paced Jupyter notebook modules for building and optimizing hybrid quantum-classical algorithms using CUDA-Q.
      Jupyter Notebook
      6623428Updated Dec 19, 2025Dec 19, 2025
    • TileGym

      Public
      Helpful kernel tutorials and examples for tile-based GPU programming
      Python
      2345601Updated Dec 19, 2025Dec 19, 2025
    • Open-source deep-learning framework for exploring, building and deploying AI weather/climate workflows.
      Python
      85316106Updated Dec 19, 2025Dec 19, 2025
    • HPC Container Maker
      Python
      99500144Updated Dec 19, 2025Dec 19, 2025
    • aistore

      Public
      AIStore: scalable storage for AI applications
      Go
      2311.7k10Updated Dec 19, 2025Dec 19, 2025