Thanks to visit codestin.com
Credit goes to github.com

NVIDIA Corporation

All

643 repositories

nv-ingest
Public
NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
Python
•
Apache License 2.0
•281•2.8k•101•32•Updated Dec 19, 2025Dec 19, 2025
cccl
Public
CUDA Core Compute Libraries
cpp hpc gpu modern-cpp parallel-computing cuda nvidia gpu-acceleration cuda-kernels gpu-computing
C++
•
Other
•309•2.1k•1.1k•200•Updated Dec 19, 2025Dec 19, 2025
Megatron-LM
Public
Ongoing research training transformer models at scale
transformers model-para large-language-models
Python
•
Other
•3.4k•15k•344•251•Updated Dec 19, 2025Dec 19, 2025
cuda-python
Public
CUDA Python: Performance meets Productivity
Cython
•
Other
•233•3.1k•202•15•Updated Dec 19, 2025Dec 19, 2025
Fuser
Public
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
C++
•
Other
•74•366•209•218•Updated Dec 19, 2025Dec 19, 2025
NeMo-Agent-Toolkit
Public
The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
Python
•
Apache License 2.0
•462•1.6k•62•36•Updated Dec 19, 2025Dec 19, 2025
physicsnemo
Public
Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods
machine-learning deep-learning physics pytorch nvidia-gpu nvidia-warp
Python
•
Apache License 2.0
•519•2.2k•39•43•Updated Dec 19, 2025Dec 19, 2025
cloudai
Public
CloudAI Benchmark Framework
Python
•
Apache License 2.0
•40•77•1•6•Updated Dec 19, 2025Dec 19, 2025
NVSentinel
Public
NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments
Go
•
Apache License 2.0
•29•127•33•8•Updated Dec 19, 2025Dec 19, 2025
TensorRT-LLM
Public
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
cuda pytorch moe blackwell llm-serving
Python
•
Other
•2k•12k•537•481•Updated Dec 19, 2025Dec 19, 2025
Model-Optimizer
Public
A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
Python
•
Apache License 2.0
•218•1.7k•54•54•Updated Dec 19, 2025Dec 19, 2025
JAX-Toolbox
Public
JAX-Toolbox
Python
•
Apache License 2.0
•68•368•80•39•Updated Dec 19, 2025Dec 19, 2025
kokoro
Public
https://hf.co/hexgrad/Kokoro-82M
JavaScript
•
Apache License 2.0
•0•1•0•0•Updated Dec 19, 2025Dec 19, 2025
numba-cuda
Public
The CUDA target for Numba
Python
•
BSD 2-Clause "Simplified" License
•51•233•99•24•Updated Dec 19, 2025Dec 19, 2025
NV-Kernels
Public
Ubuntu kernels which are optimized for NVIDIA server systems
C
•
Other
•47•73•0•7•Updated Dec 19, 2025Dec 19, 2025
nsight-python
Public
Nsight Python is a Python kernel profiling interface based on NVIDIA Nsight Tools
Python
•
Apache License 2.0
•6•75•5•1•Updated Dec 19, 2025Dec 19, 2025
DALI
Public
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
python machine-learning deep-learning neural-network mxnet gpu image-processing pytorch gpu-tensorflow data-processing
C++
•
Apache License 2.0
•655•5.6k•222•29•Updated Dec 19, 2025Dec 19, 2025
NVTX
Public
The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resources in your applications.
C++
•
Other
•66•487•3•4•Updated Dec 19, 2025Dec 19, 2025
cuda-quantum
Public
C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
python cpp quantum quantum-computing hacktoberfest quantum-programming-language quantum-algorithms quantum-machine-learning unitaryhack
C++
•
Other
•313•875•405•80•Updated Dec 19, 2025Dec 19, 2025
edk2
Public
NVIDIA fork of tianocore/edk2
C
•
Other
•16•26•0•15•Updated Dec 19, 2025Dec 19, 2025
phosphor-user-manager
Public
C++
•
Apache License 2.0
•11•1•0•0•Updated Dec 19, 2025Dec 19, 2025
cuEquivariance
Public
cuEquivariance is a math library that is a collective of low-level primitives and tensor ops to accelerate widely-used models, like DiffDock, MACE, Allegro and NEQUIP, based on equivariant neural networks. Also includes kernels for accelerated structure prediction.
Python
•23•336•12•5•Updated Dec 19, 2025Dec 19, 2025
gpu-operator
Public
NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
kubernetes gpu cuda nvidia
Go
•
Apache License 2.0
•431•2.5k•94•82•Updated Dec 19, 2025Dec 19, 2025
doca-platform
Public
DOCA Platform manages provisioning and service orchestration for Bluefield DPUs
Go
•
Apache License 2.0
•16•64•0•0•Updated Dec 19, 2025Dec 19, 2025
AMGX
Public
Distributed multigrid linear solver library on GPU
Cuda
•166•628•111•2•Updated Dec 19, 2025Dec 19, 2025
TensorRT-Incubator
Public
Experimental projects related to TensorRT
MLIR
•22•117•37•12•Updated Dec 19, 2025Dec 19, 2025
earth2studio
Public
Open-source deep-learning framework for exploring, building and deploying AI weather/climate workflows.
weather ai deep-learning climate-science
Python
•
Apache License 2.0
•85•316•10•8•Updated Dec 19, 2025Dec 19, 2025
TileGym
Public
Helpful kernel tutorials and examples for tile-based GPU programming
Python
•
Other
•22•455•0•0•Updated Dec 19, 2025Dec 19, 2025
stdexec
Public
`std::execution`, the proposed C++ framework for asynchronous and parallel programming.
C++
•
Apache License 2.0
•222•2.1k•114•12•Updated Dec 19, 2025Dec 19, 2025
OSMO
Public
The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge devices in a simple YAML
Python
•
Apache License 2.0
•6•61•14•12•Updated Dec 19, 2025Dec 19, 2025