Thanks to visit codestin.com
Credit goes to github.com

Skip to content
#

fp16

Here are 32 public repositories matching this topic...

A reproducible GPU benchmarking lab that compares FP16 vs FP32 training on MNIST using PyTorch, CuPy, and Nsight profiling tools. This project blends performance engineering with cinematic storytelling—featuring NVTX-tagged training loops, fused CuPy kernels, and a profiler-driven README that narrates the GPU’s inner workings frame by frame.

  • Updated Sep 5, 2025
  • Python

Improve this page

Add a description, image, and links to the fp16 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the fp16 topic, visit your repo's landing page and select "manage topics."

Learn more