- nvidia developer
- docs
- source
- samples
- github: nvidia/cuda-samples
- repos
nvidia-smi --query-gpu=compute_cap --format=csv
compute_cap
8.6git clone https://github.com/NVIDIA/cuda-samples.git- Introduction: c++11_cuda
cd Samples/0_Introduction/c++11_cudamake HOST_COMPILER=clang++ SMS="86" dbg=1
make HOST_COMPILER=g++ SMS="86" dbg=1
make HOST_COMPILER=g++-13 SMS="86" dbg=1./c++11_cuda
GPU Device 0: "Ampere" with compute capability 8.6
Read 3223503 byte corpus from ./warandpeace.txt
counted 107310 instances of 'x', 'y', 'z', or 'w' in "./warandpeace.txt"- Hello CUDA: hello_cuda, hello_cuda with C++
- Thread: thread_layout
- Device: device_query
- Vector: vector_add
- Matrix
- TensorRT: tensorrt
- Sync: sync, streams + event