Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Tags: NVIDIA/nccl-tests

Tags

v2.17.2

Toggle v2.17.2's commit message
Check if sufficient GPUs are available

The CUDA error message "Test CUDA failure util.cu:706 'invalid device ordinal'"
is not as helpful. Test this explicitly and guide the user.

v2.17.1

Toggle v2.17.1's commit message
Fix compilation for old NCCL versions

Fix compilation failure on ctaPolicy with NCCL <= 2.26.
Fix compilation failure on local_register with NCCL <= 2.18.
Fix ctaPolicy behavior if the tests are compiled with NCCL <= 2.26
but run with NCCL >= 2.27.

v2.17.0

Toggle v2.17.0's commit message
Update to align with the NCCL 2.28 release

Added Device API infrastructure and example kernels
Two new command line arguments:

  -D <num> device kernel implementation to use <0/1/2/3/4>
  -V <num> number of CTAs to launch device kernels with

Added new CTA Policy command line option:

  -x <policy> set the CTA Policy <0/1/2>

v2.16.9

Toggle v2.16.9's commit message
Update NVCUFLAGS and CXXFLAGS to use -std=c++14

v2.16.8

Toggle v2.16.8's commit message
Modified warmup to run for more message sizes

Loops between minBytes and maxBytes doubling size each time

Reduced default warmup iteration count to 1 (was 5)

v2.16.7

Toggle v2.16.7's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Merge pull request #316 from martin-belanger/print-program-name

Print the name of the program being executed before and after test output

v2.16.6

Toggle v2.16.6's commit message
Add extra reserved space during maxBytes calculation

Also, don't allow minBytes > maxBytes

v2.16.5

Toggle v2.16.5's commit message
Minor fix to Makefile

Move comments to separate lines

v2.16.4

Toggle v2.16.4's commit message
Add Turing (SM75) support to CUDA 13.0 builds

v2.16.3

Toggle v2.16.3's commit message
Wrap ncclCommWindowRegister() calls within ncclGroup