Codestin Search App

v2.17.2

Check if sufficient GPUs are available

The CUDA error message "Test CUDA failure util.cu:706 'invalid device ordinal'"
is not as helpful. Test this explicitly and guide the user.

Oct 2, 2025
abc4677
zip
tar.gz

v2.17.1

Fix compilation for old NCCL versions

Fix compilation failure on ctaPolicy with NCCL <= 2.26.
Fix compilation failure on local_register with NCCL <= 2.18.
Fix ctaPolicy behavior if the tests are compiled with NCCL <= 2.26
but run with NCCL >= 2.27.

Sep 5, 2025
9a5c154
zip
tar.gz

v2.17.0

Update to align with the NCCL 2.28 release

Added Device API infrastructure and example kernels
Two new command line arguments:

  -D <num> device kernel implementation to use <0/1/2/3/4>
  -V <num> number of CTAs to launch device kernels with

Added new CTA Policy command line option:

  -x <policy> set the CTA Policy <0/1/2>

Sep 5, 2025
e12dbb0
zip
tar.gz

v2.16.9

Update NVCUFLAGS and CXXFLAGS to use -std=c++14

Aug 29, 2025
c2cb96f
zip
tar.gz

v2.16.8

Modified warmup to run for more message sizes

Loops between minBytes and maxBytes doubling size each time

Reduced default warmup iteration count to 1 (was 5)

Aug 25, 2025
f2015cb
zip
tar.gz

v2.16.7

Merge pull request #316 from martin-belanger/print-program-name

Print the name of the program being executed before and after test output

Jul 24, 2025
fae7cb4
zip
tar.gz

v2.16.6

Add extra reserved space during maxBytes calculation

Also, don't allow minBytes > maxBytes

Jul 23, 2025
6edafa0
zip
tar.gz

v2.16.5

Minor fix to Makefile

Move comments to separate lines

Jul 23, 2025
def2d36
zip
tar.gz

v2.16.4

Add Turing (SM75) support to CUDA 13.0 builds

Jun 5, 2025
97ee098
zip
tar.gz

v2.16.3

Wrap ncclCommWindowRegister() calls within ncclGroup

Jun 3, 2025
e7c8825
zip
tar.gz

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v2.17.2

v2.17.1

v2.17.0

v2.16.9

v2.16.8

v2.16.7

v2.16.6

v2.16.5

v2.16.4

v2.16.3

Tags: NVIDIA/nccl-tests