Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Tags: microsoft/msccl

Tags

v0.7.4

Toggle v0.7.4's commit message
fixing memory ordering issue -- clipping -- adding ncclGetLastError

v0.7.3

Toggle v0.7.3's commit message
MSCCL with CUDA graph support

v0.6.3

Toggle v0.6.3's commit message
well optimized MSCCL interpreter with NCCL 2.8.4

v0.7.2

Toggle v0.7.2's commit message
0.7.2 MSCCL 2.12.12 with CUDA graphs and reduced compilation time

v0.7.1

Toggle v0.7.1's commit message
MSCCL 2.12 with CUDA graph support

v0.7

Toggle v0.7's commit message
MSCCL with NCCL 2.12

v1.0

Toggle v1.0's commit message
fully capable MSCCL runtime

v0.6.2

Toggle v0.6.2's commit message
increasing the limit for ar_ll128

v0.6.1

Toggle v0.6.1's commit message
minor bug fix for how scratchpad is allocated

v0.6

Toggle v0.6's commit message
MSCCL optimized allreduce for up-to 256KB with 8xA100