Thanks to visit codestin.com
Credit goes to github.com

Skip to content

v0.30.1

Latest

Choose a tag to compare

@angeloskath angeloskath released this 18 Dec 00:32
· 13 commits to main since this release
c215b6f

Highlights

  • RDMA over thunderbolt with the JACCL backend (macOS >= 26.2) (some numbers)
  • NAX with JIT so that they can be used in MLX Swift
  • CUDA improvements
    • Many improvements to SDPA (masking, T_q != T_kv)
    • Faster quantize/dequantize
    • QQMM to make use of faster tensor cores
    • Fix in col reduce speeds up training

What's Changed

New Contributors

Full Changelog: v0.30.0...v0.30.1