Releases: gomlx/gopjrt
Releases · gomlx/gopjrt
v0.10.0 - Added XLA Shardy support.
- Package
cmd/gopjrt_installer:- Link
libcublasLt.so.13andlibcublas.so.13to thelibsubdirectory of the installation directory given.
Nvidia needs it for some models, but doesn't know how to find it within the provided SDK path.
- Link
- Package
xlabuilder: removed from CI: it requires manual installation of the older library version. - Package
pjrt:- Fixed wrong struct size set for
PJRT_Event_Destroy_Args, in #65 (@timkaye11) - Fixed
buffer.Destroyto release theclientpointer in the wrapper. - Added a required
runtime.KeepAlive(program)on a CGO call to compileprogram. - Added new Shardy support for distributed (across multiple devices) execution.
- Added old SPMD support (see
pjrt.Compile().WithSPMD)
- Fixed wrong struct size set for
v0.9.1: More multi-device support; updated CPU PJRT; dropped static CPU PJRT linking.
v0.9.1 2025/11/07: More multi-device support; updated CPU PJRT; dropped static CPU PJRT linking.
- Package
pjrt:- Added
Buffer.CopyToDevice().
- Added
- Updated and rebuilt XLA's CPU PJRT and XlaBuilder for new release.
- Static builds of PJRT CPU no longer working (Bazel issues).
- XlaBuilder no longer included – making the binary packages smaller.
v0.9.0 2025/11/06: Multi-device SPMD execution; TPU installation support; Deprecating `xlabuilder`.
v0.9.0 2025/11/06: Multi-device SPMD execution; TPU installation support; Deprecating xlabuilder.
gopjrt_installer: Added TPU installation.- Package
pjrt:- Renamed
Client.DevicestoClient.AllDevices. - Added
Client.NumDevicesto get the number of addressable devices. - Added
Compile().WithSPMDfor multi-device SPMD execution. - Default compilation is "portable" (can be run in any device), but it allows for device specific as well.
- Arena pools for powers-of-2 sizes.
- Renamed
- Replaced
xlabuilderwithstablehloin examples:- Enabled darwin tests using
stablehlo
- Enabled darwin tests using
- Updated PJRT C API and protos from XLA sources.
v0.8.5 - Fixed `xlabuilder` for newer C compilers.
- GitHub workflows:
- Renamed to more meaningful names.
- Added
darwin_build_cpu_pjrt.yaml: workflow to automatically build the Darwin CPU PJRT plugin. - Remove unnecessary
apt installin linux tests.
- Package
xlabuilder:- Added
-std=gnu11toCFLAGSto avoid incompatibility introduced inc23. - Temporary fix, as
xlabuilderis deprecated and will be removed in the future.
- Added
v0.8.4 - 2025/10/23 - Adding Darwin (Mac) support for CPU PJRT plugin
- Added default "${HOME}/Library/Application Support/GoMLX/PJRT" as a default search path for PJRT plugins in MacOS.
- Fixed cuda.go to be linux-only for now, with a safe default for other platforms.
- Removed the requirement for static linking of CUDA PJRT plugin for Darwin.
gopjrt_installer: now also supports darwin/arm64.- Use -tags=all when building to include all installers (linux/darwin) if cross-platform installing.
v0.8.3 - 2025/10/02 New `gopjrt_installer`; Marked `xlabuilder` as deprecated.
- New
github.com/gomlx/gopjrt/cmd/gopjrt_installercommand-line tool to install Gopjrt plugins.- Doesn't do sudo by itself.
- Doesn't depend on Python for CUDA plugins (automatic dependency traversing; sha256 checking).
- Interactive, self-explained mode.
- Optional version specification (both for Gopjrt and for CUDA PJRT).
- All versions in one tool.
- Faster
- Doesn't download unnecessary jax files.*
- Updated README.md, with links to
gopjrt_installerfor installation.- Moved
xlabuilderdocumentation underxlabuilder/README.md, and marked it as deprecated. - Changed Mandelbrot example to use
github.com/gomlx/stablehloinstead.
- Moved
v0.8.2 - 2025/09/29 Updated CUDA drivers
- Updated PJRT's
pjrt_c_api.hand protos from github.com/openxla/xla, and regenerated wrapper code. - Added
cmd/install_cuda13.shscript to install "jax[cuda13]" drivers. - Updated README.md to stard adverstising of
xlabuilderdeprecation. - Moved
pjrt.SuppressAbseilLoggingHackimplementation to the "per-platform" filespjrt/dynamiclib_posix.goand
pjrt/dynamiclib_darwin.go: the first usingsyscall.Dup3()and the second usingsyscall.Dup2().
v0.8.1 - 2025/09/21 Changes in support of github.com/openxla/stablehlo
Changes in support of github.com/openxla/stablehlo
- Moved
stablehloto its own repository. - Updated dependencies.
- Package
dtypes:- Added
DType.Bitsfield indicating the number of bits in the data type -- supporting sub-byte types including
quantized types. - Added
DType.IsPromotableTo(target)to check if types are promotable to the target type -- it follows the StableHLO rule. - Cleaned up generators for
dtypespackage.
- Added
- Package
xlabuilder:- Deprecated
SelectAndScatterSum.
- Deprecated
v0.8.0 - Convolution API Changes
- API change to convolutions:
ConvGeneralDilated()renamed toConvGeneral(). Fixed (standardized) name of the parameters,
but they remain the same.- The
ConvolveAxesConfigstructure had its field names slightly changed, by replacing "Channel"
by "Feature".
- Updated XLA protos.
v0.7.4 Added 0-dimension buffers/tensors support
- Added 0-dimension buffers/tensors support
- Refreshed XLA