Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Releases: ARM-software/ComputeLibrary

v52.6.0

31 Oct 16:12

Choose a tag to compare

v52.6.0 Public Minor Release

Feat

  • Enable F32 output in Quantized CpuGemmConv2d

Fix

  • Invalidate certain Cpu operations if tensor sizes are large
  • Missing output type validation in CpuGemmDirectConv2d
  • Handle padding updates after configure() in CpuActivation

Refactor

  • Flatten nested zip usage in validation/NEON.
  • Flatten nested combine and zip usage in validation/CL directory.
  • Flatten nested combine usage in validation/NEON directory.

Perf

  • Do only one iteration of refinement for FP16 inv

Documentation (API, build guide, contribution guide, errata, etc.) available here:
https://artificial-intelligence.sites.arm.com/computelibrary/v52.6.0/index.xhtml

v52.5.0

13 Oct 21:10

Choose a tag to compare

v52.5.0 Public Minor Release

Feat

  • Add profiling tracepoints to CPU and GPU platforms
  • Add Perfetto profiler as default backend
  • Further modernization in CMake build
  • Add CMakePresets.json

Fix

  • Handle padding updates after configure() in CpuActivation
  • Broken URLs in rendered non-released README.md
  • Linker errors on macOS when building with CMake

Perf

  • Add FP16 GEMM MMUL Reshaped Only Rhs Kernel

Documentation (API, build guide, contribution guide, errata, etc.) available here:
https://artificial-intelligence.sites.arm.com/computelibrary/v52.5.0/index.xhtml

v52.4.0

27 Aug 07:44

Choose a tag to compare

v52.4.0 Public Minor Release

Notice

  • The generation of pre-built binaries for macOS and Windows is currently under review and may be temporarily unavailable following this release.

Feat

  • Updates to operator CpuGEMMLowp for static quantization, and associated tests.

Fix

  • Potential null pointer access in CpuFullyConnected validate method.

Perf

  • Remove switch statements in activation kernels.

Documentation (API, build guide, contribution guide, errata, etc.) available here:
https://artificial-intelligence.sites.arm.com/computelibrary/v52.4.0/index.xhtml

v52.3.0

04 Jul 14:02

Choose a tag to compare

v52.3.0 Public Minor Release

Feat

  • Support QSYMM8_PER_CHANNEL in NEQuantizationLayer.
  • Add stateless wrapper for CpuFullyConnected.

Fix

  • Support mixed-type quantized matmul when updating quantization after configure.
  • Prevent overread when computing row sums in GEMM.
  • Resolve out-of-bounds access in Dimensions::collapse().

Perf

  • Remove switch in SVE activation.
  • Remove switch in SVE2 activation.

Documentation (API, build guide, contribution guide, errata, etc.) available here:
https://artificial-intelligence.sites.arm.com/computelibrary/v52.3.0/index.xhtml

v52.2.0

13 Jun 09:01

Choose a tag to compare

v52.2.0 Public Minor Release

Feat

  • Enable non-transposed BF16 reorders.

Fix

  • Reorder test failures on multi-isa builds.
  • Over-eager read ahead of operands in a64_hgemm_8x24.

Documentation (API, build guide, contribution guide, errata, etc.) available here:
https://artificial-intelligence.sites.arm.com/computelibrary/v52.2.0/index.xhtml

v52.1.0

02 Jun 09:04

Choose a tag to compare

v52.1.0 Public Minor Release

Feat

  • Restrict GEMM stateless execution to fixed-format kernels only
  • Add wrapper class to expose cpu::CpuPool2d functionality
  • Enable non-transposed F32 reorders

Documentation (API, build guide, contribution guide, errata, etc.) available here:
https://artificial-intelligence.sites.arm.com/computelibrary/v52.1.0/index.xhtml

v52.0.1

15 May 09:10

Choose a tag to compare

v52.0.1 Public Patch Release

Fix

  • Fill the padding area with zeros in CpuIm2ColKernel
  • Public header files pass -Wundef check
  • Limit thread split to the window size for run_parallel_pretranspose_B_array

Documentation (API, build guide, contribution guide, errata, etc.) available here:
https://artificial-intelligence.sites.arm.com/computelibrary/v52.0.1/index.xhtml

v52.0.0

01 May 15:32

Choose a tag to compare

v52.0.0 Public Major Release

Fix

v25.04

17 Apr 13:01

Choose a tag to compare

v25.04 Public Major Release

Feat

  • Add Neon(TM) and SVE hybrid FP16 matmul kernels using FP32 accumulation.

Fix

v25.03.1

04 Apr 14:05

Choose a tag to compare

v25.03.1 Public Major Release

Feat

  • Add experimental QNX(R) support.
  • Add matmul fp16->fp32 kernels to enable fp16 PyTorch attention through ACL.

Fix

  • Replace .word with .inst when encoding instructions.
  • Neon(TM) detection for Bare Metal.

Refactor