ENH: Add CPU feature detection for SVE2 #21638

EwoutH · 2022-05-30T18:40:22Z

Proposed new feature or change:

Add CPU feature detection for SVE2. On wide CPU cores the Scalable Vector Extension has the potential to increase performance manyfold compared to NEON.

SVE2 is supported on most Armv9 cores, including the Arm Cortex-A510, Cortex-A710, Cortex-X2 and Neoverse N2 CPU designs. This means it's (to be) found in a huge amount of devices.

Arm documentation: https://developer.arm.com/Architectures/SVE

Introducing SVE2

This section introduces the Scalable Vector Extension version two (SVE2) of the Arm AArch64 architecture.

Following the development of the Neon architecture extension, which has a fixed 128-bit vector length for the instruction set, Arm designed the Scalable Vector Extension (SVE). SVE is a new Single Instruction Multiple Data (SIMD) instruction set that is used as an extension to AArch64, to allow for flexible vector length implementations. SVE improves the suitability of the architecture for High Performance Computing (HPC) applications, which require very large quantities of data processing.

SVE2 is a superset of SVE and Neon. SVE2 allows for more function domains in data-level parallelism. SVE2 inherits the concept, vector registers, and operation principles of SVE. SVE and SVE2 define 32 scalable vector registers. Silicon partners can choose a suitable vector length design implementation for hardware that varies between 128 bits and 2048 bits, at 128-bit increments. The advantage of SVE and SVE2 is that only one vector instruction set uses the scalable variables.

This enhancement might be similar to #20821 and #20552.

EwoutH · 2022-05-31T10:36:46Z

Some other projects that have implemented SVE2 support, which might or might not be useful resources:

rgommers · 2022-06-12T14:34:14Z

Sounds like a good idea to me.

EwoutH · 2022-06-29T07:25:17Z

Arm’s client CPU cores targeted for 2023 devices have been announced, the Cortex-X3, Cortex-A715 and refreshed Cortex-A510, and all boast faster SVE2 implementations (especially at the decoding stage). The maximum CPU cluster size has been expanded from 8 to 12 cores, making it more likely these cores will also end up in laptops.

kawakami-k · 2022-09-07T00:41:58Z

Hi, @EwoutH
I am trying an implementation for SVE. Here is the source code, and I have confirmed that it works on Fujitsu A64FX (Armv8.2a + SVE 512 bits).

I am planning to submit a pull request in the near future.
Are you working on an SVE/SVE2 implementation?

kawakami-k · 2022-09-15T15:53:35Z

I did the PR. #22265

EwoutH · 2022-09-16T11:33:52Z

Awesome, thanks a lot! This is also great timing with the announcement of Neoverse V2 and E2 CPU cores!

Do you by any chance have any performance benchmarks? (see maybe benchmark docs)

EwoutH · 2024-05-24T11:53:50Z

The Apple M4 is talked a lot about in the last week, apparently it's a full-fledged ARMv9.4 CPU with SME2 support, which would imply SME and SVE2 support. And apparently the SME2 support will replace Apple's own AMX.

See

johnnynunez · 2025-06-12T10:17:02Z

how is it going this?

rgommers added component: SIMD Issues in SIMD (fast instruction sets) code or machinery 01 - Enhancement labels Jun 12, 2022

EwoutH mentioned this issue Aug 30, 2022

BUG, SIMD: Fix detecting NEON/ASIMD on aarch64 #21749

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ENH: Add CPU feature detection for SVE2 #21638

ENH: Add CPU feature detection for SVE2 #21638

EwoutH commented May 30, 2022

Introducing SVE2

EwoutH commented May 31, 2022

Uh oh!

rgommers commented Jun 12, 2022

Uh oh!

EwoutH commented Jun 29, 2022

Uh oh!

kawakami-k commented Sep 7, 2022 •

edited

Loading

Uh oh!

kawakami-k commented Sep 15, 2022

Uh oh!

EwoutH commented Sep 16, 2022

Uh oh!

EwoutH commented May 24, 2024

Uh oh!

johnnynunez commented Jun 12, 2025

Uh oh!

Uh oh!

ENH: Add CPU feature detection for SVE2 #21638

ENH: Add CPU feature detection for SVE2 #21638

Comments

EwoutH commented May 30, 2022

Proposed new feature or change:

Introducing SVE2

EwoutH commented May 31, 2022

Uh oh!

rgommers commented Jun 12, 2022

Uh oh!

EwoutH commented Jun 29, 2022

Uh oh!

kawakami-k commented Sep 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kawakami-k commented Sep 15, 2022

Uh oh!

EwoutH commented Sep 16, 2022

Uh oh!

EwoutH commented May 24, 2024

Uh oh!

johnnynunez commented Jun 12, 2025

Uh oh!

kawakami-k commented Sep 7, 2022 •

edited

Loading