-
-
Notifications
You must be signed in to change notification settings - Fork 11k
ENH: Add CPU feature detection for SVE2 #21638
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Some other projects that have implemented SVE2 support, which might or might not be useful resources: |
Sounds like a good idea to me. |
Arm’s client CPU cores targeted for 2023 devices have been announced, the Cortex-X3, Cortex-A715 and refreshed Cortex-A510, and all boast faster SVE2 implementations (especially at the decoding stage). The maximum CPU cluster size has been expanded from 8 to 12 cores, making it more likely these cores will also end up in laptops. |
Hi, @EwoutH I am planning to submit a pull request in the near future. |
I did the PR. #22265 |
Awesome, thanks a lot! This is also great timing with the announcement of Neoverse V2 and E2 CPU cores! Do you by any chance have any performance benchmarks? (see maybe benchmark docs) |
The Apple M4 is talked a lot about in the last week, apparently it's a full-fledged ARMv9.4 CPU with SME2 support, which would imply SME and SVE2 support. And apparently the SME2 support will replace Apple's own AMX. See |
how is it going this? |
Proposed new feature or change:
Add CPU feature detection for SVE2. On wide CPU cores the Scalable Vector Extension has the potential to increase performance manyfold compared to NEON.
SVE2 is supported on most Armv9 cores, including the Arm Cortex-A510, Cortex-A710, Cortex-X2 and Neoverse N2 CPU designs. This means it's (to be) found in a huge amount of devices.
Arm documentation: https://developer.arm.com/Architectures/SVE
This enhancement might be similar to #20821 and #20552.
The text was updated successfully, but these errors were encountered: