`<atomic>`: Fix ARM64EC and CHPE codegen #4222

StephanTLavavej · 2023-12-01T00:54:25Z

This mirrors @mcfi's MSVC-PR-511288 as of Iteration 10, fixing VSO-1918426 which tracks the following issues reported internally by Geoffrey Antos:

std::atomic<T>::load() is still generating the original/slow ldr+barrier instructions and not the lda[p]r instructions intended by MSVC-PR-449792 / <atomic>: Improve ARM64 performance #3399.
Neither overload of std::atomic<T>::load() appears to guarantee sequential consistency when requested for ARM64 CHPE builds.
std::atomic_thread_fence() does not appear to guarantee the requested ordering for CHPE builds (other than for std::memory_order_seq_cst).
(Performance) On CHPE builds, std::atomic<T>::exchange(), std::atomic<T>::compare_exchange_*() and std::atomic<T>::fetch_*() appear to force sequentially consistent behavior.

MSVC-PR-511288 Iteration 10.

593a9c0

StephanTLavavej added bug Something isn't working ARM64 Related to the ARM64 architecture labels Dec 1, 2023

StephanTLavavej requested a review from a team as a code owner December 1, 2023 00:54

CaseyCarter approved these changes Dec 1, 2023

View reviewed changes

StephanTLavavej merged commit 8a8dda8 into microsoft:main Dec 1, 2023

StephanTLavavej deleted the dev/beniu/atomicfixes branch December 1, 2023 21:32

StephanTLavavej mentioned this pull request Dec 1, 2023

<atomic>: ADL-proof implementation of atomic and atomic_ref #4221

Merged

Provide feedback