[X86] Manage atomic load of fp -> int promotion in DAG #120386

jofrn · 2024-12-18T08:41:42Z

When lowering atomic <1 x T> vector types with floats, selection can fail since
this pattern is unsupported. To support this, floats can be casted to
an integer type of the same size.

Stack:

⚠️ Part of a stack created by spr. Do not merge manually using the UI - doing so may have unexpected results.

llvmbot · 2024-12-18T08:42:17Z

@llvm/pr-subscribers-backend-x86

Author: None (jofrn)

Changes

When lowering atomic <1 x T> vector types with floats, selection can fail since
this pattern is unsupported. To support this, floats can be casted to
an integer type of the same size.

Stack:

#120387
#120386 ⬅
#120385
#120384

⚠️ Part of a stack created by spr. Do not merge manually using the UI - doing so may have unexpected results.

Full diff: https://github.com/llvm/llvm-project/pull/120386.diff

2 Files Affected:

(modified) llvm/lib/Target/X86/X86ISelLowering.cpp (+4)
(modified) llvm/test/CodeGen/X86/atomic-load-store.ll (+74-1)

diff --git a/llvm/lib/Target/X86/X86ISelLowering.cpp b/llvm/lib/Target/X86/X86ISelLowering.cpp
index 2571873dba8483..8006d32d077a65 100644
--- a/llvm/lib/Target/X86/X86ISelLowering.cpp
+++ b/llvm/lib/Target/X86/X86ISelLowering.cpp
@@ -2595,6 +2595,10 @@ X86TargetLowering::X86TargetLowering(const X86TargetMachine &TM,
         setOperationAction(Op, MVT::f32, Promote);
   }
 
+  setOperationPromotedToType(ISD::ATOMIC_LOAD, MVT::f16, MVT::i16);
+  setOperationPromotedToType(ISD::ATOMIC_LOAD, MVT::f32, MVT::i32);
+  setOperationPromotedToType(ISD::ATOMIC_LOAD, MVT::f64, MVT::i64);
+
   // We have target-specific dag combine patterns for the following nodes:
   setTargetDAGCombine({ISD::VECTOR_SHUFFLE,
                        ISD::SCALAR_TO_VECTOR,
diff --git a/llvm/test/CodeGen/X86/atomic-load-store.ll b/llvm/test/CodeGen/X86/atomic-load-store.ll
index 9cac8167542d8b..2bde0d2ffd06ad 100644
--- a/llvm/test/CodeGen/X86/atomic-load-store.ll
+++ b/llvm/test/CodeGen/X86/atomic-load-store.ll
@@ -1,12 +1,17 @@
 ; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
 ; RUN: llc < %s -mtriple=x86_64-apple-macosx10.7.0 -verify-machineinstrs | FileCheck %s
-; RUN: llc < %s -mtriple=x86_64-apple-macosx10.7.0 -verify-machineinstrs -O0 | FileCheck %s
+; RUN: llc < %s -mtriple=x86_64-apple-macosx10.7.0 -verify-machineinstrs -O0 | FileCheck %s --check-prefix=CHECK0
 
 define void @test1(ptr %ptr, i32 %val1) {
 ; CHECK-LABEL: test1:
 ; CHECK:       ## %bb.0:
 ; CHECK-NEXT:    xchgl %esi, (%rdi)
 ; CHECK-NEXT:    retq
+;
+; CHECK0-LABEL: test1:
+; CHECK0:       ## %bb.0:
+; CHECK0-NEXT:    xchgl %esi, (%rdi)
+; CHECK0-NEXT:    retq
   store atomic i32 %val1, ptr %ptr seq_cst, align 4
   ret void
 }
@@ -16,6 +21,11 @@ define void @test2(ptr %ptr, i32 %val1) {
 ; CHECK:       ## %bb.0:
 ; CHECK-NEXT:    movl %esi, (%rdi)
 ; CHECK-NEXT:    retq
+;
+; CHECK0-LABEL: test2:
+; CHECK0:       ## %bb.0:
+; CHECK0-NEXT:    movl %esi, (%rdi)
+; CHECK0-NEXT:    retq
   store atomic i32 %val1, ptr %ptr release, align 4
   ret void
 }
@@ -25,6 +35,11 @@ define i32 @test3(ptr %ptr) {
 ; CHECK:       ## %bb.0:
 ; CHECK-NEXT:    movl (%rdi), %eax
 ; CHECK-NEXT:    retq
+;
+; CHECK0-LABEL: test3:
+; CHECK0:       ## %bb.0:
+; CHECK0-NEXT:    movl (%rdi), %eax
+; CHECK0-NEXT:    retq
   %val = load atomic i32, ptr %ptr seq_cst, align 4
   ret i32 %val
 }
@@ -34,6 +49,64 @@ define <1 x i32> @atomic_vec1_i32(ptr %x) {
 ; CHECK:       ## %bb.0:
 ; CHECK-NEXT:    movl (%rdi), %eax
 ; CHECK-NEXT:    retq
+;
+; CHECK0-LABEL: atomic_vec1_i32:
+; CHECK0:       ## %bb.0:
+; CHECK0-NEXT:    movl (%rdi), %eax
+; CHECK0-NEXT:    retq
   %ret = load atomic <1 x i32>, ptr %x acquire, align 4
   ret <1 x i32> %ret
 }
+
+define <1 x half> @atomic_vec1_half(ptr %x) {
+; CHECK-LABEL: atomic_vec1_half:
+; CHECK:       ## %bb.0:
+; CHECK-NEXT:    movzwl (%rdi), %eax
+; CHECK-NEXT:    pinsrw $0, %eax, %xmm0
+; CHECK-NEXT:    retq
+;
+; CHECK0-LABEL: atomic_vec1_half:
+; CHECK0:       ## %bb.0:
+; CHECK0-NEXT:    movw (%rdi), %cx
+; CHECK0-NEXT:    ## implicit-def: $eax
+; CHECK0-NEXT:    movw %cx, %ax
+; CHECK0-NEXT:    ## implicit-def: $xmm0
+; CHECK0-NEXT:    pinsrw $0, %eax, %xmm0
+; CHECK0-NEXT:    retq
+  %ret = load atomic <1 x half>, ptr %x acquire, align 4
+  ret <1 x half> %ret
+}
+
+define <1 x float> @atomic_vec1_float(ptr %x) {
+; CHECK-LABEL: atomic_vec1_float:
+; CHECK:       ## %bb.0:
+; CHECK-NEXT:    movss {{.*#+}} xmm0 = mem[0],zero,zero,zero
+; CHECK-NEXT:    retq
+;
+; CHECK0-LABEL: atomic_vec1_float:
+; CHECK0:       ## %bb.0:
+; CHECK0-NEXT:    movss {{.*#+}} xmm0 = mem[0],zero,zero,zero
+; CHECK0-NEXT:    retq
+  %ret = load atomic <1 x float>, ptr %x acquire, align 4
+  ret <1 x float> %ret
+}
+
+define <1 x bfloat> @atomic_vec1_bfloat(ptr %x) {
+; CHECK-LABEL: atomic_vec1_bfloat:
+; CHECK:       ## %bb.0:
+; CHECK-NEXT:    movzwl (%rdi), %eax
+; CHECK-NEXT:    pinsrw $0, %eax, %xmm0
+; CHECK-NEXT:    retq
+;
+; CHECK0-LABEL: atomic_vec1_bfloat:
+; CHECK0:       ## %bb.0:
+; CHECK0-NEXT:    movw (%rdi), %cx
+; CHECK0-NEXT:    ## implicit-def: $eax
+; CHECK0-NEXT:    movw %cx, %ax
+; CHECK0-NEXT:    ## implicit-def: $xmm0
+; CHECK0-NEXT:    pinsrw $0, %eax, %xmm0
+; CHECK0-NEXT:    retq
+  %ret = load atomic <1 x bfloat>, ptr %x acquire, align 4
+  ret <1 x bfloat> %ret
+}
+

jyknight · 2024-12-18T23:13:22Z

llvm/lib/Target/X86/X86ISelLowering.cpp

@@ -2595,6 +2595,10 @@ X86TargetLowering::X86TargetLowering(const X86TargetMachine &TM,
        setOperationAction(Op, MVT::f32, Promote);
  }

+  setOperationPromotedToType(ISD::ATOMIC_LOAD, MVT::f16, MVT::i16);


Presumably similar changes to other backends are also required?

Handle bf16 as well?

bf16 is already lowered properly without promotion.

And yes, other backends would either have to promote these here or implement them explicitly.

SelectionDAG making anything legal by default was a terrible mistake but we're not going to fix that here

When lowering atomic <1 x T> vector types with floats, selection can fail since this pattern is unsupported. To support this, floats can be casted to an integer type of the same size. commit-id:f9d761c5

This was referenced Dec 18, 2024

[X86] Add atomic vector tests for unaligned >1 sizes. #120387

Open

[SelectionDAG] Legalize <1 x T> vector types for atomic load #120385

Open

IR/Verifier: Allow vector type in atomic load and store #120384

Open

llvmbot added the backend:X86 label Dec 18, 2024

jofrn force-pushed the users/jofrn/spr/main/f9d761c5 branch from f799ee0 to 141279f Compare December 18, 2024 08:54

jofrn changed the title ~~[SelectionDAG][X86] Add floating point promotion.~~ [X86] Manage atomic load of fp -> int promotion in DAG Dec 18, 2024

jofrn force-pushed the users/jofrn/spr/main/f9d761c5 branch from 141279f to 70bb5b9 Compare December 18, 2024 11:45

jofrn force-pushed the users/jofrn/spr/main/5c36cc8c branch from 7263545 to 5a3a12d Compare December 18, 2024 19:11

jofrn force-pushed the users/jofrn/spr/main/f9d761c5 branch 2 times, most recently from dac7f1e to df5e28c Compare December 18, 2024 20:47

jyknight reviewed Dec 18, 2024

View reviewed changes

jofrn force-pushed the users/jofrn/spr/main/5c36cc8c branch from 66eca4b to 4d3fcb3 Compare December 19, 2024 02:29

jofrn force-pushed the users/jofrn/spr/main/f9d761c5 branch 4 times, most recently from b336c25 to 7ef2576 Compare December 19, 2024 16:01

jofrn force-pushed the users/jofrn/spr/main/5c36cc8c branch from 4d0be71 to 601c009 Compare December 19, 2024 16:01

jofrn force-pushed the users/jofrn/spr/main/f9d761c5 branch from 7ef2576 to b4f0562 Compare December 19, 2024 16:24

jofrn force-pushed the users/jofrn/spr/main/5c36cc8c branch from 601c009 to 3796cf7 Compare December 19, 2024 16:24

jofrn mentioned this pull request Dec 19, 2024

[SelectionDAG] Widen <2 x T> vector types for atomic load #120598

Open

jofrn force-pushed the users/jofrn/spr/main/5c36cc8c branch from 3796cf7 to 5f30edf Compare December 19, 2024 16:42

jofrn force-pushed the users/jofrn/spr/main/f9d761c5 branch from b4f0562 to e0a02b6 Compare December 19, 2024 16:42

jofrn force-pushed the users/jofrn/spr/main/5c36cc8c branch from 5f30edf to 99296f3 Compare December 19, 2024 19:15

jofrn force-pushed the users/jofrn/spr/main/f9d761c5 branch 2 times, most recently from 40392eb to 7ca91cc Compare December 19, 2024 19:36

jofrn force-pushed the users/jofrn/spr/main/5c36cc8c branch from 99296f3 to 245acf7 Compare December 19, 2024 19:36

jofrn force-pushed the users/jofrn/spr/main/f9d761c5 branch from 7ca91cc to 0d6882e Compare December 19, 2024 19:43

jofrn force-pushed the users/jofrn/spr/main/5c36cc8c branch 2 times, most recently from a3e83cb to faa0e03 Compare December 19, 2024 21:28

jofrn force-pushed the users/jofrn/spr/main/f9d761c5 branch from 6ac3c17 to 03a726d Compare January 22, 2025 18:19

jofrn force-pushed the users/jofrn/spr/main/5c36cc8c branch from a1143a0 to 2a1b149 Compare January 22, 2025 18:19

jofrn changed the base branch from users/jofrn/spr/main/5c36cc8c to main February 2, 2025 20:25

jofrn force-pushed the users/jofrn/spr/main/f9d761c5 branch from 03a726d to f096c88 Compare February 2, 2025 20:25

jofrn mentioned this pull request Feb 2, 2025

[SelectionDAG][X86] Remove unused elements from atomic vector. #125432

Open

jofrn changed the base branch from main to users/jofrn/spr/main/5c36cc8c February 2, 2025 20:25

jofrn force-pushed the users/jofrn/spr/main/f9d761c5 branch from f096c88 to edd2af8 Compare March 3, 2025 23:26

arsenm approved these changes Mar 4, 2025

View reviewed changes

jofrn force-pushed the users/jofrn/spr/main/5c36cc8c branch from 24d9628 to e9820bf Compare April 25, 2025 20:51

jofrn force-pushed the users/jofrn/spr/main/f9d761c5 branch 2 times, most recently from 47d8c3a to 02dd787 Compare April 26, 2025 04:07

jofrn force-pushed the users/jofrn/spr/main/5c36cc8c branch from 4b9e4d3 to d6cac89 Compare April 26, 2025 07:57

jofrn force-pushed the users/jofrn/spr/main/f9d761c5 branch from 5a5f241 to cb2e5bc Compare May 1, 2025 04:14

jofrn force-pushed the users/jofrn/spr/main/5c36cc8c branch from 7f2115c to ab9f3f2 Compare May 1, 2025 04:39

jofrn force-pushed the users/jofrn/spr/main/f9d761c5 branch from cb2e5bc to 14c8155 Compare May 1, 2025 04:39

jofrn force-pushed the users/jofrn/spr/main/5c36cc8c branch from ab9f3f2 to 4a47d3f Compare May 1, 2025 06:45

jofrn force-pushed the users/jofrn/spr/main/f9d761c5 branch 2 times, most recently from 0824a27 to 6078905 Compare May 5, 2025 17:45

jofrn force-pushed the users/jofrn/spr/main/5c36cc8c branch 2 times, most recently from 5189e84 to e7805ff Compare May 6, 2025 03:50

jofrn force-pushed the users/jofrn/spr/main/f9d761c5 branch from 6078905 to fdc2107 Compare May 6, 2025 03:50

jofrn changed the base branch from users/jofrn/spr/main/5c36cc8c to main May 6, 2025 06:03

jofrn mentioned this pull request May 6, 2025

[X86] Remove extra MOV after widening atomic load #138635

Open

jofrn force-pushed the users/jofrn/spr/main/f9d761c5 branch from fdc2107 to 5005b94 Compare May 6, 2025 06:03

jofrn changed the base branch from main to users/jofrn/spr/main/5c36cc8c May 6, 2025 06:04

jofrn force-pushed the users/jofrn/spr/main/5c36cc8c branch from 2d7f7dc to 1e2a179 Compare May 6, 2025 15:04

jofrn force-pushed the users/jofrn/spr/main/f9d761c5 branch from 5005b94 to c7d4433 Compare May 6, 2025 15:04

[X86] Manage atomic load of fp -> int promotion in DAG

531bc05

When lowering atomic <1 x T> vector types with floats, selection can fail since this pattern is unsupported. To support this, floats can be casted to an integer type of the same size. commit-id:f9d761c5

jofrn force-pushed the users/jofrn/spr/main/5c36cc8c branch from 1e2a179 to 08e39f2 Compare May 7, 2025 12:53

jofrn force-pushed the users/jofrn/spr/main/f9d761c5 branch from c7d4433 to 531bc05 Compare May 7, 2025 12:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[X86] Manage atomic load of fp -> int promotion in DAG #120386

[X86] Manage atomic load of fp -> int promotion in DAG #120386

jofrn commented Dec 18, 2024 •

edited

Loading

llvmbot commented Dec 18, 2024

jyknight Dec 18, 2024

RKSimon Dec 19, 2024

jofrn Dec 19, 2024

arsenm Dec 20, 2024

[X86] Manage atomic load of fp -> int promotion in DAG #120386

Are you sure you want to change the base?

[X86] Manage atomic load of fp -> int promotion in DAG #120386

Conversation

jofrn commented Dec 18, 2024 • edited Loading

llvmbot commented Dec 18, 2024

jyknight Dec 18, 2024

Choose a reason for hiding this comment

RKSimon Dec 19, 2024

Choose a reason for hiding this comment

jofrn Dec 19, 2024

Choose a reason for hiding this comment

arsenm Dec 20, 2024

Choose a reason for hiding this comment

jofrn commented Dec 18, 2024 •

edited

Loading