[SelectionDAG] Fix incorrect fold condition in foldSetCCWithFunnelShift. #137637

Ruhung · 2025-04-28T14:16:22Z

Proposed by 2ed1598:

fshl X, (or X, Y), C ==/!= 0 --> or (srl Y, BW-C), X ==/!= 0

This transformation is valid when (C%Bitwidth) != 0 , as verified by Alive2.

Fixes #136746

llvmbot · 2025-04-29T07:40:48Z

@llvm/pr-subscribers-backend-aarch64

@llvm/pr-subscribers-llvm-selectiondag

Author: Rux124 (Ruhung)

Changes

Proposed by 2ed1598:

fshl X, (or X, Y), C ==/!= 0 --> or (srl Y, BW-C), X ==/!= 0

This transformation is valid when C != 0, as verified by Alive2.

Fixes #136746

Full diff: https://github.com/llvm/llvm-project/pull/137637.diff

2 Files Affected:

(modified) llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp (+2-1)
(modified) llvm/test/CodeGen/AArch64/setcc-fsh.ll (+12)

diff --git a/llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp b/llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
index 6930b54ddb14a..1e9fb1aa2ea61 100644
--- a/llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
+++ b/llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
@@ -4462,7 +4462,8 @@ static SDValue foldSetCCWithFunnelShift(EVT VT, SDValue N0, SDValue N1,
 
   unsigned BitWidth = N0.getScalarValueSizeInBits();
   auto *ShAmtC = isConstOrConstSplat(N0.getOperand(2));
-  if (!ShAmtC || ShAmtC->getAPIntValue().uge(BitWidth))
+  APInt AmtVal = ShAmtC->getAPIntValue();
+  if (!ShAmtC || AmtVal.uge(BitWidth) || AmtVal.isZero())
     return SDValue();
 
   // Canonicalize fshr as fshl to reduce pattern-matching.
diff --git a/llvm/test/CodeGen/AArch64/setcc-fsh.ll b/llvm/test/CodeGen/AArch64/setcc-fsh.ll
index 08bfe282703ff..f0cf775f5c2fa 100644
--- a/llvm/test/CodeGen/AArch64/setcc-fsh.ll
+++ b/llvm/test/CodeGen/AArch64/setcc-fsh.ll
@@ -248,3 +248,15 @@ define i1 @fshl_or_ne_2(i32 %x, i32 %y) {
   %r = icmp ne i32 %f, 2
   ret i1 %r
 }
+
+define i1 @fshr_0_or_eq_0(i16 %x, i16 %y) {
+; CHECK-LABEL: fshr_0_or_eq_0:
+; CHECK:       // %bb.0:
+; CHECK-NEXT:    tst w0, #0xffff
+; CHECK-NEXT:    cset w0, eq
+; CHECK-NEXT:    ret
+  %or = or i16 %x, %y
+  %f = call i16 @llvm.fshr.i16(i16 %or, i16 %x, i16 0)
+  %r = icmp eq i16 %f, 0
+  ret i1 %r
+}

davemgreen · 2025-04-29T07:57:37Z

llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp

+  APInt AmtVal = ShAmtC->getAPIntValue();
+  if (!ShAmtC || AmtVal.uge(BitWidth) || AmtVal.isZero())


If can't access ShAmtC (through ShAmtC->getAPIntValue()) before testing that it is valid (through !ShAmtC).

A more precise way to implement this check would be to first reduce the shift amount modulo the bitwidth:

unsigned ShAmt = ShAmtC->getAPIntValue().urem(BitWidth).getZExtValue();

and then bail out if it is zero.

If can't access ShAmtC (through ShAmtC->getAPIntValue()) before testing that it is valid (through !ShAmtC).

@davemgreen Done. Thanks.

A more precise way to implement this check would be to first reduce the shift amount modulo the bitwidth:

unsigned ShAmt = ShAmtC->getAPIntValue().urem(BitWidth).getZExtValue();

and then bail out if it is zero.

The shift amount needs to be less than BitWidth, so a modulo operation may not be necessary?

A funnel shift will rotate around (so can be > bitwidth), a shift will produce poison if the shift amount is >= bitwidth. It looks like it might only be fshr that is incorrect, not fshl? We should have canonicalized the constant shift amount so it is probably OK either way so long as we fix the bug in non-canonical forms.

It looks like it might only be fshr that is incorrect

What is "incorrect" here? (I was not pointing out a bug, just a way to make the implementation a bit simpler and handle more cases.)

I didn't mean your suggestion was incorrect, just that it was only the fshr by 0 case that was incorrect in the original code (as it turns into a shift by 32-0, which turns into poison). The other cases did not look worth supporting to me as they are more likely to introduce more issues than letting the funnel shift by a constant canonicalize before optimizing it again, but if you want to go that route it sounds OK so long as the other code works with it.

@jayfoad Sorry for the initial misunderstanding, it's OK to take the modulo first and then check whether it's zero. I've fixed it. Thanks!

davemgreen

LGTM if there are no other comments. Thanks

RKSimon · 2025-05-02T14:15:52Z

llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp

+    return SDValue();
+
+  APInt AmtVal = ShAmtC->getAPIntValue();
+  if (AmtVal.uge(BitWidth) || AmtVal.isZero())
    return SDValue();


Why not:

uint64_t AmtVal = ShAmtC->getAPIntValue().urem(BitWidth); if (AmtVal == 0) return SDValue();

From my point of view - just because it should not come up from non-canonical code IIUC, and so fixing the bug is the more important issue and trying to handle other cases that could lead to further bugs for little benefit. But it sounds OK so long as the rest of the code handles it correctly.

@RKSimon You're right. It's OK to take the modulo first. I've fixed it. Thanks!

RKSimon

LGTM - cheers

Ruhung marked this pull request as ready for review April 29, 2025 07:40

llvmbot added backend:AArch64 llvm:SelectionDAG SelectionDAGISel as well labels Apr 29, 2025

davemgreen reviewed Apr 29, 2025

View reviewed changes

Ruhung force-pushed the fix-136746 branch from 4dbe104 to df204b0 Compare April 30, 2025 11:52

davemgreen requested a review from efriedma-quic April 30, 2025 19:08

davemgreen approved these changes Apr 30, 2025

View reviewed changes

efriedma-quic approved these changes Apr 30, 2025

View reviewed changes

RKSimon reviewed May 2, 2025

View reviewed changes

Ruhung added 2 commits May 3, 2025 15:45

Pre-commit tests.

342c7e1

[SelectionDAG] Fix incorrect fold condition in foldSetCCWithFunnelShift.

cd8ae16

Ruhung force-pushed the fix-136746 branch from df204b0 to cd8ae16 Compare May 3, 2025 07:49

RKSimon approved these changes May 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SelectionDAG] Fix incorrect fold condition in foldSetCCWithFunnelShift. #137637

[SelectionDAG] Fix incorrect fold condition in foldSetCCWithFunnelShift. #137637

Ruhung commented Apr 28, 2025 •

edited

Loading

llvmbot commented Apr 29, 2025 •

edited

Loading

davemgreen Apr 29, 2025 •

edited

Loading

jayfoad Apr 29, 2025

Ruhung Apr 30, 2025

Ruhung Apr 30, 2025

davemgreen Apr 30, 2025

jayfoad May 2, 2025

davemgreen May 2, 2025

Ruhung May 3, 2025 •

edited

Loading

davemgreen left a comment

RKSimon May 2, 2025

davemgreen May 2, 2025

Ruhung May 3, 2025

RKSimon left a comment

		APInt AmtVal = ShAmtC->getAPIntValue();
		if (!ShAmtC \|\| AmtVal.uge(BitWidth) \|\| AmtVal.isZero())

[SelectionDAG] Fix incorrect fold condition in foldSetCCWithFunnelShift. #137637

Are you sure you want to change the base?

[SelectionDAG] Fix incorrect fold condition in foldSetCCWithFunnelShift. #137637

Conversation

Ruhung commented Apr 28, 2025 • edited Loading

llvmbot commented Apr 29, 2025 • edited Loading

davemgreen Apr 29, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Ruhung May 3, 2025 • edited Loading

Choose a reason for hiding this comment

davemgreen left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

RKSimon left a comment

Choose a reason for hiding this comment

Ruhung commented Apr 28, 2025 •

edited

Loading

llvmbot commented Apr 29, 2025 •

edited

Loading

davemgreen Apr 29, 2025 •

edited

Loading

Ruhung May 3, 2025 •

edited

Loading