[NVPTX] Implement `isTruncateFree(EVT FromVT, EVT ToVT)` #138605

justinfargnoli · 2025-05-05T22:51:55Z

This PR also makes NFC changes to isTruncateFree(Type *SrcTy, Type *DstTy) so that it models the HW more accurately.

Copilot

Pull Request Overview

This pull request improves the target lowering for NVPTX by updating the truncation-free checks to more accurately model the hardware behavior for both LLVM IR types and EVT types.

Modifies the integer type checks in isTruncateFree for both Type* and EVT overloads.
Updates the bit-size conditions and removes the outdated comment regarding 64-to-32-bit truncation in SASS.

Files not reviewed (1)

llvm/test/CodeGen/NVPTX/i128-array.ll: Language not supported

llvmbot · 2025-05-05T22:52:30Z

@llvm/pr-subscribers-backend-nvptx

Author: Justin Fargnoli (justinfargnoli)

Changes

This PR also makes NFC changes to isTruncateFree(Type *SrcTy, Type *DstTy) so that it models the HW more accurately.

Full diff: https://github.com/llvm/llvm-project/pull/138605.diff

2 Files Affected:

(modified) llvm/lib/Target/NVPTX/NVPTXISelLowering.h (+12-4)
(modified) llvm/test/CodeGen/NVPTX/i128-array.ll (+6-6)

diff --git a/llvm/lib/Target/NVPTX/NVPTXISelLowering.h b/llvm/lib/Target/NVPTX/NVPTXISelLowering.h
index 7a8bf3bf33a94..680ff13d8f936 100644
--- a/llvm/lib/Target/NVPTX/NVPTXISelLowering.h
+++ b/llvm/lib/Target/NVPTX/NVPTXISelLowering.h
@@ -155,11 +155,19 @@ class NVPTXTargetLowering : public TargetLowering {
                              Instruction *I = nullptr) const override;
 
   bool isTruncateFree(Type *SrcTy, Type *DstTy) const override {
-    // Truncating 64-bit to 32-bit is free in SASS.
-    if (!SrcTy->isIntegerTy() || !DstTy->isIntegerTy())
+    if (!(SrcTy->isIntegerTy() && DstTy->isIntegerTy()))
       return false;
-    return SrcTy->getPrimitiveSizeInBits() == 64 &&
-           DstTy->getPrimitiveSizeInBits() == 32;
+    if (SrcTy->getPrimitiveSizeInBits() <= DstTy->getPrimitiveSizeInBits())
+      return false;
+    return DstTy->getPrimitiveSizeInBits() % 32 == 0;
+  }
+
+  bool isTruncateFree(EVT FromVT, EVT ToVT) const override {
+    if (!(FromVT.isScalarInteger() && ToVT.isScalarInteger()))
+      return false;
+    if (FromVT.getSizeInBits() <= ToVT.getSizeInBits())
+      return false;
+    return ToVT.getSizeInBits() % 32 == 0;
   }
 
   EVT getSetCCResultType(const DataLayout &DL, LLVMContext &Ctx,
diff --git a/llvm/test/CodeGen/NVPTX/i128-array.ll b/llvm/test/CodeGen/NVPTX/i128-array.ll
index dd6d48bd5862c..f25d451590bed 100644
--- a/llvm/test/CodeGen/NVPTX/i128-array.ll
+++ b/llvm/test/CodeGen/NVPTX/i128-array.ll
@@ -8,13 +8,13 @@ define [2 x i128] @foo(i64 %a, i32 %b) {
 ; CHECK-NEXT:    .reg .b64 %rd<5>;
 ; CHECK-EMPTY:
 ; CHECK-NEXT:  // %bb.0:
-; CHECK-NEXT:    ld.param.u32 %r1, [foo_param_1];
 ; CHECK-NEXT:    ld.param.u64 %rd1, [foo_param_0];
-; CHECK-NEXT:    shr.s64 %rd2, %rd1, 63;
-; CHECK-NEXT:    cvt.s64.s32 %rd3, %r1;
-; CHECK-NEXT:    shr.s64 %rd4, %rd3, 63;
-; CHECK-NEXT:    st.param.v2.b64 [func_retval0], {%rd1, %rd2};
-; CHECK-NEXT:    st.param.v2.b64 [func_retval0+16], {%rd3, %rd4};
+; CHECK-NEXT:    ld.param.s32 %rd2, [foo_param_1];
+; CHECK-NEXT:    cvt.u32.u64 %r1, %rd2;
+; CHECK-NEXT:    shr.s64 %rd3, %rd1, 63;
+; CHECK-NEXT:    shr.s64 %rd4, %rd2, 63;
+; CHECK-NEXT:    st.param.v2.b64 [func_retval0], {%rd1, %rd3};
+; CHECK-NEXT:    st.param.v2.b64 [func_retval0+16], {%rd2, %rd4};
 ; CHECK-NEXT:    ret;
   %1 = sext i64 %a to i128
   %2 = sext i32 %b to i128

AlexMaclean

Would it be possible to include some tests that demonstrate positive effects on code-gen as a result of this change? The only impact currently seems to be some re-ordering.

AlexMaclean · 2025-05-05T23:11:46Z

llvm/lib/Target/NVPTX/NVPTXISelLowering.h

-    if (!SrcTy->isIntegerTy() || !DstTy->isIntegerTy())
+    if (!(SrcTy->isIntegerTy() && DstTy->isIntegerTy()))
      return false;
-    return SrcTy->getPrimitiveSizeInBits() == 64 &&
-           DstTy->getPrimitiveSizeInBits() == 32;
+    if (SrcTy->getPrimitiveSizeInBits() <= DstTy->getPrimitiveSizeInBits())
+      return false;


Would it be valid to call isTruncateFree if either of these conditions were not already met?

I believe so. The second condition is explicitly mentioned in

llvm-project/llvm/include/llvm/CodeGen/TargetLowering.h

Line 3025 in 1c1238d

/// Targets must return false when FromTy <= ToTy.

. Most targets that override this have a check for the first condition as well.

Interesting. What about the check for isScalarInteger? If the vector element sizes meet the criteria for being free won't the eventual expansion be free? Do we ever expect to see non integer types?

Yeah, that's a good point. When expressed in PTX, the vectors become registers and thus do not guarantee contiguousness.

Notes: https://godbolt.org/z/dTo9aGEEb

AlexMaclean · 2025-05-05T23:12:33Z

llvm/lib/Target/NVPTX/NVPTXISelLowering.h

+    if (!(FromVT.isScalarInteger() && ToVT.isScalarInteger()))
+      return false;
+    if (FromVT.getSizeInBits() <= ToVT.getSizeInBits())
+      return false;


Same question as above.

kalxr · 2025-05-06T00:52:10Z

llvm/lib/Target/NVPTX/NVPTXISelLowering.h

@@ -155,11 +155,19 @@ class NVPTXTargetLowering : public TargetLowering {
                             Instruction *I = nullptr) const override;

  bool isTruncateFree(Type *SrcTy, Type *DstTy) const override {


Seems like Hexagon is the only target that does it this way, but seems simpler:

llvm-project/llvm/lib/Target/Hexagon/HexagonISelLowering.cpp

Line 2145 in 1c1238d

return isTruncateFree(EVT::getEVT(Ty1), EVT::getEVT(Ty2));

kalxr · 2025-05-06T00:53:35Z

llvm/lib/Target/NVPTX/NVPTXISelLowering.h

-    if (!SrcTy->isIntegerTy() || !DstTy->isIntegerTy())
+    if (!(SrcTy->isIntegerTy() && DstTy->isIntegerTy()))
      return false;
-    return SrcTy->getPrimitiveSizeInBits() == 64 &&
-           DstTy->getPrimitiveSizeInBits() == 32;
+    if (SrcTy->getPrimitiveSizeInBits() <= DstTy->getPrimitiveSizeInBits())
+      return false;


I believe so. The second condition is explicitly mentioned in

llvm-project/llvm/include/llvm/CodeGen/TargetLowering.h

Line 3025 in 1c1238d

/// Targets must return false when FromTy <= ToTy.

. Most targets that override this have a check for the first condition as well.

Implement isTruncateFree(EVT FromVT, EVT ToVT)

5e1852a

justinfargnoli requested review from Artem-B, AlexMaclean, kalxr and Copilot May 5, 2025 22:51

llvmbot added the backend:NVPTX label May 5, 2025

Copilot AI reviewed May 5, 2025

View reviewed changes

AlexMaclean reviewed May 5, 2025

View reviewed changes

kalxr reviewed May 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NVPTX] Implement `isTruncateFree(EVT FromVT, EVT ToVT)` #138605

[NVPTX] Implement `isTruncateFree(EVT FromVT, EVT ToVT)` #138605

justinfargnoli commented May 5, 2025 •

edited

Loading

Copilot AI left a comment

llvmbot commented May 5, 2025

AlexMaclean left a comment

AlexMaclean May 5, 2025

kalxr May 6, 2025

AlexMaclean May 6, 2025

justinfargnoli May 6, 2025 •

edited

Loading

AlexMaclean May 5, 2025

kalxr May 6, 2025

kalxr May 6, 2025

		@@ -155,11 +155,19 @@ class NVPTXTargetLowering : public TargetLowering {
		Instruction *I = nullptr) const override;

		bool isTruncateFree(Type SrcTy, Type DstTy) const override {

[NVPTX] Implement isTruncateFree(EVT FromVT, EVT ToVT) #138605

Are you sure you want to change the base?

[NVPTX] Implement isTruncateFree(EVT FromVT, EVT ToVT) #138605

Conversation

justinfargnoli commented May 5, 2025 • edited Loading

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

llvmbot commented May 5, 2025

AlexMaclean left a comment

Choose a reason for hiding this comment

AlexMaclean May 5, 2025

Choose a reason for hiding this comment

kalxr May 6, 2025

Choose a reason for hiding this comment

AlexMaclean May 6, 2025

Choose a reason for hiding this comment

justinfargnoli May 6, 2025 • edited Loading

Choose a reason for hiding this comment

AlexMaclean May 5, 2025

Choose a reason for hiding this comment

kalxr May 6, 2025

Choose a reason for hiding this comment

kalxr May 6, 2025

Choose a reason for hiding this comment

[NVPTX] Implement `isTruncateFree(EVT FromVT, EVT ToVT)` #138605

[NVPTX] Implement `isTruncateFree(EVT FromVT, EVT ToVT)` #138605

justinfargnoli commented May 5, 2025 •

edited

Loading

justinfargnoli May 6, 2025 •

edited

Loading