[ScheduleDAG] Allow disabling the SchedModel / Itineraries during Scheduling #138057

jrbyrnes · 2025-05-01T00:41:23Z

This provides the disable-schedmodel-in-sched-mi flag. Using this, we will disable the SchedModel / Itineraries during scheduling. This has the effect of not using any latency / hardware resource information for scheduling decisions.

We have the schedmodel flag, but this disables the SchedModel for all passes. This allows disabling only for scheduling while preserving the behavior of other passes (e.g. MachineLICM). This is conceptually similar to other flags like enable-aa-sched-mi

Change-Id: I34b84c83b5de73a93911641a26a4260f156128d6

llvmbot · 2025-05-01T00:42:27Z

@llvm/pr-subscribers-backend-amdgpu

Author: Jeffrey Byrnes (jrbyrnes)

Changes

This provides the disable-schedmodel-in-sched-mi flag. Using this, we will disable the SchedModel / Itineraries during scheduling. This has the effect of not using any latency / hardware resource information for scheduling decisions.

We have the schedmodel flag, but this disables the SchedModel for all passes. This allows disabling only for scheduling while preserving the behavior of other passes (e.g. MachineLICM). This is conceptually similar to other flags like enable-aa-sched-mi

Full diff: https://github.com/llvm/llvm-project/pull/138057.diff

5 Files Affected:

(modified) llvm/include/llvm/CodeGen/TargetSchedule.h (+4-1)
(modified) llvm/lib/CodeGen/ScheduleDAGInstrs.cpp (+5-1)
(modified) llvm/lib/CodeGen/TargetSchedule.cpp (+7-3)
(modified) llvm/test/CodeGen/AMDGPU/mai-hazards-gfx942.mir (+1)
(added) llvm/test/CodeGen/AMDGPU/sched-no-schedmodel.mir (+50)

diff --git a/llvm/include/llvm/CodeGen/TargetSchedule.h b/llvm/include/llvm/CodeGen/TargetSchedule.h
index bfe4234abf8eb..0314940cbafd5 100644
--- a/llvm/include/llvm/CodeGen/TargetSchedule.h
+++ b/llvm/include/llvm/CodeGen/TargetSchedule.h
@@ -45,6 +45,8 @@ class TargetSchedModel {
 
   unsigned computeInstrLatency(const MCSchedClassDesc &SCDesc) const;
 
+  bool DisableItinerariesAndSchedModel = false;
+
 public:
   TargetSchedModel() : SchedModel(MCSchedModel::Default) {}
 
@@ -53,7 +55,8 @@ class TargetSchedModel {
   /// The machine model API keeps a copy of the top-level MCSchedModel table
   /// indices and may query TargetSubtargetInfo and TargetInstrInfo to resolve
   /// dynamic properties.
-  void init(const TargetSubtargetInfo *TSInfo);
+  void init(const TargetSubtargetInfo *TSInfo,
+            bool DisableItinerariesAndSchedModel = false);
 
   /// Return the MCSchedClassDesc for this instruction.
   const MCSchedClassDesc *resolveSchedClass(const MachineInstr *MI) const;
diff --git a/llvm/lib/CodeGen/ScheduleDAGInstrs.cpp b/llvm/lib/CodeGen/ScheduleDAGInstrs.cpp
index a26804707dd1f..c6d3a0be1dfa5 100644
--- a/llvm/lib/CodeGen/ScheduleDAGInstrs.cpp
+++ b/llvm/lib/CodeGen/ScheduleDAGInstrs.cpp
@@ -69,6 +69,10 @@ static cl::opt<bool>
 static cl::opt<bool> UseTBAA("use-tbaa-in-sched-mi", cl::Hidden,
     cl::init(true), cl::desc("Enable use of TBAA during MI DAG construction"));
 
+static cl::opt<bool> DisableSchedModel(
+    "disable-schedmodel-in-sched-mi", cl::Hidden, cl::init(false),
+    cl::desc("Enable use of TBAA during MI DAG construction"));
+
 // Note: the two options below might be used in tuning compile time vs
 // output quality. Setting HugeRegion so large that it will never be
 // reached means best-effort, but may be slow.
@@ -121,7 +125,7 @@ ScheduleDAGInstrs::ScheduleDAGInstrs(MachineFunction &mf,
   DbgValues.clear();
 
   const TargetSubtargetInfo &ST = mf.getSubtarget();
-  SchedModel.init(&ST);
+  SchedModel.init(&ST, DisableSchedModel);
 }
 
 /// If this machine instr has memory reference information and it can be
diff --git a/llvm/lib/CodeGen/TargetSchedule.cpp b/llvm/lib/CodeGen/TargetSchedule.cpp
index db884b4940395..98cbeed9f03a3 100644
--- a/llvm/lib/CodeGen/TargetSchedule.cpp
+++ b/llvm/lib/CodeGen/TargetSchedule.cpp
@@ -40,19 +40,23 @@ static cl::opt<bool> ForceEnableIntervals(
     cl::desc("Force the use of resource intervals in the schedule model"));
 
 bool TargetSchedModel::hasInstrSchedModel() const {
-  return EnableSchedModel && SchedModel.hasInstrSchedModel();
+  return EnableSchedModel && SchedModel.hasInstrSchedModel() &&
+         !DisableItinerariesAndSchedModel;
 }
 
 bool TargetSchedModel::hasInstrItineraries() const {
-  return EnableSchedItins && !InstrItins.isEmpty();
+  return EnableSchedItins && !InstrItins.isEmpty() &&
+         !DisableItinerariesAndSchedModel;
 }
 
-void TargetSchedModel::init(const TargetSubtargetInfo *TSInfo) {
+void TargetSchedModel::init(const TargetSubtargetInfo *TSInfo, bool Disable) {
   STI = TSInfo;
   SchedModel = TSInfo->getSchedModel();
   TII = TSInfo->getInstrInfo();
   STI->initInstrItins(InstrItins);
 
+  DisableItinerariesAndSchedModel = Disable;
+
   unsigned NumRes = SchedModel.getNumProcResourceKinds();
   ResourceFactors.resize(NumRes);
   ResourceLCM = SchedModel.IssueWidth;
diff --git a/llvm/test/CodeGen/AMDGPU/mai-hazards-gfx942.mir b/llvm/test/CodeGen/AMDGPU/mai-hazards-gfx942.mir
index d029043f90a85..dc57d421ee03f 100644
--- a/llvm/test/CodeGen/AMDGPU/mai-hazards-gfx942.mir
+++ b/llvm/test/CodeGen/AMDGPU/mai-hazards-gfx942.mir
@@ -1,5 +1,6 @@
 # RUN: llc -mtriple=amdgcn -mcpu=gfx942 -verify-machineinstrs -run-pass post-RA-hazard-rec %s -o - | FileCheck -check-prefixes=GCN,GFX942 %s
 # RUN: llc -mtriple=amdgcn -mcpu=gfx950 -verify-machineinstrs -run-pass post-RA-hazard-rec %s -o - | FileCheck -check-prefixes=GCN,GFX950 %s
+# RUN: llc -mtriple=amdgcn -mcpu=gfx950 -verify-machineinstrs -run-pass post-RA-hazard-rec --disable-schedmodel-in-sched-mi=1 %s -o - | FileCheck -check-prefixes=GCN,GFX950 %s
 
 # GCN-LABEL: name: valu_write_vgpr_sgemm_mfma_read
 # GCN:      V_MOV_B32
diff --git a/llvm/test/CodeGen/AMDGPU/sched-no-schedmodel.mir b/llvm/test/CodeGen/AMDGPU/sched-no-schedmodel.mir
new file mode 100644
index 0000000000000..685b20ddd1156
--- /dev/null
+++ b/llvm/test/CodeGen/AMDGPU/sched-no-schedmodel.mir
@@ -0,0 +1,50 @@
+# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py UTC_ARGS: --version 5
+# RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx942 -misched-cluster=false --misched-prera-direction=topdown -run-pass=machine-scheduler --disable-schedmodel-in-sched-mi=0 -o - %s | FileCheck -check-prefix=GCN %s
+# RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx942 -misched-cluster=false --misched-prera-direction=topdown -run-pass=machine-scheduler --disable-schedmodel-in-sched-mi=1 -o - %s | FileCheck -check-prefix=GCN-NO-SCHEDMODEL %s
+
+---
+name: sched_group_barrier_1_VMEM_READ_1_VALU_5_MFMA_1_VMEM_READ_3_VALU_2_VMEM_WRITE
+tracksRegLiveness: true
+body: |
+  bb.0:
+
+    ; GCN-LABEL: name: sched_group_barrier_1_VMEM_READ_1_VALU_5_MFMA_1_VMEM_READ_3_VALU_2_VMEM_WRITE
+    ; GCN: [[DEF:%[0-9]+]]:vreg_128_align2 = IMPLICIT_DEF
+    ; GCN-NEXT: [[DEF1:%[0-9]+]]:vreg_128_align2 = IMPLICIT_DEF
+    ; GCN-NEXT: early-clobber %2:vreg_512_align2 = contract V_MFMA_F32_32X32X16_FP8_FP8_vgprcd_e64 [[DEF]].sub0_sub1, [[DEF1]].sub0_sub1, 0, 0, 0, 0, implicit $mode, implicit $exec
+    ; GCN-NEXT: [[DEF2:%[0-9]+]]:vgpr_32 = IMPLICIT_DEF
+    ; GCN-NEXT: dead [[DS_READ_U16_gfx9_:%[0-9]+]]:vgpr_32 = DS_READ_U16_gfx9 [[DEF2]], 0, 0, implicit $exec
+    ; GCN-NEXT: [[DEF3:%[0-9]+]]:vgpr_32 = IMPLICIT_DEF
+    ; GCN-NEXT: dead [[DS_READ_U16_gfx9_1:%[0-9]+]]:vgpr_32 = DS_READ_U16_gfx9 [[DEF3]], 0, 0, implicit $exec
+    ; GCN-NEXT: [[DEF4:%[0-9]+]]:vgpr_32 = IMPLICIT_DEF
+    ; GCN-NEXT: dead [[DS_READ_U16_gfx9_2:%[0-9]+]]:vgpr_32 = DS_READ_U16_gfx9 [[DEF4]], 0, 0, implicit $exec
+    ; GCN-NEXT: [[V_MUL_LO_U32_e64_:%[0-9]+]]:vgpr_32 = nsw V_MUL_LO_U32_e64 %2.sub0, %2.sub1, implicit $exec
+    ; GCN-NEXT: early-clobber %3:vreg_512_align2 = contract V_MFMA_F32_32X32X16_FP8_FP8_vgprcd_e64 [[DEF]].sub0_sub1, [[DEF1]].sub0_sub1, 0, 0, 0, 0, implicit $mode, implicit $exec
+    ; GCN-NEXT: S_ENDPGM 0, implicit %2, implicit %3, implicit [[V_MUL_LO_U32_e64_]]
+    ;
+    ; GCN-NO-SCHEDMODEL-LABEL: name: sched_group_barrier_1_VMEM_READ_1_VALU_5_MFMA_1_VMEM_READ_3_VALU_2_VMEM_WRITE
+    ; GCN-NO-SCHEDMODEL: [[DEF:%[0-9]+]]:vreg_128_align2 = IMPLICIT_DEF
+    ; GCN-NO-SCHEDMODEL-NEXT: [[DEF1:%[0-9]+]]:vreg_128_align2 = IMPLICIT_DEF
+    ; GCN-NO-SCHEDMODEL-NEXT: early-clobber %2:vreg_512_align2 = contract V_MFMA_F32_32X32X16_FP8_FP8_vgprcd_e64 [[DEF]].sub0_sub1, [[DEF1]].sub0_sub1, 0, 0, 0, 0, implicit $mode, implicit $exec
+    ; GCN-NO-SCHEDMODEL-NEXT: early-clobber %3:vreg_512_align2 = contract V_MFMA_F32_32X32X16_FP8_FP8_vgprcd_e64 [[DEF]].sub0_sub1, [[DEF1]].sub0_sub1, 0, 0, 0, 0, implicit $mode, implicit $exec
+    ; GCN-NO-SCHEDMODEL-NEXT: [[V_MUL_LO_U32_e64_:%[0-9]+]]:vgpr_32 = nsw V_MUL_LO_U32_e64 %2.sub0, %2.sub1, implicit $exec
+    ; GCN-NO-SCHEDMODEL-NEXT: [[DEF2:%[0-9]+]]:vgpr_32 = IMPLICIT_DEF
+    ; GCN-NO-SCHEDMODEL-NEXT: dead [[DS_READ_U16_gfx9_:%[0-9]+]]:vgpr_32 = DS_READ_U16_gfx9 [[DEF2]], 0, 0, implicit $exec
+    ; GCN-NO-SCHEDMODEL-NEXT: [[DEF3:%[0-9]+]]:vgpr_32 = IMPLICIT_DEF
+    ; GCN-NO-SCHEDMODEL-NEXT: dead [[DS_READ_U16_gfx9_1:%[0-9]+]]:vgpr_32 = DS_READ_U16_gfx9 [[DEF3]], 0, 0, implicit $exec
+    ; GCN-NO-SCHEDMODEL-NEXT: [[DEF4:%[0-9]+]]:vgpr_32 = IMPLICIT_DEF
+    ; GCN-NO-SCHEDMODEL-NEXT: dead [[DS_READ_U16_gfx9_2:%[0-9]+]]:vgpr_32 = DS_READ_U16_gfx9 [[DEF4]], 0, 0, implicit $exec
+    ; GCN-NO-SCHEDMODEL-NEXT: S_ENDPGM 0, implicit %2, implicit %3, implicit [[V_MUL_LO_U32_e64_]]
+    %0:vreg_128_align2 = IMPLICIT_DEF
+    %1:vreg_128_align2 = IMPLICIT_DEF
+    %2:vreg_512_align2 = contract V_MFMA_F32_32X32X16_FP8_FP8_vgprcd_e64 %0.sub0_sub1:vreg_128_align2, %1.sub0_sub1:vreg_128_align2, 0, 0, 0, 0, implicit $mode, implicit $exec
+    %3:vreg_512_align2 = contract V_MFMA_F32_32X32X16_FP8_FP8_vgprcd_e64 %0.sub0_sub1:vreg_128_align2, %1.sub0_sub1:vreg_128_align2, 0, 0, 0, 0, implicit $mode, implicit $exec
+    %4:vgpr_32 = nsw V_MUL_LO_U32_e64 %2.sub0, %2.sub1, implicit $exec
+    %5:vgpr_32 = IMPLICIT_DEF
+    %6:vgpr_32 = DS_READ_U16_gfx9 %5, 0, 0, implicit $exec
+    %7:vgpr_32 = IMPLICIT_DEF
+    %8:vgpr_32 = DS_READ_U16_gfx9 %7, 0, 0, implicit $exec
+    %9:vgpr_32 = IMPLICIT_DEF
+    %10:vgpr_32 = DS_READ_U16_gfx9 %9, 0, 0, implicit $exec
+    S_ENDPGM 0, implicit %2, implicit %3, implicit %4
+...

arsenm

I don't know why you would want to only disable this during a specific pass. In general we have too many of these old debug flags used for bringup that have no real use case

llvm/lib/CodeGen/ScheduleDAGInstrs.cpp

llvm/lib/CodeGen/TargetSchedule.cpp

Change-Id: I2c7080bce7fadbb7b6c471457edbc0606c1b0bb0

jrbyrnes · 2025-05-01T18:14:04Z

I don't know why you would want to only disable this during a specific pass

Certain passes may need this info for correctness. I've ported the existing flags into the Scheduler s.t. using them only works on scheduling pass

github-actions · 2025-05-01T18:16:35Z

✅ With the latest revision this PR passed the C/C++ code formatter.

Change-Id: Ied902da014ca3dff4fc47f2a0871523b0dcd97da

jrbyrnes · 2025-05-01T18:31:29Z

Only user of this flag that I see is in X86/sink-hoist.ll and the port has no effect https://github.com/llvm/llvm-project/blob/main/llvm/test/CodeGen/X86/sink-hoist.ll

RKSimon · 2025-05-02T12:21:29Z

I'd be much more in favour of getting rid of these kludge flags once and for all tbh.

arsenm · 2025-05-02T13:02:31Z

llvm/lib/CodeGen/ScheduleDAGInstrs.cpp

+static cl::opt<bool>
+    EnableSchedModel("schedmodel", cl::Hidden, cl::init(true),
+                     cl::desc("Use TargetSchedModel for latency lookup"));
+
+static cl::opt<bool>
+    EnableSchedItins("scheditins", cl::Hidden, cl::init(true),
+                     cl::desc("Use InstrItineraryData for latency lookup"));


These are mutually exclusive though? What happens if you set both?

As far as I can tell -- I don't see this mutual exclusion constraint encoded. It seems like the API handles the case of having neither --

Default if both are missing --

llvm-project/llvm/lib/CodeGen/TargetSchedule.cpp

Line 179 in 173ec72

if (!hasInstrSchedModel() && !hasInstrItineraries())

Default if both are missing --

llvm-project/llvm/lib/CodeGen/TargetSchedule.cpp

Line 269 in 173ec72

return TII->defaultDefLatency(SchedModel, *MI);

arsenm · 2025-05-02T13:03:11Z

llvm/include/llvm/CodeGen/TargetSchedule.h

+  bool EnableSchedModel = true;
+  bool EnableSchedItins = true;


Document these, maybe should just make it an enum for which type to use

Change-Id: I132cdb3b5709ac84ae858fa1aecee399abcec63f

jrbyrnes · 2025-05-02T17:44:56Z

I'd be much more in favour of getting rid of these kludge flags once and for all tbh.

Presently I find that they are a good way to experiment with latency / hazard agnostic scheduling. But I don't disagree with you: I think the original intent of these flags was to prefer Itins over SchedModel or vice-versa -- for that purpose, I agree that we shouldn't have these flags.

…eduling (llvm#138057) This provides the `disable-schedmodel-in-sched-mi` flag. Using this, we will disable the SchedModel / Itineraries during scheduling. This has the effect of not using any latency / hardware resource information for scheduling decisions. We have the `schedmodel` flag, but this disables the `SchedModel` for all passes. This allows disabling only for scheduling while preserving the behavior of other passes (e.g. MachineLICM). This is conceptually similar to other flags like `enable-aa-sched-mi`

[ScheduleDAG] Allow disabling the SchedModel during Scheduling

9074bc3

Change-Id: I34b84c83b5de73a93911641a26a4260f156128d6

jrbyrnes requested review from arsenm and kerbowa May 1, 2025 00:41

llvmbot added the backend:AMDGPU label May 1, 2025

jrbyrnes changed the title ~~[ScheduleDAG] Allow disabling the SchedModel during Scheduling~~ [ScheduleDAG] Allow disabling the SchedModel / Itineraries during Scheduling May 1, 2025

arsenm reviewed May 1, 2025

View reviewed changes

llvm/lib/CodeGen/ScheduleDAGInstrs.cpp Outdated Show resolved Hide resolved

llvm/lib/CodeGen/TargetSchedule.cpp Outdated Show resolved Hide resolved

Port existing TargetSchedule flags to ScheduleDAG

c279a9d

Change-Id: I2c7080bce7fadbb7b6c471457edbc0606c1b0bb0

Formatting

d7fa3f4

Change-Id: Ied902da014ca3dff4fc47f2a0871523b0dcd97da

jrbyrnes requested a review from RKSimon May 1, 2025 18:29

arsenm reviewed May 2, 2025

View reviewed changes

Review comments

fb92404

Change-Id: I132cdb3b5709ac84ae858fa1aecee399abcec63f

arsenm added the llvm:codegen label May 5, 2025

arsenm approved these changes May 5, 2025

View reviewed changes

jrbyrnes merged commit 00e7a02 into llvm:main May 5, 2025
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ScheduleDAG] Allow disabling the SchedModel / Itineraries during Scheduling #138057

[ScheduleDAG] Allow disabling the SchedModel / Itineraries during Scheduling #138057

jrbyrnes commented May 1, 2025

llvmbot commented May 1, 2025

arsenm left a comment

jrbyrnes commented May 1, 2025

github-actions bot commented May 1, 2025 •

edited

Loading

jrbyrnes commented May 1, 2025

RKSimon commented May 2, 2025

arsenm May 2, 2025

jrbyrnes May 2, 2025

arsenm May 2, 2025

jrbyrnes commented May 2, 2025 •

edited

Loading

[ScheduleDAG] Allow disabling the SchedModel / Itineraries during Scheduling #138057

[ScheduleDAG] Allow disabling the SchedModel / Itineraries during Scheduling #138057

Conversation

jrbyrnes commented May 1, 2025

llvmbot commented May 1, 2025

arsenm left a comment

Choose a reason for hiding this comment

jrbyrnes commented May 1, 2025

github-actions bot commented May 1, 2025 • edited Loading

jrbyrnes commented May 1, 2025

RKSimon commented May 2, 2025

arsenm May 2, 2025

Choose a reason for hiding this comment

jrbyrnes May 2, 2025

Choose a reason for hiding this comment

arsenm May 2, 2025

Choose a reason for hiding this comment

jrbyrnes commented May 2, 2025 • edited Loading

github-actions bot commented May 1, 2025 •

edited

Loading

jrbyrnes commented May 2, 2025 •

edited

Loading