[mlir][xegpu] SIMT distribution patterns for XeGPU CreateNdTdesc, LoadNd, StoreNd and Dpas Ops. #135271

charithaintc · 2025-04-10T22:19:28Z

This PR adds the SIMT distribution patterns for create_nd_tdesc, load_nd, store_nd and dpas XeGPU ops.

Depends on: #135116

fschlimb · 2025-04-29T10:08:57Z

mlir/lib/Dialect/XeGPU/Transforms/XeGPUSubgroupDistribute.cpp

  TypeSwitch<Operation *>(op)
      .Case<xegpu::DpasOp>(
          [&](auto dpasOp) { visitDpasOp(dpasOp, operands, results); })


Similar to the sharding propagation, you might consider using an OpInterface for this. It allows for a nicer code structure (the code is with the operation, not the lattice), for easier extension and for prettier dispatch code.

I will take a look at your code. I think we can try to incorporate those ideas as well in future.

mlir/lib/Dialect/XeGPU/Utils/XeGPUUtils.cpp

mlir/include/mlir/Dialect/XeGPU/Utils/XeGPUUtils.h

mlir/lib/Dialect/XeGPU/Transforms/XeGPUSubgroupDistribute.cpp

charithaintc · 2025-04-29T18:05:59Z

Hi @fschlimb, this is Charitha from the IMEX team. We have the initial part of the XeGPU subgroup - SIMT distribution work ready for review. If you are interested and have the bandwidth, please have a look and give us feedback/approval. Thanks!

Thanks for getting in touch, interesting!

I added a few comments. This is not an area that I typically work in, so they are mostly on monkey level.

While distribution is different in this context than in the context of distributed memory, there a commonalities (similar to similarities in tiling and sharding). It seems to me that tiling/vector-distribution are special cases of general sharding/spmdization. Unification might be worth considering.

Hi Frank, thanks very much for the review. I tried to address everything as much as I can. please take a look.

Out of curiosity, in addition to the question about the forward propagation, I wonder how tensor shapes that do not evenly divide to lane-sizes would be treated.

Good question. For the operators handled in this PR, we expect perfect distribution. If the high-level work group level computation does not map to lane sizes evenly, it is the responsibility of work group to subgroup or subsequent optimizations to ensure perfect distribution at SIMT level. I think this will be done using masking gather/scatter type loads.

Also we check this requirement during the lowering. If the vector shape is not distributable pass will report that and fail.

mlir/lib/Dialect/XeGPU/Transforms/XeGPUSubgroupDistribute.cpp

adam-smnk

Overall structure LGTM % open comments
I lack experience in the distribution itself to give in-depth review of the core logic but nothing obvious catches my eye

mlir/lib/Dialect/XeGPU/Transforms/XeGPUSubgroupDistribute.cpp

kazutakahirata · 2025-05-01T19:41:53Z

@charithaintc I've landed b2e2ae8 to fix warnings from this PR. Thanks!

charithaintc · 2025-05-05T14:44:43Z

@charithaintc I've landed b2e2ae8 to fix warnings from this PR. Thanks!

Thanks!

…dNd, StoreNd and Dpas Ops. (llvm#135271) This PR adds the SIMT distribution patterns for create_nd_tdesc, load_nd, store_nd and dpas XeGPU ops.

charithaintc and others added 30 commits March 18, 2025 20:32

save work

39dcf9d

moving all ops to region working

2058773

moving all ops to region working

14233fa

save work

f599873

save work

220ed1f

save work

2a8070f

extend sg_map from subgroup to workgroup

4838b52

format code

cb26979

remove changes to prefetch op

273fc40

refine the doc for TensorDesc

504d274

save work

90e0704

save work

3abe7cb

Merge branch 'main' into xegpu_simt_dist

7c87319

update doc

596c953

save work

2065764

refine docs

899439b

refine docs

8636d15

refine util

0190418

refine convert_layout docs

32f9272

save work

fe11c79

save work

6e1ef3e

save work

55c272c

Merge branch 'gpu_dialect_changes' into xegpu_simt_dist

ee56a3e

save work

1ffe5c8

save work before merging with Chao's PR

e5521f9

Merge branch 'users/chencha3/xegpu/extend_sg_map' into xegpu_simt_dist

350b581

merge xegpu changes

5700c81

Merge branch 'main' into xegpu_simt_dist

1619fcf

refactor names

2334a97

drop ScopeAttr and refine 1D layout support

9bddeb6

fschlimb reviewed Apr 29, 2025

View reviewed changes

adam-smnk reviewed Apr 29, 2025

View reviewed changes

mlir/lib/Dialect/XeGPU/Utils/XeGPUUtils.cpp Show resolved Hide resolved

mlir/include/mlir/Dialect/XeGPU/Utils/XeGPUUtils.h Outdated Show resolved Hide resolved

adam-smnk reviewed Apr 29, 2025

View reviewed changes

Merge branch 'main' into xegpu_simt_dist

9b28449

charithaintc added 5 commits April 29, 2025 18:16

address comments

1464adb

address comments

14468b5

address comments

b84c2f9

address comments

bff1f5e

address comments

36206bb

fschlimb reviewed Apr 30, 2025

View reviewed changes

mlir/lib/Dialect/XeGPU/Transforms/XeGPUSubgroupDistribute.cpp Outdated Show resolved Hide resolved

charithaintc added 2 commits April 30, 2025 15:05

save work

0328684

save work

466712f

adam-smnk approved these changes Apr 30, 2025

View reviewed changes

mlir/lib/Dialect/XeGPU/Transforms/XeGPUSubgroupDistribute.cpp Outdated Show resolved Hide resolved

mlir/lib/Dialect/XeGPU/Transforms/XeGPUSubgroupDistribute.cpp Outdated Show resolved Hide resolved

charithaintc and others added 9 commits April 30, 2025 17:17

save work

0baef66

save work

cadc078

add missing lib

24635e0

add missing lib

bed781b

Merge branch 'main' into xegpu_simt_dist

afdf394

Merge branch 'main' into xegpu_simt_dist

84afb20

Merge branch 'main' into xegpu_simt_dist

7e0d753

Merge branch 'main' into xegpu_simt_dist

d4abd69

Merge branch 'main' into xegpu_simt_dist

3722dec

charithaintc merged commit d30554b into llvm:main Apr 30, 2025
6 of 9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[mlir][xegpu] SIMT distribution patterns for XeGPU CreateNdTdesc, LoadNd, StoreNd and Dpas Ops. #135271

[mlir][xegpu] SIMT distribution patterns for XeGPU CreateNdTdesc, LoadNd, StoreNd and Dpas Ops. #135271

charithaintc commented Apr 10, 2025 •

edited

Loading

fschlimb Apr 29, 2025

charithaintc Apr 29, 2025

charithaintc commented Apr 29, 2025

adam-smnk left a comment

kazutakahirata commented May 1, 2025

charithaintc commented May 5, 2025

[mlir][xegpu] SIMT distribution patterns for XeGPU CreateNdTdesc, LoadNd, StoreNd and Dpas Ops. #135271

[mlir][xegpu] SIMT distribution patterns for XeGPU CreateNdTdesc, LoadNd, StoreNd and Dpas Ops. #135271

Conversation

charithaintc commented Apr 10, 2025 • edited Loading

fschlimb Apr 29, 2025

Choose a reason for hiding this comment

charithaintc Apr 29, 2025

Choose a reason for hiding this comment

charithaintc commented Apr 29, 2025

adam-smnk left a comment

Choose a reason for hiding this comment

kazutakahirata commented May 1, 2025

charithaintc commented May 5, 2025

charithaintc commented Apr 10, 2025 •

edited

Loading