[FlexAttention] Allow num_warps 8 since when block size >=128 #143299

drisspg · 2024-12-16T17:15:12Z

Stack from ghstack (oldest at bottom):

Summary

Fixes #143290

We already strip bad configs here:

pytorch/torch/_inductor/kernel/flex_attention.py

Line 2299 in e0e763e

for w in ([4, 8] if BLOCK1 >= 128 or BLOCK2 >= 128 else [4])

So this shouldn't be needed. Confirming that the 64 x 128 case is valid otherwise we can just change the default config

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang @aakhundov @Chillee @yanboliang @BoyuanFeng

[ghstack-poisoned]

pytorch-bot · 2024-12-16T17:15:15Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/143299

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit a5420d8 with merge base 7ab3177 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

drisspg · 2024-12-17T00:45:38Z

@pytorchbot merge

pytorchmergebot · 2024-12-17T00:47:28Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Fixes pytorch#143331 Pull Request resolved: pytorch#143344 Approved by: https://github.com/Chillee ghstack dependencies: pytorch#143299

…h#143299) # Summary Fixes pytorch#143290 We already strip bad configs here: https://github.com/pytorch/pytorch/blob/e0e763e33135d2ad25c613007aa5f2fee6d2cc24/torch/_inductor/kernel/flex_attention.py#L2299 So this shouldn't be needed. Confirming that the 64 x 128 case is valid otherwise we can just change the default config Pull Request resolved: pytorch#143299 Approved by: https://github.com/yanboliang

Fixes pytorch#143331 Pull Request resolved: pytorch#143344 Approved by: https://github.com/Chillee ghstack dependencies: pytorch#143299

ghstack-source-id: 6992e3a Pull Request resolved: pytorch/pytorch#143299

…h#143299) # Summary Fixes pytorch#143290 We already strip bad configs here: https://github.com/pytorch/pytorch/blob/e0e763e33135d2ad25c613007aa5f2fee6d2cc24/torch/_inductor/kernel/flex_attention.py#L2299 So this shouldn't be needed. Confirming that the 64 x 128 case is valid otherwise we can just change the default config Pull Request resolved: pytorch#143299 Approved by: https://github.com/yanboliang

Fixes pytorch#143331 Pull Request resolved: pytorch#143344 Approved by: https://github.com/Chillee ghstack dependencies: pytorch#143299

…h#143299) # Summary Fixes pytorch#143290 We already strip bad configs here: https://github.com/pytorch/pytorch/blob/e0e763e33135d2ad25c613007aa5f2fee6d2cc24/torch/_inductor/kernel/flex_attention.py#L2299 So this shouldn't be needed. Confirming that the 64 x 128 case is valid otherwise we can just change the default config Pull Request resolved: pytorch#143299 Approved by: https://github.com/yanboliang

Fixes pytorch#143331 Pull Request resolved: pytorch#143344 Approved by: https://github.com/Chillee ghstack dependencies: pytorch#143299

atalman · 2025-02-19T18:07:46Z

Adding to 2.6.1 as requested by Runway

patrickvonplaten · 2025-03-26T15:24:51Z

Is that not worth making a patch release 2.6.1?

Update

1ae3479

[ghstack-poisoned]

pytorch-bot bot added ciflow/inductor module: inductor labels Dec 16, 2024

drisspg mentioned this pull request Dec 15, 2024

Make Template code naming better for triton dump + ncu #143103

Closed

drisspg requested review from Chillee and yanboliang and removed request for Chillee December 16, 2024 17:16

yanboliang approved these changes Dec 16, 2024

View reviewed changes

drisspg added topic: not user facing topic category module: flex attention labels Dec 16, 2024

Update

0e2cc14

[ghstack-poisoned]

drisspg added the ciflow/rocm Trigger "default" config CI on ROCm label Dec 16, 2024

drisspg added 3 commits December 16, 2024 12:56

Update

f7b9861

[ghstack-poisoned]

Update

926c21b

[ghstack-poisoned]

Update

a5420d8

[ghstack-poisoned]

drisspg mentioned this pull request Dec 17, 2024

[FlexAttention] Fix broken eager tracing #143344

Closed

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Dec 17, 2024

pytorchmergebot added the merging label Dec 17, 2024

pytorchmergebot added the Merged label Dec 17, 2024

pytorchmergebot closed this in d2ec7f0 Dec 17, 2024

pytorchmergebot removed the merging label Dec 17, 2024

aditew01 pushed a commit to aditew01/pytorch that referenced this pull request Dec 18, 2024

[FlexAttention] Fix broken eager tracing (pytorch#143344)

045eaea

Fixes pytorch#143331 Pull Request resolved: pytorch#143344 Approved by: https://github.com/Chillee ghstack dependencies: pytorch#143299

github-actions bot deleted the gh/drisspg/101/head branch January 18, 2025 02:02

Git-Hub-Chris pushed a commit to Git-Hub-Chris/PyTorch that referenced this pull request Jan 19, 2025

[FlexAttention] Allow num_warps 8 since when block size >=128

073fd89

ghstack-source-id: 6992e3a Pull Request resolved: pytorch/pytorch#143299

FindDefinition mentioned this pull request Feb 8, 2025

flex attention NoValidChoicesError with torch 2.6 #146260

Closed

atalman added this to the 2.6.1 milestone Feb 19, 2025

vasqu mentioned this pull request Apr 9, 2025

Handle torch ver in flexattn huggingface/transformers#37400

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FlexAttention] Allow num_warps 8 since when block size >=128 #143299

[FlexAttention] Allow num_warps 8 since when block size >=128 #143299

drisspg commented Dec 16, 2024 •

edited

Loading

pytorch-bot bot commented Dec 16, 2024 •

edited

Loading

drisspg commented Dec 17, 2024

pytorchmergebot commented Dec 17, 2024

atalman commented Feb 19, 2025

patrickvonplaten commented Mar 26, 2025

[FlexAttention] Allow num_warps 8 since when block size >=128 #143299

[FlexAttention] Allow num_warps 8 since when block size >=128 #143299

Conversation

drisspg commented Dec 16, 2024 • edited Loading

Summary

pytorch-bot bot commented Dec 16, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/143299

✅ No Failures

drisspg commented Dec 17, 2024

pytorchmergebot commented Dec 17, 2024

Merge started

atalman commented Feb 19, 2025

patrickvonplaten commented Mar 26, 2025

drisspg commented Dec 16, 2024 •

edited

Loading

pytorch-bot bot commented Dec 16, 2024 •

edited

Loading