[Quant][PT2E]change flatten recipe for X86InductorQuantizer #136298

blzheng · 2024-09-19T01:37:05Z

Stack from ghstack (oldest at bottom):

-> [Quant][PT2E]change flatten recipe for X86InductorQuantizer #136298

This PR modifies the flatten recipe: if none of the users of the flatten node are quantizable ops, int8 flatten will be disabled to avoid unnecessary dtype conversions.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang

[ghstack-poisoned]

pytorch-bot · 2024-09-19T01:37:10Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/136298

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (4 Unrelated Failures)

As of commit aed4b1d with merge base 0cdc6a8 ():

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

pull / linux-focal-py3.11-clang10 / test (dynamo, 3, 3, lf.linux.2xlarge) (gh) (disabled by #128551 but the issue was closed recently and a rebase is needed to make it pass)
test_dataloader.py::TestDataLoader::test_segfault
pull / linux-focal-py3.12-clang10 / test (dynamo, 3, 3, lf.linux.2xlarge) (gh) (disabled by #128551 but the issue was closed recently and a rebase is needed to make it pass)
test_dataloader.py::TestDataLoader::test_segfault
pull / linux-focal-py3.12-clang10-experimental-split-build / test (dynamo, 3, 3, linux.2xlarge) (gh) (disabled by #128551 but the issue was closed recently and a rebase is needed to make it pass)
test_dataloader.py::TestDataLoader::test_segfault
pull / linux-focal-py3.9-clang10 / test (dynamo, 2, 3, lf.linux.2xlarge) (gh) (disabled by #128551 but the issue was closed recently and a rebase is needed to make it pass)
test_dataloader.py::TestDataLoader::test_segfault

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

leslie-fang-intel · 2024-09-19T04:28:59Z

torch/ao/quantization/quantizer/x86_inductor_quantizer.py

+            elif (
+                node.target is torch.ops.aten.flatten.using_ints
+                and len(node.users) > 0
+                and not is_any_users_connected_to_quantizable_op(node.users)


nit: not any([user for user in list(node.users.keys()) if user.target in quantizable_ops]). since is_any_users_connected_to_quantizable_op is only used here.

Please also add a comment for the special recipe of flatten here.

leslie-fang-intel

Please add the summary of this PR and the description of this special flatten quant recipe.

[ghstack-poisoned]

ghstack-source-id: a38c0ae Pull Request resolved: #136298

blzheng · 2024-09-19T05:54:39Z

Thank you for your review. The code and description have been updated.

jgong5

What model does this change help with?

blzheng · 2024-09-24T01:01:00Z

What model does this change help with?

ViT

blzheng · 2024-09-24T01:01:44Z

@pytorchbot merge

pytorchmergebot · 2024-09-24T01:03:23Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…136298) This PR modifies the flatten recipe: if none of the users of the flatten node are quantizable ops, int8 flatten will be disabled to avoid unnecessary dtype conversions. Pull Request resolved: pytorch#136298 Approved by: https://github.com/leslie-fang-intel, https://github.com/jgong5

Update

2518af4

[ghstack-poisoned]

blzheng requested a review from jerryzh168 as a code owner September 19, 2024 01:37

pytorch-bot bot added release notes: quantization release notes category labels Sep 19, 2024

blzheng requested a review from leslie-fang-intel September 19, 2024 01:37

pytorchbot added the open source label Sep 19, 2024

Update

f08c67d

[ghstack-poisoned]

pytorch-bot bot added the module: inductor label Sep 19, 2024

leslie-fang-intel reviewed Sep 19, 2024

View reviewed changes

Update

ac91a5c

[ghstack-poisoned]

Update

aed4b1d

[ghstack-poisoned]

blzheng added a commit that referenced this pull request Sep 19, 2024

[Quant][PT2E]change flatten recipe for X86InductorQuantizer

1339d3b

ghstack-source-id: a38c0ae Pull Request resolved: #136298

leslie-fang-intel approved these changes Sep 19, 2024

View reviewed changes

blzheng requested a review from jgong5 September 19, 2024 05:59

jgong5 approved these changes Sep 23, 2024

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 24, 2024

pytorchmergebot added the merging label Sep 24, 2024

pytorchmergebot added the Merged label Sep 24, 2024

pytorchmergebot closed this in 797c7e2 Sep 24, 2024

pytorchmergebot removed the merging label Sep 24, 2024

github-actions bot deleted the gh/blzheng/1/head branch October 25, 2024 02:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Quant][PT2E]change flatten recipe for X86InductorQuantizer #136298

[Quant][PT2E]change flatten recipe for X86InductorQuantizer #136298

Uh oh!

blzheng commented Sep 19, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 19, 2024 •

edited

Loading

Uh oh!

leslie-fang-intel Sep 19, 2024 •

edited

Loading

Uh oh!

leslie-fang-intel Sep 19, 2024

Uh oh!

leslie-fang-intel left a comment

Uh oh!

blzheng commented Sep 19, 2024

Uh oh!

jgong5 left a comment

Uh oh!

blzheng commented Sep 24, 2024

Uh oh!

blzheng commented Sep 24, 2024

Uh oh!

pytorchmergebot commented Sep 24, 2024

Uh oh!

Uh oh!

[Quant][PT2E]change flatten recipe for X86InductorQuantizer #136298

[Quant][PT2E]change flatten recipe for X86InductorQuantizer #136298

Uh oh!

Conversation

blzheng commented Sep 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/136298

✅ You can merge normally! (4 Unrelated Failures)

Uh oh!

leslie-fang-intel Sep 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

leslie-fang-intel Sep 19, 2024

Choose a reason for hiding this comment

Uh oh!

leslie-fang-intel left a comment

Choose a reason for hiding this comment

Uh oh!

blzheng commented Sep 19, 2024

Uh oh!

jgong5 left a comment

Choose a reason for hiding this comment

Uh oh!

blzheng commented Sep 24, 2024

Uh oh!

blzheng commented Sep 24, 2024

Uh oh!

pytorchmergebot commented Sep 24, 2024

Merge started

Uh oh!

Uh oh!

blzheng commented Sep 19, 2024 •

edited

Loading

pytorch-bot bot commented Sep 19, 2024 •

edited

Loading

leslie-fang-intel Sep 19, 2024 •

edited

Loading