-
Notifications
You must be signed in to change notification settings - Fork 24.1k
[inductor] [compile async] Don't compile in eager #152507
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152507
Note: Links to docs will display an error until the docs builds have been completed. ❗ 1 Active SEVsThere are 1 currently active SEVs. If your PR is affected, please view them below: ❌ 22 New FailuresAs of commit c8a3815 with merge base 9c7b902 ( NEW FAILURES - The following jobs have failed:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
@pytorchbot label "topic: not user facing" |
@pytorchbot rebase |
@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here |
Rebase failed due to Command
Raised by https://github.com/pytorch/pytorch/actions/runs/14825161606 |
I'm not sure if I like this change. My original intention for this dict was so we could tell the difference between the stats for different types of compile modes. With this change we can no longer tell the difference between the types/modes of compile. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks like tests are failing, you should also address @aorenste's concerns.
Then maybe I misunderstood your idea. I thought it can be a method to decrease the latency of In my local test, it can reduce the latency of |
Previously we will compile in eager mode.
This looks not intentional according to the test. There is a check to check the number of compilations (in current process) to be 0. But maybe due to an oversight, the number it checks is always a zero.
In _InProcessFxCompile and _SerializedFxCompile, we increment the number of
codegen_and_compile
byself
, which is a member variable attached to the instance. But in test, we check the number ofcodegen_and_compile
by the class. I think we should increment the number ofcodegen_and_compile
by the class. Then the test will fail now.See torch/_inductor/compile_fx_async.py for the fix.
CC @aorenste
cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov