[inductor] [compile async] Don't compile in eager #152507

ChuanqiXu9 · 2025-04-30T02:14:41Z

Previously we will compile in eager mode.

This looks not intentional according to the test. There is a check to check the number of compilations (in current process) to be 0. But maybe due to an oversight, the number it checks is always a zero.

In _InProcessFxCompile and _SerializedFxCompile, we increment the number of codegen_and_compile by self, which is a member variable attached to the instance. But in test, we check the number of codegen_and_compile by the class. I think we should increment the number of codegen_and_compile by the class. Then the test will fail now.

See torch/_inductor/compile_fx_async.py for the fix.

CC @aorenste

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov

pytorch-bot · 2025-04-30T02:14:45Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152507

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

CI workflows being skipped on PR

❌ 22 New Failures

As of commit c8a3815 with merge base 9c7b902 ():

NEW FAILURES - The following jobs have failed:

Lint / Link checks / Lint URLs / linux-job (gh)
RuntimeError: Command docker exec -t 1961bc9116a00e7fb36ff0b84d03a3138dcdd865604e27569b72139b437dc116 /exec failed with exit code 1
Lint / lintrunner-noclang / linux-job (gh)
>>> Lint for torch/_inductor/compile_fx.py:
pull / linux-focal-cuda12.6-py3.10-gcc11-sm89 / test (default, 2, 5, lf.ephemeral.linux.g6.4xlarge.experimental.nvidia.gpu) (gh)
inductor/test_compile_subprocess.py::GPUTests::test_aoti_eager_support_out_cuda
pull / linux-focal-cuda12.6-py3.10-gcc11-sm89 / test (default, 3, 5, lf.ephemeral.linux.g6.4xlarge.experimental.nvidia.gpu) (gh)
inductor/test_compile_subprocess.py::GPUTests::test_aoti_eager_cache_hit_cuda
pull / linux-focal-py3.13-clang10 / test (default, 2, 5, lf.ephemeral.linux.4xlarge) (gh)
inductor/test_compile_subprocess.py::CpuTests::test_aoti_eager_with_persistent_cache_cpu
pull / linux-focal-py3.13-clang10 / test (default, 3, 5, lf.ephemeral.linux.4xlarge) (gh)
inductor/test_compile_subprocess.py::CpuTests::test_aoti_eager_support_out_cpu
pull / linux-focal-py3.13-clang10 / test (default, 4, 5, lf.ephemeral.linux.4xlarge) (gh)
inductor/test_compile_subprocess.py::CpuTests::test_aoti_eager_support_str_cpu
pull / linux-focal-py3.13-clang10 / test (default, 5, 5, lf.ephemeral.linux.4xlarge) (gh)
inductor/test_compile_subprocess.py::CpuTests::test_aoti_eager_cache_hit_cpu
pull / linux-focal-py3.9-clang10 / test (default, 2, 5, lf.ephemeral.linux.4xlarge) (gh)
inductor/test_compile_subprocess.py::CpuTests::test_aoti_eager_with_persistent_cache_cpu
pull / linux-focal-py3.9-clang10 / test (default, 3, 5, lf.ephemeral.linux.4xlarge) (gh)
inductor/test_compile_subprocess.py::CpuTests::test_aoti_eager_support_out_cpu
pull / linux-focal-py3.9-clang10 / test (default, 4, 5, lf.ephemeral.linux.4xlarge) (gh)
inductor/test_compile_subprocess.py::CpuTests::test_aoti_eager_support_str_cpu
pull / linux-focal-py3.9-clang10 / test (default, 5, 5, lf.ephemeral.linux.4xlarge) (gh)
inductor/test_compile_subprocess.py::CpuTests::test_aoti_eager_cache_hit_cpu
pull / linux-jammy-py3.10-clang15-asan / test (default, 1, 6, lf.ephemeral.linux.4xlarge) (gh)
inductor/test_compile_subprocess.py::CpuTests::test_aoti_eager_support_out_cpu
pull / linux-jammy-py3.10-clang15-asan / test (default, 2, 6, lf.ephemeral.linux.4xlarge) (gh)
inductor/test_compile_subprocess.py::CpuTests::test_fallback_mutable_op_list_cpu
pull / linux-jammy-py3.10-clang15-asan / test (default, 3, 6, lf.ephemeral.linux.4xlarge) (gh)
inductor/test_compile_subprocess.py::CpuTests::test_aoti_eager_cache_hit_cpu
pull / linux-jammy-py3.10-clang15-asan / test (default, 4, 6, lf.ephemeral.linux.4xlarge) (gh)
inductor/test_compile_subprocess.py::CpuTests::test_aoti_eager_support_str_cpu
pull / linux-jammy-py3.10-clang15-asan / test (default, 5, 6, lf.ephemeral.linux.4xlarge) (gh)
inductor/test_compile_subprocess.py::CpuTests::test_fallback_mutable_op_no_mutated_tensors_cpu
pull / linux-jammy-py3.10-clang15-asan / test (default, 6, 6, lf.ephemeral.linux.4xlarge) (gh)
inductor/test_compile_subprocess.py::CpuTests::test_aoti_eager_with_persistent_cache_cpu
pull / linux-jammy-py3.9-gcc11 / test (default, 2, 5, lf.ephemeral.linux.2xlarge) (gh)
inductor/test_compile_subprocess.py::CpuTests::test_aoti_eager_with_persistent_cache_cpu
pull / linux-jammy-py3.9-gcc11 / test (default, 3, 5, lf.ephemeral.linux.2xlarge) (gh)
inductor/test_compile_subprocess.py::CpuTests::test_aoti_eager_support_out_cpu
pull / linux-jammy-py3.9-gcc11 / test (default, 4, 5, lf.ephemeral.linux.2xlarge) (gh)
inductor/test_compile_subprocess.py::CpuTests::test_aoti_eager_support_str_cpu
pull / linux-jammy-py3.9-gcc11 / test (default, 5, 5, lf.ephemeral.linux.2xlarge) (gh)
inductor/test_compile_subprocess.py::CpuTests::test_aoti_eager_cache_hit_cpu

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ChuanqiXu9 · 2025-04-30T02:15:52Z

@pytorchbot label "topic: not user facing"

jansel · 2025-05-04T21:03:55Z

@pytorchbot rebase

pytorchmergebot · 2025-05-04T21:05:24Z

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

pytorchmergebot · 2025-05-04T21:05:26Z

Rebase failed due to Command git -C /home/runner/work/pytorch/pytorch rebase refs/remotes/origin/viable/strict pull/152507/head returned non-zero exit code 1

Rebasing (1/1)
Auto-merging torch/_inductor/compile_fx.py
CONFLICT (content): Merge conflict in torch/_inductor/compile_fx.py
error: could not apply c8a3815c6f4... [inductor] [compile] [async] Don't compile in eager
hint: Resolve all conflicts manually, mark them as resolved with
hint: "git add/rm <conflicted_files>", then run "git rebase --continue".
hint: You can instead skip this commit: run "git rebase --skip".
hint: To abort and get back to the state before "git rebase", run "git rebase --abort".
hint: Disable this message with "git config set advice.mergeConflict false"
Could not apply c8a3815c6f4... [inductor] [compile] [async] Don't compile in eager

Raised by https://github.com/pytorch/pytorch/actions/runs/14825161606

aorenste · 2025-05-05T00:42:44Z

I'm not sure if I like this change. My original intention for this dict was so we could tell the difference between the stats for different types of compile modes. With this change we can no longer tell the difference between the types/modes of compile.

jansel

Looks like tests are failing, you should also address @aorenste's concerns.

ChuanqiXu9 · 2025-05-06T02:58:11Z

I'm not sure if I like this change. My original intention for this dict was so we could tell the difference between the stats for different types of compile modes. With this change we can no longer tell the difference between the types/modes of compile.

Then maybe I misunderstood your idea. I thought it can be a method to decrease the latency of torch.compile.

In my local test, it can reduce the latency of torch.compile by 30%-70% (end to end). I feel this is worthy. How do you feel about to add this idea to be a different compile modes?

[inductor] [compile] [async] Don't compile in eager

c8a3815

pytorch-bot bot added the module: inductor label Apr 30, 2025

pytorch-bot bot added the topic: not user facing topic category label Apr 30, 2025

pytorchbot added the open source label Apr 30, 2025

HDCharles requested review from jansel and jamesjwu May 2, 2025 03:50

HDCharles added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label May 2, 2025

jansel approved these changes May 4, 2025

View reviewed changes

jansel requested review from aorenste and jansel and removed request for jansel May 5, 2025 00:43

jansel requested changes May 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[inductor] [compile async] Don't compile in eager #152507

[inductor] [compile async] Don't compile in eager #152507

ChuanqiXu9 commented Apr 30, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Apr 30, 2025 •

edited

Loading

ChuanqiXu9 commented Apr 30, 2025

jansel commented May 4, 2025

pytorchmergebot commented May 4, 2025

pytorchmergebot commented May 4, 2025

aorenste commented May 5, 2025

jansel left a comment

ChuanqiXu9 commented May 6, 2025

[inductor] [compile async] Don't compile in eager #152507

Are you sure you want to change the base?

[inductor] [compile async] Don't compile in eager #152507

Conversation

ChuanqiXu9 commented Apr 30, 2025 • edited by pytorch-bot bot Loading

pytorch-bot bot commented Apr 30, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152507

❗ 1 Active SEVs

❌ 22 New Failures

ChuanqiXu9 commented Apr 30, 2025

jansel commented May 4, 2025

pytorchmergebot commented May 4, 2025

pytorchmergebot commented May 4, 2025

aorenste commented May 5, 2025

jansel left a comment

Choose a reason for hiding this comment

ChuanqiXu9 commented May 6, 2025

ChuanqiXu9 commented Apr 30, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Apr 30, 2025 •

edited

Loading