Optimize AOTInductor: Caching, Reduced Decompositions, and Improved JSON Handling #148616

devsashidhar · 2025-03-05T22:36:57Z

PR Description:
This PR improves AOTInductor's compilation performance by introducing caching, limiting unnecessary decompositions, and optimizing JSON handling.

Changes:
Added persistent caching to avoid redundant recompilation.
Restricted decompositions to only necessary operators (aten::add, aten::mul).
Optimized JSON metadata updates to prevent unnecessary file writes.

Impact:
Reduces compilation time for repeated runs.
Improves efficiency by only updating metadata when needed.
Helps prevent excessive decompositions, leading to better overall performance.

Testing:
Ran pytest test/inductor to check for regressions.
Verified that AOT compilation is significantly faster on repeated runs.

cc @H-Huang @awgu @kwen2501 @wanchaol @fegin @fduwjj @wz337 @wconstab @d4l3k @c-p-i-o @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov

… handling

pytorch-bot · 2025-03-05T22:37:00Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/148616

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 7890c7b with merge base 4b35139 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

devsashidhar · 2025-03-05T22:38:56Z

@pytorchbot label "topic: not user facing"

desertfire

I will let @EikanWang to comment the aoti_eager part.

desertfire · 2025-03-06T13:55:07Z

args_any_results.txt

@@ -0,0 +1,259 @@
+torch/_higher_order_ops/utils.py:    operator: OperatorBase, delayed_error: bool, *args: Any, **kwargs: Any


What is this file for?

desertfire · 2025-03-06T13:56:51Z

vision

Please revert this third-party change.

desertfire · 2025-03-06T13:57:18Z

torch/nn/parallel/data_parallel.py

@@ -168,7 +173,7 @@ def __init__(
        if len(self.device_ids) == 1:
            self.module.to(self.src_device_obj)

-    def forward(self, *inputs: Any, **kwargs: Any) -> Any:
+    def forward(self, *inputs: P.args, **kwargs: P.kwargs) -> R:


Is this relevant to this PR?

github-actions · 2025-07-01T17:38:20Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

devsashidhar added 2 commits February 20, 2025 22:31

Refactor typing: Replace Any with ParamSpec for better type safety

0ce2203

Optimize AOTInductor: caching, reduced decompositions, optimized JSON…

7890c7b

… handling

devsashidhar requested review from albanD, jbschlosser and mikaylagawarecki as code owners March 5, 2025 22:36

pytorch-bot bot added module: inductor oncall: distributed Add this issue/PR to distributed oncall triage queue labels Mar 5, 2025

pytorch-bot bot added the topic: not user facing topic category label Mar 5, 2025

pytorchbot added the open source label Mar 5, 2025

bdhirsh requested a review from desertfire March 6, 2025 01:53

bdhirsh added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Mar 6, 2025

desertfire requested a review from EikanWang March 6, 2025 13:54

desertfire reviewed Mar 6, 2025

View reviewed changes

mikaylagawarecki removed their request for review May 2, 2025 16:54

github-actions bot added the Stale label Jul 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize AOTInductor: Caching, Reduced Decompositions, and Improved JSON Handling #148616

Optimize AOTInductor: Caching, Reduced Decompositions, and Improved JSON Handling #148616

Uh oh!

devsashidhar commented Mar 5, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Mar 5, 2025 •

edited

Loading

Uh oh!

devsashidhar commented Mar 5, 2025

Uh oh!

desertfire left a comment

Uh oh!

desertfire Mar 6, 2025

Uh oh!

desertfire Mar 6, 2025

Uh oh!

desertfire Mar 6, 2025

Uh oh!

github-actions bot commented Jul 1, 2025

Uh oh!

Uh oh!

		@@ -0,0 +1,259 @@
		torch/_higher_order_ops/utils.py: operator: OperatorBase, delayed_error: bool, args: Any, *kwargs: Any

Optimize AOTInductor: Caching, Reduced Decompositions, and Improved JSON Handling #148616

Are you sure you want to change the base?

Optimize AOTInductor: Caching, Reduced Decompositions, and Improved JSON Handling #148616

Uh oh!

Conversation

devsashidhar commented Mar 5, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/148616

✅ No Failures

Uh oh!

devsashidhar commented Mar 5, 2025

Uh oh!

desertfire left a comment

Choose a reason for hiding this comment

Uh oh!

desertfire Mar 6, 2025

Choose a reason for hiding this comment

Uh oh!

desertfire Mar 6, 2025

Choose a reason for hiding this comment

Uh oh!

desertfire Mar 6, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Jul 1, 2025

Uh oh!

Uh oh!

devsashidhar commented Mar 5, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Mar 5, 2025 •

edited

Loading