cpp_wrapper: Use runtime dispatched fallbacks for complex ops #143223

benjaminglass1 · 2024-12-13T21:56:50Z

Stack from ghstack (oldest at bottom):

When calling a fallback op in cpp_wrapper mode, where any of the inputs are complex numbers, utilize the runtime dispatched fallback mode. This properly handles the Conjugate and Negative dispatch keys, if present, in exchange for a performance pessimization in complex arithmetic.

This PR additionally fixes some cascading failure modes exposed in our aot_inductor tests by this change.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang @aakhundov

[ghstack-poisoned]

pytorch-bot · 2024-12-13T21:56:54Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/143223

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 218a34c with merge base bb5e439 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

When calling a fallback op in cpp_wrapper mode, where any of the inputs are complex numbers, utilize the runtime dispatched fallback mode. This properly handles the Conjugate and Negative dispatch keys, if present, in exchange for a performance pessimization in complex arithmetic. ghstack-source-id: 46b9be7 Pull Request resolved: #143223

[ghstack-poisoned]

When calling a fallback op in cpp_wrapper mode, where any of the inputs are complex numbers, utilize the runtime dispatched fallback mode. This properly handles the Conjugate and Negative dispatch keys, if present, in exchange for a performance pessimization in complex arithmetic. ghstack-source-id: d8ff9b1 Pull Request resolved: #143223

[ghstack-poisoned]

When calling a fallback op in cpp_wrapper mode, where any of the inputs are complex numbers, utilize the runtime dispatched fallback mode. This properly handles the Conjugate and Negative dispatch keys, if present, in exchange for a performance pessimization in complex arithmetic. ghstack-source-id: 409133d Pull Request resolved: pytorch/pytorch#143223

[ghstack-poisoned]

benjaminglass1 · 2025-01-02T16:01:43Z

@pytorchbot merge

pytorchmergebot · 2025-01-02T16:04:27Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-01-02T17:51:33Z

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

[ghstack-poisoned]

Removes 4 fallback ops that are currently not possible to codegen, which does not break ABI-compatibility. 1. `_cudnn_rnn_backward` and `_histogramdd_bin_edges` both return `Tensor[]`, which we cannot codegen with the current design. 2. `_sparse_coo_tensor_with_dims_and_tensors` only supplies a Sparse operator, which we don't support. 3. `zeros.names` requires a `Dimname` input, which we can't currently codegen. Removing these ops from the list will improve test performance, since the fallback op generation will use the Python proxy executor instead of calling non-existent C functions. Pull Request resolved: #143421 Approved by: https://github.com/desertfire ghstack dependencies: #141371, #143223

fft to CPU. [ghstack-poisoned]

fft to CPU. ghstack-source-id: 95da949 Pull Request resolved: #144238

Since #143223 enabled runtime dispatch for fft_c2c in AOTI mod, for XPU, we can fallback fft_c2c which has no XPU implementation to CPU and pass the case now. Pull Request resolved: #144238 Approved by: https://github.com/jansel

Update

9ca4cee

[ghstack-poisoned]

This was referenced Dec 13, 2024

cpp_builder: handle CUDA lib paths involving "stubs" in more circumstances #142175

Closed

[export] Serialize all dataclass fields #142286

Closed

pytorch-bot bot added ciflow/inductor module: inductor labels Dec 13, 2024

benjaminglass1 mentioned this pull request Dec 13, 2024

ir.ExternKernel: correctly handle kwarg default arguments #141371

Closed

benjaminglass1 added the topic: not user facing topic category label Dec 13, 2024

Update

5b25ef8

[ghstack-poisoned]

benjaminglass1 self-assigned this Dec 13, 2024

pytorchbot added the open source label Dec 13, 2024

Update

a9214a3

[ghstack-poisoned]

Update

e7fcad9

[ghstack-poisoned]

Update

03d3c7f

[ghstack-poisoned]

This was referenced Dec 17, 2024

AOTI fallback ops: remove ops that were never codegen'ed #143421

Closed

cpp_builder.py: Build in -O2 to improve compilation time #143422

Closed

Update

dc213b6

[ghstack-poisoned]

benjaminglass1 requested a review from desertfire December 18, 2024 22:55

benjaminglass1 marked this pull request as ready for review December 18, 2024 22:55

benjaminglass1 requested a review from a team as a code owner December 18, 2024 22:55

benjaminglass1 added 2 commits December 19, 2024 16:09

Update

db43512

[ghstack-poisoned]

Update

564a773

[ghstack-poisoned]

Update

344917d

[ghstack-poisoned]

benjaminglass1 mentioned this pull request Dec 23, 2024

cpp_wrapper: minimize pybind11 dependency #143772

Closed

Update

c38e453

[ghstack-poisoned]

benjaminglass1 mentioned this pull request Dec 27, 2024

cpp_wrapper: Move #includes to per-device header files #143909

Closed

benjaminglass1 added 2 commits December 30, 2024 15:57

Update

7771156

[ghstack-poisoned]

Update

14944ab

[ghstack-poisoned]

benjaminglass1 mentioned this pull request Dec 30, 2024

cpp_wrapper: Precompile device-specific header files #144002

Closed

desertfire approved these changes Jan 2, 2025

View reviewed changes

Update

17bc202

[ghstack-poisoned]

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jan 2, 2025

pytorchmergebot added the merging label Jan 2, 2025

benjaminglass1 removed ciflow/trunk Trigger trunk jobs on your pull request merging labels Jan 2, 2025

Update

218a34c

[ghstack-poisoned]

This was referenced Jan 3, 2025

cpp_wrapper AOTI: Move #includes to per-device header files #144123

Closed

cpp_wrapper AOTI: Precompile device-specific header files #144124

Closed

pytorchmergebot closed this in b5b419d Jan 3, 2025

pytorchmergebot added the Merged label Jan 3, 2025

etaf added a commit that referenced this pull request Jan 6, 2025

[Inductor UT] Remove excepted failure since #143223 supported fallback

c98fc2a

fft to CPU. [ghstack-poisoned]

etaf added a commit that referenced this pull request Jan 6, 2025

[Inductor UT] Remove excepted failure since #143223 supported fallback

9be7a73

fft to CPU. ghstack-source-id: 95da949 Pull Request resolved: #144238

etaf mentioned this pull request Jan 6, 2025

[Break XPU][Inductor UT] Remove excepted failure for aoti test_fft_c2c #144238

Closed

daisyden mentioned this pull request Jan 9, 2025

[1/N]Add Intel GPU Support to Torch Test Cases #143833

Closed

github-actions bot deleted the gh/benjaminglass1/41/head branch February 3, 2025 02:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

cpp_wrapper: Use runtime dispatched fallbacks for complex ops #143223

cpp_wrapper: Use runtime dispatched fallbacks for complex ops #143223

Uh oh!

benjaminglass1 commented Dec 13, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Dec 13, 2024 •

edited

Loading

Uh oh!

benjaminglass1 commented Jan 2, 2025

Uh oh!

pytorchmergebot commented Jan 2, 2025

Uh oh!

pytorchmergebot commented Jan 2, 2025

Uh oh!

Uh oh!

cpp_wrapper: Use runtime dispatched fallbacks for complex ops #143223

cpp_wrapper: Use runtime dispatched fallbacks for complex ops #143223

Uh oh!

Conversation

benjaminglass1 commented Dec 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Dec 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/143223

✅ No Failures

Uh oh!

benjaminglass1 commented Jan 2, 2025

Uh oh!

pytorchmergebot commented Jan 2, 2025

Merge started

Uh oh!

pytorchmergebot commented Jan 2, 2025

Uh oh!

Uh oh!

benjaminglass1 commented Dec 13, 2024 •

edited

Loading

pytorch-bot bot commented Dec 13, 2024 •

edited

Loading