[Don't merge]Upgrade submodule oneDNN to v3.7 (#147498)(Z7) #148163

xuhancn · 2025-02-28T06:01:21Z

Fixes #ISSUE_NUMBER

cc @gujinghui @PenghuiCheng @XiaobingSuper @jianyuh @jgong5 @mingfeima @sanchitintel @ashokei @jingxu10 @min-jean-cho @yanbing-j @Guobing-Chen @Xia-Weiwen @snadampal @voznesenskym @penguinwu @EikanWang @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov

This PR is to upgrade submodule oneDNN to v3.7. ## Improvements - Improved performance of convolution and matmul primitives on Intel Xeon processors with Intel AMX instruction set support (formerly Sapphire Rapids and Granite Rapids). - Improved performance of int8 and fp32 forward convolution primitive on processors with Intel AVX2 instruction set support. - Improved performance of fp8 matmul primitives with bf16 and fp16 bias data type on Intel Xeon processors with Intel AMX instruction set support (formerly Sapphire Rapids and Granite Rapids). - Introduced initial optimizations for Intel GPUs based on Xe3 architecture. - Added bfloat16 support for SDPA, implemented fp16 and bf16 gemm kernel in SDPA. - Fixed f16 matmul accuracy, the issue of SDPA cannot dispatched to ukernel, bf16/fp16/fp32 conv performance, INT8 Kernel trigger page fault, deconvolution precision issue on complex128 and fp64 and gemm correctness issue in float16 issues. - Improved bf16 matmul performance with fp32 destination with Arm Compute Library (ACL). - Improved bf16 to fp32 reorder performance. - Improved bf16 reorder performance. - Improved bf16 convolution with ACL. Fixes pytorch#136348. ## Validation results on CPU 1. NLP models accuracy/inference/training ![image](https://github.com/user-attachments/assets/859279b8-1631-4268-b226-7de9ac5870d8) ![image](https://github.com/user-attachments/assets/30ec7151-41ca-482a-9d2d-0c4850e75bab) 2. Torchbench cpu userbenchmark inference & training ![image](https://github.com/user-attachments/assets/71c9807c-caf9-4385-9990-d2ab637031cd) 3. Inductor quantization ![image](https://github.com/user-attachments/assets/3d2a3bd3-82fa-4566-8050-7ea5d6b61675) 4. Dynamo benchmarks ![image](https://github.com/user-attachments/assets/554ecce3-c85c-4a0e-88f1-2e73983c5dcd) ![image](https://github.com/user-attachments/assets/148c88f8-4367-4428-bb54-ce8a4deefd1b) ![image](https://github.com/user-attachments/assets/f2e744f4-d710-4699-acf4-1f130ecfadf1) ![image](https://github.com/user-attachments/assets/97128b80-4d0e-495a-aeda-dde3e70c96fd) ![image](https://github.com/user-attachments/assets/a9afce37-684c-45c0-b938-6dd7e0383805) ![image](https://github.com/user-attachments/assets/b8714236-9681-4fbe-8d98-be93deedab88) ![image](https://github.com/user-attachments/assets/4423061f-d133-45ba-98bd-d2f739e50431) ![image](https://github.com/user-attachments/assets/7955da10-3d23-493e-99fa-658f7f40035b) ## Validation results on XPU Accuracy is same as baseline. Performance is shown below. ![image](https://github.com/user-attachments/assets/7645304d-5b1d-43f9-b840-9f846ed380a0) ## Validation results on ARM ![image](https://github.com/user-attachments/assets/080f7c02-0238-436f-ad20-5a9e3f6aafbb) ![image](https://github.com/user-attachments/assets/443742aa-ca61-41de-ae80-5d4c65cd0c87) Pull Request resolved: pytorch#147498 Approved by: https://github.com/fadara01, https://github.com/mingfeima, https://github.com/atalman

pytorch-bot · 2025-02-28T06:01:27Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/148163

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Unrelated Failure

As of commit 081d7ff with merge base b533bb4 ():

NEW FAILURE - The following job has failed:

macos-arm64-binary-wheel / wheel-py3_13t-cpu-build (gh)
No files were found with the provided path: /Users/runner/work/_temp/artifacts. No artifacts will be uploaded.

FLAKY - The following job failed but was likely due to flakiness present on trunk:

trunk / win-vs2022-cpu-py3 / test (default, 3, 3, lf.windows.4xlarge.nonephemeral) (gh) (similar failure)
'test/test_mkldnn.py::TestMkldnnCPU::test_mul_cpu'

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2025-04-30T13:39:26Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

yanbing-j and others added 3 commits February 26, 2025 10:15

genarate and package PDB to wheel.

df5447b

build PDB with Z7.

081d7ff

xuhancn added ciflow/binaries Trigger all binary build and upload jobs on the PR topic: not user facing topic category ci-no-td Do not run TD on this PR labels Feb 28, 2025

pytorch-bot bot added ciflow/inductor ciflow/linux-aarch64 linux aarch64 CI workflow module: inductor module: mkldnn Related to Intel IDEEP or oneDNN (a.k.a. mkldnn) integration labels Feb 28, 2025

pytorchbot added the open source label Feb 28, 2025

xuhancn added the ciflow/trunk Trigger trunk jobs on your pull request label Feb 28, 2025

github-actions bot added the Stale label Apr 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Don't merge]Upgrade submodule oneDNN to v3.7 (#147498)(Z7) #148163

[Don't merge]Upgrade submodule oneDNN to v3.7 (#147498)(Z7) #148163

xuhancn commented Feb 28, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Feb 28, 2025 •

edited

Loading

github-actions bot commented Apr 30, 2025

[Don't merge]Upgrade submodule oneDNN to v3.7 (#147498)(Z7) #148163

Are you sure you want to change the base?

[Don't merge]Upgrade submodule oneDNN to v3.7 (#147498)(Z7) #148163

Conversation

xuhancn commented Feb 28, 2025 • edited by pytorch-bot bot Loading

pytorch-bot bot commented Feb 28, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/148163

❌ 1 New Failure, 1 Unrelated Failure

github-actions bot commented Apr 30, 2025

xuhancn commented Feb 28, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Feb 28, 2025 •

edited

Loading