Fixing aliasing behavior for slice in AQT TensorCoreTiledLayout #2174

jerryzh168 · 2025-05-05T22:48:57Z

Summary:
slice op is supposed to preserve aliasing (output of slice should alias the input), but this is not true for TensorCoreTiledLayout (used by int4wo), and some others like gemlite

Reason is that we do unpacking, pading and prepacking right now, which creates new tensors.

We fixes it in this PR by doing slicing on the packed inner Tensor directly, specifically packed_weight and scale_and_zero in TensorCoreTiledLayout.

Test Plan:
python test/dtypes/test_affine_quantized.py -k test_slice_and_copy_int4wo

Reviewers:

Subscribers:

Tasks:

Tags:

Summary: slice op is supposed to preserve aliasing (output of slice should alias the input), but this is not true for TensorCoreTiledLayout (used by int4wo), and some others like gemlite Reason is that we do unpacking, pading and prepacking right now, which creates new tensors. We fixes it in this PR by doing slicing on the packed inner Tensor directly, specifically packed_weight and scale_and_zero in TensorCoreTiledLayout. Test Plan: python test/dtypes/test_affine_quantized.py -k test_slice_and_copy_int4wo Reviewers: Subscribers: Tasks: Tags:

pytorch-bot · 2025-05-05T22:49:00Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2174

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

CI workflows being skipped on PR

⏳ No Failures, 6 Pending

As of commit 2fb26de with merge base 94e2e05 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

test/dtypes/test_affine_quantized.py

…ut (#2174)" This reverts commit 95119bb.

…ut" (#2175) Revert "Fixing aliasing behavior for slice in AQT TensorCoreTiledLayout (#2174)" This reverts commit 95119bb.

…out (#2174) Summary: slice op is supposed to preserve aliasing (output of slice should alias the input), but this is not true for TensorCoreTiledLayout (used by int4wo), and some others like gemlite Reason is that we do unpacking, pading and prepacking right now, which creates new tensors. We fixes it in this PR by doing slicing on the packed inner Tensor directly, specifically packed_weight and scale_and_zero in TensorCoreTiledLayout. Test Plan: python test/dtypes/test_affine_quantized.py -k test_slice_and_copy_int4wo Reviewers: Subscribers: Tasks: Tags: * simplify code * add check for data_ptr * format * avoid div by zero * format

* [reland] Fixing aliasing behavior for slice in AQT TensorCoreTiledLayout (#2174) Summary: slice op is supposed to preserve aliasing (output of slice should alias the input), but this is not true for TensorCoreTiledLayout (used by int4wo), and some others like gemlite Reason is that we do unpacking, pading and prepacking right now, which creates new tensors. We fixes it in this PR by doing slicing on the packed inner Tensor directly, specifically packed_weight and scale_and_zero in TensorCoreTiledLayout. Test Plan: python test/dtypes/test_affine_quantized.py -k test_slice_and_copy_int4wo Reviewers: Subscribers: Tasks: Tags: * simplify code * add check for data_ptr * format * avoid div by zero * format * fix shape

* Fixing aliasing behavior for slice in AQT TensorCoreTiledLayout Summary: slice op is supposed to preserve aliasing (output of slice should alias the input), but this is not true for TensorCoreTiledLayout (used by int4wo), and some others like gemlite Reason is that we do unpacking, pading and prepacking right now, which creates new tensors. We fixes it in this PR by doing slicing on the packed inner Tensor directly, specifically packed_weight and scale_and_zero in TensorCoreTiledLayout. Test Plan: python test/dtypes/test_affine_quantized.py -k test_slice_and_copy_int4wo Reviewers: Subscribers: Tasks: Tags: * simplify code * add check for data_ptr * format * avoid div by zero * format

…ut" (#2175) Revert "Fixing aliasing behavior for slice in AQT TensorCoreTiledLayout (#2174)" This reverts commit 95119bb.

* [reland] Fixing aliasing behavior for slice in AQT TensorCoreTiledLayout (#2174) Summary: slice op is supposed to preserve aliasing (output of slice should alias the input), but this is not true for TensorCoreTiledLayout (used by int4wo), and some others like gemlite Reason is that we do unpacking, pading and prepacking right now, which creates new tensors. We fixes it in this PR by doing slicing on the packed inner Tensor directly, specifically packed_weight and scale_and_zero in TensorCoreTiledLayout. Test Plan: python test/dtypes/test_affine_quantized.py -k test_slice_and_copy_int4wo Reviewers: Subscribers: Tasks: Tags: * simplify code * add check for data_ptr * format * avoid div by zero * format * fix shape

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 5, 2025

jerryzh168 requested review from bdhirsh, drisspg and mobicham May 5, 2025 22:49

jerryzh168 added the topic: bug fix Use this tag for PRs that fix bugs label May 5, 2025

simplify code

801d76f

drisspg reviewed May 5, 2025

View reviewed changes

test/dtypes/test_affine_quantized.py Show resolved Hide resolved

jerryzh168 added 4 commits May 5, 2025 16:02

add check for data_ptr

d06f1a1

format

df064d0

avoid div by zero

9d70683

format

2fb26de

drisspg approved these changes May 6, 2025

View reviewed changes

jerryzh168 merged commit 95119bb into main May 6, 2025
14 of 18 checks passed

jerryzh168 added a commit that referenced this pull request May 6, 2025

Revert "Fixing aliasing behavior for slice in AQT TensorCoreTiledLayo…

e8139cb

…ut (#2174)" This reverts commit 95119bb.

jerryzh168 mentioned this pull request May 6, 2025

Revert "Fixing aliasing behavior for slice in AQT TensorCoreTiledLayout" #2175

Merged

jerryzh168 added a commit that referenced this pull request May 6, 2025

Revert "Fixing aliasing behavior for slice in AQT TensorCoreTiledLayo…

a83636d

…ut" (#2175) Revert "Fixing aliasing behavior for slice in AQT TensorCoreTiledLayout (#2174)" This reverts commit 95119bb.

jerryzh168 mentioned this pull request May 6, 2025

[reland] Fixing aliasing behavior for slice in AQT int4wo layout #2176

Merged

jerryzh168 mentioned this pull request May 28, 2025

int4_weight_only get plain weight are padded #2249

Open

liangel-02 pushed a commit that referenced this pull request Aug 25, 2025

Revert "Fixing aliasing behavior for slice in AQT TensorCoreTiledLayo…

da8c7cd

…ut" (#2175) Revert "Fixing aliasing behavior for slice in AQT TensorCoreTiledLayout (#2174)" This reverts commit 95119bb.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixing aliasing behavior for slice in AQT TensorCoreTiledLayout #2174

Fixing aliasing behavior for slice in AQT TensorCoreTiledLayout #2174

Uh oh!

jerryzh168 commented May 5, 2025

Uh oh!

pytorch-bot bot commented May 5, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fixing aliasing behavior for slice in AQT TensorCoreTiledLayout #2174

Fixing aliasing behavior for slice in AQT TensorCoreTiledLayout #2174

Uh oh!

Conversation

jerryzh168 commented May 5, 2025

Uh oh!

pytorch-bot bot commented May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2174

❗ 1 Active SEVs

⏳ No Failures, 6 Pending

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented May 5, 2025 •

edited

Loading