Add all fbgemm kernel Tensors into Int4WeightOnlyConfig and Float8DynamicActivationInt4WeightConfig #2474

jerryzh168 · 2025-07-02T01:58:39Z

Stacked PRs:

Support optional_tensor_names in TorchAOBaseTensor #2710
Align Int4Tensor implementation details with the design of Float8Tensor #2687
->Add all fbgemm kernel Tensors into Int4WeightOnlyConfig and Float8DynamicActivationInt4WeightConfig #2474
Check numerical equivalence / closeness between different kernel preferences #2651

Add all fbgemm kernel Tensors into Int4WeightOnlyConfig and Float8DynamicActivationInt4WeightConfig

Summary:
we will

deprecate FbgemmConfig since it's a single kernel (later).
we'd like to categorize things to derived dtype + packed format, e.g. int4 preshuffled, float8 plain
Added PackingFormat that has preshuffled, plain in Version 2 of Int4WeightOnlyConfig, the older AQT tensor will remain in Version 1

Test Plan:
python test/quantization/quantize_/workflows/int4/test_int4_tensor.py
python test/quantization/quantize_/workflows/int4/test_int4_preshuffled_tensor.py
python test/quantization/quantize_/workflows/float8/test_float8_tensor.py

Reviewers:

Subscribers:

Tasks:

Tags:

…amicActivationInt4WeightConfig Summary: we will * deprecate FbgemmConfig since it's a single kernel (later). * we'd like to categorize things to derived dtype + packed format, e.g. int4 preshuffled, float8 plain * Added PackingFormat that has preshuffled, plain in Version 2 of Int4WeightOnlyConfig, the older AQT tensor will remain in Version 1 Test Plan: python test/quantization/quantize_/workflows/int4/test_int4_tensor.py python test/quantization/quantize_/workflows/int4/test_int4_preshuffled_tensor.py python test/quantization/quantize_/workflows/float8/test_float8_tensor.py Reviewers: Subscribers: Tasks: Tags: stack-info: PR: #2474, branch: jerryzh168/stack/10

pytorch-bot · 2025-07-02T01:58:43Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2474

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 5bb2fd4 with merge base 1114ca0 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…micActivationInt4WeightConfig Summary: att, we will deprecate FbgemmConfig since it's a single kernel. we'd like to categorize things to derived dtype + packed format Test Plan: python test/quantization/quantize_/test_int4_groupwise_preshuffle.py Reviewers: Subscribers: Tasks: Tags: stack-info: PR: #2474, branch: jerryzh168/stack/10

…amicActivationInt4WeightConfig Summary: we will * deprecate FbgemmConfig since it's a single kernel (later). * we'd like to categorize things to derived dtype + packed format, e.g. int4 preshuffled, float8 plain * Added PackingFormat that has preshuffled, plain in Version 2 of Int4WeightOnlyConfig, the older AQT tensor will remain in Version 1 Test Plan: python test/quantization/quantize_/workflows/int4/test_int4_tensor.py python test/quantization/quantize_/workflows/int4/test_int4_preshuffled_tensor.py python test/quantization/quantize_/workflows/float8/test_float8_tensor.py Reviewers: Subscribers: Tasks: Tags: stack-info: PR: #2474, branch: jerryzh168/stack/10

…amicActivationInt4WeightConfig (#2474) Summary: we will * deprecate FbgemmConfig since it's a single kernel (later). * we'd like to categorize things to derived dtype + packed format, e.g. int4 preshuffled, float8 plain * Added PackingFormat that has preshuffled, plain in Version 2 of Int4WeightOnlyConfig, the older AQT tensor will remain in Version 1 Test Plan: python test/quantization/quantize_/workflows/int4/test_int4_tensor.py python test/quantization/quantize_/workflows/int4/test_int4_preshuffled_tensor.py python test/quantization/quantize_/workflows/float8/test_float8_tensor.py Reviewers: Subscribers: Tasks: Tags:

jerryzh168 force-pushed the jerryzh168/stack/10 branch from a3d0835 to 4b0c7c7 Compare July 2, 2025 01:58

This was referenced Jul 2, 2025

Add support for Int4GroupwisePreshuffleTensor for fbgemm #2421

Merged

Remove transpose_input from fbgemm configs #2422

Merged

Add support for float8 activation for Int4PreshuffledTensor #2437

Merged

Add Float8Tensor #2463

Merged

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 2, 2025

jerryzh168 added the topic: new feature Use this tag if this PR adds a new feature label Jul 2, 2025

jerryzh168 changed the base branch from jerryzh168/stack/9 to main July 2, 2025 20:35

jerryzh168 force-pushed the jerryzh168/stack/10 branch from 4b0c7c7 to f5977ce Compare July 2, 2025 20:36

jerryzh168 changed the base branch from main to jerryzh168/stack/9 July 2, 2025 20:36

jerryzh168 changed the base branch from jerryzh168/stack/9 to main July 2, 2025 21:42

jerryzh168 force-pushed the jerryzh168/stack/10 branch 2 times, most recently from 04ce2c5 to afd8703 Compare July 2, 2025 21:42

jerryzh168 mentioned this pull request Jul 2, 2025

Rename torchao.float8.Float8Tensor to torchao.float8.Float8TrainingTensor #2479

Merged

jerryzh168 changed the base branch from main to jerryzh168/stack/9 July 2, 2025 21:42

jerryzh168 changed the base branch from jerryzh168/stack/9 to main July 2, 2025 23:44

jerryzh168 force-pushed the jerryzh168/stack/10 branch from afd8703 to ff4682e Compare July 2, 2025 23:44

jerryzh168 changed the base branch from main to jerryzh168/stack/9 July 2, 2025 23:44

jerryzh168 changed the base branch from jerryzh168/stack/9 to main July 3, 2025 00:09

jerryzh168 force-pushed the jerryzh168/stack/10 branch from ff4682e to 58f8a2a Compare July 3, 2025 00:09

jerryzh168 changed the base branch from main to jerryzh168/stack/9 July 3, 2025 00:09

jerryzh168 changed the base branch from jerryzh168/stack/9 to main July 3, 2025 02:18

jerryzh168 changed the base branch from jerryzh168/stack/15 to main August 6, 2025 22:16

jerryzh168 force-pushed the jerryzh168/stack/10 branch from 1cb0ee1 to c41527a Compare August 6, 2025 22:16

jerryzh168 changed the base branch from main to jerryzh168/stack/15 August 6, 2025 22:16

jerryzh168 changed the base branch from jerryzh168/stack/15 to main August 6, 2025 23:27

jerryzh168 force-pushed the jerryzh168/stack/10 branch from c41527a to 89b3fac Compare August 6, 2025 23:27

jerryzh168 changed the base branch from main to jerryzh168/stack/15 August 6, 2025 23:27

jerryzh168 changed the base branch from jerryzh168/stack/15 to main August 7, 2025 02:57

jerryzh168 force-pushed the jerryzh168/stack/10 branch from 89b3fac to 7868bcf Compare August 7, 2025 02:57

jerryzh168 changed the base branch from main to jerryzh168/stack/15 August 7, 2025 02:57

jerryzh168 mentioned this pull request Aug 7, 2025

Support optional_tensor_names in TorchAOBaseTensor #2710

Merged

jerryzh168 force-pushed the jerryzh168/stack/15 branch from c7f8ff0 to 847259b Compare August 7, 2025 02:58

jerryzh168 force-pushed the jerryzh168/stack/10 branch from 7868bcf to ceac84c Compare August 7, 2025 02:58

jerryzh168 changed the base branch from jerryzh168/stack/15 to main August 7, 2025 03:37

jerryzh168 force-pushed the jerryzh168/stack/10 branch from ceac84c to 36840f0 Compare August 7, 2025 03:37

jerryzh168 changed the base branch from main to jerryzh168/stack/15 August 7, 2025 03:37

jerryzh168 changed the base branch from jerryzh168/stack/15 to main August 7, 2025 03:51

jerryzh168 force-pushed the jerryzh168/stack/10 branch from 36840f0 to 5ad0522 Compare August 7, 2025 03:51

jerryzh168 changed the base branch from main to jerryzh168/stack/15 August 7, 2025 03:51

jerryzh168 changed the base branch from jerryzh168/stack/15 to main August 7, 2025 04:29

jerryzh168 force-pushed the jerryzh168/stack/10 branch from 5ad0522 to 497c62a Compare August 7, 2025 04:29

jerryzh168 changed the base branch from main to jerryzh168/stack/15 August 7, 2025 04:29

jerryzh168 changed the base branch from jerryzh168/stack/15 to main August 7, 2025 20:56

jerryzh168 force-pushed the jerryzh168/stack/10 branch from 497c62a to 8fb7215 Compare August 7, 2025 20:56

jerryzh168 changed the base branch from main to jerryzh168/stack/15 August 7, 2025 20:56

jerryzh168 force-pushed the jerryzh168/stack/10 branch from 8fb7215 to 5bb2fd4 Compare August 7, 2025 23:08

jerryzh168 changed the base branch from jerryzh168/stack/15 to main August 7, 2025 23:08

jerryzh168 merged commit bfe34b5 into main Aug 7, 2025
8 checks passed

namgyu-youn mentioned this pull request Aug 10, 2025

replace FbgemmConfig with Int4WeightOnlyConfig #2727

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add all fbgemm kernel Tensors into Int4WeightOnlyConfig and Float8DynamicActivationInt4WeightConfig #2474

Add all fbgemm kernel Tensors into Int4WeightOnlyConfig and Float8DynamicActivationInt4WeightConfig #2474

Uh oh!

jerryzh168 commented Jul 2, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jul 2, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Add all fbgemm kernel Tensors into Int4WeightOnlyConfig and Float8DynamicActivationInt4WeightConfig #2474

Add all fbgemm kernel Tensors into Int4WeightOnlyConfig and Float8DynamicActivationInt4WeightConfig #2474

Uh oh!

Conversation

jerryzh168 commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!