Add Float8Tensor #2463

jerryzh168 · 2025-06-30T23:01:24Z

Stacked PRs:

Add Float8Tensor

Summary:

Added Float8Tensor that's using fbgemm kernels and scaled_mm:
- per row activation + per row weight linear calling torch._scaled_mm op (for compatibilty with SM 8.9)
- per tensor activation + per tensor weight quant linear calling torch._scaled_mm op (for compatibilty with SM 8.9)
- per row activation + per row weight bmm calling torch.ops.fbgemm.f8f8bf16_rowwise_batched kernel (only works for SM 9.0+) can use batched scaled mm from torch when it's supported: [RFC]: PyTorch Low-Precision GEMMs Public API pytorch#157950
dynamic quantization kwargs is added to the Float8Tensor directly
Added QuantizeTensorKwargs and QuantizeTensorToFloat8Kwargs to store key word args for Float8Tensor.to_float8
Updated Float8DynamicActivationFloat8WeightConfig and Float8WeightOnlyConfig to use Float8Tensor

Test Plan:
python test/dtypes/test_affine_quantized_float.py
python test/quantization/quantize_/workflows/float8/test_float8_tensor.py

Reviewers:

Subscribers:

Tasks:

Tags:

pytorch-bot · 2025-06-30T23:01:28Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2463

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

ghstack-mergeability-check and Check labels failing with 'Resource not accessible by integration'

❌ 1 New Failure

As of commit b0c2cf3 with merge base b757fb9 ():

NEW FAILURE - The following job has failed:

Run TorchAO Experimental Tests / test-cpu-ops (linux.arm64.2xlarge) (gh)
test_replace_q_dq_patterns_with_quantized_linear_ops_pass

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Summary: Splits out the float8 rowwise quantized path (both act and weight) of AQT to Float8RowwiseTensor Next: could potentially incorporate the per tensor activation path there as well Next: we can split the per tensor weight path to another Tensor as well, so we can deprecate AQT path for float8 Test Plan: python test/dtypes/test_affine_quantized_float.py python test/quantization/quantize_/test_float8_rowwise_tensor.py Reviewers: Subscribers: Tasks: Tags: stack-info: PR: #2463, branch: jerryzh168/stack/9

torchao/quantization/quantize_/float8_tensor.py

torchao/quantization/quant_api.py

torchao/quantization/quantize_/float8_tensor.py

Summary: * Added Float8Tensor that's using fbgemm kernels and scaled_mm: * per row activation + per row weight linear calling torch._scaled_mm op (for compatibilty with SM 8.9) * per tensor activation + per tensor weight quant linear calling torch._scaled_mm op (for compatibilty with SM 8.9) * per row activation + per row weight bmm calling torch.ops.fbgemm.f8f8bf16_rowwise_batched kernel (only works for SM 9.0+) can use batched scaled mm from torch when it's supported: pytorch/pytorch#157950 * dynamic quantization kwargs is added to the Float8Tensor directly * Added QuantizeTensorKwargs and QuantizeTensorToFloat8Kwargs to store key word args for Float8Tensor.to_float8 * Updated Float8DynamicActivationFloat8WeightConfig and Float8WeightOnlyConfig to use Float8Tensor Test Plan: python test/dtypes/test_affine_quantized_float.py python test/quantization/quantize_/workflows/float8/test_float8_tensor.py Reviewers: Subscribers: Tasks: Tags: stack-info: PR: #2463, branch: jerryzh168/stack/9

clee2000 · 2025-08-06T17:04:57Z

/easycla

Summary: We have recently updated our design for structuring tensor subclasses in torchao to remove unnecessary abstractions and reduce indirections and having a structuring that aligns better with people's intuitive understanding of different quantization use cases, examples using the new design are: pytorch#2463, pytorch#2687 Test Plan: check generated doc Reviewers: Subscribers: Tasks: Tags:

Summary: We have recently updated our design for structuring tensor subclasses in torchao to remove unnecessary abstractions and reduce indirections and having a structuring that aligns better with people's intuitive understanding of different quantization use cases, examples using the new design are: #2463, #2687 Test Plan: check generated doc Reviewers: Subscribers: Tasks: Tags:

Summary: * Added Float8Tensor that's using fbgemm kernels and scaled_mm: * per row activation + per row weight linear calling torch._scaled_mm op (for compatibilty with SM 8.9) * per tensor activation + per tensor weight quant linear calling torch._scaled_mm op (for compatibilty with SM 8.9) * per row activation + per row weight bmm calling torch.ops.fbgemm.f8f8bf16_rowwise_batched kernel (only works for SM 9.0+) can use batched scaled mm from torch when it's supported: pytorch/pytorch#157950 * dynamic quantization kwargs is added to the Float8Tensor directly * Added QuantizeTensorKwargs and QuantizeTensorToFloat8Kwargs to store key word args for Float8Tensor.to_float8 * Updated Float8DynamicActivationFloat8WeightConfig and Float8WeightOnlyConfig to use Float8Tensor Test Plan: python test/dtypes/test_affine_quantized_float.py python test/quantization/quantize_/workflows/float8/test_float8_tensor.py Reviewers: Subscribers: Tasks: Tags:

Summary: We have recently updated our design for structuring tensor subclasses in torchao to remove unnecessary abstractions and reduce indirections and having a structuring that aligns better with people's intuitive understanding of different quantization use cases, examples using the new design are: #2463, #2687 Test Plan: check generated doc Reviewers: Subscribers: Tasks: Tags:

jerryzh168 force-pushed the jerryzh168/stack/9 branch from da79207 to 5cae4d0 Compare June 30, 2025 23:01

This was referenced Jun 30, 2025

Add support for Int4GroupwisePreshuffleTensor for fbgemm #2421

Merged

Remove transpose_input from fbgemm configs #2422

Merged

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 30, 2025

jerryzh168 mentioned this pull request Jun 30, 2025

Add support for float8 activation for Int4PreshuffledTensor #2437

Merged

jerryzh168 added the topic: new feature Use this tag if this PR adds a new feature label Jun 30, 2025

jerryzh168 changed the base branch from jerryzh168/stack/4 to main July 2, 2025 01:58

jerryzh168 force-pushed the jerryzh168/stack/9 branch from 5cae4d0 to 33ca58e Compare July 2, 2025 01:58

jerryzh168 changed the title ~~Add Float8RowwiseTensor~~ Add Float8Tensor Jul 2, 2025

jerryzh168 changed the base branch from main to jerryzh168/stack/4 July 2, 2025 01:58

jerryzh168 mentioned this pull request Jul 2, 2025

Add all fbgemm kernel Tensors into Int4WeightOnlyConfig and Float8DynamicActivationInt4WeightConfig #2474

Merged

vkuzo reviewed Jul 2, 2025

View reviewed changes

torchao/quantization/quantize_/float8_tensor.py Outdated Show resolved Hide resolved

vkuzo reviewed Jul 2, 2025

View reviewed changes

torchao/quantization/quant_api.py Outdated Show resolved Hide resolved

vkuzo reviewed Jul 2, 2025

View reviewed changes

torchao/quantization/quantize_/float8_tensor.py Outdated Show resolved Hide resolved

jerryzh168 changed the base branch from jerryzh168/stack/4 to main July 2, 2025 20:35

jerryzh168 force-pushed the jerryzh168/stack/9 branch from 33ca58e to 897ec7e Compare July 2, 2025 20:36

jerryzh168 changed the base branch from main to jerryzh168/stack/4 July 2, 2025 20:36

jerryzh168 changed the base branch from jerryzh168/stack/4 to main July 2, 2025 21:42

jerryzh168 force-pushed the jerryzh168/stack/9 branch 2 times, most recently from 7897dcf to 99a1bb1 Compare July 2, 2025 21:42

jerryzh168 mentioned this pull request Jul 2, 2025

Rename torchao.float8.Float8Tensor to torchao.float8.Float8TrainingTensor #2479

Merged

jerryzh168 changed the base branch from main to jerryzh168/stack/11 July 2, 2025 21:42

jerryzh168 changed the base branch from jerryzh168/stack/11 to main July 2, 2025 23:44

jerryzh168 force-pushed the jerryzh168/stack/9 branch from 99a1bb1 to 7e9f224 Compare July 2, 2025 23:44

jerryzh168 changed the base branch from main to jerryzh168/stack/11 July 2, 2025 23:44

jerryzh168 changed the base branch from jerryzh168/stack/11 to main July 3, 2025 00:09

jerryzh168 force-pushed the jerryzh168/stack/9 branch from 7e9f224 to 442bd6c Compare July 3, 2025 00:09

jerryzh168 changed the base branch from main to jerryzh168/stack/11 July 3, 2025 00:09

This was referenced Aug 1, 2025

Bump version for float8 dynamic quant and weight only quant configs #2650

Merged

Check numerical equivalence / closeness between different kernel preferences #2651

Merged

jerryzh168 force-pushed the jerryzh168/stack/9 branch 9 times, most recently from cb65283 to 76a8e21 Compare August 5, 2025 03:24

jerryzh168 mentioned this pull request Aug 5, 2025

Align Int4Tensor implementation details with the design of Float8Tensor #2687

Merged

jerryzh168 force-pushed the jerryzh168/stack/9 branch 2 times, most recently from f426b52 to 0c8fafb Compare August 5, 2025 23:30

jerryzh168 force-pushed the jerryzh168/stack/9 branch from 0c8fafb to b0c2cf3 Compare August 6, 2025 01:08

jerryzh168 merged commit 3b4bc98 into main Aug 6, 2025
18 of 20 checks passed

jerryzh168 mentioned this pull request Aug 8, 2025

Update quantization overview and contributor guide doc #2723

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add Float8Tensor #2463

Add Float8Tensor #2463

Uh oh!

jerryzh168 commented Jun 30, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jun 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

clee2000 commented Aug 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Uh oh!

Add Float8Tensor #2463

Add Float8Tensor #2463

Uh oh!

Conversation

jerryzh168 commented Jun 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!