Thanks to visit codestin.com
Credit goes to Github.com

Skip to content

[RFC] What is missing from ONNX's representation of quantized models? #7435

@justinchuby

Description

@justinchuby

What is missing from the QDQ representation? Block quantize axes? 2bit weights? Quantized attention?

@onnx/sig-operators @onnx/sig-optimizations

Metadata

Metadata

Assignees

No one assigned

    Labels

    rfcRequest for Commentstopic: operatorIssues related to ONNX operators

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions