Thanks to visit codestin.com
Credit goes to github.com

Skip to content

GPTQ / ExLlamaV2 (EXL2) quantisation #4165

Closed
@0xdevalias

Description

@0xdevalias

Feature Description

Please provide a detailed written description of what you were trying to do, and what you expected llama.cpp to do as an enhancement.

Motivation

It sounds like it's a fast/useful quantisation method:

Possible Implementation

N/A

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions