quantize: Use UINT32 if there's an INT KV override #14197

EAddario · 2025-06-15T15:14:33Z

When quantising models and overriding integer metadata parameters at the same time, their assigned type becomes int even though their original value is unsigned int. In certain cases this behaviour triggers an exception when loading the quantised model.

For example, using the model available here:

Quantise & override: llama-quantize --override-kv qwen3moe.expert_used_count=int:16 Qwen3-30B-A3B-BF16.gguf Qwen3-30B-A3B-Q4_K_M.gguf Q4_K_M
Load model: llama-simple -m Qwen3-30B-A3B-Q4_K_M.gguf "Hello, world!"

Will lead to an error loading model hyperparameters: key qwen3moe.expert_used_count has wrong type i32 but expected type u32 exception

This PR changes the if (params->kv_overrides) logic in llama-quant.cpp to use uint32 if there are any int overrides, so that llama-quantize --override-kv qwen3moe.expert_used_count=int:16 Qwen3-30B-A3B-BF16.gguf Qwen3-30B-A3B-Q4_K_M.gguf Q4_K_M generates a functioning model

More context here

Change int to unsigned int for KV overrides

fa96887

EAddario mentioned this pull request Jun 15, 2025

quantize: Add UINT type to KV overrides #14182

Closed

CISC approved these changes Jun 15, 2025

View reviewed changes

CISC merged commit 30e5b01 into ggml-org:master Jun 15, 2025
47 checks passed

EAddario deleted the quant_i32 branch June 15, 2025 19:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

quantize: Use UINT32 if there's an INT KV override #14197

quantize: Use UINT32 if there's an INT KV override #14197

EAddario commented Jun 15, 2025

Uh oh!

Uh oh!

Uh oh!

quantize: Use UINT32 if there's an INT KV override #14197

quantize: Use UINT32 if there's an INT KV override #14197

Conversation

EAddario commented Jun 15, 2025

Uh oh!

Uh oh!

Uh oh!