Fix autocast_adapter_dtype=False for quantized models #2891

Aznix07 · 2025-11-04T06:35:33Z

What does this PR do?

Fixes the issue where autocast_adapter_dtype=False was being ignored when using quantized models with BitsAndBytes.

Problem

When a model is quantized using BitsAndBytes (e.g., 4-bit quantization), LoRA adapters were always initialized with float32 dtype, even when:

autocast_adapter_dtype=False was explicitly specified
The model's compute dtype was set to float16

This caused unexpected behavior and potential performance/memory issues.

Solution

Added a _get_weight_dtype() helper method to the LoraLayer class that:

Checks for compute_dtype attribute (present in BitsAndBytes quantized layers)
Falls back to weight.dtype for regular layers
Uses this dtype when creating lora_A and lora_B Linear Layers in update_layer()

Testing

✅ Verified the fix works correctly with custom test script:

Quantized model (4-bit with compute_dtype=float16) -> LoRA params are float16
Non-quantized model (dtype=float16) -> LoRA params are float16
Default behavior (autocast_adapter_dtype=True) still works as expected

✅ Ran 131 LoRA config tests locally - all passed

BenjaminBossan · 2025-11-04T14:15:17Z

Thanks for proposing this fix @Aznix07. However, to apply this broadly requires a lot more changes. I have worked on those in #2893. I think this PR can be closed. Still, your contribution is appreciated.

Fix autocast_adapter_dtype=False for quantized models

f461725

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix autocast_adapter_dtype=False for quantized models #2891

Fix autocast_adapter_dtype=False for quantized models #2891

Uh oh!

Aznix07 commented Nov 4, 2025 •

edited

Loading

Uh oh!

BenjaminBossan commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix autocast_adapter_dtype=False for quantized models #2891

Are you sure you want to change the base?

Fix autocast_adapter_dtype=False for quantized models #2891

Uh oh!

Conversation

Aznix07 commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Problem

Solution

Testing

Uh oh!

BenjaminBossan commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Aznix07 commented Nov 4, 2025 •

edited

Loading