Pull Request: Adding HiRA integration into PEFT library #2668

hqsiswiliam · 2025-07-24T10:32:51Z

Feature request

This request proposes integrating HiRA (Hadamard High-Rank Adaptation) as described in the ICLR 2025 oral paper (https://openreview.net/pdf?id=TwJrTz9cRS) (https://iclr.cc/virtual/2025/oral/31839) and implemented in the hqsiswiliam/hira repository into the core PEFT library. This will enable users to apply HiRA through the familiar get_peft_model API and benefit from its high-rank updates without adding any inference overhead.

Motivation

General Motivation

PEFT methods like LoRA achieve parameter-efficient fine-tuning by injecting low-rank updates into pre-trained weights. While effective, purely low-rank adaptation can struggle to capture complex patterns in large language models.

1. Expressiveness grows with the rank

Empirically, increasing the LoRA rank in LLM training yields better downstream performance:

Higher LoRA rank correlates with improved task accuracy.

2. HiRA: Hadamard high-rank updates without extra parameters

HiRA sidesteps the expressiveness constraint by computing a Hadamard-enhanced update:

$$ \Delta W = W_0 \odot (A B) $$

HiRA uses the Hadamard product to inject high-rank structure into the frozen weight matrix $W_0$ via low-rank matrix $A$ and $B$.

3. Singular-value patterns

After training, HiRA exhibits a rich singular-value pattern, akin to full-rank fine-tuning (FFT), indicating its ability to model complex transformations without the expensive computational overhead:

HiRA’s singular-value distribution closely mirrors that of FFT.

4. Performance gains

Across commonsense reasoning benchmarks, HiRA outperforms LoRA and other PEFT baselines:

HiRA delivers notable accuracy improvements over baseline adapters.

5. No extra parameter or compute cost

Despite its high-rank behaviour, HiRA introduces no additional trainable parameters compared to LoRA:

HiRA matches LoRA’s GRAM usage and training hours.

6. Complementary with LoRA (HiLoRA)

Combining HiRA and LoRA into a hybrid “HiLoRA” setup yields even stronger results than either method alone:

HiLoRA leverages both low-rank and Hadamard high-rank updates for better expressiveness.

By integrating HiRA into PEFT, users gain richer adaptation capability without sacrificing the parameter efficiency and usability that PEFT provides.

Your contribution

We would be pleased to submit a pull request to integrate HiRA class implementation into the PEFT framework. We welcome any suggestions for alternative integration approaches and appreciate any guidance on best practices.

BenjaminBossan

Thanks for this PR to add HiRA to PEFT. The method looks promising and the provided code is already quite mature.

When I started reading the paper, I was at first reminded of FedPara, aka LoHa, which is already integrated into PEFT, as that method also relies on the Hadamard product. However, IIUC, the two methods are still distinct: HiRA basically corresponds to LoRA, but instead of adding dW, we multiply it. In that way, it is much closer to LoRA than to LoHa. Still, I wanted to flag this, as I'm not sure you are aware (your paper doesn't seem to be reference FedPara).

At the moment, I haven't done a full in-depth review, but I think that makes more sense once we have completed the next step.

I noticed that you have formatted some unrelated files in method_comparison, could you please undo those changes? Usually, when you run make style, that directory should not be included.

I think a good next step is to add HiRA to the testing matrix we have in PEFT. For now, let's add some entries similar to the ones you can find here:

peft/tests/test_custom_models.py

Lines 70 to 72 in 92d65ca

    
           ("Vanilla MLP 1 LoRA", "MLP", LoraConfig, {"target_modules": "lin0"}), 
        
           ("Vanilla MLP 2 LoRA", "MLP", LoraConfig, {"target_modules": ["lin0"]}), 
        
           ("Vanilla MLP 3 LoRA", "MLP", LoraConfig, {"target_modules": ["lin1"]}),

Since you also support embedding and conv layers, please make sure to include examples with those layers as well (basically, copy the relevant examples from LoRA and adjust them).

Then, please run pytest tests/test_custom_models.py -k "hira and not shira" -v and see if those tests pass. Once we get there, we can discuss the best next steps.

src/peft/tuners/hira/__init__.py

src/peft/tuners/hira/config.py

src/peft/utils/constants.py

tests/test_hira.py

github-actions · 2025-08-23T15:03:36Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

BenjaminBossan · 2025-08-25T09:29:31Z

@hqsiswiliam Do you still plan on working on this PR?

hqsiswiliam · 2025-08-25T15:02:37Z

@hqsiswiliam Do you still plan on working on this PR?

Hi, BenjaminBossan. Thanks for checking in! I’ll continue working on this PR over the next few days.

# Conflicts: # src/peft/tuners/hira/config.py # src/peft/tuners/hira/model.py # tests/test_custom_models.py

hqsiswiliam · 2025-11-04T23:45:45Z

Hi, sorry for the long delay. I have updated the code and synced it with the latest changes in the repository. Could you please reopen this PR so I can push the updates? Thanks a lot for your time and help.

BenjaminBossan · 2025-11-05T10:52:13Z

Thanks, done @hqsiswiliam

- update r's description

hqsiswiliam · 2025-11-05T14:02:17Z

Thanks, done @hqsiswiliam

Hi, thanks for reopening the PR! The latest changes have been pushed, and I’ve synced everything with the most recent updates in the repository. Please let me know if there are any additional suggestions or further steps needed. 😄

BenjaminBossan

Thanks for your work on adding HiRA to PEFT. I focused in this review on the core implementation for now. A lot of my comments stem from the fact that this PR is based on the LoRA code with changes made to accommodate HiRA, but some of the LoRA code doesn't make sense here. Please check my comments on this.

I also thought about suggesting to add HiRA as a LoRA variant, similar to how DoRA is currently implemented. I think this would be a possibility to save a lot of code. Conceptually, however, I think HiRA is sufficiently different from LoRA that I wouldn't consider it a LoRA variant. LMK what you think about it.

Please merge with/rebase on the latest main branch and, once you finish your changes, please call make style.

BenjaminBossan · 2025-11-14T11:46:43Z

docs/source/package_reference/hira.md

We also need an entry in the toctree for this to show up in the docs.

BenjaminBossan · 2025-11-14T11:51:08Z

src/peft/tuners/hira/model.py

This file is based on lora/model.py with a few changes for HiRA. However, lora/model.py has changed significantly since, could you please update hira/model.py based on the latest lora/model.py? It should be simplified now because we removed many methods.

BenjaminBossan · 2025-11-14T11:52:12Z

src/peft/tuners/hira/config.py

+            `Conv1D` which stores weights like (fan_in, fan_out) and hence this should be set to `True`.
+        modules_to_save (`List[str]`):
+            List of modules apart from adapter layers to be set as trainable and saved in the final checkpoint.
+        init_hira_weights (`bool` | `Literal["gaussian"]`):


Let's rename this to init_weights.

BenjaminBossan · 2025-11-14T11:54:04Z

src/peft/tuners/hira/layer.py

+from .config import HiRAConfig
+
+
+class HiRALayer(BaseTunerLayer):


Let's rename this to HiraLayer.

BenjaminBossan · 2025-11-14T11:55:54Z

src/peft/tuners/hira/model.py

+        self.active_adapter = adapter_name
+
+    @contextmanager
+    def _enable_peft_forward_hooks(self, *args, **kwargs):


I'd say let's remove _enable_peft_forward_hooks from HiRA for now. It's not trivial to support, we should get the basics first and then later we can think about whether we want to add it or not.

BenjaminBossan · 2025-11-14T12:11:37Z

src/peft/tuners/hira/layer.py

+            msg = "Cannot pass `adapter_names` when there are merged adapters, please call `unmerge_adapter` first."
+            raise ValueError(msg)
+
+    def _mixed_batch_forward(


This method and its calls below can be removed if we drop _enable_peft_forward_hooks.

BenjaminBossan · 2025-11-14T12:12:31Z

src/peft/tuners/hira/layer.py

+            weight_A = weight_A.float()
+            weight_B = weight_B.float()
+        output_tensor = transpose((weight_B @ weight_A), self.fan_in_fan_out)
+        assert self.get_base_layer().weight.shape == output_tensor.shape


Let's not use any asserts in the code (except for tests). I think this can be removed, if you want to keep it, raise a proper error.

BenjaminBossan · 2025-11-14T12:15:13Z

src/peft/tuners/hira/layer.py

+        elif isinstance(base_layer, nn.MultiheadAttention):
+            if not base_layer._qkv_same_embed_dim:
+                raise ValueError(f"Only same dim for query/key/value is supported as of now for {self.__class__}.")
+            in_features, out_features = base_layer.embed_dim, 3 * base_layer.embed_dim
+        elif hasattr(base_layer, "infeatures") and hasattr(base_layer, "outfeatures"):
+            # QuantLinear
+            in_features, out_features = base_layer.infeatures, base_layer.outfeatures
+        elif hasattr(base_layer, "input_size") and hasattr(base_layer, "output_size"):
+            # Megatron ColumnParallelLinear,RowParallelLinear
+            in_features, out_features = base_layer.input_size, base_layer.output_size
+        elif hasattr(base_layer, "codebooks") and base_layer.__class__.__name__ == "QuantizedLinear":
+            # AQLM QuantLinear
+            in_features, out_features = base_layer.in_features, base_layer.out_features
+        elif hasattr(base_layer, "w_bit") and base_layer.__class__.__name__ == "WQLinear_GEMM":
+            # Awq layers
+            in_features, out_features = base_layer.in_features, base_layer.out_features
+        elif base_layer.__class__.__name__ == "EetqLinear":
+            # Eetq layers
+            in_features, out_features = base_layer.in_features, base_layer.out_features
+        elif hasattr(base_layer, "W_q") and base_layer.__class__.__name__ == "HQQLinear":
+            # HQQ layers
+            in_features, out_features = base_layer.in_features, base_layer.out_features
+        elif base_layer.__class__.__name__ == "PatchedLinear":
+            # INC layers
+            in_features, out_features = base_layer.in_features, base_layer.out_features


Suggested change

elif isinstance(base_layer, nn.MultiheadAttention):

if not base_layer._qkv_same_embed_dim:

raise ValueError(f"Only same dim for query/key/value is supported as of now for {self.__class__}.")

in_features, out_features = base_layer.embed_dim, 3 * base_layer.embed_dim

elif hasattr(base_layer, "infeatures") and hasattr(base_layer, "outfeatures"):

# QuantLinear

in_features, out_features = base_layer.infeatures, base_layer.outfeatures

elif hasattr(base_layer, "input_size") and hasattr(base_layer, "output_size"):

# Megatron ColumnParallelLinear,RowParallelLinear

in_features, out_features = base_layer.input_size, base_layer.output_size

elif hasattr(base_layer, "codebooks") and base_layer.__class__.__name__ == "QuantizedLinear":

# AQLM QuantLinear

in_features, out_features = base_layer.in_features, base_layer.out_features

elif hasattr(base_layer, "w_bit") and base_layer.__class__.__name__ == "WQLinear_GEMM":

# Awq layers

in_features, out_features = base_layer.in_features, base_layer.out_features

elif base_layer.__class__.__name__ == "EetqLinear":

# Eetq layers

in_features, out_features = base_layer.in_features, base_layer.out_features

elif hasattr(base_layer, "W_q") and base_layer.__class__.__name__ == "HQQLinear":

# HQQ layers

in_features, out_features = base_layer.in_features, base_layer.out_features

elif base_layer.__class__.__name__ == "PatchedLinear":

# INC layers

in_features, out_features = base_layer.in_features, base_layer.out_features

Since all these are not supported, let's remove them.

BenjaminBossan · 2025-11-14T12:16:14Z

src/peft/tuners/hira/model.py

+                if hasattr(target, "unload_and_optionally_merge_module"):
+                    # if layers have special unloading method, like MultiheadAttention, use that
+                    unloaded_module = target.unload_and_optionally_merge_module(
+                        merge=merge, safe_merge=safe_merge, adapter_names=adapter_names
+                    )
+                    self._replace_module(parent, target_name, unloaded_module, target)


We have no layers with unload_and_optionally_merge_module in HiRA, so let's remove this.

BenjaminBossan · 2025-11-14T12:18:52Z

tests/test_hira.py

+from peft.tuners.hira.layer import Conv2d as HiraConv2d
+
+
+def test_hira_linear_merge_unmerge_basic():


I believe these tests should be redundant with the existing tests we have in PEFT. Or do you see a gap in the PEFT tests that would require these?

The only one that could be worth keeping is test_manual_hira_linear_equivalence.

hqsiswiliam added 25 commits June 8, 2025 21:13

- initial commit for hira adapter

bc16e34

- This initial modification of HiRA's config

3c27937

- update HiRA Model

aeb3d54

- update HiRA Layer

d290008

- update HiRA Layer partially

dcdbe27

- update HiRA Layer partially (Embedding Layer)

8f48e2c

- update HiRA Layer partially (ConvNd Layer)

86e5195

- update HiRA Layer partially (ConvNd Layer)

da12aab

- update HiRA Layer partially (Conv1/2/3d Layer)

69ace05

- update HiRA Layer partially (MultiheadAttention)

2c53c8d

- remove HiRA Layer partially (MultiheadAttention)

32f6a4d

- update HiRA layer, model, and config

f86c9a9

- add bnb implementation and __init__.py

54c8de7

- add HiRA's Linear8bitLt implementation

ef18d9f

- update HiRA's layer comment

7c4718b

- add HiRA's Linear4bit

8506413

- complete HiRA's Linear4bit

9e8c017

- add test_hira

71907b4

- HiRA: updates to peft init, tuners, types, and GPU tests

ce782b6

Merge remote-tracking branch 'upstream/main'

d20332e

- HiRA: updates to HiRA layer, and HiRA testing

d76e328

- HiRA: formatting hira

e933f2a

- HiRA: formatting hira

0a4b3aa

- HiRA: add document

6b4092a

- apply merge

aab9204

hqsiswiliam mentioned this pull request Jul 24, 2025

Integrate HiRA (Hadamard High-Rank Adaptation) #2534

Closed

BenjaminBossan requested changes Jul 25, 2025

View reviewed changes

hqsiswiliam added 22 commits November 5, 2025 07:11

- remove HiRA Layer partially (MultiheadAttention)

17d66fa

- update HiRA layer, model, and config

521793e

- add bnb implementation and __init__.py

f18ae01

- add HiRA's Linear8bitLt implementation

22dc172

- update HiRA's layer comment

7820eec

- add HiRA's Linear4bit

9fa6449

- complete HiRA's Linear4bit

338fa33

- add test_hira

38cd4c7

- HiRA: updates to peft init, tuners, types, and GPU tests

69aeb26

- HiRA: updates to HiRA layer, and HiRA testing

27183ec

- HiRA: formatting hira

97a6401

- HiRA: formatting hira

76f8e17

- HiRA: add document

4a5b5aa

- update HiRA cases in test_custom_models.py

85aef7e

- update HiRA Embedding cases in test_custom_models.py

f7fd92d

- add suggested changes from BenjaminBossan

36aa9b6

- add fix for ConvND in HiRA Layer

bc5c047

- update test for HiRA

4727fe9

- update copyright notice.

e963455

- update copyright notice.

092258a

- fix HiRA's config and model for updated fork sync.

9f77b62

Merge remote-tracking branch 'origin/main'

87bbccd

# Conflicts: # src/peft/tuners/hira/config.py # src/peft/tuners/hira/model.py # tests/test_custom_models.py

BenjaminBossan reopened this Nov 5, 2025

hqsiswiliam added 3 commits November 5, 2025 21:45

- remove module mapping duplication

3b2b432

- remove HiRARuntimeConfig

9f24194

- update copyright information

796ef2a

- update r's description

BenjaminBossan requested changes Nov 14, 2025

View reviewed changes

	("Vanilla MLP 1 LoRA", "MLP", LoraConfig, {"target_modules": "lin0"}),
	("Vanilla MLP 2 LoRA", "MLP", LoraConfig, {"target_modules": ["lin0"]}),
	("Vanilla MLP 3 LoRA", "MLP", LoraConfig, {"target_modules": ["lin1"]}),

		from .config import HiRAConfig


		class HiRALayer(BaseTunerLayer):

		from peft.tuners.hira.layer import Conv2d as HiraConv2d


		def test_hira_linear_merge_unmerge_basic():

Pull Request: Adding HiRA integration into PEFT library #2668

Are you sure you want to change the base?

Pull Request: Adding HiRA integration into PEFT library #2668

Conversation

hqsiswiliam commented Jul 24, 2025

Feature request

Motivation

General Motivation

1. Expressiveness grows with the rank

2. HiRA: Hadamard high-rank updates without extra parameters

3. Singular-value patterns

4. Performance gains

5. No extra parameter or compute cost

6. Complementary with LoRA (HiLoRA)

Your contribution

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Aug 23, 2025

Uh oh!

BenjaminBossan commented Aug 25, 2025

Uh oh!

hqsiswiliam commented Aug 25, 2025

Uh oh!

hqsiswiliam commented Nov 4, 2025

Uh oh!

BenjaminBossan commented Nov 5, 2025

Uh oh!

hqsiswiliam commented Nov 5, 2025

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants