[LoRA PEFT] fix LoRA loading so that correct alphas are parsed #6135

sayakpaul · 2023-12-11T13:05:52Z

What does this PR do?

This PR ensures that the relevant LoraConfig is also serialized when the state dict is serialized. Otherwise, even if the LoRA state dict is passed properly with the peft backend, its underlying config might not be parsed correctly.

So, we explicitly pass the config dictionary to save_lora_weights(), and load_lora_weights() takes care of parsing the config accordingly in a backward-compatible way.

To check, first generate a peft LoRA:

from diffusers import UNet2DConditionModel, StableDiffusionXLPipeline
from peft import LoraConfig
from peft.utils import get_peft_model_state_dict

unet = UNet2DConditionModel.from_pretrained("stabilityai/stable-diffusion-xl-base-1.0", subfolder="unet")
rank = 64
lora_config = LoraConfig(
    r=rank, target_modules=["to_k", "to_q", "to_v", "to_out.0"], init_lora_weights=False
)
unet.add_adapter(lora_config)

output_dir = "my_lora"
unet_lora_layers_to_save = get_peft_model_state_dict(unet)
unet_lora_config = unet.peft_config["default"]

StableDiffusionXLPipeline.save_lora_weights(output_dir, unet_lora_layers=unet_lora_layers_to_save, unet_lora_config=unet_lora_config)

To load:

from diffusers import DiffusionPipeline

pipeline = DiffusionPipeline.from_pretrained("stabilityai/stable-diffusion-xl-base-1.0")
pipeline.load_lora_weights("my_lora")

TODO

Propagate to the other examples
Add tests
Document (docstrings)

Inspired by @pacman100's https://github.com/pacman100/peft-dreambooth-ui/blob/main/train_dreambooth_peft.py#L136-L212 (hence he is also a co-author here).

Co-authored-by: pacman100 <13534540+pacman100@users.noreply.github.com>

HuggingFaceDocBuilderDev · 2023-12-11T13:12:27Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

BenjaminBossan

Implementation-wise, this looks good, I only have a few non-critical comments.

Regarding the overall problem, with some recent updates we made to PEFT, it should now be possible to pass lora_alpha to forward, similar to the non-PEFT LoRA layers of diffusers. So we could theoretically remove the "workaround" we have in diffusers and use the same mechanism as for non-PEFT to load and pass alphas. However, at this point, I'm not sure if it's worth it to make the change or live with the current situation.

src/diffusers/utils/peft_utils.py

src/diffusers/loaders/lora.py

sayakpaul · 2023-12-15T07:04:25Z

Regarding the overall problem, with some recent updates we made to PEFT, it should now be possible to pass lora_alpha to forward, similar to the non-PEFT LoRA layers of diffusers. So we could theoretically remove the "workaround" we have in diffusers and use the same mechanism as for non-PEFT to load and pass alphas. However, at this point, I'm not sure if it's worth it to make the change or live with the current situation.

@BenjaminBossan thanks for your comments. If you could provide an example for me here, that'd be helpful for us to gauge if it's worthwhile to move forward with the proposed changes.

younesbelkada

Makes sense to me, thanks very much @sayakpaul for leading this effort!

BenjaminBossan

If you could provide an example for me here, that'd be helpful for us to gauge if it's worthwhile to move forward with the proposed changes.

So what I mean here is that when the PEFT integration was added, we had to treat the scaling differently for PEFT because we couldn't pass the scale argument to PEFT layers. This led to cases like these:

diffusers/src/diffusers/models/transformer_2d.py

Lines 404 to 417 in 93ea26f

    
           if not self.use_linear_projection: 
        
               hidden_states = hidden_states.reshape(batch, height, width, inner_dim).permute(0, 3, 1, 2).contiguous() 
        
               hidden_states = ( 
        
                   self.proj_out(hidden_states, scale=lora_scale) 
        
                   if not USE_PEFT_BACKEND 
        
                   else self.proj_out(hidden_states) 
        
               ) 
        
           else: 
        
               hidden_states = ( 
        
                   self.proj_out(hidden_states, scale=lora_scale) 
        
                   if not USE_PEFT_BACKEND 
        
                   else self.proj_out(hidden_states) 
        
               ) 
        
               hidden_states = hidden_states.reshape(batch, height, width, inner_dim).permute(0, 3, 1, 2).contiguous()

diffusers/src/diffusers/models/attention_processor.py

Line 741 in 93ea26f

args = () if USE_PEFT_BACKEND else (scale,)

Now, in PEFT, we allow to pass arbitrary extra arguments to forward and the PEFT layer will pass it on to the base layer that's being adapted, as e.g. here:

https://github.com/huggingface/peft/blob/997e6ec5ab4bfbfbb79d13a7e51c8fd3874635fa/src/peft/tuners/lora/layer.py#L364-L374

Therefore, I think we could remove all the workarounds we have where we have to scale and unscale the PEFT layer weights.

That said, I haven't tested it and it would be a lot of work to unwind these changes. Maybe it's just not worth it and we can keep the current implementation, as it works.

src/diffusers/utils/peft_utils.py

sayakpaul · 2023-12-15T11:46:14Z

@BenjaminBossan agree with your assessment here. Also, I think I have addressed your comments. Let me know.

BenjaminBossan

Thanks. Looks all good from my side.

sayakpaul · 2023-12-15T12:55:37Z

@younesbelkada @BenjaminBossan @patrickvonplaten would appreciate another round of review here.

Benjamin, Feel free to ignore the changes introduced in the training-related parts.

I would wait for the three of you to approve this PR as this change is quite impactful.

pacman100

Hello @sayakpaul, great work on fixing the loading of LoRA weights with PEFT training support. This enables the status quo of having a single safetensors weight file while correctly saving the related config. LGTM 🔥🚀✨! Also, Thank you for taking the time to go through my suggestions and related code.

younesbelkada

Impressive and inspiring work @sayakpaul ! Thanks for taking the lead on this !
One minor suggestion would be to detail this approach in the doc for people that are curious about how peft configs are saved internally in the safetensor checkpoints, what do you think?

sayakpaul · 2023-12-15T17:13:45Z

One minor suggestion would be to detail this approach in the doc for people that are curious about how peft configs are saved internally in the safetensor checkpoints, what do you think?

I have edited the description of the PR for the community to refer to the details. I think that might be better. WDYT @younesbelkada?

younesbelkada · 2023-12-16T15:54:55Z

sounds great @sayakpaul thanks!

sayakpaul · 2023-12-19T02:17:40Z

Closing this PR to favor a simpler alternative as described below. Simplified PR: #6225.

We didn't have a concept of alpha in our non-peft diffusers LoRA training scripts. Hence, it used to be set to None, therefore leading to no impact.

But that is NOT the case with peft (especially with how lora_alpha is initialized after LoraConfig is initialized) as per #6087.

So, @pacman100 suggested a simpler alternative. Just set the lora_alpha within LoraConfig to args.rank and that should cut it for us. Even though this is a simpler alternative and does the job all of us (@younesbelkada @BenjaminBossan @pacman100 and myself) agree that this way we're restricting the users to not benefit from lora_alpha. We'll see how it goes and if there are requests from the community we can always refer to this PR (will keep the branch alive for reference).

sayakpaul and others added 16 commits December 11, 2023 17:15

fix: parse lora_alpha correctly

ff3d380

fix

79b1637

better conditioning

20fac7b

assertion

981ea82

debug

cf132fb

debug

e4c00bc

dehug

3b27b23

ifx?

0d08249

fix?

41b9cd8

ifx

b868e8a

ifx

c341111

unwrap

a2792cd

unwrap

9ecb271

json unwrap

32212b6

remove print

ed333f0

Empty-Commit

fdb1146

Co-authored-by: pacman100 <13534540+pacman100@users.noreply.github.com>

sayakpaul requested review from BenjaminBossan, pacman100 and younesbelkada December 11, 2023 13:05

sayakpaul mentioned this pull request Dec 11, 2023

diffusers doesn't save and load the LoraConfig, resulting wrong lora_alpha during inference. #6087

Closed

sayakpaul added 2 commits December 11, 2023 18:44

fix

bcf0f4a

fix

24cb282

BenjaminBossan reviewed Dec 12, 2023

View reviewed changes

src/diffusers/utils/peft_utils.py Outdated Show resolved Hide resolved

src/diffusers/loaders/lora.py Outdated Show resolved Hide resolved

sayakpaul mentioned this pull request Dec 15, 2023

Error when load LoRA weight after training dreambooth_lora #6176

Closed

Merge branch 'main' into fix/lora-loading

49a0f3a

sayakpaul added 3 commits December 15, 2023 12:49

move config related stuff in a separate utility.

f4adaae

fix: import error

57a16f3

debug

d24e7d3

younesbelkada approved these changes Dec 15, 2023

View reviewed changes

BenjaminBossan reviewed Dec 15, 2023

View reviewed changes

src/diffusers/utils/peft_utils.py Outdated Show resolved Hide resolved

simplify condition.

ec9df6f

sayakpaul requested a review from BenjaminBossan December 15, 2023 11:46

BenjaminBossan approved these changes Dec 15, 2023

View reviewed changes

sayakpaul added 5 commits December 15, 2023 17:46

propagate changes to sd dreambooth lora.

16ac1b2

propagate to sd t2i lora fine-tuning

ece6d89

propagate to sdxl t2i lora fine-tuning

8c98a18

add: doc strings.

765fef7

add test

f145d48

sayakpaul marked this pull request as ready for review December 15, 2023 12:54

sayakpaul changed the title ~~[WIP][LoRA PEFT] fix LoRA loading so that correct alphas are parsed~~ [LoRA PEFT] fix LoRA loading so that correct alphas are parsed Dec 15, 2023

fix attribute access.

5d04eeb

pacman100 approved these changes Dec 15, 2023

View reviewed changes

younesbelkada approved these changes Dec 15, 2023

View reviewed changes

sayakpaul added 2 commits December 15, 2023 22:43

Merge branch 'main' into fix/lora-loading

255adf0

Merge branch 'main' into fix/lora-loading

f9c7b32

sayakpaul mentioned this pull request Dec 16, 2023

[LoRA refactor 2] feat: add support for load_lora(). #5958

Closed

2 tasks

This was referenced Dec 17, 2023

pipeline_stable_diffusion.py trys to encode the prompt when there is no prompt and prompt_embeds and negative_prompt_embeds are given #6201

Open

Fix bugs of Stable Diffusion #6202

Closed

younesbelkada mentioned this pull request Dec 18, 2023

Training "train_dreambooth_lora_sdxl.py" on multi-gpu using accelerate #6146

Closed

sayakpaul mentioned this pull request Dec 19, 2023

[LoRA PEFT] fix LoRA loading so that correct alphas are parsed #6225

Merged

sayakpaul closed this Dec 19, 2023

This was referenced Aug 9, 2024

Issue with Flux LoRAs trained with SimpleTuner #9134

Closed

🚨 feat: add non-breaking support to serialize metadata in loras. #9143

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LoRA PEFT] fix LoRA loading so that correct alphas are parsed #6135

[LoRA PEFT] fix LoRA loading so that correct alphas are parsed #6135

sayakpaul commented Dec 11, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Dec 11, 2023

BenjaminBossan left a comment

sayakpaul commented Dec 15, 2023

younesbelkada left a comment

BenjaminBossan left a comment

sayakpaul commented Dec 15, 2023

BenjaminBossan left a comment

sayakpaul commented Dec 15, 2023

pacman100 left a comment

younesbelkada left a comment •

edited

Loading

sayakpaul commented Dec 15, 2023

younesbelkada commented Dec 16, 2023

sayakpaul commented Dec 19, 2023

	if not self.use_linear_projection:
	hidden_states = hidden_states.reshape(batch, height, width, inner_dim).permute(0, 3, 1, 2).contiguous()
	hidden_states = (
	self.proj_out(hidden_states, scale=lora_scale)
	if not USE_PEFT_BACKEND
	else self.proj_out(hidden_states)
	)
	else:
	hidden_states = (
	self.proj_out(hidden_states, scale=lora_scale)
	if not USE_PEFT_BACKEND
	else self.proj_out(hidden_states)
	)
	hidden_states = hidden_states.reshape(batch, height, width, inner_dim).permute(0, 3, 1, 2).contiguous()

[LoRA PEFT] fix LoRA loading so that correct alphas are parsed #6135

[LoRA PEFT] fix LoRA loading so that correct alphas are parsed #6135

Conversation

sayakpaul commented Dec 11, 2023 • edited Loading

What does this PR do?

TODO

HuggingFaceDocBuilderDev commented Dec 11, 2023

BenjaminBossan left a comment

Choose a reason for hiding this comment

sayakpaul commented Dec 15, 2023

younesbelkada left a comment

Choose a reason for hiding this comment

BenjaminBossan left a comment

Choose a reason for hiding this comment

sayakpaul commented Dec 15, 2023

BenjaminBossan left a comment

Choose a reason for hiding this comment

sayakpaul commented Dec 15, 2023

pacman100 left a comment

Choose a reason for hiding this comment

younesbelkada left a comment • edited Loading

Choose a reason for hiding this comment

sayakpaul commented Dec 15, 2023

younesbelkada commented Dec 16, 2023

sayakpaul commented Dec 19, 2023

sayakpaul commented Dec 11, 2023 •

edited

Loading

younesbelkada left a comment •

edited

Loading