add: support for `peft` in ddpo. #1165

sayakpaul · 2024-01-01T10:41:35Z

It's time.

HuggingFaceDocBuilderDev · 2024-01-01T10:47:07Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul · 2024-01-01T10:48:20Z

trl/models/modeling_sd_base.py

+            lora_config = LoraConfig(
+                r=4,
+                lora_alpha=4,
+                init_lora_weights="gaussian",
+                target_modules=["to_k", "to_q", "to_v", "to_out.0"],
+            )


This matches with what was done previously.

sayakpaul · 2024-01-01T11:40:13Z

trl/models/modeling_sd_base.py

-            # Set correct lora layers
-            lora_attn_procs = {}
-            for name in self.sd_pipeline.unet.attn_processors.keys():
-                cross_attention_dim = (
-                    None if name.endswith("attn1.processor") else self.sd_pipeline.unet.config.cross_attention_dim
-                )
-                if name.startswith("mid_block"):
-                    hidden_size = self.sd_pipeline.unet.config.block_out_channels[-1]
-                elif name.startswith("up_blocks"):
-                    block_id = int(name[len("up_blocks.")])
-                    hidden_size = list(reversed(self.sd_pipeline.unet.config.block_out_channels))[block_id]
-                elif name.startswith("down_blocks"):
-                    block_id = int(name[len("down_blocks.")])
-                    hidden_size = self.sd_pipeline.unet.config.block_out_channels[block_id]
-
-                lora_attn_procs[name] = LoRAAttnProcessor(
-                    hidden_size=hidden_size, cross_attention_dim=cross_attention_dim
-                )
-            self.sd_pipeline.unet.set_attn_processor(lora_attn_procs)


No crazy layer iteration and state dict munging. Pretty please. Thanks to peft.

Pretty swell that this is being cleanly replaced. Sweet stuff

metric-space · 2024-01-02T05:07:11Z

hey @sayakpaul things look splendid. Nothing has really changed in theory but would be nice to have a test run that shows convergence but I'll leave it to your and @younesbelkada's discretion to do without

sayakpaul · 2024-01-02T05:43:01Z

@metric-space yes, will do! Thanks for your reviews.

sayakpaul · 2024-01-02T09:00:59Z

WandB run page: https://wandb.ai/sayakpaul/stable_diffusion_training/runs/7ebll3fb?workspace=user-sayakpaul.

This pig is too cute:

kashif · 2024-01-02T09:43:23Z

LGTM! thanks

younesbelkada · 2024-01-08T04:25:08Z

Awesome work @sayakpaul and team !

* add: support for peft in ddpo. * revert to the original modeling_base. * style * specify weight_name * explicitly specify weight_name * fix: parameter parsing * fix: trainable_layers. * parameterize use_lora. * fix one more trainable_layers * debug * debug * more fixes. * manually set unet of sd_pipeline * make trainable_layers cleaner. * more fixes * remove prints. * tester class for LoRA too.

sayakpaul added 2 commits January 1, 2024 16:09

add: support for peft in ddpo.

f7bdda2

revert to the original modeling_base.

e047d32

style

054e78f

sayakpaul commented Jan 1, 2024

View reviewed changes

sayakpaul added 2 commits January 1, 2024 17:02

specify weight_name

ef82af0

explicitly specify weight_name

2d0abdf

sayakpaul commented Jan 1, 2024

View reviewed changes

metric-space approved these changes Jan 2, 2024

View reviewed changes

sayakpaul added 10 commits January 2, 2024 12:40

fix: parameter parsing

25f4d7c

fix: trainable_layers.

7d21527

parameterize use_lora.

f0879f7

fix one more trainable_layers

dd1d43c

debug

77a6354

debug

0d09b63

more fixes.

a29204d

manually set unet of sd_pipeline

02d36d4

make trainable_layers cleaner.

900b629

more fixes

c3dc0cc

sayakpaul added 2 commits January 2, 2024 14:32

remove prints.

359c94f

tester class for LoRA too.

f65fea6

kashif self-requested a review January 2, 2024 11:51

kashif approved these changes Jan 2, 2024

View reviewed changes

kashif merged commit 20428c4 into huggingface:main Jan 2, 2024
9 checks passed

sayakpaul deleted the harmonize-lora-ddpo branch January 2, 2024 12:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add: support for `peft` in ddpo. #1165

add: support for `peft` in ddpo. #1165

sayakpaul commented Jan 1, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Jan 1, 2024

sayakpaul Jan 1, 2024

sayakpaul Jan 1, 2024

metric-space Jan 2, 2024 •

edited

Loading

metric-space commented Jan 2, 2024

sayakpaul commented Jan 2, 2024

sayakpaul commented Jan 2, 2024 •

edited

Loading

kashif commented Jan 2, 2024

younesbelkada commented Jan 8, 2024

add: support for peft in ddpo. #1165

add: support for peft in ddpo. #1165

Conversation

sayakpaul commented Jan 1, 2024 • edited Loading

HuggingFaceDocBuilderDev commented Jan 1, 2024

sayakpaul Jan 1, 2024

Choose a reason for hiding this comment

sayakpaul Jan 1, 2024

Choose a reason for hiding this comment

metric-space Jan 2, 2024 • edited Loading

Choose a reason for hiding this comment

metric-space commented Jan 2, 2024

sayakpaul commented Jan 2, 2024

sayakpaul commented Jan 2, 2024 • edited Loading

kashif commented Jan 2, 2024

younesbelkada commented Jan 8, 2024

add: support for `peft` in ddpo. #1165

add: support for `peft` in ddpo. #1165

sayakpaul commented Jan 1, 2024 •

edited

Loading

metric-space Jan 2, 2024 •

edited

Loading

sayakpaul commented Jan 2, 2024 •

edited

Loading