Prodigy optimizer_type #277

teebarjunk · 2023-08-12T18:59:58Z

I see #242 says it's implemented, but github search can't find any evidence.

Tried installing with !pip install -U prodigyopt and manually inputting Prodigy as the optimizer_type but it didn't work.

Any one had success?

The text was updated successfully, but these errors were encountered:

Reviem0 · 2023-08-13T19:32:39Z

Tried this aswell, no success here.

MetroByte · 2023-08-16T00:54:29Z

If this is relevant for you:
For "kohya-trainer" google colab:

Add a code block (on top, for example) with code !pip install -U prodigyopt
Proceed setup as usual up to 5.3
You have to change your settings a bit in 5.3-5.5:

In optimizer_args set the following (you can change these if you know what are you doing): ["decouple=True", "weight_decay=0.01", "d_coef=2", "use_bias_correction=True", "safeguard_warmup=True"]
unet_lr, text_encoder_lr to 1 (important)

Then you have to change your config files in folder LoRA/config. In config_file.toml:

Make sure that every lr (learning rates) are set to (probably) 0.1 to 1, or not set at all (otherwise error, needs checking) Error when trying to train using DAdaptation optimizer bmaltais/kohya_ss#1160 (comment)
optimizer_args is present
optimizer_type is Prodigy (safecheck)

Then you have to change a bit in the kohya-trainer/train_network.py

At line ~12-13 add from prodigyopt import Prodigy
Find and comment line (with #) containing text optimizer_name, optimizer_args, optimizer = … (Line number ~224-227)
Add the following three lines below the commented line:

optimizer_name = "Prodigy"
optimizer_args = "['decouple': True, 'weight_decay': 0.01, 'd_coef': 2, 'use_bias_correction': True, 'safeguard_warmup': True]"
optimizer = Prodigy(trainable_params)

(first two are for metadata printing only though)

Then training should run fine (I hope)

If you re-run steps 5.2-5.4, config file will be replaced, so better to change config files directly.

Tested but I get overfitting results so early, I don't know how to set Prodigy correctly…

teebarjunk · 2023-08-17T10:54:20Z

Tested but I get overfitting results so early, I don't know how to set Prodigy correctly…

This guide says to use more epochs than repeats, and that he gets good results.

teebarjunk · 2023-08-18T19:41:08Z

To whom it may concern, Prodigy works nice and fast, but can easily overcook. So epochs > repeats seems best.

In section 5.2:

Set dataset_repeats to 2.

In section 5.4:

Set num_epochs to something higher like 100

Set save_n_epochs_type_value to 10

Set optimizer_args to ["decouple=True", "weight_decay=0.01", "d_coef=2", "use_bias_correction=True", "safeguard_warmup=False", "betas=0.9,0.999"]

And in the code for 5.4, look for sample_every_n_epochs and set that to like 10, so you don't waste time generating previews every single epoch.

I also found this guide to keep an eye on.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prodigy optimizer_type #277

Prodigy optimizer_type #277

teebarjunk commented Aug 12, 2023

Reviem0 commented Aug 13, 2023

MetroByte commented Aug 16, 2023 •

edited

Loading

teebarjunk commented Aug 17, 2023

teebarjunk commented Aug 18, 2023 •

edited

Loading

Prodigy optimizer_type #277

Prodigy optimizer_type #277

Comments

teebarjunk commented Aug 12, 2023

Reviem0 commented Aug 13, 2023

MetroByte commented Aug 16, 2023 • edited Loading

teebarjunk commented Aug 17, 2023

teebarjunk commented Aug 18, 2023 • edited Loading

MetroByte commented Aug 16, 2023 •

edited

Loading

teebarjunk commented Aug 18, 2023 •

edited

Loading