Seed is not applied for DPO recipes #2335

bogdansalyp · 2025-02-03T17:24:43Z

TL;DR

Launching same config twice with seed: 42 results in two different loss curves

Affected recipes

`full_dpo_distributed` - seed is not set

Full DPO is taken from #2275

`lora_dpo_distributed` - seed is not set

Not affected recipes

`full_finetune_distributed` - works fine

`lora_finetune_distributed` - works fine

The text was updated successfully, but these errors were encountered:

acisseJZhong · 2025-02-03T17:35:07Z

Hi @bogdansalyp, could you share more about the run information, e.g. the config and run command?

bogdansalyp · 2025-02-03T19:07:37Z

Hi @bogdansalyp, could you share more about the run information, e.g. the config and run command?

Yes, it's just standard llama3_1/8B_lora_dpo.yaml config but with seed: 42 instead of seed: null

I didn't test it on other recipes yet

UPD: Updated description with the recipes I've tried

ebsmothers · 2025-02-05T01:10:58Z

@bogdansalyp agree this looks a bit weird. One observation is that the y-axis in some of the plots is really small though, I do wonder whether you also see variation of ~1e-3 in the unaffected recipes' losses? (Because I think exact numerical parity is not achievable with bf16). For debugging, would suggest inspecting two things: (1) are the model weights the same across runs? (2) are the samples seen the same across runs? If both (1) and (2) are true, my guess would be that it's just accumulated numerical error. But for (1) in the LoRA DPO recipe especially -- we don't load in LoRA weights, so those will be randomly initialized. Definitely worth checking if they're identical across runs. (It's also possible that there's another source of randomness I haven't yet accounted for.)

bogdansalyp mentioned this issue Feb 3, 2025

Apply gradient accumulation fix to DPO/PPO recipes #2334

Open

bogdansalyp changed the title ~~Seed is not applied to some recipes~~ Seed is not applied for DPO recipes Feb 3, 2025

joecummings added bug Something isn't working triaged This issue has been assigned an owner and appropriate label labels Feb 4, 2025

ebsmothers mentioned this issue Feb 6, 2025

Full DPO Distributed #2275

Merged

9 tasks

bogdansalyp mentioned this issue Feb 9, 2025

feat: Added cfg.cudnn_deterministic_mode flag #2367

Merged

13 tasks

pbontrager closed this as completed in #2367 Feb 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Seed is not applied for DPO recipes #2335

Seed is not applied for DPO recipes #2335

bogdansalyp commented Feb 3, 2025 •

edited

Loading

acisseJZhong commented Feb 3, 2025

bogdansalyp commented Feb 3, 2025 •

edited

Loading

ebsmothers commented Feb 5, 2025

Seed is not applied for DPO recipes #2335

Seed is not applied for DPO recipes #2335

Comments

bogdansalyp commented Feb 3, 2025 • edited Loading

TL;DR

Affected recipes

full_dpo_distributed - seed is not set

lora_dpo_distributed - seed is not set

Not affected recipes

full_finetune_distributed - works fine

lora_finetune_distributed - works fine

acisseJZhong commented Feb 3, 2025

bogdansalyp commented Feb 3, 2025 • edited Loading

ebsmothers commented Feb 5, 2025

bogdansalyp commented Feb 3, 2025 •

edited

Loading

`full_dpo_distributed` - seed is not set

`lora_dpo_distributed` - seed is not set

`full_finetune_distributed` - works fine

`lora_finetune_distributed` - works fine

bogdansalyp commented Feb 3, 2025 •

edited

Loading