add manual seeding for RL experiments #118

natolambert · 2023-01-28T01:20:21Z

@lvwerra or @younesbelkada is there any case where a model will be initialized randomly? If so, we need to make sure the seeds are set too.

Also, I ran make style and it modified those two lines of setup, so we can change them here or elsewhere.

HuggingFaceDocBuilderDev · 2023-01-28T01:24:08Z

The documentation is not available anymore as the PR was closed or merged.

younesbelkada

Thanks a lot for this PR and taking care of the randomness!
I am very much ok for these changes, but I have few questions
1- Does this ensure exact reproducibility? I think you still need to set manual seed before creating each model. Maybe we should document more about this in the documentation
2- Maybe we should set the seed only if the seed is not None , but no strong opinion here maybe let's see what @lvwerra says here
Thanks again!

natolambert · 2023-01-28T20:59:31Z

I think we try it and see. It covers most use cases, I just don't really know how model loading etc is handled in transformers. My intuition is they're all pretrained and using torch, so the torch setting should work. Not so hard to check 🤗 (I can double check that)

lvwerra · 2023-01-30T09:53:46Z

As @younesbelkada pointed out we initialize the models before setting up the PPOTrainer. I think we could copy the set_seed function from transformers (we don't need the TF case) which also takes care of the CUDA case. Then we can use it in the PPOTrainer and also expose it to the user.

natolambert · 2023-01-30T17:16:29Z

What is the way the model is initialized? I'm confused because I thought we were fine-tuning existing models. What parts get initialized?

@lvwerra @younesbelkada -- seems like a gap in my new NLP knowledge :)

lvwerra · 2023-01-30T17:18:16Z

@natolambert it's actually the RL part: the value head is randomly initialized :)

younesbelkada · 2023-01-30T17:24:23Z

As a side note, from what I have discovered, you need to declare the seed before initializing any new module, i.e. in this case before initializing model & ref_model. Check the example snippet below:

import torch
import torch.nn as nn

torch.manual_seed(0)
linear_1 = nn.Linear(10, 10)
linear_2 = nn.Linear(10, 10)

# check weights are not the same
assert not torch.allclose(linear_1.weight, linear_2.weight)

torch.manual_seed(0)
linear_1 = nn.Linear(10, 10)
torch.manual_seed(0)
linear_2 = nn.Linear(10, 10)

# check weights are the same
assert torch.allclose(linear_1.weight, linear_2.weight)

If we make sure users do this, maybe it will be possible to ensure reproducibility - thus worth documenting it!

natolambert · 2023-01-30T17:53:43Z

I don't think we want all the weights to be the same, no? The initial seed should be enough to maintain reproducibility across runs -- does that make sense?

natolambert · 2023-01-30T18:45:11Z

If we are encountering issues later in this, we could also incorportate set_full_determinism

trl/trainer/ppo_trainer.py

younesbelkada · 2023-01-31T11:55:43Z

trl/trainer/ppo_trainer.py

+        random.seed(seed)
+        # np.random.seed(seed)      # if np is added to functionality, add this line
+        torch.manual_seed(seed)
+        torch.cuda.manual_seed_all(seed)


(optional) this is a personal taste though, I feel it's better to not force users to set a seed by default

Suggested change

random.seed(seed)

# np.random.seed(seed) # if np is added to functionality, add this line

torch.manual_seed(seed)

torch.cuda.manual_seed_all(seed)

if seed is not None and isinstance(seed, int):

random.seed(seed)

# np.random.seed(seed) # if np is added to functionality, add this line

torch.manual_seed(seed)

torch.cuda.manual_seed_all(seed)

younesbelkada

Thanks for your work on this @natolambert
I left few minor comments, IMO we should not force users to set a seed by default but maybe @lvwerra has a different opinion on this, or if you think it's ok to force users to set a seed by default then it's fine for me!

Co-authored-by: Younes Belkada <[email protected]>

natolambert · 2023-01-31T19:59:59Z

I personally just don't really see an issue with the default seed being 0? It's very normal practice in RL imo. The users for non-research generally will just not touch it? Maybe you didn't see the default seed in the constructor, which would explain some of the confusion.

I don't know how a default seed is particularly different than no seeding.

younesbelkada

LGTM thanks for this! 🚀
Yes your explanation makes sense!

lvwerra

Looks good to me, just two minor comments. I think without setting the seed before initializing the value head (i.e. loading the models which is outside the trainer) the training will behave different no matter the internal seeding. That's why I would expose the set_seed as a standalone function that's used both internally and externally.

For the default value, the transformers Trainer uses 42 :)

lvwerra · 2023-01-31T20:05:36Z

trl/trainer/ppo_trainer.py

+            seed (`int`): The seed to set.
+        """
+        random.seed(seed)
+        # np.random.seed(seed)      # if np is added to functionality, add this line


lets maybe add it - don't know if torch or some other library is using numpy somewhere.

lvwerra · 2023-01-31T20:06:48Z

trl/trainer/ppo_trainer.py

@@ -875,6 +879,17 @@ def create_model_card(self, path: str, model_name: Optional[str] = "TRL Model")
        with open(os.path.join(path, "README.md"), "w", encoding="utf-8") as f:
            f.write(model_card_content)

+    def set_seed(self, seed: Optional[int]):


I would move the function to core.py and expose it to the user via __init__.py. So if we wanted we can use it in the script too.

…eeds

add manual seeding for RL experiments

0a420d8

younesbelkada reviewed Jan 28, 2023

View reviewed changes

natolambert added 2 commits January 30, 2023 10:50

add set_seed function

3c8d871

style

bf661b0

younesbelkada reviewed Jan 31, 2023

View reviewed changes

trl/trainer/ppo_trainer.py Outdated Show resolved Hide resolved

younesbelkada reviewed Jan 31, 2023

View reviewed changes

Update trl/trainer/ppo_trainer.py

9573d8d

Co-authored-by: Younes Belkada <[email protected]>

younesbelkada approved these changes Jan 31, 2023

View reviewed changes

lvwerra approved these changes Jan 31, 2023

View reviewed changes

natolambert added 3 commits January 31, 2023 15:47

move set seed to core

e23c699

Merge branch 'add_seeds' of https://github.com/lvwerra/trl into add_s…

29f6b56

…eeds

add seed to examples an why

5445925

natolambert merged commit 078182a into main Feb 1, 2023

natolambert deleted the add_seeds branch February 1, 2023 00:12

younesbelkada mentioned this pull request Feb 6, 2023

[bug] Update gpt2-sentiment.py #132

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add manual seeding for RL experiments #118

add manual seeding for RL experiments #118

natolambert commented Jan 28, 2023

HuggingFaceDocBuilderDev commented Jan 28, 2023 •

edited

Loading

younesbelkada left a comment

natolambert commented Jan 28, 2023

lvwerra commented Jan 30, 2023

natolambert commented Jan 30, 2023

lvwerra commented Jan 30, 2023

younesbelkada commented Jan 30, 2023

natolambert commented Jan 30, 2023

natolambert commented Jan 30, 2023 •

edited

Loading

younesbelkada Jan 31, 2023 •

edited

Loading

younesbelkada left a comment

natolambert commented Jan 31, 2023 •

edited

Loading

younesbelkada left a comment •

edited

Loading

lvwerra left a comment

lvwerra Jan 31, 2023

lvwerra Jan 31, 2023

add manual seeding for RL experiments #118

add manual seeding for RL experiments #118

Conversation

natolambert commented Jan 28, 2023

HuggingFaceDocBuilderDev commented Jan 28, 2023 • edited Loading

younesbelkada left a comment

Choose a reason for hiding this comment

natolambert commented Jan 28, 2023

lvwerra commented Jan 30, 2023

natolambert commented Jan 30, 2023

lvwerra commented Jan 30, 2023

younesbelkada commented Jan 30, 2023

natolambert commented Jan 30, 2023

natolambert commented Jan 30, 2023 • edited Loading

younesbelkada Jan 31, 2023 • edited Loading

Choose a reason for hiding this comment

younesbelkada left a comment

Choose a reason for hiding this comment

natolambert commented Jan 31, 2023 • edited Loading

younesbelkada left a comment • edited Loading

Choose a reason for hiding this comment

lvwerra left a comment

Choose a reason for hiding this comment

lvwerra Jan 31, 2023

Choose a reason for hiding this comment

lvwerra Jan 31, 2023

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Jan 28, 2023 •

edited

Loading

natolambert commented Jan 30, 2023 •

edited

Loading

younesbelkada Jan 31, 2023 •

edited

Loading

natolambert commented Jan 31, 2023 •

edited

Loading

younesbelkada left a comment •

edited

Loading