Small changes when integrating into H4 #216

natolambert · 2023-03-14T03:25:30Z

Two changes:

Pass the optimizer in the sentiment example (currently variable was not passed into trainier).
[I think] fix the kwarg option for wandb config of Accelerate. See this docs page, where init_kwargs is handled differently. In trying to use this with the code as is, wandb is getting read as a kwarg and not handled correctly by this line. If this is different in Tensorboard, it may just be incompatible.

Let me know if I'm wrong!

Fixes: #215

natolambert · 2023-03-14T03:25:56Z

Closes ##215 if correct on point 1 @younesbelkada !

HuggingFaceDocBuilderDev · 2023-03-14T03:28:40Z

The documentation is not available anymore as the PR was closed or merged.

natolambert · 2023-03-14T03:34:12Z

I tested the logging change with my code in H4 #https://github.com/huggingface/h4/pull/73, and it fixed my problem!

younesbelkada

Thanks a lot for the PR
Agreed for the first point! Great catch!
Regarding the second point I have slight doubts that it may break things with tensorboard, if this is not vital we can leave it on a follow up PR to test it properly - otherwise you can quickly test any script with log_with="tensorboard" and see if the training runs
Thanks!

natolambert · 2023-03-14T16:32:40Z

I'll test tensorboard today. FYI this is needed for the script in H4, so I'll be motivated to get this working soon.

If tensorboard doesn't work, I'll prolly do an if statement.

natolambert · 2023-03-14T19:22:14Z

@younesbelkada I think I ran this with tensorboard (just changed the config to as follows and it didn't error). Seems good to me?

The term I changed tracker_kwargs was not used in any of TRL to date actually.

config = PPOConfig(
    model_name="ybelkada/gpt-j-6b-sharded-bf16",
    learning_rate=(1.47e-5) * 2,
    # log_with="wandb",
    log_with="tensorboard",
    accelerator_kwargs={"logging_dir": '/home/nathan/logs/'},
    batch_size=32,
    forward_batch_size=1,
)

younesbelkada

Thanks a lot for fixing! 🔥

younesbelkada · 2023-03-14T19:47:14Z

Thanks a lot for experimenting @natolambert ! LGTM

nits

3a0c0cd

style

e849ef0

younesbelkada reviewed Mar 14, 2023

View reviewed changes

younesbelkada approved these changes Mar 14, 2023

View reviewed changes

natolambert merged commit 357730f into main Mar 14, 2023

natolambert deleted the nol_nits branch March 14, 2023 21:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Small changes when integrating into H4 #216

Small changes when integrating into H4 #216

natolambert commented Mar 14, 2023 •

edited by younesbelkada

Loading

natolambert commented Mar 14, 2023

HuggingFaceDocBuilderDev commented Mar 14, 2023 •

edited

Loading

natolambert commented Mar 14, 2023

younesbelkada left a comment

natolambert commented Mar 14, 2023

natolambert commented Mar 14, 2023

younesbelkada left a comment

younesbelkada commented Mar 14, 2023

Small changes when integrating into H4 #216

Small changes when integrating into H4 #216

Conversation

natolambert commented Mar 14, 2023 • edited by younesbelkada Loading

natolambert commented Mar 14, 2023

HuggingFaceDocBuilderDev commented Mar 14, 2023 • edited Loading

natolambert commented Mar 14, 2023

younesbelkada left a comment

Choose a reason for hiding this comment

natolambert commented Mar 14, 2023

natolambert commented Mar 14, 2023

younesbelkada left a comment

Choose a reason for hiding this comment

younesbelkada commented Mar 14, 2023

natolambert commented Mar 14, 2023 •

edited by younesbelkada

Loading

HuggingFaceDocBuilderDev commented Mar 14, 2023 •

edited

Loading