Grad accumulation and memory bugfix #220

edbeeching · 2023-03-14T12:32:34Z

Adds command line arg passing to sentiment and toxicity examples
Adds gradient accumulation as a command line arg Add gradient accumulation #218
Fixes tensors not being detached from graph when stored in stats, leading to an overuse of memory.
Updates forward_batch_size -> minibatch size in a number of examples

By the way, I think there may be other places we use excessive memory due to storing attached tensors for too long. I will investigate further.

edbeeching · 2023-03-14T12:34:30Z

I forgot to run style / quality. I am not on my dev machine at the moment. I will run this in an hour.

HuggingFaceDocBuilderDev · 2023-03-14T12:36:41Z

The documentation is not available anymore as the PR was closed or merged.

lvwerra

Thanks @edbeeching, looks very clean to me. Just one small comment.

examples/sentiment/scripts/gpt-neox-20b_peft/gpt-neo-20b_sentiment_peft.py

younesbelkada

Thanks a lot for this!
I left a single comment, otherwise the fix proposed in #216 will fail I belive

trl/trainer/ppo_trainer.py

Co-authored-by: Younes Belkada <[email protected]>

edbeeching requested a review from lvwerra March 14, 2023 12:32

lvwerra reviewed Mar 14, 2023

View reviewed changes

examples/sentiment/scripts/gpt-neox-20b_peft/gpt-neo-20b_sentiment_peft.py Outdated Show resolved Hide resolved

edbeeching added 10 commits March 15, 2023 11:31

adds args and grad accum steps to sentiment examples

bdab2f1

updates to minibatch size in peft 20b example

9cc5222

adds arg and grad acc to toxicity example

2e603e2

adds detach to all entries in the step stats to reduce memory usage

c028e6a

style

16995de

style2

4939869

adds accelerator accumulation context

543432c

style

a17d49a

makes gradient_accumulation_steps part of the PPOConfig

cf2db12

style

89c02aa

edbeeching force-pushed the grad-accu-memory-bugfix branch from 7a19275 to 89c02aa Compare March 15, 2023 10:33

younesbelkada reviewed Mar 15, 2023

View reviewed changes

trl/trainer/ppo_trainer.py Outdated Show resolved Hide resolved

edbeeching and others added 2 commits March 15, 2023 21:15

Update trl/trainer/ppo_trainer.py

c7d0731

Co-authored-by: Younes Belkada <[email protected]>

style

6cef869

lvwerra approved these changes Mar 16, 2023

View reviewed changes

edbeeching merged commit 768c389 into main Mar 16, 2023

edbeeching deleted the grad-accu-memory-bugfix branch March 16, 2023 08:56

edbeeching mentioned this pull request Mar 16, 2023

adds a missing detach to the ratio #224

Merged

younesbelkada mentioned this pull request Jun 6, 2023

Fix correct gradient accumulation #407

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Grad accumulation and memory bugfix #220

Grad accumulation and memory bugfix #220

edbeeching commented Mar 14, 2023

edbeeching commented Mar 14, 2023

HuggingFaceDocBuilderDev commented Mar 14, 2023 •

edited

Loading

lvwerra left a comment

younesbelkada left a comment

Grad accumulation and memory bugfix #220

Grad accumulation and memory bugfix #220

Conversation

edbeeching commented Mar 14, 2023

edbeeching commented Mar 14, 2023

HuggingFaceDocBuilderDev commented Mar 14, 2023 • edited Loading

lvwerra left a comment

Choose a reason for hiding this comment

younesbelkada left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Mar 14, 2023 •

edited

Loading