Improvements 1a #70

edbeeching · 2023-01-04T09:12:28Z

This PR:

Cleans up setup.py and remove settings.ini (legacy from nbdev)
Makes all internal imports relative (e.g. from trl.trainer -> from .trainer)
Cleans up README and fixes the example

HuggingFaceDocBuilderDev · 2023-01-04T09:16:34Z

The documentation is not available anymore as the PR was closed or merged.

lvwerra

Thanks @edbeeching, this looks great! Left a few minor comments, can also address some of them in a follow up PR.

README.md

lvwerra · 2023-01-04T13:07:16Z

README.md

 # define a reward for response
 # (this could be any reward such as human feedback or output from another model)
-reward = [torch.tensor(1.0)]
+reward = [torch.tensor(1.0).to(device)]


We should let accelerate handle device placement internally. (Can address it in a follow up PR if you want)

I think the reward would normally be from a preference model, so it would probably be on the correct device. But in this example that is not the case. I'll leave it to another PR.

yes, discussed with @younesbelkada that we add some device placement steps to PPOTrainer.step.

yes, this will be addressed in a PR right after merging this

setup.py

lvwerra · 2023-01-04T13:09:47Z

trl/trainer/__init__.py

+# isort: off
+from .utils import AdaptiveKLController, FixedKLController
+
+# isort: on
+


Why are the utils imports separate from the others?

This causes a circular import in PPOTrainer if we allow isort to sort the imports. I added a comment. Alternatively, we could exclude these from the init.py file and/or use a relative import in PPOTrainer.

lvwerra

Looks good - thanks @edbeeching!

younesbelkada

Thanks for taking care of the improvements @edbeeching !

edbeeching added 3 commits January 4, 2023 09:15

updates setup.py, removes settings.ini

b4c5c01

adds relative imports, fixes a circular import

8b0b393

Updates README

12b0e83

edbeeching marked this pull request as ready for review January 4, 2023 09:13

adds accelerate as a dependency

5286000

edbeeching force-pushed the improvements-1a branch from 1fb877a to 5286000 Compare January 4, 2023 09:58

edbeeching added 3 commits January 4, 2023 10:59

updates gitignore for wandb files

9ff1510

debugging isort issue

c047e11

still debugging isort

f3c9ab6

edbeeching force-pushed the improvements-1a branch from abaf0e4 to f3c9ab6 Compare January 4, 2023 10:30

edbeeching added 2 commits January 4, 2023 11:35

more fixing isort

e30cd46

fixed isort issue

db2fb06

lvwerra reviewed Jan 4, 2023

View reviewed changes

edbeeching added 2 commits January 4, 2023 16:35

updates based on feedback to PR

e9ddf0d

make quality

74a650d

younesbelkada mentioned this pull request Jan 5, 2023

Roadmap - trl 0.2 #64

Closed

26 tasks

lvwerra approved these changes Jan 5, 2023

View reviewed changes

younesbelkada approved these changes Jan 5, 2023

View reviewed changes

edbeeching merged commit 7eba287 into main Jan 5, 2023

edbeeching deleted the improvements-1a branch January 5, 2023 10:20

lvwerra mentioned this pull request Jan 13, 2023

lm_head and v_head, why re-initialize and why dropout? #43

Closed

August-murr mentioned this pull request Jan 6, 2025

onlinedpo error when use deepspeed zero3 August-murr/trl#7

Open

9 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvements 1a #70

Improvements 1a #70

edbeeching commented Jan 4, 2023

HuggingFaceDocBuilderDev commented Jan 4, 2023 •

edited

Loading

lvwerra left a comment

lvwerra Jan 4, 2023

edbeeching Jan 4, 2023

lvwerra Jan 5, 2023

younesbelkada Jan 5, 2023

lvwerra Jan 4, 2023

edbeeching Jan 4, 2023

lvwerra left a comment

younesbelkada left a comment •

edited

Loading

Improvements 1a #70

Improvements 1a #70

Conversation

edbeeching commented Jan 4, 2023

HuggingFaceDocBuilderDev commented Jan 4, 2023 • edited Loading

lvwerra left a comment

Choose a reason for hiding this comment

lvwerra Jan 4, 2023

Choose a reason for hiding this comment

edbeeching Jan 4, 2023

Choose a reason for hiding this comment

lvwerra Jan 5, 2023

Choose a reason for hiding this comment

younesbelkada Jan 5, 2023

Choose a reason for hiding this comment

lvwerra Jan 4, 2023

Choose a reason for hiding this comment

edbeeching Jan 4, 2023

Choose a reason for hiding this comment

lvwerra left a comment

Choose a reason for hiding this comment

younesbelkada left a comment • edited Loading

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Jan 4, 2023 •

edited

Loading

younesbelkada left a comment •

edited

Loading