[KTO] fix tokenization bugs #1418

kawine · 2024-03-12T00:51:15Z

fixed the tokenization bugs mentioned here Bos token in prompt_input_ids but not in completion_input_ids? #1401
made some minor changes to the doc to clarify this issue KTOTrainer vs kto_loss in DPO-Trainer #1387

cc @kashif

…e batch_size losses

add reference to paper Co-authored-by: lewtun <[email protected]>

Co-authored-by: Kashif Rasul <[email protected]>

Co-authored-by: lewtun <[email protected]>

kashif · 2024-03-12T08:09:19Z

thanks @kawine checking now

HuggingFaceDocBuilderDev · 2024-03-12T08:13:34Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

lewtun

Thanks a lot for the fix @kawine ! Would you mind adding a small unit test for the tokenize_row() method to ensure we don't regress on this in future 🙏 ?

kashif · 2024-03-12T13:58:49Z

@kawine you can use this test:

    def test_kto_trainer_tokenize_row(self):
        with tempfile.TemporaryDirectory() as tmp_dir:
            training_args = KTOConfig(
                output_dir=tmp_dir,
                per_device_train_batch_size=2,
                max_steps=3,
                remove_unused_columns=False,
                gradient_accumulation_steps=1,
                learning_rate=9e-1,
                evaluation_strategy="steps",
                beta=0.1,
            )

            dummy_dataset = self._init_dummy_dataset()

            trainer = KTOTrainer(
                model=self.model,
                ref_model=self.ref_model,
                args=training_args,
                tokenizer=self.tokenizer,
                train_dataset=dummy_dataset,
                eval_dataset=dummy_dataset,
            )

            row = dummy_dataset[0]

            # test that the row can be tokenized
            tokenized_row = trainer.tokenize_row(row)

            # Assert bos_token_id
            assert tokenized_row["prompt_input_ids"][0] == self.tokenizer.bos_token_id
            assert tokenized_row["completion_input_ids"][0] == self.tokenizer.bos_token_id

kawine · 2024-03-12T21:45:39Z

thanks! test has been added @kashif

tests/test_kto_trainer.py

* add warning for imbalanced data * update documentation * update script commands to be same as in dpo * use batch_size KL examples and batch_size target examples to calculate batch_size losses * fix deepspeed issue * speed up forward with no_grad for KL * add some removed metrics * Update trl/trainer/kto_trainer.py * Update trl/trainer/kto_trainer.py * Update trl/trainer/kto_trainer.py add reference to paper Co-authored-by: lewtun <[email protected]> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <[email protected]> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <[email protected]> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <[email protected]> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <[email protected]> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <[email protected]> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <[email protected]> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <[email protected]> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <[email protected]> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <[email protected]> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <[email protected]> * Update trl/trainer/kto_trainer.py Co-authored-by: Kashif Rasul <[email protected]> * add more detailed comments * convert assert to ValueError * Update kto_trainer.py * precommit formatting * remove nans in metrics by gathering across machines * fix formatting * fix choice of mismatched examples for KL term * describe weights * fix hanging issue in distributed training * linting * move metrics to cpu * Update trl/trainer/kto_trainer.py Co-authored-by: lewtun <[email protected]> * Update trl/trainer/kto_trainer.py * Update trl/trainer/kto_trainer.py * fix tokenization error: lack of bos * change user warning for weight hyperparams * minor update to docs * reshape attention mask * reformat * add test for bos/eos tokens * move dependency location * Update tests/test_kto_trainer.py --------- Co-authored-by: Kashif Rasul <[email protected]> Co-authored-by: lewtun <[email protected]>

kawine and others added 30 commits February 24, 2024 18:19

add warning for imbalanced data

6ee3be4

update documentation

22dd810

update script commands to be same as in dpo

8d14930

use batch_size KL examples and batch_size target examples to calculat…

8a490af

…e batch_size losses

fix deepspeed issue

f826600

speed up forward with no_grad for KL

688ed6c

Merge branch 'huggingface:main' into main

587517b

add some removed metrics

e128f09

Update trl/trainer/kto_trainer.py

2d860b8

Update trl/trainer/kto_trainer.py

48d25ff

Update trl/trainer/kto_trainer.py

392bcc0

add reference to paper Co-authored-by: lewtun <[email protected]>

Update trl/trainer/kto_trainer.py

a42049f

Co-authored-by: Kashif Rasul <[email protected]>

Update trl/trainer/kto_trainer.py

5696814

Co-authored-by: Kashif Rasul <[email protected]>

Update trl/trainer/kto_trainer.py

000d5d8

Co-authored-by: Kashif Rasul <[email protected]>

Update trl/trainer/kto_trainer.py

2738d1f

Co-authored-by: Kashif Rasul <[email protected]>

Update trl/trainer/kto_trainer.py

d7f63c5

Co-authored-by: Kashif Rasul <[email protected]>

Update trl/trainer/kto_trainer.py

824da55

Co-authored-by: Kashif Rasul <[email protected]>

Update trl/trainer/kto_trainer.py

4399af4

Co-authored-by: Kashif Rasul <[email protected]>

Update trl/trainer/kto_trainer.py

69094be

Co-authored-by: Kashif Rasul <[email protected]>

Update trl/trainer/kto_trainer.py

73f7ed7

Co-authored-by: Kashif Rasul <[email protected]>

Update trl/trainer/kto_trainer.py

5b95aca

Co-authored-by: Kashif Rasul <[email protected]>

Update trl/trainer/kto_trainer.py

3102901

Co-authored-by: Kashif Rasul <[email protected]>

add more detailed comments

ca68f24

convert assert to ValueError

94fb375

Update kto_trainer.py

8f7e788

precommit formatting

ed19ed5

Merge branch 'main' of https://github.com/kawine/trl into main

310bd97

Merge branch 'huggingface:main' into main

639f4de

remove nans in metrics by gathering across machines

ee7d6a4

fix formatting

7ae95c2

kawine and others added 15 commits March 5, 2024 18:55

fix hanging issue in distributed training

1f145b9

linting

83ed882

Merge branch 'main' of https://github.com/kawine/trl into main

9c5480d

move metrics to cpu

15251ff

Update trl/trainer/kto_trainer.py

8f9fdfe

Co-authored-by: lewtun <[email protected]>

Update trl/trainer/kto_trainer.py

600aad8

Update trl/trainer/kto_trainer.py

8b5367e

Merge branch 'huggingface:main' into main

5cc6fed

Merge branch 'huggingface:main' into main

03dfe90

fix tokenization error: lack of bos

1680de6

change user warning for weight hyperparams

80fa86d

minor update to docs

8f112ce

reshape attention mask

0cc2d8f

reformat

eed3044

Merge branch 'main' of https://github.com/kawine/trl into main

5d7fdd1

lewtun reviewed Mar 12, 2024

View reviewed changes

add test for bos/eos tokens

0bfd326

kawine added 2 commits March 13, 2024 18:40

Merge branch 'huggingface:main' into main

86af5dc

move dependency location

a1dfa81

kashif reviewed Mar 14, 2024

View reviewed changes

tests/test_kto_trainer.py Outdated Show resolved Hide resolved

Update tests/test_kto_trainer.py

19afc89

kashif approved these changes Mar 14, 2024

View reviewed changes

kashif merged commit fb6ebb1 into huggingface:main Mar 14, 2024
2 of 9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[KTO] fix tokenization bugs #1418

[KTO] fix tokenization bugs #1418

kawine commented Mar 12, 2024

kashif commented Mar 12, 2024

HuggingFaceDocBuilderDev commented Mar 12, 2024

lewtun left a comment

kashif commented Mar 12, 2024

kawine commented Mar 12, 2024

[KTO] fix tokenization bugs #1418

[KTO] fix tokenization bugs #1418

Conversation

kawine commented Mar 12, 2024

kashif commented Mar 12, 2024

HuggingFaceDocBuilderDev commented Mar 12, 2024

lewtun left a comment

Choose a reason for hiding this comment

kashif commented Mar 12, 2024

kawine commented Mar 12, 2024