[`core` / `xxxTrainer`] Automatic tagging #1329

younesbelkada · 2024-02-15T04:13:50Z

What does this PR do?

This PR injects trl / dpo / sft etc tags on the model at the trainer's init. That way models that get pushed with model.push_to_hub() will also get the correct tags instead of users that call trainer.push_to_hub

cc @lvwerra @osanseviero for awareness

younesbelkada · 2024-02-15T04:17:25Z

Verified that this PR do not create any conflict with the previous tagging logic already in place:

import datasets
import peft
import transformers
import trl


model_dir = "HuggingFaceM4/tiny-random-LlamaForCausalLM"

tokenizer = transformers.AutoTokenizer.from_pretrained(model_dir)
tokenizer.pad_token = tokenizer.eos_token
tokenizer.padding_side = "right"

model = transformers.AutoModelForCausalLM.from_pretrained(model_dir)

ds_train = datasets.load_dataset("imdb", split="train[:10]")

trainer = trl.SFTTrainer(
    model=model,
    args=transformers.TrainingArguments(
        output_dir="test-automatic-tagging-from-trainer",
        max_steps=1,
        remove_unused_columns=True,
    ),
    peft_config=peft.LoraConfig(
        lora_alpha=16,
        lora_dropout=0.1,
        r=8,
        bias="none",
        task_type="Causal_LM",
    ),
    train_dataset=ds_train,
    tokenizer=tokenizer,
    dataset_text_field="text",
    max_seq_length=8,
)

model.push_to_hub("ybelkada/test-automatic-tagging")
trainer.push_to_hub()

https://huggingface.co/ybelkada/test-automatic-tagging-from-trainer

https://huggingface.co/ybelkada/test-automatic-tagging

HuggingFaceDocBuilderDev · 2024-02-15T04:18:52Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

philschmid

Good idea!

osanseviero

Very nice!

cc @kashif to also add in KTO Trainer

kashif · 2024-02-15T10:32:15Z

ah yes thanks! added to #1181

* automatic tagging * add comments * fix tests * fix

younesbelkada added 2 commits February 15, 2024 04:12

automatic tagging

c5f46e8

add comments

b27dcd0

younesbelkada added 2 commits February 15, 2024 04:31

fix tests

591b23f

fix

539ec37

philschmid approved these changes Feb 15, 2024

View reviewed changes

lvwerra approved these changes Feb 15, 2024

View reviewed changes

osanseviero approved these changes Feb 15, 2024

View reviewed changes

younesbelkada merged commit 1e77d8a into main Feb 15, 2024
9 checks passed

younesbelkada deleted the add-tags-loading branch February 15, 2024 13:47

lapp0 pushed a commit to lapp0/trl that referenced this pull request May 10, 2024

[core / xxxTrainer] Automatic tagging (huggingface#1329)

d8a4a23

* automatic tagging * add comments * fix tests * fix

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`core` / `xxxTrainer`] Automatic tagging #1329

[`core` / `xxxTrainer`] Automatic tagging #1329

younesbelkada commented Feb 15, 2024

younesbelkada commented Feb 15, 2024

HuggingFaceDocBuilderDev commented Feb 15, 2024

philschmid left a comment

osanseviero left a comment

kashif commented Feb 15, 2024 •

edited

Loading

[core / xxxTrainer] Automatic tagging #1329

[core / xxxTrainer] Automatic tagging #1329

Conversation

younesbelkada commented Feb 15, 2024

What does this PR do?

younesbelkada commented Feb 15, 2024

HuggingFaceDocBuilderDev commented Feb 15, 2024

philschmid left a comment

Choose a reason for hiding this comment

osanseviero left a comment

Choose a reason for hiding this comment

kashif commented Feb 15, 2024 • edited Loading

[`core` / `xxxTrainer`] Automatic tagging #1329

[`core` / `xxxTrainer`] Automatic tagging #1329

kashif commented Feb 15, 2024 •

edited

Loading