VLM dpo bug #1972

liuchaohu · 2024-08-26T12:00:42Z

trl/trainer/dpo_trainer.py line 542
The tokenizer for super().init () should be set to self.tokenizer instead of tokenizer, otherwise the previous is_vision_model will be invalid.

qgallouedec · 2024-08-26T12:15:25Z

what do you mean by

otherwise the previous is_vision_model will be invalid.

?

When the model is a VLM, the transformers.Trainer expect a processor for the arg tokenizer, not a tokenizer.
in other words:

Trainer(
    ...,
    # tokenizer=processor.tokenizer, # NO
    tokenizer=processor, # YES
)

liuchaohu · 2024-08-26T14:30:39Z

In line 341-343, if self.is_vision_model == True, then self.processor = tokenizer(i.e., processor) and self.tokenizer = tokenizer.tokenizer.
However, In line 536-548, super().init(...) will set self.tokenizer = tokenizer(i.e., processor).
In this case, both self.tokenizer & self.processor are tokenizer(i.e., processor).

qgallouedec · 2024-10-20T17:06:56Z

We are refactoring DPO data processing. The new implementation should be more readable.

qgallouedec added the 👁️ VLM Related to Visual Language Models label Aug 26, 2024

qgallouedec added the 🏋 DPO Related to DPO label Oct 7, 2024

qgallouedec linked a pull request Oct 20, 2024 that will close this issue

Refactor DPO data processing #2209

Merged

5 tasks

qgallouedec closed this as completed in #2209 Oct 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VLM dpo bug #1972

VLM dpo bug #1972

liuchaohu commented Aug 26, 2024

qgallouedec commented Aug 26, 2024

liuchaohu commented Aug 26, 2024 •

edited

Loading

qgallouedec commented Oct 20, 2024

VLM dpo bug #1972

VLM dpo bug #1972

Comments

liuchaohu commented Aug 26, 2024

qgallouedec commented Aug 26, 2024

liuchaohu commented Aug 26, 2024 • edited Loading

qgallouedec commented Oct 20, 2024

liuchaohu commented Aug 26, 2024 •

edited

Loading