Skip to content

[KTOTrainer] add BCO (reward shift and underlying distribution matching)#1599

Merged
younesbelkada merged 11 commits intohuggingface:mainfrom seanexp:unpaired_bcoApr 30, 2024

Commits

Commits on Apr 28, 2024

Commits on Apr 29, 2024

Commits on Apr 30, 2024