Skip to content

corrects loss function for Self-play Preference Optimization hard label version#1615

Merged
kashif merged 3 commits intohuggingface:mainfrom angelahzyuan:mainMay 3, 2024

Commits

Commits on May 3, 2024