Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add support for FSDP+QLoRA and DeepSpeed ZeRO3+QLoRA (huggingface#1416)
* don't do mp casting * don't use `prepare_for_kbit` when using fsdp+qlora or dsz3+qlora * changes to enable fsdp+qlora and dsz3+qlora * revert * Update sft_trainer.py * quality * fix deprecation using changes from PR huggingface#1415 * fixes * quality * Update trl/trainer/sft_trainer.py Co-authored-by: Younes Belkada <[email protected]> * quality * relaunch tests --------- Co-authored-by: Younes Belkada <[email protected]>
- Loading branch information