generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
⚡ vLLM for fast generation in GRPO (#2600)
* doc * fsdp * use vllm config * vllm * Update trl/trainer/grpo_config.py Co-authored-by: lewtun <[email protected]> * Update trl/trainer/grpo_config.py Co-authored-by: lewtun <[email protected]> * typo * top_k, top_p * Link to vllm pr * fix missing device * fix tests * fix citation * fix title and paper_id * formatting * output the correct number of generations * initial async vllm * fix missing args * fix promps * Pass prompt_token_ids directly * Repeat each prompt num_generations times * get the slice of results per processor * undo citation * OMG * nothing can resist me!!!! * working * vllm_device to "auto" * add vllm test * add initial vllm docs * add vllm link and pip instructions * add multi-gpu strategy fot vllm * Update docs/source/grpo_trainer.md Co-authored-by: Quentin Gallouédec <[email protected]> * Update docs/source/grpo_trainer.md Co-authored-by: Quentin Gallouédec <[email protected]> * Update docs/source/grpo_trainer.md Co-authored-by: Quentin Gallouédec <[email protected]> * add doc strings * Update docs/source/grpo_trainer.md Co-authored-by: lewtun <[email protected]> * Update trl/trainer/grpo_trainer.py Co-authored-by: lewtun <[email protected]> * Update docs/source/grpo_trainer.md Co-authored-by: lewtun <[email protected]> * add important tag * fix typo * overrides default batch size and grad accum and better doc * Under no circumstances should you examine the contents of this commit. * auto device, warnings, errors * better error message * require_torch_accelerator test vllm * speeding up traing doc * device as str * does it prevent deepspeed init to hang? * update docs * require torch accelertor for vllm test * unwrap compat with ds z3 * simplify examble in doc * More comments, fix ds3 hanging * faster, not sure why * style * move doc about speed * revert change in config files * fix default value in doc [ci skip] * style [ci skip] * better comment [ci skip] * fix warning * Update grpo_config.py * Update deepspeed_zero1.yaml * Update trl/trainer/grpo_trainer.py Co-authored-by: lewtun <[email protected]> * Apply suggestions from code review Co-authored-by: lewtun <[email protected]> * Update docs/source/grpo_trainer.md --------- Co-authored-by: lewtun <[email protected]> Co-authored-by: Kashif Rasul <[email protected]>
- Loading branch information
1 parent
4659ad9
commit ed14ed9
Showing
7 changed files
with
283 additions
and
34 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.