Is it possible to convert a normal Pytorch checkpoint to a deepspeed-compatible checkpoint? #3027
-
My goal is to use deepspeed zero inference on bloomz model, or even any model on huggingface hub. Specifically, I would like to download model weights from huggingface hub repo (e.g. bloomz), and convert them to deepspeed compatible checkpoints through a script. (basically a reverse one of zero_to_fp32.py?) Are there existing scripts for this or similar purpose? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
This should work out of the box, please see https://github.com/huggingface/transformers-bloom-inference/blob/main/bloom-inference-scripts/bloom-ds-zero-inference.py |
Beta Was this translation helpful? Give feedback.
This should work out of the box, please see https://github.com/huggingface/transformers-bloom-inference/blob/main/bloom-inference-scripts/bloom-ds-zero-inference.py