-
Notifications
You must be signed in to change notification settings - Fork 527
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to convert fine-tuned .pt to huggingface .safetensors #2118
Comments
hey @yuzhenmao , I am sorry that you are having issues with it. We are about to land this PR that will solve this problem for our users: #2074 Is it ok if you wait until the EoD and retrain the model? If you need it right away or retraining is a big lift, then you can check this similar issue and follow the steps: #2048 |
Hi @felipemello1, thank you for the reply! I can wait until the EoD. Please keep me posted, thanks! |
hey @yuzhenmao , PR is merged: #2074 Now it should be much easier to use vllm/huggingface. Instructions are in the readme. We will update the docs soon. Let us know if you find any issues and thanks for your patience :). |
Hi @felipemello1, thank you so much for your help! I have a quick question regarding the PR: In section 3.1 (Using Hugging Face), if an alternative approach involves something similar to 3.2—specifically, deleting adapter_model.safetensors from the folder—will Hugging Face then automatically use ft-model-00001-of-00001.safetensors? Thanks again! |
Np! Please feel free to ask more or reopen the issue. If you want to use the model (not adapter) with hf, there’s no need to delete the adapter. Hugginface will use the file index.safetensor.json to know which ckpt to load @yuzhenmao |
@yuzhenmao , I updated the PR description. There was an issue with the file name that was fixed today. You can use it with huggingface like this:
|
Hi @felipemello1, thank you so much for letting me know the update! |
Hi, I have successfully fine-tuned the Llama 3.1 model and obtained the fine-tuned checkpoint in the format
hf_model_000*_2.pt
. However, I haven’t been able to find a solution for converting the.pt
file into the.safetensors
format used in Huggingface. I have noticed some similar issues: #878 and #832. I would greatly appreciate any guidance on how to achieve this.The text was updated successfully, but these errors were encountered: