Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Uploading/Sharing large models to HuggingFace #7398

Closed
MXueguang opened this issue Sep 25, 2020 · 7 comments · Fixed by #8324
Closed

Uploading/Sharing large models to HuggingFace #7398

MXueguang opened this issue Sep 25, 2020 · 7 comments · Fixed by #8324

Comments

@MXueguang
Copy link

Hi,

I am trying to upload a t5-3b based model to HuggingFace. The folder to upload has 11G.
When I am uploading, it will gives 'Connection aborted.', BrokenPipeError(32, 'Broken pipe').
Does it because the model is too large and there is a limitation? How could I deal with that?
Thank you for your help!

@julien-c
Copy link
Member

There is no limit to the file sizes on the model hub, however, for uploads that large and if your connection is even slightly unstable, it can indeed fail.

If you have another host (S3 bucket or whatever) you can upload the file to, I can handle cping it to your namespace on huggingface.co

@MXueguang
Copy link
Author

MXueguang commented Sep 25, 2020

Actually It will abort at the very beginning of uploading process for the large file every time. All my other smaller models could be uploaded smoothly. so I feel it might not be my network issue.
My pytorch_model.bin is about 11G. I tried to use truncate to truncate the file size and noticed that I will keep aborting until I truncate the file to 5G

@MXueguang
Copy link
Author

MXueguang commented Sep 25, 2020

Btw, we have a public google cloud storage host. Does it work for you if i am still not able to upload the model?

@julien-c
Copy link
Member

I can indeed reproduce. For now, can you upload to a GCS or S3 bucket, post the url here, and I'll cp the file?

Will take a note to investigate/fix this in the future.

@MXueguang
Copy link
Author

gs://ron-random/castorini/monot5-3b-med-msmarco/
gs://ron-random/castorini/monot5-3b-msmarco/

Could you help us cp these two models to our organization castorini

Thank you very much for your help!

@julien-c
Copy link
Member

Here you go: https://huggingface.co/castorini

@julien-c
Copy link
Member

Will close this for now but we are tracking the "large file upload" issue internally

@julien-c julien-c linked a pull request Nov 5, 2020 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants