-
Notifications
You must be signed in to change notification settings - Fork 28.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Resize embeds (with Deepspeed) is still not fixed in version 4.43.3 #32287
Comments
cc @ArthurZucker, I do confirm that previous PR is included in both v4.43.2 and v4.43.3, could you see what's acting up here? |
@LysandreJik @ArthurZucker The issue is resolved by PR #32214, but this PR was not included in the patch (4.43.3). Please note that the patch notes mention this PR, but the actual patch does not include it. Could you please do some additional verification? Relevant PRs: #32192, #32214 (continuation of #32192)
|
May I ask for further follow-up on this issue? Any additional validation or assistance would be greatly appreciated. |
Hey! Yes, I either forgot it, or there was an issue that was introduced on |
There will be a release tomorrow! |
Unless you needs this urgently today! |
@ArthurZucker I think the sooner, the better! The issue has been active for two weeks in the official releases, despite the patch notes indicating it is resolved. I also prefer the official release due to reproducibility concerns. |
I just wanted to send a friendly reminder! If @ArthurZucker doesn't have the bandwidth to handle this issue, any additional assistance from other maintainers would be greatly appreciated. |
Sorry sir! It was my bad I should have done a patch instantly, was supposed to do a release on friday, but pushing it back to today / Wednesdays. So I'll patch in a bit! |
https://github.com/huggingface/transformers/commits/v4.43.4/ sorry for the delay |
Thank you so much for your consistent follow-up and for patching the issue, @ArthurZucker! I confirmed that the test script works as expected in the newer version 😄 |
Thanks for being thorough on your side! 🤗 |
Still getting this issue in the version: 4.45.2, any update on this? |
Can you open a new issue with a new script, the one shared by @seokhyunan does not produce the error. You might have a setting? |
System Info
Who can help?
@ArthurZucker @LysandreJik
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
resize_token_embeddings
still does not work in version 4.43.3. Although PR #32214 resolved the issue, it seems that the actual patch does not include this PR, despite being mentioned in the latest patch notes. I confirmed that the test scripts below still do not work in 4.43.3 but do work in the main branch that includes the PR.If I
resize
thetoken embedding
to be greater than or equal to the original vocab size,vocab_size
is set to zero. Otherwise, another error occurs:RuntimeError: start (0) + length (525336576) exceeds dimension size (524943360)
.test.sh
:test.py
:test_ds_config.conf
:output
:Expected behavior
correctly update
vocab_size
.The text was updated successfully, but these errors were encountered: