Limit position embeddings in inference #1598

bhargaveede · 2024-12-12T05:00:31Z

Moving the max_position_embeddings truncation from model code as it hampers with training.
Moved it to text-generation/utils.py as that's better place

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

…rom (huggingface#57) model code. Co-authored-by: Adam Stachowicz <[email protected]>

…e#62)

HuggingFaceDocBuilderDev · 2024-12-12T05:04:28Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

bhargaveede · 2024-12-12T05:09:42Z

@regisss There's a bug which got exposed now and causing OOM with 0fbc457
.
This change is to fix that in a better way such that it won't interfere with training.
Please review and merge this to main and also 1.15-release

regisss

LGTM.

@bhargaveede Can you also address the unsolved comments in #1501 please? At least moving parallel_state.py to optimum/habana/distributed. Using more recent versions of Llama for CI can be done after release, but moving parallel_state.py should be done before release. Otherwise that will be a breaking change in the release after and we should avoid that. It's just about moving one file and updating a couple of imports.

Co-authored-by: Adam Stachowicz <[email protected]>

bhargaveede and others added 2 commits December 12, 2024 06:57

[SW-207354] Changing max_position_embeddings for inference to utils f…

0ead4a0

…rom (huggingface#57) model code. Co-authored-by: Adam Stachowicz <[email protected]>

[SW-207354] additional check for models other than llama3 (huggingfac…

d1d3d5f

…e#62)

bhargaveede requested review from mandy-li, libinta and regisss as code owners December 12, 2024 05:00

bhargaveede added the synapse_1.19_dependency label Dec 12, 2024

libinta added the run-test Run CI for PRs from external contributors label Dec 12, 2024

regisss approved these changes Dec 12, 2024

View reviewed changes

regisss merged commit 652bf96 into huggingface:main Dec 12, 2024
4 checks passed

regisss pushed a commit that referenced this pull request Dec 12, 2024

Limit position embeddings in inference (#1598)

05b9216

Co-authored-by: Adam Stachowicz <[email protected]>

zzhang37 pushed a commit to zzhang37/optimum-habana that referenced this pull request Dec 12, 2024

Limit position embeddings in inference (huggingface#1598)

05ada67

Co-authored-by: Adam Stachowicz <[email protected]>

Liangyx2 pushed a commit to HabanaAI/optimum-habana-fork that referenced this pull request Jan 20, 2025

Limit position embeddings in inference (huggingface#1598)

bb48682

Co-authored-by: Adam Stachowicz <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Limit position embeddings in inference #1598

Limit position embeddings in inference #1598

bhargaveede commented Dec 12, 2024

HuggingFaceDocBuilderDev commented Dec 12, 2024

bhargaveede commented Dec 12, 2024

regisss left a comment

Limit position embeddings in inference #1598

Limit position embeddings in inference #1598

Conversation

bhargaveede commented Dec 12, 2024

Before submitting

HuggingFaceDocBuilderDev commented Dec 12, 2024

bhargaveede commented Dec 12, 2024

regisss left a comment

Choose a reason for hiding this comment