[Models] Support Qwen model with PP #6974

andoorve · 2024-07-31T06:41:24Z

Adds support for QWen 1 model. Tested locally using test_pipeline_parallel.py with 1.8B

github-actions · 2024-07-31T06:41:37Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which consists a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of default ones by unblocking the steps in your fast-check build on Buildkite UI.

Once the PR is approved and ready to go, please make sure to run full CI as it is required to merge (or just use auto-merge).

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

andoorve · 2024-08-01T01:15:04Z

/ready

andoorve · 2024-08-01T01:17:01Z

@youkaichao PTAL!

youkaichao

thanks for the hard working!

Signed-off-by: Muralidhar Andoorveedu <[email protected]>

Signed-off-by: Muralidhar Andoorveedu <[email protected]> Signed-off-by: Alvant <[email protected]>

Signed-off-by: Muralidhar Andoorveedu <[email protected]>

andoorve mentioned this pull request Jul 31, 2024

[Model] Pipeline parallel support for Qwen2 #6924

Merged

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 1, 2024

youkaichao approved these changes Aug 1, 2024

View reviewed changes

andoorve and others added 8 commits August 1, 2024 06:34

Support Qwen model

8b6c5de

Signed-off-by: Muralidhar Andoorveedu <[email protected]>

Add prefix

9987309

Signed-off-by: Muralidhar Andoorveedu <[email protected]>

Change prefix

1d50eca

Signed-off-by: Muralidhar Andoorveedu <[email protected]>

Fix name

3556334

Signed-off-by: Muralidhar Andoorveedu <[email protected]>

Fix name

0f5b096

Signed-off-by: Muralidhar Andoorveedu <[email protected]>

Format

9501b11

Signed-off-by: Muralidhar Andoorveedu <[email protected]>

Format

f5702dd

Signed-off-by: Muralidhar Andoorveedu <[email protected]>

Update distributed_serving.rst

ca1cadf

andoorve force-pushed the qwen-pp branch from c870596 to ca1cadf Compare August 1, 2024 06:35

andoorve added 2 commits August 1, 2024 06:42

Revert config

06aa6f0

Signed-off-by: Muralidhar Andoorveedu <[email protected]>

Add QWenLMHeadModel to supported models

ec09d4d

Signed-off-by: Muralidhar Andoorveedu <[email protected]>

andoorve mentioned this pull request Aug 1, 2024

[Feature]: Pipeline parallelism support for qwen model #6471

Closed

youkaichao merged commit fc912e0 into vllm-project:main Aug 1, 2024
61 of 63 checks passed

andoorve deleted the qwen-pp branch August 1, 2024 19:48

dtrifiro mentioned this pull request Aug 5, 2024

Sync with [email protected] opendatahub-io/vllm#120

Closed

kylesayrs pushed a commit to neuralmagic/vllm that referenced this pull request Aug 17, 2024

[Models] Support Qwen model with PP (vllm-project#6974)

7089f47

Signed-off-by: Muralidhar Andoorveedu <[email protected]>

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[Models] Support Qwen model with PP (vllm-project#6974)

0fb98ae

Signed-off-by: Muralidhar Andoorveedu <[email protected]> Signed-off-by: Alvant <[email protected]>

KuntaiDu pushed a commit to KuntaiDu/vllm that referenced this pull request Nov 20, 2024

[Models] Support Qwen model with PP (vllm-project#6974)

63dd014

Signed-off-by: Muralidhar Andoorveedu <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Models] Support Qwen model with PP #6974

[Models] Support Qwen model with PP #6974

andoorve commented Jul 31, 2024 •

edited

Loading

github-actions bot commented Jul 31, 2024

andoorve commented Aug 1, 2024

andoorve commented Aug 1, 2024

youkaichao left a comment •

edited

Loading

[Models] Support Qwen model with PP #6974

[Models] Support Qwen model with PP #6974

Conversation

andoorve commented Jul 31, 2024 • edited Loading

github-actions bot commented Jul 31, 2024

andoorve commented Aug 1, 2024

andoorve commented Aug 1, 2024

youkaichao left a comment • edited Loading

Choose a reason for hiding this comment

andoorve commented Jul 31, 2024 •

edited

Loading

youkaichao left a comment •

edited

Loading