[Feature]: Pipeline Parallelism support for the Vision Language Models #7684

Manikandan-Thangaraj-ZS0321 · 2024-08-20T09:42:45Z

🚀 The feature, motivation and pitch

If I am not wrong, currently vllm supports only the Language models not the Vision models.

NotImplementedError: Pipeline parallelism is only supported for the following architectures: ['AquilaModel', 'AquilaForCausalLM', 'DeepseekV2ForCausalLM', 'InternLMForCausalLM', 'JAISLMHeadModel', 'LlamaForCausalLM', 'LLaMAForCausalLM', 'MistralForCausalLM', 'Phi3ForCausalLM', 'GPT2LMHeadModel', 'MixtralForCausalLM', 'NemotronForCausalLM', 'Qwen2ForCausalLM', 'Qwen2MoeForCausalLM', 'QWenLMHeadModel'].

This feature would greatly benefit teams and projects working with vision-language models, allowing them to scale out their workloads efficiently and maintain performance as model sizes continue to grow.

Also It would be greatly helpful, if someone can point me out on other possibilities for pipeline parallelism. Thanks in advance

Alternatives

No response

Additional context

No response

youkaichao · 2024-08-20T16:51:58Z

You can follow #7168 to add some if you want.

Manikandan-Thangaraj-ZS0321 · 2024-08-21T05:06:49Z

Hi @youkaichao,

Thank a lot for the response. I was wondering if there's any way to know when the PR will be merged, as it doesn't seem to be updated with the latest changes from the main branch.

Additionally, could you please suggest any other frameworks that support pipeline parallelism? I need to implement the scale-out (pipeline parallelism) feature in my project as soon as possible.

Thanks again!

youkaichao · 2024-08-21T05:12:25Z

that pr #7168 should be merged after @andoorve comes back.

Manikandan-Thangaraj-ZS0321 · 2024-08-21T08:25:27Z

@youkaichao, any ideas on frameworks that support pipeline parallelism other than vllm.

youkaichao · 2024-08-21T16:55:25Z

any ideas on frameworks that support pipeline parallelism other than vllm

sorry I don't know

DarkLight1337 · 2024-09-05T13:00:14Z

The rest of the vision language models still don't support PP yet, so I'm reopening this.

Manikandan-Thangaraj-ZS0321 added the feature request label Aug 20, 2024

This was referenced Aug 21, 2024

[RFC]: Multi-modality Support on vLLM #4194

Open

[Models] Add remaining model PP support #7168

Merged

DarkLight1337 mentioned this issue Aug 22, 2024

[Usage]: How do I configure Phi-3-vision for high throughput? #7751

Closed

Manikandan-Thangaraj-ZS0321 mentioned this issue Aug 26, 2024

Inclusion of InternVLChatModel In PP_SUPPORTED_MODELS(Pipeline Parallelism) #7860

Merged

DarkLight1337 closed this as completed in #7860 Sep 5, 2024

DarkLight1337 reopened this Sep 5, 2024

DarkLight1337 closed this as completed in #7168 Oct 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Pipeline Parallelism support for the Vision Language Models #7684

[Feature]: Pipeline Parallelism support for the Vision Language Models #7684

Manikandan-Thangaraj-ZS0321 commented Aug 20, 2024 •

edited

Loading

youkaichao commented Aug 20, 2024

Manikandan-Thangaraj-ZS0321 commented Aug 21, 2024 •

edited

Loading

youkaichao commented Aug 21, 2024

Manikandan-Thangaraj-ZS0321 commented Aug 21, 2024

youkaichao commented Aug 21, 2024

DarkLight1337 commented Sep 5, 2024

[Feature]: Pipeline Parallelism support for the Vision Language Models #7684

[Feature]: Pipeline Parallelism support for the Vision Language Models #7684

Comments

Manikandan-Thangaraj-ZS0321 commented Aug 20, 2024 • edited Loading

🚀 The feature, motivation and pitch

Alternatives

Additional context

youkaichao commented Aug 20, 2024

Manikandan-Thangaraj-ZS0321 commented Aug 21, 2024 • edited Loading

youkaichao commented Aug 21, 2024

Manikandan-Thangaraj-ZS0321 commented Aug 21, 2024

youkaichao commented Aug 21, 2024

DarkLight1337 commented Sep 5, 2024

Manikandan-Thangaraj-ZS0321 commented Aug 20, 2024 •

edited

Loading

Manikandan-Thangaraj-ZS0321 commented Aug 21, 2024 •

edited

Loading