[Neuron] Add custom_ops for neuron backend #13246

liangfu · 2025-02-13T23:48:47Z

As part of the effort in supporting vLLM V1 architecture for neuron backend (#11152), this PR intent to support activation, layernorm, rotary_embedding and logits_processor as a variant of existing modules that derive from CustomOp class.

These changes are tested in individual test cases that are ported from tests/kernels directory to be neuron backend specific.

Co-authored-by: George Novack (@gnovack)
Co-authored-by: Aoyu Zhang (@AoyuQC)

github-actions · 2025-02-13T23:48:58Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

tests/neuron/test_activation.py

tests/neuron/test_rotary_embedding.py

vllm/model_executor/layers/activation.py

vllm/model_executor/layers/rotary_embedding.py

mergify · 2025-02-21T18:45:10Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @liangfu.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Co-authored-by: George Novack <[email protected]> Co-authored-by: Aoyu Zhang <[email protected]> Signed-off-by: Liangfu Chen <[email protected]>

gnovack

Thanks @liangfu! lgtm

liangfu force-pushed the custom-op branch 2 times, most recently from ebef082 to ef47583 Compare February 17, 2025 22:57

liangfu marked this pull request as ready for review February 17, 2025 22:58

liangfu force-pushed the custom-op branch 2 times, most recently from 730bcc2 to 6b0f132 Compare February 18, 2025 07:07

gnovack reviewed Feb 21, 2025

View reviewed changes

tests/neuron/test_activation.py Outdated Show resolved Hide resolved

tests/neuron/test_rotary_embedding.py Show resolved Hide resolved

vllm/model_executor/layers/activation.py Outdated Show resolved Hide resolved

vllm/model_executor/layers/rotary_embedding.py Outdated Show resolved Hide resolved

mergify bot added the needs-rebase label Feb 21, 2025

liangfu mentioned this pull request Feb 21, 2025

[RFC][Exploratory]: vLLM Neuron Backend with V1 Architecture #11152

Open

6 tasks

liangfu force-pushed the custom-op branch from 6b0f132 to fe0f5f5 Compare February 25, 2025 00:49

mergify bot removed the needs-rebase label Feb 25, 2025

[Neuron] add custom_ops for neuron backend

c4ed25d

Co-authored-by: George Novack <[email protected]> Co-authored-by: Aoyu Zhang <[email protected]> Signed-off-by: Liangfu Chen <[email protected]>

liangfu force-pushed the custom-op branch from fe0f5f5 to c4ed25d Compare February 25, 2025 00:58

liangfu requested review from gnovack and aarondou February 25, 2025 01:03

gnovack approved these changes Feb 25, 2025

View reviewed changes

simon-mo merged commit f75aa72 into vllm-project:main Feb 25, 2025
19 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Neuron] Add custom_ops for neuron backend #13246

[Neuron] Add custom_ops for neuron backend #13246

liangfu commented Feb 13, 2025 •

edited by github-actions bot

Loading

github-actions bot commented Feb 13, 2025

mergify bot commented Feb 21, 2025

gnovack left a comment

[Neuron] Add custom_ops for neuron backend #13246

[Neuron] Add custom_ops for neuron backend #13246

Conversation

liangfu commented Feb 13, 2025 • edited by github-actions bot Loading

github-actions bot commented Feb 13, 2025

mergify bot commented Feb 21, 2025

gnovack left a comment

Choose a reason for hiding this comment

liangfu commented Feb 13, 2025 •

edited by github-actions bot

Loading