Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dockerfile: use fixed vllm-provided nccl version #23

Merged
merged 1 commit into from
May 14, 2024

Conversation

dtrifiro
Copy link

nccl, which is install> 2.18 has a bug which greatly increases memory overhead, thus a specific nccl version has to be installed

See NVIDIA/nccl#1234 and vllm-project/vllm-nccl#1

@openshift-ci openshift-ci bot requested review from heyselbi and rpancham May 14, 2024 13:53
@dtrifiro dtrifiro requested review from z103cb and removed request for heyselbi and rpancham May 14, 2024 13:53
Copy link

@z103cb z103cb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm
/approve

Copy link

openshift-ci bot commented May 14, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: dtrifiro, z103cb

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-merge-bot openshift-merge-bot bot merged commit 3f5757e into opendatahub-io:ibm_main May 14, 2024
3 checks passed
@dtrifiro dtrifiro mentioned this pull request May 15, 2024
z103cb pushed a commit to z103cb/opendatahub_vllm that referenced this pull request May 16, 2024
…ubi (opendatahub-io#23)

Changes:
- vLLM v0.4.2 was published today, update our build to use pre-built
libs from their wheel
- bump other dependencies in the image build (base UBI image, miniforge,
flash attention, grpcio-tools, accelerate)
- little cleanup to remove `PYTORCH_` args that are no longer used

---------

Signed-off-by: Travis Johnson <[email protected]>
z103cb pushed a commit to z103cb/opendatahub_vllm that referenced this pull request May 16, 2024
…ubi (opendatahub-io#23)

Changes:
- vLLM v0.4.2 was published today, update our build to use pre-built
libs from their wheel
- bump other dependencies in the image build (base UBI image, miniforge,
flash attention, grpcio-tools, accelerate)
- little cleanup to remove `PYTORCH_` args that are no longer used

---------

Signed-off-by: Travis Johnson <[email protected]>
@dtrifiro dtrifiro deleted the use-vllm-nccl branch May 16, 2024 09:51
prarit pushed a commit to prarit/vllm that referenced this pull request Oct 18, 2024
Removed HIP specific matvec logic that is duplicated from tuned_gemm.py and doesn't support bf16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants