Skip to content
This repository has been archived by the owner on Oct 11, 2024. It is now read-only.

Upstream sync 2024 04 12 #183

Merged
merged 44 commits into from
Apr 15, 2024
Merged

Upstream sync 2024 04 12 #183

merged 44 commits into from
Apr 15, 2024

Conversation

andy-neuma
Copy link
Member

@andy-neuma andy-neuma commented Apr 12, 2024

SUMMARY:

  • upstream sync from "vllm" using cherry-pick. range b4543c8..7fd3949a0b1c6cd0dcd7066aca48d9d589f2f68e

NOTES:
vllm current 7fd3949 ... 7fd3949a0b1c6cd0dcd7066aca48d9d589f2f68e
vllm from last upstream sync b4543c8 ... b4543c8f6bf67a7f1a0d6d0fd6cf5697c7eeaabb

COMMAND:

git cherry-pick b4543c8..7fd3949a0b1c6cd0dcd7066aca48d9d589f2f68e

TEST PLAN:
runs on remote push and will trigger a NIGHTLY

egortolmachev and others added 30 commits April 12, 2024 09:03
…project#3643)

Co-authored-by: jpvillam <[email protected]>
Co-authored-by: Gregory Shtrasberg <[email protected]>
Co-authored-by: Woosuk Kwon <[email protected]>
…ect#3950)

[WIP][Core][Refactor] move vllm/model_executor/parallel_utils into vllm/distributed and vllm/device_communicators (vllm-project#3950)
)

[Core][Model] Use torch.compile to accelerate layernorm in commandr (vllm-project#3985)
…m-project#4014)

[Core][Distributed] make init_distributed_environment compatible with init_process_group (vllm-project#4014)
@robertgshaw2-redhat
Copy link
Collaborator

There was a change to the API for create_weights and apply_weights. Updated SparseW16A16LinearMethod to conform to this API.

Passing failed tests locally.

@SageMoore SageMoore merged commit c46ca5e into main Apr 15, 2024
6 of 9 checks passed
@SageMoore SageMoore deleted the upstream-sync-2024-04-12 branch April 15, 2024 13:26
@robertgshaw2-redhat robertgshaw2-redhat restored the upstream-sync-2024-04-12 branch April 15, 2024 13:34
@robertgshaw2-redhat
Copy link
Collaborator

just going to keep these branches around

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.