Skip to content
This repository has been archived by the owner on Oct 11, 2024. It is now read-only.

update install commands #264

Merged
merged 2 commits into from
May 23, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
9 changes: 5 additions & 4 deletions Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -123,6 +123,11 @@ RUN --mount=type=bind,from=build,src=/workspace/dist,target=/vllm-workspace/dist
--mount=type=cache,target=/root/.cache/pip \
pip install dist/*.whl --verbose

# UPSTREAM SYNC: Install sparsity extras
RUN --mount=type=bind,from=build,src=/workspace/dist,target=/vllm-workspace/dist \
--mount=type=cache,target=/root/.cache/pip \
pip install nm-magic-wand-nightly --extra-index-url https://pypi.neuralmagic.com/simple

RUN --mount=type=bind,from=flash-attn-builder,src=/usr/src/flash-attention-v2,target=/usr/src/flash-attention-v2 \
--mount=type=cache,target=/root/.cache/pip \
pip install /usr/src/flash-attention-v2/*.whl --no-cache-dir
Expand All @@ -135,10 +140,6 @@ FROM vllm-base AS test

ADD . /vllm-workspace/

# UPSTREAM SYNC: Install sparsity extras
RUN --mount=type=cache,target=/root/.cache/pip \
pip3 install nm-magic-wand-nightly

# install development dependencies (for testing)
RUN --mount=type=cache,target=/root/.cache/pip \
pip install -r requirements-dev.txt
Expand Down
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,12 +12,12 @@ The [nm-vllm PyPi package](https://pypi.org/project/nm-vllm/) includes pre-compi

Install it using pip:
```bash
pip install nm-vllm
pip install nm-vllm --extra-index-url https://pypi.neuralmagic.com/simple
```

For utilizing weight-sparsity kernels, such as through `sparsity="sparse_w16a16"`, you can extend the installation with the `sparsity` extras:
```bash
pip install nm-vllm[sparse]
pip install nm-vllm[sparse] --extra-index-url https://pypi.neuralmagic.com/simple
```

You can also build and install `nm-vllm` from source (this will take ~10 minutes):
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -5386,7 +5386,7 @@
}
],
"source": [
"!pip install nm-vllm[sparse]"
"!pip install nm-vllm[sparse] --extra-index-url https://pypi.neuralmagic.com/simple"
]
},
{
Expand Down Expand Up @@ -6219,4 +6219,4 @@
}
}
]
}
}
Original file line number Diff line number Diff line change
Expand Up @@ -1506,7 +1506,7 @@
}
],
"source": [
"!pip install nm-vllm"
"!pip install nm-vllm --extra-index-url https://pypi.neuralmagic.com/simple"
]
},
{
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -7899,7 +7899,7 @@
{
"cell_type": "code",
"source": [
"!pip install nm-vllm[sparse]"
"!pip install nm-vllm[sparse] --extra-index-url https://pypi.neuralmagic.com/simple"
],
"metadata": {
"id": "Sz2Cs4BtPa7_"
Expand Down Expand Up @@ -8062,4 +8062,4 @@
"outputs": []
}
]
}
}
Loading