Skip to content

Latest commit

 

History

History
34 lines (24 loc) · 811 Bytes

README.md

File metadata and controls

34 lines (24 loc) · 811 Bytes

Spyre plugin for vLLM

The vLLM Spyre plugin (vllm-spyre) is a dedicated backend extension that enables seamless integration of IBM Spyre Accelerator with vLLM. It follows the architecture describes in vLLM's Plugin System, making it easy to integrate IBM's advanced AI acceleration into existing vLLM workflows.

Installation

With Docker

First, download vllm-spyre

git clone https://github.com/IBM/vllm-spyre
cd vllm-spyre

Build image from source

docker build . -f Dockerfile.spyre -t vllm-spyre
docker run -it --rm vllm-spyre bash

In a local environment

# Install vllm
pip install vllm==0.7.3

# Install vllm-spyre
cd ..
git clone https://github.com/IBM/vllm-spyre.git
cd vllm-spyre
pip install -v -e .