Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add batched Llama model definition using vLLM paged attention (mlc-ai…
…#1134) * Add batched Llama model with vllm paged attention * update core.py * doc * minor * add e2e test * mv file * clean * Check if TVM has been built with USE_VLLM * update BuildArgs docstring
- Loading branch information