Skip to content

[Bugfix] Fix KV head calculation for MPT models when using GQA#5142

Merged
WoosukKwon merged 1 commit intovllm-project:mainfrom bfontain:mainJun 17, 2024

Commits

Commits on May 30, 2024