Skip to content

[Core] Support tensor parallelism for GGUF quantization#7520

Merged
mgoin merged 9 commits intovllm-project:mainfrom Isotr0py:gguf-tpAug 19, 2024

Commits

Commits on Aug 9, 2024

Commits on Aug 14, 2024

Commits on Aug 15, 2024

Commits on Aug 16, 2024

Commits on Aug 18, 2024