Skip to content

[Kernel] Optimize FP8 support for MoE kernel / Mixtral via static scales#4343

Merged
robertgshaw2-redhat merged 33 commits intovllm-project:mainfrom pcmoritz:mixtral-fp8-staticApr 27, 2024

Commits

Commits on Apr 24, 2024

Commits on Apr 25, 2024

Commits on Apr 26, 2024