Skip to content

[Kernel][Model] logits_soft_cap for Gemma2 with flashinfer#6051

Merged
LiuXiaoxuanPKU merged 15 commits intovllm-project:mainfrom LiuXiaoxuanPKU:flashinfer-logit-soft-capJul 4, 2024

Commits

Commits on Jul 1, 2024

Commits on Jul 2, 2024

Commits on Jul 3, 2024