Skip to content

[Kernel] Flashinfer for prefill & decode, with Cudagraph support for decode#4628

Merged
LiuXiaoxuanPKU merged 47 commits intovllm-project:mainfrom LiuXiaoxuanPKU:flashinfer-prefillJun 28, 2024

Commits

Commits on May 6, 2024

Commits on May 7, 2024

Commits on May 28, 2024

Commits on May 29, 2024

Commits on May 30, 2024

Commits on Jun 4, 2024

Commits on Jun 5, 2024

  • author
    LiuXiaoxuanPKU
    committed

Commits on Jun 11, 2024

Commits on Jun 13, 2024

Commits on Jun 14, 2024

Commits on Jun 17, 2024

Commits on Jun 18, 2024

Commits on Jun 19, 2024

Commits on Jun 20, 2024

Commits on Jun 21, 2024

Commits on Jun 22, 2024

Commits on Jun 23, 2024

Commits on Jun 25, 2024

Commits on Jun 26, 2024

Commits on Jun 27, 2024

Commits on Jun 28, 2024