[Kernel] Flashinfer for prefill & decode, with Cudagraph support for decode#4628
Merged
LiuXiaoxuanPKU merged 47 commits intovllm-project:mainfrom LiuXiaoxuanPKU:flashinfer-prefillJun 28, 2024
+313-117
Commits
Commits on May 6, 2024
- committedLiuXiaoxuanPKU
- committedLiuXiaoxuanPKU
Commits on May 7, 2024
- committedLiuXiaoxuanPKU
- committedLiuXiaoxuanPKU
- committedLiuXiaoxuanPKU
Commits on May 28, 2024
- committedLiuXiaoxuanPKU
Commits on May 30, 2024
- committedLiuXiaoxuanPKU
Commits on Jun 4, 2024
- committedLiuXiaoxuanPKU
- committedLiuXiaoxuanPKU
- committedLiuXiaoxuanPKU
Commits on Jun 5, 2024
- committedLiuXiaoxuanPKU
Commits on Jun 11, 2024
Commits on Jun 13, 2024
- committed
- committed
- committed
- committed
Commits on Jun 14, 2024
Commits on Jun 17, 2024
- committed
- committed
Commits on Jun 18, 2024
Commits on Jun 19, 2024
- committed
Commits on Jun 20, 2024
- committed
- committed
- committed
- committed
- committed
- committed
- committed
Commits on Jun 21, 2024
- committed
- committed
- committed
Commits on Jun 22, 2024
- committed
Commits on Jun 23, 2024
Commits on Jun 25, 2024
- committed
- committed
Commits on Jun 26, 2024
Commits on Jun 27, 2024
- committed
- committed