Skip to content

[V1][WIP] Hybrid allocator for full attention & sliding window attention interleaved models (Reference PR, do not merge)#11938

Draft
heheda12345 wants to merge 15 commits intovllm-project:mainfrom heheda12345:sliding_self_interleave

Commits

Commits on Dec 27, 2024

Commits on Dec 28, 2024

Commits on Dec 30, 2024

Commits on Dec 31, 2024

Commits on Jan 2, 2025

Commits on Jan 10, 2025