Skip to content

Commit

Permalink
fix2
Browse files Browse the repository at this point in the history
  • Loading branch information
comaniac committed Jul 9, 2024
1 parent 5b032d1 commit 546504b
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion vllm/worker/model_runner.py
Original file line number Diff line number Diff line change
Expand Up @@ -245,7 +245,8 @@ def add_seq_group(self, seq_group_metadata: SequenceGroupMetadata):
# Note that prefix caching does not support sliding window.
prefix_cache_hit = (computed_block_nums is not None
and len(computed_block_nums) > 0
and self.sliding_window is None)
and self.sliding_window is None
and not is_prompt)
if self.chunked_prefill_enabled and prefix_cache_hit:
raise RuntimeError(
"chunked prefill cannot be used with prefix caching now.")
Expand Down

0 comments on commit 546504b

Please sign in to comment.