Skip to content

Commit

Permalink
hey ho let's go
Browse files Browse the repository at this point in the history
  • Loading branch information
kzawora-intel committed Feb 24, 2025
1 parent 83b568f commit 7ad7226
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions vllm/v1/worker/hpu_model_runner.py
Original file line number Diff line number Diff line change
Expand Up @@ -654,6 +654,7 @@ def __init__(
self.max_prefill_batch_size = 16 # TODO(kzawora): add knob for that
self.padding_aware_scheduling = True # TODO(kzawora): add knob for that
self.padding_ratio_threshold = 0.9 # TODO(kzawora): add knob for that
os.environ['VLLM_CONTIGUOUS_PA'] = 'false' # NOTE(kzawora): this is a workaround
self.use_contiguous_pa = os.environ.get('VLLM_CONTIGUOUS_PA',

Check failure on line 658 in vllm/v1/worker/hpu_model_runner.py

View workflow job for this annotation

GitHub Actions / pre-commit

Ruff (E501)

vllm/v1/worker/hpu_model_runner.py:658:81: E501 Line too long (82 > 80)
'true').lower() == 'true'
self.seen_configs: set = set()
Expand Down

0 comments on commit 7ad7226

Please sign in to comment.