Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add a script for serving experiments & Collect system stats in scheduler #30

Merged
merged 36 commits into from
Apr 12, 2023

Conversation

WoosukKwon
Copy link
Collaborator

Example usage:

  • Generating a single completion: python benchmark/benchmark_text_completion.py --dataset alpaca_opt_text_completion.pkl --model facebook/opt-13b --request-rate 1.0 --duration 3600 --n1 1.0
  • Generating two completions in parallel: python benchmark/benchmark_text_completion.py --dataset alpaca_opt_text_completion.pkl --model facebook/opt-13b --request-rate 1.0 --duration 3600 --n2 1.0
  • Generating two completions with beam search: python benchmark/benchmark_text_completion.py --dataset alpaca_opt_text_completion.pkl --model facebook/opt-13b --request-rate 1.0 --duration 3600 --n2-beam 1.0

@WoosukKwon WoosukKwon requested a review from zhuohan123 April 6, 2023 09:46
@WoosukKwon
Copy link
Collaborator Author

Merging this PR to main, as we have too many branches.

@WoosukKwon WoosukKwon merged commit 84eee24 into main Apr 12, 2023
@WoosukKwon WoosukKwon deleted the experiment branch April 12, 2023 22:04
slyalin pushed a commit to slyalin/vllm that referenced this pull request Apr 19, 2024
…ce_artifacts

Revert "Produce artifacts for bare metal installation in Dockerfile.openvino"
dtrifiro pushed a commit to dtrifiro/vllm that referenced this pull request May 21, 2024
This PR logs all errors during validation or generation
for a request like TGIS does. 

Signed-off-by: Joe Runde <[email protected]>
z103cb pushed a commit to dtrifiro/vllm that referenced this pull request May 21, 2024
…ensions

Dockerfile.ubi: get rid of prebuilt-wheel stage
tianyil1 pushed a commit to tianyil1/vllm that referenced this pull request Jun 5, 2024
…um_wa

WA: Disable cumsum in HPU _prepare_prompt
fxmarty pushed a commit to fxmarty/vllm-public that referenced this pull request Jun 12, 2024
@alixiaodi alixiaodi mentioned this pull request Aug 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant