This repository has been archived by the owner on Oct 11, 2024. It is now read-only.
varun-sundar-rabindranath triggered nightly on refs/heads/main #49
nightly.yml
on: schedule
AWS-AVX2-32G-A10G-24G-Benchmark
/
BENCHMARK
6h 22m
NIGHTLY-MULTI
/
BUILD-TEST
3h 33m
NIGHTLY-SOLO
/
BUILD-TEST
3h 59m
AWS-AVX2-32G-A10G-24G-Accuracy
/
LM-EVAL
34m 47s
AWS-AVX2-32G-A10G-24G-Benchmark
/
NM_GH_ACTION_BENCHMARK
23s
Annotations
1 error and 1 warning
AWS-AVX2-32G-A10G-24G-Benchmark / NM_GH_ACTION_BENCHMARK
# :warning: **Performance Alert** :warning:
Possible performance regression was detected for benchmark **'smaller_is_better'**.
Benchmark result of this commit is worse than the previous benchmark result exceeding threshold `1.10`.
| Benchmark suite | Current: bdfdb774576b34b4cae98a200b146c19cd24d24c | Previous: 79f4e60848a43b6794fb489f422c7d45c4306ca4 | Ratio |
|-|-|-|-|
| `{"name": "median_ttft_ms", "description": "VLLM Serving - 2:4 Sparse\nmodel - neuralmagic/OpenHermes-2.5-Mistral-7B-pruned2.4\nmax-model-len - 4096\nsparsity - semi_structured_sparse_w16a16\nbenchmark_serving {\n \"nr-qps-pair_\": \"300,1\",\n \"dataset\": \"sharegpt\"\n}", "gpu_description": "NVIDIA A10G x 1", "vllm_version": "0.1.0", "python_version": "3.10.12 (main, Mar 7 2024, 18:39:53) [GCC 9.4.0]", "torch_version": "2.1.2+cu121"}` | `75.95981250051409` ms | `65.88031650062476` ms | `1.15` |
This comment was automatically generated by [workflow](https://github.com/neuralmagic/nm-vllm/actions?query=workflow%3ANightly) using [github-action-benchmark](https://github.com/marketplace/actions/continuous-benchmark).
Comment was generated at https://github.com/neuralmagic/nm-vllm/commit/bdfdb774576b34b4cae98a200b146c19cd24d24c#commitcomment-140365696
|
AWS-AVX2-32G-A10G-24G-Benchmark / NM_GH_ACTION_BENCHMARK
Performance alert! Previous value was 65.88031650062476 and current value is 75.95981250051409. It is 1.152997079177568x worse than previous exceeding a ratio threshold 1.1
|
Artifacts
Produced during runtime
Name | Size | |
---|---|---|
8475302354-aws-avx2-192G-4-a10g-96G-cc-nm-vllm-html
Expired
|
1.17 MB |
|
8475302354-aws-avx2-32G-a10g-24G
Expired
|
127 KB |
|
8475302354-aws-avx2-32G-a10g-24G-cc-nm-vllm-html
Expired
|
1.17 MB |
|
gh_action_benchmark_jsons-8475302354-aws-avx2-32G-a10g-24G
Expired
|
30.6 KB |
|