Skip to content

Actions: mmoskal/vllm

clang-format

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
7 workflow runs
7 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Update benchmark_serving.py to read and write json-datasets, results …
clang-format #7: Commit 22f5851 pushed by mmoskal
October 1, 2024 23:12 22s main
October 1, 2024 23:12 22s
[Misc] Log spec decode metrics (#6454)
clang-format #6: Commit 160e1d8 pushed by GindaChen
July 16, 2024 21:08 27s main
July 16, 2024 21:08 27s
add benchmark for fix length input and output (#5857)
clang-format #5: Commit 333306a pushed by mmoskal
July 7, 2024 17:04 14s main
July 7, 2024 17:04 14s
[core][distributed] custom allreduce when pp size > 1 (#6117)
clang-format #4: Commit 3c6325f pushed by mmoskal
July 3, 2024 22:04 19s main
July 3, 2024 22:04 19s
[Model] Jamba support (#4115)
clang-format #3: Commit 9d6a8da pushed by mmoskal
July 3, 2024 00:16 21s main
July 3, 2024 00:16 21s
[Misc] add logging level env var (#5045)
clang-format #2: Commit 325c119 pushed by mmoskal
May 25, 2024 16:03 16s main
May 25, 2024 16:03 16s
[Misc] Take user preference in attention selector (#4960)
clang-format #1: Commit ee3eea0 pushed by mmoskal
May 23, 2024 00:05 22s main
May 23, 2024 00:05 22s