Truncation support for recent Mistrals to prevent AsyncEngineDeadError on input exceeding max_model_len w/ chunked prefill #4592
Triggered via pull request
February 24, 2025 04:51
nightflight-dk
opened
#13741
Status
Success
Total duration
12s
Artifacts
–
cleanup_pr_body.yml
on: pull_request_target
update-description
5s