[Bug]: OpenAI API Server always reports 0 tokens/s #4209
Labels
bug
Something isn't working
release-blocker
This PR/issue blocks the next release, therefore deserves highest priority
Your current environment
🐛 Describe the bug
It seems that the async engine logger in the openai api_server is not reporting tokens/s for either prompt or generation throughput.
Start the server with:
And submit requests with:
You should be able to see in the server logging output that there are requests running but no tokens/s reported:
The text was updated successfully, but these errors were encountered: