Serverless Inference API OpenAI /v1/chat/completions route broken #2946

pelikhan · 2025-01-23T15:20:27Z

System Info

Trying to access the serverless inference endpoints using the OpenAI compatible route leads to status 400.

Invalid URL: missing field `name`

Information

Docker
The CLI directly

Tasks

An officially supported command
My own modifications

Reproduction

Here is a curl command to run inference, requires to setup your HF_TOKEN.

curl https://api-inference.huggingface.co/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $HF_TOKEN" \
-d '{
  "model": "meta-llama/Llama-3.3-70B-Instruct",
  "messages": [
    {
      "role": "user",
      "content": "Write a short poem."
    }
  ]
}'

Expected behavior

This endpoint should be "openai" compatible.

The text was updated successfully, but these errors were encountered:

mzyil · 2025-01-24T02:25:16Z

+1
The same request works in this URL format though:

https://api-inference.huggingface.co/models/meta-llama/Llama-3.3-70B-Instruct/v1/chat/completions

pelikhan mentioned this issue Jan 23, 2025

re-enable huggingface once openai endpoint is fixed microsoft/genaiscript#1040

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Serverless Inference API OpenAI /v1/chat/completions route broken #2946

Serverless Inference API OpenAI /v1/chat/completions route broken #2946

pelikhan commented Jan 23, 2025

mzyil commented Jan 24, 2025 •

edited

Loading

Serverless Inference API OpenAI /v1/chat/completions route broken #2946

Serverless Inference API OpenAI /v1/chat/completions route broken #2946

Comments

pelikhan commented Jan 23, 2025

System Info

Information

Tasks

Reproduction

Expected behavior

mzyil commented Jan 24, 2025 • edited Loading

mzyil commented Jan 24, 2025 •

edited

Loading