[InferenceClient] Support proxy calls for 3rd party providers #2781

hanouticelina · 2025-01-24T14:13:10Z

Resolve #2780

This is a first draft of the implementation of proxied calls. For now, this is done directly in prepare_request method for each provider, as there might be some some particularities for some providers (e.g. we need to update headers for fal.ai when the call is proxied).

HuggingFaceDocBuilderDev · 2025-01-24T14:16:51Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

hanouticelina · 2025-01-24T14:17:12Z

tests/test_inference_client.py

@@ -1186,3 +1186,203 @@ def test_chat_completion_error_in_stream():
 def test_resolve_chat_completion_url(model_url: str, expected_url: str):
    url = _build_chat_completion_url(model_url)
    assert url == expected_url
+
+
+class TestHFInferenceProvider:


There might be some refactoring here as the tests are quite similar, but i opted to have separate test classes per provider to make adding tests for a new provider more straightforward.

(likely to be revisited at some point yes but not shocked by having some redundancy in these tests)

hanouticelina · 2025-01-24T14:17:39Z

src/huggingface_hub/inference/_client.py

@@ -2806,7 +2806,7 @@ def visual_question_answering(
        """
        provider_helper = get_provider_helper(self.provider, task="visual-question-answering")
        request_parameters = provider_helper.prepare_request(
-            inputs=None,
+            inputs=image,


not sure how the test did not fail before 😅

src/huggingface_hub/inference/_providers/fal_ai.py

Wauplin

Thanks! Mostly some high-level questions but overall logic looks good to me

tests/test_inference_client.py

Wauplin · 2025-01-24T16:01:09Z

tests/test_inference_client.py

@@ -1186,3 +1186,203 @@ def test_chat_completion_error_in_stream():
 def test_resolve_chat_completion_url(model_url: str, expected_url: str):
    url = _build_chat_completion_url(model_url)
    assert url == expected_url
+
+
+class TestHFInferenceProvider:


(likely to be revisited at some point yes but not shocked by having some redundancy in these tests)

Wauplin · 2025-01-24T16:06:35Z

src/huggingface_hub/inference/_client.py

@@ -2806,7 +2806,7 @@ def visual_question_answering(
        """
        provider_helper = get_provider_helper(self.provider, task="visual-question-answering")
        request_parameters = provider_helper.prepare_request(
-            inputs=None,
+            inputs=image,


Wauplin · 2025-01-24T16:09:03Z

src/huggingface_hub/inference/_providers/fal_ai.py

+                "Routing the call through Hugging Face's infrastructure using your HF token, "
+                "and the usage will be billed directly to your Hugging Face account"


Is it actually the case? My understanding was that if a user sets their api_key in their HF account, the proxy will use it. So usage is billed on HF account only if using proxy + haven't set an api_key in their user settings. cc @SBrandeis for confirmation

Suggested change

"Routing the call through Hugging Face's infrastructure using your HF token, "

"and the usage will be billed directly to your Hugging Face account"

"Calling fal.ai provider through Hugging Face proxy."

ah yes maybe you're right

My understanding was that if a user sets their api_key in their HF account, the proxy will use it.

correct (but this changed today, there was some back'n'forth)

tests/test_inference_client.py

src/huggingface_hub/inference/_providers/fal_ai.py

Wauplin

Let's go!

add proxy calls support and tests

6b93561

hanouticelina commented Jan 24, 2025

View reviewed changes

hanouticelina requested a review from Wauplin January 24, 2025 14:18

fix tests

5dce083

hanouticelina commented Jan 24, 2025

View reviewed changes

src/huggingface_hub/inference/_providers/fal_ai.py Outdated Show resolved Hide resolved

hanouticelina added 2 commits January 24, 2025 16:45

add logging

9b507e1

fix logging message

6fba042

Wauplin reviewed Jan 24, 2025

View reviewed changes

tests/test_inference_client.py Outdated Show resolved Hide resolved

move providers tests into a separate test file

2008109

hanouticelina marked this pull request as ready for review January 24, 2025 16:30

hanouticelina requested a review from Wauplin January 24, 2025 16:30

julien-c reviewed Jan 24, 2025

View reviewed changes

src/huggingface_hub/inference/_providers/fal_ai.py Outdated Show resolved Hide resolved

Wauplin approved these changes Jan 24, 2025

View reviewed changes

fix logging message

459e025

julien-c approved these changes Jan 24, 2025

View reviewed changes

fix tests

221d01e

hanouticelina merged commit 9410c22 into main Jan 24, 2025
17 checks passed

hanouticelina deleted the add-proxy-calls branch January 24, 2025 16:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[InferenceClient] Support proxy calls for 3rd party providers #2781

[InferenceClient] Support proxy calls for 3rd party providers #2781

hanouticelina commented Jan 24, 2025

HuggingFaceDocBuilderDev commented Jan 24, 2025

hanouticelina Jan 24, 2025

Wauplin Jan 24, 2025

hanouticelina Jan 24, 2025

Wauplin Jan 24, 2025

Wauplin left a comment

Wauplin Jan 24, 2025

Wauplin Jan 24, 2025

Wauplin Jan 24, 2025

Wauplin Jan 24, 2025

hanouticelina Jan 24, 2025

julien-c Jan 24, 2025

Wauplin left a comment

		"Routing the call through Hugging Face's infrastructure using your HF token, "
		"and the usage will be billed directly to your Hugging Face account"

	"Routing the call through Hugging Face's infrastructure using your HF token, "
	"and the usage will be billed directly to your Hugging Face account"
	"Calling fal.ai provider through Hugging Face proxy."

[InferenceClient] Support proxy calls for 3rd party providers #2781

[InferenceClient] Support proxy calls for 3rd party providers #2781

Conversation

hanouticelina commented Jan 24, 2025

HuggingFaceDocBuilderDev commented Jan 24, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Wauplin left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Wauplin left a comment

Choose a reason for hiding this comment