Adds EP-10494: AI APIs enhancement proposal #10495

npolshakova · 2025-01-23T18:14:17Z

Description

Adds Enhancement Proposal 10494 that proposes adding AI Gateway APIs support.

Supports #10494

Initial APIs can be found in this draft PR: #10493

docs/content/enhancements/10494.md

EItanya

This proposal LGTM

docs/content/enhancements/10494.md

npolshakova · 2025-01-28T21:57:29Z

@linsun / @yuval-k can I get another review?

docs/content/enhancements/10494.md

design/10494.md

andy-fong · 2025-02-12T00:10:27Z

design/10494.md

+spec:
+  ai:
+    vertexAi:
+      model: gemini-1.5-flash-001


Should we mention what would happen if the URL or the request body contains the model value that's not the same as the model value in the Upstream?

Yep, I'll add a sentence about if the model is not provided as well. We use the Upstream in the case of a mismatch, right?

design/10494.md

lgadban · 2025-02-14T19:29:32Z

design/10494.md

+          name: openai-secret
+```
+
+Notice that this Upstream does not specify a model, so kgateway will use the model value in the request to determine 


kgateway will use the model value in the request

What does this mean? Might be worth elaborating here.

lgadban · 2025-02-14T19:30:07Z

design/10494.md

+    filters:
+    - type: ExtensionRef
+      extensionRef:
+        group: gateway.kgateway.dev/v1alpha1
+        kind: RoutePolicy
+        name: open-ai-opt


Prefer using targetRef attachment on the RoutePolicy rather than ExtensionRef?

lgadban · 2025-02-14T19:31:58Z

design/10494.md

+The `pool` entries can either define a list of backends or a single backend.
+
+```yaml
+multi:


Does this live on the Upstream? More context would be help IMO

npolshakova requested review from yuval-k, danehans and EItanya January 23, 2025 19:36

linsun reviewed Jan 24, 2025

View reviewed changes

docs/content/enhancements/10494.md Outdated Show resolved Hide resolved

EItanya reviewed Jan 27, 2025

View reviewed changes

lgadban reviewed Jan 28, 2025

View reviewed changes

docs/content/enhancements/10494.md Outdated Show resolved Hide resolved

npolshakova requested a review from linsun January 28, 2025 21:57

npolshakova requested a review from lgadban January 31, 2025 16:29

yuval-k reviewed Feb 3, 2025

View reviewed changes

docs/content/enhancements/10494.md Outdated Show resolved Hide resolved

docs/content/enhancements/10494.md Outdated Show resolved Hide resolved

docs/content/enhancements/10494.md Outdated Show resolved Hide resolved

docs/content/enhancements/10494.md Outdated Show resolved Hide resolved

npolshakova mentioned this pull request Feb 6, 2025

Add support for AI APIs #10493

Open

4 tasks

npolshakova requested review from EItanya and yuval-k February 11, 2025 17:34

andy-fong suggested changes Feb 12, 2025

View reviewed changes

npolshakova added 9 commits February 12, 2025 15:46

add ai api enhancement proposal

e876a96

fix links

878dbca

reword

042307c

add additional sections and examples

b311dea

add deepseek custom hosting example, highlight egress gw usecase

bd88126

fix typos

18dc546

feedback

384fa3a

move EP

e1dc2c1

feedback

b1fe48b

npolshakova force-pushed the add-ai-api-enhancement-proposal branch from 3ab7979 to b1fe48b Compare February 12, 2025 20:46

andy-fong reviewed Feb 12, 2025

View reviewed changes

design/10494.md Outdated Show resolved Hide resolved

design/10494.md Show resolved Hide resolved

npolshakova added 2 commits February 12, 2025 17:32

address feedback

da5c601

fix path

da8ef8a

npolshakova requested a review from andy-fong February 12, 2025 22:37

andy-fong approved these changes Feb 13, 2025

View reviewed changes

Merge branch 'main' into add-ai-api-enhancement-proposal

a8b9b3e

npolshakova added 6 commits February 13, 2025 17:34

clean up examples, limit scope to extensionRefs

a2a8272

fix extensionRef

3f666c3

fmt

9a00936

reword

7c1daee

Merge branch 'main' into add-ai-api-enhancement-proposal

101daa4

Merge branch 'main' into add-ai-api-enhancement-proposal

f7cb884

npolshakova mentioned this pull request Feb 14, 2025

Add AI plugin #10627

Draft

4 tasks

lgadban reviewed Feb 14, 2025

View reviewed changes

andy-fong approved these changes Feb 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds EP-10494: AI APIs enhancement proposal #10495

Adds EP-10494: AI APIs enhancement proposal #10495

npolshakova commented Jan 23, 2025

EItanya left a comment

npolshakova commented Jan 28, 2025

andy-fong Feb 12, 2025

npolshakova Feb 12, 2025

lgadban Feb 14, 2025

lgadban Feb 14, 2025

lgadban Feb 14, 2025

Adds EP-10494: AI APIs enhancement proposal #10495

Are you sure you want to change the base?

Adds EP-10494: AI APIs enhancement proposal #10495

Conversation

npolshakova commented Jan 23, 2025

Description

EItanya left a comment

Choose a reason for hiding this comment

npolshakova commented Jan 28, 2025

andy-fong Feb 12, 2025

Choose a reason for hiding this comment

npolshakova Feb 12, 2025

Choose a reason for hiding this comment

lgadban Feb 14, 2025

Choose a reason for hiding this comment

lgadban Feb 14, 2025

Choose a reason for hiding this comment

lgadban Feb 14, 2025

Choose a reason for hiding this comment