-
Notifications
You must be signed in to change notification settings - Fork 44
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Gateways supporting LLMServerPool as a an HTTPRoute BackendRef #19
Comments
hey @kfswain ive created envoyproxy/gateway#4423 to make a decision on supporting this in Envoy Gateway |
The Kubernetes project currently lacks enough contributors to adequately respond to all issues. This bot triages un-triaged issues according to the following rules:
You can:
Please send feedback to sig-contributor-experience at kubernetes/community. /lifecycle stale |
The Kubernetes project currently lacks enough active contributors to adequately respond to all issues. This bot triages un-triaged issues according to the following rules:
You can:
Please send feedback to sig-contributor-experience at kubernetes/community. /lifecycle rotten |
/remove-lifecycle rotten |
Our current Envoy integration relies on
EnvoyExtensionPolicy
andEnvoyPatchPolicy
this is very manual, and not sustainable.(See: #18)
We're trying to settle on a single implementation that this project will work on to extend to support LLMServerPool as a Gateway API backend. This will enable us to run e2e tests against these concepts and iterate more quickly. That implementation should be:
We propose extending existing this gateway implementation to act as the controller for the
LLMServerPool
object. (See: https://github.com/kubernetes-sigs/llm-instance-gateway/blob/main/docs/proposals/002-api-proposal/proposal.md#llmserverpool). As well as updatingHTTPRoute
to support aLLMServerPool
as a backendRef.At a high level we expect this to look like:
The text was updated successfully, but these errors were encountered: