Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rethink inactivity timeout under high load #2761

Open
tillrohrmann opened this issue Feb 20, 2025 · 0 comments
Open

Rethink inactivity timeout under high load #2761

tillrohrmann opened this issue Feb 20, 2025 · 0 comments

Comments

@tillrohrmann
Copy link
Contributor

At the moment, the inactivity timeout defines when a bi-directional streaming invocation w/o activity is suspended. The current default value is 60s. The idea is to keep the invocation running in case that a completion arrives that enables the invocation for make further progress. This works fine as long as the invoker has enough capacity to invoke other invocations. If, however, the concurrency limit is set and the system is under load it might also lead to a slow down of invocation processing. See #2758 (comment) for more details.

Under load there is the tension between keeping an invocation running and occupying the slot for making faster progress once a completion arrives vs. freeing its slot and letting waiting invocations make progress. One idea to avoid occupying slots w/o making progress is to suspend in-flight invocations whenever a new invocations arrives that does get a free slot.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant