Fix gRPC streaming non-decoupled segfault if sending response and final flag separately #7265

kthui · 2024-05-24T00:51:05Z

When using gRPC streaming for non-decoupled models, the frontend does not properly handle the case which the response and final flag are sent separately. The gRPC server will release the state as soon as the response is received, resulting in a possible segfault when the final flag is later received.

The fix is to defer sending the response if a final flag is not coupled with the response, and leave the gRPC step in WRITEREADY. Then, when the final flag is received, it continue sending the response and move to step WRITTEN.

This PR should merge before #7311

qa/L0_backend_python/test.sh

Tabrizian

Can we target main and only include the related commits for this change? It looks like it is independent of other Python be changes.

qa/L0_backend_python/response_sender/response_sender_test.py

src/grpc/stream_infer_handler.cc

kthui · 2024-05-28T22:12:42Z

All non-related commits are removed and the PR is now targeting the main branch.

…al flag separately

…ponse

qa/L0_grpc_state_cleanup/cleanup_test.py

oandreeva-nv

LGTM, Thanks for elaborations!

oandreeva-nv

LGTM!

rmccorm4 · 2024-06-06T00:41:27Z

qa/L0_grpc_state_cleanup/cleanup_test.py

@@ -79,7 +80,7 @@ def _prepare_inputs_and_outputs(self, kind):
            self.outputs_.append(grpcclient.InferRequestedOutput("OUT"))
            self.outputs_.append(grpcclient.InferRequestedOutput("IDX"))
            self.requested_outputs_ = self.outputs_
-        elif kind == "simple" or kind == "streaming":
+        elif kind in ("simple", "streaming"):


Just curious, what is streaming compared to decoupled_streaming and non_decoupled_streaming?

The decoupled_streaming and non_decoupled_streaming are for the repeat_int32 and repeat_int32_non_decoupled models on repeat backend. The simple and streaming are for the custom_zero_1_float32 model on identity backend.

I think we could do some refactoring on how the inputs are prepared, i.e. for the repeat backend, we can have a function that returns a inputs that can be used directly on the client.async_stream_infer().

rmccorm4

LGTM, just one question: https://github.com/triton-inference-server/server/pull/7265/files#r1628592664. Doesn't need to be fixed/addressed here, just curious.

…al flag separately (#7265)

This was referenced May 24, 2024

Fix decoupled gpu output error handling triton-inference-server/python_backend#362

Merged

Update expected error message #7258

Merged

kthui marked this pull request as ready for review May 24, 2024 01:05

kthui requested review from tanmayv25, Tabrizian, rmccorm4 and oandreeva-nv May 24, 2024 01:05

oandreeva-nv reviewed May 24, 2024

View reviewed changes

qa/L0_backend_python/test.sh Outdated Show resolved Hide resolved

Tabrizian reviewed May 28, 2024

View reviewed changes

oandreeva-nv reviewed May 28, 2024

View reviewed changes

qa/L0_backend_python/response_sender/response_sender_test.py Outdated Show resolved Hide resolved

oandreeva-nv reviewed May 28, 2024

View reviewed changes

src/grpc/stream_infer_handler.cc Outdated Show resolved Hide resolved

rmccorm4 reviewed May 28, 2024

View reviewed changes

src/grpc/stream_infer_handler.cc Show resolved Hide resolved

kthui force-pushed the jacky-res-sender-fix-grpc branch from 03130e0 to 8a10ba4 Compare May 28, 2024 22:10

kthui changed the base branch from jacky-res-sender-main to main May 28, 2024 22:10

Tabrizian previously approved these changes May 30, 2024

View reviewed changes

kthui dismissed Tabrizian’s stale review via 8fd6f4e May 31, 2024 00:54

kthui force-pushed the jacky-res-sender-fix-grpc branch 3 times, most recently from c08292b to 8799aec Compare June 4, 2024 00:04

kthui added 3 commits June 3, 2024 17:05

Fix gRPC streaming non-decoupled segfault if sending response and fin…

d729557

…al flag separately

Enhance logging for non-decoupled on null response and final flag

b888f7e

Add test for non-decoupled gRPC streaming returning more than one res…

addab53

…ponse

kthui force-pushed the jacky-res-sender-fix-grpc branch from 8799aec to addab53 Compare June 4, 2024 00:06

kthui requested review from rmccorm4, Tabrizian and oandreeva-nv June 4, 2024 00:14

kthui mentioned this pull request Jun 4, 2024

Add support for response sender in the default mode #7311

Merged

Update copyright

3e823dc

Tabrizian previously approved these changes Jun 5, 2024

View reviewed changes

oandreeva-nv reviewed Jun 5, 2024

View reviewed changes

qa/L0_grpc_state_cleanup/cleanup_test.py Outdated Show resolved Hide resolved

oandreeva-nv previously approved these changes Jun 5, 2024

View reviewed changes

Use in tuple instead of or

178b65b

kthui dismissed stale reviews from oandreeva-nv and Tabrizian via 178b65b June 5, 2024 21:22

kthui requested review from Tabrizian and oandreeva-nv June 5, 2024 21:23

oandreeva-nv approved these changes Jun 5, 2024

View reviewed changes

tanmayv25 approved these changes Jun 5, 2024

View reviewed changes

rmccorm4 reviewed Jun 6, 2024

View reviewed changes

rmccorm4 approved these changes Jun 6, 2024

View reviewed changes

kthui merged commit 797d296 into main Jun 6, 2024
3 checks passed

kthui deleted the jacky-res-sender-fix-grpc branch June 6, 2024 00:53

pskiran1 pushed a commit that referenced this pull request Sep 19, 2024

Fix gRPC streaming non-decoupled segfault if sending response and fin…

575b98b

…al flag separately (#7265)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix gRPC streaming non-decoupled segfault if sending response and final flag separately #7265

Fix gRPC streaming non-decoupled segfault if sending response and final flag separately #7265

kthui commented May 24, 2024 •

edited

Loading

Tabrizian left a comment •

edited

Loading

kthui commented May 28, 2024 •

edited

Loading

oandreeva-nv left a comment

oandreeva-nv left a comment

rmccorm4 Jun 6, 2024

kthui Jun 6, 2024

rmccorm4 left a comment

Fix gRPC streaming non-decoupled segfault if sending response and final flag separately #7265

Fix gRPC streaming non-decoupled segfault if sending response and final flag separately #7265

Conversation

kthui commented May 24, 2024 • edited Loading

Tabrizian left a comment • edited Loading

Choose a reason for hiding this comment

kthui commented May 28, 2024 • edited Loading

oandreeva-nv left a comment

Choose a reason for hiding this comment

oandreeva-nv left a comment

Choose a reason for hiding this comment

rmccorm4 Jun 6, 2024

Choose a reason for hiding this comment

kthui Jun 6, 2024

Choose a reason for hiding this comment

rmccorm4 left a comment

Choose a reason for hiding this comment

kthui commented May 24, 2024 •

edited

Loading

Tabrizian left a comment •

edited

Loading

kthui commented May 28, 2024 •

edited

Loading