Reset any queued stream on receipt of remote reset #258

hawkw · 2018-04-14T01:19:47Z

Fixes #256.

This PR changes state::recv_reset so any closed stream with queued send is immediately reset (and thus, the queue is cleared) on receipt of a RST_STREAM frame from the remote.

This fixes the panic encountered by the test @goffrie posted in #256, where the stream is scheduled to close, receives a RST_STREAM frame, and sets the buffered capacity to 0, but isn't removed from the send queue, so we hit an assertion (or overflow, if debug assertions are disabled) when subtracting a sent frame's size from the buffered size.

I've determined that this does fix the panic in @goffrie's test, but the test instead hangs forever instead of passing. I haven't yet been able to rewrite the test such that it hits the panic on master but doesn't hang when the issue is resolved. I intend to continue working on testing this, but I'm putting this up for review now nonetheless.

olix0r · 2018-04-14T03:24:36Z

src/proto/streams/state.rs

@@ -217,10 +217,11 @@ impl State {
    pub fn recv_reset(&mut self, reason: Reason, queued: bool) {

        match self.inner {
-            Closed(Cause::EndStream) if queued => {
+            Closed(prior_reason) if queued => {


this looks like the only functional change in this PR... can you explain how this resolves the issue?

+1. It would be good to include a comment explaining this as it is definitely subtle.

IIUC, the match can be condensed as:

pub fn recv_reset(&mut self, reason: Reason, queued: bool) { match self.inner { Closed(..) if !queued => {}, s => { trace!("recv_reset; reason={:?}; state={:?}", reason, s); self.inner = Closed(Cause::Proto(reason)); } } }

I want to make sure I understand this properly, though. What are the states where we can be closed and have queued data?

https://github.com/carllerche/h2/blob/b1d14caa2251cfa6316d73ad77409fb0be4a41f1/src/proto/streams/state.rs#L73-L87

In particular, I'm not sure if it makes sense to change the closed type when it's already Proto (it's probably harmless? but I'm not sure of the intended behavior). Also, I'm not sure we'd want to override an Io error (though I'm not sure if it's actually possible to receive a reset after an Io error).

This is fair, I can make the match more specific.

My understanding is that we can have queued data in the following cases:

when the cause Cause::Scheduled,

when the cause is Cause::EndStream, which would indicate that we've enqueued an EOS frame but it has yet to be sent (either because there are other frames ahead of it in the send queue, or we just haven't popped it yet)

AFAIK, queued should not be true with any other Cause variants, but I'll change the pattern match to actually encode this nonetheless.

@olix0r, hopefully 05a6017 is clearer?

olix0r · 2018-04-16T21:22:22Z

src/proto/streams/state.rs

+            //
+            // In either of these cases, we want to honor the received
+            // RST_STREAM by resetting the stream immediately (and clearing
+            // the send queue).


we don't clear the sendq here. Would be clearer as:

... immediately. This causes the send queue to be cleared by prioritize.

(Or whatever is actually correct.

goffrie · 2018-04-16T22:18:24Z

Hm, there's one more way that we can call clear_queue, which is via recv_err; I believe this is able to cause a similar issue wherein the stream's state doesn't necessarily become is_peer_reset and so the check here doesn't trigger. For example I've seen this happen when receiving a GO_AWAY frame, which has caused this assertion to trip because the frames were all cleared.

hawkw · 2018-04-16T22:35:45Z

@goffrie Ah, thanks for pointing that out --- I think we'll want to add similar logic to recv_error for when a stream has frames in the send queue. I'll look into that in an additional PR.

This reverts commit 37940ac.

carllerche · 2018-04-23T16:52:42Z

Thanks for getting this done.

hawkw added 4 commits April 13, 2018 16:10

Add more trace logging

81d2d74

Reset any queued stream on receipt of reset

039a061

make test not hang

0a8ac7d

rm incomplete test

b1d14ca

hawkw requested a review from carllerche April 14, 2018 01:19

olix0r reviewed Apr 14, 2018

View reviewed changes

Review feedback

05a6017

olix0r reviewed Apr 16, 2018

View reviewed changes

Review feedback (update docs)

21dc068

olix0r approved these changes Apr 16, 2018

View reviewed changes

olix0r mentioned this pull request Apr 16, 2018

proxy can error due to HTTP/2 stream resets linkerd/linkerd2#754

Closed

hawkw added 3 commits April 16, 2018 15:45

Update docs

37940ac

Revert "Update docs"

8966655

This reverts commit 37940ac.

Update docs

a3e520c

hawkw merged commit 040f391 into master Apr 16, 2018

hawkw mentioned this pull request Apr 16, 2018

Reset queued streams on recv_err #259

Closed

goffrie mentioned this pull request Apr 20, 2018

Be more lenient with streams in the pending_send queue. #261

Merged

carllerche deleted the eliza/rst_while_sending branch May 4, 2018 17:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reset any queued stream on receipt of remote reset #258

Reset any queued stream on receipt of remote reset #258

hawkw commented Apr 14, 2018

olix0r Apr 14, 2018

carllerche Apr 14, 2018

olix0r Apr 15, 2018 •

edited

Loading

hawkw Apr 16, 2018 •

edited

Loading

hawkw Apr 16, 2018

olix0r Apr 16, 2018

goffrie commented Apr 16, 2018

hawkw commented Apr 16, 2018

carllerche commented Apr 23, 2018

Reset any queued stream on receipt of remote reset #258

Reset any queued stream on receipt of remote reset #258

Conversation

hawkw commented Apr 14, 2018

olix0r Apr 14, 2018

Choose a reason for hiding this comment

carllerche Apr 14, 2018

Choose a reason for hiding this comment

olix0r Apr 15, 2018 • edited Loading

Choose a reason for hiding this comment

hawkw Apr 16, 2018 • edited Loading

Choose a reason for hiding this comment

hawkw Apr 16, 2018

Choose a reason for hiding this comment

olix0r Apr 16, 2018

Choose a reason for hiding this comment

goffrie commented Apr 16, 2018

hawkw commented Apr 16, 2018

carllerche commented Apr 23, 2018

olix0r Apr 15, 2018 •

edited

Loading

hawkw Apr 16, 2018 •

edited

Loading