[DNM] do not retry the txhashset msg send (do not retry any msg with an attachment) #3152

antiochp · 2019-11-29T16:04:04Z

Related #3145.

The existing retry_send logic retries any msg send if try_break! returns None.
try_break! does some funky stuff internally squashing some exceptions but letting others propagate. WouldBlock and TimedOut are both squashed, returning None.
So any msg send that times out will be retried.

This is fine for most (small) self-contained msgs but is definitely not fine for msgs that contain an attachment. For attachments we write repeatedly in a tight loop and any iteration of this loop could experience a timeout.
In situations like this we want to fail fast and not attempt to retry the msg send.

The receiver may have received a partial txhashset zip file and the only thing we can do here is to close the connection to signal to the recipient that we cannot reliably send the remaining bytes of the zip file.

Do not merge yet - needs discussing.
#3145 should be merged first.

We may want to rethink how we implement write_message to make sending the attachment more robust. But even if we do we still want to prevent msgs with attachments from being retried at the full msg level.

…achment)

hashmap · 2019-11-29T16:07:59Z

Sounds reasonable

antiochp · 2019-11-29T21:42:50Z

Going to take another pass at this.

try_break! actively works against us when writing messages onto the tcp stream, obfuscating errors and making it hard to really know what can and what cannot be safely retried.

And the per-msg "retry" logic feels flawed to me - we end up potentially retrying a msg forever under some error conditions (whereas other error conditions mean we drop the connection).

I think we can write this in a far simpler way by getting rid of try_break! and handling the "retry" internally in write_message, either successfully writing all the required bytes or failing and closing the connection.

There is no intermediate state that makes sense where "we could not write all the bytes but we'll just try sending the whole msg again and keep doing this until it gets more serious" which is effectively what we are attempting to handle currently.

antiochp · 2019-12-02T11:52:46Z

Closing this for now.
Related - #3154

Now that we have consistency across all the peer connections (blocking everywhere consistently) the "send" behavior needs investigating again to see if "retry send" is even necessary or makes sense.

I'd like to try and reduce our reliance on try_break! but will approach this in another PR.

do not retry the txhashset msg send (do not retry any msg with an att…

b1f8820

…achment)

antiochp requested a review from hashmap November 29, 2019 16:04

antiochp closed this Dec 2, 2019

antiochp deleted the no_retry_txhashset_msg branch December 2, 2019 11:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DNM] do not retry the txhashset msg send (do not retry any msg with an attachment) #3152

[DNM] do not retry the txhashset msg send (do not retry any msg with an attachment) #3152

antiochp commented Nov 29, 2019 •

edited

Loading

hashmap commented Nov 29, 2019

antiochp commented Nov 29, 2019

antiochp commented Dec 2, 2019

[DNM] do not retry the txhashset msg send (do not retry any msg with an attachment) #3152

[DNM] do not retry the txhashset msg send (do not retry any msg with an attachment) #3152

Conversation

antiochp commented Nov 29, 2019 • edited Loading

hashmap commented Nov 29, 2019

antiochp commented Nov 29, 2019

antiochp commented Dec 2, 2019

antiochp commented Nov 29, 2019 •

edited

Loading