Move connection management into networking layer #351

dirkmc · 2020-04-15T21:26:57Z

Part of the fix for #347

internal/messagequeue/messagequeue.go

Stebalien · 2020-04-16T22:58:30Z

network/ipfs_impl.go

+			return nil
+		case <-s.done:
+			return nil
+		case <-time.After(s.opts.SendErrorBackoff):


I'd avoid using time.After but this isn't too critical.

network/ipfs_impl.go

Stebalien · 2020-04-16T23:08:39Z

network/connecteventmanager.go

+	}
+	state.refs++
+
+	if state.refs == 1 && state.responsive {


I'd consider switching the peer back to "responsive" on connect.

That is:

if state.refs == 1 || !state.responsive { state.responsive = true ... }

We get into the unresponsive state if the remote peer fails to respond to several attempts to dial it, but it's still connected. So we're implicitly saying that we care about responsiveness more than connectivity.
I guess arguably if a peer opens a new connection it can be considered responsive?

Thinking about this more we probably want to keep it how it is - if for example a peer that doesn't support bitswap dials us, we will dial it when broadcasting to connected peers. If the peer responds with an error indicating protocol not supported, we shouldn't try to dial it again even if it connects to us again.

Given that we're making 3 attempts, it's probably fine. I'm concerned that it can sometimes take some time to know that a connection is actually dead. In that case, we could try several times, say "peer's dead!", then get the new connection, then see the old connection finally die.

network/connecteventmanager.go

internal/messagequeue/messagequeue.go

Stebalien · 2020-04-17T20:01:33Z

network/ipfs_impl.go

 	return s.stream, nil
 }

 // Reset the stream
 func (s *streamMessageSender) Reset() error {
 	if s.stream != nil {
 		err := s.stream.Reset()
-		s.stream = nil
+		s.connected = false


Why not just set the stream to nil? That will free up the resources as well.

I believe the crash was caused because we were calling SupportsHave() after a Reset():

func (s *streamMessageSender) SupportsHave() bool { return s.bsnet.SupportsHave(s.stream.Protocol()) }

Ah, I see. We shouldn't even construct a streamMessageSender till we have the stream.

Stebalien · 2020-04-17T20:02:02Z

panic: runtime error: invalid memory address or nil pointer dereference
[signal SIGSEGV: segmentation violation code=0x1 addr=0x28 pc=0x14356d7]
goroutine 151492 [running]:
github.com/ipfs/go-bitswap/network.(*streamMessageSender).SupportsHave(0xc004dc0a80, 0x1ef1520)
        pkg/mod/github.com/ipfs/[email protected]/network/ipfs_impl.go:137 +0x37
github.com/ipfs/go-bitswap/internal/messagequeue.(*MessageQueue).sendMessage(0xc00331ac40)
        pkg/mod/github.com/ipfs/[email protected]/internal/messagequeue/messagequeue.go:425 +0x1b8
github.com/ipfs/go-bitswap/internal/messagequeue.(*MessageQueue).sendIfReady(0xc00331ac40)
        pkg/mod/github.com/ipfs/[email protected]/internal/messagequeue/messagequeue.go:406 +0x4b
github.com/ipfs/go-bitswap/internal/messagequeue.(*MessageQueue).runQueue(0xc00331ac40)
        pkg/mod/github.com/ipfs/[email protected]/internal/messagequeue/messagequeue.go:350 +0x2b7
created by github.com/ipfs/go-bitswap/internal/messagequeue.(*MessageQueue).Startup
        pkg/mod/github.com/ipfs/[email protected]/internal/messagequeue/messagequeue.go:300 +0x98

Stebalien

There are some remaining questions but this PR fixes the immediate problem.

Move connection management into networking layer This commit was moved from ipfs/go-bitswap@9d9719e

dirkmc added 7 commits April 15, 2020 17:26

refactor: move connection management into networking layer

bfd6fe8

fix: stop sender when message queue shut down

b097d70

fix: tests

c1922c0

fix: don't hang on to disconnected peer refs

ba4b52e

fix: shutdown message queue when there's a send error

189564e

refactor: extract Connection Event Manager to own file and add tests

37301bc

test: add more testing for ipfs_impl

b62e7fd

dirkmc marked this pull request as ready for review April 16, 2020 21:00

dirkmc requested a review from Stebalien April 16, 2020 21:00

Stebalien suggested changes Apr 16, 2020

View reviewed changes

dirkmc added 4 commits April 17, 2020 11:49

feat: dont retry if connect error is multistream.ErrNotSupported

3b40d49

fix: copy opts in ipfs_impl

59e7aa4

fix: remove extraneous map writes in connectionEventManager

c233956

fix: perf improvement for connectEventManager

c26bd59

Stebalien reviewed Apr 17, 2020

View reviewed changes

internal/messagequeue/messagequeue.go Show resolved Hide resolved

dirkmc added 2 commits April 17, 2020 14:57

fix: simplify message queue shutdown

bdccb20

fix: use explicit connected bool for streamMessageSender

a8ed651

Stebalien reviewed Apr 17, 2020

View reviewed changes

dirkmc added 2 commits April 17, 2020 16:21

fix: ipfs_impl error handling

8894bb6

fix: mark wants sent when they are added to a message to be sent

e6bf8af

Stebalien mentioned this pull request Apr 18, 2020

PeerManager's lock is highly contended #352

Closed

Stebalien approved these changes Apr 18, 2020

View reviewed changes

Stebalien merged commit 9d9719e into master Apr 18, 2020

Stebalien deleted the refactor/conn-mgmt branch April 21, 2020 17:53

Jorropo pushed a commit to Jorropo/go-libipfs that referenced this pull request Jan 26, 2023

Merge pull request ipfs/go-bitswap#351 from ipfs/refactor/conn-mgmt

521c70c

Move connection management into networking layer This commit was moved from ipfs/go-bitswap@9d9719e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move connection management into networking layer #351

Move connection management into networking layer #351

dirkmc commented Apr 15, 2020 •

edited

Loading

Stebalien Apr 16, 2020

Stebalien Apr 16, 2020

Stebalien Apr 16, 2020

dirkmc Apr 17, 2020

dirkmc Apr 17, 2020

Stebalien Apr 17, 2020

Stebalien Apr 17, 2020

dirkmc Apr 17, 2020

Stebalien Apr 17, 2020

Stebalien commented Apr 17, 2020

Stebalien left a comment

Move connection management into networking layer #351

Move connection management into networking layer #351

Conversation

dirkmc commented Apr 15, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Stebalien commented Apr 17, 2020

Stebalien left a comment

Choose a reason for hiding this comment

dirkmc commented Apr 15, 2020 •

edited

Loading