improve connection scavenge logic by doing zero-byte read #50545

geoffkizer · 2021-04-01T06:34:29Z

Currently, during our periodic scavenge of idle connections, we check whether the connection is still valid (i.e. has not received EOF or invalid data) by either (a) doing a socket poll, if we have access to the underlying socket; or (b) issuing a read-ahead, when we don't have access to the underlying socket.

Approach (b) is problematic because it pins our read buffer for potentially a long period of time (on Windows), and prevents us from freeing this buffer back to the pool in the future. The negative impact of this approach can be seen here: #49431

Unfortunately, there are common cases where we don't have access to the underlying socket, and thus today have to fall back to approach (b). Most importantly, if a user supplies a ConnectCallback, then we never have access to the underlying socket, even if the ConnectCallback does trivial things like logging.

This PR addresses the problem as follows: Instead of doing a poll or a read-ahead in these cases, just do a zero-byte read. For Streams that implement proper zero-byte read support (that is, the read does not complete until data is available), this will avoid unnecessary buffer pinning. It's also probably cheaper than polling, since we only need to pend one zero-byte read in this case vs potentially multiple calls to poll.

Since we cannot depend on the Stream properly supporting zero-byte read semantics, we must issue an actual read-ahead when the zero-byte read completes. For non-compliant Streams, this results in the same behavior as previously: we will immediately pin the read buffer. For compliant Streams, the completion will only happen when either the connection is invalid (as occurs today) or when there is actual valid response data on the connection, in which case we would have issued a read-ahead when the connection is about to be reused anyway.

Note that we also issue a read-ahead when we retrieve an idle connection from the pool for reuse. This logic is essentially unchanged, but refactored a bit, and no longer shared with the scavenge logic since we want different behavior in these two cases.

Also, this PR changes the behavior in ReturnConnection (called when a connection is about to be returned to the pool) to not check for EOF or invalid data in the case where we are about to transfer the connection to a waiting request. Any invalid data would likely have been detected as part of response processing. And since we are about to reuse the connection anyway, the likelihood of detecting an invalid connection during the transfer is very small and highly timing-dependent. Note this only affects cases where a connection limit is set, since this is the only case in which we have waiting requests currently.

Related to #49431

@scalablecory @stephentoub @wfurt @dotnet/ncl

…te read

ghost · 2021-04-01T06:34:34Z

Tagging subscribers to this area: @dotnet/ncl
See info in area-owners.md if you want to be subscribed.

Issue Details

Currently, during our periodic scavenge of idle connections, we check whether the connection is still valid (i.e. has not received EOF or invalid data) by either (a) doing a socket poll, if we have access to the underlying socket; or (b) issuing a read-ahead, when we don't have access to the underlying socket.

Approach (b) is problematic because it pins our read buffer for potentially a long period of time (on Windows), and prevents us from freeing this buffer back to the pool in the future. The negative impact of this approach can be seen here: #49431

Unfortunately, there are common cases where we don't have access to the underlying socket, and thus today have to fall back to approach (b). Most importantly, if a user supplies a ConnectCallback, then we never have access to the underlying socket, even if the ConnectCallback does trivial things like logging.

This PR addresses the problem as follows: Instead of doing a poll or a read-ahead in these cases, just do a zero-byte read. For Streams that implement proper zero-byte read support (that is, the read does not complete until data is available), this will avoid unnecessary buffer pinning. It's also probably cheaper than polling, since we only need to pend one zero-byte read in this case vs potentially multiple calls to poll.

Since we cannot depend on the Stream properly supporting zero-byte read semantics, we must issue an actual read-ahead when the zero-byte read completes. For non-compliant Streams, this results in the same behavior as previously: we will immediately pin the read buffer. For compliant Streams, the completion will only happen when either the connection is invalid (as occurs today) or when there is actual valid response data on the connection, in which case we would have issued a read-ahead when the connection is about to be reused anyway.

Note that we also issue a read-ahead when we retrieve an idle connection from the pool for reuse. This logic is essentially unchanged, but refactored a bit, and no longer shared with the scavenge logic since we want different behavior in these two cases.

Also, this PR changes the behavior in ReturnConnection (called when a connection is about to be returned to the pool) to not check for EOF or invalid data in the case where we are about to transfer the connection to a waiting request. Any invalid data would likely have been detected as part of response processing. And since we are about to reuse the connection anyway, the likelihood of detecting an invalid connection during the transfer is very small and highly timing-dependent. Note this only affects cases where a connection limit is set, since this is the only case in which we have waiting requests currently.

Related to #49431

@scalablecory @stephentoub @wfurt @dotnet/ncl

Author:	geoffkizer
Assignees:	-
Labels:	`area-System.Net.Http`
Milestone:	-

geoffkizer · 2021-04-01T06:35:27Z

/azp run runtime-libraries-coreclr outerloop

azure-pipelines · 2021-04-01T06:35:41Z

Azure Pipelines successfully started running 1 pipeline(s).

stephentoub

Some nits, but otherwise LGTM.

src/libraries/System.Net.Http/src/System/Net/Http/SocketsHttpHandler/Http2Connection.cs

stephentoub · 2021-04-01T13:41:55Z

src/libraries/System.Net.Http/src/System/Net/Http/SocketsHttpHandler/HttpConnection.cs

 #pragma warning restore CA2012
-                }
            }


Nit:
Could be:

// We may already have a read-ahead task if we did a previous scavenge and haven't used the connection since. #pragma warning disable CA2012 // we're very careful to ensure the ValueTask is only consumed once, even though it's stored into a field _readAheadTask ??= ReadAheadWithZeroByteReadAsync(); #pragma warning restore CA2012

stephentoub · 2021-04-01T13:46:27Z

src/libraries/System.Net.Http/src/System/Net/Http/SocketsHttpHandler/HttpConnectionPool.cs

+                if (connection.LifetimeExpired(nowTicks, _poolManager.Settings._pooledConnectionLifetime))
                {
-                    if (NetEventSource.Log.IsEnabled()) connection.Trace("Found usable connection in pool.");
-                    return new ValueTask<HttpConnection?>(connection);
+                    if (NetEventSource.Log.IsEnabled()) connection.Trace("Found expired connection in pool.");
+                    connection.Dispose();
+                    continue;
                }
-                else
+
+                if (!connection.PrepareForReuse(async))
                {
                    if (NetEventSource.Log.IsEnabled()) connection.Trace("Found invalid connection in pool.");
                    connection.Dispose();
+                    continue;
                }


The bodies of these if blocks are identical except for one word in the traced message. If that distinction isn't important, we could combine these:

if (connection.LifetimeExpired(nowTicks, _poolManager.Settings._pooledConnectionLifetime) || !connection.PrepareForReuse(async)) { if (NetEventSource.Log.IsEnabled()) connection.Trace("Found expired or invalid connection in pool."); connection.Dispose(); continue; }

The distinction seems important to me.

src/libraries/System.Net.Http/src/System/Net/Http/SocketsHttpHandler/HttpConnectionPool.cs

src/libraries/System.Net.Http/src/System/Net/Http/SocketsHttpHandler/HttpConnection.cs

…andler/Http2Connection.cs Co-authored-by: Stephen Toub <[email protected]>

geoffkizer · 2021-04-04T07:17:23Z

/azp run runtime-libraries-coreclr outerloop

azure-pipelines · 2021-04-04T07:17:37Z

Azure Pipelines successfully started running 1 pipeline(s).

…shim_mono # By Aaron Robinson (10) and others # Via GitHub * upstream/main: (108 commits) [mbr] Add Apple sample (dotnet#50740) make EstablishProxyTunnelAsync throw on failure status code from proxy (dotnet#50763) Improve RGB Min Max evaluation performance by using 2 or 3 comparison… (dotnet#50622) [mono] More domain cleanups (dotnet#50479) Fix Crossgen2 of PlatformDefaultMemberFunction methods and calls. (dotnet#50754) Disable EventSource generator in design-time builds (dotnet#50741) Fix X509 test failures on Android (dotnet#50301) Do not confuse fgDispBasicBlocks in fgMorphBlocks (dotnet#50703) Enforce 64KB event payload size limit on EventPipe (dotnet#50600) Reorganize CoreCLR native build to reduce CMake reconfigures when the build system is untouched (dotnet#49906) [mbr] Turn on hot reload for iOS, tvOS and MacCatalyst (dotnet#50458) improve connection scavenge logic by doing zero-byte read (dotnet#50545) Resolve call mdtokens when making tier 1 inline observations (dotnet#50675) Annotate APIs in System.Private.Xml (dotnet#49682) Support compiling against OpenSSL 3 headers Change Configuration.Json to use a regular Dictionary. (dotnet#50611) Remove unused BigNumFromBinary P/Invoke (dotnet#50670) Make Ninja the default CMake generator on Windows for the repo (dotnet#49715) [AppleAppBuilder] Entitlements to run tests on catalyst using the JIT (dotnet#50637) [mono] Fix delegate invokes to dynamic methods in mixed mode. (dotnet#50547) ... # Conflicts: # src/mono/dlls/mscordbi/CMakeLists.txt

improve connection usability check in scavenge logic by doing zero-by…

75e0a73

…te read

dotnet-issue-labeler bot added the area-System.Net.Http label Apr 1, 2021

geoffkizer added this to the 6.0.0 milestone Apr 1, 2021

stephentoub approved these changes Apr 1, 2021

View reviewed changes

stephentoub reviewed Apr 1, 2021

View reviewed changes

src/libraries/System.Net.Http/src/System/Net/Http/SocketsHttpHandler/HttpConnection.cs Outdated Show resolved Hide resolved

Geoffrey Kizer and others added 4 commits April 4, 2021 00:06

fix assert re read buffer state in ReadAheadWithZeroByteReadAsync

8ac5e55

Update src/libraries/System.Net.Http/src/System/Net/Http/SocketsHttpH…

2891997

…andler/Http2Connection.cs Co-authored-by: Stephen Toub <[email protected]>

Update src/libraries/System.Net.Http/src/System/Net/Http/SocketsHttpH…

7a2b6bd

…andler/Http2Connection.cs Co-authored-by: Stephen Toub <[email protected]>

simplify waiter dequeue

34870de

geoffkizer merged commit 7ebc067 into dotnet:main Apr 5, 2021

runfoapp bot mentioned this pull request Apr 5, 2021

Microsoft.Extensions.Hosting.Tests.HostTests.CreateDefaultBuilder_SecretsDoesReload fails on Linux #50751

Closed

ghost locked as resolved and limited conversation to collaborators May 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve connection scavenge logic by doing zero-byte read #50545

improve connection scavenge logic by doing zero-byte read #50545

geoffkizer commented Apr 1, 2021

ghost commented Apr 1, 2021

geoffkizer commented Apr 1, 2021

azure-pipelines bot commented Apr 1, 2021

stephentoub left a comment

stephentoub Apr 1, 2021

stephentoub Apr 1, 2021

geoffkizer Apr 4, 2021

geoffkizer commented Apr 4, 2021

azure-pipelines bot commented Apr 4, 2021

improve connection scavenge logic by doing zero-byte read #50545

improve connection scavenge logic by doing zero-byte read #50545

Conversation

geoffkizer commented Apr 1, 2021

ghost commented Apr 1, 2021

geoffkizer commented Apr 1, 2021

azure-pipelines bot commented Apr 1, 2021

stephentoub left a comment

Choose a reason for hiding this comment

stephentoub Apr 1, 2021

Choose a reason for hiding this comment

stephentoub Apr 1, 2021

Choose a reason for hiding this comment

geoffkizer Apr 4, 2021

Choose a reason for hiding this comment

geoffkizer commented Apr 4, 2021

azure-pipelines bot commented Apr 4, 2021