-
Notifications
You must be signed in to change notification settings - Fork 853
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CredHub interactions take 97% of ATC CPU time #2373
Comments
Pretty sure this is similar, if not the same, as #2300 |
'ello @jama-pivotal! We think this is orthogonal. Proper engineers wot write proper code could tell you more, but it's to do with the way the stream is being read or something, so instantiation isn't relevant (EDIT: unless it establishes a connection on construction?). |
@jama-pivotal Hi, without the fix for #2300 then cloudfoundry/credhub-cli#45 will have no effect on ATC. They are two separate problems however. I actually had to downgrade concourse to v3.13 to expose this issue. The reason #2300 was so bad was because as part of the client construction credhub-cli reads and parses all of the system CA files. With neither issue fixed, the flame graph is all parsing system root certificates, with #2300 fixed, it's all in TLS handshakes to credhub, and with both fixed ATC can actually get on and do work 🎉 |
Hey @jama-pivotal, We (read: @takeyourhatoff) just realised the fix in #2300 doesn't work because the method that does the lazy initialisation had a non-pointer receiver, so was acting on a fresh instance of the We're just testing a fix for that, so disregard this until we've had a chance to address #2300. We'll verify it with a profiler before raising a PR. |
thanks for the context @takeyourhatoff @DanielJonesEB . Gonna watch the PR closely for the fix, looking forward to it 🙏 |
Bug Report
Given a Concourse with secrets backed by CredHub, we were seeing 97% of the ATC's CPU time going to TLS handshakes when it was looking up secrets.
This is because
credhub-cli
(understandably, as a CLI library) was not using HTTP Keepalive, forcing each secret lookup to use a new connection, and this a new TLS handshake. The issue was fixed in this PR: cloudfoundry/credhub-cli#45This has since been fixed as of commit cloudfoundry/credhub-cli@0887387. Bumping to this version will see a significant performance improvement for those using CredHub.
Before:
data:image/s3,"s3://crabby-images/3218a/3218a424a75a3d61c762f14766b650d7535824ef" alt="screen shot 2018-07-06 at 13 27 30"
After:
data:image/s3,"s3://crabby-images/c35b8/c35b86dec13de03352e1a9d8d60d1b7702f1c3ce" alt="screen shot 2018-07-06 at 13 27 37"
Difference:
data:image/s3,"s3://crabby-images/089f4/089f4de45a7af9d3e57f1e2d8d9fb6d7a9d3aa1a" alt="screen shot 2018-07-06 at 13 28 11"
The text was updated successfully, but these errors were encountered: