-
Notifications
You must be signed in to change notification settings - Fork 135
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RUMM-1744 Add E2E tests for Kronos (NTP) #703
Merged
ncreated
merged 1 commit into
ncreated/RUMM-1744-embed-Kronos-directly-into-SDK
from
ncreated/RUMM-1744-stabilize-Kronos-tests
Dec 31, 2021
Merged
Changes from all commits
Commits
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,195 @@ | ||
/* | ||
* Unless explicitly stated otherwise all files in this repository are licensed under the Apache License Version 2.0. | ||
* This product includes software developed at Datadog (https://www.datadoghq.com/). | ||
* Copyright 2019-2020 Datadog, Inc. | ||
*/ | ||
|
||
@testable import Datadog | ||
|
||
class KronosE2ETests: E2ETests { | ||
/// The logger sending additional telemetry on Kronos execution. These logs are available in Mobile Integrations org. | ||
private var logger: Logger! // swiftlint:disable:this implicitly_unwrapped_optional | ||
|
||
override func setUp() { | ||
super.setUp() | ||
logger = Logger | ||
.builder | ||
.set(loggerName: "kronos-e2e") | ||
.build() | ||
} | ||
|
||
override func tearDown() { | ||
logger = nil | ||
super.tearDown() | ||
} | ||
|
||
/// TODO: RUMM-1859: Add E2E tests for monitoring Kronos in nightly tests | ||
func test_kronos_clock_performs_sync_using_datadog_ntp_pool() { // E2E:wip | ||
/// The result of `KronosClock.sync()`. | ||
struct KronosSyncResult { | ||
/// First received server date. | ||
var firstReceivedDate: Date? = nil | ||
/// First received server offset. | ||
var firstReceivedOffset: TimeInterval? = nil | ||
/// Last received server date. | ||
var lastReceivedDate: Date? = nil | ||
/// Last received server offset. | ||
var lastReceivedOffset: TimeInterval? = nil | ||
/// Device date measured at the moment of receiving any server date. Used for additional debugging and comparision. | ||
var measuredDeviceDate = Date() | ||
} | ||
|
||
func performKronosSync(using pool: String) -> KronosSyncResult { | ||
KronosClock.reset() | ||
defer { KronosClock.reset() } | ||
|
||
// Given | ||
let numberOfSamplesForEachIP = 2 // exchange only 2 samples with each resolved IP - to run test quick | ||
|
||
// Each IP (each server) is asked in parallel, but samples are obtained sequentially. | ||
// Here we compute test timeout, to ensure that all (parallel) servers complete querying their (sequential) samples | ||
// below `testTimeout` with assuming +50% margin. This should guarantee no flakiness on test timeout. | ||
let testTimeout = kronosDefaultTimeout * Double(numberOfSamplesForEachIP) * 1.5 | ||
|
||
// When | ||
let completionExpectation = expectation(description: "KronosClock.sync() calls completion closure") | ||
var result = KronosSyncResult() | ||
|
||
KronosClock.sync( | ||
from: pool, | ||
samples: numberOfSamplesForEachIP, | ||
first: { date, offset in // this closure could not be called if all samples to all servers resulted with failure | ||
result.firstReceivedDate = date | ||
result.firstReceivedOffset = offset | ||
result.measuredDeviceDate = Date() | ||
}, | ||
completion: { date, offset in // this closure should always be called | ||
result.lastReceivedDate = date | ||
result.lastReceivedOffset = offset | ||
result.measuredDeviceDate = Date() | ||
completionExpectation.fulfill() | ||
} | ||
) | ||
|
||
// Then | ||
|
||
// We don't expect receiving timeout on `completionExpectation`. Number of samples and individual sample timeout | ||
// is configured in a way that lets `KronosNTPClient` always fulfill the `completionExpectation`. | ||
waitForExpectations(timeout: testTimeout) | ||
|
||
return result | ||
} | ||
|
||
// Run test for each Datadog NTP pool: | ||
DateCorrector.datadogNTPServers.forEach { ddNTPPool in | ||
let result = measure(resourceName: DD.PerfSpanName.fromCurrentMethodName()) { | ||
performKronosSync(using: ddNTPPool) | ||
} | ||
|
||
// Report result for this pool: | ||
if let _ = result.firstReceivedDate, let _ = result.firstReceivedOffset, let serverDate = result.lastReceivedDate, let serverOffset = result.lastReceivedOffset { | ||
// We consider `KronosClock.sync()` result to be consistent only if it has both `first` and `last` time values set. | ||
// We log consistent result as INFO log that can be seen in Mobile Integration org. | ||
logger.info("KronosClock.sync() completed with consistent result for \(ddNTPPool)", attributes: [ | ||
"serverOffset_measured": serverDate.timeIntervalSince(result.measuredDeviceDate), | ||
"serverOffset_received": serverOffset, | ||
"serverDate_received": iso8601DateFormatter.string(from: serverDate), | ||
]) | ||
} else { | ||
// Inconsistent result may correspond to flaky execution, e.g. if network was unreachable or if **all** NTP calls received timeout. | ||
// We track inconsistent result as WARN log that will be watched by E2E monitor. | ||
logger.warn("KronosClock.sync() completed with inconsistent result for \(ddNTPPool)", attributes: [ | ||
"serverDate_firstReceived": result.firstReceivedDate.flatMap { iso8601DateFormatter.string(from: $0) }, | ||
"serverDate_lastReceived": result.lastReceivedDate.flatMap { iso8601DateFormatter.string(from: $0) }, | ||
"serverOffset_firstReceived": result.firstReceivedOffset, | ||
"serverOffset_lastReceived": result.lastReceivedOffset, | ||
]) | ||
} | ||
} | ||
} | ||
|
||
/// TODO: RUMM-1859: Add E2E tests for monitoring Kronos in nightly tests | ||
func test_kronos_ntp_client_queries_both_ipv4_and_ipv6_ips() { // E2E:wip | ||
/// The result of `KronosNTPClient.query(pool:)`. | ||
struct KronosNTPClientQueryResult { | ||
/// Partial offsets received for each NTP packet sent to each resolved IP. | ||
var receivedOffsets: [TimeInterval?] = [] | ||
/// Expected number of NTP packets to send. | ||
var expectedNumberOfSamples = 0 | ||
/// Actual number of NTP packets that completed. | ||
var numberOfCompletedSamples = 0 | ||
} | ||
|
||
func performKronosNTPClientQuery() -> KronosNTPClientQueryResult { | ||
let testTimeout: TimeInterval = 30 | ||
|
||
// Given | ||
let pool = "2.datadog.pool.ntp.org" // a pool resolved to multiple IPv4 and IPv6 addresses (e.g. 4 + 4) | ||
let numberOfSamplesForEachIP = 2 // exchange only 2 samples with each resolved IP - to run test quick | ||
|
||
// Each IP (each server) is asked in parallel, but samples are obtained sequentially. | ||
// Here we compute individual sample timeout, to ensure that all (parallel) servers complete querying their (sequential) samples | ||
// below `testTimeout` with assuming -30% margin. This should guarantee no flakiness on test timeout. | ||
let timeoutForEachSample = (testTimeout / Double(numberOfSamplesForEachIP)) * 0.7 | ||
|
||
// When | ||
let completionExpectation = expectation(description: "It completes all samples for all IPs") | ||
var result = KronosNTPClientQueryResult() | ||
|
||
KronosNTPClient() | ||
.query( | ||
pool: pool, | ||
numberOfSamples: numberOfSamplesForEachIP, | ||
maximumServers: .max, // query all resolved IPs in the pool - to include both IPv4 and IPv6 | ||
timeout: timeoutForEachSample | ||
) { offset, completed, total in | ||
result.receivedOffsets.append(offset) | ||
result.numberOfCompletedSamples = completed | ||
result.expectedNumberOfSamples = total | ||
|
||
if completed == total { | ||
completionExpectation.fulfill() | ||
} | ||
} | ||
|
||
// Then | ||
|
||
// We don't expect receiving timeout on `completionExpectation`. Number of samples and individual sample timeout | ||
// is configured in a way that lets `KronosNTPClient` always fulfill the `completionExpectation`. | ||
// In worst case, it can fulfill it, with recording only `nil` offsets, which will mean receiving timeouts | ||
// or error on all NTP queries. | ||
waitForExpectations(timeout: testTimeout) | ||
|
||
return result | ||
} | ||
|
||
// Run test: | ||
let result = measure(resourceName: DD.PerfSpanName.fromCurrentMethodName()) { | ||
performKronosNTPClientQuery() | ||
} | ||
|
||
// Report result: | ||
if result.receivedOffsets.contains(where: { offset in offset != nil }) { | ||
// We consider `KronosNTPClient.query(pool:)` result to be consistent if it received at least one offset. | ||
let receivedOffsets: [String] = result.receivedOffsets.map { offset in | ||
if let offset = offset { | ||
return "\(offset)" | ||
} else { | ||
return "(nil)" | ||
} | ||
} | ||
logger.info( | ||
"KronosNTPClient.query(pool:) completed with consistent result receiving \(result.numberOfCompletedSamples)/\(result.expectedNumberOfSamples) NTP packets", | ||
attributes: [ | ||
"offsets_received": receivedOffsets | ||
] | ||
) | ||
Comment on lines
+181
to
+186
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. |
||
} else { | ||
// Inconsistent result may correspond to flaky execution, e.g. if network was unreachable or if **all** NTP calls received timeout. | ||
// We track inconsistent result as WARN log that will be watched by E2E monitor. | ||
logger.warn( | ||
"KronosNTPClient.query(pool:) completed with inconsistent result receiving \(result.numberOfCompletedSamples)/\(result.expectedNumberOfSamples) NTP packets" | ||
) | ||
} | ||
} | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Example
WARN
log received from here: