Implementing metrics #125

devbugging · 2024-03-04T13:02:26Z

We need comprehensive metrics to measure the performance and resource usage of our APIs. This will help us understand the performance of different API methods and track various states and errors.

Performance

Give feedback

Measure end-to-end request/response time by method call (track the time taken from the start of a request to the return of the response for each method call to understand relative performance and user experience using percentiles).
Monitor time it takes for Flow transaction to be submitted and a result is returned, with the transaction status
API requests per time interval metric
API calls by API endpoint (most used to least used calls)
Options

Measuring performance can/should be done using tracing, so we can have multiple sub-calls measured as well. Ideally, we should have all the network calls as a sub-trace as well as any APIs. Traces should be enabled with a flag and not on by default.
Each API response time should also submit a simple metric measuring the time it took for the request to be processed.

Be careful to also include websocket request/responses metrics.

State

Give feedback

Ingestion index health is a boolean value that is being set to false if the latest indexed EVM height falls behind the latest EVM height by X
Execution EVM traces index health is a boolean value that should be set to false if there are any traces that failed to download
API errors should be submitted to a counter metric
Report fees paid on Flow and EVM side as a metric
Metric for users EVM contract addresses which are being called
Database size (folder size)
Options

Ingestion

Give feedback

EVM height should be submitted as a value on event ingestion
Trace download failures should be recorded
Options

We should use prometheus and open telemetry to collect the traces and metrics.

m-Peter · 2024-03-14T15:15:26Z

For JSON-RPC endpoints that are served over WebSocket, such as subscriptions and filtering of entities, we should add some dedicated metrics as well, e.g. active connections etc.

franklywatson · 2024-03-14T16:03:48Z

@m-Peter also suggested tracking DB size over time and also DB query time

devbugging · 2024-06-11T15:58:59Z

Add metrics for index health. Trace index health is dependent on the trace download success, if one is failed the index becomes unhealthy. Transaction index health is dependent on how far back the latest ingested event is from the latest height on the network. If too far behind the index is unhealthy.

devbugging · 2024-07-30T16:19:39Z

Another high priority metric is: #384

j1010001 · 2024-08-29T16:37:35Z

First set of metrics is implemented and Grafana Dashboard created: https://flowfoundation.grafana.net/d/PkvVJj4Mz/mainnet-general?from=now-24h&to=now&timezone=America%2FVancouver

github-project-automation bot added this to 🌊 Flow 4D Mar 4, 2024

devbugging mentioned this issue Mar 4, 2024

[EPIC] Production Stable Gateway #126

Closed

devbugging assigned m-Peter and devbugging Mar 29, 2024

onflow deleted a comment from m-Peter Jun 14, 2024

illia-malachyn moved this to 🧊 Backlog in 🌊 Flow 4D Jul 17, 2024

illia-malachyn moved this from 🧊 Backlog to 🔖 Ready for Pickup in 🌊 Flow 4D Jul 17, 2024

This was referenced Jul 17, 2024

Integrate prometheus to evm gateway #359

Closed

Integrate prometheus to evm gateway #360

Merged

j1010001 closed this as completed Aug 29, 2024

github-project-automation bot moved this from 🔖 Ready for Pickup to ✅ Done in 🌊 Flow 4D Aug 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementing metrics #125

Implementing metrics #125

devbugging commented Mar 4, 2024 •

edited

Loading

Performance

State

Ingestion

m-Peter commented Mar 14, 2024

franklywatson commented Mar 14, 2024

devbugging commented Jun 11, 2024

devbugging commented Jul 30, 2024

j1010001 commented Aug 29, 2024

Implementing metrics #125

Implementing metrics #125

Comments

devbugging commented Mar 4, 2024 • edited Loading

Performance

State

Ingestion

m-Peter commented Mar 14, 2024

franklywatson commented Mar 14, 2024

devbugging commented Jun 11, 2024

devbugging commented Jul 30, 2024

j1010001 commented Aug 29, 2024

devbugging commented Mar 4, 2024 •

edited

Loading