Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Export geth metrics on VM testnet #1351

Merged
merged 9 commits into from
Nov 11, 2019
Merged

Conversation

tkporter
Copy link
Contributor

Description

This sends geth metrics from VM geth nodes to stackdriver. Geth metrics are exposed in prometheus format directly on the VM on port 9200, which can only be reached from inside the VPC network. On the kubernetes side, 1 exporter pod is created for each VM node that sends the exposed prometheus metrics to stackdriver. Unfortunately creating 1 pod each was the only way to make pod_id on the stackdriver side be unique for each VM node & behave similar to the existing setup with K8-s based testnets.

Tested

Ran my own VM testnet, saw metrics in stackdriver. Made sure that the existing use of the prometheus-to-sd helm chart would behave the same

Other changes

Made sure the firewall rules only apply to the appropriate instances. Changed prometheus-to-sd helm chart to be a statefulset that can be used for geth metrics in addition to the kube metrics we use the chart for already.

Related issues

Backwards compatibility

Yes

Copy link
Contributor

@timmoreton timmoreton left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍 I think 1 prom-to-sd per VM is fine. setting resource limits might help limit the cost and bin pack better?

@timmoreton
Copy link
Contributor

looks like you need to do yarn run lint too

@timmoreton timmoreton assigned tkporter and unassigned timmoreton Nov 8, 2019
@tkporter
Copy link
Contributor Author

👍 cool, just added resource limits. They're usually at around 1m CPU and 10 Mi memory but I gave them some leeway

@tkporter tkporter added the automerge Have PR merge automatically when checks pass label Nov 11, 2019
@celo-ci-bot-user celo-ci-bot-user merged commit ba60964 into master Nov 11, 2019
@celo-ci-bot-user celo-ci-bot-user deleted the trevor/vm-geth-metrics branch November 11, 2019 17:13
aaronmgdr added a commit that referenced this pull request Nov 14, 2019
* master: (56 commits)
  Adjust e2e transfer and governance tests to match new fee distribution and eliminate ProposerFraction (#1585)
  [Wallet] Add more local currencies (#1698)
  Switch to correct cluster when fauceting (#1687)
  [Wallet] Use the country of the phone number for determining the default local currency (#1684)
  [Wallet] Limit QR code scanner to 1 code per second (#1676)
  Update Dark backgrounds text color (#1677)
  Remove integration sync test
  Minor attestation service fixes (#1680)
  [wallet] Fixed Native phone picker Use native API instead (#1669)
  Fix token addresses for notification service (#1674)
  Add golang to setup docs
  [wallet] Hide invite education copy after invite was redeemed (#1670)
  [Wallet] Add spinner, timer, and tip text to Verification input screen (#1656)
  [Wallet] Fix app deprecation check mechanism (#1358)
  Point end-to-end governance test back to master (#1665)
  Add EpochRewards smart contract to calculate epoch rewards and payments (#1558)
  Optimized Attestation view calls and removal of the reveal TX (#1578)
  Support claim signatures and support Keybase claims (#1575)
  [Wallet] Add timestamp to top banner messages (#1657)
  Export geth metrics on VM testnet (#1351)
  ...

# Conflicts:
#	yarn.lock
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
automerge Have PR merge automatically when checks pass
Projects
None yet
Development

Successfully merging this pull request may close these issues.

VM testnets SBAT export geth metrics
3 participants