Teleport 15 Test Plan #36663

r0mant · 2024-01-13T00:02:13Z

Manual Testing Plan

Below are the items that should be manually tested with each release of Teleport.
These tests should be run on both a fresh installation of the version to be released
as well as an upgrade of the previous version of Teleport.

User accounting @atburke

Verify that active interactive sessions are tracked in /var/run/utmp on Linux.
Verify that interactive sessions are logged in /var/log/wtmp on Linux.

Combinations @capnspacehook

For some manual testing, many combinations need to be tested. For example, for
interactive sessions the 12 combinations are below.

Add an agentless Node in a local cluster.
- Connect using OpenSSH.
- Connect using Teleport.
- Connect using the Web UI.
- Remove the Node (but keep its custom CA in sshd config).
  - Verify that it fails to connect when using OpenSSH.
  - Verify that it fails to connect when using Teleport.
  - Verify that it fails to connect when using the Web UI.
Add a Teleport Node in a local cluster.
- Connect using OpenSSH.
- Connect using Teleport.
- Connect using the Web UI.
Add an agentless Node in a remote (leaf) cluster.
- Connect using OpenSSH from root cluster.
- Connect using Teleport from root cluster.
- Connect using the Web UI from root cluster.
- Remove the Node (but keep its custom CA in sshd config).
  - Verify that it fails to connect when using OpenSSH from root cluster.
  - Verify that it fails to connect when using Teleport from root cluster.
  - Verify that it fails to connect when using the Web UI from root cluster.
Add a Teleport Node in a remote (leaf) cluster.
- Connect using OpenSSH from root cluster.
- Connect using Teleport from root cluster.
- Connect using the Web UI from root cluster.

Teleport with EKS/GKE @AntonAM

Deploy Teleport on a single EKS cluster
Deploy Teleport on two EKS clusters and connect them via trusted cluster feature
Deploy Teleport Proxy outside GKE cluster fronting connections to it (use this script to generate a kubeconfig)
Deploy Teleport Proxy outside EKS cluster fronting connections to it (use this script to generate a kubeconfig)

Teleport with multiple Kubernetes clusters @tigrato

Note: you can use GKE or EKS or minikube to run Kubernetes clusters.
Minikube is the only caveat - it's not reachable publicly so don't run a proxy there.

Kubernetes auto-discovery @AntonAM

Kubernetes Secret Storage @AntonAM

Kubernetes Secret storage for Agent's Identity
- Install Teleport agent with a short-lived token
  - Validate if the Teleport is installed as a Kubernetes Statefulset
  - Restart the agent after token TTL expires to see if it reuses the same identity.
- Force cluster CA rotation

Kubernetes Pod RBAC @AntonAM

Teleport with FIPS mode @bl-nero

Perform trusted clusters, Web and SSH sanity check with all teleport components deployed in FIPS mode.

ACME @bl-nero

Teleport can fetch TLS certificate automatically using ACME protocol.

Migrations @tigrato

Migrate trusted clusters from 2.4.0 to 2.5.0
- Migrate auth server on main cluster, then rest of the servers on main cluster
  SSH should work for both main and old clusters
- Migrate auth server on remote cluster, then rest of the remote cluster
  SSH should work

Command Templates

When interacting with a cluster, the following command templates are useful:

OpenSSH

# when connecting to the recording proxy, `-o 'ForwardAgent yes'` is required.
ssh -o "ProxyCommand ssh -o 'ForwardAgent yes' -p 3023 %[email protected] -s proxy:%h:%p" \
  node.example.com

# the above command only forwards the agent to the proxy, to forward the agent
# to the target node, `-o 'ForwardAgent yes'` needs to be passed twice.
ssh -o "ForwardAgent yes" \
  -o "ProxyCommand ssh -o 'ForwardAgent yes' -p 3023 %[email protected] -s proxy:%h:%p" \
  node.example.com

# when connecting to a remote cluster using OpenSSH, the subsystem request is
# updated with the name of the remote cluster.
ssh -o "ProxyCommand ssh -o 'ForwardAgent yes' -p 3023 %[email protected] -s proxy:%h:%[email protected]" \
  node.foo.com

Teleport

# when connecting to a OpenSSH node, remember `-p 22` needs to be passed.
tsh --proxy=proxy.example.com --user=<username> --insecure ssh -p 22 node.example.com

# an agent can be forwarded to the target node with `-A`
tsh --proxy=proxy.example.com --user=<username> --insecure ssh -A -p 22 node.example.com

# the --cluster flag is used to connect to a node in a remote cluster.
tsh --proxy=proxy.example.com --user=<username> --insecure ssh --cluster=foo.com -p 22 node.foo.com

Teleport with SSO Providers

GitHub External SSO @capnspacehook

Teleport OSS
- GitHub organization without external SSO succeeds
- GitHub organization with external SSO fails
Teleport Enterprise
- GitHub organization without external SSO succeeds
- GitHub organization with external SSO succeeds

`tctl sso` family of commands @flyinghermit

For help with setting up sso connectors, check out the [Quick GitHub/SAML/OIDC Setup Tips]

tctl sso configure helps to construct a valid connector definition:

tctl sso configure github ... creates valid connector definitions
tctl sso configure oidc ... creates valid connector definitions
tctl sso configure saml ... creates valid connector definitions

tctl sso test test a provided connector definition, which can be loaded from
file or piped in with tctl sso configure or tctl get --with-secrets. Valid
connectors are accepted, invalid are rejected with sensible error messages.

SSO login on remote host @atburke

SSO login on a remote host

tsh should be running on a remote host (e.g. over an SSH session) and use the
local browser to complete and SSO login. Run
tsh login --callback <remote.host>:<port> --bind-addr localhost:<port> --auth <auth>
on the remote host. Note that the --callback URL must be able to resolve to the
--bind-addr over HTTPS.

Teleport Plugins @EdwardDowling

Teleport Operator @hugoShaka

Test deploying a Teleport cluster with the teleport-cluster Helm chart and the operator enabled
Test deploying a standalone operator against Teleport Cloud
Test that operator can reconcile
- TeleportUser
- TeleportRole
- TeleportProvisionToken

AWS Node Joining @hugoShaka

Docs

On EC2 instance with ec2:DescribeInstances permissions for local account:
TELEPORT_TEST_EC2=1 go test ./integration -run TestEC2NodeJoin
On EC2 instance with any attached role:
TELEPORT_TEST_EC2=1 go test ./integration -run TestIAMNodeJoin
EC2 Join method in IoT mode with node and auth in different AWS accounts
IAM Join method in IoT mode with node and auth in different AWS accounts

Kubernetes Node Joining @hugoShaka

Join a Teleport node running in the same Kubernetes cluster via a Kubernetes in-cluster ProvisionToken
Join a tbot instance running in a different Kubernetes cluster as Teleport with a Kubernetes JWKS ProvisionToken

Azure Node Joining @marcoandredinis

Docs

Join a Teleport node running in an Azure VM

GCP Node Joining @hugoShaka

Docs

Join a Teleport node running in a GCP VM.

Cloud Labels @hugoShaka @marcoandredinis

Create an EC2 instance with tags in instance metadata enabled
and with tag foo: bar. Verify that a node running on the instance has label
aws/foo=bar.
Create an Azure VM with tag foo: bar. Verify that a node running on the
instance has label azure/foo=bar.

Passwordless @codingllama

This feature has additional build requirements, so it should be tested with a
pre-release build from Drone (eg:
https://get.gravitational.com/tsh-v10.0.0-alpha.2.pkg).

This sections complements "Users -> Managing MFA devices". tsh binaries for
each operating system (Linux, macOS and Windows) must be tested separately for
FIDO2 items.

Device Trust @codingllama

Device Trust requires Teleport Enterprise.

This feature has additional build requirements, so it should be tested with a
pre-release build from Drone (eg:
https://get.gravitational.com/teleport-ent-v10.0.0-alpha.2-linux-amd64-bin.tar.gz).

Client-side enrollment requires a signed tsh for macOS, make sure to use the
tsh binary from tsh.app.

A simple formula for testing device authorization is:

# Before enrollment.
# Replace with other kinds of access, as appropriate (db, kube, etc)
tsh ssh node-that-requires-device-trust
> ERROR: ssh: rejected: administratively prohibited (unauthorized device)

# Register the device.
# Get the serial number from `tsh device asset-tag`.
tctl devices add --os=macos --asset-tag=<SERIAL_NUMBER> --enroll

# Enroll the device.
tsh device enroll --token=<TOKEN_FROM_COMMAND_ABOVE>
tsh logout; tsh login

# After enrollment
tsh ssh node-that-requires-device-trust
> $

Hardware Key Support @jakule

Hardware Key Support is an Enterprise feature and is not available for OSS.

You will need a YubiKey 4.3+ to test this feature.

This feature has additional build requirements, so it should be tested with a pre-release build from Drone (eg: https://get.gravitational.com/teleport-ent-v11.0.0-alpha.2-linux-amd64-bin.tar.gz).

Server Access

These tests should be carried out sequentially. tsh tests should be carried out on Linux, MacOS, and Windows.

tsh login as user with Webauthn login and no hardware key requirement.
Request a role with role.role_options.require_session_mfa: hardware_key - tsh login --request-roles=hardware_key_required

Assuming the role should force automatic re-login with yubikey
tsh ssh
- Requires yubikey to be connected for re-login
- Prompts for per-session MFA

Request a role with role.role_options.require_session_mfa: hardware_key_touch - tsh login --request-roles=hardware_key_touch_required

Assuming the role should force automatic re-login with yubikey
- Prompts for touch if not cached (last touch within 15 seconds)
tsh ssh
- Requires yubikey to be connected for re-login
- Prompts for touch if not cached

tsh logout and tsh login as the user with no hardware key requirement.
Upgrade auth settings to auth_service.authentication.require_session_mfa: hardware_key

Using the existing login session (tsh ls) should force automatic re-login with yubikey
tsh ssh
- Requires yubikey to be connected for re-login
- Prompts for per-session MFA

Upgrade auth settings to auth_service.authentication.require_session_mfa: hardware_key_touch

Using the existing login session (tsh ls) should force automatic re-login with yubikey
- Prompts for touch if not cached
tsh ssh
- Requires yubikey to be connected for re-login
- Prompts for touch if not cached

Other

Set auth_service.authentication.require_session_mfa: hardware_key_touch in your cluster auth settings.

Database Access: tsh proxy db --tunnel

HSM Support @nklaassen

Docs

Moderated session @strideynet

Using tsh join an SSH session as two moderators (two separate terminals, role requires one moderator).

Ctrl+C in the Implement a prototype for a proxying SSH server that implements concepts expressed in readme #1 terminal should disconnect the moderator.
Ctrl+C in the Implement a functional prototype #2 terminal should disconnect the moderator and terminate the session as session has no moderator.

Using tsh join an SSH session as two moderators (two separate terminals, role requires one moderator).

t in any terminal should terminate the session for all participants.

Performance @rosstimothy @fspmarshall

Scaling Test

Scale up the number of nodes/clusters a few times for each configuration below.

Verify that there are no memory/goroutine/file descriptor leaks
Compare the baseline metrics with the previous release to determine if resource usage has increased
Restart all Auth instances and verify that all nodes/clusters reconnect

Perform reverse tunnel node scaling tests with actual nodes for Cloud:

DynamoDB - 30k

Perform simulated node scaling tests with actual nodes via
tctl loadtest node-heartbeats --count=15000 --ttl=2m --interval=1m --labels=2 --concurrency=32 for:

etcd - 30k
Firestore - 30k
Postgres - 30k

Perform the following additional scaling tests on etcd:

500 trusted clusters.

Soak Test

Run 30 minute soak test directly against direct and tunnel nodes
and via label based matching. Tests should only be run against a Cloud
tenant.

tsh bench ssh --duration=30m user@reverse-tunnel-node ls
tsh bench ssh --duration=30m user@foo=bar ls
tsh bench ssh --duration=30m --random user@foo ls

Concurrent Session Test

Cluster with 1k reverse tunnel nodes

Run a concurrent session test that will spawn 5 interactive sessions per node in the cluster. Tests should only be run against a Cloud tenant:

tsh bench web sessions --max=5000 user ls

Verify that all 5000 sessions are able to be established.
Verify that tsh and the web UI are still functional.

Robustness

Connectivity Issues:

Verify that a lack of connectivity to Auth does not prevent access to
resources which do not require a moderated session and in async recording
mode from an already issued certificate.
Verify that a lack of connectivity to Auth prevents access to resources
which require a moderated session and in async recording mode from an already
issued certificate.
Verify that an open session is not terminated when all Auth instances
are restarted.

Teleport with Cloud Providers

AWS @camscale

Deploy Teleport to AWS. Using DynamoDB & S3
Deploy Teleport Enterprise to AWS. Using HA Setup https://goteleport.com/docs/deploy-a-cluster/deployments/aws-ha-autoscale-cluster-terraform/

GCP @tigrato

Deploy Teleport to GCP. Using Cloud Firestore & Cloud Storage
Deploy Teleport to GKE. Google Kubernetes engine.
Deploy Teleport Enterprise to GCP.

IBM @hugoShaka

Deploy Teleport to IBM Cloud. Using IBM Database for etcd & IBM Object Store
Deploy Teleport to IBM Cloud Kubernetes.
Deploy Teleport Enterprise to IBM Cloud.

Application Access @mdwn

Database Access @greedy52 + team

TLS Routing @smallinsky

Verify that teleport proxy v2 configuration starts only a single listener for proxy service, in contrast with v1 configuration. @smallinsky
Given configuration:

version: v2
proxy_service:
  enabled: "yes"
  public_addr: ['root.example.com']
  web_listen_addr: 0.0.0.0:3080

There should be total of three listeners, with only *:3080 for proxy service. Given the configuration above, 3022 and 3025 will be opened for other services.

lsof -i -P | grep teleport | grep LISTEN
  teleport  ...  TCP *:3022 (LISTEN)
  teleport  ...  TCP *:3025 (LISTEN)
  teleport  ...  TCP *:3080 (LISTEN) # <-- proxy service

In contrast for the same configuration with version v1, there should be additional ports 3023 and 3024.

lsof -i -P | grep teleport | grep LISTEN
  teleport  ...  TCP *:3022 (LISTEN)
  teleport  ...  TCP *:3025 (LISTEN)
  teleport  ...  TCP *:3023 (LISTEN) # <-- extra proxy service port
  teleport  ...  TCP *:3024 (LISTEN) # <-- extra proxy service port
  teleport  ...  TCP *:3080 (LISTEN) # <-- proxy service

Run Teleport Proxy in multiplex mode auth_service.proxy_listener_mode: "multiplex" @smallinsky
- Trusted cluster
  - Setup trusted clusters using single port setup web_proxy_addr == tunnel_addr
```
kind: trusted_cluster
spec:
  ...
  web_proxy_addr: root.example.com:443
  tunnel_addr: root.example.com:443
  ...
```
Database Access
- Verify that tsh db connect works through proxy running in multiplex mode
  - Postgres @Tener
  - MySQL @Tener
  - MariaDB @greedy52
  - MongoDB @GavinFrazar
  - CockroachDB @Tener
  - Redis @greedy52
  - MSSQL @gabrielcorado
  - Snowflake @gabrielcorado
  - Elasticsearch. @Tener
  - OpenSearch. @Tener
  - Cassandra/ScyllaDB. @gabrielcorado
  - Oracle. @greedy52
- Verify connecting to a database through TLS ALPN SNI local proxy tsh proxy db with a GUI client. @greedy52
- Verify connecting to a database through Teleport Connect. @greedy52
Application Access @smallinsky
- Verify app access through proxy running in multiplex mode
SSH Access @smallinsky
- Connect to a OpenSSH server through a local ssh proxy ssh -o "ForwardAgent yes" -o "ProxyCommand tsh proxy ssh" [email protected]
- Connect to a OpenSSH server on leaf-cluster through a local ssh proxyssh -o "ForwardAgent yes" -o "ProxyCommand tsh proxy ssh --user=%r --cluster=leaf-cluster %h:%p" [email protected]
- Verify tsh ssh access through proxy running in multiplex mode
Kubernetes access: @smallinsky
- Verify kubernetes access through proxy running in multiplex mode, using tsh
- Verify kubernetes access through Teleport Connect
Teleport Proxy single port multiplex mode behind L7 load balancer
- Agent can join through Proxy and maintain reverse tunnel @greedy52
- tsh login and tctl @greedy52
- SSH Access: tsh ssh and tsh config @smallinsky
- Database Access: tsh proxy db and tsh db connect @greedy52
- Application Access: tsh proxy app and tsh aws @smallinsky
- Kubernetes Access: tsh proxy kube@smallinsky

IGS:

The text was updated successfully, but these errors were encountered:

r0mant · 2024-01-13T00:02:25Z

Desktop Access @probakowski @ibeckermayer

Binaries / OS compatibility @fheinecke

Verify that our software runs on the minimum supported OS versions as per
https://goteleport.com/docs/installation/#operating-system-support

Windows @ravicious

tsh runs on the minimum supported Windows version
Teleport Connect runs on the minimum supported Windows version

macOS @camscale

tsh runs on the minimum supported macOS version
tctl runs on the minimum supported macOS version
teleport runs on the minimum supported macOS version
tbot runs on the minimum supported macOS version
Teleport Connect runs on the minimum supported macOS version

Linux @camscale

tsh runs on the minimum supported Linux version
tctl runs on the minimum supported Linux version
teleport runs on the minimum supported Linux version
tbot runs on the minimum supported Linux version
Teleport Connect runs on the minimum supported Linux version

Machine ID @strideynet

Verify you are able to create a new bot user with tctl bots add robot --roles=access. Follow the instructions provided in the output to start tbot
- Directly connecting to the auth server
- Connecting to the auth server via the proxy reverse tunnel
Verify that after the renewal period (default 20m, but this can be reduced via configuration), that newly generated certificates are placed in the destination directory
Verify that sending both SIGUSR1 and SIGHUP to a running tbot process causes a renewal and new certificates to be generated

With an SSH node registered to the Teleport cluster:

Verify you are able to connect to the SSH node using openssh with the generated ssh_config in the destination directory
Verify you are able to connect to the SSH node using tsh with the identity file in the destination directory

With a Postgres DB registered to the Teleport cluster:

Verify you are able to interact with a database using tbot db connect with a database output
Verify you are able to connect to the database using tbot proxy db with a database output
Verify you are able to produce an authenticated tunnel using tbot proxy db --tunnel with a database output and then able to connect to the database through the tunnel without credentials

With a Kubernetes cluster registered to the Teleport cluster:

Verify the kubeconfig produced by a Kubernetes output can be used to run basic commands (e.g kubectl get pods)

With a HTTP application registered to the Teleport cluster:

Verify the certificates produced by an application output can be used directly against the proxy (e.g curl --cert ./out/tlscert --key ./out/key https://httpbin.teleport.example.com/headers)
Verify you are able to produce an authenticated tunnel using tbot proxy app httpbin with an application output and then able to connect to the application through the tunnel without credentials curl localhost:port/headers

Host users creation @lxea

Host users creation docs
Host users creation RFD

Verify host users creation functionality
- non-existing users are created automatically
- users are added to groups
  - non-existing configured groups are created
  - created users are added to the teleport-system group
- users are cleaned up after their session ends
  - cleanup occurs if a program was left running after session ends
- sudoers file creation is successful
  - Invalid sudoers files are not created
- existing host users are not modified
- setting disable_create_host_user: true stops user creation from occurring

CA rotations @fspmarshall

Verify the CA rotation functionality itself (by checking in the backend or with tctl get cert_authority)
- standby phase: only active_keys, no additional_trusted_keys
- init phase: active_keys and additional_trusted_keys
- update_clients and update_servers phases: the certs from the init phase are swapped
- standby phase: only the new certs remain in active_keys, nothing in additional_trusted_keys
- rollback phase (second pass, after completing a regular rotation): same content as in the init phase
- standby phase after rollback: same content as in the previous standby phase
Verify functionality in all phases (clients might have to log in again in lieu of waiting for credentials to expire between phases)
- SSH session in tsh from a previous phase
- SSH session in web UI from a previous phase
- New SSH session with tsh
- New SSH session with web UI
- New SSH session in a child cluster on the same major version
- New SSH session in a child cluster on the previous major version
- New SSH session from a parent cluster
- Application access through a browser
- Application access through curl with tsh apps login
- kubectl get po after tsh kube login
- Database access (no configuration change should be necessary if the database CA isn't rotated, other Teleport functionality should not be affected if only the database CA is rotated)

Proxy Peering

Proxy Peering docs

Verify that Proxy Peering works for the following protocols:
- SSH @rosstimothy
- Kubernetes @AntonAM
- Database @greedy52
- Windows Desktop @ibeckermayer
- App Access @mdwn

EC2 Discovery @marcoandredinis

EC2 Discovery docs

Verify EC2 instance discovery
- Only EC2 instances matching given AWS tags have the installer executed on them
- Only the IAM permissions mentioned in the discovery docs are required for operation
- Custom scripts specified in different matchers are executed
- Custom SSM documents specified in different matchers are executed
- New EC2 instances with matching AWS tags are discovered and added to the teleport cluster
  - Large numbers of EC2 instances (51+) are all successfully added to the cluster
- Nodes that have been discovered do not have the install script run on the node multiple times

Azure Discovery @marcoandredinis

Azure Discovery docs

Verify Azure VM discovery
- Only Azure VMs matching given Azure tags have the installer executed on them
- Only the IAM permissions mentioned in the discovery docs are required for operation
- Custom scripts specified in different matchers are executed
- New Azure VMs with matching Azure tags are discovered and added to the teleport cluster
  - Large numbers of Azure VMs (51+) are all successfully added to the cluster
- Nodes that have been discovered do not have the install script run on the node multiple times

GCP Discovery @atburke

GCP Discovery docs

Verify GCP instance discovery
- Only GCP instances matching given GCP tags have the installer executed on them
- Only the IAM permissions mentioned in the discovery docs are required for operation docs: updates for GCP discovery instructions #36952
- Custom scripts specified in different matchers are executed
- New GCP instances with matching GCP tags are discovered and added to the teleport cluster
  - Large numbers of GCP instances (51+) are all successfully added to the cluster
- Nodes that have been discovered do not have the install script run on the node multiple times

IP Pinning @AntonAM

Add a role with pin_source_ip: true (requires Enterprise) to test IP pinning.
Testing will require changing your IP (that Teleport Proxy sees).
Docs: IP Pinning

Verify that it works for SSH Access
- You can access tunnel node with tsh ssh on root cluster
- You can access direct access node with tsh ssh on root cluster
- You can access tunnel node from Web UI on root cluster
- You can access direct access node from Web UI on root cluster
- You can access tunnel node with tsh ssh on leaf cluster
- You can access direct access node with tsh ssh on leaf cluster
- You can access tunnel node from Web UI on leaf cluster
- You can access direct access node from Web UI on leaf cluster
- You can download files from nodes in Web UI (small arrows at top left corner)
- If you change your IP you no longer can access nodes.
Verify that it works for Kube Access
- You can access Kubernetes cluster through standalone Kube service on root cluster
- You can access Kubernetes cluster through agent inside Kubernetes on root cluster
- You can access Kubernetes cluster through standalone Kube service on leaf cluster
- You can access Kubernetes cluster through agent inside Kubernetes on leaf cluster
- If you change your IP you no longer can access Kube clusters.
Verify that it works for DB Access
- You can access DB servers on root cluster
- You can access DB servers on leaf cluster
- If you change your IP you no longer can access DB servers.
Verify that it works for App Access
- You can access App service on root cluster
- You can access App service on leaf cluster
- If you change your IP you no longer can access App services.
Verify that it works for Desktop Access
- You can access Desktop service on root cluster
- You can access Desktop service on leaf cluster
- If you change your IP you no longer can access Desktop services.

Assist @jakule @ryanclark @tigrato @xacrimon @justinas

Assist is not supported by tsh and WebUI is the only way to use it.
Assist test plan is in the core section instead of WebUI as most functionality is implemented in the core.

Configuration
- Assist is disabled by default (OSS, Enterprise)
- Assist can be enabled in the configuration file.
- Assist is disabled in the Cloud.
- Assist is enabled by default in the Cloud Team plan.
- Assist is always disabled when etcd is used as a backend.
Conversations
- A new conversation can be started.
- SSH command can be executed on one server.
- SSH command can be executed on multiple servers.
- SSH command can be executed on a node with per session MFA enabled.
- Execution output is explained when it fits the context window.
- Assist can list all nodes/execute a command on all nodes (using embeddings).
- Access request can be created.
- Access request is created when approved.
- Conversation title is set after the first message.
SSH integration
- Assist icon is visible in WebUI's Terminal
- A Bash command can be generated in the above window.
- When an output is selected in the Terminal "Explain" option is available, and it generates the summary.

Resources

Quick GitHub/SAML/OIDC Setup Tips

strideynet · 2024-01-16T14:05:01Z

#36732

codingllama · 2024-01-16T17:14:25Z

Relogin retries too aggressive, swallows legitimate errors:

tsh attempts relogin on a failure to remove the last MFA device #36749

Edit: I'm ticking off "with second_factor: on in auth_service, should fail" in the users section - technically the condition is working, the relogin issue is just bigger than it. I have drafted a fix at #36866.

Edit2: Fixed by #36866.

ibeckermayer · 2024-01-17T03:10:02Z

Desktop access recordings UI bug: #36781

awaiting: #36843

strideynet · 2024-01-17T12:57:48Z

Certificate presented by root proxy for leaf agentless nodes is not trusted by client #36801

bl-nero · 2024-01-17T18:32:07Z

No audit log entries when SCP denied: #36820

ibeckermayer · 2024-01-17T19:17:48Z

Desktop session clipboard and directory sharing icon state is unclear: #36825

ibeckermayer · 2024-01-17T19:38:25Z

Desktop access recording progress bar fails to reach the end in case of error: #36827

awaiting: #36843

rosstimothy · 2024-01-17T23:33:05Z

Login failure events aren't emitted if MFA is enabled: #36837

bl-nero · 2024-01-18T10:02:05Z

#31410, which I reported for v14, is still not fixed

tigrato · 2024-01-18T12:23:20Z

#36850 hides messages when users aren't allowed to use a certain ssh principal

[Alan]: Fixed by #36866.

ravicious · 2024-01-18T12:28:48Z

tsh does not work on Windows 10 rev. 1607. This is a regression introduced somewhere between 13.0.0 and 13.4.15.

tsh v13.4.15+ does not work on Windows 10 rev. 1607 #36851

[Alan]: Fixed by #36859.

tigrato · 2024-01-18T12:45:03Z

#36852

tigrato · 2024-01-18T19:20:40Z

#36881 which is fixed by #36882

GavinFrazar · 2024-01-18T20:07:03Z

agent locking is broken: https://github.com/gravitational/teleport-private/issues/1340

nklaassen · 2024-01-18T23:57:06Z

second auth with same YubiHSM can't create new CA keys #36838

bl-nero · 2024-01-19T13:10:44Z

A minor issue with tsh in FIPS mode (not sure if related to FIPS itself, though): #36922

bl-nero · 2024-01-19T14:22:54Z

Unable to use RDP in FIPS mode: #36928

hugoShaka · 2024-01-19T15:44:39Z

Nit: tctl bots add message doesn't support non-expiring tokens: #36932

hugoShaka · 2024-01-19T16:01:02Z

Helm chart teleport-cluster creates a token with wrong allow rules when deploying the operator: #36933

atburke · 2024-01-19T23:10:17Z

Jump host fails with unknown certificate authority: #36964

GavinFrazar · 2024-01-20T02:39:31Z

default --db-user selected for database auto-user provisioning via trusted cluster is invalid #36976

rosstimothy · 2024-01-22T14:59:34Z

Performance Test Results

Cloud

Load Tests

30k Resources

5K Concurrent Sessions

Soak Tests

Origin: us-east-1 Target: us-east-1

/usr/local/bin/tsh bench ssh --duration=30m root@node-agents-67588c8d58-26f2m-00 ls

* Requests originated: 17998
* Requests failed: 0

Histogram

Percentile Response Duration
---------- -----------------
25         219 ms
50         224 ms
75         229 ms
90         234 ms
95         236 ms
99         245 ms
100        1228 ms

/usr/local/bin/tsh bench ssh --duration=30m root@fullname=node-agents-67588c8d58-26f2m-00 ls

* Requests originated: 17996
* Requests failed: 0

Histogram

Percentile Response Duration
---------- -----------------
25         481 ms
50         488 ms
75         495 ms
90         501 ms
95         505 ms
99         514 ms
100        1522 ms

/usr/local/bin/tsh bench ssh --duration=30m --random root@all ls

* Requests originated: 17998
* Requests failed: 0

Histogram

Percentile Response Duration
---------- -----------------
25         218 ms
50         224 ms
75         232 ms
90         242 ms
95         251 ms
99         289 ms
100        784 ms

Origin: us-east-1 Target: us-west-2

/usr/local/bin/tsh bench ssh --duration=30m root@ip-172-31-35-106 ls

* Requests originated: 17993
* Requests failed: 0

Histogram

Percentile Response Duration
---------- -----------------
25         783 ms
50         801 ms
75         827 ms
90         850 ms
95         858 ms
99         878 ms
100        1102 ms

Origin: us-west-2 Target: us-east-1

/usr/local/bin/tsh bench ssh --duration=30m root@node-agents-67588c8d58-26f2m-00 ls

* Requests originated: 17992
* Requests failed: 0

Histogram

Percentile Response Duration
---------- -----------------
25         823 ms
50         836 ms
75         845 ms
90         851 ms
95         858 ms
99         870 ms
100        1980 ms

etcd¹

30k Resources

500 Trusted Clusters

Postgres¹

30k Resources

Note

The postgres backend exhibited some odd memory usage behaviors that were not observed when testing other backends.

Firestore¹

30k Resources

30k tests were performed using the simulated method described in the v14 Test Plan ↩ ↩² ↩³

hugoShaka · 2024-01-22T22:21:46Z

IBM changed its admin etcd login process, docs are not working anymore: #37059

nklaassen · 2024-01-22T22:36:42Z

Unable to login to UI when automatic upgrades is misconfigured #37060

nklaassen · 2024-01-23T00:06:55Z

External Audit Storage bootstrap fails #37062

nklaassen · 2024-01-23T01:32:59Z

Can't play session recordings directly from audit log page #37066

Tener · 2024-01-23T07:22:12Z

Cancelling running query doesn't work for CockroachDB #37074

justinas · 2024-01-24T12:23:04Z

Found a seemingly Cloud-specific issue with inviting new users to a cluster. #37159

tcsc · 2024-01-24T13:22:03Z

Okta integration installer doesn't create SSO connector: #37160

gabrielcorado · 2024-01-24T21:56:33Z

Goroutine leak on PostgreSQL database access: #37219

gabrielcorado · 2024-01-25T20:10:43Z

Database Access load test (PostgreSQL and MySQL)

Setup (same as previous test)

EKS with a single node group:

Min: 2, Max: 10 instances.
Instance class: m5.4xlarge
Kubernetes version: 1.27

Teleport cluster (all deployed on the EKS cluster):

DynamoDB backend
3 Auth servers
3 Proxies instances
1 Database Agent

Databases:

Single PostgreSQL RDS instance on a db.t4g.xlarge instance class. Accessed through RDS Proxy with single RW endpoint.
Single MySQL RDS instance on a db.t4g.xlarge instance class. Accessed through RDS Proxy with single RW endpoint.

Note: Databases were configured using discovery running inside the database agent.

tsh bench commands were executed inside the cluster.

MySQL

10 connections/second

# tsh bench mysql mysql-proxy-rdsproxy --db-user=mysql --db-name=mysql --rate=10 --duration=30m

* Requests originated: 18000
* Requests failed: 0
Histogram
Percentile Response Duration
---------- -----------------
25         54 ms
50         57 ms
75         61 ms
90         67 ms
95         72 ms
99         88 ms
100        1087 ms

50 connections/second

# tsh bench mysql mysql-proxy-rdsproxy --db-user=mysql --db-name=mysql --rate=50 --duration=30m

* Requests originated: 89931
* Requests failed: 5
* Last error: io.ReadFull(header) failed. err EOF: connection was bad
Histogram
Percentile Response Duration
---------- -----------------
25         515 ms
50         741 ms
75         983 ms
90         1175 ms
95         1287 ms
99         1539 ms
100        2649 ms

PostgreSQL

10 connections/second

# tsh bench postgres postgres-proxy-rdsproxy --db-user=postgres --db-name=postgres --rate=10 --duration=30m

* Requests originated: 18000
* Requests failed: 0
Histogram
Percentile Response Duration
---------- -----------------
25         69 ms
50         71 ms
75         75 ms
90         80 ms
95         85 ms
99         101 ms
100        2010 ms

50 connections/second

# tsh bench postgres postgres-proxy-rdsproxy --db-user=postgres --db-name=postgres --rate=50 --duration=30m

* Requests originated: 89914
* Requests failed: 21192
* Last error: failed to connect to `host=127.0.0.1 user=teleport database=postgres`: failed to receive message (unexpected EOF)
Histogram
Percentile Response Duration
---------- -----------------
25         731 ms
50         1144 ms
75         1398 ms
90         1648 ms
95         1788 ms
99         2127 ms
100        30815 ms

espadolini · 2024-01-26T17:48:07Z

SSH Connection Resumption

Verify that SSH works, and that resumable SSH is not interrupted across a Teleport Cloud tenant upgrade.

	Standard node	Non-resuming node	Peered node	v14 node	Agentless node
`tsh ssh`
`tsh ssh --no-resume`
`tsh ssh` v14
Teleport Connect
Web UI (not resuming)
OpenSSH (standard `tsh config`)
OpenSSH (adding `--no-resume`)

(resumed connections to peered nodes work with a local tsh after #37352)

Verify that SSH works, and that resumable SSH is not interrupted across a control plane restart (of either the root or the leaf cluster).

	Tunnel node	Direct dial node
`tsh ssh` (local cluster)
`tsh ssh --no-resume` (local cluster)
`tsh ssh` (root cluster)
`tsh ssh --no-resume` (root cluster)
OpenSSH (without `ProxyCommand`)	n/a
OpenSSH's `ssh-keyscan`	n/a

espadolini · 2024-01-26T17:50:50Z

The "SSH gRPC" transport client code doesn't unblock the connection on Close, hanging SSH connections when the background reconnection hits in certain situations (seems to only happen with proxy peering connections).

r0mant added the test-plan A list of tasks required to ship a successful product release. label Jan 13, 2024

rosstimothy added this to the 9.0 milestone Jan 17, 2024

rosstimothy removed this from the 9.0 milestone Jan 17, 2024

ptgott mentioned this issue Jan 23, 2024

Teleport v15 Docs Test Plan #37018

Closed

17 tasks

zmb3 closed this as completed Feb 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Teleport 15 Test Plan #36663

Teleport 15 Test Plan #36663

r0mant commented Jan 13, 2024 •

edited by hugoShaka

Loading

r0mant commented Jan 13, 2024 •

edited by fheinecke

Loading

strideynet commented Jan 16, 2024

codingllama commented Jan 16, 2024 •

edited

Loading

ibeckermayer commented Jan 17, 2024 •

edited

Loading

strideynet commented Jan 17, 2024

bl-nero commented Jan 17, 2024

ibeckermayer commented Jan 17, 2024

ibeckermayer commented Jan 17, 2024 •

edited

Loading

rosstimothy commented Jan 17, 2024

bl-nero commented Jan 18, 2024

tigrato commented Jan 18, 2024 •

edited by codingllama

Loading

ravicious commented Jan 18, 2024 •

edited by codingllama

Loading

tigrato commented Jan 18, 2024

tigrato commented Jan 18, 2024

GavinFrazar commented Jan 18, 2024

nklaassen commented Jan 18, 2024 •

edited

Loading

bl-nero commented Jan 19, 2024

bl-nero commented Jan 19, 2024

hugoShaka commented Jan 19, 2024

hugoShaka commented Jan 19, 2024

atburke commented Jan 19, 2024

GavinFrazar commented Jan 20, 2024

rosstimothy commented Jan 22, 2024

hugoShaka commented Jan 22, 2024

nklaassen commented Jan 22, 2024 •

edited

Loading

nklaassen commented Jan 23, 2024 •

edited

Loading

nklaassen commented Jan 23, 2024 •

edited

Loading

Tener commented Jan 23, 2024

justinas commented Jan 24, 2024

tcsc commented Jan 24, 2024

gabrielcorado commented Jan 24, 2024

gabrielcorado commented Jan 25, 2024 •

edited

Loading

espadolini commented Jan 26, 2024 •

edited

Loading

espadolini commented Jan 26, 2024 •

edited

Loading

Teleport 15 Test Plan #36663

Teleport 15 Test Plan #36663

Comments

r0mant commented Jan 13, 2024 • edited by hugoShaka Loading

Manual Testing Plan

User accounting @atburke

Combinations @capnspacehook

Teleport with EKS/GKE @AntonAM

Teleport with multiple Kubernetes clusters @tigrato

Kubernetes auto-discovery @AntonAM

Kubernetes Secret Storage @AntonAM

Kubernetes Pod RBAC @AntonAM

Teleport with FIPS mode @bl-nero

ACME @bl-nero

Migrations @tigrato

Command Templates

OpenSSH

Teleport

Teleport with SSO Providers

GitHub External SSO @capnspacehook

tctl sso family of commands @flyinghermit

SSO login on remote host @atburke

Teleport Plugins @EdwardDowling

Teleport Operator @hugoShaka

AWS Node Joining @hugoShaka

Kubernetes Node Joining @hugoShaka

Azure Node Joining @marcoandredinis

GCP Node Joining @hugoShaka

Cloud Labels @hugoShaka @marcoandredinis

Passwordless @codingllama

Device Trust @codingllama

Hardware Key Support @jakule

Server Access

Other

HSM Support @nklaassen

Moderated session @strideynet

Performance @rosstimothy @fspmarshall

Scaling Test

Soak Test

Concurrent Session Test

Robustness

Teleport with Cloud Providers

AWS @camscale

GCP @tigrato

IBM @hugoShaka

Application Access @mdwn

Database Access @greedy52 + team

TLS Routing @smallinsky

IGS:

r0mant commented Jan 13, 2024 • edited by fheinecke Loading

Desktop Access @probakowski @ibeckermayer

Binaries / OS compatibility @fheinecke

Windows @ravicious

macOS @camscale

Linux @camscale

Machine ID @strideynet

Host users creation @lxea

CA rotations @fspmarshall

Proxy Peering

EC2 Discovery @marcoandredinis

Azure Discovery @marcoandredinis

GCP Discovery @atburke

IP Pinning @AntonAM

Assist @jakule @ryanclark @tigrato @xacrimon @justinas

Resources

strideynet commented Jan 16, 2024

codingllama commented Jan 16, 2024 • edited Loading

ibeckermayer commented Jan 17, 2024 • edited Loading

strideynet commented Jan 17, 2024

bl-nero commented Jan 17, 2024

ibeckermayer commented Jan 17, 2024

ibeckermayer commented Jan 17, 2024 • edited Loading

rosstimothy commented Jan 17, 2024

bl-nero commented Jan 18, 2024

tigrato commented Jan 18, 2024 • edited by codingllama Loading

ravicious commented Jan 18, 2024 • edited by codingllama Loading

tigrato commented Jan 18, 2024

tigrato commented Jan 18, 2024

GavinFrazar commented Jan 18, 2024

nklaassen commented Jan 18, 2024 • edited Loading

r0mant commented Jan 13, 2024 •

edited by hugoShaka

Loading

`tctl sso` family of commands @flyinghermit

r0mant commented Jan 13, 2024 •

edited by fheinecke

Loading

codingllama commented Jan 16, 2024 •

edited

Loading

ibeckermayer commented Jan 17, 2024 •

edited

Loading

ibeckermayer commented Jan 17, 2024 •

edited

Loading

tigrato commented Jan 18, 2024 •

edited by codingllama

Loading

ravicious commented Jan 18, 2024 •

edited by codingllama

Loading

nklaassen commented Jan 18, 2024 •

edited

Loading

etcd¹

Postgres¹

Firestore¹

nklaassen commented Jan 22, 2024 •

edited

Loading

nklaassen commented Jan 23, 2024 •

edited

Loading

nklaassen commented Jan 23, 2024 •

edited

Loading

gabrielcorado commented Jan 25, 2024 •

edited

Loading

espadolini commented Jan 26, 2024 •

edited

Loading

espadolini commented Jan 26, 2024 •

edited

Loading