title

summary

aliases

Cluster Topology Configuration

Learn to configure cluster topology to maximize the capacity for disaster recovery.

/docs/dev/location-awareness/

/docs/dev/how-to/deploy/geographic-redundancy/location-awareness/

Cluster Topology Configuration

Overview

PD schedules according to the topology of the TiKV cluster to maximize the TiKV's capability for disaster recovery. It is recommended that TiKV nodes are physically distributed as much as possible. For example, TiKV nodes can be distributed on different racks or even in different data zones. According to the topology information of TiKV, the PD scheduler automatically performs scheduling at the background to isolate the replicas of Regions as much as possible, thereby maximizing the capability for disaster recovery.

Before you begin, see Deploy TiDB Using TiDB Ansible (Recommended) and Deploy TiDB Using Docker.

TiKV reports the topological information

TiKV reports the topological information to PD according to the startup parameter or configuration of TiKV.

Assuming that the topology has three structures: zone > rack > host, use labels to specify the following information:

Startup parameter:

tikv-server --labels zone=<zone>,rack=<rack>,host=<host>

Configuration:

[server]
labels = "zone=<zone>,rack=<rack>,host=<host>"

PD understands the TiKV topology

PD gets the topology of TiKV cluster through the PD configuration.

[replication]
max-replicas = 3
location-labels = ["zone", "rack", "host"]

After the PD cluster is initialized, you need to use the pd-ctl tool to make online changes:

pd-ctl config set location-labels zone,rack,host

location-labels needs to correspond to the TiKV labels name so that PD can understand that the labels represents the TiKV topology.

Note:

You must configure location-labels for PD and labels for TiKV at the same time for labels to take effect.

PD restricts the TiKV topology

Having configured location-labels, you can further enhance the topological isolation requirements on TiKV clusters through the isolation-level parameter. Assume that you have made a three-layer cluster topology by configuring location-labels according to the instructions above: zone -> rack -> host, and have configured the isolation-level as follows:

[replication]
isolation-level = "zone"

After the PD cluster is initialized, you need to use the pd-ctl tool to make online changes:

pd-ctl config set isolation-level zone

isolation-level needs to correspond to one of the location-labels names so that PD can understand that this label represents the TiKV topology.

Note:

isolation-level is empty by default, that is, there is no mandatory isolation level restriction. To set it, you must first configure the PD's location-labels parameter, and ensure that the value of isolation-level must be one of the location-labels names.

PD schedules based on the TiKV topology

PD makes optimal scheduling according to the topological information. You just need to care about what kind of topology can achieve the desired effect.

If you use 3 replicas and hope that the TiDB cluster is always highly available even when a data zone goes down, you need at least 4 data zones.

Assume that you have 4 data zones, each zone has 2 racks, and each rack has 2 hosts. You can start 2 TiKV instances on each host:

# zone=z1
tikv-server --labels zone=z1,rack=r1,host=h1
tikv-server --labels zone=z1,rack=r1,host=h2
tikv-server --labels zone=z1,rack=r2,host=h1
tikv-server --labels zone=z1,rack=r2,host=h2

# zone=z2
tikv-server --labels zone=z2,rack=r1,host=h1
tikv-server --labels zone=z2,rack=r1,host=h2
tikv-server --labels zone=z2,rack=r2,host=h1
tikv-server --labels zone=z2,rack=r2,host=h2

# zone=z3
tikv-server --labels zone=z3,rack=r1,host=h1
tikv-server --labels zone=z3,rack=r1,host=h2
tikv-server --labels zone=z3,rack=r2,host=h1
tikv-server --labels zone=z3,rack=r2,host=h2

# zone=z4
tikv-server --labels zone=z4,rack=r1,host=h1
tikv-server --labels zone=z4,rack=r1,host=h2
tikv-server --labels zone=z4,rack=r2,host=h1
tikv-server --labels zone=z4,rack=r2,host=h2

In other words, 16 TiKV instances are distributed across 4 data zones, 8 racks and 16 machines.

In this case, PD will schedule different replicas of each datum to different data zones.

If one of the data zones goes down, the high availability of the TiDB cluster is not affected.
If the data zone cannot recover within a period of time, PD will remove the replica from this data zone.

However, if isolation-level is set to zone, PD will ensure that different replicas of a Region are isolated from each other at the zone level, even if guaranteeing this restriction does not meet the requirement of max-replicas. For example, a TiKV cluster is distributed across three data zones z1/z2/z3. Each Region has three replicas as required, and PD distributes the three replicas of the same Region to these three data zones respectively. If a power outage occurs in z1 and cannot be recovered after a period of time, PD determines that the Region replicas on z1 are no longer available. However, because isolation-level is set to zone, PD needs to strictly guarantee that different replicas of the same Region will not be scheduled on the same data zone. Because both z2 and z3 already have replicas, PD will not perform any scheduling under the minimum isolation level restriction of isolation-level, even if there are only two replicas at this moment.

Similarly, when isolation-level is set to rack, the minimum isolation level applies to different racks in the same data zone. With this configuration, the isolation at the zone level is guaranteed first if possible. When the isolation at the zone level cannot be guaranteed, PD tries to avoid scheduling different replicas to the same rack in the same zone, and so on.

In summary, PD maximizes the disaster recovery of the cluster according to the current topology. Therefore, if you want to achieve a certain level of disaster recovery, deploy more machines on different sites according to the topology than the number of max-replicas. TiDB also provides a mandatory configuration item isolation-level to control the topological isolation level of data according to different cases.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

location-awareness.md

location-awareness.md

Cluster Topology Configuration

Overview

TiKV reports the topological information

PD understands the TiKV topology

PD restricts the TiKV topology

PD schedules based on the TiKV topology

Files

location-awareness.md

Latest commit

History

location-awareness.md

File metadata and controls

Cluster Topology Configuration

Overview

TiKV reports the topological information

PD understands the TiKV topology

PD restricts the TiKV topology

PD schedules based on the TiKV topology