Skip to content

Commit

Permalink
grafana: supply the explanation on the metrics of Flow-Control in TiK…
Browse files Browse the repository at this point in the history
…V. (#15670) (#15784)
  • Loading branch information
ti-chi-bot authored Dec 18, 2023
1 parent a133f0e commit ec9fc13
Show file tree
Hide file tree
Showing 2 changed files with 15 additions and 0 deletions.
15 changes: 15 additions & 0 deletions grafana-tikv-dashboard.md
Original file line number Diff line number Diff line change
Expand Up @@ -180,6 +180,21 @@ title: TiKV 监控指标详解

![TiKV Dashboard - Storage metrics](/media/tikv-dashboard-storage.png)

### Flow Control

- Scheduler flow:每个 TiKV 实例的 scheduler 的实时流量
- Scheduler discard ratio:每个 TiKV 实例的 scheduler 的请求拒绝比率。如果该比例不为 0,则表明存在流控。当 Compaction pending bytes 超过阈值时,TiKV 会根据超过阈值部分的值,按比例线性增加 Scheduler discard ratio。被拒绝的请求将自动由客户端重试
- Throttle duration:L0 文件过多并触发流控后,scheduler 执行请求的阻塞时间。如果存在统计数据,则表明存在流控
- Scheduler throttled CF:由于达到流控阈值,触发 RocksDB 限流的 CF
- Flow controller actions:由于达到流控阈值,触发 RocksDB 限流的原因
- Flush/L0 flow:每个 TiKV 实例上 RocksDB 的不同 CF 的 Flush 流量和 L0 compaction 的流量
- Flow control factors:触发 RocksDB 限流相关的因素
- Compaction pending bytes:每个 TiKV 实例上 RocksDB 实时等待 compaction 的数据的大小
- Txn command throttled duration:由于限流,与事务相关的命令的阻塞时间。正常情况下,该指标为 0
- Non-txn command throttled duration:由于限流,非事务相关的命令的阻塞时间。正常情况下,该指标为 0

![TiKV Dashboard - Flow Control metrics](/media/tikv-dashboard-flow-control.png)

### Scheduler

- Scheduler stage total:每种命令不同阶段的 ops,正常情况下,不会在短时间内出现大量的错误
Expand Down
Binary file added media/tikv-dashboard-flow-control.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit ec9fc13

Please sign in to comment.