Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TiCDC admin job blocked, and etcd worker may be stuck #4211

Closed
3AceShowHand opened this issue Jan 4, 2022 · 7 comments
Closed

TiCDC admin job blocked, and etcd worker may be stuck #4211

3AceShowHand opened this issue Jan 4, 2022 · 7 comments
Assignees
Labels
area/ticdc Issues or PRs related to TiCDC. severity/moderate type/bug The issue is confirmed as a bug. type/enhancement The issue or PR belongs to an enhancement.

Comments

@3AceShowHand
Copy link
Contributor

3AceShowHand commented Jan 4, 2022

What did you do?

tiup cluster deploy lingjin-kafka-cluster-test v5.5.0-nightly-20220104 kafka-cluster-testing.yaml -p -u root

tiup ctl:v5.5.0-nightly-20220104 cdc changefeed create --pd=http://172.16.7.51:2379 --sink-uri="kafka://172.16.6.111:9092/normal-incremental-test?protocol=open-protocol"

tiup ctl:v5.5.0-nightly-20220104 cdc changefeed pause --pd=http://172.16.7.51:2379 --changefeed-id="bbadf644-73eb-4d07-a8f3-1f245f82b9a9"

What did you expect to see?

changefeed's state become stopped

What did you see instead?

it is still normal.

blocked for a few minutes, and then become stopped.

Versions of the cluster

Upstream TiDB cluster version (execute SELECT tidb_version(); in a MySQL client):

(paste TiDB cluster version here)

Git Hash:

fc70dbde8
@3AceShowHand 3AceShowHand added type/bug The issue is confirmed as a bug. area/ticdc Issues or PRs related to TiCDC. labels Jan 4, 2022
@3AceShowHand
Copy link
Contributor Author

cdc.log

@3AceShowHand 3AceShowHand added the type/enhancement The issue or PR belongs to an enhancement. label Jan 4, 2022
@3AceShowHand
Copy link
Contributor Author

it looks that the admin job was handled, but it was stucked when try to update the etcd.

@asddongmen
Copy link
Contributor

asddongmen commented Jan 4, 2022

[2022/01/04 15:18:56.439 +08:00] [WARN] [client.go:226] ["etcd client outCh blocking too long, the etcdWorker may be stuck"] [duration=3m55.999651547s]
[2022/01/04 15:18:56.992 +08:00] [WARN] [etcd_worker.go:242] ["EtcdWorker reactor tick took too long"] [duration=4m5.165662023s]

It seems that the owner is stucked.
The EtcdWorker reactor here is owner.

@3AceShowHand 3AceShowHand self-assigned this Jan 4, 2022
@3AceShowHand
Copy link
Contributor Author

kafka min.insync.replicas is set to 2, and replication-factor is 1

@3AceShowHand
Copy link
Contributor Author

server.log

@3AceShowHand
Copy link
Contributor Author

cdc-126.log

@Rustin170506
Copy link
Member

Duplicate of #4241

@Rustin170506 Rustin170506 marked this as a duplicate of #4241 Jan 10, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/ticdc Issues or PRs related to TiCDC. severity/moderate type/bug The issue is confirmed as a bug. type/enhancement The issue or PR belongs to an enhancement.
Projects
None yet
Development

No branches or pull requests

4 participants