KAFKA-9235; Ensure transaction coordinator resigns after replica deletion #7963

hachikuji · 2020-01-15T01:19:50Z

During a reassignment, it can happen that the current leader of a partition is demoted and removed from the replica set at the same time. In this case, we rely on the StopReplica request in order to stop replica fetchers and to clear the group coordinator cache. This patch adds similar logic to ensure that the transaction coordinator state cache also gets cleared.

Committer Checklist (excluded from commit message)

Verify design and implementation
Verify test coverage and CI build status
Verify documentation (including upgrade notes)

…eletion

hachikuji · 2020-01-15T01:27:39Z

core/src/main/scala/kafka/coordinator/transaction/TransactionStateManager.scala

@@ -90,9 +90,6 @@ class TransactionStateManager(brokerId: Int,
  /** partitions of transaction topic that are being loaded, state lock should be called BEFORE accessing this set */
  private[transaction] val loadingPartitions: mutable.Set[TransactionPartitionAndLeaderEpoch] = mutable.Set()

-  /** partitions of transaction topic that are being removed, state lock should be called BEFORE accessing this set */
-  private[transaction] val leavingPartitions: mutable.Set[TransactionPartitionAndLeaderEpoch] = mutable.Set()


Note I got rid of this because there didn't seem to be a strong reason to do transaction state unloading asynchronously (all we do is remove a key from a map).

hachikuji · 2020-01-15T01:37:18Z

core/src/test/scala/unit/kafka/coordinator/transaction/TransactionStateManagerTest.scala

    assertEquals(Right(Some(CoordinatorEpochAndTxnMetadata(coordinatorEpoch, txnMetadata1))),
      transactionManager.getTransactionState(transactionalId1))
-    assertEquals(Right(CoordinatorEpochAndTxnMetadata(coordinatorEpoch, txnMetadata1)),
-      transactionManager.putTransactionStateIfNotExists(transactionalId1, txnMetadata2))


Minor test bug fix here.

rajinisivaram

@hachikuji Thanks for the PR, LGTM

hachikuji · 2020-01-16T17:15:16Z

retest this please

…eletion (#7963) During a reassignment, it can happen that the current leader of a partition is demoted and removed from the replica set at the same time. In this case, we rely on the StopReplica request in order to stop replica fetchers and to clear the group coordinator cache. This patch adds similar logic to ensure that the transaction coordinator state cache also gets cleared. Reviewers: Rajini Sivaram <[email protected]>

Conflicts or compilation errors due to the fact that we temporarily reverted the commit that removes Scala 2.11 support: * AclCommand.scala: take upstream changes. * AclCommandTest.scala: take upstream changes. * TransactionCoordinatorTest.scala: don't use SAMs, but adjust mock call to putTransactionStateIfNotExists given new signature. * TransactionStateManagerTest: use Runnable instead of SAMs. * PartitionLockTest: use Runnable instead of SAMs. * docs/upgrade.html: take upstream changes excluding line that states that Scala 2.11 support has been removed. * apache-github/trunk: (28 commits) KAFKA-9457; Fix flaky test org.apache.kafka.common.network.SelectorTest.testGracefulClose (apache#7989) MINOR: Update AclCommand help message to match implementation (apache#7990) MINOR: Update introduction page in Kafka documentation MINOR: Use Math.min for StreamsPartitionAssignor#updateMinReceivedVersion method (apache#7954) KAFKA-9338; Fetch session should cache request leader epoch (apache#7970) KAFKA-9329; KafkaController::replicasAreValid should return error message (apache#7865) KAFKA-9449; Adds support for closing the producer's BufferPool. (apache#7967) MINOR: Handle expandIsr in PartitionLockTest and ensure read threads not blocked on write (apache#7973) MINOR: Fix typo in connect integration test class name (apache#7976) KAFKA-9218: MirrorMaker 2 can fail to create topics (apache#7745) KAFKA-8847; Deprecate and remove usage of supporting classes in kafka.security.auth (apache#7966) MINOR: Suppress DescribeConfigs Denied log during CreateTopics (apache#7971) [MINOR]: Fix typo in Fetcher comment (apache#7934) MINOR: Remove unnecessary call to `super` in `MetricConfig` constructor (apache#7975) MINOR: fix flaky StreamsUpgradeTestIntegrationTest (apache#7974) KAFKA-9431: Expose API in KafkaStreams to fetch all local offset lags (apache#7961) KAFKA-9235; Ensure transaction coordinator is stopped after replica deletion (apache#7963) KAFKA-9410; Make groupId Optional in KafkaConsumer (apache#7943) MINOR: Removed accidental double negation in error message. (apache#7834) KAFKA-6144: IQ option to query standbys (apache#7962) ...

…eletion (apache#7963) During a reassignment, it can happen that the current leader of a partition is demoted and removed from the replica set at the same time. In this case, we rely on the StopReplica request in order to stop replica fetchers and to clear the group coordinator cache. This patch adds similar logic to ensure that the transaction coordinator state cache also gets cleared. Reviewers: Rajini Sivaram <[email protected]>

KAFKA-9235; Ensure transaction coordinator is stopped after replica d…

66e378c

…eletion

hachikuji changed the title ~~KAFKA-9235; Ensure transaction coordinator is stopped after replica deletion~~ KAFKA-9235; Ensure transaction coordinator resigns after replica deletion Jan 15, 2020

hachikuji commented Jan 15, 2020

View reviewed changes

rajinisivaram approved these changes Jan 15, 2020

View reviewed changes

hachikuji merged commit fbe2e60 into apache:trunk Jan 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KAFKA-9235; Ensure transaction coordinator resigns after replica deletion #7963

KAFKA-9235; Ensure transaction coordinator resigns after replica deletion #7963

hachikuji commented Jan 15, 2020

hachikuji Jan 15, 2020

hachikuji Jan 15, 2020

rajinisivaram left a comment

hachikuji commented Jan 16, 2020

KAFKA-9235; Ensure transaction coordinator resigns after replica deletion #7963

KAFKA-9235; Ensure transaction coordinator resigns after replica deletion #7963

Conversation

hachikuji commented Jan 15, 2020

Committer Checklist (excluded from commit message)

hachikuji Jan 15, 2020

Choose a reason for hiding this comment

hachikuji Jan 15, 2020

Choose a reason for hiding this comment

rajinisivaram left a comment

Choose a reason for hiding this comment

hachikuji commented Jan 16, 2020