c2c: write a protected timestamp on the destination cluster #92093

adityamaru · 2022-11-17T22:41:23Z

Data is streamed from the source to the destination tenant over several partitions. A frontier timestamp that is updated during ingestion, tracks the highest timestamp up to which all partitions on the destination cluster have completed streaming. As time progresses and more data is ingested, versions that fall below the destination tenant's GCThreshold will become eligible for GC. Timestamps below the tenant's GCThreshold become ineligible for cutover since those version are no longer available to revert to.

To ensure that the user always has some generous window of timestamps to cutover to, we must write a protected timestamp over the ingesting tenant's keyspan that protects at FrontierTimestamp - C2CTTL, where C2CTTL can default to 25hours to begin with. This can later be made a configurable value that is specified when the replication stream is being setup. As the frontier moves forward, this protected timestamp must be pulled up as well so as to make data outside the C2CTTL window eligible for GC.

Jira issue: CRDB-21570

Epic: CRDB-18749

The text was updated successfully, but these errors were encountered:

blathers-crl · 2022-11-17T22:41:26Z

cc @cockroachdb/disaster-recovery

During C2C replication as the destination tenant is ingesting KVs we must protect a certain window of MVCC revisions from garbage collection so that the user can cutover to any of the timestamps that lie within this window. To this effect we introduce a `ReplicationTTLSeconds` field to the replication job payload that governs the size of this window relative to the replication job's highwatermark (frontier timestamp). On the first resumption of the replication job we write a protected timestamp record on the destination tenant's keyspace protecting all revisions above `now()`. As the replication job updates its highwatermark, the PTS record is pulled up to protect above `highWatermark - ReplicationTTLSeconds`. This active management of the PTS always ensures that users can cutover to any time in (highWatermark-ReplicationTTLSeconds, highWatermark] and older revisions are gradually made eligible for GC as the frontier progresses. The PTS is released if the replication job fails or is cancelled. Fixes: cockroachdb#92093 Release note: None

92336: streamingest: write and manage PTS on the destination tenant r=stevendanna a=adityamaru During C2C replication as the destination tenant is ingesting KVs we must protect a certain window of MVCC revisions from garbage collection so that the user can cutover to any of the timestamps that lie within this window. To this effect we introduce a `ReplicationTTLSeconds` field to the replication job payload that governs the size of this window relative to the replication job's highwatermark (frontier timestamp). On the first resumption of the replication job we write a protected timestamp record on the destination tenant's keyspace protecting all revisions above `now()`. As the replication job updates its highwatermark, the PTS record is pulled up to protect above `highWatermark - ReplicationTTLSeconds`. This active management of the PTS always ensures that users can cutover to any time in (highWatermark-ReplicationTTLSeconds, highWatermark] and older revisions are gradually made eligible for GC as the frontier progresses. The PTS is released if the replication job fails or is cancelled. Fixes: #92093 Release note: None Co-authored-by: adityamaru <[email protected]>

adityamaru added C-enhancement Solution expected to add code/behavior + preserve backward-compat (pg compat issues are exception) A-disaster-recovery labels Nov 17, 2022

blathers-crl bot added the T-disaster-recovery label Nov 17, 2022

adityamaru self-assigned this Nov 21, 2022

adityamaru mentioned this issue Nov 22, 2022

streamingest: write and manage PTS on the destination tenant #92336

Merged

craig bot closed this as completed in 721cc76 Nov 23, 2022

github-project-automation bot added this to Disaster Recovery Backlog Aug 28, 2024

github-project-automation bot moved this to Done in Disaster Recovery Backlog Aug 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

c2c: write a protected timestamp on the destination cluster #92093

c2c: write a protected timestamp on the destination cluster #92093

adityamaru commented Nov 17, 2022 •

edited

Loading

blathers-crl bot commented Nov 17, 2022

c2c: write a protected timestamp on the destination cluster #92093

c2c: write a protected timestamp on the destination cluster #92093

Comments

adityamaru commented Nov 17, 2022 • edited Loading

blathers-crl bot commented Nov 17, 2022

adityamaru commented Nov 17, 2022 •

edited

Loading