compact_level0_phase1: add back spawn_blocking support #4492

problame · 2023-06-13T16:15:19Z

This is follow-up to #4441

See #4441 and more specifically,
#4441 (comment)

Ideas for how to bring it back: move deltas_to_compact into the spawn_blocking future and return it back?

The text was updated successfully, but these errors were encountered:

The data will help decide whether it's ok to keep holding Timeline::layers in shared mode until after we've calculated the holes. Other timings are to understand the general breakdown of timings in that function. Context: #4492

The stats for `compact_level0_phase` that I added in #4527 show the following breakdown (24h data from prod, only looking at compactions with > 1 L1 produced): * 10%ish of wall-clock time spent between the two read locks * I learned that the `DeltaLayer::iter()` and `DeltaLayer::key_iter()` calls actually do IO, even before we call `.next()`. I suspect that is why they take so much time between the locks. * 80+% of wall-clock time spent writing layer files * Lock acquisition time is irrelevant (low double-digit microseconds at most) * The generation of the holes holds the read lock for a relatively long time and it's proportional to the amount of keys / IO required to iterate over them (max: 110ms in prod; staging (nightly benchmarks): multiple seconds). Find below screenshots from my ad-hoc spreadsheet + some graphs. <img width="1182" alt="image" src="https://github.com/neondatabase/neon/assets/956573/81398b3f-6fa1-40dd-9887-46a4715d9194"> <img width="901" alt="image" src="https://github.com/neondatabase/neon/assets/956573/e4ac0393-f2c1-4187-a5e5-39a8b0c394c9"> <img width="210" alt="image" src="https://github.com/neondatabase/neon/assets/956573/7977ade7-6aa5-4773-a0a2-f9729aecee0d"> ## Changes In This PR This PR makes the following changes: * rearrange the `compact_level0_phase1` code such that we build the `all_keys_iter` and `all_values_iter` later than before * only grab the `Timeline::layers` lock once, and hold it until we've computed the holes * run compact_level0_phase1 in spawn_blocking, pre-grabbing the `Timeline::layers` lock in the async code and passing it in as an `OwnedRwLockReadGuard`. * the code inside spawn_blocking drops this guard after computing the holds * the `OwnedRwLockReadGuard` requires the `Timeline::layers` to be wrapped in an `Arc`. I think that's Ok, the locking for the RwLock is more heavy-weight than an additional pointer indirection. ## Alternatives Considered The naive alternative is to throw the entire function into `spawn_blocking`, and use `blocking_read` for `Timeline::layers` access. What I've done in this PR is better because, with this alternative, 1. while we `blocking_read()`, we'd waste one slot in the spawn_blocking pool 2. there's deadlock risk because the spawn_blocking pool is a finite resource ![image](https://github.com/neondatabase/neon/assets/956573/46c419f1-6695-467e-b315-9d1fc0949058) ## Metadata Fixes #4492

problame mentioned this issue Jun 13, 2023

turn Timeline::layers into tokio::sync::RwLock #4441

Merged

problame self-assigned this Jun 16, 2023

problame mentioned this issue Jun 16, 2023

log timings for compact_level0_phase1 #4527

Merged

problame mentioned this issue Jun 23, 2023

bring back spawn_blocking for compact_level0_phase1 #4537

Merged

problame closed this as completed in #4537 Jun 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

compact_level0_phase1: add back spawn_blocking support #4492

compact_level0_phase1: add back spawn_blocking support #4492

problame commented Jun 13, 2023 •

edited

Loading

compact_level0_phase1: add back spawn_blocking support #4492

compact_level0_phase1: add back spawn_blocking support #4492

Comments

problame commented Jun 13, 2023 • edited Loading

problame commented Jun 13, 2023 •

edited

Loading