Dense reader: read next batch of tiles as other get processed. #3965

KiterLuc · 2023-03-13T15:25:56Z

This change moves tile processing to another task so that the read can continue until another read operation is encountered. The reader then does the read, after which it will wait for the running process operation to complete before kicking off the new one. This will make it so that most reads after the first one come for free. For large queries, rough benchmarking shows that we reduce query time by 30%.

TYPE: IMPROVEMENT
DESC: Dense reader: read next batch of tiles as other get processed.

shortcut-integration · 2023-03-13T15:26:01Z

This pull request has been linked to Shortcut Story #25396: Read tiles as others get unfiltered..

dhoke4tdb · 2023-03-13T16:07:44Z

tiledb/sm/query/readers/dense_reader.cc

-      clear_tiles(name, result_tiles);
+      compute_task = storage_manager_->compute_tp()->execute(
+          [&,
+           filtered_data = std::move(filtered_data),


filtered_data doesn't appear to actually be used in here, maybe irrelevant.

It is used indirectly. The memory that the filtered_data object contains needs to be kept alive until unfiltering is completed, which happens at the end of this task.

tiledb/sm/query/readers/dense_reader.cc

test/src/unit-dense-reader.cc

tiledb/sm/query/readers/dense_reader.cc

tiledb/sm/query/readers/dense_reader.h

lums658 · 2023-03-15T08:26:46Z

Editorializing, there is an important general principle here that can be recognized independent of this (or any) particular application, namely overlapping I/O with computation. As long as the same processor (or a single thread) isn't doing both I/O and compute, it is generally a win and is especially useful for hiding latency.

The general pattern is

Start an asynchronous I/O operation
Do some computational work that does not need the results of the I/O
Complete I/O operation
Do computational work using the results of I/O

If multiple I/O calls are required, one can do

While not done
- Start an asynchronous I/O operation
- Do some computational work that does not need the results of the current I/O, but may use results from previous I/O
- Complete I/O operation
- Process the results of the just-completed I/O (sometimes this gets wrapped into a single computational step)

The advantage of following this kind of a pattern is that the concurrency between I/O and computation is localized and fairly easy to follow -- structured, in other words. In the PR, it seems like the compute is being made asynchronous and also being passed around so it isn't clear where it is completing or where it is getting launched again. So we should try to figure out a way of making it more structured.

(As an aside, the I/O is usually the thing that is made asynchronous because most operating systems have I/O subsystems that support asynchronous operations. In our situation, the I/O is much more complicated than just doing an OS call.)

(As an aside aside -- we should audit the code to find other overlap opportunities like this -- but also develop a formula to realize the overlap in a more structured way.)

This change moves tile processing to another task so that the read can continue until another read operation is encountered. The reader then does the read, after which it will wait for the running process operation to complete before kicking off the new one. This will make it so that most reads after the first one come for free. For large queries, rough benchmarking shows that we reduce query time by 30%. --- TYPE: IMPROVEMENT DESC: Dense reader: read next batch of tiles as other get processed.

lums658

All of my immediate concerns were addressed. There are a few things (the pattern for creating can passing around compute_task) that need to be planned and executed more carefully, but will ansi need to be part of a larger restructuring. I don't think we need to implement those things for this PR as that restructuring will be large task on its own.

tiledb/common/thread_pool/thread_pool.cc

test/src/unit-dense-reader.cc

tiledb/sm/query/readers/dense_reader.cc

NikolaosPapailiou · 2023-03-16T11:05:29Z

tiledb/sm/query/readers/dense_reader.cc

+          }
+
+          // Process all tiles in parallel.
+          auto status = parallel_for_2d(


Do we need to use num_range_threads-1 here to count for the extra compute thread?

The range threads are not actually the number of threads in the threadpool. They are only used when a read will be a few large tiles. At that point, we will split the work for a tile across threads. Also, by the time the work of the parallel for in the compute task gets processed, the compute task should already mostly be in a waiting/yielding state.

tiledb/sm/query/readers/dense_reader.cc

…opy-parallel/ch25396

KiterLuc requested a review from ihnorton March 13, 2023 15:25

dhoke4tdb reviewed Mar 13, 2023

View reviewed changes

tiledb/sm/query/readers/dense_reader.cc Show resolved Hide resolved

ihnorton requested review from lums658 and NikolaosPapailiou March 14, 2023 03:10

lums658 suggested changes Mar 14, 2023

View reviewed changes

KiterLuc added 4 commits March 15, 2023 11:10

Fix UT.

0833ca1

Address feedback from @lums658.

d8009b5

Address feedback from @lums658, part 2.

76ff3da

KiterLuc force-pushed the lr/dense-reader-read-copy-parallel/ch25396 branch from f346b4b to 76ff3da Compare March 15, 2023 10:10

lums658 approved these changes Mar 15, 2023

View reviewed changes

tiledb/common/thread_pool/thread_pool.cc Show resolved Hide resolved

Add compute task comment.

b7539ef

NikolaosPapailiou reviewed Mar 16, 2023

View reviewed changes

Address feedback from @NikolaosPapailiou/fix UT.

959339c

NikolaosPapailiou approved these changes Mar 17, 2023

View reviewed changes

More comment goodness.

15956a2

ihnorton approved these changes Mar 25, 2023

View reviewed changes

ihnorton added the pending-2.16 label Mar 25, 2023

Merge remote-tracking branch 'origin/dev' into lr/dense-reader-read-c…

f3a6800

…opy-parallel/ch25396

ihnorton merged commit 443d233 into dev Mar 28, 2023

ihnorton deleted the lr/dense-reader-read-copy-parallel/ch25396 branch March 28, 2023 14:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dense reader: read next batch of tiles as other get processed. #3965

Dense reader: read next batch of tiles as other get processed. #3965

KiterLuc commented Mar 13, 2023

shortcut-integration bot commented Mar 13, 2023

dhoke4tdb Mar 13, 2023

KiterLuc Mar 13, 2023

lums658 commented Mar 15, 2023

lums658 left a comment

NikolaosPapailiou Mar 16, 2023

KiterLuc Mar 17, 2023

Dense reader: read next batch of tiles as other get processed. #3965

Dense reader: read next batch of tiles as other get processed. #3965

Conversation

KiterLuc commented Mar 13, 2023

shortcut-integration bot commented Mar 13, 2023

dhoke4tdb Mar 13, 2023

Choose a reason for hiding this comment

KiterLuc Mar 13, 2023

Choose a reason for hiding this comment

lums658 commented Mar 15, 2023

lums658 left a comment

Choose a reason for hiding this comment

NikolaosPapailiou Mar 16, 2023

Choose a reason for hiding this comment

KiterLuc Mar 17, 2023

Choose a reason for hiding this comment