Skip to content

Commit

Permalink
Optimize copy_cells, part 2
Browse files Browse the repository at this point in the history
This patch optimizes the `copy_cells` path for multi-fragment reads. The
following benchmarks are for the multi-fragment read scenario discussed
offline:

```
// Current
Read time: 4.75082 secs
  * Time to copy result attribute values: 3.54192 secs
    > Time to read attribute tiles: 0.311707 secs
    > Time to unfilter attribute tiles: 0.370434 secs
    > Time to copy fixed-sized attribute values: 0.898421 secs
    > Time to copy var-sized attribute values: 0.954925 secs
```

```
// With this patch
Read time: 3.04627 secs
  * Time to copy result attribute values: 1.83972 secs
    > Time to read attribute tiles: 0.274928 secs
    > Time to unfilter attribute tiles: 0.38196 secs
    > Time to copy fixed-sized attribute values: 0.517415 secs
    > Time to copy var-sized attribute values: 0.461847 secs
```

For context, here are the benchmark results for the single-fragment read. The
stats are similar with and without this patch:
```
Read time: 1.86883 secs
  * Time to copy result attribute values: 1.19411 secs
    > Time to read attribute tiles: 0.304055 secs
    > Time to unfilter attribute tiles: 0.351332 secs
    > Time to copy fixed-sized attribute values: 0.289661 secs
    > Time to copy var-sized attribute values: 0.142405 secs
```

This patch does three things:
1. Converts the `offset_offsets_per_cs` and `var_offsets_per_cs` in the var-sized
   path from a 2D array (vector<vector<uint64_t>>) to a 1D array (vector<uint64_t).
   The big win is construction and destruction time for the nested vector
   elements.
2. Partitions cell copying for both fixed and var-sized paths. The motivation
   is to reduce contention on the TBB threads and minimize time spent context
   switching between the individual cell slab copies.
3. Added a context cache for the fixed size path, similar to the existing
   var-sized context cache.
  • Loading branch information
Joe Maley committed Jun 22, 2020
1 parent fd46117 commit 1395b81
Show file tree
Hide file tree
Showing 5 changed files with 467 additions and 167 deletions.
10 changes: 10 additions & 0 deletions tiledb/sm/global_state/global_state.cc
Original file line number Diff line number Diff line change
Expand Up @@ -50,6 +50,8 @@ namespace tiledb {
namespace sm {
namespace global_state {

extern int tbb_nthreads_;

GlobalState& GlobalState::GetGlobalState() {
// This is thread-safe in C++11.
static GlobalState globalState;
Expand Down Expand Up @@ -120,6 +122,14 @@ std::set<StorageManager*> GlobalState::storage_managers() {
const std::string& GlobalState::cert_file() {
return cert_file_;
}

int GlobalState::tbb_threads() {
if (!initialized_)
return 0;

return tbb_nthreads_;
}

} // namespace global_state
} // namespace sm
} // namespace tiledb
6 changes: 6 additions & 0 deletions tiledb/sm/global_state/global_state.h
Original file line number Diff line number Diff line change
Expand Up @@ -92,6 +92,12 @@ class GlobalState {
*/
const std::string& cert_file();

/**
* Gets the number of threads that TBB was initialized with.
* @return the number of configured TBB threads.
*/
int tbb_threads();

private:
/** The TileDB configuration parameters. */
Config config_;
Expand Down
3 changes: 2 additions & 1 deletion tiledb/sm/global_state/tbb_state.cc
Original file line number Diff line number Diff line change
Expand Up @@ -50,7 +50,7 @@ namespace global_state {
static std::unique_ptr<tbb::task_scheduler_init> tbb_scheduler_;

/** The number of TBB threads the scheduler was configured with **/
static int tbb_nthreads_;
int tbb_nthreads_;

Status init_tbb(const Config* config) {
int nthreads;
Expand Down Expand Up @@ -95,6 +95,7 @@ Status init_tbb(const Config* config) {
return Status::Error(msg.str());
}
}

return Status::Ok();
}

Expand Down
Loading

0 comments on commit 1395b81

Please sign in to comment.