planner: increase default concurrency factor of cost computing #11752

eurekaka · 2019-08-15T11:01:33Z

What problem does this PR solve?

#10581 incurs 2.19% performance regression of oltp_read_write workload, this PR resolves this regression.

What is changed and how it works?

The performance regression is caused by the fact that: oltp_read_write contains a class of aggregation quires without Group By, the new cost model would prefer HashAgg which would run in parallel, while the old cost model prefers single-threaded StreamAgg. According to the actual benchmark, single-threaded StreamAgg is a little bit faster than multi-threaded HashAgg for this kind of query.

This PR adjust default concurrency factor to make planner prefer StreamAgg for this kind of query.

Check List

Tests

Unit test

Code changes

N/A

Side effects

N/A

Related changes

N/A

codecov · 2019-08-15T11:06:35Z

Codecov Report

Merging #11752 into master will decrease coverage by 0.2283%.
The diff coverage is n/a.

@@               Coverage Diff               @@
##            master     #11752        +/-   ##
===============================================
- Coverage   81.776%   81.5477%   -0.2284%     
===============================================
  Files          434        433         -1     
  Lines        94107      93506       -601     
===============================================
- Hits         76957      76252       -705     
- Misses       11684      11838       +154     
+ Partials      5466       5416        -50

eurekaka · 2019-08-15T11:14:26Z

/run-all-tests

alivxxx · 2019-08-16T03:00:09Z

cmd/explaintest/r/access_path_selection.result

@@ -23,24 +23,3 @@ id	count	task	operator info
 IndexReader_11	1104.45	root	index:Selection_10
 └─Selection_10	1104.45	cop	lt(test.access_path_selection.b, 3)
  └─IndexScan_9	3323.33	cop	table:access_path_selection, index:a, b, range:[-inf,3), keep order:false, stats:pseudo
-CREATE TABLE `outdated_statistics` (


Why remove it？

Because this case is intended to test that planner would prefer idx_ab over idx_a, but it is hard to say whether double read using idx_ab is better than table scan or not. With the new concurrency factor, planner would prefer table scan, because there are only 6 rows in total.

lzmhhh123

LGTM.

zz-jason

LGTM

sre-bot · 2019-08-16T09:33:19Z

/run-all-tests

planner: increase default concurrency factor of cost computing

dd36120

eurekaka added type/bugfix This PR fixes a bug. sig/planner SIG: Planner labels Aug 15, 2019

Merge branch 'master' into cost/concurrency

20b280f

eurekaka added the status/all tests passed label Aug 15, 2019

eurekaka marked this pull request as ready for review August 16, 2019 02:49

eurekaka requested review from zz-jason, alivxxx, lzmhhh123 and winoros August 16, 2019 02:50

alivxxx reviewed Aug 16, 2019

View reviewed changes

lzmhhh123 reviewed Aug 16, 2019

View reviewed changes

lzmhhh123 added the status/LGT1 Indicates that a PR has LGTM 1. label Aug 16, 2019

zz-jason reviewed Aug 16, 2019

View reviewed changes

alivxxx approved these changes Aug 16, 2019

View reviewed changes

alivxxx added status/LGT2 Indicates that a PR has LGTM 2. status/can-merge Indicates a PR has been approved by a committer. and removed status/LGT1 Indicates that a PR has LGTM 1. labels Aug 16, 2019

Merge branch 'master' into cost/concurrency

4951bad

sre-bot merged commit 2009741 into pingcap:master Aug 16, 2019

eurekaka deleted the cost/concurrency branch August 16, 2019 10:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

planner: increase default concurrency factor of cost computing #11752

planner: increase default concurrency factor of cost computing #11752

eurekaka commented Aug 15, 2019 •

edited

Loading

codecov bot commented Aug 15, 2019 •

edited

Loading

eurekaka commented Aug 15, 2019

alivxxx Aug 16, 2019

eurekaka Aug 16, 2019

lzmhhh123 left a comment

zz-jason left a comment

sre-bot commented Aug 16, 2019

planner: increase default concurrency factor of cost computing #11752

planner: increase default concurrency factor of cost computing #11752

Conversation

eurekaka commented Aug 15, 2019 • edited Loading

What problem does this PR solve?

What is changed and how it works?

Check List

codecov bot commented Aug 15, 2019 • edited Loading

Codecov Report

eurekaka commented Aug 15, 2019

alivxxx Aug 16, 2019

Choose a reason for hiding this comment

eurekaka Aug 16, 2019

Choose a reason for hiding this comment

lzmhhh123 left a comment

Choose a reason for hiding this comment

zz-jason left a comment

Choose a reason for hiding this comment

sre-bot commented Aug 16, 2019

eurekaka commented Aug 15, 2019 •

edited

Loading

codecov bot commented Aug 15, 2019 •

edited

Loading