opt: add rule to normalize Like to Range #53536

ipugh · 2020-08-27T04:10:14Z

It is easier to calculate stats for a Range expression than for
Like expressions. This PR adds a rule that converts a Like operator
with tight constraints to a Range operator.

Fixes #52153

Release note: None

Release justification: This change is small and is unlikely to break
anything.

It is easier to calculate stats for a `Range` expression than for `Like` expressions. This PR adds a rule that converts a `Like` operator with tight constraints to a `Range` operator. Fixes cockroachdb#52153 Release note: None Release justification: This change is small and is unlikely to break anything.

cockroach-teamcity · 2020-08-27T04:10:17Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.

Isaac Pugh seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

blathers-crl · 2020-08-27T04:10:18Z

Thank you for contributing to CockroachDB. Please ensure you have followed the guidelines for creating a PR.

My owl senses detect your PR is good for review. Please keep an eye out for any test failures in CI.

_{🦉 Hoot! I am a Blathers, a bot for CockroachDB. My owner is otan.}

cockroach-teamcity · 2020-08-27T04:10:21Z

This change is

rytaft

Thanks for your contribution! This code is nice and clean and well-documented.

I'm not sure this change is fixing any problem right now, though. The stats that are visible haven't changed (I added comments below in the test files where the stats are already shown). You could see what the stats look like in the other files by adding the format=show-stats directive next to opt, or add some more tests in the memo/testdata/stats directory. My guess is that you won't see any change in the calculated stats for the other cases either.

This is because if you look at the test files that changed, all of the constraints calculated stayed the same. Since the statisticsBuilder uses the constraints to estimate stats, it doesn't seem like this change is actually making it easier for the statisticsBuilder. The cases this rule targets, when there are already tight constraints, is the easiest case for the statisticsBuilder. @RaduBerinde did you have something else in mind when you created the issue?

Reviewed 10 of 10 files at r1.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @ipugh)

pkg/sql/opt/memo/testdata/stats/select, line 320 at r1 (raw file):

select
 ├── columns: d_id:1(int!null) d_w_id:2(int!null) d_name:3(string!null)
 ├── stats: [rows=0.91, distinct(1)=0.91, null(1)=0, distinct(3)=0.91, null(3)=0, distinct(1,3)=0.91, null(1,3)=0]

stats didn't change here

pkg/sql/opt/memo/testdata/stats_quality/tpch/q20, line 109 at r1 (raw file):

           │         │         ├── lookup columns are key
           │         │         ├── immutable
           │         │         ├── stats: [rows=124.917659, distinct(14)=124.917659, null(14)=0, distinct(15)=124.917659, null(15)=0]

stats didn't change here

RaduBerinde · 2020-08-27T19:47:50Z

I didn't know we create constraints in this case.. Yeah it doesn't sound like this rule helps, sorry about that.

ipugh requested a review from a team as a code owner August 27, 2020 04:10

blathers-crl bot added the O-community Originated from the community label Aug 27, 2020

rytaft reviewed Aug 27, 2020

View reviewed changes

ipugh closed this Sep 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

opt: add rule to normalize Like to Range #53536

opt: add rule to normalize Like to Range #53536

ipugh commented Aug 27, 2020

cockroach-teamcity commented Aug 27, 2020

blathers-crl bot commented Aug 27, 2020

cockroach-teamcity commented Aug 27, 2020

rytaft left a comment

RaduBerinde commented Aug 27, 2020

opt: add rule to normalize Like to Range #53536

opt: add rule to normalize Like to Range #53536

Conversation

ipugh commented Aug 27, 2020

cockroach-teamcity commented Aug 27, 2020

blathers-crl bot commented Aug 27, 2020

cockroach-teamcity commented Aug 27, 2020

rytaft left a comment

Choose a reason for hiding this comment

RaduBerinde commented Aug 27, 2020