-
Notifications
You must be signed in to change notification settings - Fork 613
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
revert #7957 to validate the performance regression of q7 and try to improve if true #8510
Comments
Doubled the throughput after reverting the PR https://github.com/risingwavelabs/risingwave/tree/lz/revert_7957 |
check the performance of the queries that involve join: Let's wait and see more results... |
Probably it still depends on the real workload to determine whether it improves or makes the performance worse? Or |
Interesting 🤔 cc. @hzxa21 Any ideas about the reason? |
TL;DR, I think in the perf benchmark, the locality between the two stream is worse than expected so refilling cache on write in the join executor cause eviction on the old entries that are still needed, leading to higher operator cache miss. cc @KeXiangWang This is different from what we have seen before. Any thoughts? Findings after looking at grafana (up: 02/23 no |
I agree with Patrick. Nexmark Q7 is join
Like @lmatz mentioned, we may probably make decision depends on task and data patterns😕. |
This issue has been open for 60 days with no activity. Could you please update the status? Feel free to continue discussion or close as not planned. |
Later, we found that the 8 kafka-partition nexmark is generated by 16 threads, which can mess up the locality. I will try to re-enable it later and make it a configurable choice. |
This issue has been open for 60 days with no activity. If you think it is still relevant today, and needs to be done in the near future, you can comment to update the status, or just manually remove the You can also confidently close this issue as not planned to keep our backlog clean. |
#7957
Meanwhile q7-rewrite which uses
group top-n
does not change performance much. Didn't see other queries also affected.The text was updated successfully, but these errors were encountered: