Any Documentation on Sizing Suggestions #4569

alex-gadea · 2020-05-23T02:04:02Z

I am looking to gain an understanding of what are the biggest impacts on resource requirements for a YB cluster. Is there any documentation or guidance available that might provide rough guidance on what to take into consideration when calculating node, memory and CPU requirements in a Kubernetes cluster?

I know this is an almost impossible question to answer without real world data, so any type of guidance would be super helpful. Do the number of databases have any type of impact or can I pack as many as memory allows? What is the overhead per database? What is the expected # of concurrent connections per node? What is the memory requirement per connection? Do connections fork ala Postgres? How important is CPU to a YB cluster or is it really memory we have to push? Is it better to have lots of single VCPU nodes or fewer 8 VCPU core nodes (is this largely impacted by whether we go colocated or not due to inter-node traffic)? Is there a point where you expect to reach diminishing returns for each VCPU and its better to expand horizontally instead?

Our expected baseline is:

1000 customers each with their own db (from initial reading looks like we would want this data colocated)
10GB average database size - 20 tables per database, avg row size would be 4k
to keep it simple lets say all the databases are similar (no big whales creating imbalances)
at peak 250 read queries per second consuming an average of 500 rows of 2k each vs 10 write queries per second each pushing 4k (let's say no joins to keep it simple)
at peak 10 aggregation queries running per second (again no joins and doing simple grouping on one column and a sum against a 2nd column)
each database would have an average of 20 users
we would like to implement row level security at the database level
we would like to take advantage of CDC that would push out to Kafka

We have worked with Postgres for a long time and have a good, intuitive understanding of how to architect that environment to meet our needs and a rough idea of what the horsepower we have to bring to play would be. YB is completely new to us and so far we have not been able to find the type of guidelines that would help us understand what our starting point should be prior to moving toward benchmark and load testing so even the simplest guidance would help us at this stage.

ddorian · 2020-05-27T06:41:07Z

Hi @alex-gadea
I haven't forgotten, it was holiday over the weekend. I'll get back at you today with all answers.

ddorian · 2020-05-27T21:48:11Z

Is there any documentation or guidance available that might provide rough guidance on what to take into consideration when calculating node, memory and CPU requirements in a Kubernetes cluster?

Yes in https://docs.yugabyte.com/latest/deploy/checklist/. For kubernetes we also have best practices in https://docs.yugabyte.com/latest/deploy/kubernetes/best-practices/. In kubernetes it's also best to provide fixed hardware requirements. I'm also doing a pull request regarding general best practices in #4158 (will finish soon).

Do the number of databases have any type of impact or can I pack as many as memory allows? What is the overhead per database?

Master overhead should not be high memory, except during actually creating the database(s). Afterwards, they’re mostly a bunch of UUIDs in some maps.

What is the expected # of concurrent connections per node? What is the memory requirement per connection? Do connections fork ala Postgres?

Each new YSQL connection starts a new process just like PostgreSQL. Therefore the overhead will be roughly the same. We have a default limit of 300 max connections per yb-tserver https://docs.yugabyte.com/latest/reference/configuration/yb-tserver/#ysql-max-connections . New connections also add overheard from fetching metadata from the master.

How important is CPU to a YB cluster or is it really memory we have to push?

Usually CPU becomes a bottleneck before memory. See https://docs.yugabyte.com/latest/deploy/checklist/.

Is it better to have lots of single VCPU nodes or fewer 8 VCPU core nodes (is this largely impacted by whether we go colocated or not due to inter-node traffic)?

It's best to have at least the recommended node size mentioned https://docs.yugabyte.com/latest/deploy/checklist/. A bigger machine can support a bigger colocated database before it may need to be distributed. We're also working to pull out of big tables out of colocation and distribute them across the cluster.
We've tested with very big nodes https://blog.yugabyte.com/performance-benchmarks-tb-database-cluster-high-data-density-nodes/

Is there a point where you expect to reach diminishing returns for each VCPU and its better to expand horizontally instead?

Usually you also want to expand horizontally because very big machines may have more impact on the cluster when they go down.

1000 customers each with their own db (from initial reading looks like we would want this data colocated)

Yes on your data size.

at peak 250 read queries per second consuming an average of 500 rows of 2k each vs 10 write queries per second each pushing 4k (let's say no joins to keep it simple)

Yes. But we need to keep in mind the disk/network/cpu/ram requirements, hot data distribution (zipfian,random) etc.

at peak 10 aggregation queries running per second (again no joins and doing simple grouping on one column and a sum against a 2nd column)

This will depend on the number of rows that the query will group. We haven't optimized a lot aggregate queries but based on your mentioned data size it should work.

each database would have an average of 20 users
we would like to implement row level security at the database level

250/second read queries + 10/second aggregation queries will be ok since those are 260 concurrent connections.
It will be harder to pool connections so you will have more cpu overhead creating/dropping connections.

we would like to take advantage of CDC that would push out to Kafka

Kafka implementation isn't ready for production.

We have worked with Postgres for a long time and have a good, intuitive understanding of how to architect that environment to meet our needs and a rough idea of what the horsepower we have to bring to play would be. YB is completely new to us and so far we have not been able to find the type of guidelines that would help us understand what our starting point should be prior to moving toward benchmark and load testing

The same path will help with YugabyteDB too. Check the architecture section https://docs.yugabyte.com/latest/architecture/ to get a baseline. In this case it will be a combination of Postgresql knowledge (like the connections part) and YugabyteDB too.

so even the simplest guidance would help us at this stage

We can continue on this thread, forums or slack with each step in your application development process (schema, sharding, queries, load testing, production debug help etc).

ddorian · 2020-07-14T09:52:47Z

@alex-gadea please reopen the issue if you have further questions.

yugabyte-ci added the community/request Issues created by external users label May 23, 2020

ddorian self-assigned this May 26, 2020

ddorian added area/docdb YugabyteDB core features area/ysql Yugabyte SQL (YSQL) labels May 26, 2020

ddorian closed this as completed Jul 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Any Documentation on Sizing Suggestions #4569

Any Documentation on Sizing Suggestions #4569

alex-gadea commented May 23, 2020

ddorian commented May 27, 2020 •

edited

Loading

ddorian commented May 27, 2020

ddorian commented Jul 14, 2020

Any Documentation on Sizing Suggestions #4569

Any Documentation on Sizing Suggestions #4569

Comments

alex-gadea commented May 23, 2020

ddorian commented May 27, 2020 • edited Loading

ddorian commented May 27, 2020

ddorian commented Jul 14, 2020

ddorian commented May 27, 2020 •

edited

Loading