Sizing Guide
Usually, the total data set is partitioned horizontally into copysets where each copyset holds a fraction of the data. Since a copyset in production typically includes more than one node for redundancy (where each node is an exact replica of the data in that copyset), let us start with a simplifying assumption that the data resides on a single node per copyset.
The size of a copyset is determined by the following factors:
- The number of rows
- The size of a row in bytes (which is determined by number of columns, the column data types, and the actual values placed in each column)
- Indexes
- Example of a Sizing Calculation
Consider a scenario where the purchasing details of a customer are stored in the purchase table.
Copyright © Cloud Software Group, Inc. All rights reserved.