Storage Parameters Dialog Box

For operators that can generate an output table, the Storage Parameters dialog box allows the user to specify several additional parameters regarding storage method and compression.

This helps avoid proliferation of excessively large quantities of data, and is a convenience feature that allows for re-use of similar workflows without changing constant values in numerous places.
Important: Currently, this feature is unique to Greenplum databases.
Parameters
Append Only? Allows the operator's output data to be simply appended to the existing data source.

Default value: No.

Columnar Storage? Allows storage of the data in columns. This option is enabled only if Append Only is enabled.

Default value: No.

Compression? If the user selects Yes, he or she must set the level of compression as a number between 1 and 9. This option is enabled only if Append Only is enabled.

Default value: No.

When enabled, the default compression value is 1.

Distribution? Determines how the dataset rows are sorted for the output column.

Options:

  • Distributed randomly
  • Distributed by these columns

If set to Distributed by these columns, a list of columns separated by commas must be provided for the distribution clause. In this case, you must enter a distribution clause as text, in the form of a list of columns separated by commas (the columns specified cannot be of type array).

Default value: Distributed randomly.

Partition By This option is available for Greenplum databases, allowing the data to be distributed on disk by table partitioning.
Note: Table partitioning addresses the problem of supporting very large tables, such as fact tables, by allowing you to divide them into smaller and more manageable pieces.

Specify the desired method for distributing the data on disk. See the Greenplum documentation for more details.