Correlation (DB)
Use to specify two or more numeric type attributes (columns) in a data set for relative analysis against each other by calculating the correlation between each pair of selected columns.
Information at a Glance
Parameter |
Description |
---|---|
Category | Explore |
Data source type | DB |
Send output to other operators | No |
Data processing tool | n/a |
Algorithm
The covariance between two variables (X and Y) is calculated as shown in the following formula:
where
and
are the mean values for
X and
Y, respectively.
The correlation is calculated by normalizing the covariance, as shown in the following formula:
For information about correlation and covariance, see Correlation and Covariance.
Input
A data set from the preceding operator.
Restrictions
The algorithm is relevant only when run on numeric data.
Configuration
Parameter | Description |
---|---|
Notes | Notes or helpful information about this operator's parameter settings. When you enter content in the Notes field, a yellow asterisk appears on the operator. |
Columns | The numeric columns for which the correlation should be calculated. |