Correlation (DB)
Use to specify two or more numeric type attributes (columns) in a data set for relative analysis against each other by calculating the correlation between each pair of selected columns.
Information at a Glance
|
Parameter |
Description |
|---|---|
| Category | Explore |
| Data source type | DB |
| Send output to other operators | No |
| Data processing tool | n/a |
Algorithm
The covariance between two variables (X and Y) is calculated as shown in the following formula:
where
and
are the mean values for
X and
Y, respectively.
The correlation is calculated by normalizing the covariance, as shown in the following formula:
For information about correlation and covariance, see Correlation and Covariance.
Input
A data set from the preceding operator.
Restrictions
The algorithm is relevant only when run on numeric data.
Configuration
| Parameter | Description |
|---|---|
| Notes | Notes or helpful information about this operator's parameter settings. When you enter content in the Notes field, a yellow asterisk appears on the operator. |
| Columns | The numeric columns for which the correlation should be calculated. |
Output
