Workspace Node: Data Health Check Summary - Specifications - Quick Tab

The Data Health Check workspace node can be accessed from the Feature Finder, ribbon bar, or Node Browser. The Quick tab of the specifications dialog box is displayed by default when you double-click the node.

Element Name Description
Variables Click this button to display a standard variable selection dialog box, where you select the variables for the analysis.
Variable selection options Use the options in this group box to specify whether variables are selected separately by variable type, i.e., categorical or continuous, or selected together in one list.
Select continuous and categorical lists Select this option button to specify the continuous and categorical variables separately for the analysis.
Let Statistica automatically separate into continuous and categorical lists Select this option button to let Statistica automatically separate the lists of variables into categorical and continuous lists.
Automatic variable classification If the Let Statistica automatically separate into continuous and categorical lists option button is selected, use these options to specify how Statistica should determine if a variable is either categorical or continuous.
Treat a variable as categorical if it has labels or is of type Text, Integer, or One-byte When this option button is selected, all variables of type text, integer, and byte, and all variables with text labels will be identified as categorical variables; others will be treated as continuous variables. A variable of type integer, text or byte is classified immediately as categorical.
Treat a variable as categorical if it has labels or is of type Text, Integer, or One-byte; or contains only integer values in the first _ cases Select this option button to allow Statistica to identify all variables as described above in Treat a variable as categorical if it has labels or is of type Text, Integer, or One-byte. In addition, up to k case values for all remaining unclassified variables of type double will be inspected. If all inspected values are integers, the variable will be classified as categorical.
Perform automatic classification of variables on first _ percent or _ cases Select an option button to specify the automatic variable classification be performed either on the first _ percent or first _ number of cases in the data set.
Set automatic variable classification as default measurement types Select this check box to apply the results of the automatic classification of variables in the downstream document. Note that this option has no effect if a downstream document is not created. A downstream is created by selecting the Display Data Diagnostic Report and Apply Data Cleaning option button on the Results tab.
Identify all categorical variables with more than _ levels Select this check box to have Statistica identify categorical variables with more than the user-specified number of levels. This is helpful in identifying those categorical variables that may require binning prior to analysis. Enter the number of levels in the adjacent box. The number must be greater than or equal to 1 and less than or equal to 100.
Remove or mark excluded all categorical variables with more than _ levels This option is only enabled if the Identify all categorical variables with more than _ levels check box (see above) is selected. The number in this option will automatically match the number entered in the Identify all categorical variables with more than _ levels box. When this check box is selected, qualified variables are removed or marked excluded in downstream documents depending on the option selected on the Results tab.
Show types of data on Variable Summary Select this check box to produce summary results of types of data for all selected variables. Clear the check box to significantly improve the performance on a large number of variables.

Options. See Common Options.

OK Click the OK button to accept all the specifications made in the dialog box and to close it.

See also, Specifications - Sparse Data tab, Specifications - Outliers tab, Specifications - Invariant Variables tab, Specifications - Redundancy tab, Specifications - Options tab, Results tab, and Home tab.