Workspace Node: Data Health Check Summary - Results Tab
In the Data Health Check Summary node dialog box, select the Results tab to access the following options.
Data cleaning and report options.
Element Name | Description |
---|---|
Display Data Diagnostic Report Only | Select this option button to produce a summary with the results of all of the specified health checks, i.e., sparse data check, outlier check, etc. |
Display Data Diagnostic Report And Apply Data Cleaning | Select this option button to produce a summary with the results of all of the specified health checks, i.e., sparse data check, outlier check, etc. In addition, the results of the check will be applied to the downstream document, that is, all sparse data will be removed as well as invariant variables, etc. |
Display Data Diagnostic Report and Mark Variables Excluded | Select this option button to produce a summary with the results of all the specified health checks, i.e., sparse data check, outlier check, etc. In addition, the results of the check will be applied to the downstream document, that is, all sparse data and outlying data will be removed; sparse variables, invariant variables, and redundant variables will not be removed, but marked as excluded. |
Continuous variables | Select the check boxes for the statistics/graphs to be computed and placed in the Reporting Documents after running (updating) the project. |
Graphical comparative summary display | Select this check box to display up to six variables on one graph, a quick way to create a summary graphical document that enables easy comparison between variables; they will be located in the Graphical comparative summary display folder. A histogram, box plot, and descriptive statistics are produced per variable. The histogram and box plot use the same scale. Also created is a set of graphs (histograms, box plots, and statistics) comparing each variable before and after the data cleaning application; they will be located in the Graphical comparisons of continuous variables before and after data cleaning folder. Comparisons are not created for variables that consist only of zeros and/or missing data. |
Normal probability plot | Select this check box to produce a cascade of normal probability plots for the selected variables, one plot per selected variable. For more information on how the standard normal probability plot is constructed, see Normal Probability Plots. |
Categorical variables | |
Histograms | Select this check box to produce a histogram for each categorical variable. |
Frequency tables | Select this check box to produce a frequency table for each categorical variable. |
All variables | |
Parallel coordinate plot | Select this check box to produce a parallel coordinate plot for all selected variables.
Options. See Common Options. |
OK | Click the OK button to accept all the specifications made in the dialog box and to close it. |
Copyright © 2021. Cloud Software Group, Inc. All Rights Reserved.