Optimal Binning Startup Panel
Ribbon bar. Select the Data Mining tab. In the Clustering/Grouping group, click Optimal Binning to display the Optimal Binning Startup Panel.
The Startup Panel contains two tabs: Quick and Advanced. See also, Optimal Binning for Predictive Data Mining Introductory Overview and Optimal Binning for Predictive Data Mining Program Overview.
Naming conventions for recoded variables
When you click the Summary button (see option description below), the program will automatically determine a best way to combine the classes in each categorical predictor to yield strong relationships to the dependent or outcome variable specified for the analysis. The recoded or aggregated class codes will automatically be placed into the input data file, into the variables designated for output. Those newly recoded variables will be named according to the following conventions:
Variable names
The variable names will be created from the respective categorical predictor variables (from which they were computed), followed by the further clarification (Grouped). For example, a variable computed from aggregated SIC Codes would be named SIC Codes(Grouped.
Code text labels
The code labels will be created as Group1(k1), Group2(k2), .., Groupn(kn), where
- the numbers following the common prefix Group designate the respective simple ordinal counts (first group, second group, etc.)
- the numbers in parentheses indicate the number of classes from the original categorical input variable that were combined into the respective group.
For example, Group4(53) would mean that the fourth group was created as a combination of 53 classes or categories from the original categorical predictor variable.
Code descriptions
In addition, for each newly created code, a long code description is created, with all codes aggregated into the respective category. Those codes can be reviewed in the Text Labels Editor. All information regarding the recoding of class variables is recorded in the respective newly created class codes. For additional details regarding text values in Statistica, see also Using the Text Labels Editor and Notes on Text Labels and Text Values.
Element Name | Description |
---|---|
Summary | Click the Summary button to begin the computations, and to recode the respective variables as specified on the Quick tab. Summary results spreadsheets with details regarding the recoding that was performed will display. |
Cancel | Click the Cancel button to close the Startup Panel without performing an analysis. |
Options | See Options Menu for descriptions of the commands on this menu. |
By Group | Click the By Group button to display the By Group specification dialog box. |
Open Data | Click the Open Data button to display the Select Data Source dialog box, which contains options to choose the spreadsheet on which to perform the analysis. The Select Data Source dialog box contains a list of the spreadsheets that are currently active. |
Select Cases | Click the Select Cases button to display the Analysis/Graph Case Selection Conditions dialog box, which contains options to create conditions for which cases will be included (or excluded) in the current analysis. More information is available in the case selection conditions overview, syntax summary, and dialog box description. |
W | Click the W (Weight) button to display the Analysis/Graph Case Weights dialog box, which contains options to adjust the contribution of individual cases to the outcome of the current analysis by weighting those cases in proportion to the values of a selected variable. |