Workspace Node: Random Forest Classification - Specifications - Classification Tab

In the Random Forest Classification node dialog box, under the Specifications heading, select the Classification tab to access the following options.

Element Name Description
Misclassification costs Use the options in this group box to assign greater importance to classification (misclassification) of certain classes as compared to others. For example, in medical diagnostics you may want to assign a greater cost for misclassifying malignant tissues as healthy compared to misclassifying healthy tissues as malignant. This implies that the misclassification cost matrix does not have to be symmetric, and in fact it rarely is.

The misclassification costs are combined with the prior probabilities when computing the classification probabilities (during estimation, as well as when computing the final classification probabilities); for details regarding these computations, see Friedman (1999a, p. 11; essentially, the cost will be applied as a relative weight to the classification probabilities, and final classifications will be determined from the products of the two).

Equal When you select the Equal option button, each off-diagonal element of the predicted class (row) by observed class (column) misclassification costs matrix is set equal to 1.0, and the specified prior probabilities for the classes on the dependent variable are not adjusted.
User specified Select this option button if more accurate classification is desired for some classes than others. Note that this option is available only if you have selected the dependent variable codes by clicking the Response codes button on the Specifications - Quick tab.
Prior probabilities Use the options in this group box to specify how likely it is, without using any knowledge derived from data of the values for the predictor variables in the model, that a case or object will fall into one of the classes. The prior probabilities will be combined with the prediction probabilities and misclassification costs to compute the classification probabilities during estimation (building of trees), and to compute the final classifications (see also, Technical Notes).
Estimated Select this option button to specify that the likelihood that a case or object will fall into one of the classes is proportional to the dependent variable class sizes (see example below).
Equal Select this option button to specify that the likelihood that a case or object will fall into one of the classes is the same for all dependent variable classes (see example below).
Example These two options are best explained with an example. In an educational study of high school dropouts, for instance, it may happen that, overall, there are fewer dropouts than there are students who stay in school (i.e., there are different base rates); thus, the a priori probability that a student drops out is lower than that a student remains in school. The a priori probabilities can greatly affect the classification of cases or objects. If differential base rates are not of interest for the study, or if you know that there are about an equal number of cases in each class, then you could set the a priori probabilities to be Equal. If the differential base rates are reflected in the class sizes (as they would be if the sample is a probability sample), set the a priori probabilities to Estimated.
User specified Note that this option is available only after you have selected the specific Response codes for the dependent variable on the Specifications - Quick tab. Select this option button if you have specific knowledge about the base rates (for example, based on previous research). Click the adjacent button to display the Enter values for the prior probabilities dialog box, in which you can specify the a priori probabilities for each class of the dependent variable. If the probabilities do not add up to 1.0, Statistica automatically adjusts them proportionately.

Options / C. See Common Options.

OK Click the OK button to accept all the specifications made in the dialog box and to close it. The analysis results will be placed in the Reporting Documents node after running (updating) the project.