Example 1: CHAID Classification Tree
This example illustrates an analysis of the Boston housing data (Harrison & Rubinfeld, 1978) that was reported by Lim, Loh, and Shih (1997). This data file is also used in Example 2: Discriminant-Based Univariate Splits for Categorical and Ordered Predictors in the Classification Trees Analysis module. Median prices of housing tracts are classified as Low, Medium, or High on the dependent variable Price. There is 1 categorical predictor, Cat1, and 12 ordered predictors, Ord1 through Ord12. A duplicate of the learning sample is used as a test sample. The sample identifier variable is Sample and contains codes of 1 for Learning and 2 for Test. The complete data set containing a total of 1,012 cases is available in the example data file Boston2.sta. Open this data file via the File - Open Examples menu; it is in the Datasets folder. Part of this data file is shown below.

Click on the Validation tab and select the V-fold cross-validation check box. Also, click the Test sample button to display the Cross-Validation dialog. Click the Sample identifier variable button and select variable Sample. Learning (the default value) is the Code for analysis sample; also set the Status to On.

Click OK on the Cross-Validation dialog to return to the General CHAID Models Quick specs dialog.

Leave all other defaults and click OK to begin the analysis, and then to display the General CHAID Models Results dialog.

As also described in General Computation Issues and Unique Solutions of Statistica GCHAID (see Reviewing Large Trees: Unique Analysis Management Tools in GC&RT Introductory Overview - Basic Ideas Part II), the most convenient way (and most standard way, from the user-interface point of view) to review information in trees is via the tree browser. Click the Tree browser button to review the final tree in the efficient Workbook Tree Browser.

As also described in the Workbook Tree Browser, it is easy to review large trees by clicking the nodes in the left pane and observing the changes in the distribution of the observations assigned to the respective nodes. In fact an "animation-like" effect can be created in this manner.

and the Data belonging to node button.

As you can see, the Low housing prices for observations in this node are associated with the pattern of predictor values shown in the parallel coordinate plot. This type of plot of the pattern of values for each observation over the predictor variables can provide valuable insights into overall "patterns" for observations classified into (or predicted to belong to) a particular node.
Of course, for predictive purposes, the sequence of if-then conditions (splits) that lead to the respective node of interest - as shown in the summary tree graph or the Workbook Tree Browser - is of greatest interest.