Stepwise Model Builder - Linear Regression
Ribbon bar. In Statistica, open a data set. Then, select the Statistics tab. In the Advanced/Multivariate group, click Advanced Models. On the Stepwise Model Builder submenu, select Linear Regression to display the Stepwise Model Builder - Linear Regression Startup Panel.
Or, select the Data Mining tab. In the Tools group, click Stepwise Model Builder and from the menu, select Linear Regression to display the Stepwise Model Builder - Linear Regression Startup Panel.
Classic menus. Open a data set. On the Statistics - Advanced Linear/Nonlinear Models - Stepwise Model Builder submenu, select Linear Regression to display the Stepwise Model Builder - Linear Regression Startup Panel.
Or, on the Data Mining - Stepwise Model Builder submenu, select Linear Regression to display the Stepwise Model Builder - Linear Regression Startup Panel.
Overview and Workflow
Use the options in the Stepwise Model Builder - Linear Regression Startup Panel to compute the marginal predictor statistics given a current model; specifically, the variables listed in the Marginal Results Table will be entered one at a time into the linear regression containing the predictors listed in the Model Results Table, so that analysts can evaluate the unique contribution of each predictor candidate not in the equation.
Next, select (highlight) the predictor candidates in the Marginal Analysis Variables pane, and click either the Full sample button or the Subsample button to compute the Marginal Results Table results.
Then select (highlight) the predictors in the Marginal Results Table that are to be entered into the full Model Results Table, and click the Add variable button to:
1) Estimate the parameters of the linear regression model including the selected predictors and any predictors previously entered into the regression equation
2) Re-estimate the results for all predictor candidates in the Marginal Results Table
The Model Results Table statistics are always computed for the entire data set.
This means that you cannot separate the set of discrete values available in a discrete predictor candidate, and all values will always be added or removed from models and selections in unison, even if only a single code is selected (highlighted) when the respective predictor is moved in/out of a results table.
When discrete predictors are removed from the prediction equation or the Marginal results table, the same logic applies.
Bootstrapping provides a way for analysts to assess the robustness of the parameter estimates and results in the Model Results Table from repeatedly drawn samples.
Option Descriptions
Select variables. Click this button to display a standard variable selection dialog box. Select a continuous variable as the Dependent or Y variable, and two or more Continuous and/or Categorical Predictors (predictor candidates). After exiting the variable selection dialog box (click the OK button), the respective variable names will be displayed in the Marginal Analysis Variables pane.
Marginal Analysis Variables. After selecting variables for the analyses, this pane will show the selected variable names, their type (Continuous or Categorical), and the variable number in the input file. You can select one or more predictors in this list by highlighting them; to run marginal analyses on the selected variables, click either the Full sample or Subsample button in the Run Marginal Analysis group box.
Set Linear Parameters.
Dependent (Y) Variable. As described in the Introductory Overview, Statistica will compute parameter estimates and other results for a linear regression. This box displays the dependent or Y variable.
Project. Use the Save project and Open project options to save work in progress and to retrieve previously saved projects to continue working with the same variables and model.
Model.
Run Marginal Analysis. Click either the Full sample or Subsample button to add the selected (highlighted) predictor candidates from the Marginal Analysis Variables pane to the Marginal Results Table, and to compute the respective marginal analysis results.
Marginal Results.
- Scatterplots of observed values vs raw residuals for continuous predictors
- Mean plot with errors for categorical predictors
Results will be displayed in standard results spreadsheets and graphs, shown by default in workbooks.
Model Analysis.
Replications. The number of bootstrap replications
% in holdout sample. The proportion of hold-out cases in each bootstrap replication; if this value is 0 (zero), predictive accuracy is computed from the (100%) training sample (used for estimating the parameters)
The program will then create k replications of the data via random sampling with replacement, and designate a proportion p cases in each replication as the hold-out or testing sample. Next, the respective model will be fit to all cases not in the hold-out sample (in the training sample) in the respective replication. The parameter estimates and R squared statistic in each replication are also recorded. Thus, you can then evaluate the distribution of the parameter estimates and fit over the replications.
Model Results. Use these options to compute various model statistics and summaries for the current model with predictors listed in the Model results table, and computed from the full sample.
Marginal Results Table. Displays the marginal analysis results for the currently selected predictor candidates. Right-click on any column header in the Marginal Results Table to display a shortcut menu containing check boxes adjacent to available statistics; when a check box is selected, the respective column is added to the table; when a check box is cleared, the respective column is hidden in the table.
R2 / Pr(f) / Pr(t). These columns display the results statistics.
df. Degrees of freedom for Wald statistic.
Notes:
Sorting the variable list. Click on the R2 column in the Marginal Results Table to sort the table by the respective column values in ascending or descending order.
Selecting predictor candidates in the Marginal Results Table. To select predictor candidates in the Marginal Results Table, click on the respective predictor candidate. Use CTRL+click or SHIFT+click to select specific predictor candidates or lists of contiguous predictor candidates, respectively.
Re-calculating marginal results. Click the Full sample or Subsample button to recalculate the marginal analysis results for the highlighted (selected) predictor variables.
Adding variables to the Model Results Table. Click the Add variable button to move selected (highlighted) predictors into the model and to update the Marginal Results Table.
Add/Remove Model Variables.
The results in the Mode Results Table are always computed for the full sample. After the model parameters are updated, the Marginal Results Table results are then recalculated for all predictor candidates currently not in the model.
Model Results Table. The Model Results Table shows the parameter estimates and summary statistics for the current model, that is, the model with the predictors listed in the pane and computed from the full sample. Right-click on any column header in the Model Results Table to display a shortcut menu containing check boxes adjacent to available statistics; when a check box is selected, the respective column is added to the table; when a check box is cleared, the respective column is hidden in the table.
Note:
Removing variables. To remove predictors from the current model, highlight the respective predictors and then click the Remove variable button. The predictors will be removed from the model, the model will be re-estimated with the remaining predictors, and the Marginal Results Table will be recalculated for all predictor candidates including those that were removed from the model.