Generalized Linear Model (GLM) Introductory Overview - Sigma-Restricted and Overparameterized Model
Unlike the Multiple Regression model, which is usually applied to cases where the X variables are continuous, the general linear model is frequently applied to analyze any ANOVA or MANOVA design with categorical predictor variables, any ANCOVA or MANCOVA design with both categorical and continuous predictor variables, and any multiple or multivariate regression design with continuous predictor variables. To illustrate, Gender is clearly a nominal level variable (anyone who attempts to rank order the sexes on any dimension does so at his or her own peril in today's world). There are two basic methods by which Gender can be coded into one or more (non-offensive) predictor variables, and analyzed using the general linear model.
As further illustration, consider an example where a model is specified that has 1 factor that contains 3 three levels A, B, and C. Under the sigma-restricted parameterization, the factor would be coded as follows:
Factor | Column A | Column B |
A | 1 | 0 |
B | 0 | 1 |
C | -1 | -1 |
This parameterization leads to the interpretation that each coefficient estimates the difference between each level and the average of the other 2 levels, i.e., the coefficient for A is the estimate of the difference between level A and the average of levels of B and C.
Overparameterized model (coding of categorical predictors). The second basic method for recoding categorical predictors is the indicator variable approach. In this method, a separate predictor variable is coded for each group identified by a categorical predictor variable. To illustrate, females might be assigned a value of 1 and males a value of 0 on a first predictor variable identifying membership in the female Gender group, and males would then be assigned a value of 1 and females a value of 0 on a second predictor variable identifying membership in the male Gender group. Note that this method of recoding categorical predictor variables will almost always lead to X'X matrices with redundant columns, and thus require a generalized inverse for solving the normal equations. As such, this method is often called the overparameterized model for representing categorical predictor variables, because it results in more columns in the X'X than are necessary for determining the relationships of categorical predictor variables to responses on the dependent variables.
True to its description as general, the general linear model can be used to perform analyses with categorical predictor variables that are coded using either of the two basic methods that have been described.
Other Generalized Linear Model (GLM) Introductory Overview Topics
A detailed discussion of univariate and multivariate ANOVA techniques can also be found in the Introductory Overview section of the ANOVA/MANOVA module; a discussion of multiple regression methods is provided in the Multiple Regression Overviews.