Reliability and Item Analysis Introductory Overview - Designing a Reliable Scale
After the discussion so far, it should be clear that, the more reliable a scale, the better (e.g., more valid) the scale. As mentioned earlier, one way to make a sum scale more valid is by adding items. Reliability and Item Analysis methods include options that allow you to compute how many items would have to be added in order to achieve a particular reliability, or how reliable the scale would be if a certain number of items were added. However, in practice, the number of items on a questionnaire is usually limited by various other factors (e.g., respondents get tired, overall space is limited, etc.). Let us return to our prejudice example, and outline the steps that one would generally follow in order to design the scale so that it will be reliable:
| Statistica
RELIABL. ANALYSIS | Summary for scale: Mean=46.1100 Std.Dv.=8.26444 Valid n:100
Cronbach alpha: .794313 Standardized alpha: .800491 Average inter-item corr.: .297818 | |||||
| variable | Mean if
deleted | Var. if
deleted | StDv. if
deleted | Itm-Totl
Correl. | Squared
Multp. R | Alpha if
deleted |
| ITEM1 | 41.61000 | 51.93790 | 7.206795 | .656298 | .507160 | .752243 |
| ITEM2 | 41.37000 | 53.79310 | 7.334378 | .666111 | .533015 | .754692 |
| ITEM3 | 41.41000 | 54.86190 | 7.406882 | .549226 | .363895 | .766778 |
| ITEM4 | 41.63000 | 56.57310 | 7.521509 | .470852 | .305573 | .776015 |
| ITEM5 | 41.52000 | 64.16961 | 8.010593 | .054609 | .057399 | .824907 |
| ITEM6 | 41.56000 | 62.68640 | 7.917474 | .118561 | .045653 | .817907 |
| ITEM7 | 41.46000 | 54.02840 | 7.350401 | .587637 | .443563 | .762033 |
| ITEM8 | 41.33000 | 53.32110 | 7.302130 | .609204 | .446298 | .758992 |
| ITEM9 | 41.44000 | 55.06640 | 7.420674 | .502529 | .328149 | .772013 |
| ITEM10 | 41.66000 | 53.78440 | 7.333785 | .572875 | .410561 | .763314 |
Shown above are the results for 10 items, that are discussed in greater detail in Examples. Of most interest to us are the three right-most columns in this spreadsheet. They show us the correlation between the respective item and the total sum score (without the respective item), the squared multiple correlation between the respective item and all others, and the internal consistency of the scale (coefficient Alpha) if the respective item would be deleted. Clearly, items 5 and 6 "stick out," in that they are not consistent with the rest of the scale. Their correlations with the sum scale are .05 and .12, respectively, while all other items correlate at .45 or better. In the right-most column, we can see that the reliability of the scale would be about .82 if either of the two items were to be deleted. Thus, we would probably delete the two items from this scale.