Normalization by Percentile 


Assume that there are n rows with seven variables, A, B, C, D, E, F and G, in the data. We use variable E as an example in the calculations below. The remaining variables in the rows are normalized in the same way.

Without rescaling (Baseline variable = None)

The normalized value of ei for variable E in the ith record is calculated as:

images/n_percentile_without.gif

where

qE,P% = the value that P% of the values for variable E—among the selected rows—are less than or equal to

P = the percentile value that you specify when you normalize the data.

Rescaling by a baseline variable

If we select variable A as baseline variable, the normalized value of ei for variable E in the ith row is calculated as:

images/n_percentile_with.gif

where

qA,P% = the value that P % of the values for variable A—among the selected rows—are less than or equal to

qE,P% = the value that P % of the values for variable E—among the selected rows—are less than or equal to

P = the percentile value that you specify when you normalize the data.

See also:

Normalizing Columns