Measures of central tendency locate only the center of a distribution of measures. Other measures often are needed to describe data.

For example, consider the two sets of numbers presented in Table 1.

The mean, the median, and the mode of each employee's daily earnings all equal $200. Yet, there is significant difference between the two sets of numbers. For example, the daily earnings of Employee A are much more consistent than those of Employee B, which show great variation. This example illustrates the need for **measures of variation** or spread.

Range

**range**. Range is defined as the difference between the largest and smallest values. The range is a single number. To say that the range is from $190 to $200, although informative, is not really a correct use of the term. The range for Employee A is $210 – $190

**=**$20; the range for Employee B is $400 – $0 = $400.

Deviation and variance

**variance**is defined as the sum of the squared deviations of

*n*measurements from their mean divided by (

*n*– 1).

So, from Table 1, the mean for Employee A is $200, and the deviations from the mean are as follows:

0, +10, –10, +1, –1, –5, +5, 0

The squared deviations from the mean, therefore, are the following:

0, 100, 100, 1, 1, 25, 25, 0

The sum of these squared deviations from the mean equals 252. Dividing by ( *n* – 1), or 8 – 1, yields

For Employee B, the mean is also $200, and the deviations from the mean are as follows:

0, –180, +200, –200, +190, –190, 0, +180

The squared deviations, therefore, are the following:

0; 32,400; 40,000; 40,000; 36,100; 36,100; 0; 32,400

The sum of these squared deviations equals 217,000. Dividing by ( *n* – 1) yields

Although they earned the same totals, there is significant difference in variance between the daily earnings of the two employees.

Standard deviation

**standard deviation**is defined as the positive square root of the variance; thus, the standard deviation of Employee A's daily earnings is the positive square root of 36, or 6. The standard deviation of Employee B's daily earnings is the positive square root of 31,000, or about 176.

*Notation
*

*s*

^{2}denotes the variance of a sample.

σ ^{2} denotes the variance of a population.

*s* denotes the standard deviation of a sample.

σ denotes the standard deviation of a population.

Empirical rule: The normal curve

- The interval from one standard deviation below the mean to one standard deviation above the mean contains approximately 68 percent of the measurements.

- The interval from two standard deviations below the mean to two standard deviations above the mean contains approximately 95 percent of the measurements.

- The interval from three standard deviations below the mean to three standard deviations above the mean contains approximately all the measurements.

These mound‐shaped curves usually are called **normal distributions** or normal curves (see Figures 1, 2, and 3).

Figure 1.The interval ±σ from the mean contains 68 percent of the measurements.

Figure 2.The interval ±2σ from the mean contains 95 percent of the measurements.

Figure 3.The interval ±3σ from the mean contains 99.7 percent of the measurements.

Shortcut formulae

Σ *x* = sum of the measures

Σ *x* ^{2}= sum of the squares of the measures

For example, using these six measures: 3, 9, 1, 2, 5, and 4:

The quantities are then substituted into the shortcut formula to find

The variance and standard deviation are now found as before:

Percentile

*N*th

**percentile**is defined as the value such that

*N*percent of the values lie below it. So, for example, a score of 5 percent from the top score would be the 95th percentile because it is above 95 percent of the other scores (see Figure 4).

Figure 4.Ninety‐five percent of the test scores are less than the value of the 95th percentile.

Quartiles and interquartile range

The **lower quartile** ( *Q* _{1}) is defined as the 25th percentile; thus, 75 percent of the measures are above the lower quartile. The **middle quartile** ( *Q* _{2}) is defined as the 50th percentile, which is, in fact, the median of all the measures. The **upper quartile** ( *Q* _{3}) is defined as the 75th percentile; thus, only 25 percent of the measures are above the upper quartile.

The **interquartile range** ( *IQR*) is the value for Q _{3} – Q _{1}. Like the range, it is a single value.

Figure 5 illustrates the locations of the median and the quartiles for a set of 20 test scores.

Figure 5.An illustration of the quartiles and the interquartile range.