When the ’s are normally distributed, so is for every sample size . The derivations in Example 5.21 and simulation experiment of Example 5.24 suggest that even when the population distribution is highly nonnormal, averaging produces a distribution more bell-shaped than the one being sampled. A reasonable conjecture is that

if is large, a suitable normal curve will approximate the actual distribution of .

The formal statement of this result is the most important theorem of probability.

The Central Limit Theorem (CLT)

Let be a random sample from a distribution with mean and variance . Then if is sufficiently large,

  • has approximately a normal distribution with
  • also has approximately a normal distribution with
    • ,
    • .

The larger the value of , the better the approximation.

Figure 5.16 illustrates the Central Limit Theorem. According to the CLT, when is large and we wish to calculate a probability such as , we need only “pretend” that is normal, standardize it, and use the normal table. The resulting answer will be approximately correct. The exact answer could be obtained only by first finding the distribution of , so the CLT provides a truly impressive shortcut. The proof of the theorem involves much advanced mathematics.

Figure 5.16 The Central Limit Theorem illustrated 0192609f-6f5c-74c9-8588-c1ef28b2184d_35_817_176_760_409_0.jpg

EXAMPLE 5.27

EXAMPLE 5.28

The CLT provides insight into why many random variables have probability distributions that are approximately normal. For example, the measurement error in a scientific experiment can be thought of as the sum of a number of underlying perturbations and errors of small magnitude.

A practical difficulty in applying the CLT is in knowing when is sufficiently large. The problem is that the accuracy of the approximation for a particular depends on the shape of the original underlying distribution being sampled. If the underlying distribution is close to a normal density curve, then the approximation will be good even for a small , whereas if it is far from being normal, then a large will be required.

Rule of Thumb

The Central Limit Theorem can generally be used if .

There are population distributions for which even an of 40 or 50 does not suffice, but such distributions are rarely encountered in practice. On the other hand, the rule of thumb is often conservative; for many population distributions, an much less than 30 would suffice. For example, in the case of a uniform population distribution, the CLT gives a good approximation for .

EXAMPLE 5.29