Recall that the mean value and standard deviation of a binomial random variable are and , respectively. Figure 4.25 displays a binomial probability histogram for the binomial distribution with , , for which and .
Figure 4.25
Binomial probability histogram for  with normal approximation curve superimposed

A normal curve with this and has been superimposed on the probability histogram. Although the probability histogram is a bit skewed (because ), the normal curve gives a very good approximation, especially in the middle part of the picture. The area of any rectangle (probability of any particular value) except those in the extreme tails can be accurately approximated by the corresponding normal curve area.
Example
For example, whereas the area under the normal curve between 9.5 and 10.5 is
More generally, as long as the binomial probability histogram is not too skewed, binomial probabilities can be well approximated by normal curve areas. It is then customary to say that has approximately a normal distribution.
Proposition
Let be a binomial rv based on trials with success probability . Then if the binomial probability histogram is not too skewed, has approximately a normal distribution with and . In particular, for a possible value of ,
In practice, the approximation is adequate provided that both and (i.e., the expected number of successes and the expected number of failures are both at least 10), since there is then enough symmetry in the underlying binomial distribution.
A direct proof of the approximation’s validity is quite difficult. In the next chapter we’ll see that it is a consequence of a more general result called the Central Limit Theorem. In all honesty, the approximation is not so important for probability calculation as it once was. This is because software can now calculate binomial probabilities exactly for quite large values of .
When the objective of our investigation is to make an inference about a population proportion , interest will focus on the sample proportion of successes rather than on itself. Because this proportion is just multiplied by the constant , it will also have approximately a normal distribution
- with mean and standard deviation
- provided that both and . This normal approximation is the basis for several inferential procedures to be discussed in later chapters.