Content

Introduction

The normal distribution is the most important one in all of probability and statistics. Many numerical populations have distributions that can be fit very closely by an appropriate normal curve. Examples include heights, weights, and other physical characteristics (the famous 1903 Biometrika article “On the Laws of Inheritance in Man” discussed many examples of this sort), measurement errors in scientific experiments, anthropometric measurements on fossils, reaction times in psychological experiments, measurements of intelligence and aptitude, scores on various tests, and numerous economic measures and indicators. In addition, even when individual variables themselves are not normally distributed, sums and averages of the variables will under suitable conditions have approximately a normal distribution; this is the content of the Central Limit Theorem discussed in the next chapter.

normal distribution

A continuous rv is said to have a normal distribution with parameters and (or and ), where and , if the pdf of is

Again denotes the base of the natural logarithm system and equals approximately 2.71828, and represents the familiar mathematical constant with approximate value 3.14159. The statement that is normally distributed with parameters and is often abbreviated .

Clearly , but a somewhat complicated calculus argument must be used to verify that . It can be shown that and , so the parameters are the mean and the standard deviation of . Figure 4.13 presents graphs of for several different pairs. Each density curve is symmetric about and bell-shaped, so the center of the bell (point of symmetry) is both the mean of the distribution and the median. The mean is a location parameter, since changing its value rigidly shifts the density curve to one side or the other; is referred to as a scale parameter, because changing its value stretches or compresses the curve horizontally without changing the basic shape. The inflection points of a normal curve (points at which the curve changes from turning downward to turning upward) occur at and . Thus the value of can be visualized as the distance from the mean to these inflection points. A large value of corresponds to a density curve that is quite spread out about , whereas a small value yields a highly concentrated curve. The larger the value of , the more likely it is that a value of far from the mean may be observed.

Figure 4.13 (a) Two different normal density curves (b) Visualizing and for a normal distribution 01925166-48c0-7eca-9860-67f13d0848b1_16_549_1567_1207_565_0.jpg