Just as the most useful univariate distribution in statistical practice is the normal distribution, the most useful joint distribution for two rv’s and is the bivariate normal distribution. The pdf is somewhat complicated:
A graph of this pdf, the density surface, appears in Figure 5.6. It follows (after some tricky integration) that the marginal distribution of is normal with mean value and standard deviation , and similarly the marginal distribution of is normal with mean and standard deviation . The fifth parameter of the distribution is , which can be shown to be the correlation coefficient between and .
Figure 5.6 A graph of the bivariate normal pdf
It is not at all straightforward to integrate the bivariate normal pdf in order to calculate probabilities. Instead, selected software packages employ numerical integration techniques for this purpose.
It can also be shown that the conditional distribution of given that is normal. This can be seen geometrically by slicing the density surface with a plane perpendicular to the passing through the value on that axis; the result is a normal curve sketched out on the slicing plane. The conditional mean value is , a linear function of , and the conditional variance is . The closer the correlation coefficient is to 1 or -1, the less variability there is in the conditional distribution. Analogous results hold for the conditional distribution of given that .
The bivariate normal distribution can be generalized to the multivariate normal distribution. Its density function is quite complicated, and the only way to write it compactly is to employ matrix notation. If a collection of variables has this distribution, then the marginal distribution of any single variable is normal, the conditional distribution of any single variable given values of the other variables is normal, the joint marginal distribution of any pair of variables is bivariate normal, and the joint marginal distribution of any subset of three or more of the variables is again multivariate normal.