5.2.1 Covariance

When two random variables $X$ and $Y$ are not independent, it is frequently of interest to assess how strongly they are related to one another.

Definition

The covariance between two rv’s $X$ and $Y$ is
$Cov (X, Y) = E [(X - μ_{X}) (Y - μ_{Y})] = {\sum_{x} \sum_{y} (x - μ_{X}) (y - μ_{Y}) p (x, y) \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} (x - μ_{X}) (y - μ_{Y}) f (x, y) d x d y if X, Y are discrete if X, Y are continuous$

That is, since $X - μ_{X}$ and $Y - μ_{Y}$ are the deviations of the two variables from their respective mean values, the covariance is the expected product of deviations. Note that $Cov (X, X) = E [(X - μ_{X})^{2}] = V (X) .$

The rationale for the definition is as follows.

Suppose $X$ and $Y$ have a strong positive relationship to one another, by which we mean that large values of $X$ tend to occur with large values of $Y$ and small values of $X$ with small values of $Y$ .
Then most of the probability mass or density will be associated with $(x - μ_{X})$ and $(y - μ_{Y})$ , either both positive (both $X$ and $Y$ above their respective means) or both negative, so the product $(x - μ_{X}) (y - μ_{Y})$ will tend to be positive.
Thus for a strong positive relationship, $Cov (X, Y)$ should be quite positive.
For a strong negative relationship, the signs of $(x - μ_{X})$ and $(y - μ_{Y})$ will tend to be opposite, yielding a negative product.
- Thus for a strong negative relationship, $Cov (X, Y)$ should be quite negative.
If $X$ and $Y$ are not strongly related, positive and negative products will tend to cancel one another, yielding a covariance near 0.

Figure 5.4 illustrates the different possibilities. The covariance depends on both the set of possible pairs and the probabilities. In Figure 5.4, the probabilities could be changed without altering the set of possible pairs, and this could drastically change the value of $Cov (X, Y)$ .

Figure 5.4 $p (x, y) = 1/ 10$ for each of ten pairs corresponding to indicated points: (a) positive covariance; (b) negative covariance; (c) covariance near zero 0192609f-6f5c-74c9-8588-c1ef28b2184d_17_639_187_1109_348_0.jpg

EXAMPLE 5.15

The following shortcut formula for $Cov (X, Y)$ simplifies the computations.

Proposition

$Cov (X, Y) = E (X Y) - μ_{X} \cdot μ_{Y}$

According to this formula, no intermediate subtractions are necessary; only at the end of the computation is $μ_{X} \cdot μ_{Y}$ subtracted from $E (X Y)$ . The proof involves expanding $(X - μ_{X}) (Y - μ_{Y})$ and then carrying the summation or integration through to each individual term.

EXAMPLE 5.16 (Example 5.5 were continued)

It might appear that the relationship in the insurance example is quite strong since $Cov (X, Y) = 136, 875$ , whereas $Cov (X, Y) = - 2/ 75$ in the nut example would seem to imply quite a weak relationship. Unfortunately, the covariance has a serious defect that makes it impossible to interpret a computed value. In the insurance example, suppose we had expressed the deductible amount in cents rather than in dollars. Then $100 X$ would replace $X, 100 Y$ would replace $Y$ , and the resulting covariance would be $Cov (100 X, 100 Y) = (100) (100) Cov (X, Y) = 1, 368, 750, 000 .$ If, on the other hand, the deductible amount had been expressed in hundreds of dollars, the computed covariance would have been (.01)(.01)(136,875) $= 13.6875$ . The defect of covariance is that its computed value depends critically on the units of measurement. Ideally, the choice of units should have no effect on a measure of strength of relationship. This is achieved by scaling the covariance.

Youliang Zhong

Backlinks

Graph View

5.2.1 Covariance