Let and be discrete rv’s with joint pmf
The points that receive positive probability mass are identified on the coordinate system in Figure 5.5.
Figure 5.5 The population of pairs for Example 5.18
It is evident from the figure that the value of is completely determined by the value of and vice versa, so the two variables are completely dependent. However, by symmetry and The covariance is then and thus Although there is perfect dependence, there is also complete absence of any linear relationship! A value of near 1 does not necessarily imply that increasing the value of causes to increase. It implies only that large values are associated with large values.
For example, in the population of children,
- vocabulary size and number of cavities are quite positively correlated,
- but it is certainly not true that cavities cause vocabulary to grow.
- Instead, the values of both these variables tend to increase as the value of age, a third variable, increases.
- For children of a fixed age, there is probably a low correlation between number of cavities and vocabulary size.
- In summary, association (a high correlation) is not the same as causation.