The probability mass function (pmf) of a single discrete rv specifies how much probability mass is placed on each possible value. The joint pmf of two discrete rv’s and describes how much probability mass is placed on each possible pair of values .

joint probability mass function

Let and be two discrete rv’s defined on the sample space of an experiment. The joint probability mass function is defined for each pair of numbers by

It must be the case that and .

Now let be any particular set consisting of pairs of values

  • e.g., or Then the probability that the random pair lies in the set is obtained by summing the joint pmf over pairs in :

EXAMPLE 5.1

Once the joint pmf of the two variables and is available, it is in principle straightforward to obtain the distribution of just one of these variables.

Example

Let and be the number of statistics and mathematics courses, respectively, currently being taken by a randomly selected statistics major. Suppose that we wish the distribution of , and that when , the only possible values of are 0,1, and 2. Then

That is, the joint pmf is summed over all pairs of the form . More generally, for any possible value of , the probability results from holding fixed and summing the joint over all for which the pair has positive probability mass. The same strategy applies to obtaining the distribution of by itself.

marginal probability mass function

The marginal probability mass function of , denoted by , is given by

Similarly, the marginal probability mass function of is

The use of the word marginal here is a consequence of the fact that if the joint pmf is displayed in a rectangular table as in Example 5.1, then the row totals give the marginal pmf of and the column totals give the marginal pmf of . Once these marginal pmf’s are available, the probability of any event involving only or only can be calculated.

EXAMPLE 5.2 (Example 5.1 continued)