Example discretizing depth of lake

probability distribution

Let be a continuous rv. Then a probability distribution or probability density function (pdf) of is a function such that for any two numbers and with ,

That is, the probability that takes on a value in the interval is the area above this interval and under the graph of the density function, as illustrated in Figure 4.2. The graph of is often referred to as the density curve.

Figure 4.2 the area under the density curve between and 01925166-48c0-7eca-9860-67f13d0848b1_2_885_1462_623_254_0.jpg

For to be a legitimate pdf, it must satisfy the following two conditions:

  1. for all
  2. \begin{align} &{\int}_{-\infty }^{\infty }f\left( x\right) {dx} \\ &= \text{area under the entire graph of } f\left( x\right) \\ &= 1 \end{align}

EX 4.4 probability of angle

Because whenever in EX 4.4 probability of angle, depends only on the width of the interval, is said to have a uniform distribution.

uniform distribution

A continuous rv is said to have a uniform distribution on the interval if the pdf of is

The graph of any uniform pdf looks like the graph in Figure 4.3 except that the interval of positive density is rather than .

  • In the discrete case, a probability mass function (pmf) tells us

how little “blobs” of probability mass of various magnitudes are distributed along the measurement axis.

  • In the continuous case, probability density is “smeared” in a continuous fashion along the interval of possible values. When density is smeared uniformly over the interval, a uniform pdf, as in Figure 4.3, results.

Probability of single point:

  • When is a discrete random variable, each possible value is assigned positive probability.
  • This is not true of a continuous random variable (that is, the second condition of the definition is satisfied) because the area under a density curve that lies above any single value is zero:

The fact that when is continuous has an important practical consequence:

The probability that lies in some interval between and does not depend on whether the lower limit or the upper limit is included in the probability calculation:

If is discrete and both and are possible values (e.g., is binomial with and ), then all four of the probabilities in (4.1) are different.

physical analog

The zero probability condition has a physical analog. Consider a solid circular rod with cross-sectional area . Place the rod alongside a measurement axis and suppose that the density of the rod at any point is given by the value of a density function. Then

  • if the rod is sliced at points and and this segment is removed, the amount of mass removed is ;
  • if the rod is sliced just at the point , no mass is removed. Mass is assigned to interval segments of the rod but not to individual points.

EX 4.5 time headway

Unlike discrete distributions such as the binomial, hypergeometric, and negative binomial, the distribution of any given continuous rv cannot usually be derived using simple probabilistic arguments. Instead, one must make a judicious choice of pdf based on prior knowledge and available data. Fortunately, there are some general families of pdf’s that have been found to be sensible candidates in a wide variety of experimental situations; several of these are discussed later in the chapter.

Just as in the discrete case, it is often helpful to think of the population of interest as consisting of values rather than individuals or objects. The pdf is then a model for the distribution of values in this numerical population, and from this model various population characteristics (such as the mean) can be calculated.