7 Statistical Intervals Based on a Single Sample

INTRODUCTION

A point estimate, because it is a single number, by itself provides no information about the precision and reliability of estimation. Consider, for example, using the statistic $\overset{ˉ}{X}$ to calculate a point estimate for the true average breaking strength (g) of paper towels of a certain brand, and suppose that $\overset{x}{ˉ} = 9322.7$ . Because of sampling variability, it is virtually never the case that $\overset{x}{ˉ} = μ$ . The point estimate says nothing about how close it might be to $μ$ . An alternative to reporting a single sensible value for the parameter being estimated is to calculate and report an entire interval of plausible values—an interval estimate or confidence interval (CI). A confidence interval is always calculated by first selecting a confidence level, which is a measure of the degree of reliability of the interval. A confidence interval with a $95 %$ confidence level for the true average breaking strength might have a lower limit of 9162.5 and an upper limit of 9482.9. Then at the 95% confidence level, any value of $μ$ between 9162.5 and 9482.9 is plausible. A confidence level of $95 %$ implies that $95 %$ of all samples would give an interval that includes $μ$ , or whatever other parameter is being estimated, and only $5%$ of all samples would yield an erroneous interval. The most frequently used confidence levels are $95 %, 99 %$ , and $90 %$ . The higher the confidence level, the more strongly we believe that the value of the parameter being estimated lies within the interval (an interpretation of any particular confidence level will be given shortly).

Information about the precision of an interval estimate is conveyed by the width of the interval. If the confidence level is high and the resulting interval is quite narrow, our knowledge of the value of the parameter is reasonably precise. A very wide confidence interval, however, gives the message that there is a great deal of uncertainty concerning the value of what we are estimating. Figure 7.1 shows $95 %$ confidence intervals for true average breaking strengths of two different brands of paper towels. One of these intervals suggests precise knowledge about $μ$ , whereas the other suggests a very wide range of plausible values.

01927a02-a1da-7086-ba87-2208e017bc0f_1_851_489_698_116_0.jpg

Figure 7.1 CIs indicating precise (brand 1) and imprecise (brand 2) information about $μ$

7.1 Basic Properties of Confidence Intervals

The basic concepts and properties of confidence intervals (CIs) are most easily introduced by first focusing on a simple, albeit somewhat unrealistic, problem situation. Suppose that the parameter of interest is a population mean $μ$ and that

The population distribution is normal
The value of the population standard deviation $σ$ is known

Normality of the population distribution is often a reasonable assumption. However, if the value of $μ$ is unknown, it is typically implausible that the value of $σ$ would be available (knowledge of a population’s center typically precedes information concerning spread). We’ll develop methods based on less restrictive assumptions in Sections 7.2 and 7.3.

EXAMPLE 7.1 Industrial engineers who specialize in ergonomics are concerned with designing workspace and worker-operated devices so as to achieve high productivity and comfort. The article “Studies on Ergonomically Designed Alphanumeric Keyboards” (Human Factors, 1985: 175-187) reports on a study of preferred height for an experimental keyboard with large forearm-wrist support. A sample of $n = 31$ trained typists was selected, and the preferred keyboard height was determined for each typist. The resulting sample average preferred height was $\overset{x}{ˉ} = 80.0 cm$ . Assuming that the preferred height is normally distributed with $σ = 2.0 cm$ (a value suggested by data in the article), obtain a confidence interval (interval of plausible values) for $μ$ , the true average preferred height for the population of all experienced typists.

The actual sample observations $x_{1}, x_{2}, \dots, x_{n}$ are assumed to be the result of a random sample $X_{1}, \dots, X_{n}$ from a normal distribution with mean value $μ$ and standard deviation $σ$ . The results described in Chapter 5 then imply that, irrespective of the sample size $n$ , the sample mean $\overset{ˉ}{X}$ is normally distributed with expected value $μ$ and standard deviation $σ / n$ . Standardizing $\overset{ˉ}{X}$ by first subtracting its expected value and then dividing by its standard deviation yields the standard normal variable

Z = \frac{X ˉ - μ}{σ / n} (7.1)

Because the area under the standard normal curve between -1.96 and 1.96 is .95 ,

P (- 1.96 < \frac{X ˉ - μ}{σ / n} < 1.96) = .95 (7.2)

Now let’s manipulate the inequalities inside the parentheses in (7.2) so that they appear in the equivalent form $l < μ < u$ , where the endpoints $l$ and $u$ involve $\overset{ˉ}{X}$ and $σ / n$ . This is achieved through the following sequence of operations, each yielding inequalities equivalent to the original ones.

Multiply through by $σ / n$ :

- 1.96 \cdot \frac{σ}{n} < \overset{ˉ}{X} - μ < 1.96 \cdot \frac{σ}{n}

Subtract $\overset{ˉ}{X}$ from each term:

- \overset{ˉ}{X} - 1.96 \cdot \frac{σ}{n} < - μ < - \overset{ˉ}{X} + 1.96 \cdot \frac{σ}{n}

Multiply through by -1 to eliminate the minus sign in front of $μ$ (which reverses the direction of each inequality):

\overset{ˉ}{X} + 1.96 \cdot \frac{σ}{n} > μ > \overset{ˉ}{X} - 1.96 \cdot \frac{σ}{n}

that is,

\overset{ˉ}{X} - 1.96 \cdot \frac{σ}{n} < μ < \overset{ˉ}{X} + 1.96 \cdot \frac{σ}{n}

The equivalence of each set of inequalities to the original set implies that

P (\overset{ˉ}{X} - 1.96 \frac{σ}{n} < μ < \overset{ˉ}{X} + 1.96 \frac{σ}{n}) = .95 (7.3)

The event inside the parentheses in (7.3) has a somewhat unfamiliar appearance; previously, the random quantity has appeared in the middle with constants on both ends, as in $a \leq Y \leq b$ . In (7.3) the random quantity appears on the two ends, whereas the unknown constant $μ$ appears in the middle. To interpret (7.3), think of a random interval having left endpoint $\overset{ˉ}{X} - 1.96 \cdot σ / n$ and right endpoint $\overset{ˉ}{X} + 1.96 \cdot σ / n$ . In interval notation, this becomes

(\overset{ˉ}{X} - 1.96 \cdot \frac{σ}{n}, \overset{ˉ}{X} + 1.96 \cdot \frac{σ}{n}) (7.4)

The interval (7.4) is random because the two endpoints of the interval involve a random variable. It is centered at the sample mean $\overset{ˉ}{X}$ and extends $1.96 σ / n$ to each side of $\overset{ˉ}{X}$ . Thus the interval’s width is $2 \cdot (1.96) \cdot σ / n$ , a fixed number; only the location of the interval (its midpoint $\overset{ˉ}{X}$ ) is random (Figure 7.2). Now (7.3) can be paraphrased as “the probability is .95 that the random interval (7.4) includes or covers the true value of $μ$ .” Before any data is gathered, it is quite likely that $μ$ will lie inside the interval (7.4).

01927a02-a1da-7086-ba87-2208e017bc0f_2_855_2018_502_150_0.jpg

Figure 7.2 The random interval (7.4) centered at $\overset{ˉ}{X}$

DEFINITION

If, after observing $X_{1} = x_{1}, X_{2} = x_{2}, \dots, X_{n} = x_{n}$ , we compute the observed sample mean $\overset{x}{ˉ}$ and then substitute $\overset{x}{ˉ}$ into (7.4) in place of $\overset{ˉ}{X}$ , the resulting fixed interval is called a $95 %$ confidence interval for $μ$ . This CI can be expressed either as

(\overset{x}{ˉ} - 1.96 \cdot \frac{σ}{n}, \overset{x}{ˉ} + 1.96 \cdot \frac{σ}{n}) is a 95 % CI for μ

or as

\overset{x}{ˉ} - 1.96 \cdot \frac{σ}{n} < μ < \overset{x}{ˉ} + 1.96 \cdot \frac{σ}{n} with 95 % confidence

A concise expression for the interval is $\overset{x}{ˉ} \pm 1.96 \cdot σ / n$ , where - gives the left endpoint (lower limit) and + gives the right endpoint (upper limit).

EXAMPLE 7.2 (Example 7.1 continued)

2 The quantities needed for computation of the $95 % CI$ for true average preferred height are $σ = 2.0, n = 31$ , and $\overset{x}{ˉ} = 80.0$ . The resulting interval is

\overset{x}{ˉ} \pm 1.96 \cdot \frac{σ}{n} = 80.0 \pm (1.96) \frac{2.0}{31} = 80.0 \pm .7 = (79.3, 80.7)

That is, we can be highly confident, at the $95 %$ confidence level, that $79.3 < μ < 80.7$ . This interval is relatively narrow, indicating that $μ$ has been rather precisely estimated.

Interpreting a Confidence Level

The confidence level $95 %$ for the interval just defined was inherited from the probability .95 for the random interval (7.4). Intervals having other levels of confidence will be introduced shortly. For now, though, consider how $95 %$ confidence can be interpreted.

We started with an event whose probability was .95 - that the random interval (7.4) would capture the true value of $μ$ -and then used the data in Example 7.1 to compute the CI $(79.3, 80.7)$ . It is therefore tempting to conclude that $μ$ is within this fixed interval with probability .95 . But by substituting $\overset{x}{ˉ} = 80.0$ for $\overset{ˉ}{X}$ , all randomness disappears; the interval $(79.3, 80.7)$ is not a random interval, and $μ$ is a constant (unfortunately unknown to us). Thus it is incorrect to write the statement $P (μ lies in (79.3, 80.7)) = .95$ .

A correct interpretation of “95% confidence” relies on the long-run relative frequency interpretation of probability: To say that an event $A$ has probability .95 is to say that if the experiment on which $A$ is defined is performed over and over again, in the long run $A$ will occur $95 %$ of the time. Suppose we obtain another sample of typists’ preferred heights and compute another $95 %$ interval. Now consider repeating this for a third sample, a fourth sample, a fifth sample, and so on. Let $A$ be the event that $\overset{ˉ}{X} - 1.96 \cdot σ / n < μ < \overset{ˉ}{X} + 1.96 \cdot σ / n$ . Since $P (A) = .95$ , in the long run $95 %$ of our computed CIs will contain $μ$ . This is illustrated in Figure 7.3, where the vertical line cuts the measurement axis at the true (but unknown) value of $μ$ . Notice that 7 of the 100 intervals shown fail to contain $μ$ . In the long run, only $5%$ of the intervals so constructed would fail to contain $μ$ .

According to this interpretation, the confidence level $95 %$ is not so much a statement about any particular interval such as $(79.3, 80.7)$ . Instead it pertains to what would happen if a very large number of like intervals were to be constructed using the same CI formula. Although this may seem unsatisfactory, the root of the difficulty lies with our interpretation of probability-it applies to a long sequence of replications of an experiment rather than just a single replication. There is another approach to the construction and interpretation of CIs that uses the notion of subjective probability and Bayes’ theorem, but the technical details are beyond the scope of this text; the book by DeGroot, et al. (see the Chapter 6 bibliography) is a good source. The interval presented here (as well as each interval presented subsequently) is called a “classical” CI because its interpretation rests on the classical notion of probability.

01927a02-a1da-7086-ba87-2208e017bc0f_4_736_179_685_1073_0.jpg

Figure 7.3 One hundred $95 %$ CIs (asterisks identify intervals that do not include $μ$ )

Other Levels of Confidence

The confidence level of $95 %$ was inherited from the probability .95 for the initial inequalities in (7.2). If a confidence level of $99 %$ is desired, the initial probability of $.95$ must be replaced by .99, which necessitates changing the $z$ critical value from 1.96 to 2.58 . A $99 %$ CI then results from using 2.58 in place of 1.96 in the formula for the $95 %$ CI.

In fact, any desired level of confidence can be achieved by replacing 1.96 or 2.58 with the appropriate standard normal critical value. Recall from Chapter 4 the notation for a $z$ critical value: $z_{α}$ is the number on the horizontal $z$ scale that captures upper tail area $α$ . As Figure 7.4 shows, a probability (i.e., central $z$ curve area) of $1 - α$ is achieved by using $z_{α /2}$ in place of 1.96 .

01927a02-a1da-7086-ba87-2208e017bc0f_5_933_185_530_263_0.jpg

Figure 7.4 $P (- z_{α /2} < Z < z_{α /2}) = 1 - α$

DEFINITION

A $100 (1 - α) %$ confidence interval for the mean $μ$ of a normal population when the value of $σ$ is known is given by

(\overset{x}{ˉ} - z_{α /2} \cdot \frac{σ}{n}, \overset{x}{ˉ} + z_{α /2} \cdot \frac{σ}{n}) (7.5)

or, equivalently, by $\overset{x}{ˉ} \pm z_{α /2} \cdot σ / n$ .

The formula (7.5) for the CI can also be expressed in words as

point estimate of $μ \pm$ ( $z$ critical value) (standard error of the mean).

EXAMPLE 7.3 The production process for engine control housing units of a particular type has recently been modified. Prior to this modification, historical data had suggested that the distribution of hole diameters for bushings on the housings was normal with a standard deviation of $.100 mm$ . It is believed that the modification has not affected the shape of the distribution or the standard deviation, but that the value of the mean diameter may have changed. A sample of 40 housing units is selected and hole diameter is determined for each one, resulting in a sample mean diameter of $5.426 mm$ . Let’s calculate a confidence interval for true average hole diameter using a confidence level of $90 %$ . This requires that $100 (1 - α) = 90$ , from which $α = .10$ and $z_{α /2} = z_{.05} = 1.645$ (corresponding to a cumulative $z$ -curve area of .9500). The desired interval is then

5.426 \pm (1.645) \frac{.100}{40} = 5.426 \pm .026 = (5.400, 5.452)

With a reasonably high degree of confidence, we can say that $5.400 < μ < 5.452$ . This interval is rather narrow because of the small amount of variability in hole diameter $(σ = .100)$ .

Confidence Level, Precision, and Sample Size

Why settle for a confidence level of $95 %$ when a level of $99 %$ is achievable? Because the price paid for the higher confidence level is a wider interval. Since the $95 %$ interval extends $1.96 \cdot σ / n$ to each side of $\overset{x}{ˉ}$ , the width of the interval is $2 (1.96) \cdot σ / n = 3.92 \cdot σ / n$ . Similarly, the width of the $99 %$ interval is $2 (2.58) \cdot σ / n = 5.16 \cdot σ / n$ . That is, we have more confidence in the $99 %$ interval precisely because it is wider. The higher the desired degree of confidence, the wider the resulting interval will be.

If we think of the width of the interval as specifying its precision or accuracy, then the confidence level (or reliability) of the interval is inversely related to its precision. A highly reliable interval estimate may be imprecise in that the endpoints of the interval may be far apart, whereas a precise interval may entail relatively low reliability. Thus it cannot be said unequivocally that a $99 %$ interval is to be preferred to a $95 %$ interval; the gain in reliability entails a loss in precision.

An appealing strategy is to specify both the desired confidence level and interval width and then determine the necessary sample size.

EXAMPLE 7.4 Extensive monitoring of a computer time-sharing system has suggested that response time to a particular editing command is normally distributed with standard deviation 25 millisec. A new operating system has been installed, and we wish to estimate the true average response time $μ$ for the new environment. Assuming that response times are still normally distributed with $σ = 25$ , what sample size is necessary to ensure that the resulting $95 % CI$ has a width of (at most) 10 ? The sample size $n$ must satisfy

10 = 2 \cdot (1.96) (25 / n)

Rearranging this equation gives

n = 2 \cdot (1.96) (25) / 10 = 9.80

n = (9.80)^{2} = 96.04

Since $n$ must be an integer, a sample size of 97 is required.

A general formula for the sample size $n$ necessary to ensure an interval width $w$ is obtained from equating $w$ to $2 \cdot z_{α /2} \cdot σ / n$ and solving for $n$ .

The sample size necessary for the CI (7.5) to have a width $w$ is

n = (2 z_{α /2} \cdot \frac{σ}{w})^{2}

The smaller the desired width $w$ , the larger $n$ must be. In addition, $n$ is an increasing function of $σ$ (more population variability necessitates a larger sample size) and of the confidence level $100 (1 - α) %$ (as $α$ decreases, $z_{α /2}$ increases).

The half-width $1.96 σ / n$ of the $95 % CI$ is sometimes called the bound on the error of estimation associated with a 95% confidence level. That is, with 95% confidence, the point estimate $\overset{x}{ˉ}$ will be no farther than this from $μ$ . Before obtaining data, an investigator may wish to determine a sample size for which a particular value of the bound is achieved. For example, with $μ$ representing the average fuel efficiency (mpg) for all cars of a certain type, the objective of an investigation may be to estimate $μ$ to within $1 mpg$ with $95 %$ confidence. More generally, if we wish to estimate $μ$ to within an amount $B$ (the specified bound on the error of estimation) with $100 (1 - α) %$ confidence, the necessary sample size results from replacing $2/ w$ by $1/ B$ in the formula in the preceding box.

Deriving a Confidence Interval

Let $X_{1}, X_{2}, \dots, X_{n}$ denote the sample on which the CI for a parameter $θ$ is to be based.

Suppose a random variable satisfying the following two properties can be found:

The variable is a function of both $X_{1}, \dots, X_{n}$ and $θ$ .
The probability distribution of the variable does not depend on $θ$ or on any other unknown parameters.

Let $h (X_{1}, X_{2}, \dots, X_{n}; θ)$ denote this random variable. For example, if the population distribution is normal with known $σ$ and $θ = μ$ , the variable $h (X_{1}, \dots, X_{n}; μ) = (\overset{ˉ}{X} - μ) / (σ / n)$ satisfies both properties; it clearly depends functionally on $μ$ , yet has the standard normal probability distribution irrespective of the value of $μ$ . In general, the form of the $h$ function is usually suggested by examining the distribution of an appropriate estimator $θ$ .

For any $α$ between 0 and 1, constants $a$ and $b$ can be found to satisfy

P (a < h (X_{1}, \dots, X_{n}; θ) < b) = 1 - α (7.6)

Because of the second property, $a$ and $b$ do not depend on $θ$ . In the normal example, $a = - z_{α /2}$ and $b = z_{α /2}$ . Now suppose that the inequalities in (7.6) can be manipulated to isolate $θ$ , giving the equivalent probability statement

P (l (X_{1}, X_{2}, \dots, X_{n}) < θ < u (X_{1}, X_{2}, \dots, X_{n})) = 1 - α

Then $l (x_{1}, x_{2}, \dots, x_{n})$ and $u (x_{1}, \dots, x_{n})$ are the lower and upper confidence limits, respectively, for a $100 (1 - α) % CI$ . In the normal example, we saw that $l (X_{1}, \dots, X_{n}) = \overset{ˉ}{X} - z_{α /2} \cdot σ / n$ and $u (X_{1}, \dots, X_{n}) = \overset{ˉ}{X} + z_{α /2} \cdot σ / n$ .

EXAMPLE 7.5 A theoretical model suggests that the time to breakdown of an insulating fluid between electrodes at a particular voltage has an exponential distribution with parameter $λ$ (see Section 4.4). A random sample of $n = 10$ breakdown times yields the following sample data (in min): $x_{1} = 41.53, x_{2} = 18.73, x_{3} = 2.99, x_{4} = 30.34$ , $x_{5} = 12.33, x_{6} = 117.52, x_{7} = 73.02, x_{8} = 223.63, x_{9} = 4.00, x_{10} = 26.78$ . A $95 %$ $CI$ for $λ$ and for the true average breakdown time are desired.

Let $h (X_{1}, X_{2}, \dots, X_{n}; λ) = 2 λ \sum X_{i}$ . It can be shown that this random variable has a probability distribution called a chi-squared distribution with $2 n$ degrees of freedom (df) $(ν = 2 n$ , where $ν$ is the parameter of a chi-squared distribution as mentioned in Section 4.4). Appendix Table A. 7 pictures a typical chi-squared density curve and tabulates critical values that capture specified tail areas. The relevant number of df here is $2 (10) = 20$ . The $ν = 20$ row of the table shows that 34.170 captures upper-tail area .025 and 9.591 captures lower-tail area .025 (upper-tail area .975). Thus for $n = 10$ ,

P (9.591 < 2 λ \sum X_{i} < 34.170) = .95

Division by $2 \sum X_{i}$ isolates $λ$ , yielding

P (9.591 / (2 \sum X_{i}) < λ < (34.170 / (2 \sum X_{i})) = .95

The lower limit of the $95 %$ CI for $λ$ is $9.591 / (2 \sum x_{i})$ , and the upper limit is $34.170 / (2 \sum x_{i})$ . For the given data, $\sum x_{i} = 550.87$ , giving the interval $(.00871$ , .03101).

The expected value of an exponential rv is $μ = 1/ λ$ . Since

P (2 \sum X_{i} / 34.170 < 1/ λ < 2 \sum X_{i} / 9.591) = .95

the $95 % CI$ for true average breakdown time is $(2 \sum x_{i} / 34.170, 2 \sum x_{i} / 9.591) =$ (32.24, 114.87). This interval is obviously quite wide, reflecting substantial variability in breakdown times and a small sample size.

In general, the upper and lower confidence limits result from replacing each $<$ in (7.6) by $=$ and solving for $θ$ . In the insulating fluid example just considered, $2 λ \sum x_{i} = 34.170$ gives $λ = 34.170 / (2 \sum x_{i})$ as the upper confidence limit, and the lower limit is obtained from the other equation. Notice that the two interval limits are not equidistant from the point estimate, since the interval is not of the form $θ \pm c$ .

Bootstrap Confidence Intervals

The bootstrap technique was introduced in Chapter 6 as a way of estimating $σ_{θ}$ . It can also be applied to obtain a CI for $θ$ . Consider again estimating the mean $μ$ of a normal distribution when $σ$ is known. Let’s replace $μ$ by $θ$ and use $\overset{ˉ}{θ} = \overset{ˉ}{X}$ as the point estimator. Notice that $1.96 σ n$ is the 97.5th percentile of the distribution of $θ - θ$ [that is, $P (\overset{ˉ}{X} - μ < 1.96 σ / n) = P (Z < 1.96) = .9750$ ]. Similarly, $- 1.96 σ / n$ is the 2.5th percentile, so

.95 = P (2.5 th percentile < θ - θ < 97.5 th percentile)

= P (θ - 2.5 th percentile > θ > θ - 97.5 th percentile)

That is, with

l = θ - 97.5 th percentile of θ - θ

u = θ - 2.5 th percentile of θ - θ (7.7)

the CI for $θ$ is $(l, u)$ . In many cases, the percentiles in (7.7) cannot be calculated, but they can be estimated from bootstrap samples. Suppose we obtain $B = 1000$ bootstrap samples and calculate $θ_{1}^{*}, \dots, θ_{1000}^{*}$ , and $\overset{ˉ}{θ}^{*}$ followed by the 1000 differences $θ_{1}^{*} - \overset{ˉ}{θ}^{*}, \dots, θ_{1000}^{*} - \overset{ˉ}{θ}^{*}$ . The 25th largest and 25th smallest of these differences are estimates of the unknown percentiles in (7.7). Consult the Devore and Berk or Efron books cited in Chapter 6 for more information.

EXERCISES Section 7.1 (1-11)

Consider a normal population distribution with the value of $σ$ known.

a. What is the confidence level for the interval $\overset{x}{ˉ} \pm$ $2.81 σ / n$ ?

b. What is the confidence level for the interval $\overset{x}{ˉ} \pm$ $1.44 σ / n$ ?

c. What value of $z_{α /2}$ in the CI formula (7.5) results in a confidence level of $99.7 %$ ?

d. Answer the question posed in part (c) for a confidence level of $75 %$ .

Each of the following is a confidence interval for $μ =$ true average (i.e., population mean) resonance frequency (Hz) for all tennis rackets of a certain type:

$(114.4, 115.6) (114.1, 115.9)$

a. What is the value of the sample mean resonance frequency?

b. Both intervals were calculated from the same sample data. The confidence level for one of these intervals is $90 %$ and for the other is $99 %$ . Which of the intervals has the $90 %$ confidence level, and why?

Suppose that a random sample of 50 bottles of a particular brand of cough syrup is selected and the alcohol content of each bottle is determined. Let $μ$ denote the average alcohol content for the population of all bottles of the brand under study. Suppose that the resulting $95 %$ confidence interval is $(7.8, 9.4)$ .

a. Would a $90 %$ confidence interval calculated from this same sample have been narrower or wider than the given interval? Explain your reasoning.

b. Consider the following statement: There is a $95 %$ chance that $μ$ is between 7.8 and 9.4. Is this statement correct? Why or why not?

c. Consider the following statement: We can be highly confident that $95 %$ of all bottles of this type of cough syrup have an alcohol content that is between 7.8 and 9.4. Is this statement correct? Why or why not?

d. Consider the following statement: If the process of selecting a sample of size 50 and then computing the corresponding $95 %$ interval is repeated 100 times,95 of the resulting intervals will include $μ$ . Is this statement correct? Why or why not?

A CI is desired for the true average stray-load loss $μ$ (watts) for a certain type of induction motor when the line current is held at 10 amps for a speed of $1500 rpm$ . Assume that stray-load loss is normally distributed with $σ = 3.0$ .

a. Compute a $95 % CI$ for $μ$ when $n = 25$ and $\overset{x}{ˉ} = 58.3$ .

b. Compute a $95 % CI$ for $μ$ when $n = 100$ and $\overset{x}{ˉ} =$ 58.3.

c. Compute a $99 %$ CI for $μ$ when $n = 100$ and $\overset{x}{ˉ} =$ 58.3.

d. Compute an $82 %$ CI for $μ$ when $n = 100$ and $\overset{x}{ˉ} =$ 58.3.

e. How large must $n$ be if the width of the $99 %$ interval for $μ$ is to be 1.0 ?

Assume that the helium porosity (in percentage) of coal samples taken from any particular seam is normally distributed with true standard deviation .75 .

a. Compute a $95 % CI$ for the true average porosity of a certain seam if the average porosity for 20 specimens from the seam was 4.85.

b. Compute a $98 % CI$ for true average porosity of another seam based on 16 specimens with a sample average porosity of 4.56 .

c. How large a sample size is necessary if the width of the $95 %$ interval is to be .40 ?

d. What sample size is necessary to estimate true average porosity to within $.2$ with $99 %$ confidence?

On the basis of extensive tests, the yield point of a particular type of mild steel-reinforcing bar is known to be normally distributed with $σ = 100$ . The composition of bars has been slightly modified, but the modification is not believed to have affected either the normality or the value of $σ$ .

a. Assuming this to be the case, if a sample of 25 modified bars resulted in a sample average yield point of $8439 lb$ , compute a $90 % CI$ for the true average yield point of the modified bar.

b. How would you modify the interval in part (a) to obtain a confidence level of $92 %$ ?

By how much must the sample size $n$ be increased if the width of the CI (7.5) is to be halved? If the sample size is increased by a factor of 25 , what effect will this have on the width of the interval? Justify your assertions.
Let $α_{1} > 0, α_{2} > 0$ , with $α_{1} + α_{2} = α$ . Then

P (- z_{α_{1}} < \frac{X ˉ - μ}{σ / n} < z_{α_{2}}) = 1 - α

a. Use this equation to derive a more general expression for a $100 (1 - α) %$ CI for $μ$ of which the interval (7.5) is a special case.

b. Let $α = .05$ and $α_{1} = α /4, α_{2} = 3 α /4$ . Does this result in a narrower or wider interval than the interval (7.5)?

a. Under the same conditions as those leading to the interval (7.5), $P [(\overset{ˉ}{X} - μ) / (σ / n) < 1.645] = .95$ . Use this to derive a one-sided interval for $μ$ that has infinite width and provides a lower confidence bound on $μ$ . What is this interval for the data in Exercise 5(a)?

b. Generalize the result of part (a) to obtain a lower bound with confidence level $100 (1 - α) %$ .

c. What is an analogous interval to that of part (b) that provides an upper bound on $μ$ ? Compute this $99 %$ interval for the data of Exercise 4(a).

A random sample of $n = 15$ heat pumps of a certain type yielded the following observations on lifetime (in years): $2.0 1.3 6.0 1.9 5.1 .4 1.0 5.3$ 15.7 .7 4.8 .9 12.2 5.3 .6

a. Assume that the lifetime distribution is exponential and use an argument parallel to that of Example 7.5 to obtain a $95 % CI$ for expected (true average) lifetime.

b. How should the interval of part (a) be altered to achieve a confidence level of $99 %$ ?

c. What is a $95 % CI$ for the standard deviation of the lifetime distribution? [Hint: What is the standard deviation of an exponential random variable?]

Consider the next $100095 %$ CIs for $μ$ that a statistical consultant will obtain for various clients. Suppose the data sets on which the intervals are based are selected independently of one another. How many of these 1000 intervals do you expect to capture the corresponding value of $μ$ ? What is the probability that between 940 and 960 of these intervals contain the corresponding value of $μ$ ? [Hint: Let $Y =$ the number among the 1000 intervals that contain $μ$ . What kind of random variable is $Y$ ?]

7.2 Large-Sample Confidence Intervals for a Population Mean and Proportion

The CI for $μ$ given in the previous section assumed that the population distribution is normal with the value of $σ$ known. We now present a large-sample CI whose validity does not require these assumptions. After showing how the argument leading to this interval generalizes to yield other large-sample intervals, we focus on an interval for a population proportion $p$ .

Copyright 2016 Cengage Learning. All Rights Reserved, May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Congage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it.

A Large-Sample Interval for $μ$

Let $X_{1}, X_{2}, \dots, X_{n}$ be a random sample from a population having a mean $μ$ and standard deviation $σ$ . Provided that $n$ is sufficiently large, the Central Limit Theorem (CLT) implies that $\overset{ˉ}{X}$ has approximately a normal distribution whatever the nature of the population distribution. It then follows that $Z = (\overset{ˉ}{X} - μ) / (σ / n)$ has approximately a standard normal distribution, so that

P (- z_{α /2} < \frac{X ˉ - μ}{σ / n} < z_{α /2}) \approx 1 - α

An argument parallel to that given in Section 7.1 yields $\overset{x}{ˉ} \pm z_{α /2} \cdot σ / n$ as a large-sample CI for $μ$ with a confidence level of approximately $100 (1 - α) %$ . That is, when $n$ is large, the CI for $μ$ given previously remains valid whatever the population distribution, provided that the qualifier “approximately” is inserted in front of the confidence level.

A practical difficulty with this development is that computation of the CI requires the value of $σ$ , which will rarely be known. Consider replacing the population standard deviation $σ$ in $Z$ by the sample standard deviation to obtain the standardized variable

\frac{X ˉ - μ}{S / n}

Previously, there was randomness only in the numerator of $Z$ by virtue of $\overset{ˉ}{X}$ . In the new standardized variable, both $\overset{ˉ}{X}$ and $S$ vary in value from one sample to another. So it might seem that the distribution of the new variable should be more spread out than the $z$ curve to reflect the extra variation in the denominator. This is indeed true when $n$ is small. However, for large $n$ the subsititution of $S$ for $σ$ adds little extra variability, so this variable also has approximately a standard normal distribution. Manipulation of the variable in a probability statement, as in the case of known $σ$ , gives a general large-sample CI for $μ$ .

PROPOSITION

If $n$ is sufficiently large, the standardized variable

Z = \frac{X ˉ - μ}{S / n}

has approximately a standard normal distribution. This implies that

\overset{x}{ˉ} \pm z_{α /2} \cdot \frac{s}{n} (7.8)

is a large-sample confidence interval for $μ$ with confidence level approximately $100 (1 - α) %$ . This formula is valid regardless of the shape of the population distribution.

In words, the CI (7.8) is

point estimate of $μ \pm$ ( $z$ critical value) (estimated standard error of the mean).

Generally speaking, $n > 40$ will be sufficient to justify the use of this interval. This is somewhat more conservative than the rule of thumb for the CLT because of the additional variability introduced by using $S$ in place of $σ$ .

AMPLE 7.6 Haven’t you always wanted to own a Porsche? The author thought maybe he could afford a Boxster, the cheapest model. So he went to www.cars.com on Nov. 18, 2009, and found a total of 1113 such cars listed. Asking prices ranged from $3499

to $\$ {130},{000} $(t h e l a tt er p r i ce w a so n eo f o n l y tw oe x cee d in g$ $ {70},{000}$ ). The prices depressed him, so he focused instead on odometer readings (miles). Here are reported readings for a sample of 50 of these Boxsters:

2948	2996	7197	8338	8500	8759	12710	12925
15767	20000	23247	24863	26000	26210	30552	30600
35700	36466	40316	40596	41021	41234	43000	44607
45000	45027	45442	46963	47978	49518	52000	53334
54208	56062	57000	57365	60020	60265	60803	62851
64404	72140	74594	79308	79500	80000	80000	84000
113000	118634

A boxplot of the data (Figure 7.5) shows that, except for the two outliers at the upper end, the distribution of values is reasonably symmetric (in fact, a normal probability plot exhibits a reasonably linear pattern, though the points corresponding to the two smallest and two largest observations are somewhat removed from a line fit through the remaining points).

01927a02-a1da-7086-ba87-2208e017bc0f_11_651_861_1098_291_0.jpg

Figure 7.5 A boxplot of the odometer reading data from Example 7.6

Summary quantities include $n = 50, \overset{x}{ˉ} = 45, 679.4, x = 45, 013.5, s = 26, 641.675$ , $f_{s} = 34, 265$ . The mean and median are reasonably close (if the two largest values were each reduced by 30,000 , the mean would fall to 44,479.4 , while the median would be unaffected). The boxplot and the magnitudes of $s$ and $f_{s}$ relative to the mean and median both indicate a substantial amount of variability. A confidence level of about $95 %$ requires $z_{.025} = 1.96$ , and the interval is

45, 679.4 \pm (1.96) (\frac{26 , 641.675}{50}) = 45, 679.4 \pm 7384.7

= (38, 294.7, 53, 064.1)

That is, $38, 294.7 < μ < 53, 064.1$ with $95 %$ confidence. This interval is rather wide because a sample size of 50 , even though large by our rule of thumb, is not large enough to overcome the substantial variability in the sample. We do not have a very precise estimate of the population mean odometer reading.

Is the interval we’ve calculated one of the $95 %$ that in the long run includes the parameter being estimated, or is it one of the “bad” $5%$ that does not do so? Without knowing the value of $μ$ , we cannot tell. Remember that the confidence level refers to the long run capture percentage when the formula is used repeatedly on various samples; it cannot be interpreted for a single sample and the resulting interval.

Unfortunately, the choice of sample size to yield a desired interval width is not as straightforward here as it was for the case of known $σ$ . This is because the width of (7.8) is $2 z_{α /2} s / n$ . Since the value of $s$ is not available before the data has been gathered, the width of the interval cannot be determined solely by the choice of $n$ . The only option for an investigator who wishes to specify a desired width is to make an educated guess as to what the value of $s$ might be. By being conservative and guessing a larger value of $s$ , an $n$ larger than necessary will be chosen. The investigator may be able to specify a reasonably accurate value of the population range (the difference between the largest and smallest values). Then if the population distribution is not too skewed, dividing the range by 4 gives a ballpark value of what $s$ might be.

EXAMPLE 7.7 The charge-to-tap time (min) for carbon steel in one type of open hearth furnace is to be determined for each heat in a sample of size $n$ . If the investigator believes that almost all times in the distribution are between 320 and 440 , what sample size would be appropriate for estimating the true average time to within $5 min$ . with a confidence level of $95 %$ ?

A reasonable value for $s$ is $(440 - 320) /4 = 30$ . Thus

n = [\frac{( 1.96 ) ( 30 )}{5}]^{2} = 138.3

Since the sample size must be an integer, $n = 139$ should be used. Note that estimating to within $5 min$ . with the specified confidence level is equivalent to a CI width of 10 min.

A General Large-Sample Confidence Interval

The large-sample intervals $\overset{x}{ˉ} \pm z_{α /2} \cdot σ / n$ and $\overset{x}{ˉ} \pm z_{α /2} \cdot s / n$ are special cases of a general large-sample CI for a parameter $θ$ . Suppose that $θ$ is an estimator satisfying the following properties: (1) It has approximately a normal distribution; (2) it is (at least approximately) unbiased; and (3) an expression for $σ_{θ}$ , the standard deviation (standard error) of $θ$ , is available. For example, in the case $θ = μ, μ = \overset{ˉ}{X}$ is an unbiased estimator whose distribution is approximately normal when $n$ is large and $σ_{μ} = σ_{\overset{ˉ}{X}} = σ / n$ . Standardizing $θ$ yields the rv $Z = (θ - θ) / σ_{θ}$ , which has approximately a standard normal distribution. This justifies the probability statement

P (- z_{α /2} < \frac{θ - θ}{σ _{θ}} < z_{α /2}) \approx 1 - α (7.9)

Assume first that $σ_{θ}$ does not involve any unknown parameters (e.g., known $σ$ in the case $θ = μ$ ). Then replacing each $<$ in (7.9) by $=$ results in $θ = θ \pm z_{α /2} \cdot σ_{θ}$ , so the lower and upper confidence limits are $θ - z_{α /2} \cdot σ_{θ}$ and $θ + z_{α /2} \cdot σ_{θ}$ , respectively. Now suppose that $σ_{θ}$ does not involve $θ$ but does involve at least one other unknown parameter. Let $s_{θ}$ be the estimate of $σ_{θ}$ obtained by using estimates in place of the unknown parameters (e.g., $s / n$ estimates $σ / n$ ). Under general conditions (essentially that $s_{θ}$ be close to $σ_{θ}$ for most samples), a valid CI is $θ \pm z_{α /2} \cdot s_{θ}$ . The large-sample interval $\overset{x}{ˉ} \pm z_{α /2} \cdot s / n$ is an example.

Finally, suppose that $σ_{θ}$ does involve the unknown $θ$ . For example, we shall see momentarily that this is the case when $θ = p$ , a population proportion. Then $(θ - θ) / σ_{θ} = z_{α /2}$ can be difficult to solve. An approximate solution can often be obtained by replacing $θ$ in $σ_{θ}$ by its estimate $θ$ . This results in an estimated standard deviation $s_{θ}$ , and the corresponding interval is again $θ \pm z_{α /2} \cdot s_{θ}$ .

In words, this CI is

point estimate of $θ \pm$ ( $z$ critical value)(estimated standard error of the estimator)

A Confidence Interval for a Population Proportion

Let $p$ denote the proportion of “successes” in a population, where success identifies an individual or object that has a specified property (e.g., individuals who graduated from college, computers that do not need warranty service, etc.). A random sample of $n$ individuals or objects is to be selected, and $X$ is the number of successes in the sample. Provided that $n$ is small compared to the population size, $X$ can be regarded as a binomial rv with $E (X) = n p$ and $σ_{X} = n p (1 - p)$ . Furthermore, if both $n p \geq 10$ and $n q \geq 10, (q = 1 - p), X$ has approximately a normal distribution.

The natural estimator of $p$ is $p = X / n$ , the sample fraction of successes. Since $p$ is just $X$ multiplied by the constant $1/ n, p$ also has approximately a normal distribution. As shown in Section 6.1, $E (p) = p$ (unbiasedness) and $σ_{p} = p (1 - p) / n$ . The standard deviation $σ_{p}$ involves the unknown parameter $p$ . Standardizing $p$ by subtracting $p$ and dividing by $σ_{p}$ then implies that

P (- z_{α /2} < \frac{p - p}{p ( 1 - p ) / n} < z_{α /2}) \approx 1 - α

Proceeding as suggested in the subsection “Deriving a Confidence Interval” (Section 7.1), the confidence limits result from replacing each $<$ by $=$ and solving the resulting equation for $p$ . But whereas the equations $(\overset{x}{ˉ} - μ) / (s / n) = \pm z_{α /2}$ employed in deriving the large-sample CI for $μ$ are linear in $μ$ , the equations here are quadratic $(p^{2}$ appears in the numerator when both sides of each equation are squared to eliminate the square root). The two roots are

p = \frac{p + z _{α /2}^{2} / 2 n}{1 + z _{α /2}^{2} / n} \pm z_{α /2} \frac{p ( 1 - p ) / n + z _{α /2}^{2} /4 n ^{2}}{1 + z _{α /2}^{2} / n}

= p \pm z_{α /2} \frac{p ( 1 - p ) / n + z _{α /2}^{2} /4 n ^{2}}{1 + z _{α /2}^{2} / n}

PROPOSITION

Let $p = [p + z_{α /2}^{2} / 2 n] / [1 + z_{α /2}^{2} / n]$ . Then a confidence interval for a population proportion $p$ with confidence level approximately $100 (1 - α)$ $%$ is

p \pm z_{α /2} \frac{p q / n + z _{α /2}^{2} /4 n ^{2}}{1 + z _{α /2}^{2} / n} (7.10)

where $q = 1 - p$ and, as before, the - in (7.10) corresponds to the lower confidence limit and the + to the upper confidence limit.

This is often referred to as the score $C I$ for $p$ .

If the sample size $n$ is very large, then $z^{2} / 2 n$ is generally quite negligible (small) compared to $p$ and $z^{2} / n$ is quite negligible compared to 1, from which $p \approx p$ . In this case $z^{2} /4 n^{2}$ is also negligible compared to $pq / n (n^{2}$ is a much larger divisor than is $n$ ). As a result, the dominant term in the $\pm$ expression is $z_{α /2} p q / n$ and the score interval is approximately

p \pm z_{α /2} p q / n (7.11)

This latter interval has the general form $θ \pm z_{α /2} σ_{θ}$ of a large-sample interval suggested in the last subsection. The approximate CI (7.11) is the one that for decades has appeared in introductory statistics textbooks. It clearly has a much simpler and more appealing form than the score CI. So why bother with the latter?

First of all, suppose we use $z_{.025} = 1.96$ in the traditional formula (7.11). Then our nominal confidence level (the one we think we’re buying by using that $z$ critical value) is approximately $95 %$ . So before a sample is selected, the probability that the random interval includes the actual value of $p$ (i.e., the coverage probability) should be about .95 . But as Figure 7.6 shows for the case $n = 100$ , the actual coverage probability for this interval can differ considerably from the nominal probability .95, particularly when $p$ is not close to . 5 (the graph of coverage probability versus $p$ is very jagged because the underlying binomial probability distribution is discrete rather than continuous). This is generally speaking a deficiency of the traditional interval-the actual confidence level can be quite different from the nominal level even for reasonably large sample sizes. Recent research has shown that the score interval rectifies this behavior-for virtually all sample sizes and values of $p$ , its actual confidence level will be quite close to the nominal level specified by the choice of $z_{α /2}$ . This is due largely to the fact that the score interval is shifted a bit toward .5 compared to the traditional interval. In particular, the midpoint $p$ of the score interval is always a bit closer to . 5 than is the midpoint $p$ of the traditional interval. This is especially important when $p$ is close to 0 or 1 .

01927a02-a1da-7086-ba87-2208e017bc0f_14_540_1047_1070_575_0.jpg

Figure 7.6 Actual coverage probability for the interval (7.11) for varying values of $p$ when $n = 100$

In addition, the score interval can be used with nearly all sample sizes and parameter values. It is thus not necessary to check the conditions $n p \geq 10$ and $n (1 - p) \geq 10$ that would be required were the traditional interval employed. So rather than asking when $n$ is large enough for (7.11) to yield a good approximation to (7.10), our recommendation is that the score CI should always be used. The slight additional tediousness of the computation is outweighed by the desirable properties of the interval.

AMPLE 7.8 The article “Repeatability and Reproducibility for Pass/Fail Data” (J. of Testing and Eval.,1997: 151-153) reported that in $n = 48$ trials in a particular laboratory, 16 resulted in ignition of a particular type of substrate by a lighted cigarette. Let $p$ denote the long-run proportion of all such trials that would result in ignition. A point estimate for $p$ is $p = 16 / 48 = .333$ . A confidence interval for $p$ with a confidence level of approximately $95 %$ is

\frac{.333 + ( 1.96 ) ^{2} / 96}{1 + ( 1.96 ) ^{2} / 48} \pm (1.96) \frac{( .333 ) ( .667 ) / 48 + ( 1.96 ) ^{2} / 9216}{1 + ( 1.96 ) ^{2} / 48}

= .345 \pm .129 = (.216, .474)

This interval is quite wide because a sample size of 48 is not at all large when estimating a proportion.

The traditional interval is

.333 \pm 1.96 (.333) (.667) / 48 = .333 \pm .133 = (.200, .466)

These two intervals would be in much closer agreement were the sample size substantially larger.

Equating the width of the CI for $p$ to a prespecified width $w$ gives a quadratic equation for the sample size $n$ necessary to give an interval with a desired degree of precision. Suppressing the subscript in $z_{α /2}$ , the solution is

n = \frac{2 z ^{2} p q - z ^{2} w ^{2} \pm 4 z ^{4} p q ( p q - w ^{2} ) + w ^{2} z ^{4}}{w ^{2}} (7.12)

Neglecting the terms in the numerator involving $w^{2}$ gives

n \approx \frac{4 z ^{2} p q}{w ^{2}}

This latter expression is what results from equating the width of the traditional interval to $w$ .

These formulas unfortunately involve the unknown $p$ . The most conservative approach is to take advantage of the fact that $p q [= p (1 - p)]$ is maximized at $p = .5$ . Thus if $p = q = .5$ is used in (7.12), the width will be at most $w$ regardless of what value of $p$ results from the sample. Alternatively, if the investigator believes strongly, based on prior information, that $p \leq p_{0} \leq .5$ , then $p_{0}$ can be used in place of $p$ . A similar comment applies when $p \geq p_{0} \geq .5$ .

9 The width of the $95 % CI$ in Example 7.8 is .258 . The value of $n$ necessary to ensure a width of .10 irrespective of the value of $p$ is

n = \frac{2 ( 1.96 ) ^{2} ( .25 ) - ( 1.96 ) ^{2} ( .01 ) \pm 4 ( 1.96 ) ^{4} ( .25 ) ( .25 - .01 ) + ( .01 ) ( 1.96 ) ^{4}}{.01} = 380.3

Thus a sample size of 381 should be used. The expression for $n$ based on the traditional CI gives a slightly larger value of 385.

One-Sided Confidence Intervals (Confidence Bounds)

The confidence intervals discussed thus far give both a lower confidence bound and an upper confidence bound for the parameter being estimated. In some circumstances, an investigator will want only one of these two types of bounds. For example, a psychologist may wish to calculate a $95 %$ upper confidence bound for true average reaction time to a particular stimulus, or a reliability engineer may want only a lower confidence bound for true average lifetime of components of a certain

type. Because the cumulative area under the standard normal curve to the left of 1.645 is .95 ,

P (\frac{X ˉ - μ}{S / n} < 1.645) \approx .95

Manipulating the inequality inside the parentheses to isolate $μ$ on one side and replacing rv’s by calculated values gives the inequality $μ > \overset{x}{ˉ} - 1.645 s / n$ ; the expression on the right is the desired lower confidence bound. Starting with $P (- 1.645 < Z) \approx .95$ and manipulating the inequality results in the upper confidence bound. A similar argument gives a one-sided bound associated with any other confidence level.

PROPOSITION

A large-sample upper confidence bound for $μ$ is

μ < \overset{x}{ˉ} + z_{α} \cdot \frac{s}{n}

and a large-sample lower confidence bound for $μ$ is

μ > \overset{x}{ˉ} - z_{α} \cdot \frac{s}{n}

A one-sided confidence bound for $p$ results from replacing $z_{α /2}$ by $z_{α}$ and $\pm$ by either + or - in the CI formula (7.10) for $p$ . In all cases the confidence level is approximately $100 (1 - α) %$ .

EXAMPLE 7.10 Titanium and its alloys have found increasing use in aerospace and automotive applications because of durability and high strength-to-weight ratios. However, machining can be difficult because of low thermal conductivity. The article “Modeling and Multi-Objective Optimization of Process Parameters of Wire Electrical Discharge Machining Using Non-Dominated Sorting Genetic Algorithm-II (J. of Engr. Manuf., 2012: 1186-2001) described an investigation into different settings that impact wire electrical discharge machining of titanium 6-2-4-2. One characteristic of interest was surface roughness $(μ g)$ of the metal after machining. A sample of 54 surface roughness observations resulted in a sample mean roughness of 1.9042 and a sample standard deviation of .1455 . An upper confidence bound for true average roughness $μ$ with confidence level $95 %$ requires $z_{.05} = 1.645$ (not the value $z_{.025} = 1.96$ needed for a two-sided CI). The bound is

1.9042 + (1.645) \cdot \frac{( .1455 )}{54} = 1.9042 + .0326 = 1.9368

Thus we estimate with a confidence level of roughly $95 %$ that $μ < 1.9368$ .

EXERCISES Section 7.2 (12-27)

The following observations are lifetimes (days) subse- $115181255418441461516739743789807$ quent to diagnosis for individuals suffering from blood $86592498310251062106311651191122212221251$ cancer (“A Goodness of Fit Approach to the Class of $12771290135713691408145514781519157815781599$ $1603160516961735179918151852189919251965$

Life Distributions with Unknown Age,” Quality and a. Can a confidence interval for true average lifetime be Reliability Engr. Intl., 2012: 761-766): calculated without assuming anything about the

Copyright 2016 Congage Learning, All Rights Reserved, May not be copied, scanned, or duplicated, in whole or in part. Due to electronic rights, some third party content may be suppressed from the eBook and/or eChapter(s). Editorial review has deemed that any suppressed content does not materially affect the overall learning experience. Congage Learning reserves the right to remove additional content at any time if subsequent rights restrictions require it. nature of the lifetime distribution? Explain your reasoning. [Note: A normal probability plot of the data exhibits a reasonably linear pattern.]

b. Calculate and interpret a confidence interval with a $99 %$ confidence level for true average lifetime. [Hint: $\overset{x}{ˉ} = 1191.6$ and $s = 506.6$ .]

The article “Gas Cooking, Kitchen Ventilation, and Exposure to Combustion Products” (Indoor Air, 2006: 65-73) reported that for a sample of 50 kitchens with gas cooking appliances monitored during a one-week period, the sample mean $CO_{2}$ level (ppm) was 654.16, and the sample standard deviation was 164.43.

a. Calculate and interpret a $95 %$ (two-sided) confidence interval for true average $CO_{2}$ level in the population of all homes from which the sample was selected.

b. Suppose the investigators had made a rough guess of 175 for the value of $s$ before collecting data. What sample size would be necessary to obtain an interval width of $50 ppm$ for a confidence level of $95 %$ ?

The negative effects of ambient air pollution on children’s lung function has been well established, but less research is available about the impact of indoor air pollution. The authors of “Indoor Air Pollution and Lung Function Growth Among Children in Four Chinese Cities” (Indoor Air, 2012: 3-11) investigated the relationship between indoor air-pollution metrics and lung function growth among children ages 6-13 years living in four Chinese cities. For each subject in the study, the authors measured an important lung-capacity index known as $FEV_{1}$ , the forced volume (in $ml$ ) of air that is exhaled in 1 second. Higher $FEV_{1}$ values are associated with greater lung capacity. Among the children in the study, 514 came from households that used coal for cooking or heating or both. Their $FEV_{1}$ mean was 1427 with a standard deviation of 325. (A complex statistical procedure was used to show that burning coal had a clear negative effect on mean $FEV_{1}$ levels.)

a. Calculate and interpret a $95 %$ (two-sided) confidence interval for true average $FEV_{1}$ level in the population of all children from which the sample was selected. Does it appear that the parameter of interest has been accurately estimated?

b. Suppose the investigators had made a rough guess of 320 for the value of $s$ before collecting data. What sample size would be necessary to obtain an interval width of $50 ml$ for a confidence level of $95 %$ ?

Determine the confidence level for each of the following large-sample one-sided confidence bounds:

a. Upper bound: $\overset{x}{ˉ} + .84 s / n$

b. Lower bound: $\overset{x}{ˉ} - 2.05 s / n$

c. Upper bound: $\overset{x}{ˉ} + .67 s / n$

The alternating current (AC) breakdown voltage of an insulating liquid indicates its dielectric strength. The article “Testing Practices for the AC Breakdown Voltage Testing of Insulation Liquids” (IEEE

Electrical Insulation Magazine, 1995: 21-26) gave the accompanying sample observations on breakdown voltage $(kV)$ of a particular circuit under certain conditions.

$62505357415355615964505364625068$

$54555750555056554655535452474755$

57 48 63 57 57 55 53 59 53 52 50 55 60 50 56 58

a. Construct a boxplot of the data and comment on interesting features.

b. Calculate and interpret a $95 % CI$ for true average breakdown voltage $μ$ . Does it appear that $μ$ has been precisely estimated? Explain.

c. Suppose the investigator believes that virtually all values of breakdown voltage are between 40 and 70 . What sample size would be appropriate for the $95 %$ $CI$ to have a width of $2 kV$ (so that $μ$ is estimated to within $1 kV$ with $95 %$ confidence)?

Exercise 1.13 gave a sample of ultimate tensile strength observations (ksi). Use the accompanying descriptive statistics output from Minitab to calculate a $99 %$ lower confidence bound for true average ultimate tensile strength, and interpret the result.

$\mathrm{N}$	Mean	Median	TrMean	StDev	Mean
153	135.39	135.40	135.41	4.59	0.37
Minimum		Maximum	Q1	Q3
	122.20	147.70	132.95	138.25

The U.S. Army commissioned a study to assess how deeply a bullet penetrates ceramic body armor (“Testing Body Armor Materials for Use by the U.S. Army-Phase III,” 2012). In the standard test, a cylindrical clay model is layered under the armor vest. A projectile is then fired, causing an indentation in the clay. The deepest impression in the clay is measured as an indication of survivability of someone wearing the armor. Here is data from one testing organization under particular experimental conditions; measurements (in $mm$ ) were made using a manually controlled digital caliper:

22.4	23.6	24.0	24.9	25.5	25.6
25.8	26.1	26.4	26.7	27.4	27.6
28.3	29.0	29.1	29.6	29.7	29.8
29.9	30.0	30.4	30.5	30.7	30.7
31.0	31.0	31.4	31.6	31.7	31.9
31.9	32.0	32.1	32.4	32.5	32.5
32.6	32.9	33.1	33.3	33.5	33.5
33.5	33.5	33.6	33.6	33.8	33.9
34.1	34.2	34.6	34.6	35.0	35.2
35.2	35.4	35.4	35.4	35.5	35.7
35.8	36.0	36.0	36.0	36.1	36.1
36.2	36.4	36.6	37.0	37.4	37.5
37.5	38.0	38.7	38.8	39.8	41.0
42.0	42.1	44.6	48.3	55.0

a. Construct a boxplot of the data and comment on interesting features.

b. Construct a normal probability plot. Is it plausible that impression depth is normally distributed? Is a normal distribution assumption needed in order to calculate a confidence interval or bound for the true average depth $μ$ using the foregoing data? Explain.

c. Use the accompanying Minitab output as a basis for calculating and interpreting an upper confidence bound for $μ$ with a confidence level of $99 %$ . Variable Count Mean SE Mean StDev $Bepth 83 33.370 0.578 5.268$ Q1 Median Q3 IQR $30.400 33.500 36.000 5.600$

The article “Limited Yield Estimation for Visual Defect Sources” (IEEE Trans. on Semiconductor Manuf., 1997: 17-23) reported that, in a study of a particular wafer inspection process, 356 dies were examined by an inspection probe and 201 of these passed the probe. Assuming a stable process, calculate a $95 %$ (two-sided) confidence interval for the proportion of all dies that pass the probe.
TV advertising agencies face increasing challenges in reaching audience members because viewing TV programs via digital streaming is gaining in popularity. The Harris poll reported on November 13, 2012, that 53% of 2343 American adults surveyed said they have watched digitally streamed TV programming on some type of device.

a. Calculate and interpret a confidence interval at the $99 %$ confidence level for the proportion of all adult Americans who watched streamed programming up to that point in time.

b. What sample size would be required for the width of a $99 % CI$ to be at most .05 irrespective of the value of $p$ ?

In a sample of 1000 randomly selected consumers who had opportunities to send in a rebate claim form after purchasing a product, 250 of these people said they never did so (“Rebates: Get What You Deserve,” Consumer Reports, May 2009: 7). Reasons cited for their behavior included too many steps in the process, amount too small, missed deadline, fear of being placed on a mailing list, lost receipt, and doubts about receiving the money. Calculate an upper confidence bound at the $95 %$ confidence level for the true proportion of such consumers who never apply for a rebate. Based on this bound, is there compelling evidence that the true proportion of such consumers is smaller than 1/3? Explain your reasoning.
The technology underlying hip replacements has changed as these operations have become more popular (over 250,000 in the United States in 2008). Starting in 2003, highly durable ceramic hips were marketed. Unfortunately, for too many patients the increased durability has been counterbalanced by an increased incidence of squeaking. The May 11, 2008, issue of the New York Times reported that in one study of 143 individuals who received ceramic hips between 2003 and 2005, 10 of the hips developed squeaking.

a. Calculate a lower confidence bound at the $95 %$ confidence level for the true proportion of such hips that develop squeaking.

b. Interpret the $95 %$ confidence level used in (a).

The Pew Forum on Religion and Public Life reported on Dec. 9, 2009, that in a survey of 2003 American adults, $25 %$ said they believed in astrology.

a. Calculate and interpret a confidence interval at the $99 %$ confidence level for the proportion of all adult Americans who believe in astrology.

b. What sample size would be required for the width of a $99 % CI$ to be at most .05 irrespective of the value of $p$ ?

A sample of 56 research cotton samples resulted in a sample average percentage elongation of 8.17 and a sample standard deviation of 1.42 (“An Apparent Relation Between the Spiral Angle $ϕ$ , the Percent Elongation $E_{1}$ , and the Dimensions of the Cotton Fiber,” Textile Research J., 1978: 407-410). Calculate a $95 %$ large-sample CI for the true average percentage elongation $μ$ . What assumptions are you making about the distribution of percentage elongation?
A state legislator wishes to survey residents of her district to see what proportion of the electorate is aware of her position on using state funds to pay for abortions.

a. What sample size is necessary if the $95 % CI$ for $p$ is to have a width of at most .10 irrespective of $p$ ?

b. If the legislator has strong reason to believe that at least $2/3$ of the electorate know of her position, how large a sample size would you recommend?

The superintendent of a large school district, having once had a course in probability and statistics, believes that the number of teachers absent on any given day has a Poisson distribution with parameter $μ$ . Use the accompanying data on absences for 50 days to obtain a large-sample CI for $μ$ . [Hint: The mean and variance of a Poisson variable both equal $μ$ , so

Z = \frac{X ˉ - μ}{μ / n}

has approximately a standard normal distribution. Now proceed as in the derivation of the interval for $p$ by making a probability statement (with probability $1 - α$ ) and solving the resulting inequalities for $μ$ - see the argument just after (7.10).]

Number of

absences	0	1	2	3	4	5	6	7	8	9	10
Frequency	1	4	8	10	8	7	5	3	2	1

Reconsider the CI (7.10) for $p$ , and focus on a confidence level of $95 %$ . Show that the confidence limits agree quite well with those of the traditional interval (7.11) once two successes and two failures have been appended to the sample [i.e.,(7.11) based on $x + 2 S$ ’s in $n + 4$ trials]. [Hint: $1.96 \approx 2$ . Note: Agresti and Coull showed that this adjustment of the traditional interval also has an actual confidence level close to the nominal level.]

7.3 Intervals Based on a Normal Population Distribution

The CI for $μ$ presented in Section 7.2 is valid provided that $n$ is large. The resulting interval can be used whatever the nature of the population distribution. The CLT cannot be invoked, however, when $n$ is small. In this case, one way to proceed is to make a specific assumption about the form of the population distribution and then derive a CI tailored to that assumption. For example, we could develop a CI for $μ$ when the population is described by a gamma distribution, another interval for the case of a Weibull distribution, and so on. Statisticians have indeed carried out this program for a number of different distributional families. Because the normal distribution is more frequently appropriate as a population model than is any other type of distribution, we will focus here on a CI for this situation.

ASSUMPTION The population of interest is normal, so that $X_{1}, \dots, X_{n}$ constitutes a random sample from a normal distribution with both $μ$ and $σ$ unknown.

The key result underlying the interval in Section 7.2 was that for large $n$ , the rv $Z = (\overset{ˉ}{X} - μ) / (S / n)$ has approximately a standard normal distribution. When $n$ is small, the additional variability in the denominator implies that the probability distribution of $(\overset{ˉ}{X} - μ) / (S / n)$ will be more spread out than the standard normal distribution. The result on which inferences are based introduces a new family of probability distributions called $t$ distributions.

THEOREM

When $\overset{ˉ}{X}$ is the mean of a random sample of size $n$ from a normal distribution with mean $μ$ , the rv

T = \frac{X ˉ - μ}{S / n} (7.13)

has a probability distribution called a $t$ distribution with $n - 1$ degrees of freedom (df).

Properties of $t$ Distributions

Before applying this theorem, a discussion of properties of $t$ distributions is in order. Although the variable of interest is still $(\overset{ˉ}{X} - μ) / (S / n)$ , we now denote it by $T$ to emphasize that it does not have a standard normal distribution when $n$ is small. Recall that a normal distribution is governed by two parameters; each different choice of $μ$ in combination with $σ$ gives a particular normal distribution. Any particular $t$ distribution results from specifying the value of a single parameter, called the number of degrees of freedom, abbreviated df. We’ll denote this parameter by the Greek letter $ν$ . Possible values of $ν$ are the positive integers 1, $2, 3, \dots$ So there is a $t$ distribution with $1 df$ , another with $2 df$ , yet another with $3 df$ , and so on.

For any fixed value of $ν$ , the density function that specifies the associated $t$ curve is even more complicated than the normal density function. Fortunately, we need concern ourselves only with several of the more important features of these curves.

Properties of $t$ Distributions

Let $t_{ν}$ denote the $t$ distribution with $ν$ df.

Each $t_{ν}$ curve is bell-shaped and centered at 0 .
Each $t_{ν}$ curve is more spread out than the standard normal $(z)$ curve.
As $ν$ increases, the spread of the corresponding $t_{ν}$ curve decreases.
As $ν \to \infty$ , the sequence of $t_{ν}$ curves approaches the standard normal curve (so the $z$ curve is often called the $t$ curve with $df = \infty$ ).

Figure 7.7 illustrates several of these properties for selected values of $ν$ .

01927a02-a1da-7086-ba87-2208e017bc0f_20_733_898_695_346_0.jpg

Figure $7.7 t_{ν}$ and $z$ curves

The number of df for $T$ in (7.13) is $n - 1$ because, although $S$ is based on the $n$ deviations $X_{1} - \overset{ˉ}{X}, \dots, X_{n} - \overset{ˉ}{X}, \sum (X_{i} - \overset{ˉ}{X}) = 0$ implies that only $n - 1$ of these are “freely determined.” The number of df for a $t$ variable is the number of freely determined deviations on which the estimated standard deviation in the denominator of $T$ is based.

The use of $t$ distribution in making inferences requires notation for capturing $t$ -curve tail areas analogous to $z_{α}$ for the $z$ curve. You might think that $t_{α}$ would do the trick. However, the desired value depends not only on the tail area captured but also on df.

NOTATION

Let $t_{α, ν} =$ the number on the measurement axis for which the area under the

$t$ curve with $ν$ df to the right of $t_{α, ν}$ is $α; t_{α, ν}$ is called a $t$ critical value.

For example, $t_{.05, 6}$ is the $t$ critical value that captures an upper-tail area of .05 under the $t$ curve with 6 df. The general notation is illustrated in Figure 7.8. Because $t$ curves are symmetric about zero, $- t_{α, ν}$ captures lower-tail area $α$ . Appendix Table A. 5 gives $t_{α, ν}$ for selected values of $α$ and $ν$ . This table also appears inside the back cover. The columns of the table correspond to different values of $α$ . To obtain $t_{.05, 15}$ , go to the $α = .05$ column, look down to the $ν = 15$ row, and read $t_{.05, 15} = 1.753$ . Similarly, $t_{.05, 22} = 1.717$ (.05 column, $ν = 22$ row), and $t_{.01, 22} = 2.508$ .

01927a02-a1da-7086-ba87-2208e017bc0f_21_967_188_450_220_0.jpg

Figure 7.8 Illustration of a $t$ critical value

The values of $t_{α, ν}$ exhibit regular behavior as we move across a row or down a column. For fixed $ν, t_{α, ν}$ increases as $α$ decreases, since we must move farther to the right of zero to capture area $α$ in the tail. For fixed $α$ , as $ν$ is increased (i.e., as we look down any particular column of the $t$ table) the value of $t_{α, ν}$ decreases. This is because a larger value of $ν$ implies a $t$ distribution with smaller spread, so it is not necessary to go so far from zero to capture tail area $α$ . Furthermore, $t_{α, ν}$ decreases more slowly as $ν$ increases. Consequently, the table values are shown in increments of 2 between 30 df and 40 df and then jump to $ν = 50, 60, 120$ , and finally $\infty$ . Because $t_{\infty}$ is the standard normal curve, the familiar $z_{α}$ values appear in the last row of the table. The rule of thumb suggested earlier for use of the large-sample CI (if $n > 40$ ) comes from the approximate equality of the standard normal and $t$ distributions for $ν \geq 40$ .

The One-Sample $t$ Confidence Interval

The standardized variable $T$ has a $t$ distribution with $n - 1 df$ , and the area under the corresponding $t$ density curve between $- t_{α /2, n - 1}$ and $t_{α /2, n - 1}$ is $1 - α$ (area $α /2$ lies in each tail), so

P (- t_{α /2, n - 1} < T < t_{α /2, n - 1}) = 1 - α (7.14)

Expression (7.14) differs from expressions in previous sections in that $T$ and $t_{α /2, n - 1}$ are used in place of $Z$ and $z_{α /2}$ , but it can be manipulated in the same manner to obtain a confidence interval for $μ$ .

PROPOSITION

Let $\overset{x}{ˉ}$ and $s$ be the sample mean and sample standard deviation computed from the results of a random sample from a normal population with mean $μ$ . Then a $100 (1 - α) %$ confidence interval for $μ$ is

(\overset{x}{ˉ} - t_{α /2, n - 1} \cdot \frac{s}{n}, \overset{x}{ˉ} + t_{α /2, n - 1} \cdot \frac{s}{n}) (7.15)

or, more compactly, $\overset{x}{ˉ} \pm t_{α /2, n - 1} \cdot s / n$ .

An upper confidence bound for $μ$ is

\overset{x}{ˉ} + t_{α, n - 1} \cdot \frac{s}{n}

and replacing + by - in this latter expression gives a lower confidence bound for $μ$ , both with confidence level $100 (1 - α) %$ .

EXAMPLE 7.11 Even as traditional markets for sweetgum lumber have declined, large section solid timbers traditionally used for construction bridges and mats have become increasingly scarce. The article “Development of Novel Industrial Laminated Planks from Sweetgum Lumber” (J. of Bridge Engr., 2008: 64-66) described the manufacturing and testing of composite beams designed to add value to low-grade sweetgum lumber.

Here is data on the modulus of rupture (psi; the article contained summary data expressed in $MPa$ ):

6807.99	7637.06	6663.28	6165.03	6991.41	6992.23
6981.46	7569.75	7437.88	6872.39	7663.18	6032.28
6906.04	6617.17	6984.12	7093.71	7659.50	7378.61
7295.54	6702.76	7440.17	8053.26	8284.75	7347.95
7422.69	7886.87	6316.67	7713.65	7503.33	7674.99

Figure 7.9 shows a normal probability plot from the R software. The straightness of the pattern in the plot provides strong support for assuming that the population distribution of MOR is at least approximately normal.

01927a02-a1da-7086-ba87-2208e017bc0f_22_729_681_712_620_0.jpg

Figure 7.9 A normal probability plot of the modulus of rupture data

The sample mean and sample standard deviation are 7203.191 and 543.5400, respectively (for anyone bent on doing hand calculation, the computational burden is eased a bit by subtracting 6000 from each $x$ value to obtain $y_{i} = x_{i} - 6000$ ; then $\sum y_{i} = 36, 095.72$ and $\sum y_{i}^{2} = 51, 997, 668.77$ , from which $\overset{y}{ˉ} = 1203.191$ and $s_{y} = s_{x}$ as given).

Let’s now calculate a confidence interval for true average MOR using a confidence level of $95 %$ . The CI is based on $n - 1 = 29$ degrees of freedom, so the necessary $t$ critical value is $t_{.025, 29} = 2.045$ . The interval estimate is now

\overset{x}{ˉ} \pm t_{.025, 29} \cdot \frac{s}{n} = 7203.191 \pm (2.045) \cdot \frac{543.5400}{30}

= 7203.191 \pm 202.938 = (7000.253, 7406.129)

We estimate that $7000.253 < μ < 7406.129$ with $95 %$ confidence. If we use the same formula on sample after sample, in the long run $95 %$ of the calculated intervals will contain $μ$ . Since the value of $μ$ is not available, we don’t know whether the calculated interval is one of the “good” 95% or the “bad” 5%. Even with the moderately large sample size, our interval is rather wide. This is a consequence of the substantial amount of sample variability in MOR values.

A lower $95 %$ confidence bound would result from retaining only the lower confidence limit (the one with -) and replacing 2.045 with $t_{.05, 29} = 1.699$ .

Unfortunately, it is not easy to select $n$ to control the width of the $t$ interval. This is because the width involves the unknown (before the data is collected) $s$ and because $n$ enters not only through $1/ n$ but also through $t_{α /2, n - 1}$ . As a result, an appropriate $n$ can be obtained only by trial and error.

In Chapter 15, we will discuss a small-sample CI for $μ$ that is valid provided only that the population distribution is symmetric, a weaker assumption than normality. However, when the population distribution is normal, the $t$ interval tends to be narrower than would be any other interval with the same confidence level.

A Prediction Interval for a Single Future Value

In many applications, the objective is to predict a single value of a variable to be observed at some future time, rather than to estimate the mean value of that variable.

EXAMPLE 7.12 Consider the following sample of fat content (in percentage) of $n = 10$ randomly selected hot dogs (“Sensory and Mechanical Assessment of the Quality of Frankfurters,” J. of Texture Studies, 1990: 395-409):

25.2 21.3 22.8 17.0 29.8 21.0 25.5 16.0 20.9 19.5 (19.5)

Assuming that these were selected from a normal population distribution, a $95 %$ CI for (interval estimate of) the population mean fat content is

\overset{x}{ˉ} \pm t_{.025, 9} \cdot \frac{s}{n} = 21.90 \pm 2.262 \cdot \frac{4.134}{10} = 21.90 \pm 2.96

= (18.94, 24.86)

Suppose, however, you are going to eat a single hot dog of this type and want a prediction for the resulting fat content. A point prediction, analogous to a point estimate, is just $\overset{x}{ˉ} = 21.90$ . This prediction unfortunately gives no information about reliability or precision.

The general setup is as follows: We have available a random sample $X_{1}, X_{2}, \dots, X_{n}$ from a normal population distribution, and wish to predict the value of $X_{n + 1}$ , a single future observation (e.g., the lifetime of a single lightbulb to be purchased or the fuel efficiency of a single vehicle to be rented). A point predictor is $\overset{ˉ}{X}$ , and the resulting prediction error is $\overset{ˉ}{X} - X_{n + 1}$ . The expected value of the prediction error is

E (\overset{ˉ}{X} - X_{n + 1}) = E (\overset{ˉ}{X}) - E (X_{n + 1}) = μ - μ = 0

Since $X_{n + 1}$ is independent of $X_{1}, \dots, X_{n}$ , it is independent of $\overset{ˉ}{X}$ , so the variance of the prediction error is

V (\overset{ˉ}{X} - X_{n + 1}) = V (\overset{ˉ}{X}) + V (X_{n + 1}) = \frac{σ ^{2}}{n} + σ^{2} = σ^{2} (1 + \frac{1}{n})

The prediction error is normally distributed because it is a linear combination of independent, normally distributed rv’s. Thus

Z = \frac{( X ˉ - X _{n + 1} ) - 0}{σ ^{2} ( 1 + \frac{1}{n} )} = \frac{X ˉ - X _{n + 1}}{σ ^{2} ( 1 + \frac{1}{n} )}

has a standard normal distribution. It can be shown that replacing $σ$ by the sample standard deviation $S$ (of $X_{1}, \dots, X_{n}$ ) results in

T = \frac{X ˉ - X _{n + 1}}{S 1 + \frac{1}{n}} \sim t distribution with n - 1 df

Manipulating this $T$ variable as $T = (\overset{ˉ}{X} - μ) / (S / n)$ was manipulated in the development of a CI gives the following result.

PROPOSITION

A prediction interval (PI) for a single observation to be selected from a normal population distribution is

\overset{x}{ˉ} \pm t_{α /2, n - 1} \cdot s 1 + \frac{1}{n} (7.16)

The prediction level is $100 (1 - α) %$ . A lower prediction bound results from replacing $t_{α /2}$ by $t_{α}$ and discarding the + part of (7.16); a similar modification gives an upper prediction bound.

The interpretation of a $95 %$ prediction level is similar to that of a $95 %$ confidence level. If the interval (7.16) is calculated for sample after sample and after each calculation $X_{n + 1}$ is observed, in the long run $95 %$ of these intervals will include the corresponding future values.

EXAMPLE 7.13

(Example 7.12 continued)

With $n = 10, \overset{x}{ˉ} = 21.90, s = 4.134$ , and $t_{.025, 9} = 2.262$ , a $95 %$ PI for the fat content of a single hot dog is

21.90 \pm (2.262) (4.134) 1 + \frac{1}{10} = 21.90 \pm 9.81

= (12.09, 31.71)

This interval is quite wide, indicating substantial uncertainty about fat content. Notice that the width of the PI is more than three times that of the CI.

The error of prediction is $\overset{ˉ}{X} - X_{n + 1}$ , a difference between two random variables, whereas the estimation error is $\overset{ˉ}{X} - μ$ , the difference between a random variable and a fixed (but unknown) value. The PI is wider than the CI because there is more variability in the prediction error (due to $X_{n + 1}$ ) than in the estimation error. In fact, as $n$ gets arbitrarily large, the CI shrinks to the single value $μ$ , and the PI approaches $μ \pm z_{α /2} \cdot σ$ . There is uncertainty about a single $X$ value even when there is no need to estimate.

Tolerance Intervals

Consider a population of automobiles of a certain type, and suppose that under specified conditions, fuel efficiency (mpg) has a normal distribution with $μ = 30$ and $σ = 2$ . Then since the interval from -1.645 to 1.645 captures $90 %$ of the area under the $z$ curve, $90 %$ of all these automobiles will have fuel efficiency values between $μ - 1.645 σ = 26.71$ and $μ + 1.645 σ = 33.29$ . But what if the values of $μ$ and $σ$ are not known? We can take a sample of size $n$ , determine the fuel efficiencies, $\overset{x}{ˉ}$ and $s$ , and form the interval whose lower limit is $\overset{x}{ˉ} - 1.645 s$ and whose upper limit is $\overset{x}{ˉ} + 1.645 s$ . However, because of sampling variability in the estimates of $μ$ and $σ$ , there is a good chance that the resulting interval will include less than $90 %$ of the population values. Intuitively, to have an a priori $95 %$ chance of the resulting interval including at least $90 %$ of the population values, when $\overset{x}{ˉ}$ and $s$ are used in place of $μ$ and $σ$ we should also replace 1.645 by some larger number. For example, when $n = 20$ , the value 2.310 is such that we can be $95 %$ confident that the interval $\overset{x}{ˉ} \pm 2.310 s$ will include at least $90 %$ of the fuel efficiency values in the population.

Let $k$ be a number between 0 and 100 . A tolerance interval for capturing at least $k %$ of the values in a normal population distribution with a confidence level $95 %$ has the form

\overset{x}{ˉ} \pm (tolerance critical value) \cdot s

Tolerance critical values for $k = 90, 95$ , and 99 in combination with various sample sizes are given in Appendix Table A.6. This table also includes critical values for a confidence level of $99 %$ (these values are larger than the corresponding $95 %$ values). Replacing $\pm$ by + gives an upper tolerance bound, and using - in place of $\pm$ results in a lower tolerance bound. Critical values for obtaining these one-sided bounds also appear in Appendix Table A.6.

EXAMPLE 7.14 As part of a larger project to study the behavior of stressed-skin panels, a structural component being used extensively in North America, the article “Time-Dependent Bending Properties of Lumber” (J. of Testing and Eval., 1996: 187-193) reported on various mechanical properties of Scotch pine lumber specimens. Consider the following observations on modulus of elasticity $(MPa)$ obtained 1 minute after loading in a certain configuration:

10,490 16,620 17,300 15,480 12,970 17,260 13,400 13,900

13,630 13,260 14,370 11,700 15,470 17,840 14,070 14,760

There is a pronounced linear pattern in a normal probability plot of the data. Relevant summary quantities are $n = 16, \overset{x}{ˉ} = 14, 532.5, s = 2055.67$ . For a confidence level of $95 %$ , a two-sided tolerance interval for capturing at least $95 %$ of the modulus of elasticity values for specimens of lumber in the population sampled uses the tolerance critical value of 2.903 . The resulting interval is

14, 532.5 \pm (2.903) (2055.67) = 14, 532.5 \pm 5967.6 = (8, 564.9, 20, 500.1)

We can be highly confident that at least $95 %$ of all lumber specimens have modulus of elasticity values between 8,564.9 and 20,500.1.

The $95 %$ CI for $μ$ is $(13, 437.3, 15, 627.7)$ , and the $95 %$ prediction interval for the modulus of elasticity of a single lumber specimen is $(10, 017.0, 19, 048.0)$ . Both the prediction interval and the tolerance interval are substantially wider than the confidence interval.

Intervals Based on Nonnormal Population Distributions

The one-sample $t CI$ for $μ$ is robust to small or even moderate departures from normality unless $n$ is quite small. By this we mean that if a critical value for $95 %$ confidence, for example, is used in calculating the interval, the actual confidence level will be reasonably close to the nominal $95 %$ level. If, however, $n$ is small and the population distribution is highly nonnormal, then the actual confidence level may be considerably different from the one you think you are using when you obtain a particular critical value from the $t$ table. It would certainly be distressing to believe that your confidence level is about $95 %$ when in fact it was really more like $88 %$ ! The bootstrap technique, introduced in Section 7.1, has been found to be quite successful at estimating parameters in a wide variety of nonnormal situations.

In contrast to the confidence interval, the validity of the prediction and tolerance intervals described in this section is closely tied to the normality assumption. These latter intervals should not be used in the absence of compelling evidence for normality. The excellent reference Statistical Intervals, cited in the bibliography at the end of this chapter, discusses alternative procedures of this sort for various other situations. EXERCISES Section 7.3 (28-41)

Determine the values of the following quantities:

a. $t_{.1, 15}$ b. $t_{.05, 15}$ c. $t_{.05, 25}$ d. $t_{.05, 40}$ e. $t_{.005, 40}$

Determine the $t$ critical value(s) that will capture the desired $t$ -curve area in each of the following cases:

a. Central area $= .95, df = 10$

b. Central area $= .95, df = 20$

c. Central area $= .99, df = 20$

d. Central area $= .99, df = 50$

e. Upper-tail area $= .01, df = 25$

f. Lower-tail area $= .025, df = 5$

Determine the $t$ critical value for a two-sided confidence interval in each of the following situations:

a. Confidence level $= 95 %, df = 10$

b. Confidence level $= 95 %, df = 15$

c. Confidence level $= 99 %, df = 15$

d. Confidence level $= 99 %, n = 5$

e. Confidence level $= 98 %, df = 24$

f. Confidence level $= 99 %, n = 38$

Determine the $t$ critical value for a lower or an upper confidence bound for each of the situations described in Exercise 30.
According to the article “Fatigue Testing of Condoms” (Polymer Testing, 2009: 567-571), “tests currently used for condoms are surrogates for the challenges they face in use,” including a test for holes, an inflation test, a package seal test, and tests of dimensions and lubricant quality (all fertile territory for the use of statistical methodology!). The investigators developed a new test that adds cyclic strain to a level well below breakage and determines the number of cycles to break. A sample of 20 condoms of one particular type resulted in a sample mean number of 1584 and a sample standard deviation of 607. Calculate and interpret a confidence interval at the $99 %$ confidence level for the true average number of cycles to break. [Note: The article presented the results of hypothesis tests based on the $t$ distribution; the validity of these depends on assuming normal population distributions.]
The article “Measuring and Understanding the Aging of Kraft Insulating Paper in Power Transformers” (IEEE Electrical Insul. Mag., 1996: 28-34) contained the following observations on degree of polymerization for paper specimens for which viscosity times concentration fell in a certain middle range: $418421421422425427431$ $434437439446447448453$ 454 463 465

a. Construct a boxplot of the data and comment on any interesting features.

b. Is it plausible that the given sample observations were selected from a normal distribution?

c. Calculate a two-sided $95 %$ confidence interval for true average degree of polymerization (as did the authors of the article). Does the interval suggest that 440 is a plausible value for true average degree of polymerization? What about 450 ?

A sample of 14 joint specimens of a particular type gave a sample mean proportional limit stress of $8.48 MPa$ and a sample standard deviation of $.79 MPa$ (“Characterization of Bearing Strength Factors in Pegged Timber Connections,” J. of Structural Engr., 1997: 326-332).

a. Calculate and interpret a $95 %$ lower confidence bound for the true average proportional limit stress of all such joints. What, if any, assumptions did you make about the distribution of proportional limit stress?

b. Calculate and interpret a $95 %$ lower prediction bound for the proportional limit stress of a single joint of this type.

Silicone implant augmentation rhinoplasty is used to correct congenital nose deformities. The success of the procedure depends on various biomechanical properties of the human nasal periosteum and fascia. The article “Biomechanics in Augmentation Rhinoplasty” (J. of Med. Engr. and Tech., 2005: 14-17) reported that for a sample of 15 (newly deceased) adults, the mean failure strain (%) was 25.0 , and the standard deviation was 3.5.

a. Assuming a normal distribution for failure strain, estimate true average strain in a way that conveys information about precision and reliability.

b. Predict the strain for a single adult in a way that conveys information about precision and reliability. How does the prediction compare to the estimate calculated in part (a)?

A normal probability plot of the $n = 26$ observations on escape time given in Exercise 36 of Chapter 1 shows a substantial linear pattern; the sample mean and sample standard deviation are 370.69 and 24.36 , respectively.

a. Calculate an upper confidence bound for population mean escape time using a confidence level of $95 %$ .

b. Calculate an upper prediction bound for the escape time of a single additional worker using a prediction level of $95 %$ . How does this bound compare with the confidence bound of part (a)?

c. Suppose that two additional workers will be chosen to participate in the simulated escape exercise. Denote their escape times by $X_{27}$ and $X_{28}$ , and let $\overset{ˉ}{X}_{new}$ denote the average of these two values. Modify the formula for a PI for a single $x$ value to obtain a PI for $\overset{ˉ}{X}_{new}$ , and calculate a $95 %$ two-sided interval based on the given escape data.

A study of the ability of individuals to walk in a straight line (“Can We Really Walk Straight?” Amer. J. of Physical Anthro., 1992: 19-27) reported the accompanying data on cadence (strides per second) for a sample of $n = 20$ randomly selected healthy men.

$.95 .85 .92 .95 .93 .86 1.00 .92 .85 .81$

$.78 .93 .93 1.05 .93 1.06 1.06 .96 .81 .96$

A normal probability plot gives substantial support to the assumption that the population distribution of cadence is approximately normal. A descriptive summary of the data from Minitab follows:

VariableN	Mean	Median	TrMean	StDev	SEMean
cadence 20	0.9255	0.9300	0.9261	0.0809	0.0181
Variable	Min	Max	Q1	Q3
cadence	0.7800	1.0600	0.8525	0.9600

a. Calculate and interpret a $95 %$ confidence interval for population mean cadence.

b. Calculate and interpret a $95 %$ prediction interval for the cadence of a single individual randomly selected from this population.

c. Calculate an interval that includes at least $99 %$ of the cadences in the population distribution using a confidence level of $95 %$ .

Ultra high performance concrete (UHPC) is a relatively new construction material that is characterized by strong adhesive properties with other materials. The article “Adhesive Power of Ultra High Performance Concrete from a Thermodynamic Point of View” (J. of Materials in Civil Engr., 2012: 1050-1058) described an investigation of the intermolecular forces for UHPC connected to various substrates. The following work of adhesion measurements (in $mJ / m^{2}$ ) for UHPC specimens adhered to steel appeared in the article:

107.1 109.5 107.4 106.8 1

a. Is it plausible that the given sample observations were selected from a normal distribution?

b. Calculate a two-sided $95 %$ confidence interval for the true average work of adhesion for UHPC adhered to steel. Does the interval suggest that 107 is a plausible value for the true average work of adhesion for UHPC adhered to steel? What about 110 ?

c. Predict the resulting work of adhesion value resulting from a single future replication of the experiment by calculating a $95 %$ prediction interval, and compare the width of this interval to the width of the CI from (b).

d. Calculate an interval for which you can have a high degree of confidence that at least $95 %$ of all UHPC specimens adhered to steel will have work of adhesion values between the limits of the interval.

Exercise 72 of Chapter 1 gave the following observations on a receptor binding measure (adjusted distribution volume) for a sample of 13 healthy individuals: 23, 39, $40, 41, 43, 47, 51, 58, 63, 66, 67, 69, 72$ .

a. Is it plausible that the population distribution from which this sample was selected is normal?

b. Calculate an interval for which you can be $95 %$ confident that at least $95 %$ of all healthy individuals in the population have adjusted distribution volumes lying between the limits of the interval.

c. Predict the adjusted distribution volume of a single healthy individual by calculating a $95 %$ prediction interval. How does this interval’s width compare to the width of the interval calculated in part (b)?

Exercise 13 of Chapter 1 presented a sample of $n = 153$ observations on ultimate tensile strength, and Exercise 17 of the previous section gave summary quantities and requested a large-sample confidence interval. Because the sample size is large, no assumptions about the population distribution are required for the validity of the CI.

a. Is any assumption about the tensile-strength distribution required prior to calculating a lower prediction bound for the tensile strength of the next specimen selected using the method described in this section? Explain.

b. Use a statistical software package to investigate the plausibility of a normal population distribution.

c. Calculate a lower prediction bound with a prediction level of $95 %$ for the ultimate tensile strength of the next specimen selected.

A more extensive tabulation of $t$ critical values than what appears in this book shows that for the $t$ distribution with

$20 df$ , the areas to the right of the values $.687, .860$ , and 1.064 are $.25, .20$ , and .15, respectively. What is the confidence level for each of the following three confidence intervals for the mean $μ$ of a normal population distribution? Which of the three intervals would you recommend be used, and why?

a. $(\overset{x}{ˉ} - .687 s / 21, \overset{x}{ˉ} + 1.725 s / 21)$

b. $(\overset{x}{ˉ} - .860 s / 21, \overset{x}{ˉ} + 1.325 s / 21)$

c. $(\overset{x}{ˉ} - 1.064 s / 21, \overset{x}{ˉ} + 1.064 s / 21)$

7.4 Confidence Intervals for the Variance and Standard Deviation of a Normal Population

Although inferences concerning a population variance $σ^{2}$ or standard deviation $σ$ are usually of less interest than those about a mean or proportion, there are occasions when such procedures are needed. In the case of a normal population distribution, inferences are based on the following result concerning the sample variance $S^{2}$ .

THEOREM

Let $X_{1}, X_{2}, \dots, X_{n}$ be a random sample from a normal distribution with parameters $μ$ and $σ^{2}$ . Then the rv

\frac{( n - 1 ) S ^{2}}{σ ^{2}} = \frac{\sum ( X _{i} - X ˉ ) ^{2}}{σ ^{2}}

has a chi-squared $(χ^{2})$ probability distribution with $n - 1 df$ .

As discussed in Sections 4.4 and 7.1, the chi-squared distribution is a continuous probability distribution with a single parameter $ν$ , called the number of degrees of freedom, with possible values $1, 2, 3, \dots$ . The graphs of several $χ^{2}$ probability density functions (pdf’s) are illustrated in Figure 7.10. Each pdf $f (x; ν)$ is positive only for $x > 0$ , and each has a positive skew (stretched out upper tail), though the distribution moves rightward and becomes more symmetric as $ν$ increases. To specify inferential procedures that use the chi-squared distribution, we need notation analogous to that for a $t$ critical value $t_{α, ν}$ .

01927a02-a1da-7086-ba87-2208e017bc0f_28_699_1701_736_237_0.jpg

Figure 7.10 Graphs of chi-squared density functions

NOTATION Let $χ_{α, ν}^{2}$ , called a chi-squared critical value, denote the number on the horizontal axis such that $α$ of the area under the chi-squared curve with $ν$ df lies to the right of $χ_{α, ν}^{2}$ .

Symmetry of $t$ distributions made it necessary to tabulate only upper-tailed $t$ critical values $(t_{α, ν}$ for small values of $α)$ . The chi-squared distribution is not symmetric, so Appendix Table A. 7 contains values of $χ_{α, ν}^{2}$ both for $α$ near 0 and near 1, as illustrated in Figure 7.11(b). For example, $χ_{.025, 14}^{2} = 26.119$ , and $χ_{.95, 20}^{2}$ (the 5th percentile) $= 10.851$ .

01927a02-a1da-7086-ba87-2208e017bc0f_29_754_413_886_418_0.jpg

Figure $7.11 χ_{α, ν}^{2}$ notation illustrated

The rv $(n - 1) S^{2} / σ^{2}$ satisfies the two properties on which the general method for obtaining a CI is based: It is a function of the parameter of interest $σ^{2}$ , yet its probability distribution (chi-squared) does not depend on this parameter. The area under a chi-squared curve with $ν$ df to the right of $χ_{α /2, ν}^{2}$ is $α /2$ , as is the area to the left of $χ_{1 - α /2, ν}^{2}$ . Thus the area captured between these two critical values is $1 - α$ . As a consequence of this and the theorem just stated,

P (χ_{1 - α /2, n - 1}^{2} < \frac{( n - 1 ) S ^{2}}{σ ^{2}} < χ_{α /2, n - 1}^{2}) = 1 - α (7.17)

The inequalities in (7.17) are equivalent to

\frac{( n - 1 ) S ^{2}}{χ _{α /2, n - 1}^{2}} < σ^{2} < \frac{( n - 1 ) S ^{2}}{χ _{1 - α /2, n - 1}^{2}}

Substituting the computed value $s^{2}$ into the limits gives a CI for $σ^{2}$ , and taking square roots gives an interval for $σ$ .

A $100 (1 - α) %$ confidence interval for the variance $σ^{2}$ of a normal population has lower limit

(n - 1) s^{2} / χ_{α /2, n - 1}^{2}

and upper limit

(n - 1) s^{2} / χ_{1 - α /2, n - 1}^{2}

A confidence interval for $σ$ has lower and upper limits that are the square roots of the corresponding limits in the interval for $σ^{2}$ . An upper or a lower confidence bound results from replacing $α /2$ with $α$ in the corresponding limit of the CI.

7.15 The accompanying data on breakdown voltage of electrically stressed circuits was read from a normal probability plot that appeared in the article “Damage of Flexible Printed Wiring Boards Associated with Lightning-Induced Voltage Surges”

(IEEE Transactions on Components, Hybrids, and Manuf. Tech., 1985: 214-220).

The straightness of the plot gave strong support to the assumption that breakdown voltage is approximately normally distributed.

14702200151022901690238017402390190024802000250020302580210027002190 (2190)

Let $σ^{2}$ denote the variance of the breakdown voltage distribution. The computed value of the sample variance is $s^{2} = 137, 324.3$ , the point estimate of $σ^{2}$ . With $df = n - 1 = 16$ , a $95 %$ CI requires $χ_{.975, 16}^{2} = 6.908$ and $χ_{.025, 16}^{2} = 28.845$ . The interval is

(\frac{16 ( 137 , 324.3 )}{28.845}, \frac{16 ( 137 , 324.3 )}{6.908}) = (76, 172.3, 318, 064.4)

Taking the square root of each endpoint yields (276.0,564.0) as the $95 % CI$ for $σ$ . These intervals are quite wide, reflecting substantial variability in breakdown voltage in combination with a small sample size.

CIs for $σ^{2}$ and $σ$ when the population distribution is not normal can be difficult to obtain. For such cases, consult a knowledgeable statistician.

EXERCISES Section 7.4 (42-46)

Determine the values of the following quantities:

a. $χ_{.1, 15}^{2}$ b. $χ_{.1, 25}^{2}$

c. $χ_{.01, 25}^{2}$ d. $χ_{.005, 25}^{2}$

e. $χ_{.99, 25}^{2}$ f. $χ_{.995, 25}^{2}$

Determine the following:

a. The 95th percentile of the chi-squared distribution with $v = 10$

b. The 5th percentile of the chi-squared distribution with $v = 10$

c. $P (10.98 \leq χ^{2} \leq 36.78)$ , where $χ^{2}$ is a chi-squared rv with $ν = 22$

d. $P (χ^{2} < 14.611$ or $χ^{2} > 37.652)$ , where $χ^{2}$ is a chi-squared rv with $ν = 25$

The amount of lateral expansion (mils) was determined for a sample of $n = 9$ pulsed-power gas metal arc welds used in LNG ship containment tanks. The resulting sample standard deviation was $s = 2.81$ mils. Assuming normality, derive a $95 % CI$ for $σ^{2}$ and for $σ$ .
Wire electrical-discharge machining (WEDM) is a process used to manufacture conductive hard metal components. It uses a continuously moving wire that serves as an electrode. Coating on the wire electrode allows for cooling of the wire electrode core and provides an improved cutting performance. The article “High-Performance Wire Electrodes for Wire Electrical-Discharge Machining-A Review” (J. of Engr. Manuf., 2012: 1757-1773) gave the following sample observations on total coating layer thickness (in $μ m$ ) of eight wire electrodes used for WEDM:

$2116293542242425$

Calculate a $99 % CI$ for the standard deviation of the coating layer thickness distribution. Is this interval valid whatever the nature of the distribution? Explain.

The article “Concrete Pressure on Formwork” (Mag. of Concrete Res., 2009: 407-417) gave the following observations on maximum concrete pressure $(kN / m^{2})$ : $33.2 41.8 37.3 40.2 36.7 39.1 36.2 41.8$ $36.0 35.2 36.7 38.9 35.8 35.2 40.1$

a. Is it plausible that this sample was selected from a normal population distribution?

b. Calculate an upper confidence bound with confidence level $95 %$ for the population standard deviation of maximum pressure.

Example 1.11 introduced the accompanying observations on bond strength.

11.5	12.1	9.9	9.3	7.8	6.2	6.6	7.0
13.4	17.1	9.3	5.6	5.7	5.4	5.2	5.1
4.9	10.7	15.2	8.5	4.2	4.0	3.9	3.8
3.6	3.4	20.6	25.5	13.8	12.6	13.1	8.9
8.2	10.7	14.2	7.6	5.2	5.5	5.1	5.0
5.2	4.8	4.1	3.8	3.7	3.6	3.6	3.6

a. Estimate true average bond strength in a way that conveys information about precision and reliability. [Hint: $\sum x_{i} = 387.8$ and $\sum x_{i}^{2} = 4247.08$ .]

b. Calculate a $95 % CI$ for the proportion of all such bonds whose strength values would exceed 10 .

The article “Distributions of Compressive Strength Obtained from Various Diameter Cores” (ACI Materials J., 2012: 597-606) described a study in which compressive strengths were determined for concrete specimens of various types, core diameters, and length-to-diameter ratios. For one particular type, diameter, and $l / d$ ratio, the 18 tested specimens resulted in a sample mean compressive strength of $64.41 MPa$ and a sample standard deviation of $10.32 MPa$ . Normality of the compressive strength distribution was judged to be quite plausible.

a. Calculate a confidence interval with confidence level $98 %$ for the true average compressive strength under these circumstances.

b. Calculate a $98 %$ lower prediction bound for the compressive strength of a single future specimen tested under the given circumstances. [Hint: $t_{.02, 17} =$ 2.224.]

For those of you who don’t already know, dragon boat racing is a competitive water sport that involves 20 paddlers propelling a boat across various race distances. It has become increasingly popular over the last few years. The article “Physiological and Physical Characteristics of Elite Dragon Boat Paddlers” (J. of Strength and Conditioning, 2013: 137-145) summarized an extensive statistical analysis of data obtained from a sample of 11 paddlers. It reported that a $95 %$ confidence interval for true average force $(N)$ during a simulated 200-m race was $(60.2, 70.6)$ . Obtain a $95 %$ prediction interval for the force of a single randomly selected dragon boat paddler undergoing the simulated race.
A journal article reports that a sample of size 5 was used as a basis for calculating a $95 % CI$ for the true average natural frequency $(Hz)$ of delaminated beams of a certain type. The resulting interval was (229.764, 233.504). You decide that a confidence level of $99 %$ is more appropriate than the $95 %$ level used. What are the limits of the $99 %$ interval? [Hint: Use the center of the interval and its width to determine $\overset{x}{ˉ}$ and $s$ .]
Unexplained respiratory symptoms reported by athletes are often incorrectly considered secondary to exercise-induced asthma. The article “High Prevalence of Exercise-Induced Laryngeal Obstruction in Athletes” (Medicine and Science in Sports and Exercise, 2013: 2030-2035) suggested that many such cases could instead be explained by obstruction of the larynx. In a sample of 88 athletes referred for an asthma workup, 31 were found to have the EILO condition.

a. Calculate and interpret a confidence interval using a $95 %$ confidence level for the true proportion of all athletes found to have the EILO condition under these circumstances.

b. What sample size is required if the desired width of the $95 % CI$ is to be at most .04, irrespective of the sample results?

c. Does the upper limit of the interval in (a) specify a $95 %$ upper confidence bound for the proportion being estimated? Explain.

High concentration of the toxic element arsenic is all too common in groundwater. The article “Evaluation of Treatment Systems for the Removal of Arsenic from Groundwater” (Practice Periodical of Hazardous, Toxic, and Radioactive Waste Mgmt., 2005: 152-157) reported that for a sample of $n = 5$ water specimens selected for treatment by coagulation, the sample mean arsenic concentration was $24.3 μ g / L$ , and the sample standard deviation was 4.1. The authors of the cited article used $t$ -based methods to analyze their data, so hopefully had reason to believe that the distribution of arsenic concentration was normal.

a. Calculate and interpret a $95 % CI$ for true average arsenic concentration in all such water specimens.

b. Calculate a $90 %$ upper confidence bound for the standard deviation of the arsenic concentration distribution.

c. Predict the arsenic concentration for a single water specimen in a way that conveys information about precision and reliability.

Aphid infestation of fruit trees can be controlled either by spraying with pesticide or by inundation with ladybugs. In a particular area, four different groves of fruit trees are selected for experimentation. The first three groves are sprayed with pesticides 1,2 , and 3 , respectively, and the

fourth is treated with ladybugs, with the following results on yield:

Treatment	${n}_{i} =$ Number of Trees	${\bar{x}}_{i}$ (Bushels/Tree)	${s}_{i}$
1	100	10.5	1.5
2	90	10.0	1.3
3	100	10.1	1.8
4	120	10.7	1.6

Let $μ_{i} =$ the true average yield (bushels/tree) after receiving the $i$ th treatment. Then

θ = \frac{1}{3} (μ_{1} + μ_{2} + μ_{3}) - μ_{4}

measures the difference in true average yields between treatment with pesticides and treatment with ladybugs. When $n_{1}, n_{2}, n_{3}$ , and $n_{4}$ are all large, the estimator $θ$ obtained by replacing each $μ_{i}$ by $\overset{ˉ}{X}_{i}$ is approximately normal. Use this to derive a large-sample $100 (1 - α) %$ CI for $θ$ , and compute the $95 %$ interval for the given data.

It is important that face masks used by firefighters be able to withstand high temperatures because firefighters commonly work in temperatures of $200 - 500^{\circ} F$ . In a test of one type of mask, 11 of 55 masks had lenses pop out at $250^{\circ}$ . Construct a $90 %$ upper confidence bound for the true proportion of masks of this type whose lenses would pop out at $250^{\circ}$ .
A manufacturer of college textbooks is interested in estimating the strength of the bindings produced by a particular binding machine. Strength can be measured by recording the force required to pull the pages from the binding. If this force is measured in pounds, how many books should be tested to estimate the average force required to break the binding to within $.1 lb$ with $95 %$ confidence? Assume that $σ$ is known to be .8 .
The accompanying data on crack initiation depth $(μ m)$ was read from a lognormal probability plot that appeared in the article “Incorporating Small Fatigue Crack Growth in Probabilistic Life Prediction: Effect of Stress Ratio in Ti-6Al-2Sn-6Mo” (Intl. J. of Fatigue, 2013: 83-95). Although the pattern in the plot was quite straight, a normal probability plot of the data also shows a reasonably linear pattern. And a boxplot indicates that the distribution is quite symmetric in the middle $50 %$ of the data and only mildly skewed overall. It is therefore reasonable to estimate and predict using $t$ intervals. $4.7 5.1 5.2 5.3 5.6 5.8 6.3 6.7$ $7.2 7.4 7.7 8.5 8.9 9.3 10.1 11.2$

a. Estimate the true average crack initiation depth with a $99 % CI$ and interpret the resulting interval.

b. Predict the value of a single crack initiation depth by constructing a $99 %$ PI.

c. Interpret in context the meaning of $99 %$ in (b).

In Example 6.8, we introduced the concept of a censored experiment in which $n$ components are put on test and the experiment terminates as soon as $r$ of the components have failed. Suppose component lifetimes are independent, each having an exponential distribution with parameter $λ$ . Let $Y_{1}$ denote the time at which the first failure occurs, $Y_{2}$ the time at which the second failure occurs, and so on, so that $T_{r} = Y_{1} + \dots + Y_{r} +$ $(n - r) Y_{r}$ is the total accumulated lifetime at termination. Then it can be shown that $2 λ T_{r}$ has a chi-squared distribution with $2 r df$ . Use this fact to develop a $100 (1 - α) %$ CI formula for true average lifetime $1/ λ$ . Compute a $95 %$ CI from the data in Example 6.8.
Let $X_{1}, X_{2}, \dots, X_{n}$ be a random sample from a continuous probability distribution having median $μ$ (so that $P (X_{i} \leq μ) = P (X_{i} \geq μ) = .5)$ .

a. Show that

P (min (X_{i}) < μ < max (X_{i})) = 1 - (\frac{1}{2})^{n - 1}

so that $(min (x_{i}), max (x_{i}))$ is a $100 (1 - α) %$ confidence interval for $μ$ with $α = (\frac{1}{2})^{n - 1}$ . [Hint: The complement of the event ${min (X_{i}) < μ < max (X_{i})}$ is ${max (X_{i}) \leq$ $μ} \cup {min (X_{i}) \geq μ}$ . But max $(X_{i}) \leq μ$ iff $X_{i} \leq μ$ for all $i$ .]

b. For each of six normal male infants, the amount of the amino acid alanine $(mg / 100 mL)$ was determined while the infants were on an isoleucine-free diet, resulting in the following data:2.70

Compute a $97 % CI$ for the true median amount of alanine for infants on such a diet (“The Essential Amino Acid Requirements of Infants,” Amer. J. of Nutrition, 1964: 322-330).

c. Let $x_{(2)}$ denote the second smallest of the $x_{i}$ ’s and $x_{(n - 1)}$ denote the second largest of the $x_{i}$ ’s. What is the confidence level of the interval $(x_{(2)}, x_{(n - 1)})$ for $μ$ ?

Let $X_{1}, X_{2}, \dots, X_{n}$ be a random sample from a uniform distribution on the interval $[0, θ]$ , so that

f (x) = {\frac{1}{θ} 0 0 \leq x \leq θ otherwise

Then if $Y = max (X_{i})$ , it can be shown that the rv $U = Y / θ$ has density function

f_{U} (u) = {n u^{n - 1} 0 0 \leq u \leq 1 otherwise

a. Use $f_{U} (u)$ to verify that

P ((α /2)^{1/ n} < \frac{Y}{θ} \leq (1 - α /2)^{1/ n}) = 1 - α

and use this to derive a $100 (1 - α) %$ CI for $θ$ .

b. Verify that $P (α^{1/ n} \leq Y / θ \leq 1) = 1 - α$ , and derive a $100 (1 - α) %$ CI for $θ$ based on this probability statement.

c. Which of the two intervals derived previously is shorter? If my waiting time for a morning bus is uniformly distributed and observed waiting times are $x_{1} = 4.2, x_{2} = 3.5, x_{3} = 1.7, x_{4} = 1.2$ , and $x_{5} = 2.4$ , derive a $95 % CI$ for $θ$ by using the shorter of the two intervals.

Let $0 \leq γ \leq α$ . Then a $100 (1 - α) %$ CI for $μ$ when $n$ is large is

(\overset{x}{ˉ} - z_{γ} \cdot \frac{s}{n}, \overset{x}{ˉ} + z_{α - γ} \cdot \frac{s}{n})

The choice $γ = α /2$ yields the usual interval derived in Section 7.2; if $γ \neq = α /2$ , this interval is not symmetric about $\overset{x}{ˉ}$ . The width of this interval is $w = s (z_{γ} + z_{α - γ}) / n$ . Show that $w$ is minimized for the choice $γ = α /2$ , so that the symmetric interval is the shortest. [Hints: (a) By definition of $z_{α}, Φ (z_{α}) = 1 - α$ , so that $z_{α} = Φ^{- 1} (1 - α)$ ; (b) the relationship between the derivative of a function $y = f (x)$ and the inverse function $x = f^{- 1} (y)$ is $(d / d y) f^{- 1} (y) = 1/ f^{'} (x) .]$

Suppose $x_{1}, x_{2}, \dots, x_{n}$ are observed values resulting from a random sample from a symmetric but possibly heavy-tailed distribution. Let $x$ and $f_{s}$ denote the sample median and fourth spread, respectively. Chapter 11 of Understanding Robust and Exploratory Data Analysis (see the bibliography in Chapter 6) suggests the following robust $95 % CI$ for the population mean (point of symmetry):

x \pm (\frac{conservative t critical value}{1.075}) \cdot \frac{f _{s}}{n}

The value of the quantity in parentheses is 2.10 for $n = 10, 1.94$ for $n = 20$ , and 1.91 for $n = 30$ . Compute this $CI$ for the data of Exercise 45, and compare to the $t CI$ appropriate for a normal population distribution.

a. Use the results of Example 7.5 to obtain a $95 %$ lower confidence bound for the parameter $λ$ of an exponential distribution, and calculate the bound based on the data given in the example.

b. If lifetime $X$ has an exponential distribution, the probability that lifetime exceeds $t$ is $P (X > t) = e^{- λ t}$ . Use the result of part (a) to obtain a $95 %$ lower confidence bound for the probability that breakdown time exceeds $100 min$ .

BIBLIOGRAPHY

DeGroot, Morris, and Mark Schervish, Probability and Statistics (4th ed.), Addison-Wesley, Upper Saddle River, $NJ, 2012$ . A very good exposition of the general principles of statistical inference.

Devore, Jay, and Kenneth Berk, Modern Mathematical Statistics with Applications, Springer, New York, 2012. The exposition is a bit more comprehensive and sophisticated

than that of the current book, and includes more material on bootstrapping.

Hahn, Gerald, and William Meeker, Statistical Intervals, Wiley, New York, 1991. Almost everything you ever wanted to know about statistical intervals (confidence, prediction, tolerance, and others).

Youliang Zhong

Table of Contents

Graph View

7 Statistical Intervals Based on a Single Sample

INTRODUCTION

7.1 Basic Properties of Confidence Intervals

Interpreting a Confidence Level

Other Levels of Confidence

DEFINITION

Confidence Level, Precision, and Sample Size

Deriving a Confidence Interval

Bootstrap Confidence Intervals

EXERCISES Section 7.1 (1-11)

7.2 Large-Sample Confidence Intervals for a Population Mean and Proportion

A Large-Sample Interval for $μ$

A General Large-Sample Confidence Interval

A Confidence Interval for a Population Proportion

One-Sided Confidence Intervals (Confidence Bounds)

EXERCISES Section 7.2 (12-27)

7.3 Intervals Based on a Normal Population Distribution

THEOREM

Properties of $t$ Distributions

Properties of $t$ Distributions

NOTATION

The One-Sample $t$ Confidence Interval

A Prediction Interval for a Single Future Value

Tolerance Intervals

Intervals Based on Nonnormal Population Distributions

7.4 Confidence Intervals for the Variance and Standard Deviation of a Normal Population

THEOREM

EXERCISES Section 7.4 (42-46)

BIBLIOGRAPHY

Youliang Zhong

Table of Contents

Graph View

7 Statistical Intervals Based on a Single Sample

INTRODUCTION

7.1 Basic Properties of Confidence Intervals

Interpreting a Confidence Level

Other Levels of Confidence

DEFINITION

Confidence Level, Precision, and Sample Size

Deriving a Confidence Interval

Bootstrap Confidence Intervals

EXERCISES Section 7.1 (1-11)

7.2 Large-Sample Confidence Intervals for a Population Mean and Proportion

A Large-Sample Interval for μ

A General Large-Sample Confidence Interval

A Confidence Interval for a Population Proportion

One-Sided Confidence Intervals (Confidence Bounds)

EXERCISES Section 7.2 (12-27)

7.3 Intervals Based on a Normal Population Distribution

THEOREM

Properties of t Distributions

Properties of t Distributions

NOTATION

The One-Sample t Confidence Interval

A Prediction Interval for a Single Future Value

Tolerance Intervals

Intervals Based on Nonnormal Population Distributions

7.4 Confidence Intervals for the Variance and Standard Deviation of a Normal Population

THEOREM

EXERCISES Section 7.4 (42-46)

BIBLIOGRAPHY

A Large-Sample Interval for $μ$

Properties of $t$ Distributions

Properties of $t$ Distributions

The One-Sample $t$ Confidence Interval