98
Let the time it takes a read/write head to locate a desired record on a computer disk memory device once the head has been positioned over the correct track. If the disks rotate once every 25 millisec, a reasonable assumption is that is uniformly distributed on the interval .
a. Compute .
b. Compute .
c. Obtain the cdf .
d. Compute and .
99
A 12-in. bar that is clamped at both ends is to be subjected to an increasing amount of stress until it snaps. Let the distance from the left end at which the break occurs. Suppose has pdf
Compute the following:
a. The cdf of , and graph it.
b. , and
c. , and
d. The probability that the break point occurs more than 2 in. from the expected break point.
e. The expected length of the shorter segment when the break occurs.
100
Let denote the time to failure (in years) of a certain hydraulic component. Suppose the pdf of is for .
a. Verify that is a legitimate pdf.
b. Determine the cdf.
c. Use the result of part (b) to calculate the probability that time to failure is between 2 and 5 years.
d. What is the expected time to failure?
e. If the component has a salvage value equal to when its time to failure is , what is the expected salvage value?
101
The completion time for a certain task has cdf given by
a. Obtain the and sketch its graph.
b. Compute .
c. Compute .
102
Let represent the number of individuals who respond to a particular online coupon offer. Suppose that has approximately a Weibull distribution with and . Calculate the best possible approximation to the probability that is between 15 and 20, inclusive.
103
The article “Computer Assisted Net Weight Control” (Quality Progress, 1983: 22-25) suggests a normal distribution with mean 137.2 oz and standard deviation 1.6 oz for the actual contents of jars of a certain type. The stated contents was .
a. What is the probability that a single jar contains more than the stated contents?
b. Among ten randomly selected jars, what is the probability that at least eight contain more than the stated contents?
c. Assuming that the mean remains at 137.2, to what value would the standard deviation have to be changed so that of all jars contain more than the stated contents?
104
When circuit boards used in the manufacture of compact disc players are tested, the long-run percentage of defectives is . Suppose that a batch of 250 boards has been received and that the condition of any particular board is independent of that of any other board.
a. What is the approximate probability that at least of the boards in the batch are defective?
b. What is the approximate probability that there are exactly 10 defectives in the batch?
105
Exercise 38 introduced two machines that produce wine corks, the first one having a normal diameter distribution with mean value and standard deviation , and the second having a normal diameter distribution with mean value and standard deviation . Acceptable corks have diameters between 2.9 and . If of all corks used come from the first machine and a randomly selected cork is found to be acceptable, what is the probability that it was produced by the first machine?
106
The reaction time (in seconds) to a certain stimulus is a continuous random variable with pdf
a. Obtain the cdf.
b. What is the probability that reaction time is at most 2.5 sec? Between 1.5 and 2.5 sec?
c. Compute the expected reaction time.
d. Compute the standard deviation of reaction time.
e. If an individual takes more than 1.5 sec to react, a light comes on and stays on either until one further second has elapsed or until the person reacts (whichever happens first). Determine the expected amount of time that the light remains lit.
Hint: Let the time that the light is on as a function of reaction time .
107
Let denote the temperature at which a certain chemical reaction takes place. Suppose that has pdf
a. Sketch the graph of .
b. Determine the cdf and sketch it.
c. Is 0 the median temperature at which the reaction takes place? If not, is the median temperature smaller or larger than 0 ?
d. Suppose this reaction is independently carried out once in each of ten different labs and that the pdf of reaction time in each lab is as given. Let the number among the ten labs at which the temperature exceeds 1. What kind of distribution does have? (Give the names and values of any parameters.)
108
An oocyte is a female germ cell involved in reproduction. Based on analyses of a large sample, the article “Reproductive Traits of Pioneer Gastropod Species Colonizing Deep-Sea Hydrothermal Vents After an Eruption” (Marine Biology, 2011: 181-192) proposed the following mixture of normal distributions as a model for the distribution of oocyte diameter :
where and are normal pdfs. Suggested parameter values were , and .
a. What is the expected (i.e. mean) value of oocyte diameter?
b. What is the probability that oocyte diameter is between and ?
- Hint: Write an expression for the corresponding integral, carry the integral operation through to the two components, and then use the fact that each component is a normal pdf.
c. What is the probability that oocyte diameter is smaller than its mean value? What does this imply about the shape of the density curve?
109
The article “The Prediction of Corrosion by Statistical Analysis of Corrosion Profiles” (Corrosion Science, 1985: 305-315) suggests the following cdf for the depth of the deepest pit in an experiment involving the exposure of carbon manganese steel to acidified seawater.
The authors propose the values and . Assume this to be the correct model.
a. What is the probability that the depth of the deepest pit is at most 150 ? At most 300 ? Between 150 and 300 ?
b. Below what value will the depth of the maximum pit be observed in of all such experiments?
c. What is the density function of ?
d. The density function can be shown to be unimodal (a single peak). Above what value on the measurement axis does this peak occur? (This value is the mode.)
e. It can be shown that . What is the mean for the given values of and , and how does it compare to the median and mode? Sketch the graph of the density function.
- Note: This is called the largest extreme value distribution.
110
Let the amount of sales tax a retailer owes the government for a certain period. The article “Statistical Sampling in Tax Audits” (Statistics and the Law, 2008: 320-343) proposes modeling the uncertainty in by regarding it as a normally distributed random variable with mean value and standard deviation (in the article, these two parameters are estimated from the results of a tax audit involving sampled transactions). If represents the amount the retailer is assessed, then an under-assessment results if and an over-assessment results if . The proposed penalty (i.e., loss) function for over- or under-assessment is if and if is suggested to incorporate the idea that over-assessment is more serious than under-assessment).
a. Show that is the value of that minimizes the expected loss, where is the inverse function of the standard normal cdf.
b. If (suggested in the article), \mu = \ {100},{000}\sigma = $ {10},{000}a$ , and what is the resulting probability of over-assessment?
111
The mode of a continuous distribution is the value that maximizes .
a. What is the mode of a normal distribution with parameters and ?
b. Does the uniform distribution with parameters and have a single mode? Why or why not?
c. What is the mode of an exponential distribution with parameter ? (Draw a picture.)
d. If has a gamma distribution with parameters and , and , find the mode.
- Hint: will be maximized iff is, and it may be simpler to take the derivative of .
e. What is the mode of a chi-squared distribution having degrees of freedom?
112
The article “Error Distribution in Navigation” (J. of the Institute of Navigation, 1971: 429-442) suggests that the frequency distribution of positive errors (magnitudes of errors) is well approximated by an exponential distribution. Let the lateral position error (nautical miles), which can be either negative or positive. Suppose the pdf of is
a. Sketch a graph of and verify that is a legitimate pdf (show that it integrates to 1).
b. Obtain the cdf of and sketch it.
c. Compute , and the probability that an error of more than 2 miles is made.
113
The article “Statistical Behavior Modeling for Driver-Adaptive Precrash Systems” (IEEE Trans. on Intelligent Transp. Systems, 2013: 1-9) proposed the following mixture of two exponential distributions for modeling the behavior of what the authors called “the criticality level of a situation” .
This is often called the hyperexponential or mixed exponential distribution. This distribution is also proposed as a model for rainfall amount in “Modeling Monsoon Affected Rainfall of Pakistan by Point Processes” (J. of Water Resources Planning and Mgmnt., 1992: 671-688).
a. Determine and . Hint: For distributed exponentially, and ; what does this imply about ?
b. Determine the cdf of .
c. If , and (values of the ’s suggested in the cited article), calculate .
d. For the parameter values given in (c), what is the probability that is within one standard deviation of its mean value?
e. The coefficient of variation of a random variable (or distribution) is . What is for an exponential rv? What can you say about the value of when has a hyperexponential distribution?
f. What is for an Erlang distribution with parameters and as defined in Exercise 68?
- Note: In applied work, the sample is used to decide which of the three distributions might be appropriate.
114
Suppose a particular state allows individuals filing tax returns to itemize deductions only if the total of all itemized deductions is at least \ {5000}XX$ has the pdf
a. Find the value of . What restriction on is necessary?
b. What is the cdf of ?
c. What is the expected total deduction on a randomly chosen form? What restriction on is necessary for to be finite?
d. Show that has an exponential distribution with parameter .
115
Let be the input current to a transistor and be the output current. Then the current gain is proportional to . Suppose the constant of proportionality is 1
(which amounts to choosing a particular unit of measurement), so that current gain . Assume is normally distributed with and .
a. What type of distribution does the ratio have?
b. What is the probability that the output current is more than twice the input current?
c. What are the expected value and variance of the ratio of output to input current?
116
The article “Response of Composites Under Static and Cyclic Loading-An Experimental and Statistical Analysis” (J. of Engr. Materials and Technology, 1997: 186-193) suggests that tensile strength (MPa) of composites under specified conditions can be modeled by a Weibull distribution with and .
a. Sketch a graph of the density function.
b. What is the probability that the strength of a randomly selected specimen will exceed 175 ? Will be between 150 and 175 ?
c. If two randomly selected specimens are chosen and their strengths are independent of one another, what is the probability that at least one has a strength between 150 and 175 ?
d. What strength value separates the weakest of all specimens from the remaining ?
117
Let have a standard normal distribution and define a new rv by . Show that has a normal distribution with parameters and .
- Hint: iff ? Use this to find the cdf of and then differentiate it with respect to .
118
a. Suppose the lifetime of a component, when measured in hours, has a gamma distribution with parameters and . Let the lifetime measured in minutes. Derive the pdf of .
- Hint: iff . Use this to obtain the cdf of and then differentiate to obtain the pdf.
b. If has a gamma distribution with parameters and , what is the probability distribution of ?
119
In Exercises 117 and 118, as well as many other situations, one has the of and wishes to know the pdf of . Assume that is an invertible function, so that can be solved for to yield . Then it can be shown that the pdf of is
a. If has a uniform distribution with and , derive the pdf of .
b. Work Exercise 117, using this result.
c. Work Exercise 118(b), using this result.
120
Based on data from a dart-throwing experiment, the article “Shooting Darts” (Chance, Summer 1997, 16-19) proposed that the horizontal and vertical errors from aiming at a point target should be independent of one another, each with a normal distribution having mean 0 and variance . It can then be shown that the pdf of the distance from the target to the landing point is
a. This pdf is a member of what family introduced in this chapter?
b. If (close to the value suggested in the paper), what is the probability that a dart will land within (roughly 1 in.) of the target?
121
The article “Three Sisters Give Birth on the Same Day” (Chance, Spring 2001, 23-25) used the fact that three Utah sisters had all given birth on March 11, 1998 as a basis for posing some interesting questions regarding birth coincidences.
a. Disregarding leap year and assuming that the other 365 days are equally likely, what is the probability that three randomly selected births all occur on March 11? Be sure to indicate what, if any, extra assumptions you are making.
b. With the assumptions used in part (a), what is the probability that three randomly selected births all occur on the same day?
c. The author suggested that, based on extensive data, the length of gestation (time between conception and birth) could be modeled as having a normal distribution with mean value 280 days and standard deviation 19.88 days. The due dates for the three Utah sisters were March 15, April 1, and April 4, respectively. Assuming that all three due dates are at the mean of the distribution, what is the probability that all births occurred on March 11?
- Hint: The deviation of birth date from due date is normally distributed with mean 0.
d. Explain how you would use the information in part (c) to calculate the probability of a common birth date.
122
Let denote the lifetime of a component, with and the pdf and cdf of . The probability that the component fails in the interval is approximately . The conditional probability that it fails in given that it has lasted at least is . Dividing this by produces the failure rate function:
An increasing failure rate function indicates that older components are increasingly likely to wear out, whereas a decreasing failure rate is evidence of increasing reliability with age. In practice, a “bathtub-shaped” failure is often assumed.
a. If is exponentially distributed, what is ?
b. If has a Weibull distribution with parameters and , what is ? For what parameter values will be increasing? For what parameter values will decrease with ?
c. Since . Suppose
so that if a component lasts hours, it will last forever (while seemingly unreasonable, this model can be used to study just “initial wearout”). What are the cdf and pdf of ?
123
Let have a uniform distribution on the interval . Then observed values having this distribution can be obtained from a computer’s random number generator. Let .
a. Show that has an exponential distribution with parameter .
- Hint: The cdf of is ; is equivalent to ?
b. How would you use part (a) and a random number generator to obtain observed values from an exponential distribution with parameter ?
124
Consider an with mean and standard deviation , and let be a specified function of . The first-order Taylor series approximation to in the neighborhood of is
The right-hand side of this equation is a linear function of . If the distribution of is concentrated in an interval over which is approximately linear
- e.g., is approximately linear in , then the equation yields approximations to and .
a. Give expressions for these approximations.
- Hint: Use rules of expected value and variance for a linear function .
b. If the voltage across a medium is fixed but current is random, then resistance will also be a random variable related to by . If and , calculate approximations to and .
125
A function is convex if the chord connecting any two points on the function’s graph lies above the graph. When is differentiable, an equivalent condition is that for every , the tangent line at lies entirely on or below the graph. (See the figure below.) How does compare to ?
- Hint: The equation of the tangent line at is . Use the condition of convexity, substitute for , and take expected values.
- Note: Unless is linear, the resulting inequality (usually called Jensen’s inequality) is strict ; it is valid for both continuous and discrete rv’s.
126
Let have a Weibull distribution with parameters and . Show that has a chi-squared distribution with .
- Hint: The cdf of is ; express this probability in the form , use the fact that has a cdf of the form in Expression (4.12), and differentiate with respect to to obtain the pdf of .
127
An individual’s credit score is a number calculated based on that person’s credit history that helps a lender determine how much he/she should be loaned or what credit limit should be established for a credit card. An article in the Los Angeles Times gave data which suggested that a beta distribution with parameters , would provide a reasonable approximation to the distribution of American credit scores.
- Note: credit scores are integer-valued
a. Let represent a randomly selected American credit score. What are the mean value and standard deviation of this random variable? What is the probability that is within 1 standard deviation of its mean value?
b. What is the approximate probability that a randomly selected score will exceed 750 (which lenders consider a very good score)?
128
Let denote rainfall volume and denote runoff volume (both in mm). According to the article “Runoff Quality Analysis of Urban Catchments with Analytical Probability Models” (J. of Water Resource Planning and Management, 2006: 4-14), the runoff volume will be 0 if and will be if . Here is the volume of depression storage (a constant), and (also a constant) is the runoff coefficient. The cited article proposes an exponential distribution with parameter for .
a. Obtain an expression for the cdf of .
- Note: is neither purely continuous nor purely discrete; instead it has a “mixed” distribution with a discrete component at 0 and is continuous for values .
b. What is the pdf of for ? Use this to obtain an expression for the expected value of runoff volume.