5.3.1 Random Samples

The probability distribution of any particular statistic depends not only on the population distribution (normal, uniform, etc.) and the sample size $n$ but also on the method of sampling. Consider selecting a sample of size $n = 2$ from a population consisting of just the three values 1, 5, and 10, and suppose that the statistic of interest is the sample variance. If sampling is done “with replacement,” then $S^{2} = 0$ will result if $X_{1} = X_{2}$ . However, $S^{2}$ cannot equal 0 if sampling is “without replacement.” So $P (S^{2} = 0) = 0$ for one sampling method, and this probability is positive for the other method. Our next definition describes a sampling method often encountered (at least approximately) in practice.

random sample

The rv’s $X_{1}, X_{2}, \dots, X_{n}$ are said to form a (simple) random sample of size $n$ if

The $X_{i}$ ’s are independent rv’s.

Every $X_{i}$ has the same probability distribution.

Conditions 1 and 2 can be paraphrased by saying that the $X_{i}$ ’s are independent and identically distributed (iid). If sampling is either with replacement or from an infinite (conceptual) population, Conditions 1 and 2 are satisfied exactly. These conditions will be approximately satisfied if sampling is without replacement, yet the sample size $n$ is much smaller than the population size $N$ . In practice, if $n / N \leq .05$ (at most $5%$ of the population is sampled), we can proceed as if the $X_{i}$ ’s form a random sample. The virtue of such random sampling is that the probability distribution of any statistic can be more easily obtained than for any other sampling procedure.

There are two general methods for obtaining information about a statistic’s sampling distribution. One method involves calculations based on probability rules, and the other involves carrying out a simulation experiment.

Youliang Zhong

Backlinks

Graph View

5.3.1 Random Samples