Sampling Without Replacement Revision Notes for VCE SSCE Mathematical Methods

Sampling Without Replacement

What is sampling without replacement?

When we sample without replacement, we select items from a group and don't put them back before making the next selection. This is different from sampling with replacement, where items are returned to the group after each selection.

The key feature of sampling without replacement is that the probability of selecting a particular item changes after each selection. This happens because the composition of the group changes as items are removed.

infoNote

In sampling with replacement, probabilities stay constant because the population doesn't change between selections. In sampling without replacement, probabilities are dependent on previous selections, making the events dependent rather than independent.

Understanding through an example

Let's explore this concept with a practical example.

Imagine a jar containing three mints and four toffees (seven lollies in total). Bob selects two lollies from the jar without looking, and without replacing the first one before selecting the second.

Let $X$ represent the number of mints Bob selects. The random variable $X$ can take the values $0$ , $1$ , or $2$ .

Initially, the probability that Bob selects a mint is $\frac{3}{7}$ , and the probability he selects a toffee is $\frac{4}{7}$ .

However, when Bob makes his second selection, only six lollies remain. The probability of selecting a mint or toffee on the second draw depends entirely on what he selected first. This dependency is the hallmark of sampling without replacement.

Method 1: Using a tree diagram

We can visualize this problem using a tree diagram, which shows all possible outcomes across the two selections.

The tree diagram branches out from the first selection to show the second selection. Notice how the probabilities on the second set of branches change depending on the first selection:

If Bob selects a toffee first (probability $\frac{4}{7}$ ), then three toffees and three mints remain, giving probabilities $\frac{3}{6}$ for each on the second selection.
If Bob selects a mint first (probability $\frac{3}{7}$ ), then four toffees and two mints remain, giving probabilities $\frac{4}{6}$ and $\frac{2}{6}$ respectively.

Since this involves a sequence of two dependent trials, we use the multiplication rule to find the probability of each complete outcome. We multiply along the branches:

For $X = 0$ (no mints selected, i.e., two toffees):

\Pr(X = 0) = \frac{4}{7} \times \frac{3}{6} = \frac{12}{42} = \frac{2}{7}

For $X = 1$ (one mint selected):

This can happen in two ways: toffee then mint, or mint then toffee. We add these probabilities:

\Pr(X = 1) = \left(\frac{4}{7} \times \frac{3}{6}\right) + \left(\frac{3}{7} \times \frac{4}{6}\right) = \frac{2}{7} + \frac{2}{7} = \frac{4}{7}

For $X = 2$ (two mints selected):

\Pr(X = 2) = \frac{3}{7} \times \frac{2}{6} = \frac{6}{42} = \frac{1}{7}

infoNote

Tree diagram strategy:

Multiply along the branches to find the probability of a complete path
Add across different paths that lead to the same outcome
This approach works well for problems with a small number of selections

Method 2: Using combinations

For larger problems, drawing a complete tree diagram becomes impractical. Fortunately, we can calculate the same probabilities using combinations.

chatImportant

Notation note: The binomial coefficient $\binom{n}{r}$ (read as "n choose r") represents the number of ways to select $r$ objects from $n$ objects. This is the same as $^n C_r$ notation you may have seen previously.

The general approach is:

\Pr(X = x) = \frac{\text{Number of favourable outcomes}}{\text{Total number of possible outcomes}}

For $X = 0$ (no mints, so two toffees):

Number of ways to select $0$ mints from $3$ available: $\binom{3}{0} = 1$

Number of ways to select $2$ toffees from $4$ available: $\binom{4}{2} = 6$

Total ways to select $2$ lollies from $7$ : $\binom{7}{2} = 21$

Therefore:

\Pr(X = 0) = \frac{\binom{3}{0} \times \binom{4}{2}}{\binom{7}{2}} = \frac{1 \times 6}{21} = \frac{6}{21} = \frac{2}{7}

For $X = 1$ (one mint, one toffee):

\Pr(X = 1) = \frac{\binom{3}{1} \times \binom{4}{1}}{\binom{7}{2}} = \frac{3 \times 4}{21} = \frac{12}{21} = \frac{4}{7}

For $X = 2$ (two mints):

\Pr(X = 2) = \frac{\binom{3}{2} \times \binom{4}{0}}{\binom{7}{2}} = \frac{3 \times 1}{21} = \frac{3}{21} = \frac{1}{7}

The complete probability distribution for $X$ is:

chatImportant

Verification check: Notice that the probabilities must sum to $1$ :

\frac{2}{7} + \frac{4}{7} + \frac{1}{7} = \frac{7}{7} = 1

Always verify your probability distribution sums to exactly 1 — this is an essential check that your calculations are correct.

The hypergeometric distribution

The type of probability distribution we've just explored is called the hypergeometric distribution. This distribution arises whenever we sample without replacement from a finite population containing two distinct types of items.

The hypergeometric distribution is particularly useful in quality control, ecological studies, and many other real-world applications where sampling without replacement occurs naturally.

infoNote

The hypergeometric distribution is characterized by:

Sampling from a finite population
Two distinct categories or types within the population
Sampling without replacement
Interest in the number of items from one category in the sample

Worked example: tagged dolphins

lightbulbExample

Worked Example: Marine Biology Study

Marine biologists are studying a group of dolphins in a small bay. They know there are $12$ dolphins in total. Four dolphins have been caught, tagged, and released back into the population.

The researchers return the following week and catch a sample of three dolphins. What is the probability that exactly two of these three dolphins are already tagged?

Solution:

Let $X$ represent the number of tagged dolphins in the sample of three.

We need to find $\Pr(X = 2)$ .

Step 1: Identify the groups

Total population: $12$ dolphins
Tagged dolphins: $4$
Non-tagged dolphins: $8$
Sample size: $3$ dolphins
We want: exactly $2$ tagged dolphins

Step 2: Set up the combination formula

We're selecting $2$ tagged dolphins from the $4$ available tagged dolphins, and $1$ non-tagged dolphin from the $8$ non-tagged dolphins:

\Pr(X = 2) = \frac{\binom{4}{2} \times \binom{8}{1}}{\binom{12}{3}}

Step 3: Calculate the combinations

\Pr(X = 2) = \frac{6 \times 8}{220}

Step 4: Simplify

\Pr(X = 2) = \frac{48}{220} = \frac{12}{55}

Answer: The probability that exactly two of the three caught dolphins are already tagged is $\frac{12}{55}$ (approximately $0.218$ or $21.8\%$ ).

Remember!

bookmarkSummary

Key Points to Remember:

Sampling without replacement means items are not returned to the group after selection, causing probabilities to change with each draw.
Tree diagrams work well for small problems with few stages. Multiply along branches and add across different paths leading to the same outcome.
The combination method is more efficient for larger problems. Use $\frac{\binom{a}{x} \times \binom{b}{n-x}}{\binom{N}{n}}$ where you're selecting $x$ items from group $a$ and $(n-x)$ items from group $b$ , with total population $N$ and sample size $n$ .
The hypergeometric distribution describes the probability distribution when sampling without replacement from a population with two distinct types.
Always check that your probabilities sum to 1 as a verification of your calculations.

Sampling Without Replacement (VCE SSCE Mathematical Methods): Revision Notes