Range, quartiles and interquartile range (Edexcel GCSE Statistics): Revision Notes
Range, quartiles and interquartile range
When working with grouped data, you cannot determine the exact individual values. This means you can only calculate estimates of the range, quartiles, and interquartile range. Understanding these measures of spread is crucial for analysing how scattered your data is.
Understanding the range for grouped data
The range tells us about the spread of data from the smallest to the largest value. For grouped data, we need to think about the possible minimum and maximum values within each class interval.
Formula:
Method for calculating range
- Identify the smallest possible value: Look at the lowest class interval and find the minimum value that could occur
- Identify the largest possible value: Look at the highest class interval and find the maximum value that could occur
- Calculate the difference: Subtract the smallest from the largest
Worked Example: Newspaper reading habits
Consider this frequency table showing numbers of newspapers people read:
| Number of newspapers | Frequency |
|---|---|
| 0-7 | 5 |
| 8-14 | 16 |
| 15-21 | 22 |
| 22-50 | 12 |
Step 1: The smallest possible value is 0 (from the 0-7 class)
Step 2: The largest possible value is 50 (from the 22-50 class)
Step 3: Range = 50 - 0 = 50 newspapers
When data is rounded (like weights to the nearest kg), be extra careful about minimum and maximum possible values. For example, if weight satisfies , the actual minimum could be 45.5kg and maximum could be 65.5kg.
Finding quartiles for grouped data
Quartiles divide your data into four equal parts. Since we're working with grouped data, we need to use cumulative frequency and interpolation techniques.
The three quartiles
- (first quartile): The value below which 25% of the data falls
- (second quartile/median): The value below which 50% of the data falls
- (third quartile): The value below which 75% of the data falls
Step-by-step method for finding quartiles
- Create a cumulative frequency column: Add up frequencies as you go down the table
- Find the positions:
- position = (where is the total frequency)
- position =
- position =
- Locate the class intervals: Find which class contains each quartile position
- Use linear interpolation: Calculate the exact value within the class
Worked Example: Travel distances
| Distance, x (km) | Frequency | Cumulative frequency |
|---|---|---|
| 0 < x ≤ 5 | 34 | 34 |
| 5 < x ≤ 10 | 38 | 72 |
| 10 < x ≤ 15 | 22 | 94 |
| 15 < x ≤ 20 | 16 | 110 |
| 20 < x ≤ 25 | 10 | 120 |
Total frequency () = 120
Finding :
- position = th value
- The 30th value falls in the 0 < x ≤ 5 class (cumulative frequency goes up to 34)
- Since we need the 30th value out of 34 in this class:
Finding :
- position = th value
- The 60th value falls in the 5 < x ≤ 10 class (cumulative frequency goes from 34 to 72)
Finding :
- position = th value
- The 90th value falls in the 10 < x ≤ 15 class
Linear interpolation formula
When using linear interpolation to find quartiles, use this formula:
Where:
- = lower boundary of the class containing the quartile
- position = the quartile position (, , or )
- = cumulative frequency before the quartile class
- = frequency of the quartile class
- = class width
Calculating the interquartile range
The interquartile range (IQR) measures the spread of the middle 50% of your data.
Formula:
This is particularly useful because it's not affected by extreme values (outliers), making it a robust measure of spread.
Using our travel distance example:
Common exam traps and tips
Trap 1: Forgetting that grouped data gives estimates, not exact values Solution: Always use words like "estimate" in your answers
Trap 2: Using the wrong quartile positions Solution: Remember - is , is , is
Trap 3: Incorrect interpolation when the quartile position falls exactly on a cumulative frequency boundary Solution: Check your cumulative frequency column carefully and ensure you're interpolating within the correct class
Trap 4: Mixing up class boundaries with class values Solution: Pay close attention to whether inequalities are strict (< or >) or inclusive (≤ or ≥)
Tip: Always show your working clearly, especially for the interpolation steps - this often earns method marks even if your final answer isn't perfect.
Key Points to Remember:
- Range for grouped data = largest possible value - smallest possible value (these are estimates)
- Quartiles divide data into quarters - (25%), (50%), (75%)
- Use cumulative frequency to locate which class contains each quartile
- Linear interpolation helps you find the exact quartile value within its class
- IQR = gives you the spread of the middle 50% of data