Averages (OCR GCSE Maths): Revision Notes
Averages and Measures of Spread
Introduction:
Averages and measures of spread help us to summarise large sets of data and compare different data sets. The most common types of averages are mean, median, and mode. Each type has its own strengths and is used in different situations.
1. The Mean
The mean is what most people commonly refer to as the "average." It is calculated by adding up all the values in a data set and then dividing by the total number of values.
How to Work Out the Mean:
- Add up all the values in the data set.
- Divide the total by the number of data values.
Example 1: Finding the Mean
Question: Find the mean of the following set of numbers:
Solution:
- Add up all the values:
- Divide the total by the number of values:
The mean is .
2. The Median
The median is the middle value in a data set when all the values are arranged in ascending order (from smallest to largest). If there is an even number of values, the median is the average of the two middle numbers.
How to Work Out the Median:
- Arrange the data values in ascending order.
- Find the middle value. If there are two middle values, find their mean.
Example 2: Finding the Median
Question: Find the median of the following set of numbers:
Solution:
-
Arrange the numbers in ascending order:
-
The middle value is 5, so the median is .
Note: If the data set has an even number of values, you find the mean of the two middle numbers.
Example 3: Median with Even Number of Values
Question: Find the median of the following set of numbers:
Solution:
-
Arrange the numbers in ascending order:
-
The middle values are and . Find the mean of these two values:
The median is .
3. The Mode
The mode is the value that appears the most frequently in a data set. A data set may have more than one mode if multiple values occur with the same highest frequency. If no value repeats, there is no mode.
How to Work Out the Mode:
- Identify the most frequent value(s) in the data set.
Example 4: Finding the Mode
Question: Find the mode of the following set of numbers: .
Solution:
- The number appears three times, more frequently than any other number. The mode is .
4. The Range
The range measures the spread of the data by finding the difference between the largest and smallest values. A large range indicates that the data varies greatly, while a small range suggests that the data is more consistent
Example: Finding the Range
Question: Find the range of the following set of numbers: .
Solution:
- The largest value is .
- The smallest value is .
- Subtract the smallest value from the largest value:
The range is .
Why Is the Range Important?
The range gives us an idea of how spread out the data is. A large range indicates that the values in the data set vary significantly, while a small range means the values are clustered closer together.
Worked Example (The Big Cricket Example):
Example
Andrew Flintoff and Michael Vaughan are comparing their cricket performances. Their scores are as follows:
- Andrew Flintoff's Scores:

- Michael Vaughan's Scores:

We can calculate the Mean for both cricketers.
1. The Mean
Step 1: Add up all their scores.
- Andrew Flintoff's Total Runs:
- Michael Vaughan's Total Runs:
Step 2: Divide the total by the number of games.
-
Andrew Flintoff's Games Played: games.
-
Michael Vaughan's Games Played: games.
What Does This Tell Us?
- On average, Michael Vaughan has scored runs per game, while Andrew Flintoff has scored runs.
- It looks like Michael Vaughan has had the better season, based on the mean alone.
2. The Median
How to Work Out the Median:
- Arrange the data values in ascending order.
- Identify the middle value. If there are two middle values (even number of data points), find the mean of those two values.
Example: Comparing Cricketers Using the Median
Let's find the median for each cricketer's scores:
Andrew Flintoff's Scores:
- The median is the middle value. Since there are scores, the median is the value:
Michael Vaughan's Scores:
- The median is the mean of the and values:
What does this tell us?
The median shows a different picture. Michael Vaughan's median score is , which is much lower than Andrew Flintoff's median of . This suggests that Michael Vaughan was much more inconsistent, scoring very low in most games but achieving a couple of exceptionally high scores.
- Good thing about the median: It is not affected by outliers, so it gives a better sense of typical performance.
- Bad thing about the median: It doesn't consider all the data points, only focusing on the middle, which can ignore important information about the rest of the data.
3. The Mode
How to Work Out the Mode:
- Identify the most common value in the data set.
- This value is the mode.
Example: Comparing Cricketers Using the Mode
Andrew Flintoff's Mode:
-
The most frequent score is 32, which appears multiple times in his data set.
-
Mode = Michael Vaughan's Mode:
-
The most frequent score is , which appears the most often in his data set.
-
Mode = What does this tell us?
Once again, using the mode suggests that Andrew Flintoff had the better season. His most frequent score was , which shows that he consistently scored decent runs. Michael Vaughan, however, had 0 as his mode, indicating that he often scored no runs in many games.
- Good thing about the mode: It's very quick and easy to find.
- Bad thing about the mode: The mode can sometimes be misleading, especially when data is skewed or has outliers. In Michael Vaughan's case, his mode is because of his many low scores, but this doesn't reflect the two very high scores of and .
4. The Range
How to Work Out the Range:
- Subtract the smallest value from the largest value.
Example: Comparing Cricketers Using the Range
Andrew Flintoff's Range:
- Largest score:
- Smallest score:
- Range:
Michael Vaughan's Range:
- Largest score:
- Smallest score:
- Range:
What does this tell us?
At first glance, someone might think that Michael Vaughan had the better season because he has the larger range of , but that's not entirely correct. A larger range means that his scores varied significantly, showing inconsistency. Andrew Flintoff, with a smaller range of , had a more consistent performance.
- Good thing about the range: It gives a quick idea of how spread out the data is.
- Bad thing about the range: The range is heavily affected by outliers. Michael Vaughan's very high scores of and make his range large, but they don't reflect his many low scores of .
Who Had the Better Season?
When comparing Andrew Flintoff and Michael Vaughan using these statistics, it becomes clear that:
- Andrew Flintoff had a more consistent season, with a higher mode of and a smaller range of . This suggests he performed well in most matches.
- Michael Vaughan, although he had some brilliant high scores (like and ), had many low scores, leading to an inconsistent performance. His mean was distorted by the outliers, and his range was very large, showing great variability in his results.
Worked Example: Estimating the Mean Attendance at Football Matches
Example:
The table below shows the attendance at Preston North End football matches over a season:
| Attendance Range | Frequency (Number of Matches) |
|---|---|
| 5 | |
Step-by-Step Guide:
- Work out the mid-point for each group.
- The mid-point is the average of the lower and upper boundaries of each class.
- For the first group the mid-point is:
- Continue for the remaining groups.
- Multiply the mid-point by the frequency for each group.
- This gives an estimate of the total attendance for each group of matches.
- Calculate the estimated mean using the formula:
Step-by-Step Solution:
| Attendance Range | Mid-point | Frequency | Mid-point × Frequency |
|---|---|---|---|
Totals:
- Total frequency (number of matches) =
- Total of mid-point × frequency = Now, calculate the estimated mean: