Summarised below are the distances, to the nearest mile, travelled to work by a random sample of 120 commuters - Edexcel - A-Level Maths Statistics - Question 4 - 2007 - Paper 1
Question 4
Summarised below are the distances, to the nearest mile, travelled to work by a random sample of 120 commuters.
Distance (to the nearest mile) Number of commu... show full transcript
Worked Solution & Example Answer:Summarised below are the distances, to the nearest mile, travelled to work by a random sample of 120 commuters - Edexcel - A-Level Maths Statistics - Question 4 - 2007 - Paper 1
Step 1
describe its shape
96%
114 rated
Only available for registered users.
Sign up now to view full answer, or log in if you already have an account!
Answer
The shape of the distribution is positively skewed. This is indicated by a longer tail on the right side, as there are more commuters traveling shorter distances compared to those traveling longer distances. The majority of data points cluster at the lower end of the distance range.
Step 2
use linear interpolation to estimate its median
99%
104 rated
Only available for registered users.
Sign up now to view full answer, or log in if you already have an account!
Answer
To estimate the median using linear interpolation, we first identify the cumulative frequency. The total number of commuters is 120, so the median is at the 60th position. By calculating the cumulative frequencies:
0-9: 10
10-19: 29 (10 + 19)
20-29: 72 (29 + 43)
The median falls in the 20-29 category. To interpolate:
y = L + \left(\frac{ \frac{N}{2}-F}{f} \right) \times c
Where:
L = lower boundary of median class = 19.5
N = total frequency = 120
F = cumulative frequency of the class before the median = 29
State whether or not the value of your coefficient is consistent with your description in part (a). Justify your answer.
97%
117 rated
Only available for registered users.
Sign up now to view full answer, or log in if you already have an account!
Answer
Yes, the value of the skewness coefficient is consistent with the description in part (a). The positive value of the skewness (0.520) indicates the distribution is positively skewed, aligning with the observation of more shorter commutes.
Step 6
State, with a reason, whether you should use the mean or the median to represent the data in this distribution.
97%
121 rated
Only available for registered users.
Sign up now to view full answer, or log in if you already have an account!
Answer
The median should be used to represent the data in this distribution because the data is positively skewed. The median is less affected by outliers and extreme values, providing a more accurate measure of central tendency.
Step 7
State the circumstance under which it would not matter whether you used the mean or the median to represent a set of data.
96%
114 rated
Only available for registered users.
Sign up now to view full answer, or log in if you already have an account!
Answer
It would not matter whether to use the mean or median if the data is symmetrical or uniformly distributed, as both measures of central tendency would provide a similar representation of the data.