The partially completed box plot in Figure 1 shows the distribution of daily mean air temperatures using the data from the large data set for Beijing in 2015 - Edexcel - A-Level Maths Statistics - Question 2 - 2019 - Paper 1
Question 2
The partially completed box plot in Figure 1 shows the distribution of daily mean air temperatures using the data from the large data set for Beijing in 2015.
An ou... show full transcript
Worked Solution & Example Answer:The partially completed box plot in Figure 1 shows the distribution of daily mean air temperatures using the data from the large data set for Beijing in 2015 - Edexcel - A-Level Maths Statistics - Question 2 - 2019 - Paper 1
Step 1
Complete the box plot in Figure 1 showing clearly any outliers.
96%
114 rated
Only available for registered users.
Sign up now to view full answer, or log in if you already have an account!
Answer
To complete the box plot, we must first calculate the IQR. From the provided values:
Find Q1 and Q3:
The three lowest values are 7.6°C, 8.1°C, and 9.1°C, thus:
Q1 can be taken as the median of the lower half of data, which will be a value greater than 7.6°C but less than 8.1°C.
For Q3, determined similarly, will be a value greater than 9.1°C.
Calculate IQR:
IQR = Q3−Q1.
Identify Outliers:
Lower Limit = Q1−1.5imesIQR.
Upper Limit = Q3+1.5imesIQR.
Any data point outside of these limits is considered an outlier.
Plotting:
The box will be drawn based on the quartiles, whiskers drawn to the nearest values within limits, and any outliers will be marked distinctly.
Step 2
Using your knowledge of the large data set, suggest from which month the two outliers are likely to have come.
99%
104 rated
Only available for registered users.
Sign up now to view full answer, or log in if you already have an account!
Answer
The two outliers are likely to come from October as it typically has the coldest temperatures between May and October in Beijing. Therefore, extreme cold temperatures are expected in this month.
Step 3
Show that, to 3 significant figures, the standard deviation is 5.19°C.
96%
101 rated
Only available for registered users.
Sign up now to view full answer, or log in if you already have an account!
Answer
To calculate the standard deviation, we use the formula:
S=n−1∑(xi−xˉ)2
Given:
Ss=4952.906, therefore:
s=nSs=1844952.906≈26.8
Consequently, standard deviation simplifies to:
18426.8=5.19 to 3 significant figures.
Step 4
Using Simon's model, calculate the 10th to 90th interpercentile range.
98%
120 rated
Only available for registered users.
Sign up now to view full answer, or log in if you already have an account!
Answer
The 10th and 90th percentiles for a normally distributed variable can be found using the Z-table:
For the 10th percentile, Z0.10≈−1.2816.
For the 90th percentile, Z0.90≈1.2816.
Using the formula:
P=μ+Z⋅σ
10th Percentile Calculation:
P10=22.6+(−1.2816⋅5.19)≈18.3.
90th Percentile Calculation:
P90=22.6+(1.2816⋅5.19)≈29.9.
Interpercentile Range:
Range = P90−P10=29.9−18.3≈11.6.
Step 5
State two variables from the large data set for Beijing that are not suitable to be modeled by a normal distribution. Give a reason for each answer.
97%
117 rated
Only available for registered users.
Sign up now to view full answer, or log in if you already have an account!
Answer
Rainfall: Rainfall data is typically not normally distributed as it is often skewed (not symmetric), with many days having zero rain, leading to a left skew.
Wind Speed: Wind speed data often follows a different statistical distribution (like Rayleigh distribution) since it usually has a lower bound of zero and does not exhibit symmetry, resulting in a distribution that is not normal.