Grouped Frequency Distributions (Junior Cert Mathematics): Revision Notes
Grouped Frequency Distributions
When dealing with large sets of data, it's sometimes easier to group the data into intervals or ranges instead of working with individual numbers. This is called a grouped frequency distribution. In this section, we'll explore why we use grouped frequency distributions, what mid-interval values are, and how to calculate the mean, median, and mode using a worked example.
Why Do We Use Grouped Frequency Distributions?
- Simplifies Large Data Sets: When you have a lot of data, listing every single number can be overwhelming. Grouped frequency distributions help by organising the data into intervals or ranges. For example, instead of listing the exact age of every person in a survey, you might group ages into ranges like 0-10, 10-20, 20-30, and so on. This makes it easier to work with and understand the data.
-
Makes Patterns Easier to Spot: Grouping data into intervals lets you see patterns and trends more clearly. For example, if you group people's ages, you might quickly see that most people are between 20 and 30 years old. This can be harder to spot when you're just looking at a long list of individual numbers.
-
Helps with Calculations: Grouping data also makes it easier to calculate important statistics like the mean (average), median, and mode. Instead of working with each individual number, you can work with groups, which simplifies the maths.
What is a Mid-Interval Value?
In a grouped frequency distribution, you don't have individual values. Instead, you have intervals, or ranges, of values. To calculate things like the mean, you need a single value to represent each interval. This is where the mid-interval value comes in.
Mid-Interval Value:
- The mid-interval value is the middle point of each interval.
- You find it by adding the lowest and highest values in the interval and then dividing by 2.
Example:
- If your interval is 0-10, the mid-interval value is .
- If your interval is 10-20, the mid-interval value is . These mid-interval values represent the entire group, making it easier to calculate averages and other statistics.
Example of a Grouped Frequency Distribution Let's look at an example where ages are grouped into intervals:
| Age Group | 0-10 | 10-20 | 20-30 | 30-40 |
|---|---|---|---|---|
| Frequency | 2 | 5 | 4 | 8 |
This table shows:
- 2 people are aged between 0 and 10 years,
- 5 people are aged between 10 and 20 years, and so on. Mid-Interval Values:
For each age group, the mid-interval values are:
| Age Group | 0-10 | 10-20 | 20-30 | 30-40 |
|---|---|---|---|---|
| Mid-Interval | 5 | 15 | 25 | 35 |
These mid-interval values (5, 15, 25, 35) will be used when calculating the mean, median, and mode.
Why Are Mid-Interval Values Important?
- Represents the Interval: Since we don't know the exact values within each interval, the mid-interval value acts as a representative for the entire group. It's like saying, "On average, people in this group are around this age."
- Makes Calculations Possible: To calculate the mean, median, and mode for grouped data, you need to work with single values. The mid-interval value simplifies this by giving you a single number to represent each interval.
Mean, Median, and Mode for Grouped Frequency Distributions (Worked Example)
Now that you have a grouped frequency distribution and mid-interval values, you can calculate the mean, median, and mode using a worked example.
Example: Using the same grouped frequency distribution table:
| Age Group | 0-10 | 10-20 | 20-30 | 30-40 |
|---|---|---|---|---|
| Frequency | 2 | 5 | 4 | 8 |
| Mid-Interval | 5 | 15 | 25 | 35 |
1. Mean
How to calculate it:
- Step 1: Multiply each mid-interval value by the frequency for that interval.
- Why? This gives us a weighted total for each group, taking into account how many people fall into each age range.
- Step 2**:** Add up all those results.
- Why? This gives us the total sum of ages (in terms of the mid-interval values) for all people surveyed.
- Step 3**:** Add up the total frequency (the total number of people).
- Why? We need to divide the total sum by the number of people to find the average.
- Step 4**:** Divide the total from Step 2 by the total frequency.
- Why? Dividing gives us the mean, or the average age of the people surveyed.
- Answer: The mean age is approximately 24.47 years.
2. Median
How to find it:
- Step 1: Find the cumulative frequency (add up frequencies as you go down the table).
- Why? The cumulative frequency helps us find the interval where the middle value (median) lies.
- Cumulative frequency for Cumulative frequency for Cumulative frequency for Cumulative frequency for
- Step 2**:** Identify the interval where the median lies (the middle value in the data).
- Why? The median is the middle value, so we find the interval where this value falls.
- Total frequency is 19, so the median is the 10th value.
- The 10th value falls in the 20-30 interval.
- Step 3: Since the median lies in the 20-30 age group, the median is 25 years (the mid-interval value of that group).
- Why? The median is represented by the mid-interval value of the group where the middle value falls. Answer: The median age is 25 years.
3. Mode
How to find it:
- Step 1**:** Identify the interval with the highest frequency (this is called the modal class).
- Why? The mode is the value that appears most often, so we look for the interval with the most data points.
- The highest frequency is 8, which occurs in the 30-40 age group.
- Step 2**:** The mode is the mid-interval value of that interval.
- Why? The mode is represented by the mid-interval value of the group with the highest frequency.
- The mid-interval value for 30-40 is 35. Answer: The mode is 35 years.
In Summary:
- Grouped frequency distributions simplify large sets of data by organising values into intervals.
- We use mid-interval values to represent each interval, which makes calculations easier.
- The mean is found by multiplying mid-interval values by their frequencies, adding the results, and dividing by the total frequency.
- The median is found by identifying the interval containing the middle value and using the mid-interval value of that group.
- The mode is found by identifying the interval with the highest frequency and using the mid-interval value of that group. By understanding these concepts and using the worked example, you'll be better equipped to work with grouped frequency distributions and perform the necessary calculations.