Arithmetic & Numerical Computation (AQA A-Level Biology): Revision Notes
Averages (Mean, Median, Mode)
Averages are measures of central tendency that help us understand the typical or representative value in a dataset. In biological studies, these statistical tools are essential for analysing experimental data and drawing meaningful conclusions from measurements.
Understanding central tendency
An average provides a single value that represents the centre or typical value of a dataset. While the arithmetic mean is the most commonly used measure, the median and mode can sometimes provide more appropriate representations of your data, particularly when dealing with biological measurements that may contain outliers or unusual distributions.
Different measures of central tendency can tell different stories about the same dataset. Understanding when to use each measure is crucial for accurate data interpretation in biological research.
The arithmetic mean
The mean (often simply called the average) is calculated by adding all observed values together and dividing by the total number of measurements. This measure of central tendency accounts for both the magnitude of each measurement and how frequently each occurs.
The mean is particularly valuable in biology because repeated measurements help account for natural variability in living systems. When you average multiple measurements, the calculated mean approaches the true value more accurately, which explains why replicating experiments is so important in biological research.
Formula and calculation
For a set of values x, the mean is calculated using:
Where:
- represents the mean value
- is the sum of all measured values
- is the total number of measurements
Worked Example: Calculating Mean Mass
Consider measuring the mass of five laboratory mice: 6.2g, 7.7g, 6.7g, 7.1g, and 6.3g.
Mean mass =
Precision and decimal places
When calculating means, maintain appropriate precision in your final answer. Your mean should typically have the same number of decimal places as your original measurements. Using more decimal places than your raw data suggests false precision - for example, if you measure masses to whole grammes, your mean should also be expressed as a whole number. This prevents implying that your averaged result is more precise than your individual measurements actually were.
The median
The median represents the middle value when all measurements are arranged in numerical order. This measure of central tendency is calculated by first ordering your data from smallest to largest, then identifying the central value.
Worked Example: Finding the Median
For the dataset 12, 15, 10, 17, 9, 13, 13, 19, 10, 11:
Step 1: Arrange in order: 9, 10, 10, 11, 12, 13, 13, 15, 17, 19
Step 2: Find the middle value. With 10 values, the median is the average of the 5th and 6th values.
Step 3: Median =
When to use the median
The median proves particularly useful when your dataset contains outliers - extreme values that could skew your mean. It also allows meaningful comparison between datasets that have similar means but different distributions, and provides a reliable measure when you have too few measurements to calculate a trustworthy mean.
Example: Median vs Mean with Outliers
In the dataset 1, 3, 3, 11, 12, 12, 12, 13, 14, 15:
- Median value: 12 (a sensible representation)
- Mean value: 9.6 (pulled downward by the low outliers)
The median provides a better representation of the typical value in this case.
The mode
The modal value is the most frequently occurring measurement in your dataset. This measure proves most useful when working with qualitative data or when your distribution shows multiple peaks (bimodal distribution).
Applications and limitations
Example: Identifying the Mode
In the dataset 9, 10, 11, 11, 12, 13, 13, 13, 14, 17, 18, 19, the mode is 13 because it appears three times.
Use caution with modal values in biological data. Small datasets can produce misleading modes, and sometimes no clear mode exists. For instance, in datasets where each value appears only once, every value could be considered modal, making this measure meaningless.
The mode finds useful applications when data is collected in categories, such as recording the colours of flowers or types of behaviour observed in animal studies.
Practical applications in biology
These measures of central tendency are frequently used throughout biological research:
- Mean calculations for determining average enzyme activity, growth rates, or population sizes
- Median values when analysing survival times or when dealing with highly variable biological measurements
- Modal analysis for identifying the most common phenotype or behaviour in population studies
Always consider which measure best represents your particular dataset and research question. Sometimes reporting multiple measures provides the most complete picture of your experimental results.
Key Points to Remember:
- Mean = sum of all values ÷ number of values; most common but affected by outliers
- Median = middle value when data is ordered; useful for skewed datasets or with outliers
- Mode = most frequent value; best for qualitative data or bimodal distributions
- Match decimal places in your mean to the precision of your original measurements
- Consider reporting multiple measures when they tell different stories about your data