Mean and Variance of a Distribution (HSC SSCE Mathematics Advanced): Revision Notes
Mean and Variance of a Distribution
Introduction: from discrete to continuous distributions
When working with continuous probability distributions, we use similar concepts to discrete distributions, but with an important difference in how we calculate them. Instead of adding up values using summation (), we integrate over the continuous interval using .
This makes sense because continuous distributions have infinitely many possible values, so we can't simply add them up one by one. Integration allows us to find the total effect across all possible values in a smooth, continuous way.
Interestingly, both the Greek letter sigma () for summation and the integral symbol (, which is an early form of the letter S) represent the same basic concept: adding things together. In continuous distributions, we're essentially adding up infinitely many infinitely small contributions.
Understanding the mean of a continuous distribution
The mean (also called the expected value) tells us the average value we would expect from a continuous probability distribution. It's the balancing point of the distribution.
For a discrete distribution, we calculate the mean by multiplying each value by its probability and adding them all up:
For a continuous distribution with probability density function on interval , we replace the sum with an integral:
This formula works by:
- Taking each value in the interval
- Weighting it by its probability density
- Integrating (summing continuously) across the entire interval
The result gives us the mean value , which represents the centre of the distribution.
Understanding variance and standard deviation
The variance measures how spread out the distribution is. It quantifies how much the values typically differ from the mean. A larger variance means the distribution is more spread out.
Two equivalent formulas for variance
Just as with discrete distributions, we have two ways to calculate variance for continuous distributions:
Method 1: Using squared deviations from the mean
This method directly measures the average squared distance from the mean:
This formula:
- Takes the deviation of each value from the mean:
- Squares it to make all deviations positive:
- Weights it by the probability density:
- Integrates across the entire interval
Method 2: Using the expected value of the square
This alternative formula is usually easier for calculations:
This formula:
- First finds the expected value of by integrating
- Then subtracts the square of the mean
Both methods give exactly the same answer, but Method 2 typically involves simpler algebra because you don't need to expand and deal with the resulting terms. This is the recommended approach for most calculations!
Standard deviation
The standard deviation is simply the square root of the variance:
The standard deviation is particularly useful because it has the same units as the original variable. For example, if measures distance in metres, then is also in metres, while variance would be in square metres.
Key formulas summary
Let be a probability density function on interval .
Mean or expected value:
Variance (Method 1 - using deviations):
Variance (Method 2 - usually easier):
Standard deviation:
Worked example 1: calculating mean and variance
Worked Example: Calculating Mean and Variance for a Linear PDF
Let's find the mean and variance for the probability density function on the interval .
Finding the mean:
We calculate the expected value by integrating :
So the mean is 4 metres.
Finding the variance - Method 1 (using squared deviations):
Using the formula with :
Finding the variance - Method 2 (using ):
Using the simpler formula:
Notice how Method 2 involved much simpler calculations! Both methods give us a variance of 2.
Finding the standard deviation:
The standard deviation has the same units as the original variable (metres).
Worked example 2: uniform distribution
Worked Example: Uniform Distribution
Find the mean and standard deviation for the probability density function on the interval .
This represents a uniform distribution where all values are equally likely.
Finding the mean:
The mean is 4, which makes sense as it's exactly halfway through the interval .
Finding the variance:
Using the easier formula :
Finding the standard deviation:
Worked example 3: linear probability density function
Worked Example: Linear Probability Density Function
Find the mean and standard deviation for on the interval .
Finding the mean:
Finding the variance:
Using :
Finding the standard deviation:
Key Points to Remember:
-
From sums to integrals: When moving from discrete to continuous distributions, replace summation () with integration ().
-
Two variance formulas: You can calculate variance using either or . The second formula is usually simpler to work with.
-
Mean formula: gives you the expected value by weighting each value by its probability density.
-
Units matter: Standard deviation has the same units as the original variable, while variance is in squared units. This makes standard deviation more interpretable.
-
Check your work: Both variance methods should give the same answer, so if you have time in an exam, you can verify your answer by calculating it both ways.