Continuous Distributions 1 (AQA A-Level Further Maths): Revision Notes
Continuous Distributions 1
Introduction to probability density functions
When working with discrete random variables, you can list all possible values and their associated probabilities in a probability distribution function. However, this approach does not work for continuous random variables because there are infinitely many possible values within any interval. Just as histograms replace bar charts for continuous data, probability density functions replace probability distribution functions for continuous variables.
A probability density function (pdf) is a mathematical function that describes the likelihood of a continuous random variable taking values within a specific range. Instead of calculating probabilities for exact values, you find the probability that the variable falls between two values by calculating the area under the pdf curve between those points. This area is found using integration.
Definition: If is a random variable with probability density function , then the probability that lies between values and is given by:
The area under the curve represents probability, which is similar to how area represents frequency in histograms. For continuous distributions, the probability of taking any single exact value is zero, so .
This is a fundamental difference from discrete distributions: in continuous distributions, we cannot find the probability of an exact value, only the probability of falling within a range of values.
Properties of probability density functions
For a function to be a valid probability density function, it must satisfy two essential conditions. These properties ensure that the function represents a valid probability model.
Key properties:
-
Non-negativity: The function must be non-negative for all values in its domain:
This means all probabilities are greater than or equal to zero.
-
Total probability equals one: The total area under the pdf curve must equal 1:
This represents the fact that the total probability of all possible outcomes is 1.
Worked example 1: verifying a valid pdf
Worked Example: Verifying a Valid PDF
A random variable has probability density function:
Part a: Sketch the curve and verify that is a valid pdf. Shade the area representing .
Part b: Find the value of .
Solution:
First, check the non-negativity condition. Since for , and this is always positive in the given range, the first property is satisfied.
Next, verify the total probability condition:
Since both properties are satisfied, is a valid pdf.
For part b:
Worked example 2: finding an unknown constant
Worked Example: Finding an Unknown Constant
A random variable has probability density function:
where is some positive constant. Find the value of .
Solution:
To find the constant , use the total probability property. The integral of over its entire domain must equal 1.
Expand the integrand:
Integrate:
Evaluate at the limits:
Median of a continuous random variable
For a continuous random variable, the median is the value that divides the total probability into two equal halves. This means that the probability of the variable being less than the median equals one-half.
Definition: If is a continuous random variable with probability density function , then the median value is given by:
In practice, the lower limit of the integral will be the smallest value of in the domain of the pdf, not negative infinity. The integral from the minimum value up to the median equals , which means exactly half the probability lies below the median.
Worked example 3: finding the median
Worked Example: Finding the Median
A random variable has probability density function:
Find the median of .
Solution:
Set up the equation for the median:
Integrate:
Evaluate:
Quartiles
You can also calculate the lower and upper quartiles from the probability density function. These values divide the distribution into quarters.
Definitions: The lower quartile and upper quartile satisfy:
This can be written as:
where is the pdf. Again, in practice, use the minimum value of the domain as the lower limit rather than negative infinity.
Mode of a continuous random variable
The mode of a continuous random variable is the value where the probability density function reaches its maximum. This represents the most likely region for the variable to take values.
The mode can be found in two ways:
- By sketching the probability density function and identifying the peak
- By using calculus and differentiation
Definition: For a continuous random variable with probability density function , the mode can be found by solving:
This finds the stationary points of the function. You must then verify that this point is indeed a maximum by checking the second derivative or examining the graph.
Worked example 4: finding the mode
Worked Example: Finding the Mode
A random variable has probability density function:
where is a constant.
Part a: Find the value of .
Part b: By differentiating , or otherwise, find the mode of .
Solution:
Part a: Use the total probability property:
Evaluating at the limits:
Part b: To find the mode, differentiate and set equal to zero:
The modal value can also be found from the graph of . The function is a downward-opening parabola that cuts the -axis at and . By symmetry, the modal value is midway at .
Expectation and variance
The expectation (or mean) and variance of continuous random variables are calculated using integration rather than summation. These formulas are the continuous equivalents of the discrete formulas you have learned previously.
Definitions: If is a random variable with mean and variance , then:
The continuous versions of and are found from their discrete equivalents by changing the summation operators () into integrals. Integration is the continuous equivalent of discrete summation.
For ease of calculation, the variance formula can be rewritten as:
This alternative form is often simpler to compute because it avoids working with .
Worked example 5: calculating mean and variance
Worked Example: Calculating Mean and Variance
A random variable has probability density function:
Find the mean and variance of .
Solution:
First, calculate the mean:
Next, calculate the variance using the alternative formula:
Expectation and variance of functions
The mean value and variance of a function of a random variable, , can be found by replacing with in the expectation formula.
Key formulas: If is a random variable with probability density function , then:
These formulas allow you to find the expected value and variance of any transformation of the random variable.
For the expectation of sums of functions, the linearity property applies:
Linearity of expectation: For a random variable , the mean value of is:
Worked example 6: expectation of functions
Worked Example: Expectation of Functions
A continuous random variable has probability density function for between 1 and 3.
Find the expected value of:
Part a:
Part b:
Solution:
Part a:
Part b:
Alternatively, you can use the linearity property:
Linear transformations
When you apply a linear transformation to a random variable, there are simple rules for finding the mean and variance of the transformed variable. If is a linear function of , specifically where and are constants, then:
Key formulas for linear transformations:
These results can also be written as:
Notice that adding a constant shifts the mean by but does not affect the variance. Multiplying by a constant scales the mean by and scales the variance by .
Worked example 7: linear transformations
Worked Example: Linear Transformations
A random variable has probability density function:
Part a: Find the value of and the mean and variance of .
The random variable is related to by the equation .
Part b: Find the mean and variance of .
Solution:
Part a: First find the constant :
This gives , so .
Calculate the mean:
Calculate the variance:
Part b: For :
Sums of independent random variables
When dealing with two independent continuous random variables, you can find the expected value and variance of their sum using simple addition rules.
Key formulas for independent variables:
For two independent continuous random variables and :
These formulas only apply when the variables are independent. Independence means that the outcome of one variable does not affect the probability distribution of the other.
Worked example 8: sums of independent variables
Worked Example: Sums of Independent Variables
Two independent random variables and have distributions:
where and are constants.
Part a: Find the values of and .
Part b: Find the expected value and variance of the function .
Solution:
Part a: Using the 'area equals 1' property:
Part b: Since and are independent:
Calculate each expected value separately and combine them to get the final answer of (2 dp).
For the variance:
Calculate each variance separately using the formulas and combine them to get (2 dp).
Strategy for solving problems
When working with continuous random variables, follow this systematic approach:
- Use the 'area equals 1' property to find any unknown constants in the probability density function.
- Use integration to find probabilities and calculate values like the median and quartiles.
- Calculate summary statistics for location (mean, median, mode) and spread (variance, standard deviation).
- Find mean and variance of functions of random variables using the appropriate formulas, including linear transformations and sums of independent variables.
Key Points to Remember:
-
A probability density function must satisfy two properties: and .
-
Probabilities are found by integration: .
-
For continuous variables, the median satisfies , and the mode is found by solving .
-
Expectation and variance are calculated using and .
-
For linear transformations : and .
-
For independent variables: and .