Inferential Testing (A Level Only) (AQA A-Level Psychology): Revision Notes
Tests of Correlation: Spearman's & Pearson's
Overview of correlation tests
Correlation tests help us determine whether there is a relationship between two variables and measure the strength of that relationship. Two key tests are used in psychology research: Spearman's rho and Pearson's r. The choice between these tests depends on the type of data you have and whether certain statistical assumptions are met.
Spearman's rho
When to use Spearman's rho
Spearman's rho is a non-parametric correlation test designed for use with ordinal data, though it can also be used with interval data when the assumptions for parametric tests are not met.
When Spearman's rho is most appropriate:
- One or both variables are measured at the ordinal level
- The data does not follow a normal distribution
- You're looking for monotonic relationships (relationships that consistently increase or decrease, but not necessarily in a straight line)
- The investigation is correlational rather than experimental
The test works by converting raw scores into ranks, then calculating the correlation between these ranks. This makes it less sensitive to extreme values compared to Pearson's r.
Calculating Spearman's rho
The calculation involves several systematic steps:
Step 1: Rank each set of scores separately for each variable, from lowest to highest. If scores are tied, calculate the mean of their combined ranks.
Step 2: Calculate the difference between each pair of ranks (d), then square these differences (d²).
Step 3: Apply the Spearman's formula:
Where (rho) is the correlation coefficient, is the sum of squared differences, and N is the number of participants.
Step 4: Compare the calculated value with the critical value from statistical tables to determine significance.
Worked Example: Physical Attractiveness Study
A study investigated the matching hypothesis by examining whether there was a correlation between partners' physical attractiveness ratings. Twenty-four couples were photographed and rated for attractiveness by independent judges.
Hypotheses:
- Alternative hypothesis: There is a positive correlation between ratings of physical attractiveness given to two partners in a relationship (directional, one-tailed)
- Null hypothesis: There is no correlation between ratings of physical attractiveness given to two partners in a relationship
Result: The calculation showed a correlation coefficient of -0.035, which was less than the critical value of 0.503 (for N=12, p≤0.05, one-tailed test). Therefore, the result was not significant, and the null hypothesis was accepted.
Pearson's r
When to use Pearson's r
Pearson's r is a parametric correlation test used when both variables are measured at the interval level. This test is more powerful than Spearman's rho when its assumptions are met, but these assumptions are more stringent.
Critical Assumptions for Pearson's r:
- Both variables must be measured at interval or ratio level
- The data should be normally distributed
- The relationship should be linear
- The data should be measured on continuous scales
If any of these assumptions are violated, consider using Spearman's rho instead.
Pearson's r measures the strength of linear relationships between variables, making it ideal for investigations where you expect variables to change proportionally.
Calculating Pearson's r
The calculation requires several preparatory steps before applying the main formula:
Step 1: Create a table with columns for each variable (x and y), their squares (x², y²), and their products (xy).
Step 2: Calculate the sums: , , , , and .
Step 3: Apply Pearson's formula:
Where r is the correlation coefficient and N is the number of participants.
Step 4: Compare the calculated value with the critical value to determine significance.
Worked Example: Biofeedback and Stress Study
Researchers investigated whether there was a relationship between the length of time participants had been using biofeedback and their reduction in resting heart rate. Ten participants experiencing chronic stress were studied.
Hypotheses:
- Alternative hypothesis: There is a positive correlation between the number of days participants have been using biofeedback and the reduction in their resting heart rate (directional, one-tailed)
- Null hypothesis: There is no correlation between the number of days participants have been using biofeedback and their reduction in resting heart rate
Result: The calculation yielded r = 0.740, which exceeded the critical value of 0.549 (for df = 8, p≤0.05, one-tailed test). The result was significant, allowing researchers to reject the null hypothesis and accept the alternative hypothesis.
Key differences between the tests
The fundamental distinction lies in the type of data each test can handle and their underlying assumptions.
Data Type Requirements:
- Spearman's rho: Designed for ordinal data but can accommodate interval data when parametric assumptions aren't met
- Pearson's r: Specifically for interval data that meets parametric test requirements
Spearman's rho converts all data to ranks, making it robust against outliers and suitable for non-normal distributions. Pearson's r works with raw data and is more sensitive to the actual values, making it more powerful when its assumptions are satisfied.
Understanding critical values and significance
Determining Statistical Significance
Both tests use critical value tables to determine whether results are statistically significant. The critical value depends on:
- Sample size (N for Spearman's, degrees of freedom for Pearson's)
- Significance level (typically 0.05)
- Whether the test is one-tailed or two-tailed
A result is significant when the calculated correlation coefficient equals or exceeds the critical value. This indicates that the observed relationship is unlikely to have occurred by chance alone.
Key Points to Remember:
-
Spearman's rho is used with ordinal data or when parametric assumptions aren't met - it ranks data before calculating correlation
-
Pearson's r is used with interval data that meets parametric assumptions - it's more powerful when conditions are right
-
Both tests measure the strength and direction of relationships between two variables, with values ranging from -1 to +1
-
Statistical significance is determined by comparing calculated values with critical values from statistical tables
-
The choice of one-tailed versus two-tailed tests depends on whether you have a directional hypothesis about the relationship