Research Methods (Edexcel A-Level Psychology): Revision Notes
Research Methods
Research methods form the foundation of psychological investigation. This topic encompasses the various approaches psychologists use to study behaviour, the procedures they follow, and the techniques they employ to analyse collected data. Understanding these methods allows you to evaluate research critically and design your own investigations effectively.
Types of data
Psychologists collect different types of data depending on their research aims. Understanding which type to use is essential for designing appropriate investigations.
Qualitative data consists of non-numerical information, typically in the form of words, descriptions, or images. This type of data provides rich, detailed insights into experiences, opinions, and behaviours. For example, interview transcripts or diary entries would constitute qualitative data. The depth of information obtained is a key strength, allowing researchers to explore complex phenomena and unexpected themes. However, qualitative data can be difficult to analyse objectively, and findings may be influenced by researcher interpretation. The time-consuming nature of analysis and challenges in generalising results to broader populations are notable limitations.
Quantitative data comprises numerical information that can be counted or measured. Examples include test scores, reaction times, or the number of words recalled in a memory experiment. This type of data can be easily compared and statistically analysed, making it simpler to identify patterns and relationships. Quantitative data also allows for objective measurement and easier replication of studies. Nevertheless, it may oversimplify complex behaviours and fail to capture the full richness of human experience.
Primary vs Secondary Data:
Primary data is collected first-hand by the researcher specifically for their investigation, ensuring the data directly addresses the research question. While this allows complete control over collection methods and ensures data is tailored to the study's aims, it can be time-consuming and expensive.
Secondary data already exists, having been collected by someone else for a different purpose (such as government statistics or previous research findings). This saves time and resources and may provide access to large datasets. However, the data may not perfectly match the researcher's specific needs, and the researcher has no control over how it was originally collected.
Sampling techniques
Selecting participants for psychological research requires systematic approaches to ensure the sample is appropriate for the study's aims. Different sampling methods have distinct advantages and limitations.
Random sampling involves selecting participants in a way that gives every member of the target population an equal chance of being chosen. This might be done by assigning numbers to everyone in the population and using a random number generator or pulling names from a container. The main strength of random sampling is that it minimises bias, as the researcher has no influence over who is selected. This increases the likelihood that the sample will be representative of the population, improving the generalisability of findings. However, this method can be time-consuming and impractical, particularly when the target population is large or difficult to access. Additionally, there is no guarantee that the sample will actually be representative, especially with smaller sample sizes.
Stratified sampling involves dividing the target population into subgroups (strata) based on relevant characteristics, such as age, gender, or ethnicity, and then selecting participants from each subgroup in proportion to their presence in the population. This method ensures that important characteristics are represented proportionally, making it more representative than random sampling. However, stratified sampling requires detailed knowledge of the population composition and can be complex and time-consuming to implement.
Sampling Bias Concerns:
Volunteer sampling (also called self-selected sampling) and opportunity sampling are convenient but carry significant risks:
Volunteer sampling - While convenient and providing engaged participants, it suffers from volunteer bias. People who volunteer may differ systematically from those who do not, potentially in terms of motivation, availability, or personality traits, limiting generalisability.
Opportunity sampling - Though quick, inexpensive, and practical, opportunity samples are likely to be biased and unrepresentative, as they only include people who happened to be in a particular place at a particular time. This significantly limits the extent to which findings can be generalised.
Experimental and research designs
The design of a study determines how participants are allocated to different conditions and how data is collected. Each design has specific applications and considerations.
Independent groups design involves using different participants in each experimental condition. Each participant only experiences one level of the independent variable. This design eliminates order effects (such as practice or fatigue) because participants only complete the task once. It also reduces the risk of demand characteristics, as participants are unaware of other conditions. However, individual differences between participants in different groups may confound results. This design also requires more participants than repeated measures, and there may be differences between groups that affect the dependent variable.
Repeated measures design uses the same participants in all experimental conditions. Each participant experiences every level of the independent variable. The main advantage is that participant variables are controlled, as the same people are tested in each condition. This design also requires fewer participants overall. However, order effects can be problematic: participants may perform better on later tasks due to practice, or worse due to fatigue or boredom. Demand characteristics may also be more likely, as participants experience all conditions and may guess the study's aims.
Matched pairs design involves matching participants in different groups on relevant characteristics (such as age, IQ, or personality traits) so that each person in one condition has an equivalent person in the other condition. This design controls for participant variables while avoiding order effects. However, it is time-consuming and difficult to match participants on all relevant variables, and perfect matching is practically impossible. This design also requires more participants than repeated measures.
Choosing the Right Design:
Select your experimental design based on the research question and practical constraints:
- Use independent groups when order effects would be problematic or when the manipulation permanently changes participants
- Use repeated measures when participant variables must be controlled and you have a smaller participant pool
- Use matched pairs when you need to control participant variables but order effects are a concern
Hypotheses
A hypothesis is a testable prediction about the relationship between variables or the outcome of a study. Different types of hypotheses serve different purposes in psychological research.
The null hypothesis predicts that there will be no significant effect or relationship between variables. Any difference or correlation observed will be due to chance alone. In experimental research, the null hypothesis typically states that the independent variable will have no effect on the dependent variable. Researchers test the null hypothesis to determine whether it can be rejected in favour of an alternative hypothesis.
An experimental hypothesis (also called an alternative hypothesis) predicts that there will be a significant effect or relationship between variables. This is what the researcher actually expects to find. The experimental hypothesis is accepted if the null hypothesis is rejected following statistical testing.
A directional hypothesis specifies the expected direction of the relationship or difference. For example, "participants will recall significantly more words in the organised list condition than in the random list condition." Directional hypotheses are appropriate when previous research or theory provides a clear basis for predicting the direction of an effect. These hypotheses are tested using one-tailed statistical tests.
A non-directional hypothesis predicts that there will be a difference or relationship but does not specify the direction. For example, "there will be a significant difference in the number of words recalled between the organised list condition and the random list condition." Non-directional hypotheses are used when there is insufficient evidence to predict the direction of an effect. These hypotheses are tested using two-tailed statistical tests.
Worked Example: Operationalising a Hypothesis
Vague hypothesis: "Stress affects memory"
Problems with this hypothesis:
- "Stress" is not defined or measurable
- "Affects" doesn't specify the direction
- "Memory" is too broad and not operationalised
Operationalised hypothesis: "Participants exposed to a stressful task (watching a 10-minute horror film clip) will recall fewer words from a list than participants exposed to a non-stressful task (watching a 10-minute nature documentary), as measured by the number of words correctly recalled from a standardised 20-word list presented for 2 minutes."
What makes this better:
- The IV (stress) is operationalised as specific viewing conditions
- The DV (memory) is operationalised as a specific, measurable outcome
- The direction of the effect is predicted
- All variables are defined precisely and measurably
Operationalisation is the process of defining variables in a way that is specific, measurable, and testable. This ensures that research can be replicated and that results can be interpreted objectively.
Questionnaires and interviews
Questionnaires and interviews are self-report methods that gather information directly from participants about their experiences, attitudes, or behaviours.
Questionnaires consist of written questions that participants answer in their own time. They can include different types of questions:
Open questions allow participants to respond in their own words, providing detailed, qualitative information. These questions generate rich data and may reveal unexpected insights, but responses can be difficult to analyse and compare objectively.
Closed questions provide predetermined response options, such as "yes/no" or multiple choice answers. These produce quantitative data that is easy to analyse and compare statistically. However, they may restrict participants' responses and fail to capture the full complexity of their views.
Ranked scale questions (such as Likert scales) ask participants to rate their agreement or frequency on a numerical scale, such as 1-5. These questions produce quantitative data while allowing some degree of nuance in responses.
Questionnaires can be structured (with predetermined questions in a fixed order), semi-structured (with core questions but flexibility to add probes), or relatively unstructured (with general topic areas but flexibility in specific questions asked).
Self-Report Biases:
Self-report data can be affected by two key biases:
Social desirability bias - Participants respond in ways they believe are socially acceptable rather than truthfully. This is particularly problematic when asking about sensitive topics or socially undesirable behaviours.
Demand characteristics - Participants may guess the study's aims and alter their responses accordingly, either to help the researcher or to deliberately perform differently.
Reducing these biases: Ensure anonymity, use indirect questions, reassure participants there are no "right" answers, and consider using deception about the study's true purpose (with appropriate debriefing).
Interviews involve direct interaction between researcher and participant, with questions asked verbally.
Structured interviews follow a fixed set of predetermined questions in a specific order. This approach ensures consistency across participants and produces quantitative data that is easy to analyse. However, it lacks flexibility and may not allow exploration of interesting points raised.
Semi-structured interviews use predetermined questions but allow the interviewer to probe further or add follow-up questions based on responses. This balances consistency with flexibility.
Unstructured interviews are more like guided conversations, with general topic areas but flexibility in the specific questions asked. These generate rich, detailed qualitative data but may lack consistency across participants and can be influenced by interviewer bias.
Experiments
Experiments are a key research method in psychology, allowing researchers to establish cause-and-effect relationships by manipulating variables under controlled conditions.
Laboratory experiments take place in controlled, artificial settings. The researcher manipulates the independent variable (IV) – the factor that is systematically changed – and measures the dependent variable (DV) – the factor that is measured to see if it has been affected. Laboratory experiments allow high levels of control over extraneous variables, making it easier to establish causation. They can be replicated precisely to test the reliability of findings. However, the artificial setting may lack ecological validity, meaning findings may not generalise to real-world situations. Participants may also respond to demand characteristics, altering their behaviour based on their perception of the study's purpose.
Field experiments are conducted in natural, real-world environments while the researcher still manipulates the IV. These experiments have higher ecological validity than laboratory experiments, as behaviour is studied in a realistic context. Participants may be unaware they are being studied, reducing demand characteristics. However, field experiments offer less control over extraneous variables, making it harder to establish clear cause-and-effect relationships. They can also raise ethical concerns if participants are unaware of their involvement. Replication may be difficult due to the natural setting.
Operationalisation in Experiments:
When conducting experiments, researchers must operationalise both the IV and DV, defining them precisely and measurably.
For example, in a study on the effect of noise on concentration:
- IV: Operationalised as "silence vs. 70 decibels of background music"
- DV: Operationalised as "the number of mathematical problems correctly solved in 10 minutes"
Poor operationalisation leads to unclear results that cannot be replicated or compared with other studies. Always ensure your variables are specific, measurable, and testable.
Observations
Observational methods involve systematically watching and recording behaviour as it occurs naturally or in controlled settings.
Naturalistic observations take place in the participant's natural environment without researcher interference. These observations provide high ecological validity and allow study of behaviours that cannot be manipulated ethically or practically. However, extraneous variables cannot be controlled, cause-and-effect relationships cannot be established, and observer bias may influence what is recorded.
Controlled observations occur in settings where the researcher has some control over the environment and possibly the situation. These offer more control than naturalistic observations while retaining some ecological validity.
Observations can vary in participant awareness:
Covert observations involve observing people without their knowledge. This reduces demand characteristics and observer effects (changes in behaviour due to being watched), but raises ethical concerns regarding consent and privacy.
Overt observations involve observing people with their knowledge and consent. This is ethically preferable but may lead to demand characteristics as participants may alter their behaviour when aware they are being watched.
The observer's role also varies:
Participant observation involves the observer joining the group being studied. This provides detailed insights and understanding of the group's perspective but risks observer bias and loss of objectivity.
Non-participant observation involves the observer remaining separate from the group. This maintains objectivity but may limit understanding of the group's perspective and experiences.
Recording Observations Systematically:
Recording observations requires systematic approaches to ensure data is reliable and objective:
Tallying - Counting the frequency of specific behaviours using predetermined categories. Produces quantitative data but may overlook important details.
Event sampling - Recording each time a specific behaviour occurs, noting relevant details about the behaviour and context.
Time sampling - Recording behaviour at regular intervals (e.g., every 30 seconds). Makes extensive observation periods more manageable but may miss behaviours between sampling points.
Behavioural categories must be developed before observation begins. These categories should be:
- Operationalised and objective
- Observable (not requiring inference)
- Mutually exclusive (behaviours fit into only one category)
- Cover all possible relevant behaviours
Additional research methods and techniques
Beyond experiments and observations, psychologists employ various specialised research methods.
Twin studies compare the similarity of monozygotic (identical) twins, who share 100% of their DNA, with dizygotic (non-identical) twins, who share approximately 50% of their DNA. If identical twins are more similar than non-identical twins for a particular trait, this suggests genetic influence. These studies help researchers understand the relative contributions of nature (genetics) and nurture (environment) to psychological characteristics and behaviours.
Adoption studies examine individuals who were adopted in infancy, comparing them to their biological parents (with whom they share genes but not environment) and adoptive parents (with whom they share environment but not genes). If adopted individuals resemble their biological parents more closely, this suggests genetic influence; if they resemble adoptive parents more closely, this suggests environmental influence.
Animal experiments use non-human animals as research subjects. These allow researchers to conduct studies that would be unethical with humans and to control variables precisely in ways not possible with human participants. However, findings may not generalise to humans due to physiological and psychological differences between species. Animal experiments raise ethical concerns about the welfare and rights of animal subjects.
Case studies involve in-depth investigation of a single individual, small group, or event. Multiple research methods are typically used to gather detailed information, including interviews, observations, tests, and examination of records. Case studies provide rich, detailed data and allow study of rare phenomena or unique cases. However, findings cannot be generalised to other people or situations, and researcher bias may influence interpretation. The uniqueness of the case means the study cannot be replicated.
Brain Scanning Techniques:
Brain scanning techniques allow researchers to study brain structure and function:
- CAT scans (Computerised Axial Tomography) use X-rays to create detailed images of brain structure
- PET scans (Positron Emission Tomography) show brain activity by tracking radioactive glucose as the brain uses energy
- fMRI (functional Magnetic Resonance Imaging) shows brain activity by detecting changes in blood flow to different brain areas
Each technique has different strengths: structural scans show anatomy, while functional scans show activity patterns during tasks.
Content analysis is a research method for systematically analysing qualitative material, such as media content, written documents, or interview transcripts. Researchers develop coding categories and count how often particular themes or items appear. This allows qualitative material to be converted into quantitative data for statistical analysis.
Correlational research examines the relationship between two or more variables without manipulation. The strength and direction of relationships are measured using correlation coefficients. While correlations can identify relationships, they cannot establish causation: a correlation between two variables does not mean that one causes the other.
Longitudinal research involves studying the same participants over an extended period, sometimes many years. This design is valuable for studying developmental changes and long-term effects. However, longitudinal studies are time-consuming, expensive, and suffer from participant attrition (dropout) over time.
Cross-sectional research compares different groups of people at the same point in time, often different age groups. This design is quicker and cheaper than longitudinal research but cannot show developmental changes within individuals, only differences between groups.
Cross-cultural research compares behaviour, attitudes, or psychological characteristics across different cultures. This helps identify universal aspects of human psychology and culture-specific variations.
Meta-analysis is a statistical technique that combines and analyses the results of multiple studies investigating the same research question. This produces more reliable conclusions than individual studies and can identify overall patterns across research. However, meta-analyses may include studies of varying quality, and the combination of different methodologies can be problematic.
Control issues
Controlling extraneous variables is essential for establishing clear cause-and-effect relationships in research. Various factors can threaten the validity of research findings if not properly managed.
Order Effects and Counterbalancing:
Counterbalancing is a technique used to control order effects in repeated measures designs. Participants are divided into groups, with each group experiencing the conditions in a different order. For example, in a study with two conditions (A and B), half the participants would complete condition A first then B, while the other half would complete B first then A. This ensures that any order effects are evenly distributed across conditions.
Order effects occur in repeated measures designs when participation in one condition affects performance in subsequent conditions:
- Practice effects may improve performance on later tasks as participants become more familiar
- Fatigue effects may impair performance on later tasks as participants become tired or bored
Experimenter effects occur when the researcher's expectations, behaviour, or characteristics unintentionally influence participants' behaviour or the study's results. This can include giving subtle cues about expected responses or treating participants in different conditions differently. Double-blind procedures, where neither participants nor researchers know which condition participants are in, help reduce experimenter effects.
Social desirability bias occurs when participants respond in ways they believe are socially acceptable or that present them in a favourable light, rather than responding truthfully. This is particularly problematic in self-report measures. Researchers can reduce social desirability bias by ensuring anonymity, using indirect questions, or employing deception about the study's true purpose.
Demand characteristics are cues in the research situation that may reveal the study's aims or expected behaviour to participants. When participants identify these cues, they may alter their behaviour to either help the researcher (please-U effect) or deliberately perform differently (screw-U effect). Single-blind procedures, where participants are unaware of the condition they are in or the study's aims, help reduce demand characteristics.
Participant variables are individual differences between participants that may affect the dependent variable, such as age, intelligence, personality, or prior experience. These are particularly problematic in independent groups designs. Researchers can control for participant variables through random allocation, matched pairs designs, or using repeated measures designs.
Situational variables are features of the research environment that may affect participants' behaviour, such as temperature, noise level, time of day, or lighting. Standardised procedures help control situational variables by ensuring all participants experience the same environmental conditions.
Distinguishing Extraneous and Confounding Variables:
Extraneous variables are any variables other than the independent variable that might affect the dependent variable. These must be controlled or eliminated to ensure that any effect on the DV can be attributed to the IV alone.
Confounding variables are a specific type of extraneous variable that varies systematically with the independent variable, making it impossible to determine which variable caused any observed effect on the dependent variable.
Example: If all participants in one condition were tested in the morning and all participants in another condition were tested in the afternoon, time of day would be a confounding variable – you couldn't tell if differences were due to the IV or time of day.
Careful operationalisation of variables is necessary to ensure they are defined precisely, measurably, and unambiguously. This allows replication and reduces subjective interpretation.
Descriptive statistics
Descriptive statistics summarise and describe data, making it easier to identify patterns and communicate findings.
Measures of central tendency indicate the typical or average value in a dataset:
The mean is calculated by adding all values together and dividing by the number of values:
where represents the sum of all values and is the number of values.
The mean uses all data points and is the most sensitive measure of central tendency. However, it can be distorted by extreme values (outliers).
The median is the middle value when data is arranged in order. If there is an even number of values, the median is the average of the two middle values. The median is not affected by extreme values, making it useful when data includes outliers. However, it does not use all values in the dataset and is less sensitive to changes in the data.
The mode is the most frequently occurring value in a dataset. A dataset can have more than one mode (bimodal or multimodal) or no mode if all values occur with equal frequency. The mode is the only appropriate measure of central tendency for nominal data (categories). However, it does not use all values in the dataset and may not be representative if several values occur with similar frequency.
Worked Example: Calculating Measures of Central Tendency
Dataset: Test scores: 12, 15, 15, 18, 20, 22, 25, 30
Mean:
Median:
- Arrange in order (already ordered)
- With 8 values (even number), median = average of 4th and 5th values
Mode:
- Most frequent value = 15 (appears twice)
Interpretation: The mean is slightly higher than the median, suggesting a slight positive skew from the higher values.
Measures of dispersion indicate the spread or variability of data:
The range is calculated by subtracting the lowest value from the highest value:
The range is easy to calculate and provides a quick indication of spread. However, it is heavily influenced by extreme values and does not indicate how data is distributed between the highest and lowest values.
The standard deviation measures the average distance of values from the mean. A small standard deviation indicates that data points cluster closely around the mean, while a large standard deviation indicates greater spread. Standard deviation uses all values in the dataset and provides precise information about variability. However, it is more complex to calculate than the range and, like the mean, can be affected by extreme values.
Frequency tables organise data by showing how often each value or category occurs. These make patterns in data easier to identify.
Graphs and charts provide visual representations of data:
Bar charts display categorical data using separate bars for each category. The height of each bar represents the frequency or value. Bars should not touch, as they represent discrete categories.
Histograms display continuous data. Unlike bar charts, bars touch because the data is continuous rather than categorical. The area of each bar represents frequency.
Scatter diagrams display the relationship between two continuous variables by plotting pairs of values as points on a graph. The pattern of points indicates the strength and direction of any relationship. A positive correlation shows an upward trend, a negative correlation shows a downward trend, and no correlation shows no clear pattern.
Understanding Data Distribution:
Normal distribution occurs when data is symmetrically distributed around the mean in a bell-shaped curve. In a normal distribution, the mean, median, and mode are the same value, and most data points cluster near the mean with fewer values at the extremes. Many psychological variables are normally distributed.
Skewed distribution occurs when data is not symmetrically distributed:
- Positively skewed: Most values cluster at the lower end with a tail extending towards higher values
- Negatively skewed: Most values cluster at the higher end with a tail extending towards lower values
Sense checking data involves examining data for errors or anomalies before analysis. Check for impossible values, extreme outliers, or data entry errors.
When interpreting descriptive statistics, researchers must select appropriate measures for their data type and research question. Comparing different datasets requires consideration of both central tendency and dispersion to draw meaningful conclusions.
Inferential statistics
Inferential statistics allow researchers to determine whether patterns observed in sample data are likely to reflect genuine effects in the population or whether they could have occurred by chance.
Levels of measurement classify data types and determine which statistical tests are appropriate:
Nominal data consists of categories with no inherent order, such as eye colour or gender. Frequency counts show how many observations fall into each category.
Ordinal data consists of ranked categories with a meaningful order, but the intervals between ranks are not equal. Examples include ranking preferences or satisfaction ratings. While we know one value is greater than another, we cannot quantify the difference precisely.
Interval data consists of measurements on a scale with equal intervals between values but no true zero point. Temperature in Celsius is an example: the difference between 10°C and 20°C is the same as between 20°C and 30°C, but 0°C does not represent the absence of temperature.
Ratio data has equal intervals and a true zero point, meaning zero represents the complete absence of the measured variable. Examples include reaction time, number of errors, or height.
Statistical Tests for Different Designs:
The specific statistical test used depends on the research design and level of measurement:
-
The Wilcoxon signed-rank test is used for repeated measures designs when looking for differences between two conditions. It requires ordinal or interval/ratio data.
-
Spearman's rank correlation coefficient examines whether there is a relationship between two variables. It requires ordinal or interval/ratio data.
-
The chi-squared test examines whether there is an association between two categorical variables. It requires nominal data presented as frequencies.
These are examples of tests; the specific test used depends on the research design and level of measurement.
Directional and non-directional testing relates to hypothesis type:
A one-tailed test is used when the hypothesis is directional, predicting the specific direction of a difference or relationship. This test only examines one end of the distribution of possible results.
A two-tailed test is used when the hypothesis is non-directional, predicting a difference or relationship but not its direction. This test examines both ends of the distribution of possible results.
Critical value tables are used to determine whether an observed result is statistically significant. The researcher compares their observed (calculated) value with the critical value for their chosen significance level and sample size. Whether the observed value needs to be equal to or greater than the critical value, or equal to or less than it, depends on the specific test used.
Significance Levels and Hypothesis Rejection:
Significance levels indicate the probability that results occurred by chance. The conventional significance level in psychology is p ≤ .05, meaning there is a 5% probability or less that the results occurred by chance. This is considered sufficiently unlikely for researchers to reject the null hypothesis and accept the alternative hypothesis. More stringent significance levels (such as p ≤ .01 or p ≤ .001) may be used when greater certainty is required.
Decision-making process:
- If probability significance level: Reject null hypothesis, accept alternative hypothesis (results are statistically significant)
- If probability significance level: Retain null hypothesis (results are not statistically significant)
Type I errors occur when the null hypothesis is true but the researcher incorrectly rejects it, concluding there is an effect when there is not (false positive). This is more likely when a lenient significance level is used.
Type II errors occur when the null hypothesis is false but the researcher incorrectly retains it, concluding there is no effect when there actually is one (false negative). This is more likely when a stringent significance level is used.
The Significance Level Trade-Off:
The relationship between significance levels and error types involves a trade-off:
- Using a more stringent significance level (such as p ≤ .01) reduces the risk of Type I errors but increases the risk of Type II errors
- Using a more lenient significance level (such as p ≤ .10) increases the risk of Type I errors but reduces the risk of Type II errors
Researchers must balance these risks based on the consequences of each type of error in their specific research context.
Observed and critical values are compared to determine statistical significance. The observed value is calculated from the sample data using the chosen statistical test. The critical value is found in statistical tables based on the significance level, whether the test is one- or two-tailed, and the sample size or degrees of freedom. The specific decision rule (whether observed must be greater than, or less than, critical) depends on the test used.
Probability is expressed as a value between 0 and 1, or as a percentage. In statistical testing, symbols indicate probability levels:
- p ≤ .05 means probability of 5% or less
- p > .05 means probability greater than 5%
- p ≤ .01 means probability of 1% or less
Understanding probability is essential for interpreting statistical conclusions and evaluating research claims.
Methodological issues
Evaluating research quality requires understanding key methodological concepts that affect the trustworthiness and applicability of findings.
Validity refers to whether a study measures what it claims to measure or whether the conclusions are legitimate:
Internal validity concerns whether the study actually measures what it intends to measure within the research situation itself. Poor operationalisation of variables, confounding variables, and demand characteristics all threaten internal validity.
External validity concerns whether findings can be generalised beyond the specific research situation:
- Ecological validity refers to whether findings generalise to real-world situations beyond the research setting. Laboratory experiments often have low ecological validity due to artificial conditions.
- Population validity refers to whether findings generalise to other groups beyond the sample studied.
- Temporal validity refers to whether findings generalise across time, remaining relevant in different time periods.
Predictive validity concerns whether a test or measure successfully predicts future behaviour or performance.
Understanding Reliability:
Reliability refers to the consistency of measurements:
Internal reliability concerns whether a test or measure is consistent within itself. For example, if a questionnaire measures anxiety, all items should produce similar results.
External reliability concerns whether a test or measure produces consistent results across different occasions, researchers, or situations. A reliable measure should produce the same or very similar results when repeated under the same conditions.
Assessing reliability:
- Test-retest method: Administering the same measure at different times
- Inter-observer reliability: Having multiple observers record the same behaviour independently and comparing their recordings
Generalisability refers to the extent to which findings from a study can be applied to other situations, populations, or time periods. Research with high generalisability has broad applicability. Factors that limit generalisability include biased samples, artificial laboratory settings, and cultural specificity of findings.
Objectivity refers to research that is free from researcher bias and personal interpretation. Objective measurements are observable and quantifiable, with different researchers able to obtain the same results. Quantitative data is typically more objective than qualitative data.
Subjectivity refers to research influenced by personal opinions, interpretations, or bias. Subjective measurements involve interpretation and judgment. Qualitative data is typically more subjective, though systematic analysis methods can increase objectivity.
Researcher bias occurs when the researcher's expectations, beliefs, or preferences influence the research process or findings. This can affect which behaviours are observed and recorded, how data is interpreted, or how participants are treated. Standardised procedures, operational definitions, and blind techniques help reduce researcher bias.
Credibility refers to whether research findings are believable and trustworthy. Credibility is enhanced by transparent methodology, appropriate research design, acknowledgement of limitations, and consistency with other research in the area.
Analysis of qualitative data
Qualitative data requires systematic analysis approaches to identify patterns and themes while maintaining rigour.
Thematic analysis is a method for identifying, analysing, and reporting patterns (themes) within qualitative data. The process typically involves:
- Familiarisation with the data through repeated reading
- Generating initial codes to label interesting features
- Searching for themes by grouping codes into potential themes
- Reviewing themes to ensure they work in relation to coded data and the entire dataset
- Defining and naming themes
- Producing the final analysis with examples from the data
Thematic analysis provides rich, detailed accounts of data and is flexible in terms of theoretical approach. However, the analysis is subjective and requires careful researcher judgment. Different researchers may identify different themes from the same data.
Grounded Theory Approach:
Grounded theory is an approach where theory emerges from the systematic analysis of data rather than being imposed beforehand. Researchers collect and analyse data simultaneously, with analysis guiding further data collection. Theories are "grounded" in the actual data rather than being predetermined.
This approach allows theories to be closely tied to the data and can generate new insights. However, it is time-intensive and requires substantial expertise to conduct properly. The researcher's own background and assumptions inevitably influence interpretation.
When analysing qualitative data, researchers must evaluate findings in terms of:
- Credibility (whether findings are believable)
- Transferability (whether findings transfer to other contexts)
- Dependability (consistency and reliability)
- Confirmability (whether findings can be confirmed by others)
Providing rich descriptions, examples from data, and transparency about the analytical process all enhance the quality of qualitative research.
Conventions of published psychological research
Psychological research follows standardised conventions for reporting and publication, ensuring findings are communicated clearly and can be evaluated critically.
Standard Research Report Structure:
Research reports typically follow this structure:
Abstract - A brief summary (usually 150-250 words) of the entire study, including aims, methods, key results, and conclusions. Allows readers to quickly determine relevance.
Introduction - Provides background information and theoretical context. Reviews relevant previous research, identifies gaps in existing knowledge, and clearly states the aims and hypotheses. Moves from general background to specific study focus.
Method - Provides detailed information about how the study was conducted, allowing replication. Subsections typically include:
- Design: The type of study and research design used
- Participants: Details about who took part and how they were selected
- Materials/apparatus: Equipment, tests, or materials used
- Procedure: Step-by-step account of what happened
- Ethics: How ethical issues were addressed
Results - Presents findings without interpretation. Includes descriptive statistics, tables or graphs displaying data, and inferential statistics with test results and significance levels. Raw data is not typically included.
Discussion - Interprets results, explaining what they mean in relation to aims and hypotheses. Considers how results relate to previous research and theory, acknowledges limitations, suggests implications, and proposes future research directions. Moves from specific findings to broader implications.
Publishing psychological reports involves a peer review process. Researchers submit their report to an academic journal, where it is reviewed by other experts in the field (peer reviewers). These reviewers evaluate the quality of the research, identify strengths and limitations, and recommend whether the report should be published, revised, or rejected. This process helps ensure published research meets quality standards and contributes meaningfully to knowledge. However, peer review can be time-consuming, and biases in the review process may occur. The peer review system strengthens the credibility of published findings but is not infallible.
Ethical issues in research using humans
Psychological research involving human participants must adhere to ethical guidelines that protect participants' welfare and rights.
The BPS Code of Ethics and Conduct (2009) provides ethical principles for psychological research in the UK. Key ethical principles include:
Core Ethical Principles:
Informed consent - Participants should be given sufficient information about the research to make an informed decision about whether to take part. This includes information about the purpose, what participation involves, potential risks, the right to withdraw, and how data will be used. Consent must be given freely without coercion. Special considerations apply for children (who require parental consent) and vulnerable adults (who may need additional safeguards).
Right to withdraw - Participants must be able to leave the study at any time without penalty or pressure to continue. They should also be able to withdraw their data after participation. This right protects participants from feeling obligated to continue if they become uncomfortable.
Protection from harm - Participants should not experience physical or psychological harm beyond what they would encounter in everyday life. Researchers must identify potential risks in advance and take steps to minimise them. If harm does occur, appropriate support should be provided.
Deception - Deliberately misleading participants or withholding information should be avoided wherever possible. Deception may only be justified when the research could not be conducted without it and when the scientific or social value outweighs the ethical concerns. When deception is used, participants must be fully debriefed afterwards.
Debriefing - After participation, researchers should explain the full purpose of the study and address any questions or concerns. Debriefing is particularly important when deception has been used or when the study may have caused distress. Participants should leave the research in the same state they entered it.
Confidentiality - Participants' data and identity should be protected. Personal information should be stored securely and anonymised where possible. Published reports should not allow identification of individual participants unless they have explicitly consented to being identified.
Privacy - Researchers should respect participants' privacy, particularly in observational research. Observation in public spaces where people expect to be seen by strangers is generally acceptable, but observation in situations where privacy is expected raises ethical concerns.
Ethical Decision-Making:
Risk assessment involves identifying potential physical, psychological, social, or economic risks to participants before the study begins and implementing measures to minimise these risks.
When ethical concerns cannot be completely eliminated, researchers must weigh the potential benefits of the research against the ethical costs. The decision should involve ethics committees rather than being made solely by the researcher.
Ethical issues in research using animals
Animal research in psychology is governed by legal and ethical frameworks designed to protect animal welfare.
The Scientific Procedures Act 1986 (UK legislation) regulates research using animals. This Act requires that:
- Animal research must be licensed by the Home Office
- Researchers must demonstrate that the research has potential benefits that outweigh animal suffering
- The research could not be conducted using non-animal alternatives
- The minimum number of animals necessary are used
- Suffering is minimised through appropriate housing, care, and procedures
The 3Rs Principle:
The 3Rs principle guides ethical animal research:
Replace - Use non-animal alternatives wherever possible, such as computer simulations or cell cultures.
Reduce - Use the minimum number of animals necessary to obtain reliable results. Careful experimental design and statistical planning help achieve this.
Refine - Modify procedures to minimise animal suffering, enhance welfare, and improve scientific outcomes. This includes appropriate housing conditions, pain relief, and humane endpoints.
Home Office regulations require detailed proposals demonstrating scientific justification, ethical justification (why animal use is necessary and appropriate), and welfare provisions. Regular inspections ensure compliance with regulations.
Ethical arguments regarding animal research involve a tension between potential scientific and medical benefits and concerns about animal welfare and rights. Those supporting animal research argue it has led to important discoveries and treatments that benefit humans and animals. Critics argue that animals have rights not to be harmed, that animals experience suffering similarly to humans, and that results may not generalise reliably from animals to humans.
When evaluating animal research, consider the species used (as different species have different capacities for suffering), the severity of procedures, the potential benefits of the research, and whether alternatives could have been used. The ethical acceptability of animal research remains a subject of debate.
Remember!
Key Points to Remember:
-
Data types matter - Choose between qualitative and quantitative data based on your research aims; each type has distinct strengths and limitations that affect what you can conclude.
-
Control is key - In experimental research, controlling extraneous variables through proper design, standardised procedures, and awareness of potential confounds is essential for establishing cause-and-effect relationships.
-
Statistics tell the story - Descriptive statistics summarise data, while inferential statistics determine whether patterns are statistically significant; understanding both is necessary for interpreting research findings correctly.
-
Validity and reliability together - Valid research measures what it claims to measure; reliable research produces consistent results; both are necessary for trustworthy findings that contribute meaningfully to psychological knowledge.
-
Ethics protect participants - All psychological research must follow ethical guidelines that protect participants' welfare, dignity, and rights; ethical considerations are not optional extras but fundamental requirements of responsible research.