Behaviourist Approach: Operant Conditioning & Skinner's Research Revision Notes for AQA A-Level Psychology

Behaviourist Approach: Operant Conditioning & Skinner's Research

What is operant conditioning?

Operant conditioning is a type of learning that occurs through the reinforcement or punishment of the consequences of an individual's actions. This learning principle suggests that behaviour is shaped by what happens after we act, rather than what happens before.

infoNote

The approach works on the principle that if a behaviour leads to pleasant consequences, we are more likely to repeat it. If it leads to unpleasant consequences, we are less likely to repeat it. This forms the foundation of how we learn many everyday behaviours.

Types of consequences in operant conditioning

There are three main ways that consequences can affect behaviour:

Positive reinforcement

Positive reinforcement occurs when a behaviour is strengthened because it leads to positive consequences. The behaviour becomes more likely to recur because something pleasant is added after the behaviour occurs.

Examples include:

Receiving praise from a teacher for good work
Getting paid for completing a job
Feeling satisfied after exercise

Negative reinforcement

Negative reinforcement occurs when a behaviour is strengthened because it helps avoid negative consequences. The behaviour becomes more likely to recur because something unpleasant is removed or avoided.

Examples include:

Taking paracetamol to remove a headache
Wearing a seatbelt to stop the annoying beeping sound
Studying hard to avoid failing an exam

chatImportant

Remember that negative reinforcement strengthens behaviour by removing something unpleasant - it's not the same as punishment! Many students confuse these concepts.

Punishment

Punishment occurs when a behaviour becomes less likely to recur due to negative consequences. Unlike reinforcement, punishment weakens behaviour rather than strengthening it.

Examples include:

Receiving detention for arriving late to school
Getting a speeding ticket for driving too fast
Being told off for talking in class

Skinner's research

B.F. Skinner conducted extensive research into operant conditioning using laboratory experiments with animals, particularly rats and pigeons. His work built upon earlier research by Thorndike, who formulated the 'Law of Effect' showing that behaviours followed by satisfying consequences are more likely to be repeated.

lightbulbExample

Worked Example: Skinner's Box Experiment

Participants: Laboratory rats

Aim: To investigate how consequences shape behaviour and test different types of reinforcement

Procedure: Skinner designed a special chamber (known as a 'Skinner box') containing:

A response lever that the rat could press
A food dispenser that released pellets when activated
An electrified floor grid that could deliver mild electric shocks
Lights and loudspeakers for additional stimuli

The rats were placed in the box and their behaviour was carefully observed. Initially, rats would explore randomly, but when they accidentally pressed the lever and received food (positive reinforcement), they quickly learned to press it repeatedly.

Findings:

Rats rapidly learned to press the lever when it resulted in food delivery
The behaviour became established through continuous reinforcement
When the electrified floor was activated, rats learned to press the lever to turn it off (negative reinforcement)
The timing and frequency of reinforcement significantly affected learning rates

Evaluation:

Strengths:

Controlled laboratory conditions allowed precise measurement of behaviour
Clear cause-and-effect relationships could be established
Results were highly replicable across different animals
Provided objective, scientific evidence for learning principles

Weaknesses:

Animal research may not generalise directly to human behaviour due to cognitive differences
Artificial laboratory setting lacks ecological validity
Ethical concerns about using animals in potentially stressful situations
Oversimplifies human learning by ignoring cognitive and social factors

Schedules of reinforcement

Skinner discovered that the pattern and timing of reinforcement significantly affects how quickly behaviour is learned and how resistant it is to extinction. He identified five main schedules of reinforcement:

Continuous reinforcement (CRF)

Every single response receives reinforcement. This produces steady but low response rates and behaviours learned this way extinguish quickly when reinforcement stops.

Example: Receiving a sweet every time you say 'please'

Fixed interval (FI)

Reinforcement is given after a fixed period of time, provided at least one response has occurred. Response rates tend to increase as the time for reinforcement approaches.

Example: Receiving a salary every month

Variable interval (VI)

Reinforcement is given after varying time periods. This creates steady response rates that are highly resistant to extinction.

Example: Checking social media for new messages (you never know when they'll arrive)

Fixed ratio (FR)

Reinforcement is given after a fixed number of responses. This creates high, steady response rates with brief pauses after each reinforcement.

Example: Buy 10 coffees, get the 11th free

Variable ratio (VR)

Reinforcement is given after an unpredictable number of responses. This is the most effective schedule, creating very high response rates that are extremely resistant to extinction.

Example: Gambling on slot machines (you never know which pull will win)

chatImportant

Variable ratio reinforcement is considered the most powerful because the unpredictability maintains motivation. This explains why gambling can become so addictive - people continue playing because they never know when the next win might occur.

The ABC model

infoNote

The ABC Model for Behaviour Analysis

Skinner developed the ABC model to analyse behaviour systematically:

Antecedents: What happens just before a behaviour occurs (the trigger or cue)
Behaviours: The actual observable actions performed (what the person does)
Consequences: What happens immediately after the behaviour (the result)

This model helps identify why behaviours occur and how they might be modified. For example, if a student disrupts class (behaviour) when given difficult work (antecedent) and receives attention from peers (consequence), the ABC analysis reveals how the attention reinforces the disruptive behaviour.

Real-world applications

Operant conditioning principles are widely used in various settings:

Education: Schools use reward systems like house points, certificates, and praise to encourage good behaviour and academic achievement. Sanctions like detentions serve as punishment for inappropriate behaviour.

Workplace: Businesses use commission structures, bonuses, and employee recognition programmes based on operant conditioning principles.

Therapy: Behaviour modification programmes help people overcome phobias, addictions, and other behavioural problems by systematically changing the consequences of behaviour.

Criminal justice: The legal system operates on operant conditioning principles, with laws designed to punish criminal behaviour and deter future offences.

Evaluation of operant conditioning

Strengths:

Scientific approach using controlled experiments provides reliable evidence
Practical applications have proven effective in education, therapy, and behaviour change
Principles work consistently across different species and situations
Offers clear, testable predictions about behaviour

Weaknesses:

Reductionist approach ignores cognitive factors like thoughts, expectations, and decision-making
Research primarily based on animal studies may not fully apply to complex human behaviour
Overlooks individual differences in personality, motivation, and learning styles
Cannot easily explain spontaneous or creative behaviours that occur without obvious reinforcement
Deterministic view suggests humans have limited free will over their actions

bookmarkSummary

Key Points to Remember:

Operant conditioning shapes behaviour through consequences - positive reinforcement strengthens behaviour by adding something pleasant, while negative reinforcement strengthens it by removing something unpleasant
Skinner's research provided scientific evidence that behaviour can be systematically modified using different reinforcement schedules in controlled laboratory conditions
Variable ratio reinforcement is most effective at maintaining behaviour because the unpredictable nature of rewards keeps motivation high
Real-world applications are extensive - from classroom management to workplace incentives, operant conditioning principles influence many aspects of daily life
The approach has limitations in explaining complex human behaviour as it focuses mainly on observable actions rather than internal cognitive processes

Behaviourist Approach: Operant Conditioning & Skinner's Research (AQA A-Level Psychology): Revision Notes