Behaviourist Approach: Operant Conditioning & Skinner's Research (AQA A-Level Psychology): Revision Notes
Behaviourist Approach: Operant Conditioning & Skinner's Research
What is operant conditioning?
Operant conditioning is a type of learning that occurs through the reinforcement or punishment of the consequences of an individual's actions. This learning principle suggests that behaviour is shaped by what happens after we act, rather than what happens before.
The approach works on the principle that if a behaviour leads to pleasant consequences, we are more likely to repeat it. If it leads to unpleasant consequences, we are less likely to repeat it. This forms the foundation of how we learn many everyday behaviours.
Types of consequences in operant conditioning
There are three main ways that consequences can affect behaviour:
Positive reinforcement
Positive reinforcement occurs when a behaviour is strengthened because it leads to positive consequences. The behaviour becomes more likely to recur because something pleasant is added after the behaviour occurs.
Examples include:
- Receiving praise from a teacher for good work
- Getting paid for completing a job
- Feeling satisfied after exercise
Negative reinforcement
Negative reinforcement occurs when a behaviour is strengthened because it helps avoid negative consequences. The behaviour becomes more likely to recur because something unpleasant is removed or avoided.
Examples include:
- Taking paracetamol to remove a headache
- Wearing a seatbelt to stop the annoying beeping sound
- Studying hard to avoid failing an exam
Remember that negative reinforcement strengthens behaviour by removing something unpleasant - it's not the same as punishment! Many students confuse these concepts.
Punishment
Punishment occurs when a behaviour becomes less likely to recur due to negative consequences. Unlike reinforcement, punishment weakens behaviour rather than strengthening it.
Examples include:
- Receiving detention for arriving late to school
- Getting a speeding ticket for driving too fast
- Being told off for talking in class
Skinner's research
B.F. Skinner conducted extensive research into operant conditioning using laboratory experiments with animals, particularly rats and pigeons. His work built upon earlier research by Thorndike, who formulated the 'Law of Effect' showing that behaviours followed by satisfying consequences are more likely to be repeated.
Worked Example: Skinner's Box Experiment
Participants: Laboratory rats
Aim: To investigate how consequences shape behaviour and test different types of reinforcement
Procedure: Skinner designed a special chamber (known as a 'Skinner box') containing:
- A response lever that the rat could press
- A food dispenser that released pellets when activated
- An electrified floor grid that could deliver mild electric shocks
- Lights and loudspeakers for additional stimuli
The rats were placed in the box and their behaviour was carefully observed. Initially, rats would explore randomly, but when they accidentally pressed the lever and received food (positive reinforcement), they quickly learned to press it repeatedly.
Findings:
- Rats rapidly learned to press the lever when it resulted in food delivery
- The behaviour became established through continuous reinforcement
- When the electrified floor was activated, rats learned to press the lever to turn it off (negative reinforcement)
- The timing and frequency of reinforcement significantly affected learning rates
Evaluation:
Strengths:
- Controlled laboratory conditions allowed precise measurement of behaviour
- Clear cause-and-effect relationships could be established
- Results were highly replicable across different animals
- Provided objective, scientific evidence for learning principles
Weaknesses:
- Animal research may not generalise directly to human behaviour due to cognitive differences
- Artificial laboratory setting lacks ecological validity
- Ethical concerns about using animals in potentially stressful situations
- Oversimplifies human learning by ignoring cognitive and social factors
Schedules of reinforcement
Skinner discovered that the pattern and timing of reinforcement significantly affects how quickly behaviour is learned and how resistant it is to extinction. He identified five main schedules of reinforcement:
Continuous reinforcement (CRF)
Every single response receives reinforcement. This produces steady but low response rates and behaviours learned this way extinguish quickly when reinforcement stops.
Example: Receiving a sweet every time you say 'please'
Fixed interval (FI)
Reinforcement is given after a fixed period of time, provided at least one response has occurred. Response rates tend to increase as the time for reinforcement approaches.
Example: Receiving a salary every month
Variable interval (VI)
Reinforcement is given after varying time periods. This creates steady response rates that are highly resistant to extinction.
Example: Checking social media for new messages (you never know when they'll arrive)
Fixed ratio (FR)
Reinforcement is given after a fixed number of responses. This creates high, steady response rates with brief pauses after each reinforcement.
Example: Buy 10 coffees, get the 11th free
Variable ratio (VR)
Reinforcement is given after an unpredictable number of responses. This is the most effective schedule, creating very high response rates that are extremely resistant to extinction.
Example: Gambling on slot machines (you never know which pull will win)
Variable ratio reinforcement is considered the most powerful because the unpredictability maintains motivation. This explains why gambling can become so addictive - people continue playing because they never know when the next win might occur.
The ABC model
The ABC Model for Behaviour Analysis
Skinner developed the ABC model to analyse behaviour systematically:
- Antecedents: What happens just before a behaviour occurs (the trigger or cue)
- Behaviours: The actual observable actions performed (what the person does)
- Consequences: What happens immediately after the behaviour (the result)
This model helps identify why behaviours occur and how they might be modified. For example, if a student disrupts class (behaviour) when given difficult work (antecedent) and receives attention from peers (consequence), the ABC analysis reveals how the attention reinforces the disruptive behaviour.
Real-world applications
Operant conditioning principles are widely used in various settings:
Education: Schools use reward systems like house points, certificates, and praise to encourage good behaviour and academic achievement. Sanctions like detentions serve as punishment for inappropriate behaviour.
Workplace: Businesses use commission structures, bonuses, and employee recognition programmes based on operant conditioning principles.
Therapy: Behaviour modification programmes help people overcome phobias, addictions, and other behavioural problems by systematically changing the consequences of behaviour.
Criminal justice: The legal system operates on operant conditioning principles, with laws designed to punish criminal behaviour and deter future offences.
Evaluation of operant conditioning
Strengths:
- Scientific approach using controlled experiments provides reliable evidence
- Practical applications have proven effective in education, therapy, and behaviour change
- Principles work consistently across different species and situations
- Offers clear, testable predictions about behaviour
Weaknesses:
- Reductionist approach ignores cognitive factors like thoughts, expectations, and decision-making
- Research primarily based on animal studies may not fully apply to complex human behaviour
- Overlooks individual differences in personality, motivation, and learning styles
- Cannot easily explain spontaneous or creative behaviours that occur without obvious reinforcement
- Deterministic view suggests humans have limited free will over their actions
Key Points to Remember:
- Operant conditioning shapes behaviour through consequences - positive reinforcement strengthens behaviour by adding something pleasant, while negative reinforcement strengthens it by removing something unpleasant
- Skinner's research provided scientific evidence that behaviour can be systematically modified using different reinforcement schedules in controlled laboratory conditions
- Variable ratio reinforcement is most effective at maintaining behaviour because the unpredictable nature of rewards keeps motivation high
- Real-world applications are extensive - from classroom management to workplace incentives, operant conditioning principles influence many aspects of daily life
- The approach has limitations in explaining complex human behaviour as it focuses mainly on observable actions rather than internal cognitive processes