Olds & Milner (1954) Positive Reinforcement Produced by Electrical Stimulation (Edexcel A-Level Psychology): Revision Notes
Olds & Milner (1954) Positive Reinforcement Produced by Electrical Stimulation
Background and context
The research conducted by James Olds and Peter Milner in 1954 provided early evidence for the existence of specific brain regions involved in processing reward and positive reinforcement. Originally, the study was designed to investigate the neural systems that regulate sleep. During their experiments, Olds and Milner were stimulating different areas of rats' brains to identify which systems controlled sleep patterns.
The breakthrough in this research came entirely by chance. While investigating sleep systems, the researchers made an unexpected observation when one rat consistently returned to the location in its cage where a particular electrical stimulation had been administered. This serendipitous discovery led to a complete shift in the research focus and ultimately revealed the existence of reward centres in the brain.
The rat appeared to find this stimulation pleasurable, prompting the researchers to investigate why this occurred and which brain regions were responsible for this rewarding effect.
Aim
To explore whether electrical brain stimulation acts as a positive reinforcement in rats.
Participants
The study used 15 male, hooded rats, each weighing approximately 250g at the start of the experiment.
Procedure
Electrode implantation
Electrodes were surgically implanted into the brains of the rats under anaesthesia. The electrodes were secured to the skull using screws and were inserted into different regions of the brain for each rat. This allowed the researchers to investigate which specific brain areas were associated with reward or reinforcement.
Following the surgery, rats were given three days to recover before testing began, ensuring they were in appropriate condition for the experimental procedures.
Apparatus
The electrodes were connected to an electrical lead suspended from the ceiling of the cage. This design ensured that the electrode connection had minimal interference with the rats' health or free movement within the testing environment.
Rats were placed in an operant conditioning chamber (commonly known as a Skinner-type box). When the rats pressed a lever in this box, an electric current was delivered directly to their brains through the implanted electrodes. The voltage used ranged from 0.5 to 5 volts, varying between individual rats, with just enough intensity to produce a noticeable effect on behaviour.
Testing conditions
Rats were subjected to two experimental conditions:
1. Acquisition testing: This phase lasted for a total of 6–12 hours for the whole experiment. The stimulator was turned on so that when rats pressed the lever, they received electrical stimulation to their brains. For some tests, a time delay switch was used that switched off the electric current after a set amount of time if the rat continued to hold down the lever.
2. Extinction testing: This phase lasted for a total of 1–2 hours for the whole experiment. The stimulator was turned off so that when rats pressed the lever, no electrical current was delivered.
Rats were tested over a period of two to four days. Each day involved three hours of acquisition testing, followed by 30 minutes of extinction testing. This alternating pattern allowed researchers to compare behaviour when reinforcement was available versus when it was absent.
Behavioural measurement
The amount of time each rat spent responding to the electrical stimulation during acquisition was compared to the time spent responding during extinction. A behaviour was classified as a 'response' if there was a clear behavioural reaction shown at least once in 30 seconds. If no behavioural reaction occurred within 30 seconds, this interval was recorded as a period of 'no response'.
Post-mortem analysis
All rats were killed after testing so that their brains could be examined under a microscope. This allowed the researchers to identify precisely which brain structures had been stimulated by the electrodes during the experiment.
Results
Because some rats had 12 hours of acquisition testing whilst others had only 6 hours (as they were tested for less time), only the results from the first 6 hours for all rats were included in the analysis to ensure fair comparison.
Response patterns
The time rats spent responding during acquisition and extinction phases was measured in 30-second intervals. Table 8.1 shows the percentage of time rats spent responding (i.e., the number of 30-second intervals in which a behavioural reaction was observed) during both phases, along with the brain area being stimulated.
| Number of rats | Location of electrode | Area of brain | Range of percentage of acquisition time spent responding | Range of percentage of extinction time spent responding |
|---|---|---|---|---|
| 4 | Septal | Forebrain | 75–92% | 6–21% |
| 1 | Corpus callosum | Forebrain | 6% | 3% |
| 1 | Hippocampus | Thalamic | 11% | 14% |
| 1 | Caudate | Forebrain | 4% | 4% |
| 2 | Cingulate | Thalamic | 36–37% | 9–10% |
| 1 | Medial lemniscus | Thalamic | 0% | 4% |
| 1 | Mammillothalamic tract | Thalamic | 71% | 9% |
| 2 | Medial geniculate | Midbrain | 0% | 21–31% |
| 2 | Tegmentum | Midbrain | 2–77% | 1–81% |
Key findings
The highest response rates were found in the central portion of the brain, specifically the septal area. Rats with electrodes in this region spent more than 75% of their acquisition time responding to the stimulation. During extinction, they spent less than 22% of their time responding. This pattern demonstrates that stimulation of the septal area acts as a primary reward, motivating the rats to press the lever more frequently when stimulation was available.
Conversely, rats with lower percentage scores during acquisition (but similar or higher scores during extinction) suggest that stimulation of those brain areas was less rewarding. Brain areas with a 0% response during acquisition suggest a punishing effect – stimulation of these regions may have been aversive, causing the rats to avoid lever pressing.
Conclusion
Certain areas of the brain, when stimulated, have a rewarding effect on behaviour, particularly the septal area. This suggests that there is a specific 'reward centre' within the brain that processes positive reinforcement.
Evaluation
Strengths
High internal validity: The study was conducted in a controlled laboratory environment where all rats were implanted with electrodes and tested in the same operant conditioning chamber. No reinforcement was used except the electrical stimulus, which allowed the researchers to establish a clear cause-and-effect relationship between brain stimulation and behaviour. External variables were minimised, increasing the validity of the research findings.
Methodological control: The use of the same apparatus and procedure for all rats, with systematic variation only in electrode placement, allowed for direct comparison between different brain regions. The alternation between acquisition and extinction phases provided a strong test of whether the brain stimulation was genuinely acting as a reinforcer.
Ethical consideration regarding duration: When experimenters realised there was little difference in results between rats tested for different durations, they reduced the testing time for later rats. This meant that rats tested later were subjected to the procedure for a shorter time, minimising potential harm.
Animal welfare monitoring: Researchers monitored whether rats showed signs of pain after electrode implantation and during testing. As no signs of pain were observed, this suggests the rats did not directly suffer as a result of the procedure.
Provision of basic needs: Rats were not deprived of food or water (except when in the Skinner box) and were allowed to eat normally. This protected the basic welfare needs of the animals involved in the research.
Weaknesses
Ecological validity concerns: Despite being lab-reared, the experimental setting and conditions were highly unnatural for rats. Having wires attached to their skulls and receiving electric shocks does not reflect normal rat behaviour or experience. This may have influenced their behaviour in ways that were reactions to the experimental conditions rather than purely reinforcement-driven responses. The findings may not generalise to natural settings.
Lack of applicability to humans: The study provides insight into why humans engage in behaviours they find rewarding, but there is no guarantee that the reward centres identified in rats' brains also exist in humans or function in the same way. Rats have a different brain structure from humans, limiting the extent to which these findings can be directly applied to human behaviour.
Ethical concerns regarding invasive procedures: The study involved highly invasive procedures, including surgical implantation of electrodes into the rats' brains under anaesthesia. This is a procedure with inherent risks and causes physical harm to the animals.
Terminal procedures: All rats were killed after the experiment so their brains could be examined under a microscope. This means the animals were unnecessarily harmed for research purposes. Whilst this was necessary for Olds and Milner to identify which brain structures had been stimulated, it raises ethical questions about the cost to animal welfare.
Inconsistent voltage administration: There were variations in the voltage of stimulation given to different rats. This introduced a potential inconsistency that may have affected the results, reducing the replicability of the study. Future experiments would need to determine the appropriate voltage amounts for rats before they respond consistently.
Small and unrepresentative sample: The number of rats with electrodes in each brain area varied considerably, with some brain regions being observed in only one rat. The same findings may not be found if a larger, more evenly distributed sample were investigated across all brain areas. The small sample size was insufficient to provide meaningful results or to draw firm conclusions from the experiment. This means that animals may have been unnecessarily harmed without generating robust scientific knowledge.
Sample size justification: It could be argued that the sample size was insufficient to justify the harm caused to the animals, particularly since rats were destroyed following the experiment solely to allow examination of brain structures.
Wider applications
Whilst this research is now over 60 years old, the study by Olds and Milner paved the way for extensive investigation into the effect of deep brain stimulation on behaviour and mood. Brain stimulation techniques have since been used as treatments for severe depression and to manage pain in extreme cases that are typically resistant to other medical interventions. This demonstrates the lasting impact of the study on both scientific understanding and clinical practice.
Remember!
Key Points to Remember:
- Olds & Milner (1954) demonstrated that electrical stimulation of certain brain regions, particularly the septal area, acts as a positive reinforcement in rats.
- Rats with electrodes in the septal area spent 75–92% of acquisition time responding, but only 6–21% of extinction time responding, showing the reinforcing effect of brain stimulation.
- The study provided early evidence for the existence of a 'reward centre' in the brain.
- Strengths include high internal validity due to controlled laboratory conditions, clear cause-and-effect relationships, and monitoring of animal welfare.
- Weaknesses include poor ecological validity, limited applicability to humans due to species differences, ethical concerns about invasive procedures, inconsistent voltage administration, and a small sample size that may have unnecessarily harmed animals without producing robust findings.