Two approaches: What motivates instrumental behaviour?
Three-term Contingency
• The three terms (S, R and O) allow for many different types of associations to be formed (what is it that the animal is learning? Why does it perform this behaviour? Is it because it sees the S, or because it is thinking about the O?)
S-R association
looking for a general outcome (I.e. going to class looking for a better grade) but the particulars are done without thinking (i.e. where you sit in class)
S-O Association and Expectancy
• Expectancy of reward (O) forms when (S) present
• Pavlovian conditioning may be involved in instrumental learning via S-O pairings
• Instrumental response motivated by two factors:
1. S-R association
2. S-O association
• Precursor to modern two-process theory
Two-Process Theory
Rescorla & Solomon, 1967
Assumes two distinct types of learning:
• Pavlovian Conditioning
• Instrumental Conditioning
S-O association activates an emotional state which motivates instrumental behaviour (R)
• Type of emotional state depends on types of reinforcer (O)
How do you test this idea….
• Pavlovian expectancy motivates instrumental
behaviour
• Presenting a Pavlovian CS should alter
instrumental behaviour
• Test with a transfer-of-control experiment
Pavlovian Instrumental Transfer Test
• Three steps
1. Instrumental conditioning*
2. Pavlovian conditioning*
3. Instrumental responding with presentations of Pavlovian CS
• Predictions
• If Pavlovian S-O motivates, then responding should increase when CS is presented
tests two process theory
• Positive and negative emotions can be elicited by CS
• Positive emotions increase instrumental responding
• Negative emotions decrease instrumental responding
• However, The CS can elicit more than just positive or negative emotional states
- The CS can elicit reward-specific expectancies
Transfer of Control Experiment
Lovibond, 1983
Trained rabbits in an operant task
Followed this by classical conditioning
• Exp. 1 & 2, CS+
• Exp. 3, CS+ and CS-
Tested the effects of the CSs on operant responding
rabbits pressed head against lever to get food
trained with particular stimulus turning on to get food (had no access to lever at this point)
trained with CS+ and CS- (two diff emotional states)
now if we present rabbit with the lever, will they response more if presence with the CS+ at the same time
Effects of Excitatory CS on Operant Responding
• Instrumental responding increased when CS+ was presented in the experimental group (paired)
• Instrumental responding stayed the same in the unpaired control group
Effects of an Inhibitory CS on Operant Responding
• Instrumental responding increased when CS+ was presented in the experimental group (paired)
• Instrumental responding did NOT decrease with CS- presentation
• BUT, presentation of CS+ and CS- simultaneous also did NOT increase instrumental responding
Reward specific expectancies – Transfer of control with 2 CSs
IF positive emotional state
• Increased responding to both CS’s (when signaling one or the other)
IF reward specific
• Increased responding for CS that matches current instrumental reinforcer
Two-process theory ignores ______ __________
How to test for R-O associations?
How to test for R-O association?
• Devalue the O
• Then test for R
• remember US devaluation?
(Tested for S-R versus S-S learning)
O devaluation • Rats reinforced for pushing a rod to the left or right • One direction = food • Other direction = sugar water • After training, devalue one reinforcer via LiCl injections • Test preference
IF S-O association:
• decrease in both responses because devaluation had effected the
properties of S (the lever, the chamber, etc.)
IF R-O association:
• One response (the one that results in the devalued outcome) to be
lower, suggesting that separate R-O associations were formed
• Difficult to explain different responses to different stimuli
IF S-R association:
• No change in responding as model does not include the reinforcer (O)
S(R-O) Hierarchical relations
Drive Reduction Theory
Hull (1943): Events are reinforcing if they reduce a physiological drive
• Obtain water if thirsty
• Acquire conversation if lonely
Establishing operation: Making a rat hungry so it will work for food
Drive Reduction Theory is not…
Not a comprehensive theory of reinforcement
• Rats will press a lever to obtain saccharin
• People will pay money for ‘frivolous’ goods
• Too many reinforcers that neither reduce
drives nor are associated with primary
reinforcers
Consummatory-Response Theory
Relative Value Theory
•With total freedom, different behaviours have
different probabilities of occurring
• e.g., watching Netflix → high probability, studying →low probability
•Premack Principle: H = high probability response L = low probability response •L→H, reinforces L •H→L, does not reinforce H
How to test Premack Principle
Premack Principle & Children
free-eating and pinball playing
Any high probability response can serve as a reinforcer for a lower probability response
Applications of Premack Principle
Applications of Premack Principle
• Clinical patients
• Find out what behaviour is reinforcing (high probability of occurring) for each individual
• Delayed Echolalia vs. Perseverative behaviours in children with autism
• Notion that reinforcers are responses, and not
stimuli
Premack Principle and BMod
• Data from 2 children with autism
• Individual preferred behaviour was a better
reinforcer than food
Any activity…
…could be a reinforcer…
… if it is more likely (“preferred”) than the operant response.
To determine what reinforcers to use for an individual, look at what they do
Response-Deprivation hypothesis (Disequilibrium Model)
Behaviour is reinforcing when the individual is prevented from engaging in the behaviour at its normal frequency
• a sated rat will not perform an operant response for food
Premack realized restricting access was important this, but it was not central to his theory
Response-Deprivation hypothesis (Disequilibrium Model)
Premack (1962)
rat study
water deprived rat
Unlimited water
Homeostasis of Behaviour Distribution
• Assumption that all animals have a preferred
distribution of activities
Behavioural Bliss Point
• Organisms thought to have a preferred distribution of behaviour (Bliss Point)