Pavlovian conditioning
Acquisition of new behavioural response to previously neutral stimulus due to experiencing a predictive relationship between it (CS) and a biologically-relevant stimulus (US)
ISSUES:
Neural basis of pavlovian conditioning
Appetitive conditioning based on working of dopamine system –> for reward + learning
What does dopamine do in pavlovian conditioning (3 theories)
- evidence for them
O’Doherty et al., (2002) - fMRI + 2 stimuli (one predicts salt, the other sugar)
Merenowicz + Shulz (1994) - single cell recordings in dopamine neurons - monkey
Necessary conditions for learning
Awareness of CS-US relationship (explicit or implicit knowledge)
Temporal contiguity: US + CS close together in time
- Shanks et al., (1989) - computer key + outcome –> increase delay, decreased judgement of causality (from 70% to chance from 0-16s)
- BUT: flavour-aversion learning (TC not nec):
Andrykowski + Otis (1990) - chemo patients feel nauseous after treatment: no relationship between food + nausea time delay and aversion development (<1 day)
- BUT: blocking (TC not sufficient):
Despite contiguity being same for CS1/CS2 and US, prior learning of CS1-US blocked new learning about CS2
Tobler et al., (2006) - fMRI - prior learning blocked new learning about new stimulus despite same amount of training
Salience: how much you notice/case
- easier to see, new, more important, biological preparedness (inbuilt bias)
Attention: more salient = pay more attention
- previous experience affects attention - latent inhibition (CS pre-exposed without US, retards learning of CS-US after)
- Nelson + Sanjuan (2006) - space-ship computer game, stop clicking when attacked:
pre-exposure: red sensor (meant to be informative of attack but wasn’t)
learning phase: 50% same spacescape, flashes now predictive
worse suppressing mouse clicking if pre-exposed (context specific - not case if new context)
Extinction and inhibition:
Extinction occurs when CS-US then CS-no US (pavlovian conditioning)
IAS - learn to anticipate absence of US given a particular CS:
Superlearning: after conditioned inhibition (CS2 having IAS)
Blocking –> previous association of CS1 + US then CS2 + CS1 –> same US - blocks learning about CS2 because same outcome
Why does pre-exposure retard learning? (latent inhibition/blocking)
- 2 theories of attention
Mackintosh 1975:
Pearce & Hall (1980):
- only need to pay attention when first learning –> so pay attention to unreliable stimulus as learning still needs to occur (until it reaches a stable asymptote)
Hogarth et al., (2008): distractor X:
Prediction error
PE:
Brain:
Rescorla-Wagner rule:
Increase in associative strength of CS results from extent to which current associative strength deviates from perfect learning (deviation = PE)
- assumes a limited amount of associative strength to go around
Explains:
BUT: can’t explain latent inhibition
Instrumental learning (operant conditioning)
Operant conditioning = change in behaviour caused by causal relationship between behaviour + biologically relevant stimulus (reinforcer)
Reinforcement (responding increase over trials):
- positive –> add something good
- negative –> remove something bad
Punishment (responding decrease over trials):
- positive –> add something bad
- negative –> remove something good
Schedules of reinforcement - relationship between how often you do behaviour + whether it has outcome:
Neural basis of operant conditioning
Ventral striatum = important for learning about reward
Dorsal striatum = important in stimulus-response learning
O’Doherty (2004):
Habit vs goal directed behaviour (in operant conditioning)
Habit:
stimulus -(outcome/reinforcement)-> response - outcome strengthens S-R relationship
- good because: quick reaction (if dangerous stimuli) + quick learning (for dealing with predictable outcomes)
Goal-directed:
stimulus -> response -> outcome - do behaviour to bring about outcome
- interaction between: representation of causal action-outcome relationship + representation of current incentive value of outcome
Klossek et al., 2008: 2 buttons - to 2 attractive video clips, children satiated on one:
BRAIN:
De Wit et al. (2009) - vmPFC = activated in goal-directed learning; dmPFC = activated in habit learning
Valentin et al. (2007):
Generalisations:
Learning needs to be general enough to be flexible BUT not so general it produces inappropriate behaviour
Normal distribution of stimulus property (x) x associative strength of response (y):
Two stimuli - what to do if as similar to both?
Wills + Mackintosh (1998) –> artificial dimension with uncategorisable icons
Categorisation
Humans group similar stimuli together in categories
Low distortion exemplars = fewer discrepancies to typical category members
- quicker to categorise
High distortion exemplars = fit into category but aren’t very typical
Prototype effect –> even if participant hasn’t seen prototype before, it will still be more accurately categorised than other novel exemplars
Typicality effect –> low-distortion exemplars will be more accurately classified than high distortion exemplars
Theories of categorisation:
EXEMPLAR THEORY:
PROTOTYPE THEORY:
Rule learning
Positive patterning: A-, B-, AB+
Negative patterning: A+, B+, AB-
Shanks + Barby (1998) –> allergies + patterning
BUT: Gambler’s fallacy - based on rule-based learning on false assumption; associative learners do better