Learning
the acquisition,
from experience, of new
knowledge, skills, or responses
that results in a relatively
permanent change in the state
of the learner
Classical Conditioning
learning by association
something that didn’t cause reaction before can now trigger a response automatically
over time this response will decrease without the conditioned stimulus and unconditioned being paired (bell and food)
Pavlov (classical conditioning)
Pavlov was a Russian physiologist who won
Nobel Prize for Medicine for his research on
the digestive system of dogs
* His major contribution to Psychology (theory
of classical conditioning) was entirely
serendipitous…
“Pavlov became aware of an odd and
annoying phenomenon: the dogs would
secrete gastric juices at times other than
feeding – for instance when they heard
their keeper shortly before mealtime”
(Hunt, 2007)
Unconditioned stimulus/response vs conditioned stimulus/response
-Unconditioned Stimulus: something that reliably
produces a naturally occurring reaction in an organism
-Unconditioned Response: a reflexive reaction that is
reliably produced by an unconditioned stimulus
-Conditioned Stimulus: A previously neutral stimulus
that produces a reliable response in an organism
-Conditioned Response: a reaction that resembles
an unconditioned response but is produced by a
conditioned stimulus
Second-order Conditioning
Second-order conditioning is when something new gets linked to a conditioned stimulus instead of the original thing.
Ex: Advertising– putting steph curry in a subway ad.
Outside of the ad:
steph curry: Conditioned stimulus
Positive experiences: Unconditioned stimulus
Positive emotions: Unconditioned response
In the ad:
steph curry: Conditioned stimulus
Positive experiences: conditioned stimulus
Positive emotions: conditioned response
You already feel good (positive emotions) when you think about fun or positive experiences. Now, because Steph Curry is in the ad, you start to connect him (and Subway) with those good feelings — so just seeing him in the ad makes you feel happy, even without the original experience.
Example: If a dog learns that a bell means food, and then a light always comes before the bell, the dog will eventually drool just at the light — even without hearing the bell or seeing the food.
Contingency Learning
Contingency learning is when you learn that one thing depends on or predicts another — like realizing that if one event happens, another will likely follow.
Example: If every time you hear thunder (event 1), lightning follows (event 2), you learn that thunder means lightning is coming.
Contingency vs Contiguity
contingency: 2 things happen close to each other in time
contiguity: one thing predicts the other
ex of Rescorla & Wagner study:
2 groups have same number of CS - US pairings (same amt of contiguity)
only 1 group CS - US pairing was more reliable (each time CS occurred, US also occurred) therefore more contingent
Finding: contiguity doesn’t matter as much as contingency, group that had more reliable pairing learned better
conditioning should be easier when the ____ is unfamiliar
conditioning should be easier when the conditioning stimulus is unfamiliar
generalization (Little Albert Experiment)
Generalization: the CR is observed even though the CS is
slightly different from the CS used during acquisition
Example: If a dog is trained to drool when it hears a bell, it might also drool when it hears a similar sound, like a phone chime — even though it’s not the exact same bell.
Can classical conditioning be a basis for treatments for phobias? explain the relationship between CS/US in acquisition of phobia and exposure therapy
Acquisition of phobia: Conditioned stimuli and unconditioned stimuli pairings
Exposure therapy: Extinguish fear to Conditioned Stimuli
via presentation of CS
without Unconditioned Stimuli
Acquisition of phobia: You develop a fear when something scary (unconditioned stimulus, like a dog bite) gets paired with something neutral (conditioned stimulus, like a dog). After that, the neutral thing alone makes you afraid.
Exposure therapy: To get rid of the fear, you face the conditioned stimulus (the dog) without the scary event happening. Over time, your brain learns that the CS is safe, and the fear goes away.
Operant/Instrumental Conditioning
type of learning where consequences of an organism’s behavior determine whether it will repeat that behavior in the future
-“learning what works and learning
what doesn’t work”
Operant conditioning vs classical conditioning
Classical:
“involuntary”/”reactive” behaviors
Operant:
“voluntary”
Origins of Operant Conditioning: Thorndike and the Law of Effect
Thorndike’s Law of Effect: Behaviors that are followed by a
‘satisfying state of affairs’ tend to be
repeated, and those that produce an
‘unpleasant state of affairs’ are less
likely to be repeated
Reinforcement of behavior:
the increase of behavior due to a satisfying state
Punishment of Behavior:
The decrease of a behavior due to an unsatisfying state
Positive vs negative reinforcement and punishment
Reinforcement
Positive: stimulus added, increases likelihood of behavior
Negative: stimulus removed, increases likelihood of behavior
Punishment
Positive: stimulus added, decreases likelihood of behavior
Negative: stimulus removed, decreases likelihood of behavior
Intermittent Reinforcement Effect
Behaviors that are rewarded sometimes (not every time) are harder to stop than behaviors that are rewarded every time.
Explain these:
Interval vs Ratio
Fixed vs Variable
Interval vs. Ratio Schedules:
* Interval: based on the time
intervals between reinforcements
* Ratio: based on the rate of
responses to reinforcements
Fixed vs. Variable Schedules
* Fixed: presented at
fixed/consistent interval / ratio
* Variable: presented on an
average interval / ratio
ex:
fixed-interval=every 2 minutes exactly
variable-interval=on average every 2 mins
fixed-ratio=every 5th behavior exactly
variable-ratio=on average every 5th behavior
Different combinations of these and their effects:
Interval vs Ratio
Fixed vs Variable
fixed-interval=every 2 minutes exactly
-low rate of responding
-low rate at beginning of interval, high rate as end nears
variable-interval=on average every 2 mins
-low rate responding
-steady rate of responding
-slow extinction of behavior
fixed-ratio=every 5th behavior exactly
-high rate responding
-higher rate as reward nears, low rate after reward
-fast extinction of behavior
-good for initial teaching of behavior
variable-ratio=on average every 5th behavior
-high rate responding
-steady rate responding
-very resistant to extinction
Tolman Studies: Latent Learning
Findings?
A lot of learning can happen
without reward / punishment
challenges the simplistic behaviorist view
Latent Learning
learning that
occurs without immediate
behavioral manifestation
The Garcia Effect
(Taste Aversion / Biological Preparedness)
&
John Garcia study
The study of taste/food aversion revealed unique patterns of learning that are unlike typical (classical) conditioning phenomena
John Garcia study:
Key idea: Some associations are easier to learn than others.
Example: Rats easily learned taste → sickness, but not sound → sickness.
Habituation vs sensitization
Habituation is a decrease in response to a repeated stimulus, while sensitization is an increase in responsiveness after exposure to a strong or noxious stimulus
Ex of Garcia Effect in Cancer Patients
Cancer patients undergoing chemotherapy often
develop an aversion to food they eat before treatment
* This aversion can be diminished by eating a weird-
flavored candy before treatment (aversion to the candy
is developed instead)
Challenges to Conditioning (Findings that highlighted limitations to conditioning)
Observational Learning and “Bobo Doll” Experiment
Learning by watching actions of others
“Bobo Doll”:
-Children who watched an adult act
aggressively were more likely later
to act aggressively
-Reward and punishment of adult
aggression would influence
children’s behavior
* ***Much of what we learn is not
from direct reward & punishment