What did Thorndike test for?
Do animals possess intelligence?
The study of consequence
The Law of Effect
4 Key Elements:
Any action has a consequece (can be positive or negative)
4 Key elements (cause and effect chain of events):
_ex:_ puzzle box, push lever, door opens –> cat more likely to push lever
What did Skinner say about behavior?
Said that there are 4 TYPES OF EXPERIENCES:
types of operant learning
2 that strengthen behavior: REINFORCEMENT
2 that weaken behavior: PUNISHMENT
what is reinforcement?
an increase in the srength of a behavior due to its consequences
to qualify as reinforcement:
positive reinforcement
a behavior causes appearnece or increase in intensity of a stimulus
stimulus: positive reinforcer (something animal seeks out)
“reward learning”
“add” something to increase behavior
negative reinforcement
behavior strengthened by the removal/ decrease in intensity of the stimulus
stimulus: _negative reinforcer _(usually something we want ot avoid)
“escape learning”
“escape avoidance learning”
how do we measure the strength of a behavior?
frequency
or
probability
of the behavior occurring
what is behavioral momentum?
behavior persists even AFTER punishment and other reinforcers
its hard to erase the learning
(“learning keeps building up”)
kinds of reinforcers
vs.
what is satiation?
(“say-she-ay-shun”)
when a reinforcer loses its ability to be effective due to changes in the environment, or the value of the reinforcer
ex: secondary reinforcers satiate more slowly because they tend to be weaker and aren’t as disruptive; provide instant gratification (money)
operant conditioning
behavior causes an effect (event contingent on behavior)
the behavior is voluntary/flexible (can be manipulated)
classical conditioning
events connected to behavior
(event 1 is contingent on event 2)
behavior is reflexive
what is contingency?
X and Y occur together or not at all
the amount of correlation between behavior and its consequence
(how reliably the reinforcer follows behavior)
*in terms of reinforcement: *increase the likelihood of behavior happening again
what is contiguity?
the time gap between behavior and its consequence
shorter gap = faster learning
after a delay between the behavior and the consequence, you may inadvertently reinforce other behaviors
*ex: *press lever, then cat chases tail, then door opens–> cat thinks chasing tail will open door
characteristics of reinforcers
size: larger is better
qualitative differences (individual differences can determine effectiveness)
ex: ice cream not a good reinforcer for someone who is lactose intolerant
behavior characteristics
some behaviors are easier to learn than others
what we’re trying to teach influences how quickly & easily it’s learned
what are motivation operations?
anything that changes the effectiveness of a consequence
2 types
neuromechanics of reinforcement
Olds & Milner–1950s
ESB (electrical stimulation of the brain)
shock rats brain
looked at reward pathway (pathway the deals w/ reinforcement; limbic system)
found that the reward pathway is dopamine rich w/ endorphins
stimulating dopamine receptors triggers a reward
THEORIES OF REINFORCEMENT:
positive
Drive reduction theory
drive: motivational states
reinforcers: events that reduce drives
THEORIES OF REINFORCEMENT:
positive
Relative value theory
reinforcers aren’t things–>they’re BEHAVIORS
ex: the reinforcer isn’t the food, it’s EATING the food
behaviors have different relative values (“rahter be doing X or Y?”)
something with a higher relative value will reinforce better that somwthing with a low relative value
**comparing differnet behaviors and how much you’d rahter be doing them**
pros: no need for internal “drives”
cons: doesn’t consider secondary reinforcers; sometimes low probability behavior will still be reinforcing under normal conditions
THEORIES OF REINFORCEMENT:
positive
Response Deprivation Theory
compare behaviors to themselves, **not to each other
_baseline: _ amount of time spent engaging in behavior under normal condtions
when does a behaviors become a reinforcer?
THEORIES OF REINFORCEMENT:
negative
Two-Process Theory
**2 processes occurring **
BOTH operant and Pavlovian conditoning occur
escape from an aversive stimulus to learn
when CS loses its aversiveness, avoidance persists; extinction fails to occur
THEORIES OF REINFORCEMENT
negative
One-process Theory
keep operant conditioning (because it can explain everything)
the reduction in the shock is reinforcing
stop avoidance behavior by forcing it to stop