Operant Behavior
• Pavlovian responses are passive; they happen to you (reflexes, autonomic reactions).
• Operant behavior is active; it consists of the things you do.
Pavlovian responses are elicited(by stimuli) Operant behavior is emitted(by an organism)
• B. F. Skinner coined the term “operant behavior” to characterized the behavior studied by his influential predecessor, Edward Thorndike.
• Many of Skinner’s early ideas were derived from Thorndike’s work, but Skinner developed his own terminology and philosophical perspective (radical behaviorism).
Thorndike: The Law of Effect
Skinner’s operant learning is not S-R (reflex) learning
• Early behaviorists like John Watson used the reflex terminology of stimulus-response (S-R) psychology to account for all behaviors.
• From this perspective, stimuli force responses much like meat in a dog’s mouth forces salivation.
• Using the reflexive, S-R approach leads to awkward, mechanistic explanations of voluntary, goal-oriented behaviors.
• Example on board of Clark Hull’s S-R chaining
– Every point in a maze is a stimulus that drives an organism reflexively to the reinforcement at the end of a maze.
Molecular vs Molar Behaviorism
Watson and Hull’s early Molecular approach:
All behavior, even operant behavior,is made up of S-R chains.Complex behavior = more complex chains.
Tolman & Skinner’s molar approach:
Behavior is made up of learned,whole, integrated acts that can be extended over time in complex ways. No need to describe or account for every twitch: just describe the final behavioral outcome—the rat runs to the end of the maze more rapidly than before.
S-R vs R-S learning
The Selection of Operant Behavior
Methodological Advances
• Discrete Trial Procedures: Behavior of an organism ends a trial (e.g., escape from box, or finding the end of a maze ends a trial; start again)
– Requires researchers to be present during task.
• Skinner introduced “Free Operant Procedures” : Ongoing operant behavior can be “freely” repeated any number of times; rates of responding automatically recorded by a cumulative recorder.
– Researchers can do other things while behavior is recorded automatically.
Operant Conditioning
3
• Any procedure through which an operant behavior becomes stronger or weaker depending on the consequences of that behavior.
–Strength of behavior refers to its frequency, rate, probability of occurrence, intensity, or persistence.
Contingency
• In operant conditioning, we say that the strength of an operant behavior is contingent upon its consequences.
–Recall that contingency implies an if-then relationship.
• In operant conditioning, if I do this, then a certain consequence follows.
• If behavior, then consequence.
Operant Conditioning Procedures
Environmental Consequences
• Consequences of a behavior can be described in terms of whether an environmental stimulus is presented or removed as a result of a behavior.
–Sometimes our actions result in something becoming present or available to us.
–Think (+) (addition)
–Sometimes our action result in something being removed or taken from us.
–Think (−) (subtraction)
Two types of Reinforcement
Positive Reinforcement: A response is strengthened by adding a stimulus or increasing its intensity
(e.g., + food, praise)
The stimulus is called a positive reinforcer.
Negative Reinforcement: A response is strengthened by subtracting a stimulus or decreasing its intensity
(e.g.,—shock, nagging)
The stimulus is called a negative reinforcer.
Positive reinforcers as rewards
Negative reinforcement and escape
Punishment
Any stimulus or outcome that decreases the strength of an operant behavior.
–Positive Punishment: Decreases the likelihood of a behavior by presenting (adding) a stimulus (usually aversive).
–Negative Punishment: Decreases the likelihood of a behavior by removing (subtracting) a stimulus (usually a desirable one).
Kinds of Reinforcers: Primary & Secondary
• Primary reinforcer: any reinforcer that is not dependent on another reinforcer for its reinforcing properties.
–Often biologically based; innately or naturally reinforcing for members of a species.
–Food, water, sexual stimulation, or relief from pain, cold, or other aversive conditions.
• Secondary Reinforcer: Any reinforcer that has acquired its reinforcing properties through its association with other reinforcers.
–Also called “conditioned reinforcers”
• e.g., pair a buzzer with food (classical conditioning)
• Then the buzzer can be used to reinforce operant behavior (such as pressing a bar).
Generalized Reinforcers
Kinds of Reinforcers
–Contrived (“extrinsic”) reinforcers: Any reinforcing event that has explicitly been designed to modify behavior.
• Almost all reinforcements in research and applied behavior analysis.
–Natural (“intrinscic”) reinforcers: Any reinforcing event that follows automatically, i.e., naturally from a behavior.
• Pleasure from succeeding on a task.
• Scratching an itch.
• Reading for intrinsic pleasure (natural) vs. reading to earn a star in class (contrived).
Social Reinforcers
• Social reinforcers are consequences that involve personal communication from others
– Praise, touch, compliments
– Many mammals respond to these social reinforcers( Limbic System): “good dog” in a happy, warm tone. ‘
• Social reinforcers are easy to deliver, can be applied immediately, and are not very disruptive
Shaping & Chaining
• By definition, reinforcement must come after a behavior is emitted (a reinforcer is a consequence of the behavior).
Shaping
• Shaping involves reinforcing successive approximations to the desired (target) behavior.
– Hot, warmer, cold
Chaining
Chaining Methods
– 1st Step: Task Analysis—identify each link in the chain and their order
– Forward chaining begins with the first link in a task; when a link is mastered, the next link is added as a new requirement for reinforcement. And so on.
• If any link does not readily occur, use shaping.
– Backward chaining begins with the last link and works backward toward the first link.
Prompting
Prompts are any stimuli that help an organism to initiate the desired behavior.
Some prompts are simply verbal or visual cues or reminders (sticky notes, or saying “Wash your hands.”)
Modeling (showing how) is another type of prompting.
Physical (or manual) prompting means physically guiding the organism to engage in the correct behavior.
e.g., If a dog simply will not spontaneously sit, push its rear end down, then reinforce.