Define operant conditioning.
A learning process in which the strength of a behaviour is modified by the consequences following the behaviour. This approach regards observable stimulus-response behaviours. Behaviours followed by a pleasant consequence is likely to be repeated, while those followed by an unpleasant consequence are less likely to be repeated.
In operant conditioning, what is a reinforcer?
Any stimulus (action or event) that strengthens or increases the likelihood of a response (behaviour) occurring again.
In operant conditioning, what is a punisher?
Any stimulus (action or event) that weakens or decreases the likelihood of a response (behaviour) occurring again.
What are the two types of reinforcement?
Positive and negative reinforcement.
Describe positive reinforcement.
The procedure of presenting a positive reinforcer after a response, thereby increasing the likelihood of the behaviour occurring again.
Describe negative reinforcement.
The removal of an aversive stimulus as a consequence of a response, which will increase the likelihood of that response.
What are the two types of punishment?
Aversive punishment and response cost.
Describe aversive punishment.
Occurs when aversive or unpleasant stimuli is given or applied after an undesired behaviour takes place, with the aim of making the behaviour less likely to occur in the future.
Describe response cost.
Occurs when certain responses result in the loss of something pleasurable/when a pleasant stimuli is removed, with the aim of making the behaviour less likely to occur in the future.
Explain whether it is better to use reinforcement or punishment.
Since reinforcement focuses on increasing a desired behaviour and punishment focuses on reducing an unwanted behaviour but does not teach a replacement, it is typically recommended to use positive reinforcement when trying to make a behaviour change.
List the advantages of reinforcement.
List the disadvantages of punishment.
How can punishment be made effective?
When it is brief, immediate and linked to the undesired behaviour in the mind of the person/animal, and a positive behaviour can be taught to replace the negative behaviour which can then be reinforced.
In operant conditioning, what is contiguity?
Refers to the timing during the learning phase, or how ideas, memories and experiences are linked closely together in time. The consequence needs to be provided as close as possible to the behaviour for the association to occur.
In operant conditioning, what is contingency?
Relationship between a response and a reinforcer/the punisher (dependent on each other).
As part of operant conditioning and contingency, what do consequences need to be in order to be effective?
Define shaping.
A desired behavior is taught by reinforcing successive approximations, or steps, toward the final goal. This process involves reinforcing each small action that moves closer to the target behavior until the desired action is achieved.
When is shaping used?
It is often used to teach complex behaviors that would not occur naturally by providing reinforcement for behaviors that are closer and closer to the ultimate goal, such as training animals or teaching children new skills.
Describe the differences between positive reinforcement and negative reinforcement.
Similarities: both types of reinforcement aim to increase the likelihood that the behaviour will occur again.
Differences: Positive reinforcement involves providing or receiving a pleasant stimulus (reward). Negative reinforcement involves removing or avoiding an unpleasant stimulus.
Describe the differences between aversive punishment and response cost in operant conditioning
Similarities: both types of punishment aim to decrease the likelihood that the behaviour will occur again.
Differences: the difference is with regard to whether the stimulus is pleasant or unpleasant, and whether the stimulus is added or removed. Aversive punishment involves providing or receiving an unpleasant stimulus, whereas response cost involves removing a pleasant stimulus.
Explain the importance of contiguity and contingency in operant conditioning
Contiguity and contingency both relate to the timing between the behaviour and the consequence in operant conditioning. They are important, otherwise an association between the behaviour and consequence is less likely to occur – learning is less likely to occur.
State the schedules of reinforcement.
Continuous reinforcement and partial reinforcement.
Describe continuous reinforcement.
When reinforcement is given after each and every demonstration or performance of the desired behaviour; learning happens quickly when this is used.
What are the drawbacks of continuous reinforcement?
If reinforcement is stopped, the behaviour will quickly slow down and eventually stop.