How Dogs Learn: Classical vs. Operant Conditioning
By LukEli K-9, Dog Training and Behavior Modification Professionals
The first book they had us read at Highland Canine Training’s
school for dog trainers was Excel-erated Learning by Pamela J. Reid. We had to
read a book every week or so and after tackling this book I thought my brain
might explode. It is, however, chock full of insight into how animals learn,
and I’ve borrowed heavily from it today to do my best to begin to explain how
B.F. Skinner caused dog trainers-in-training everywhere to curse his name as
they struggle to grasp the concept: operant conditioning. Before I can tell you
about operant conditioning, I should explain another type of learning to you -
classical conditioning.
CLASSICAL CONDITIONING
Classical conditioning, Pavlovian conditioning, associative
learning - these are all terms for what happens when an animal learns
associations among things. Learning associations means learning that things go
together: when one thing happens (you burn your dinner), another thing will
follow shortly (the smoke detector will go off). You say “Let’s go for a walk”,
and your dog jumps up excitedly because he has learned that this particular
phrase always signals going outside and chasing squirrels. Your cat runs to the
kitchen every time it hears the can opener because it has learned that this
sound precedes feeding time. In each case there is a predictable relationship
among the events and the animal learns to respond to the first event in anticipation of the second. That’s
what classical conditioning is all about: anticipation.
You’ve probably heard of Pavlov’s bell. Ivan Pavlov was
interested in how reflexes work and set out to study the salivary reflex in
dogs. In his experiments, he would ring a bell before presenting a dog with
food. The dog eventually began to salivate at the sound of the bell, even
though no food was present, because it had learned that the bell’s ring always
preceded feeding time. Pavlov’s experiments embody the basics of associative
learning; an initially neutral or meaningless stimulus (the bell) is presented
prior to the delivery of an unconditioned stimulus (the food). It’s called an
unconditioned stimulus because presentation of the UCS (food) always elicits an
unconditioned response(UCR)(salivating). For instance: Food always elicits salivation.
The dog doesn’t have to learn to salivate, it just happens. Once the animal has
learned the association between the neutral stimulus (bell) and the
unconditioned stimulus (food), the neutral stimulus (bell) itself elicits a
response which is called the conditioned response (CR).
Keep in mind there is nothing that requires the animal to perform
the conditioned response. The dog doesn’t need to salivate when the bell
sounds. The food is delivered no matter what the animal does. The CR
(salivating) just happens; it’s involuntary. When you attempt to classically
condition your dog, you are trying to get them to have an automatic,
involuntary reaction to a stimulus.
OPERANT CONDITIONING
Operant conditioning, Skinnerian conditioning, instrumental
learning - these are all terms for what happens when an animal learns that it’s
behavior has consequences. It’s called operant conditioning because the
behavior operates on, or has an effect on, the animal’s world. This applies to
us humans as well. We do all sorts of things every day that lead to
consequences that wouldn’t happen if we hadn’t done something. If you don’t
work, you don’t get paid. If you don’t cook, you don’t eat. If I ask my dog to
sit and he doesn’t sit, he doesn’t get a treat. Animals learn the relationships
between the things they do and the consequences of those actions just as people
do.
A popular way to remember the formula for the process of operant
conditioning is to think of it as the ABC’s of learning:
Antecedent(a cue like sit) --> Behavior (the animal sits)
--> Consequence (you give them a treat)
Operant conditioning is very different than classical
conditioning in that the animal must perform a behavior in order for the
stimulus to be presented. Aside from the antecedent, the events that occur are
under the animals control.
In the operant sequence there are four possible scenarios: two of
which increase the likelihood that a
behavior will occur again and two that decrease
the likelihood of the behavior reoccurring. All learning also involves stress,
and each of these induce a differing amount of stress while learning.
POSITIVE REINFORCEMENT
Positive reinforcement (P+) involves the presentation of a good
consequence when the behavior is performed. For instance, you say “sit”, your
dog sits, and you give him a treat. This serves to increase the likelihood that your dog will perform the behavior
again. Human behavior is also regulated by positive reinforcement. Child does
homework, child is awarded a good grade. Child does chores, child gets an
allowance. Person works hard for several months, person goes on vacation. You
get the idea. Much of dog training is based on ideas that incorporate R+. Quick
learning takes place with low stress when using this quadrant for learning.
NEGATIVE REINFORCEMENT
Negative reinforcement (R-) involves the removal of a bad
consequence when the behavior is performed. As with R+, the reinforcer must be
contingent on the behavior; you cease “prodding” once the behavior has been
performed. For example, you apply pressure with a choke collar and say “sit”,
your dog sits, and you release the pressure from the choke collar, removing the
unwanted action. This also serves to increase the likelihood of the response in
the future. Negative reinforcement is a very common method of controlling
behavior in people as well. Child does homework to avoid mom’s nagging. Person
goes to work to earn money to avoid being homeless. You say uncle to get your
brother to stop twisting your arm. You drive the speed limit to avoid tickets
and fines. Some examples for dogs: Applying pressure on the rear end of the dog
while saying sit until they sit and then you stop pushing down. “Escape
training” such as pinching the dog’s ear until they open their mouth to receive
a dumbbell for forced fetch retrieving is an often used R- method. Behavior
Adjustment Training utilizes this quadrant heavily for rehabilitating
aggressive dogs by applying a negative (whatever stimulus causes the dog to be
reactive, i. e. another dog) and then taking the dog away when they’ve given
the correct response (like a look away or paw raise) to the aversive. Negative
reinforcement can be a powerful training tool when utilized correctly and
humanely. When used inhumanely, R- can seem like nothing less than torture. Negative
reinforcement is the fastest way to learn and is also the most stressful.
POSITIVE PUNISHMENT
Positive punishment (P+) involves the presentation of a bad
consequence when the response is performed. For instance, you say “sit”, your
dog lies down instead, you jerk him back on to his feet with the leash. This
serves to decrease the likelihood of the behavior in the future. Positive
Punishment happens a lot: child hits brother, father spanks child. Person
drives drunk, person ends up in jail. Puppy pees on floor, puppy gets hit with
a rolled up newspaper. Dog continues to bark after being cued to stop, dog gets
squirted in the face with water to startle it. I’m pretty sure we’re all fairly
familiar with P+, though we may not have known that’s what it was. This
quadrant is probably the most widely used. Animals learn pretty quickly when
using this quadrant, but it’s also pretty stressful.
NEGATIVE PUNISHMENT
Negative punishment (P-) involves the removal of a bad
consequence when the response is performed. For instance, you say “sit”, your
dog lies down, so you eat the treat instead of giving it to your dog. This
serves to decrease the likelihood of the response in the future. Child calls
friend a booger head, friend goes home and won’t play anymore. Person drives
drunk, person loses license. Dog plays too rough, you stop playing with them.
Puppy jumps up on you, you turn your back to puppy. Animals learn fairly
quickly this way, and the stress level is pretty low.
All four of these scenarios will cause your dog to learn. You
must choose which combination of techniques you feel is right for you. This
knowledge will aid you when searching for a dog trainer as well. Remember not
to hesitate to ask questions about a trainers methods so you can decide if that
program is right for you. There are many books on the subject a well should you
decide to read more about learning theory and behaviorism.
Contact a trainer at LukEli K-9 today to learn more about the
methods we utilize and how we can help you reach your training goals today!
No comments:
Post a Comment