Sunday, January 19, 2014

Schedules of Reinforcement


Schedules of Reinforcement

By: LukEli K-9, Dog Training and Behavior Modification Professionals


Finally! Yes, this is the blog we keep referencing so I better make it worth your while, right? Well I shall do my best! Now, put your learning theory caps back on and I promise after today all the pieces over the past few blogs should start to fall into place.

Before I go into the detail of different kinds and rates of reinforcement, let me briefly describe again how it is dogs learn through the use of operant conditioning. One of the easiest ways to remember this is A-B-C, Antecedent-Behavior-Consequence. For example, you ask the dog to sit (antecedent), the dog does or does not sit (behavior), you reinforce or do not reinforce, depending on the behavior (consequence). No matter which quadrant of operant conditioning you are using (meaning positive and negative reinforcement, positive and negative punishment) the progression will always follow the ABC’s.

So, what is a reinforcement schedule?

A reinforcement schedule is a rule or program that determines when a response will be given a reward. There are three basic schedules to choose from; Continuous Reinforcement Schedule (CRS), Fixed Ratio Schedule (FRS), and Variable Ratio Schedule (VR). The type of schedule used for your dog will greatly depend on which stage of learning you are in. Remember two blogs ago when we discussed the importance of following the four stages of learning closely and only proceeding to the next stage when the dog was correctly responding 80% of the time? Hopefully you do; if not make sure to catch up before reading this one.

In order to effectively make it through to the last and final stage of learning, the maintenance stage, the dog should have learned the relationship between the response and the consequence; this is largely influenced by the reinforcement schedule used.

Continuous Reinforcement Schedule (CRS)

During the acquisition stage of learning the tunnel,
Izzy was on a continuous reinforcement schedule
On a continuous reinforcement schedule, each correct response is followed by a reward. In the first two stages of learning, acquisition and fluency, each time the dog responds with the correct behavior a primary reinforcer reward is given. This is a continuous reinforcement schedule.  

Lets go with an example of teaching your dog to shake. You are still progressing through the first two stages of learning; therefore a continuous reinforcement schedule is still in use. Each time the dog performs the desired behavior of raising his paw to your hand to “shake” he is given a reward, most likely in the form of food. Each correct behavior earns him a reward.

Fixed Ratio Schedule (FR)

Fixed ratio schedules are considered a sub category of partial or intermittent reinforcement schedule including variable ratio, random ratio, fixed interval and variable interval. For the sake of your brains, and mine for that matter, we are only going to be discussing fixed and variable ratios of reinforcement. 

A fixed ratio schedule of reinforcement means there are a fixed number of correct responses needed before the reward or reinforcement is given. A FR(fixed ratio)-3 means that the dog must correctly perform the behavior three times before receiving reinforcement. Similarly an FR-6 requires six responses a FR-9 requires nine correct responses, so on and so forth. The fixed ratio number is dependent upon both which stage of learning you are at and how difficult the behavior is for the dog to perform. During the very end stage of fluency it is ok to start a low number fixed ratio schedule as long as the dog is performing 80% very consistently. When transitioning from a continuous reinforcement schedule to a fixed ratio schedule, you should start at number no higher than five, also known as a FR-5. 

Using the same example of teaching your dog to shake, on a FR-3, the dog must correctly perform the behavior of “shaking paws” with you three times before the reward/reinforcer is given. Keep in mind that whenever we change the rules, in this case the number of correct behavior responses needed to get a reward, the dog may regress. That’s ok! He will very quickly pick up the new rules.

Variable Reinforcement Schedule (VR)

A variable reinforcement schedule is when the amount of times a correct behavior is rewarded varies each time. It is recommended that a variable reinforcement schedule only be used once the dog has reached the very end of the generalization stage of learning and should be continued throughout the maintenance stage.   

This schedule of reinforcement is the same way slot machines or playing the lottery works. There is an unknown amount of times needed playing before a reward, in this case a jackpot, is given. When you begin using a variable reinforcement schedule it is just as important as when beginning the fixed ratio schedule, to start very low in the number of times a response receives a reward.

Why are the changes of reinforcement schedules so important?

For starters, those of you worried about having to continue to treat your dog for the same responses over and over again; have no fear! As I have mentioned in past blogs, the road to not having to carry a treat pouch with you everywhere you go, is in reaching the variable ratio schedule of reinforcement. Once you have made it to that schedule of reinforcement, which should also be at the same time you have reached the maintenance stage of learning, your dog will soon learn that “playing the lottery” is rewarding.

Changing the rates of reinforcement when training your dog will also help to greatly increase the learning capabilities of your four legged friend. It is necessary to keep in mind that learning, if done correctly, is a game for dogs. By changing the reinforcement schedule we have made the game more interesting and stimulating than it previously was.

As one a closing statement, let me remind you about the importance of consistency when teaching your dog new behaviors. As you progress through the stages of learning and begin to change reinforcement schedules, consistency in between all members of your household, each time you interact with your dog will greatly affect the end product of both you and your dogs hard work in training.

If you are unsure which schedule to use with your dog or how to properly integrate the use of a different reinforcement schedule, seek out the help of dog training professional. Contact a trainer at LukEli K-9 today to begin the first step of your journey towards reaching your training goals with your pet.

No comments:

Post a Comment