Reactive Champion: keep going signals

Showing posts with label keep going signals. Show all posts

Monday, May 20, 2013

Shedd Animal Training Seminar: Other Advanced Concepts

Picture is unrelated. But awesome.

There are four more advanced concepts that Ken discussed. Each is interesting, but there’s not quite enough material to flesh out separate posts. So… here’s the quick run-down.

Recall

It surprised me that Ken put a recall (which he defined as a behavior that requests an animal to return to “station”) as an advanced concept. After all, every puppy or beginning class I’ve seen teaches “come!” I think the reason Ken classifies it this way is twofold. First, he considers it a safety behavior, either for the trainer or the animal. And second, he outlined a number of errors that novices often make when teaching a recall.

Really, the errors people make with the recall all boils down to one thing: reinforcement. Ken said that a recall should be reinforced often and well. He recommended practicing it daily with your animal. If you use a recall in an emergency situation, you should always reinforce the behavior, preferably with a high-value item. And finally, he noted that novice trainers often use the recall only when the animal has done something wrong, or when good things are going to end (like when you call your dog to come at the dog park and then leave).

Behavior Chains

A behavior chain happens when the completion of one behavior cues the start of the next. Each subsequent behavior reinforces the previous one. Ken discussed two types of behavior chains, technical chains, where the trainer gives ONE cue and the animal then performs a series of behaviors, or common chains, where the trainer gives a series of cues, but there’s only one reinforcer at the very end.

There are two ways to teach a behavior chain. With back-chaining (which is the preferred method) the last step of the chain is taught first, and then the trainer teaches backwards to the first step. This is more reinforcing for the animal because he is always moving towards something he knows well. Forward-chaining (teaching the first behavior, then the second, and so on) also works, but there are usually more errors.

It is best to teach each individual component of the chain successfully. Each should be maintained separately as well. Ken shared that the animals that are the most successful at learning chains are ones who have learned about reinforcement variety because they already know that a behavior can be reinforcing.

Keep Going Signals (KGS)

A KGS is a signal that tells the animal that he is on the right track. It’s encouraging feedback that literally says “keep doing this and I’ll reinforce you!” It’s technically a tertiary reinforcer- that is, it’s a signal that a secondary reinforcer is coming (and thus, a primary). It’s sort of the way you get a poker chip that can be traded in for money which can then be used to by an actual reinforcer, like food.

Most people don’t purposely train them; they tend to happen along the way naturally. That said, you can purposely teach them by introducing them as a secondary reinforcer and then approximating longer periods of time before giving the primary. Interestingly, Ken said he doesn’t use them; he just doesn’t need them.

End of Session Signals

An end of session signal tells the animal that the training session is done and there will be no further opportunity for reinforcement. While it might be helpful to let your pet dog to stop bothering you, Ken pointed out that he doesn’t particularly want a 700 pound animal realizing the (fun!) session is over and thus refusing to let the trainer leave. That can be dangerous. In fact, the staff at Shedd are pretty careful that they don’t create an accidental signal that might lead to chaos, like picking up a bucket of fish and walking away.

That doesn’t mean you should use them. Ken simply doesn’t think they are important enough to argue about. If what you’re doing is working, great.

Wednesday, March 10, 2010

Asking the Wrong Questions

I’ve spent a lot of time over the last week thinking about no reward markers and keep going signals. In fact, I’ve thought so much about it, that I just had to post about it again.

In the comments to my last post on this topic, the point came up that there is a huge difference between shaping and competition. This is absolutely true. Shaping is about teaching a new skill, while competition is about testing a skill which is, presumably, under stimulus control. This means that you can’t really compare how a dog interprets silence from the learning stage to the performing stage; they’re two completely different contexts.

More than that, though, I realized I was also asking the wrong question entirely. When it comes to shaping, the question should not be How does my dog interpret silence? Instead, the question should be Why is there silence at all?

Think about that for a moment.

Now think about your last shaping session with your dog. How much silence was there? And why was there that much silence? For my last session, there was about thirty seconds of silence. Why was there that much silence? Well, because Maisy didn’t meet my criteria, of course.

But is that true?

My job as a clicker trainer is two-fold: split the task down into many small steps, and give a high rate of reinforcement when my criteria is met. These two things are interrelated. If a task is properly broken down into small, achievable steps, your rate of reinforcement will naturally be quite high. Likewise, the inverse is true: if you lump the steps together by setting the criteria too high, it will take your dog longer to figure it out, and thus your rate of reinforcement will be lower.

So why was there that much silence? Because I failed to do my job as a trainer. I lumped when I should have split.

Clicker training is difficult to master. To be a truly efficient trainer, you need to not only be able to split the task up into small steps, but you also need to be able to analyze your dog’s response, assess whether that means your criteria is too high, too low, or just right, and then adjust that criteria… and you need to be able to do all of that in a matter of seconds!

Thankfully, clicker training is also easy to learn. Even if you never move beyond the basic "click the behavior you like and give your dog a treat" stage, your dog will still learn. That's what I love about clicker training: regardless of your skill level, it has something to offer to everyone.

Thursday, March 4, 2010

Clicker Theory: No Reward Markers and Keep Going Signals

I apologize in advance to any readers who are not familiar with clicker training, or who are just beginning to learn about the learning theory behind it, as today’s post concerns more sophisticated clicker concepts.

About a week ago, someone on a mailing list I belong to posed a very interesting question: If the click means “yes,” then what does no click mean?

The poster, a teacher, mentioned that when she has her students play the“clicker game,” in class, they initially learn faster if they receive feedback for both yes, that’s what I want you to do, and no, you’re going in the wrong direction. In other words, a no reward marker. She went on to say that once her students understood the game, they learned that the absence of a click or verbal marker was basically the same thing as being told no. Once they figured that out, they could figure out the task just as quickly with only the positive marker.

She wondered: do our dogs understand the absence of a click the same way? Do they interpret silence as “no”? If so, why do they keep working in trial settings, where they receive neither clicks nor encouraging verbal feedback? Wouldn’t the silence inherit in a trial tell our dogs that they are doing it wrong? If so, this would have dire consequences on our performances.

The general response was that silence should not- cannot- imply that the dog made an error. Instead, we must teach our dogs that silence is a keep going signal- that they are on the right track, and that if they keep up with what they are doing, they will earn reinforcement. That is the only way that our trial performances will hold up.

So, if silence means “keep going,” then how do we tell our dogs they’re going off track? As Clicker Trainers, we don’t use corrections (defined here as anything that causes the dog pain or stress) to tell the dog they’re wrong. The logical response would be the use of a no reward marker- an emotionally neutral way of saying no, try something different… but some people on the list argued that this would actually slow learning down.

I disagreed. I shared with the group that when Maisy begins to get off track during a shaping session, I tell her “Nope! Try Again!” in a cheery voice. I wrote that I felt my dog learns faster this way, but that even if she doesn’t, it helps me feel better to be giving the feedback.

Still, in light of the conversation, I decided that I would test my theory, so I sat down with Maisy to work on a shaping project. First, I just worked with her like normal, not really thinking about what I say or when I say it. Although I did say “Nope! Try Again!” perhaps three or four times in the course of five minutes, I found that I said it more as conversation and less as information. Interestingly, I discovered that I was saying it at times when we were in the midst of a long period of silence. That “nope!” served to fill the silence until she finally got the click for doing what I wanted.

Next, I worked with her, but remained silent. I didn’t speak; I simply clicked or didn’t. Maisy continued on, doing well until we hit one of those long periods of silence. She kept trying things, but after about thirty seconds of neither a click nor a “nope!”, she laid down and looked at me as if she wasn’t sure what she was supposed to do.

Finally, I tried using the no reward marker more regularly. We continued shaping, but I tried to think in terms of right and wrong. I clicked when she got it right, and said “Nope!” when she got it wrong. This led to rapid-fire clicks and “nopes,” and after she got three “nopes” in the space of about ten seconds, Maisy again laid down with her chin on the floor. This time, though, I had to encourage her quite a bit to re-engage with the shaping game. But when she again got several more “nopes,” she laid down and refused to play any more.

I began to feel frustrated; this is not how it’s supposed to work! She’s supposed to want to play! My frustration came out in my voice, and I began to tell her to get up with an edgy tone. When she didn’t, my feelings of frustration gave way to anger. Since I didn’t want to take that out on her, I ended the session to evaluate what had just happened.

The first thing that I decided was that I was wrong: Maisy does not learn faster with a no reward marker. In fact, she gave up so quickly, and was so difficult to persuade to re-engage with the task, that I believe she found it punishing. True, she also gave up when the silence went on too long in the second scenario, but she worked approximately three times longer, and was much more willing to re-engage when I asked. As a result, I think she found the lack of any feedback confusing, but not aversive.

Still, I concluded that the complete lack of any kind of feedback was also not the best way to help Maisy learn. Instead, her learning is most efficient when she gets lots of reinforcement over a short period of time. This means my job is to break the shaping task at hand down into as many pieces as possible so it is easier for her to progress through each step of the task. However, sometimes it is difficult to figure out how to break a task down any further. As a result, if I cannot figure out how to make the task easier, and if it’s been fifteen to twenty seconds without a click, I need to give Maisy a “gimme” click- reverting to the previous level of criteria for a few moments before trying the higher criteria again.

I also suspect that my initial use of “nope!” wasn’t actually serving as a no reward marker. Given the way Maisy responded, I think it actually served the purpose of a keep going signal for her. This means that for tasks that haven’t had a sufficient amount of duration built in yet, she depends on verbal encouragement to know that she’s doing what I want. (Interestingly, though completely off topic, I haven’t been very good at building duration past 30 seconds or so, which was Maisy’s threshold for silence during these tests. It makes me wonder if my inability to build more duration in her behaviors is due to her threshold, or if she’s developed that threshold because I have neglected to put in the work necessary to build more duration. On second though, I’m pretty sure I know the answer to that.)

Finally, and perhaps most importantly, I learned that I don’t like it when I have to tell Maisy she’s wrong. I became frustrated and then angry as she continued to fail, even though that “failure” was behaviorally no different than when we did silence only, or when I used the keep going signal. Maisy was going about the shaping task in the exact same way in each scenario. She wasn’t any more wrong when I told her she was than when I didn’t. In other words: focusing on the wrong behavior rather than the right one changed the way I viewed and felt about the training session, and it took all of the fun and joy out of playing the shaping game with Maisy.

In the end, doing this experiment not only taught me that my initial supposition was wrong, but it also reaffirmed my commitment to positive training. Focusing on what I want her to do helps Maisy learn faster, but it also makes us both feel better.