Reactive Champion

Monday, October 8, 2012

Shedd Animal Training Seminar: Outside the Session

How much do you train your dog? How often? I’ve asked many people this question. Some say that they train for an hour every evening. Others say they train for one minute out of every hour that they are awake. Personally, I train very little. When I’m working on a goal- preparing for a trial, maybe- I will train for 5 to 10 minutes, 3 to 4 days a week. In the last two months, I’ve done exactly one formal training session, and it lasted about 60 seconds.

And yet, I’m training all the time. Even on our thrice-daily walks I have a pocketful of treats so I can reinforce behaviors I like. It’s just that I usually don’t have a specific goal in mind. This is what Ken described as the difference between formal training and non-formal interactions. Even the most dedicated of trainers will spend more time interacting with their animals in a non-formal situation. After all, even at an hour a day of training, that still leaves 23 hours of unstructured time.

The truth is, every interaction we have an animal has some type of value- hopefully reinforcing- and animals learn outside of formal training sessions just as readily as inside them. I think most of us understand this at some level, but what I loved about the Shedd experience was learning how the staff use this to their advantage. They actually have what Ken calls passive training sessions, where the staff simply spend time with the animals in their care.

We got to see this demonstrated with their Magellanic Penguins. The trainers didn’t show off any behaviors, and they didn’t demonstrate the science behind training. Instead, they brought the penguins into our classroom solely to observe how the penguins acted when around strangers. Did they play with their toys? Look for food? Curiously inspect their surroundings? Not only were we excited to get up close and personal with these amazing animals, but the trainers were able to gain some valuable information.

More importantly, though, non-formal interactions are often when the most bonding occurs. Although training sessions can and do contribute to relationships, the time we spend interacting with our animals with no pressure and no expectations has the biggest impact. At the most basic level, relationships are simply about being together with no strings attached. For me, these informal moments are the most important part of my day. I cherish walking through the woods with my dog at my side, the way she flops down exhausted after a good play session, and the quiet moments late at night where she lies pressed up against my side.

What about you? What non-formal things do you do with your dog that makes you happy?

Friday, October 5, 2012

Shedd Animal Training Seminar: What to Do When Your Animal is Wrong

Ken is unabashedly in favor of positive reinforcement when training. I am too, and in fact, I think most people see the value in using positive reinforcement for training. But people often struggle with what they should do when the animal they are training is wrong, or does something they don’t like. Some people will argue that the only way to deal with incorrect responses is through the use of punishment- after all, it decreases behavior, right?

Ken Ramirez

Well... yeah. Punishment, aversives, negative reinforcement… it all works. Ken acknowledges this freely. However, he adds that using these approaches can be fraught with pitfalls and often have unintended consequences, especially when done by novice trainers.

To a certain degree, Ken agrees with the advice “reward what you like and ignore what you don’t.” He also acknowledges that there are behaviors that simply cannot be ignored- like aggression. Thankfully, there are many ways to deal with problem behavior. That said, if you’ve come across a behavior that can’t be ignored and you can’t figure out several possible solutions, the situation probably requires skills that are more advanced than you have. And that’s fine! Animal training is something that even the experts continue to learn about, and consulting with others is standard practice.

Ken’s go-to method for dealing with incorrect responses is something called the Least Reinforcing Stimulus or Scenario (LRS). An LRS is designed to deal with unwanted behavior without causing frustration for the animal. It is deceptively simple: to implement an LRS, you provide a completely neutral response for a short period of time, followed by an immediate opportunity for the animal to earn reinforcement. Of course, an LRS is a bit more complicated than that, so let’s take a look at it in more depth.

First, like everything in training, timing is important. The LRS must be implemented immediately so that the animal understands which behavior was incorrect. A poorly timed LRS will likely frustrate the animal, and what you’re looking for is a calm acceptance from the animal.

Next, the LRS should be brief. In general, when an LRS is used, it will probably last from 3 to 5 seconds. Of course, this length may vary. It needs to be long enough that the animal notices an interruption in reinforcement, but not so long that you upset the animal. For trainers and animals that have a relatively fast rhythm going in training, the LRS can be quite short. For slower trainers and animals, the LRS will need to be longer. We saw this playing out in the daily sessions we watched; the sea lions were quite quick, and their LRS might only be a second or two. On the other hand, the lizard we saw was much slower, and his LRS needed to be longer as a result. That said, do not be tempted to extend the length of the LRS beyond what is strictly needed. The LRS should be the same length despite how “bad” the mistake was. It is not a time-out procedure.

One of the most important features of the LRS is that it should be a completely neutral response. This means that you should refrain from having an emotional response (no scowling or grumbling, for example). However, you do not need to freeze. An LRS works because it is an interruption in the flow of reinforcement in which you simply don’t respond to the animal. If you’re looking at the animal, continue looking at it. If you’re looking away, continue to look away. The only exception to this is if what you’re doing is reinforcing.

Finally, you need to provide the animal with an immediate opportunity to earn reinforcement after the LRS. Because you don’t want to cause frustration, the best way to do this is by offering the animal a different, easy behavior. Ken tends to use targeting because that’s such a strong behavior for the animals and trainers at Shedd alike, but any behavior that the animal knows well will work. Ken will then do a few other behaviors before asking for the behavior that the animal failed at earlier.

Astute readers will have realized that the LRS is really meant for an animal with prior training (because there needs to be at least one fallback behavior), and with whom you have a relationship (so that you know how long the LRS should be). It will also only work when the animal is participating in the process. You can’t use an LRS with an animal that’s disengaged from the training process because it won’t notice that you’ve interrupted the reinforcement process.

The LRS, while simple on the surface, really does work. It’s been proven both practically and scientifically. It is, technically, the first step to extinction, so it can take a bit of time. Many positive reinforcement trainers do the LRS quite naturally; I know I’ve done them, but not because I knew that’s what I was doing. Having some knowledge about them definitely helps me understand how to implement them better.

What about you? Do you use an LRS, or something similar? This is an admittedly new concept to me, so I’d love to hear about others’ experience with it.

Wednesday, October 3, 2012

Wordless Wednesday: Happy 6th Birthday, Maisy!

Photo from early February 2007.

Monday, October 1, 2012

Shedd Animal Training Seminar: Getting Behavior

So far, we’ve learned that Ken prefers to use positive reinforcement when training behaviors, and that he finds the use of a bridging stimulus to be very helpful. But we haven’t talked about how he gets the behavior he wants to bridge and reinforce, so let’s do that today! The first thing you need to know is that while some trainers will use only one or two methods to the exclusion of all others, Ken uses what works. The method that he chooses will depend on the animal and the behavior.

As an interesting side note, while the concepts were familiar to me, Ken uses some different terminology than I’m used to. Indeed, the words that zoological trainers use differ from things we say in the dog world. As a lover of words, I actually found this fascinating. For example, he uses the word “shaping” differently than we do in the dog world; for Ken, “shaping” simply means training. Each of the techniques he discussed can be broken down into small increments, and he will reward each successive approximation regardless of what technique he’s using… which is shaping. Just not the way we dog people typically think of it.

Targeting

Targeting is probably the most common technique used in the zoological setting, mostly because it is incredibly useful for teaching husbandry tasks. (Ironically, it’s probably the one I use the least with my dog, although I have found it incredibly helpful when I’m trying to learn a physical skill myself.) Targeting involves teaching the animal to touch an object with a body part, either briefly or for an extended duration. For example, a dolphin may be taught to touch its nose to a colored shape, or a sea otter may be taught to grab a plastic buoy with its paws.

Scanning
Sometimes called capturing in non-zoological settings, scanning is where the trainer watches for the animal to perform a desired behavior. When the animal does- for example when a dolphin jumps in the air or a beluga whale spits water- the trainer reinforces the animal at the exact moment the behavior happens. As you can imagine, the use of a bridging stimulus is incredibly important here. Scanning is also sometimes called free-shaping, as the trainer will capture a first step toward the behavior and then gradually increase the criteria.

Baiting
Baiting is also known as luring, and it can be a controversial method. Baiting is done by using food to elicit the desired behavior, and if not done well, can create animals and trainers who are dependent on the use of food to complete the behavior. We actually saw some baiting in action when Ken was working to teach Tanner the sea lion to go in the water on cue. He would give the cue (swim!) and then toss a fish in the water. While it is very useful- especially for novice animals and trainers- Ken said it’s typically not his first choice for a lot of behaviors. Still, he rarely says “never” to a training technique.

Environmental Manipulation
I love environmental manipulation; it can make things so easy sometimes. This is simply the process of arranging things in the environment so that the desired behavior is the only or the most likely option for the animal. It obviously won’t work for every behavior you want, but when it does, it’s genius. What could be better than setting the animal up for success and giving the trainer the opportunity to reinforce the behavior?

Modeling
This technique, sometimes called molding, is more common in the dog world than zoological settings. Modeling is where you physically manipulate the animal’s body into the position you want (like when a trainer pushes a dog into a sit), and as you can imagine, tends to be either physically impossible or incredibly dangerous with zoo animals. Still, Ken refused to discount it entirely, citing Koko the gorilla as an example. Koko’s trainers taught her sign language by manipulating her fingers and hands into each word, something they could have never targeted, captured, or lured into happening.

Mimicry
This is not a common method of training as it requires a fairly sophisticated animal who understands that it should copy what another animal is doing. Although Ken has done some work around mimicry, it is not widely used. A similar concept is social facilitation, in which an animal learns to do something from another animal. This is typically accidental, such as when a dog learns to bark at passers-by from another dog.

Abstract Learning
Abstract learning is a complex method of learning. This is where two ideas are combined to create a new concept. It happens often with humans, such as when we attend lectures, but is not common in the animal world, although it does happen with cognitive researchers. Alex the Parrot is an excellent example.

This list is certainly not all inclusive. There are other ways of training animals, but these were presented as some of the most common methods. I initially learned through the use of baiting/luring, and appreciate it as a technique. I also do a fair amount of scanning (both capturing and free-shaping). What about you guys? Which technique do you use the most? Are there techniques you’ve used that aren’t listed here? I’d love to hear about your experiences.

Friday, September 28, 2012

Shedd Animal Training Seminar: A Bridge to Better Behavior

Ken Ramirez was clear: using reinforcement is the best way to train an animal, and reinforcement is most effective when it is inherently enjoyable and provided immediately. But providing reinforcement quickly can be challenging, especially when the animal is performing at a distance. How can you get a fish to a dolphin when it’s in the middle of arcing through the air or a piece of meat to a dog who is performing an agility obstacle 20 feet from you?

This Shedd trainer uses a whistle bridge for the Aracari.

Well… you can’t. But you can bridge the gap between the time the animal performs the behavior and the time it gets the reward. This is done through the use of a bridging stimulus, or bridge for short. This terminology, while occasionally used in the dog world (we usually say “marker” or even “click”), is widespread in zoological training programs, and you have to admit, it’s a descriptive word. A bridge is a signal that tells the animal, “I like what you just did, and I’ll give you a reinforcer as soon as I can.”

Using a bridge is not essential to animal training- learning will take place whether or not you use one- but it does have several advantages. It is obviously useful when you just can’t deliver that fish fast enough. It can assist with precision by helping the animal identify exactly what part of the behavior it just performed is being reinforced. Was it the height of the jump? The size of the splash? The way it turned its head while mid-flight? The bridge provides clarity. And bridges work neurologically because they’ve been classically conditioned to signify that a reinforcer is coming.

If you choose to use a bridge (and Ken would argue that you should, at least some of the time), there are three things you need to do. You need to choose an effective bridge, you need to teach it to your animal, and you need to be proficient in using it.

When you select something to use as a bridge, there are a few things to keep in mind. First, and possibly most importantly, you need to use a bridge that your animal can perceive. Obviously, an audible bridge is useless if the animal cannot hear it, but you also need to make sure that it is unique to the environment so that it can be distinguished from other sounds. For example, a bridge that sounds like a telephone, doorbell, or microwave beep may not be a good choice for our pets.

The bridge should be practical: if it’s too hard to use, you will either struggle with it (and impact your timing, observation, or other critical training skills), or you’ll stop using it entirely. Most zoological trainers use a whistle that they can hold in their mouths because it leaves their hands free to do other things. Similarly, the bridge should be easy to replicate so that every time it’s used it sounds the same (this is especially important if there are multiple trainers working with the same animal).

And finally, the animal should have no prior negative association to the bridge. Although you can desensitize an animal to a sound they dislike, your training will be better off in the long run if it you don’t need to go through this process, especially considering animals can sometimes have a spontaneous recovery of the negative association.

Teaching the bridge to an animal is a fairly straightforward process. Pairing the stimulus (a whistle, a click, a flash of light, etc.) with a reinforcer repeatedly will result in a Pavlovian type response: the animal perceives the bridge and automatically expects that the reinforcer will come next. We dog trainers do this when we “load the clicker” by doing the click-treat repetition over and over again. Teaching the bridge is usually a pretty quick process. Dogs tend to figure out the click-treat association within five minutes or so. The Shedd staff tend to be a bit more methodical about this introduction, but even so, the animals in their care readily pick up on the bridge.

If the animal you are working with doesn’t figure out that the bridging stimulus predicts a reinforcer is coming, you should look at why. Is the timing off? If too much time elapses between the bridge and the reinforcer, the animal may not be able to make a clear connection. Likewise, if the reinforcer comes at the same time as the bridge or even before, the connection will be tricky or even impossible for the animal to understand. Or maybe the item you are using isn’t truly a reinforcer. Perhaps your dog doesn’t like beef because it makes him feel sick. Or it’s possible that the bridging stimulus you’re using has a negative association you aren’t aware of.

Finally, you need to be proficient at using the bridge. Can you physically operate it, and do so without excessive fumbling? Personally, I find i-clicks easier to use than box clickers (and I have friends who find the reverse to be true). You also need to make an effort to practice your timing skills. Ken showed us a variety of training games: you can train a human friend to do a simple task. You can bounce a ball and click every time it hits the ground (or bounces off a wall). Or you can enlist a friend to play “hand games”- the friend holds up one or more fingers at a time, and you click when they hold up only one, or only when it’s their index finger. Improving your timing will improve your training.

Once you’ve chosen an effective bridge, taught it to the animal, and are satisfied that you can use it well, you’re all set to bring out better behavior in your animal. In my next post, I’ll tell you about the different ways the Shedd staff do this. But for now, I’d love to hear from you. Do you use a bridge (or maybe more than one)? Why or why not? If you do, what bridge(s) do you use? Please comment with your experiences!

Wednesday, September 26, 2012

(Mostly) Wordless Wednesday: Whee!

I hope you aren't tired of pictures from A Dog Spot, the boarding kennel Maisy stayed at while I was at the Shedd Animal Training seminar. It's just that this picture (and some more to come) make me so darn happy. Seeing Maisy playing like a normal dog is just amazing to me, especially considering how dog reactive she used to be.

She still can be, if I'm honest. I wouldn't feel comfortable with Maisy playing at a day care or another boarding kennel, but I'm confident in A Dog Spot's staff's ability to read dog body language and trust that they will intervene without the use of punishment or fear if one of the dogs is behaving inappropriately or is scared.

And the result is this: absolute joy. I'm so glad that she could enjoy her vacation as much as I enjoyed mine.

Monday, September 24, 2012

Shedd Animal Training Seminar: Basic Operant Conditioning

In the last hundred years or so, science has learned a lot about animal training. In fact, we have learned so much that Ken stated definitively that training is a technology. That is, the laws of learning are always true, no matter what species we are working with. In that sense, training can be compared to the laws of gravity: no matter what you drop, it will fall downward. Of course training, like gravity, can be influenced by outside factors. If you drop a pen during a tornado, it may fly sideways or appear to hover in the air, but that’s not because gravity has ceased to work. Likewise, the laws behind training are still at work, even when the results are unexpected.

The laws of learning have been broadly grouped into two main categories: operant conditioning and classical conditioning. Ken focused on operant conditioning not only because it is more easily observed and understood by beginners, but also because operant conditioning depends on the animal to think and make choices. Classical conditioning works on a much more instinctive level, and does not result in animals who are actively participating in the training process.

Now, if you’ve ever been to a basic training seminar- or even read a book on the topic- you’ve probably been exposed to the four quadrants. Despite this almost universal approach to explaining the basics of training, Ken didn’t even mention them. This was by design; not only are the quadrants a bit difficult to wrap your brain around at first, but it is also somewhat unnecessary.

What you really need to know about the laws of learning can be summed up in Thorndike’s Law of Effect: behaviors which result in a satisfactory outcome will be repeated, while those that result in discomfort will not. Or, to put it simply, behavior is a function of past consequences.

Consequences come in opposing pairs:
Reinforcing or Punishing
Positive or Negative
Unconditioned (inherent) or Conditioned (learned)
Proximate (immediate) or Distal (in the future)

Beluga whale receiving reinforcement. Photo by Kate Mornement.

Ken believes that the best consequences are the first of every pair; it is far better to have positive reinforcement which is immediate and inherently satisfying. And of all of those consequences, the most important is the use of reinforcement, no matter what form it takes. If you reinforce the behaviors you like, Thorndike’s law tells us that you will see more of those behaviors, which is the ultimate goal of training.

There are three main things to keep in mind when using reinforcement. First, you need to be sure that what you are not mixing up the idea of a reward with reinforcement. Rewards are things we provide that we believe will be an incentive to perform a behavior, but that may or may not actually be something the animal finds desirable. For example, most people consider chocolate to be a great reward, but it gives me headaches, so I would not change my behavior to get some. Second, while inherently reinforcing consequences like food are best, we can certainly teach animals to enjoy and even work for things like petting or praise. Finally, and perhaps most importantly, the timing of reinforcement is the key to successful training; reinforcers should be given as soon after the behavior as possible. Reinforcing in a timely manner with a low-value item will yield better results than poorly timed reinforcers, even if they are very highly desired.

In future posts, we will discuss some of the questions that arise when considering these pairs of consequences: How do you ensure that your timing is good? How do you teach an animal to enjoy something so much that it can be reinforcing? How do you elicit the behaviors you want… and what do you do when the animal doesn’t do what you want? And if you have specific questions about the basics of operant conditioning, please ask in the comments!