Term
| Factors that influence instrumental and operant conditioning |
|
Definition
1. Delay of reinforcement or reward 2. Magnitude of reinforcement 3. Past experiences and expectations |
|
|
Term
|
Definition
| Immediate reinforcement leads to faster learning than delayed reinforcement |
|
|
Term
| Magnitude or quality of reinforcer |
|
Definition
| Better rewards lead to faster, more frequent responses |
|
|
Term
| Past experiences and expectations |
|
Definition
| How responsive you will be to reinforcer at a moment depends on past experiences and what you expect |
|
|
Term
| Positive behavioral contrast effect |
|
Definition
| Low quality reinforcer to a high one. Suddenly you get more reinforcement for same behavior and behavior will shift. You will work more for it than someone who started out with hig level of reinforcement |
|
|
Term
| Negative behavioral contrast effect |
|
Definition
| High reinforcement to low one. Less behavior than someone who started out with that level of reinforcement |
|
|
Term
|
Definition
| Inferred based on behaviors that result when subject is trying to get reinforcer. Increase in behavior itself |
|
|
Term
|
Definition
| Other behaviors direct at objects in environment |
|
|
Term
|
Definition
| Can be inferred as an animal behavior. Looks similar to human behaviors during extinction |
|
|
Term
| Factors that affect extinction |
|
Definition
1. Reinforcement schedule used in training 2. Size of reinforcement 3. Delay of reinforcement |
|
|
Term
| Reinforcement schedule and extinction |
|
Definition
| Behavior trained and maintained on a continuous Rf schedule leads to extinction much faster than on any other schedule. Leads to more frustration and aggression |
|
|
Term
| Factors that influence instrumental and operant conditioning |
|
Definition
1. Delay of reinforcement or reward 2. Magnitude of reinforcement 3. Past experiences and expectations |
|
|
Term
| Partial reinforcement extinction effect |
|
Definition
| On a partial schedule that is unpredictable subject will learn to persist and will apply this persistence in the face of extinction. Behavior likely to continue for longer time |
|
|
Term
| Size of the reward - effect on extinction |
|
Definition
| Larger rewards extinguish faster and smaller rewards extinguish slower |
|
|
Term
| Delay of reinforcement - extinction |
|
Definition
| Immediate reward extinguishes faster and longer delays extinguish slower. If you are trained to have a delay you are trained to expect that you will eventually be reinforced |
|
|
Term
| Physiological homeostasis |
|
Definition
| Deprecation/reinforcement works specifically; only food will work as a reinforcer when in a state of hunger, only fluid will work in a state of thirst |
|
|
Term
|
Definition
| All deprivations are part of a general drive. We can reduce drive state and reinforce behaviors without targeting a specific need. Water may be reinforcing when thirsty and vice versa. Have multiple motivations |
|
|
Term
|
Definition
|
|
Term
|
Definition
| Motivation created by reinforcer itself. Learned motivation - need experience with reinforced reducing primary motivation for that reinforcer to create incentive motivation |
|
|
Term
|
Definition
| Concentrated on things we find reinforcing without physiological need. Proposed that reinforcement is based on sensory stimulation. |
|
|
Term
| Brain stimulation reinforcement |
|
Definition
| Reinforcement works by stimulating certain areas of the brain. Any stimulus that does so is a possible reinforcer |
|
|
Term
| Premack's theory of reinforcement |
|
Definition
| Experience of engaging in behavior is reinforcing. Focuses on opportunity to engage in behavior |
|
|
Term
|
Definition
| Change in responding related to changes in stimulus |
|
|
Term
| Stimulus generalization gradient |
|
Definition
| Slope of curve indicates how closely the behavior is controlled by the stimulus feature in question. |
|
|
Term
|
Definition
| The responding that is observed with one stimulus is also observed when a different stimulus is presented |
|
|
Term
|
Definition
| Changes in stimuli result in different levels of responding |
|
|
Term
|
Definition
| Subjects cannot detect a difference between stimuli if it is beyond their ability |
|
|
Term
|
Definition
| Stimulus has to be accessible and relevant to sense organ |
|
|
Term
|
Definition
| Relative Intensity of stimulus compared to background |
|
|
Term
|
Definition
| If subject is highly motivated to seek out reinforcement he is more likely to respond to stimuli that are changing |
|
|
Term
| Generalization and discrimination - past experiences |
|
Definition
| Depends on past experiences with two choice discrimination task. If you've never had to experience in past, more generalization occurs. More experienced, more focused on task you will be |
|
|
Term
| Instrumental conditioning |
|
Definition
| process in which the environment constrains opportunity for reward and a specific response will obtain that reward |
|
|
Term
|
Definition
| goal directed behavior. Once behavior occurs situation must be reset so behavior can be repeated |
|
|
Term
|
Definition
| anything with clear starting and stopping point; apparatuses typically involve mazes |
|
|
Term
|
Definition
| proposed that what is happening when developing S-R association is that behavior occurs for purpose of reaching goal |
|
|
Term
|
Definition
| chambers made of produce crates where Thorndike had a foot pedal inside a chamber. when cat stepped on pedal, pull out door and let cat out into maze. Want to determine if escape rewarded by bowl of food would lead to faster escapes. |
|
|
Term
|
Definition
| if a response in presence of stimulus (puzzle box) is followed by a satisfying event (bowl of food), association between stimulus and response is strengthened |
|
|
Term
|
Definition
| place rat in start box, end is goal box with food and water. Discrete because clear beginning and end. |
|
|
Term
|
Definition
| Often used to study choice behavior; each arm has a goal box. Only reinforce side trying to train. |
|
|
Term
|
Definition
| Reinforcers that have innate reinforcement properties. Reinforce from first exposure; no experience necessary. Ex. food, water, shade on a hot day |
|
|
Term
|
Definition
| reinforcers that have to develop reinforcer properties through association with primary reinforcers. Ex. stickers, money |
|
|
Term
|
Definition
| positive contingency between the instrumental response and the appetitive stimulus |
|
|
Term
|
Definition
| positive contingency between the instrumental response and the aversive stimulus |
|
|
Term
|
Definition
| negative contingency between the instrumental response and the aversive stimulus |
|
|
Term
|
Definition
| negative contingency between instrumental response and the appetitive stimulus |
|
|
Term
|
Definition
| process in which the environment does not constrain opportunity for reinforcement and a specific response will obtain that behavior |
|
|
Term
|
Definition
| any behavior that has the same effect on the environment "operates on the environment". Specific behavior in terms of what it does |
|
|
Term
|
Definition
| behavior freely engaged in repeatedly |
|
|
Term
|
Definition
| focused objectively on behavior;thought reinforcement driving S-R association; removes element of emotion and subjectivity |
|
|
Term
|
Definition
| reinforcing behaviors more and more similar to target behavior until subject performs target behavior |
|
|
Term
|
Definition
|
|
Term
|
Definition
|
|
Term
|
Definition
| relationship between response and environmental consequence |
|
|
Term
| Continuous Reinforcement Schedule |
|
Definition
| reinforcement provided every time behavior occurs |
|
|
Term
| schedule of reinforcement |
|
Definition
| program or rule that determines when and how occurrence of behavior will be followed by reinforcer |
|
|
Term
|
Definition
| Any schedule that is not continuous. As soon as behavior occurs once without reinforcement following it is a partial schedule. |
|
|
Term
|
Definition
| certain number of responses are required for reinforcement |
|
|
Term
|
Definition
| Same number of responses required each time to earn reinforcer. |
|
|
Term
|
Definition
| Unpredictable number of response required to earn reinforcer |
|
|
Term
|
Definition
| Amount of responses it takes to reach criteria |
|
|
Term
|
Definition
| subject quits responding during ratio run; happens when requirement changes dramatically |
|
|
Term
|
Definition
| subject pauses after receiving reinforcement; increases as response number increases |
|
|
Term
|
Definition
| time interval indicates when reinforcement will occur; still have to do behavior to get reinforcement |
|
|
Term
|
Definition
| amount of time between reinforcements is the same |
|
|
Term
|
Definition
| early in interval reinforcement won't be available. After reinforcement there is very little behavior. Escalates as it gets closer to the end of the interval. |
|
|
Term
|
Definition
| time interval changes; average amount of time that has to pass before reinforcement is available |
|
|
Term
|
Definition
| particular amount of time after interval ends that reinforcement will be available. Goal is to train subject to perform behavior soon after interval ends. Applied to FI or VI schedules. |
|
|
Term
| Differential reinforcement of high rates |
|
Definition
| responding during interval leads to resetting of that interval; benefit of subject to only respond when reinforcement available |
|
|
Term
| Differential Reinforcement of low rates |
|
Definition
| After interval ends, reinforcement given for 1st behavior. If you respond during the interval, not reinforced, interval reset. Eventually schedule is learned. |
|
|
Term
| Concurrent Schedule of Reinforcement |
|
Definition
| Two or more schedules operating at same time and subject has to make a choice between them. |
|
|
Term
| Relative Rate of Responding |
|
Definition
| Calculate for schedule to determine level of preference. Rate of response A/Response A + Response B |
|
|
Term
| Relative Rates of Reinforcement |
|
Definition
| Calculate how many reinforcers occur. Rf A/ RfA + RfB |
|
|
Term
|
Definition
| Relative rates of responding is equal to relative reinforcement rate. The reason we have more behavior with one schedule is because it produces more reinforcement. |
|
|
Term
|
Definition
| from experience, operant response is motivated by aversive situation present at time; response causes stimulus to be terminated |
|
|
Term
| factors that affect escape learning |
|
Definition
1. intensity of adversity 2. amount of negative reward (Rf) 3. delay of reward |
|
|
Term
| Intensity of adversity in escape learning |
|
Definition
| the more intense the aversive stimulus, the faster learning will be and more likely that behavior will be performed. |
|
|
Term
| amount of negative reward (Rf)in escape learning |
|
Definition
| how much reward matters for escape learning to occur? In a straight alley maze, those with the least amount of shock at the end ran the fastest; those with the most shock ran the slowest or refused to move. |
|
|
Term
| delay of reward in escape learning |
|
Definition
| time between behavior and relief or escape; shorter the delay, the faster the rats ran |
|
|
Term
| extinction of escape response |
|
Definition
| don't take stimulus away when response occurs |
|
|
Term
|
Definition
| cost analysis of helping project. More serious the event was perceived to be, the fewer people helped. |
|
|
Term
|
Definition
| response will prevent or postpone occurrence of an aversive stimulus |
|
|
Term
|
Definition
| do a behavior that is measurable to prevent aversive stimulus |
|
|
Term
|
Definition
| behaviors where you not doing anything as a way of avoiding; ex. holding still |
|
|
Term
| classical conditioning in avoidance learning |
|
Definition
| recognizing context becomes classically associated with stimulus (S-S association); more you escape, stronger association becomes |
|
|
Term
|
Definition
| basic mechanism of avoidance. No discrete signal gives you a heads up |
|
|
Term
|
Definition
| most of classical conditioning becomes attached to the signal that goes on and off |
|
|
Term
|
Definition
| occurs in one direction; dog goes in on one side and exits on the other |
|
|
Term
|
Definition
| either side can be the side where an aversive stimulus occurs and either side can be safe side |
|
|
Term
| two-process theory of avoidance |
|
Definition
avoidance learning involves two forms of conditioning: 1. pavlovian conditioning of fear to a stimulus that signals aversive stimulation 2. instrumental conditioning of the avoidance response by fear reduction |
|
|
Term
| factors that affect avoidance learning |
|
Definition
1. severity of aversive stimulus 2. complexity of aversive stimulus 3. length of time between CS and aversive event |
|
|
Term
|
Definition
| if you do behavior, it will be followed by an aversive stimulus |
|
|
Term
| omission training (negative punishment) |
|
Definition
| negative relationship between behavior and stimulus consequence |
|
|
Term
|
Definition
| period of time in which reinforcement (appetitive stimulus) is unavailable. when time out ends, reinforcement is available again |
|
|
Term
|
Definition
| once you remove appetitive stimulus, no return the stimulus, it cost you something you will not be able to get back |
|
|
Term
| Factors that affect punishment |
|
Definition
1. response contingency 2. contiguity 3. consistency 4. severity 5. minimize positive RF |
|
|
Term
| response contingency in punishment |
|
Definition
| in order for punishment to be effective, it must be contingent on behavior |
|
|
Term
| contiguity of punishment relative to behavior |
|
Definition
| to be effective, stimulus must be applied immediately after behavior |
|
|
Term
| consistency of punishment |
|
Definition
| punishment should be delivered every time behavior occurs to be effective |
|
|
Term
|
Definition
| punishment must be as intense as possible without hurting subject the 1st time it is presented |
|
|
Term
|
Definition
| forbidden toy experiment - louder noise causes children to fear a forbidden toy more than a soft noise does |
|
|
Term
| minimize positive reinforcement in punishment |
|
Definition
| punishment may end up being positively reinforcing - depends how subject perceives behavior |
|
|