What Is Operant Conditioning? Operant Conditioning In A Nutshell

Operant conditioning was first described by American psychologist and behaviorist B. F. Skinner in 1938. Skinner believed classical conditioning was too simplistic to adequately account for complex human behavior. Instead, he suggested the best way to explain and predict behavior was to analyze the external causes of an action and its consequences. Operant conditioning is a method of learning where the consequences of a response determine the probability of it being repeated. 

Understanding operant conditioning

This approach he called operant conditioning, a theory based on Edward Thorndike’s 1898 law of effect principle.

Skinner used the term operant to describe “any active behavior that operates upon the environment to generate consequences.” 

Operant conditioning is based on a relatively simple premise.

Actions that are reinforced or rewarded will be strengthened and more likely to occur in the future.

Actions that are punished or lead to undesirable consequences are less likely to occur in the future.

These associations then lead to a connection being made between a behavior and its consequences. 

The phenomenon is perhaps best exemplified by describing a laboratory rat in a Skinner box, otherwise known as an operant conditioning chamber.

When the rat presses a lever while a green light is illuminated, it is rewarded with food.

When the rat presses the same level under a red light, it is punished with a mild electric shock.

Over time, the rat learns to only press the lever when the green light is illuminated.

Skinner’s theory of operant conditioning is one of many stimulus-response behavioral theories.

Each theory assumes behavior manifests as a result of the interplay between stimulus and response.

That is, behavior cannot exist without a stimulus of some kind.

Operant conditioning components

There are four key components of operant conditioning. Let’s take a look at each below.


Reinforcers describe any factor that strengthens or increases the behavior it follows. 

There are two types:

Positive reinforcers

Favorable events or outcomes that present themselves after the behavior, such as praise or a direct reward.

A bonus given to an employee for exceeding their sales target is an example of a positive reinforcer.

Negative reinforcers

Here, unfavorable events or outcomes are removed after the display of certain behavior.

Chocolate that is used by parents to stop their children from misbehaving in public is an example of a negative reinforcer.


Punishment is defined as an adverse event or outcome that causes a decrease in the behavior it follows.

Here, there are also two types:

Positive punishment

Where an unfavorable event or outcome is presented to weaken the response it follows.

Returning to the previous example, parents who spank their children for misbehaving in public are using positive punishment.

This approach is sometimes referred to as punishment by application.

Negative punishment

Where a favorable event or outcome is removed after certain behavior takes place.

For example, taking away the video game privileges of a child may be necessary if they fail to complete their assigned homework.

This approach is sometimes called punishment by removal.

Key takeaways

  • Operant conditioning is a method of learning where the consequences of a response determine the probability of it being repeated. The learning method is a stimulus-response theory developed by B.F. Skinner in 1938, who drew inspiration from the work of Edward Thorndike.
  • Operant conditioning is based on a relatively simple premise. Actions that are reinforced will be strengthened and more likely to occur in the future. Actions that are punished are less likely to occur in the future.
  • Operant conditioning has four key components: positive reinforcers, negative reinforcers, positive punishment, and negative punishment. Each component differs according to how rewards and punishments are used to influence behavior.

Connected Learning Methods

Feynman Technique

The Feynman Technique is a mental model and strategy for learning something new and committing it to memory. It is often used in exam preparation and for understanding difficult concepts. Physicist Richard Feynman elaborated this method, and it’s a powerful technique to explain anything.

5 Whys Method

The 5 Whys method is an interrogative problem-solving technique that seeks to understand cause-and-effect relationships. At its core, the technique is used to identify the root cause of a problem by asking the question of why five times. This might unlock new ways to think about a problem and therefore devise a creative solution to solve it.


A SMART goal is any goal with a carefully planned, concise, and trackable objective. To be such a goal needs to be specific, measurable, achievable, relevant, and time-based. Bringing structure and trackability to goal setting increases the chances goals will be achieved, and it helps align the organization around those goals.

Occam’s Razor

Occam’s Razor states that one should not increase (beyond reason) the number of entities required to explain anything. All things being equal, the simplest solution is often the best one. The principle is attributed to 14th-century English theologian William of Ockham.

Inverted Pyramid

The inverted pyramid style is a process used in journalism which inverts the logic of the way a story is told. Rather than start from the story details, you start from a hook, which is critical to get the reader interested, thus giving it a quick pay off.

Active Listening

Active listening is the process of listening attentively while someone speaks and displaying understanding through verbal and non-verbal techniques. Active listening is a fundamental part of good communication, fostering a positive connection and building trust between individuals.

Active Recall

Active recall enables the practitioner to remember information by moving it from short-term to long-term memory, where it can be easily retrieved. The technique is also known as active retrieval or practice testing. With active recall, the process is reversed since learning occurs when the student retrieves information from the brain.

Baptism by Fire

The phrase “baptism by fire” originates from the Bible in Matthew 3:11. In Christianity, the phrase was associated with personal trials and tribulations and was also used to describe the martyrdom of an individual. Many years later, it was associated with a soldier going to war for the first time. Here, the baptism was the battle itself.  “Baptism by fire” is a phrase used to describe the process of an employee learning something the hard way with great difficulty. 

Dreyfus Model

The Dreyfus model of skill acquisition was developed by brothers Hubert and Stuart Dreyfus at the University of California, Berkeley, in 1980. The Dreyfus model of skill acquisition is a learning progression framework. It argues that as one learns a new skill via external instruction, they pass through five stages of development: novice, advanced beginner, competent, proficient, and expert.

Kolb Learning Cycle

The Kolb reflective cycle was created by American educational theorist David Kolb. In 1984, Kolb created the Experiential Learning Theory (ELT) based on the premise that learning is facilitated by direct experience. In other words, the individual learns through action. The Kolb reflective cycle is a holistic learning and development process based on the reflection of active experiences.

Method of Loci

The Method of Loci is a mnemonic strategy for memorizing information. The Method of Loci gets its name from the word “loci”, which is the plural of locus – meaning location or place. It is a form of memorization where an individual places information they want to remember along with points of an imaginary journey. By retracing the same route through the journey, the individual can recall the information in a specific order. For this reason, many consider this memory tool a location-based mnemonic.

Experience Curve

The Experience Curve argues that the more experience a business has in manufacturing a product, the more it can lower costs. As a company gains un know-how, it also gains in terms of labor efficiency, technology-driven learning, product efficiency, and shared experience, to reduce the cost per unit as the cumulative volume of production increases.

Learning Organization

Learning organizations are those that encourage adaptative and generative learning where employees are motivated to think outside the box to solve problems. While many definitions of a learning organization exist today, author Peter Senge first popularized the term in his book The Fifth Discipline: The Art & Practice of The Learning Organisation during the 1990s.

Forgetting Curve

The forgetting curve was first proposed in 1885 by Hermann Ebbinghaus, a German psychologist and pioneer of experimental research into memory.  The forgetting curve illustrates the rate at which information is lost over time if the individual does not make effort to retain it.

Instructor-Led Training

Instructor-led training is a more traditional, top-down, teacher-oriented approach to learning that occurs in online or offline classroom environments. The approach connects instructors with students to encourage discussion and interaction in a group or individual context, with many enjoying ILT over other methods as they can seek direct clarification on a topic from the source.  Instructor-led training (ILT), therefore, encompasses any form of training provided by an instructor in an online or offline classroom setting.

Single-Loop Learning

Single-loop learning was developed by Dr. Chris Argyris, a well-respected author and Harvard Business School professor in the area of metacognitive thinking. He defined single-loop learning as “learning that changes strategies of action (i.e. the how) in ways that leave the values of a theory of action unchanged (i.e. the why).”  Single-loop learning is a learning process where people, groups, or organizations modify their actions based on the difference between expected and actual outcomes.

Spaced Repetition

Spaced repetition is a technique where individuals review lessons at increasing intervals to memorize information. Spaced repetition is based on the premise that the brain learns more effectively when the individual “spaces out” the learning process. Thus, it can be used as a mnemonic technique to transform short-term memory into long-term memory.

Related Strategy Concepts: Read Next: Mental ModelsBiasesBounded RationalityMandela EffectDunning-Kruger EffectLindy EffectCrowding Out EffectBandwagon EffectDecision-Making Matrix.

Read More:

About The Author

Scroll to Top