44 min read

It's 11 PM on a Tuesday in October, and Marcus is sitting at his desk with Gray's Anatomy open to a full-page spread of the brachial plexus. He's read the same page three times. He can follow the sentences. He understands, in some loose sense, that...

Chapter 11: Dual Coding and Visualization: Why Two Representations Beat One

It's 11 PM on a Tuesday in October, and Marcus is sitting at his desk with Gray's Anatomy open to a full-page spread of the brachial plexus. He's read the same page three times. He can follow the sentences. He understands, in some loose sense, that nerves are coming from the spinal cord and going to the arm. He could pass a reading comprehension quiz right now.

But when he covers the page and tries to sketch the structure from memory — nothing. He has words. He doesn't have the thing.

On an impulse, he puts down the textbook and starts drawing. Not copying — guessing. He draws a column of five spinal levels down the left side: C5, C6, C7, C8, T1. Then he tries to remember how they connect. He draws tentative lines, erases, redraws. He gets the names of two terminal nerves wrong. He misremembers the order. He checks the book, corrects his errors, draws again from scratch.

The second attempt is better. The third is better still.

An hour later, he can close the book and reproduce the entire structure — the roots, the trunks, the divisions, the cords, the five terminal branches — without looking. Something that had defeated him through three read-throughs clicked through twenty minutes of drawing badly.

What happened in that hour was dual coding. And understanding why it worked will change how you study everything.


Why Your Brain Has Two Memory Systems

To understand dual coding, you need a working model of how the brain actually stores information. And the key insight is that it doesn't store everything the same way.

Allan Paivio was a Canadian psychologist who spent most of his career at the University of Western Ontario studying human memory. In 1971, he published his dual coding theory — a framework that proposed something genuinely radical for its time: that the brain doesn't have one unified memory system but two distinct representational systems that operate in parallel.

The first is the verbal system: the system that handles language. When you read a word, hear a sentence, or think in language, this system is doing the work. It stores linguistic information in sequential, symbolic form — strings of meaningful symbols that refer to things in the world.

The second is the imagistic system (or nonverbal system): the system that handles sensory and spatial information. When you visualize a face, mentally rotate an object, recall what your childhood kitchen smelled like, or navigate a familiar building in your mind, this system is working. It stores information in analogical form — representations that share structural properties with what they represent. A mental image of a triangle is, in some meaningful sense, triangular in its representational structure.

Paivio's key claim wasn't just that these two systems exist. It was that they can operate independently and store the same information separately, creating two independent retrieval pathways to the same knowledge.

Think about what this means practically. If you've only ever encountered "brachial plexus" as words in a textbook, you have one retrieval pathway: the verbal one. When you try to recall the brachial plexus, you're depending entirely on that single pathway. If it's degraded — if the memory is weak, if you're stressed during an exam, if another piece of information is interfering — you have no backup.

But if you've also drawn the brachial plexus, struggled to reconstruct it from memory, and built a mental image of its spatial structure, you have two retrieval pathways. The verbal pathway leads you to the names. The imagistic pathway leads you to the picture. Each can prompt the other. Two pathways means higher probability of retrieval. Two pathways means redundancy — and in memory, redundancy is strength. [Evidence: Strong]

The Research Foundation

The research since Paivio has repeatedly confirmed the basic prediction of dual coding theory. People who learn from words plus pictures consistently outperform people who learn from words alone across a wide range of subjects, ages, and test types. This is sometimes called the multimedia learning effect, a term associated with Richard Mayer at UC Santa Barbara, who ran dozens of experiments through the 1990s and 2000s testing exactly these questions.

Mayer's findings added crucial nuance to Paivio's framework. The multimedia effect isn't automatic — you don't get it just by putting any picture next to any words. You get it when:

  • The images are relevant and integrated, not decorative. A stock photo of a person looking thoughtful next to text about cognitive science does nothing. A well-constructed diagram of the cognitive architecture described by the text does a great deal.
  • The visual and verbal information appear close together in time and space. Split-attention — diagram on one page, text on the next, or narration separated from the relevant image by several seconds — reduces or eliminates the benefit.
  • When animations are used with narration, narration beats on-screen text. The reason: narration and visual information use different channels (auditory and visual), while both narration and on-screen text compete for the same visual processing channel. This is the split-attention effect in reverse. [Evidence: Moderate-Strong]

These nuances matter for how you study. Random images don't help. Relevant diagrams do. The picture needs to do something the words can't.


The Critical Distinction: Dual Coding Is NOT Learning Styles

This is important enough to give its own section, because the confusion between dual coding and learning styles theory derails the benefits of dual coding for a significant proportion of people who hear about it.

Learning styles theory says that people have dominant modes of learning — visual, auditory, kinesthetic, read/write — and that education should be tailored to match each person's preferred style. Visual learners should get diagrams. Auditory learners should hear information. And so on.

The research does not support this. Not in an "hmm, mixed results" way. In an "this has been tested systematically and it doesn't hold up" way. The theory's core prediction is the matching hypothesis: that matching instruction format to a learner's preferred style should improve outcomes relative to mismatching. Study after study has tested this prediction by identifying students as "visual learners" or "auditory learners" and then randomly assigning them to matched or mismatched instruction — and the matching doesn't produce better outcomes. The American Psychological Association, the Association for Psychological Science, and large-scale systematic reviews have all reached the same conclusion. [Evidence: Strong]

But here's where people go wrong. When they hear "visual learners don't exist," they sometimes conclude: "Oh, so visual information doesn't matter." That's exactly backward.

The difference is this:

Learning styles theory says: Some people benefit from visuals. Those people are visual learners. Other people don't benefit from visuals. Match the format to the person.

Dual coding theory says: Everybody benefits from combining verbal and visual information, because everyone has both systems. It's not about who you are — it's about how memory works.

Dual coding isn't advice for visual learners. It's a universal principle about how the brain encodes and retrieves information. When Marcus draws the brachial plexus, he's not succeeding because he happens to be a visual learner. He's succeeding because he's a human with both a verbal and an imagistic memory system, and he's activating both.

Here's a simple test you can run on yourself. Think of something you know deeply — something you could explain to anyone, answer questions about from any angle. Your childhood home. How to drive a car. A subject you've studied for years.

Now ask yourself: do you have verbal knowledge about it (can you describe it in words) or visual/spatial knowledge (can you picture it, navigate it mentally), or both?

Almost certainly, both. The things you know best are the things encoded in multiple modalities. The things you struggle to retrieve are usually things you've only ever encountered in one form.

That's dual coding. Not learning styles. The two ideas are fundamentally different, and conflating them costs you the benefit of one of the most powerful learning principles available.


Sketch-Noting: Dual Coding During Lectures

Here's the practical challenge with dual coding: most of the time when you're learning, you're in a lecture hall or sitting with a textbook, and you're not drawing. You're writing notes or highlighting text. Dual coding sounds great in theory, but how do you actually do it when information is coming at you continuously?

Sketch-noting is the answer.

Developed as a practice by visual designer Mike Rohde and studied by researchers including Elise Piscitelli, sketch-noting combines verbal note-taking with deliberate visual elements — simple drawings, icons, diagrams, spatial arrangements, arrows, boxes — to create notes that encode information through both systems simultaneously.

The key word is "simple." Sketch-noting doesn't require artistic talent. It requires willingness to represent ideas visually using whatever crude symbols come to hand. A circle with a label. An arrow indicating causation. A rough sketch of the shape you're trying to remember. A simple timeline with marks. That's sketch-noting. The visual element doesn't have to be accurate or beautiful — it has to be visual.

How to Do It During a Lecture

Start before the lecture. Write the topic at the top of your page. If you know what's coming, sketch a quick visual question — what does this concept look like? Even a rough sketch of your expectation primes the visual system for what's coming.

Use a landscape orientation. Sketch-noting naturally benefits from more horizontal space. Turning your notebook sideways gives you room to spread information spatially rather than squeezing it into a vertical column.

Capture the structure, not the content. When the professor says "there are three types of X," don't write "three types of X: 1, 2, 3." Sketch three boxes or three bubbles and label them. The spatial arrangement carries meaning that the numbered list loses.

Add a visual for every major concept. Not every sentence — every major concept. A quick stick figure, a simple icon, a rough diagram. If you can't think of a visual for an abstract concept, use a question mark with a symbol that represents the feeling of the idea. Just the act of pausing to think "what would this look like?" forces deeper processing than transcription.

Use arrows to show relationships. Causation, sequence, contrast, hierarchy — these relationships are natural arrow territory. An arrow from A to B saying "causes" encodes both the components and their relationship in a single visual unit.

Don't fall behind the lecture trying to perfect the drawing. The sweet spot is quick, rough, and complete. If you're pausing the lecture in your head to execute a careful illustration, you've gone too far. Three seconds per visual element is about right.

The Research on Sketch-Noting

Research on sketch-noting in lecture contexts finds consistent positive effects. Students who take sketch notes recall more content after a delay than students taking text-only notes, and they report higher engagement during the lecture itself. [Evidence: Moderate]

The mechanism appears to be twofold. First, the obvious dual coding effect: visual and verbal encoding together create stronger, more redundant memory traces. Second, and more subtly, the act of deciding "how do I represent this visually?" prevents the cognitive passivity that makes pure transcription so ineffective. You can write words without understanding them. It's much harder to draw a concept without having some model of what it actually is.

One particularly interesting finding: the benefit of sketch-noting doesn't come entirely from the notes themselves. A significant portion comes from the process of creating them — the active engagement, the decision-making about representation, the visual construction happening in real time. Sketch-noting changes what happens in the lecture, not just what you have afterward.

What Sketch-Noting Handles Well and Poorly

Works well: - Processes with steps and arrows (metabolic pathways, historical sequences, engineering workflows) - Comparisons and contrasts (visual side-by-side presentations make differences pop) - Hierarchical classifications (simple tree structures encode taxonomies naturally) - Spatial or physical information (anything with a shape or location — anatomy, geography, architecture) - Causal chains (A causes B causes C, represented visually)

Works less well: - Highly quantitative material (numbers and equations don't easily become sketches) - Very fast-paced lectures where you're already struggling to keep up at all - Material where verbatim wording is essential (legal definitions, quotations you'll need to cite precisely)

The practical guidance: you don't need to sketch-note everything. Pick the structural content — the concepts, the relationships, the processes — and use standard text notes for precise definitions and quantitative details. As you practice, you'll develop a natural sense of when to reach for the pen and when to stay verbal.


Mental Imagery: Building the Picture While You Read

Sketch-noting addresses lectures. But what about reading — the hours you spend with textbooks, papers, and online materials? You're not drawing during reading, at least not constantly. But dual coding is still possible, through mental imagery.

Mental imagery means deliberately constructing a visual representation in your mind as you read, rather than processing the words purely as abstract symbols. This is something skilled readers often do spontaneously, but it can be cultivated explicitly as a learning strategy with significant results.

Here's what it looks like in practice. You're reading about how the complement system works — the series of proteins in blood serum that can identify and destroy pathogens. A purely verbal reading processes those words as symbols. A mental imagery approach builds a movie: imagine millions of Y-shaped proteins (antibodies) floating through blood vessels, each one shaped to bind to a specific surface marker. When a bacterium with the right surface enters the bloodstream, the antibodies snap onto it like molecular lock-and-key. This binding triggers a cascade — the first complement protein bumps into the antibody-coated bacterium and gets activated, which activates the next protein, which activates the next, building a "membrane attack complex" that punches a hole in the bacterial membrane. The bacterium fills with water, swells, and bursts.

You've read a mechanistic definition. But now you also have a movie. Verbal pathway and imagistic pathway, both activated by the same reading event.

Imagery for Abstract Concepts

The technique works straightforwardly for concrete, physical content. But what about abstract concepts — ideas that don't have an obvious physical form? Justice. Entropy. Correlation. Narrative arc.

This is where the technique requires more creative work, but the approach is clear: you need an analogy that maps the abstract concept onto something concrete and visual. This isn't dumbing things down — it's creating the visual anchor that abstract language can't provide on its own.

Consider some examples:

Psychology — Cognitive Dissonance: The discomfort of holding two contradictory beliefs simultaneously. Mental image: a person trying to walk in two opposite directions at once, feet pulling apart, body tensed between two incompatible pulls. The discomfort of that position is the cognitive dissonance.

Economics — Equilibrium: The price point where supply equals demand. Mental image: two sloping surfaces meeting at a point — a valley bottom where a ball rolling down either slope would come to rest. The equilibrium is the stable point that both forces converge toward.

Statistics — Regression to the Mean: The tendency for extreme scores to be followed by less extreme ones. Mental image: a dart thrower who lands an unusually perfect throw in the bullseye. The next throw will probably not be quite as good — not because they've gotten worse, but because the first throw included some lucky variance that won't necessarily recur.

Physics — Entropy: The tendency of ordered systems to become disordered. Mental image: your bedroom. You can spend an hour putting everything away in the right place. Leave it alone for a week and it starts to drift. Leave it for a month and it's chaos. Restoring order requires deliberate work. Disorder accumulates on its own. That's entropy.

None of these images are precise physical models. They're memory handles — visual hooks that give the verbal information something to attach to. And for the purposes of remembering and retrieving, that's exactly what they need to be.

The practical habit: whenever you encounter an abstract concept that doesn't have an obvious visual form, pause and ask "what would this look like?" If the answer doesn't come immediately, try "what real thing in the world behaves like this?" Build the analogy. Let it be imperfect. Write it down. Use it.


The Method of Loci: The Ancient Art of the Memory Palace

If dual coding is powerful, the method of loci is its most dramatic application. It's over 2,000 years old. It works staggeringly well. And it's almost universally unused by the students who would benefit most from it.

A Brief History

The method of loci comes with a story, as reported by Cicero and others. The Greek poet Simonides of Ceos was at a banquet when he was called outside. While he was absent, the roof collapsed, killing all the guests. The bodies were so badly crushed that family members couldn't identify their relatives for burial.

Simonides, according to legend, was able to identify each guest by recalling where they had been seated at the table — using the spatial arrangement of the room as a retrieval cue for each person's identity.

Whether the story is precisely true is less important than what it demonstrates: spatial memory and episodic memory are deeply intertwined. We are extraordinarily good at remembering where things are and were. The method of loci systematically exploits this ancient capability.

Cicero described the technique in De Oratore (55 BC). Medieval scholars used it to memorize vast quantities of theological content. It was standard equipment for educated men in classical antiquity. And today, every competitive memory champion — the people who memorize shuffled decks of cards in under two minutes, or hundreds of digits of pi, or the order of all 52 cards across ten simultaneous shuffled decks — uses this technique. [Evidence: Strong]

How to Build Your First Memory Palace

Step 1: Choose your location.

Pick a familiar physical space. Your childhood home is ideal — you know it intimately, without having to think about it. Your current apartment works. The route you walk to class works. The building where your lectures happen works.

You need a space with a clear sequence of distinct locations that you can move through in a consistent order. Your childhood home might go: front door → entry hall → living room → dining room → kitchen → hallway → bathroom → your bedroom. That's eight distinct locations — eight available "slots" for information.

Step 2: Walk through it mentally and cement the route.

Before placing any information, walk through your palace in your mind. Really see it. Notice the specific objects in each location. The thing on the entry hall table. The couch in the living room, what color it is, where it sits. The window in the kitchen. Walk through several times until the route feels automatic.

Step 3: Create vivid images for what you want to remember.

For each piece of information you want to memorize, create a mental image. The more bizarre, emotionally charged, concrete, and sensory-rich the image, the better it will stick. Memory research consistently shows that unusual, vivid, emotionally evocative images are remembered far better than neutral, generic ones.

Say Marcus wants to memorize the five terminal branches of the brachial plexus in order: musculocutaneous, axillary, radial, median, ulnar.

He might create: - A muscular CUT (musculocutaneous) — a cartoon bodybuilder with a scalpel - An axe swinging (axillary) — a woodcutter with a huge axe - A radio blaring (radial) — an old-fashioned boombox turned up to maximum volume - A MEDIAN (medial strips) — yellow road dividing lines painted across a floor - An ululating person (ulnar) — someone singing in an eerie, ululating voice

Bizarre? Yes. Memorable? Enormously.

Step 4: Place each image at a specific location in your palace.

Walk through your palace in order and place each image at its corresponding location. The muscular bodybuilder is at your front door — maybe he's blocking it, using his scalpel to carve something into it. The woodcutter is in your entry hall, chopping at the hat rack. The radio is in the living room, blaring from the couch cushions. The yellow lines are painted across your dining room floor. The ululating singer is in the kitchen, singing at the refrigerator.

Place each image interacting with the location — not just sitting near it. The interaction makes the encoding richer.

Step 5: To recall, take a mental walk.

Whenever you need the information, mentally walk through your palace. You arrive at the front door and see the bodybuilder with his scalpel — musculocutaneous. You step into the entry hall and see the axe swinging — axillary. You walk into the living room and hear the radio blaring — radial. And so on.

The spatial structure of your familiar location serves as a retrieval cue. You don't have to remember the items directly — you remember where you are, and the where prompts the what.

Why This Works Neuroscientifically

Two mechanisms explain why the method of loci is so effective, and both connect back to dual coding.

First, humans have extraordinary spatial memory. The hippocampus and entorhinal cortex contain what are called "place cells" and "grid cells" — neurons that fire specifically when you're in particular locations in space, or when you're navigating through a spatial grid. These are some of the most studied neurons in neuroscience. We have dedicated neural machinery for building and navigating spatial cognitive maps, machinery that has been selected for over millions of years of primate evolution. Placing information inside a spatial location recruits this powerful, ancient system.

Second, the vivid, bizarre imagery maximizes imagistic encoding. Emotionally charged, unusual, concrete images are remembered far better than neutral, abstract ones — a finding that has been replicated consistently in memory research for decades. When Marcus pictures a bodybuilder carving his front door with a scalpel, he's activating the imagistic system at high intensity.

The method of loci is, in essence, the most extreme form of dual coding: it takes abstract verbal information (nerve names), converts them into maximally vivid images (bizarre visual scenes), and anchors those images to a spatial memory structure (the palace) that the brain navigates with native facility.

A meta-analysis of mnemonic techniques found consistently large effect sizes for method of loci compared to standard study techniques. The effects are not subtle. [Evidence: Strong]

When to Use It

Ideal use cases: - Sequences where order matters: Cranial nerves. Presidents in chronological order. Steps in a biochemical pathway. Stages of mitosis. - Lists of distinct items: Bone names. Drug classes. Taxonomic categories. Vocabulary words. - Presentations and speeches: Attach key points to rooms in your palace; walk through it as you speak. - Historical timelines: Map events spatially with each location representing a different era.

When NOT to use it: - For understanding conceptual relationships (the method of loci stores items, not the logic between them — use concept maps for that) - For highly quantitative material (numbers and equations resist vivid imagery) - For content where you need to understand deeply rather than retrieve reliably (it's a retrieval tool, not a comprehension tool)

On the Effort Involved

Building a memory palace for the first time feels slow and strange. You will likely think: this seems like more work than just reading the list again. It is more work. The first palace takes real effort.

What you get for that effort is remarkable and enduring. Students who commit to the method of loci for anatomy, chemistry, or law regularly report that it transforms their ability to reliably retrieve sequences and lists. Not "somewhat improves" — transforms. And once you've built several palaces and the technique is familiar, it speeds up substantially. Expert users of the method of loci can place items in their palace at a rate of a few seconds per item.

Start small. Take the seven techniques from Part II of this book — retrieval practice, spaced practice, interleaving, elaborative interrogation, dual coding, desirable difficulties, and one more of your choosing — and build a simple palace for them. Use a familiar space. Make the images absurd. Walk through it mentally several times. Then tomorrow, without looking, try to recall all seven from the walk.

You'll see.


When Dual Coding Is Harder (And Creative Approaches for Those Cases)

Not all content yields easily to visual representation, and it's worth being honest about where the technique requires more creative work.

Highly abstract content — philosophical arguments, legal analysis, literary theory — doesn't have an obvious visual form. You can't draw "epistemic closure" or "stare decisis" without first constructing an analogy. The extra step of analogy-construction is the creative work, and it's worth doing, but it's genuinely more effortful than drawing a cell membrane.

Heavily mathematical content presents a different challenge. Equations don't naturally become vivid images. But the meaning behind mathematical relationships often does. A derivative is the slope of a tangent line — which you can draw. An integral is the area under a curve — which you can shade. A probability distribution is the shape of likelihood across possible outcomes — which you can sketch as a bell curve or a skewed shape. Translating mathematical abstraction back to its geometric meaning is a form of dual coding, and mathematicians who do this habitually often develop much deeper intuition than those who work purely symbolically.

Very dense verbal content — technical writing, complex argument structures — can be mapped using concept diagrams that represent relationships rather than objects. The relationships themselves become visual through the spatial arrangement and directional arrows on the page. The argument can be mapped even when the content can't be drawn.

The general principle: always ask "what would this look like?" Even when the honest answer is "I have to construct an analogy," asking the question forces the kind of generative processing that deepens encoding. The question never fails. It always makes you think harder about the concept than passive reading would.


Dual Coding Across Domains

Let's make this concrete by following our four readers into their specific learning contexts.

Marcus: Medical Anatomy and Pharmacology

For Marcus, dual coding is almost too easy — anatomy is spatial by nature. But the key principle is this: draw from memory, not from the book. Looking at the textbook diagram activates the imagistic system mildly. Trying to reconstruct the diagram from memory activates it intensely and adds retrieval practice simultaneously.

After reading about any structure, Marcus puts the book down and draws. He doesn't try to replicate the diagram perfectly — he draws what he remembers, identifies gaps, checks the book, draws again. The drawing is a test as much as it is a study activity.

For pharmacology, he creates visual mnemonics: drug names paired with images that capture their mechanisms. Beta-blockers (they block adrenergic receptors) become a bouncer blocking the door of a club. Statins (HMG-CoA reductase inhibitors) become a statue blocking a pathway. The images are imprecise but memorable, and the verbal label "beta-blocker" activates the visual bouncer, which activates "blocks adrenergic receptors."

For signaling cascades and clotting pathways, he draws them as flowcharts — not as lists but as spatial process diagrams, with his own visual conventions for different types of steps. He's built a personal visual vocabulary for biochemical processes.

David: Machine Learning Architecture

David is learning ML from textbooks and courses. The content ranges from concrete (this is what a neural network architecture looks like) to abstract (this is why gradient descent converges).

For architecture, dual coding is straightforward: David sketches network diagrams by hand before looking at the course's provided diagrams. For the mathematics of gradient descent, he works to understand the geometric intuition — what does it mean to walk downhill in a high-dimensional loss landscape? He draws simplified 3D versions of loss surfaces, visualizes what a local minimum looks like, sketches the effect of different learning rates.

For the statistical foundations, he creates analogy images: overfitting is a meandering line trying to pass through every single data point rather than capturing the underlying trend. Bias-variance tradeoff is a target with arrows either clustered far from center (high bias) or scattered widely (high variance). These images stick when the equations drift.

Amara: Pre-Med Coursework

Amara's challenge is breadth — she's covering biology, chemistry, physics, and social science simultaneously. Dual coding gives her a way to encode across all of them without separate memorization systems.

For biology, she uses anatomical drawings and process diagrams. For chemistry, she builds mental models of molecular structures and reaction mechanisms. For history of medicine and social science, she creates timeline sketches and maps of causal relationships.

Her key practice is the "one-page visual summary": after each chapter or lecture, she draws a one-page map of the major concepts and their relationships from memory. The act of drawing tests her understanding (she quickly discovers what she doesn't actually know), creates a visual record, and consolidates the material through dual encoding.

Keiko: Competitive Swimming and Movement Learning

Keiko's learning is primarily procedural — she's developing physical skills. But dual coding applies here in a specific way: mental rehearsal is essentially imagistic encoding of motor patterns, and research on mental rehearsal in sports is well-established. [Evidence: Moderate-Strong]

When Keiko is learning a new turn technique, she doesn't just practice it physically. She watches expert video and creates a detailed mental image of the movement: the timing of the approach, the flip, the push, the streamline. Then she mentally rehearses that image — running the movie of the expert technique through her mind. Before every physical practice rep, she runs the mental movie first.

The dual coding principle: the mental image (imagistic encoding) and the physical performance (kinesthetic encoding) reinforce each other. Both store the pattern; each retrieval cue can activate the other.

Language Learning: The Keyword Method

One of the most research-supported applications of dual coding in any domain is the keyword method for vocabulary learning. [Evidence: Strong]

The principle: instead of learning a foreign word by pairing it with its translation, you create a mental image that visually bridges the foreign word's sound and its meaning.

The Spanish word mariposa means "butterfly." To memorize this with the keyword method: mariposa sounds like "Marry a Poser." Imagine a butterfly in a wedding dress, marrying someone who's posing dramatically for photographs. The bizarre image bridges the phonetic form (the sound of the word) and the meaning (butterfly). When you hear mariposa, the image activates; the image activates butterfly.

This technique is genuinely more effortful than just reading a vocabulary list. It's also substantially more effective, especially for long-term retention. The effort is the point.

History and Social Sciences

For historical content, dual coding takes the form of spatial-temporal maps. Instead of a list of dates, you create a genuine timeline: a spatial layout where the relative distances between events represent their actual temporal relationships. The French Revolution is much closer to Napoleon's rise than to the Renaissance, and the visual representation should show that.

Cause-and-effect relationships become visual arrow diagrams. The causes of World War I can be mapped as a chain: long-term industrial competition → arms race → alliance formation → the assassination event → automatic alliance triggering → rapid escalation. Seeing the causal chain spatially is more powerful than reading it as a list.

Key figures can be placed in visual relationship to events — not just named but sketched (even as simple icons) and positioned relative to each other.

Music Theory

Music theory is full of abstract structure that responds beautifully to visual representation. Intervals (the relationships between notes) can be visualized on a piano keyboard. Chord structures become shapes on the staff. Scales become patterns of whole and half steps that look like specific spatial layouts. Rhythm patterns become visual sequences.

For more advanced music theory, David (who also plays piano) uses visual harmonic maps — diagrams showing the relationships between keys, how closely related different chords are, the "circle of fifths" as a literal spatial map rather than an abstract formula.


Common Mistakes with Dual Coding

Drawing Without Thinking

The most common dual coding failure mode is producing visual content without engaging cognitively. This can happen easily: you copy a diagram from the textbook, you reproduce a professor's flowchart from the slide, you trace a figure from the appendix.

The problem: copying activates the hand and the visual system without activating the semantic system. You can reproduce a diagram without understanding it. The value of drawing comes from the reconstruction — from trying to produce the structure from memory and discovering what you know and don't know.

The fix: always draw from memory first. Check the source after. If your drawing was wrong, draw again. The error-and-correction cycle is exactly where the learning happens.

Over-Elaborate Diagrams

A second common mistake is treating diagram quality as a proxy for learning quality. Students who are artistic sometimes produce beautiful, detailed, extensively annotated diagrams that take thirty minutes each — and then wonder why they're not learning faster.

The cognitive load of creating highly detailed diagrams can exceed the benefit of dual encoding. Working memory is occupied by the artistic execution rather than the conceptual processing. A detailed technical illustration may look like learning but function like transcription.

The fix: simpler is almost always better. A rough sketch that captures the structure in two minutes is usually more valuable than a polished illustration that takes twenty. Speed and crudeness are features, not bugs.

Confusing Visual Complexity with Conceptual Clarity

Related to the above: some students create visually complex diagrams — dense, multi-colored, elaborately interconnected — and interpret the complexity as evidence of deep understanding. Often it isn't. A diagram with fifteen boxes connected by thirty arrows might represent genuine insight into complex relationships. It might also represent a failure to identify which relationships actually matter.

The test: can you explain what each element of your diagram means and why each connection is there? Dual coding works because visual representation encodes understanding, not just because visual representation exists.

Not Integrating Verbal and Visual

Dual coding requires both systems to be active. Students sometimes create excellent diagrams and then never connect them to the verbal knowledge — the technical terms, the precise definitions, the specific claims. Or they do extensive verbal study and treat dual coding as an optional extra.

The fix: every major concept should have both a verbal label and a visual form. The two representations should be linked — you should be able to start from the word and retrieve the image, and start from the image and retrieve the word. Both directions of retrieval should work.


Try This Right Now

Build a simple memory palace for the seven techniques covered in Part II of this book: retrieval practice, spaced practice, interleaving, elaborative interrogation, dual coding, desirable difficulties, and note-taking.

Step 1: Choose a familiar space with at least seven distinct locations. Your home, your commute, a building you know well.

Step 2: Create a vivid, bizarre image for each technique. Retrieval practice might be a person physically reaching into their own head and pulling out a glowing idea. Spaced practice might be an astronaut with a huge calendar floating through space. Interleaving might be a juggler keeping seven objects of different types in the air simultaneously. Make them strange. Make them vivid.

Step 3: Place each image at one location in your palace, in order. The first technique goes at the entrance, the second at the next location, and so on. Make each image interact with its location — the astronaut with the calendar has floated through the window and is stuck on the ceiling.

Step 4: Walk through your palace three times from memory.

Step 5: Tomorrow, before you look at any notes, walk through your palace and try to name all seven techniques.

Notice what happens. Notice which locations were vivid enough to hold their image and which ones faded. That's feedback about which of your images were sufficiently bizarre and sensory, and which need to be strengthened.


Dual Coding and Retrieval Practice: A Powerful Combination

One of the most productive insights you can take from this chapter is how naturally dual coding and retrieval practice work together. Each amplifies the other in ways that are greater than either alone.

When you practice retrieval by trying to draw a concept from memory — rather than verbally reciting it — you're doing retrieval practice and dual coding simultaneously. The drawing attempt tests what you remember about the visual structure of the concept, and the act of drawing reinforces the imagistic encoding. Marcus drawing the brachial plexus from memory isn't just using dual coding. It's using dual coding retrieval practice — the most powerful combination of the two techniques available.

This combination is sometimes called the draw-to-learn approach, and it has accumulated a specific research base of its own. Studies comparing students who learn from diagrams they study vs. students who reproduce diagrams from memory consistently find substantially better retention in the retrieval group. [Evidence: Moderate-Strong] The generation effort involved in drawing from memory combines the benefits of retrieval practice (strengthening storage through effortful retrieval) with the benefits of dual coding (encoding information visually and building the imagistic pathway).

For any content that has a visual structure — anatomy, biology, chemistry, geography, engineering, architecture, data structures — the draw-from-memory habit is probably the single most powerful revision technique available. Not looking at diagrams. Drawing them from memory.

How to Do Draw-to-Learn Effectively

After every reading session: Close the book. Take a blank page. Draw everything you can remember about the structure, process, or concept you just studied. Label every element. Draw the connections. Don't worry about accuracy — you're generating, not copying.

Then check. Open the source and compare your drawing to the original. Where were you right? Where wrong? Where did you have gaps — structures you couldn't even remember enough to attempt?

Draw again. With the errors identified, close the book and draw again. This second attempt is where the corrected information gets encoded in the imagistic system. The errors from the first attempt prime you to notice and retain the corrections.

Repeat on the following day. A spaced retrieval draw the next day — without looking at the source first — combines spaced practice with dual coding retrieval practice, creating three-way overlap between some of the most powerful learning techniques available.

The draw-to-learn cycle might feel redundant or overly effortful compared to reading the diagram and feeling like you understand it. It's not redundant. The feeling of understanding from looking at a diagram is recognition, not retrieval. The struggle of drawing from memory is genuine retrieval practice, and it reveals — precisely and without flattery — what you actually know vs. what you merely recognized.


Dual Coding for Numerical and Mathematical Content

We noted earlier that mathematical content is one of the harder domains for dual coding. Let's give it more attention, because the potential is greater than it might initially appear.

The challenge with mathematics is that the symbolic language of equations is already a representation system — a highly efficient one — and converting it to visual form might seem like going backward. A student who encounters f'(x) = lim(h→0) [(f(x+h) - f(x))/h] might reasonably wonder: what would this look like as a picture?

The answer: it would look exactly like what the derivative is, geometrically. The derivative of a function at a point is the slope of the tangent line to the function's graph at that point. The limit definition is a formal way of saying: take a secant line (a line crossing the curve at two nearby points), move the points closer together, and watch what line the secant line approaches. That line is the tangent. Its slope is the derivative.

Draw that. Draw the curve. Draw the two nearby points. Draw the secant line. Now mentally move the points closer. Watch the secant approach the tangent. That picture — held in the imagistic system — is far more powerful as a foundation for calculus intuition than the symbolic limit definition alone. When you encounter integral calculus (the inverse of differentiation), the geometric intuition carries: integration is the area under the curve, summed from infinitely many infinitely thin rectangles. Draw that too.

The dual coding strategy for mathematics:

  • For every formula or equation you study, ask: what does this represent geometrically?
  • When a formula has parameters (constants, variables), ask: what does changing each parameter do to the picture? Draw several cases.
  • For proofs and derivations, draw diagrams representing each major step. Follow the argument visually as well as symbolically.
  • For word problems, draw the scenario before writing a single equation. The drawing often reveals the mathematical structure.

David, learning machine learning, applies this directly to gradient descent. The symbolic update rule (θ := θ - α∇J(θ)) is a dense expression. But the picture is vivid: you're standing somewhere on a hilly landscape (the loss surface), and at each step you take a small step in the direction that's most steeply downhill. The parameter α (learning rate) determines how large each step is — too large and you overshoot, bouncing around the valley floor; too small and you converge too slowly. That picture can be held and manipulated in the imagistic system. The symbolic expression is the formal description of the picture.

The translation back and forth between symbolic and geometric is, for many mathematical domains, precisely what distinguishes deep understanding from surface familiarity. The experts in mathematics and physics don't just work symbolically. They maintain mental visual models that they're simultaneously reading off when they manipulate equations. Building that dual-channel processing is what the visual component of mathematics study develops.


Creating a Personal Visual Vocabulary

One of the subtle advantages of sustained dual coding practice is that you develop a personal visual vocabulary — a set of icons, symbols, spatial conventions, and representational patterns that you use consistently across your notes and drawings.

This personal vocabulary matters because visual representations gain retrieval power from familiarity. When you've always used a specific visual convention — a lightning bolt for "triggers," a gear for "mechanism," a staircase for "progression," a broken arrow for "inhibits" — those conventions become automatic retrieval cues. Seeing the convention instantly activates the associated meaning, and drawing it instantly encodes in both systems.

Some elements of visual vocabulary are domain-specific: - In biology: arrows for activates, broken lines for inhibits, boxes for genes or proteins - In chemistry: structural formulas, energy diagrams with hills and valleys for reaction energy - In history: timelines with arrow widths representing magnitude, geographic maps showing political boundaries - In programming: box-and-arrow diagrams for data structures, flowcharts for algorithms

Some are general-purpose: - Hierarchies: trees with root at top, branches below - Cause and effect: arrows labeled with mechanisms - Contrast: side-by-side columns or split-circle diagrams - Emphasis: boxes, circles, stars, varying line thickness - Uncertainty: dashed lines, question marks, partial drawings

Building your personal visual vocabulary doesn't require artistic training. It requires consistency — using the same conventions repeatedly so that you develop fluency with them. A stick figure with a graduation cap means "expert." An alarm clock means "time-sensitive." A magnifying glass means "look more closely at this." Once established, these icons take a second to draw and carry immediate meaning.

The practical project: spend twenty minutes reviewing your most recent notes. Identify ten recurring concepts that appear frequently in your subject. Assign a visual icon to each. Write them in a reference panel at the back of your notebook or the top of a notes document. Then use them consistently.


Dual Coding Across Learning Contexts: A Day in Marcus's Life

Let me make the day-to-day application of dual coding completely concrete by following Marcus through a single study day in his second year of medical school.

8 AM — Morning pre-lecture preparation: Marcus has read the assigned chapter on the coagulation cascade. Before lecture, he takes a blank page and draws the cascade from memory: the intrinsic pathway (beginning with Factor XII), the extrinsic pathway (beginning with tissue factor and Factor VII), their convergence at Factor X, the common pathway leading through prothrombin to thrombin, the fibrinogen-to-fibrin conversion, and the crosslinking of the fibrin clot. He draws it imperfectly, misses a step, checks the book, corrects it, draws the correction.

10 AM — During lecture: The professor is discussing clotting factor deficiencies. Marcus sketch-notes: a simple drawing of the cascade with specific factors highlighted where deficiencies produce clinical syndromes. A small sketch of a person with an X over one pathway represents hemophilia A. He's adding clinical context to the structural understanding he built at 8 AM.

1 PM — Self-testing: Marcus covers his morning drawing and the sketch notes. He takes a blank page and attempts to draw the entire cascade again — this time with the clinical deficiencies he learned about in lecture. This is retrieval practice and dual coding simultaneously. The gaps he finds are where he needs to focus this afternoon.

3 PM — Pharmacology: New material: anticoagulants (warfarin, heparin, direct oral anticoagulants). Marcus draws a fresh version of the coagulation cascade and marks where each drug acts: warfarin inhibiting Vitamin K-dependent factor synthesis (II, VII, IX, X — he creates a visual mnemonic: a WAR-fare scene where warfarin is a soldier attacking the "1972" factors — I = 1, VII = 7, IX = 9, X = 10... he refines this). Heparin activating antithrombin, marked with a visual of antithrombin being activated like a switch being flipped.

7 PM — Evening review: Marcus opens Anki. He has flashcards for the cascade, but tonight he replaces his text-only cards with image-occlusion cards: photographs of his own drawings with portions hidden, requiring him to label what's missing. The image-occlusion format is dual coding built into his spaced repetition system.

Each of these activities took five to twenty minutes. Accumulated over a study day, they've produced multiple retrieval attempts, dual encoding across multiple sessions, and spaced exposure with variation. The total time spent on the cascade is perhaps ninety minutes across the day. What Marcus knows about the coagulation cascade three months from now will reflect those ninety minutes of engaged dual coding work more than many hours of re-reading ever would.


The Evidence: What We Know Confidently and Where Uncertainty Remains

A responsible treatment of dual coding requires honesty about the strength and limits of the evidence.

Strong evidence: The multimedia learning effect — words plus relevant pictures outperforming words alone — is one of the most robustly replicated findings in educational psychology. Method of loci is well-supported by decades of research on mnemonic techniques across multiple research groups. The keyword method for vocabulary learning has strong experimental support. [Evidence: Strong]

Moderate evidence: Sketch-noting as a lecture strategy is supported by a smaller but growing body of research. The effects are consistently positive, but the research base is not yet as extensive as for retrieval practice. Mental imagery training for text comprehension has positive evidence but more variability across studies. [Evidence: Moderate]

Ongoing scientific debate: The specific cognitive mechanism Paivio proposed — literally two separate systems, verbal and imagistic — is contested in its details by contemporary cognitive scientists. Some researchers argue for a single flexible representational system that can encode information in different formats, rather than two distinct systems. The practical implications of this debate are limited: whether the mechanism is "two systems" or "one flexible system operating in different modes," the result is the same. Combining verbal and visual representations consistently produces better learning outcomes. The why is debated; the what is not.

Where dual coding can backfire: Text-heavy slides paired with a narrator reading the same text out loud produce worse learning than either slides or narration alone — because both verbal channels (visual text and auditory narration) compete for the same processing resources. Dual coding helps when the visual and verbal information are complementary. It hurts when they're redundant and competing.


The Progressive Project: Dual Coding Your Learning Goal

Whatever your Progressive Project is — the specific learning goal you've committed to — there are concrete ways to apply dual coding right now.

If your project is content-heavy (medicine, law, history, science): After every major study session, put down the materials and draw from memory what you just learned. Diagram it, map it, sketch it. The act of drawing from memory is retrieval practice and dual coding simultaneously — one action serving two purposes. Identify the five most important concepts in your current domain and create your own visual representation of each one. Not copied from a source — generated by you, in your own visual language.

If your project is skill-based (programming, music, language, athletics): Create a visual model of what mastery looks like. What does expert performance look like structurally? What components are there? How do they connect? For language learning, commit to the keyword method for the next fifty vocabulary words you encounter. For athletic skills, build a mental rehearsal practice around the movement patterns you're developing.

If your project is abstract (philosophy, literary analysis, theoretical frameworks): Develop analogies systematically. For each major abstract concept, generate a concrete visual analogy before moving on. Create visual relationship maps: how does Concept A relate to Concept B? Draw arrows, show hierarchies, identify contrasts. The abstractness of your domain is not a reason to skip dual coding — it's the reason the analogy-construction step is so important.

For everyone: Consider building a memory palace for the core vocabulary of your domain — the terms, names, and categories that form the basic vocabulary of the field you're learning. It takes an evening to build and a few minutes to walk through for review. The return on that investment, in reliable retrieval over months and years, is substantial.


Putting It All Together

Dual coding is the insight that you aren't choosing between words and images — you're using both, because together they create two retrieval pathways where one alone gives you only one.

It's not about learning styles. It's not about what you prefer. It's about how memory actually works, and specifically about the fact that humans have both verbal and imagistic memory systems, both of which can encode, store, and retrieve information, and both of which benefit from being activated.

Paivio's original insight has been refined and extended across fifty years of research into a practical toolkit: sketch-noting for lectures, mental imagery for reading, the method of loci for sequences and lists, the keyword method for vocabulary, mental rehearsal for motor skills, visual maps for conceptual relationships. The techniques vary. The principle is consistent: give information a visual form whenever you can, combine it with verbal encoding, and you've doubled your access routes to what you're trying to know.

For Marcus, drawing every structure he studies isn't extra work. It's the work. The drawing is where dual coding, retrieval practice, and generative learning all converge — a few minutes after a reading session that tests what he remembers, creates a visual record, and reinforces the structure in his imagistic memory.

Before you move on, take five minutes to draw — from memory — the main ideas of this chapter. A rough sketch of the dual coding framework, the two systems, the main techniques. Make it crude. Make it yours. That drawing, imperfect as it is, will outlast anything you could have remembered from passive reading.

Two systems. Double the pathways. One principle that changes everything about how you study.