Chapter 22: Text on Screen — Words That Grab in a Visual World

DataField.Dev

21 min read

> "Half your audience is watching with sound off. If your words aren't on screen, they're not hearing your content — they're guessing at it."

Prerequisites

chapter-02
chapter-16
chapter-19
chapter-20

Learning Objectives

Understand why text overlays improve retention and engagement across platforms
Apply typography fundamentals to create readable, aesthetically effective text
Implement captioning as both an accessibility tool and an engagement strategy
Evaluate when the 'subtitle style' is more effective than traditional voiceover
Use animated text to guide the viewer's eye and create emphasis
Design text-forward hooks that stop the scroll even with sound off

In This Chapter

Chapter Overview
22.1 Why Text Overlays Boost Retention (The Data)
22.2 Typography Basics: Font, Size, Contrast, and Readability
22.3 Captioning and Accessibility: Reaching Everyone
22.4 The "Subtitle Style": When Text Replaces Voiceover
22.5 Animated Text: Motion That Guides the Eye
22.6 Text as Hook: The Opening Caption Strategy
22.7 Chapter Summary
What's Next
Chapter 22 Exercises → exercises.md
Chapter 22 Quiz → quiz.md
Case Study: The Captioner Who Unlocked a New Audience → case-study-01.md
Case Study: Text vs. Voice — An A/B Testing Journey → case-study-02.md

Exercises Quiz Case Study 01 Case Study 02 Key Takeaways Further Reading

Chapter 22: Text on Screen — Words That Grab in a Visual World

"Half your audience is watching with sound off. If your words aren't on screen, they're not hearing your content — they're guessing at it."

Chapter Overview

Chapter 19 taught you what the eye sees in the frame. Chapter 20 taught you the rhythm of those frames. Chapter 21 taught you what the ear hears between frames. This chapter teaches you the layer that bridges all three: text on screen — the words that appear over, around, and sometimes instead of your visual and audio content.

Text on screen has become one of the most important production elements in modern creator content. Research consistently shows that adding text overlays to video increases retention by 15-25%, improves accessibility for millions of viewers, and can function as the primary hook strategy in an environment where the majority of initial video views happen with sound off.

In this chapter, you will learn to: - Understand why text overlays improve retention (the dual coding evidence) - Apply typography principles that make text readable, aesthetic, and effective - Implement captioning for accessibility and engagement - Evaluate when the "subtitle style" replaces voiceover effectively - Use animated text to guide attention and create emphasis - Design text-forward hooks that work with sound off

22.1 Why Text Overlays Boost Retention (The Data)

The Silent Scroll Problem

Here's a fact that reshapes how you think about video content: on most social media platforms, 50-85% of initial video views begin with sound off. The viewer is scrolling in public, in bed, in a meeting — anywhere that sound would be inappropriate or unwanted. They evaluate your content visually before deciding to unmute.

This means your visual content must carry the message independently — at least for the first few seconds. If your content relies entirely on audio (voiceover, dialogue, music), you're invisible to the majority of potential viewers.

Text on screen solves this problem by making the verbal content visible.

The Dual Coding Advantage

Dual coding theory (Paivio, 1971; first introduced in Ch. 2) explains why text overlays boost retention: the brain encodes information more effectively when it receives the same message through two channels (visual text + auditory speech) than through one channel alone.

When the viewer sees AND hears the same words: 1. Two memory traces are created — one verbal (from reading), one auditory (from hearing). Two traces are more durable than one. 2. Comprehension improves — if the viewer misses a word in the audio, the text provides a backup. If the text scrolls past too fast, the audio fills the gap. 3. Processing effort decreases — the brain doesn't have to work as hard to understand the content, reducing cognitive load (Ch. 2) and freeing resources for engagement.

The Retention Pattern

Creators who run their own before-and-after tests, and platforms that have studied text overlays, consistently report a retention lift — but the size of the lift depends heavily on content type. The table below is illustrative (the specific numbers model a realistic pattern, not a single published study), and it shows the shape worth remembering: the denser the information, the bigger the payoff from on-screen text.

Content Type	Without Text	With Text	Direction of Lift
Educational/tutorial	lower completion	higher completion	Large
Commentary/opinion	lower completion	higher completion	Large
Storytelling/narrative	lower completion	higher completion	Moderate
Comedy/entertainment	lower completion	higher completion	Small
Process/how-to	lower completion	higher completion	Large

The lift is largest for content types with high information density (educational, process, commentary) — because these are the types where comprehension matters most. For entertainment-focused content, the lift is smaller but still positive. The takeaway is the ranking, not a guaranteed percentage: test text overlays on your own content and measure your own completion rate before and after.

Why Text Works Beyond Retention

Text overlays do more than help viewers understand — they serve multiple psychological functions:

1. Attention anchoring. Text creates a fixed visual element the eye can lock onto. In a moving video, the stability of text gives the eye a resting point — reducing the visual fatigue that can cause scrolling.

2. Information chunking. Text naturally breaks information into readable chunks. A complex idea spoken over 15 seconds might be hard to follow — but the same idea displayed as three text lines is instantly parseable.

3. Emphasis signaling. Text tells the viewer what's important. When you put specific words on screen, you're saying: "This is the part that matters." The text functions as a visual highlighter for audio content.

4. Silent communication. For sound-off viewers, text IS the content. Without text overlays, these viewers see moving images without context — and make scrolling decisions based on visuals alone.

22.2 Typography Basics: Font, Size, Contrast, and Readability

Why Typography Matters for Video

Typography is the art and science of arranging text for readability and visual impact. In print, typography has centuries of refined practice. In video, typography is still evolving — but the fundamental principles remain: text must be readable first, aesthetic second, and meaningful always.

Bad typography kills the benefit of text overlays. If the viewer can't read the text quickly and effortlessly, the text creates cognitive load instead of reducing it — adding a problem rather than solving one.

The Five Rules of Video Typography

Rule 1: Readability Above All

The viewer has 1-3 seconds to read most text overlays. If reading requires effort — squinting, re-reading, deciphering — the text has failed.

Key readability factors: - Size: Text must be large enough to read on a phone screen (where most viewing happens). Minimum: 5% of frame height for body text, 8% for headlines. - Duration: Text must stay on screen long enough to read. Rule of thumb: display time (in seconds) = word count ÷ 3. A 6-word line needs 2 seconds minimum. - Simplicity: Short phrases outperform long sentences. Use 5-8 words per line maximum.

Rule 2: Contrast Is Non-Negotiable

Text must be visible against its background. The contrast ratio between text and background must be high enough to read effortlessly in any viewing condition (bright sun, dim room, small screen).

Method	Technique	When to Use
Text shadow	Dark outline or drop shadow behind light text	Works on most backgrounds
Background bar	Semi-transparent colored bar behind text	High-contrast guarantee; common on TikTok
Dedicated text zone	Solid colored area reserved for text	When text is structural (subtitles, captions)
High-contrast font color	White text on dark video, black text on light video	When background is consistent

The safest approach: white text with black outline (or shadow) — readable on virtually any background.

Rule 3: Font Choice Communicates Tone

Font Type	Association	Use For
Sans-serif (Helvetica, Arial, Montserrat)	Modern, clean, neutral	Most creator content; default choice
Bold sans-serif (Impact, Bebas Neue)	Urgent, powerful, attention-grabbing	Headlines, hooks, emphasis
Serif (Times, Georgia, Playfair)	Traditional, authoritative, literary	Educational, historical, formal
Handwritten/script	Personal, casual, creative	Aesthetic, artistic, personal stories
Monospace (Courier, Source Code Pro)	Technical, retro, precise	Tech content, coding, data

The two-font rule: Use maximum two fonts per video — one for headlines (bold, impactful) and one for body text (clean, readable). More than two fonts creates visual chaos.

Rule 4: Placement Respects the Frame

Text placement follows composition principles (Ch. 19):

+---------------------------+
| ← SAFE ZONE for text →   |
| ↑                     ↑   |
|                           |
|   [Subject in center]     |
|                           |
| ↓                     ↓   |
| ← Platform UI zone →     |
+---------------------------+

Top third: Hook text, topic labels, headlines
Center: Emphasis text, key phrases (use sparingly — competes with subject)
Bottom third: Subtitles, captions, descriptions, CTAs
Platform safe zones: Avoid placing text where platform UI elements (username, buttons, description) will cover it. Each platform has different safe zones.

Rule 5: Consistency Builds Brand

Use the same font family, colors, and placement style across all your videos. This creates visual brand recognition — viewers recognize your content partly by how your text looks.

Element	Choose Once, Use Always
Primary font	One headline font, one body font
Text color	One main color + one accent
Placement	Consistent zone for each text type
Animation style	One entry animation, one exit
Shadow/background style	Consistent contrast method

22.3 Captioning and Accessibility: Reaching Everyone

The Accessibility Imperative

Captions (also called subtitles) are text representations of spoken audio — displaying the words being said on screen, synchronized with speech. Captions serve two overlapping audiences:

1. Accessibility audience: More than 1.5 billion people worldwide live with some degree of hearing loss, and roughly 430 million have hearing loss significant enough to require rehabilitation (WHO, World Report on Hearing, 2021). Captions make video content accessible to deaf and hard-of-hearing viewers who cannot access audio-dependent content. This isn't optional kindness — it's inclusion.

2. Preference audience: Many hearing viewers prefer captions. They watch in sound-off environments, process information better with visual text support, or find captions helpful when accents, fast speech, or background noise make audio difficult to follow.

Together, these audiences are substantial — studies suggest that 80% of viewers who use captions are not deaf or hard of hearing. Captions have become a general-audience feature, not a niche accessibility tool.

Captions and Engagement

Beyond accessibility, captions tend to improve engagement on several fronts. The table below is illustrative — it names the metrics captions move and the direction they move them, rather than promising specific percentages:

Metric	Effect of Adding Captions
View time (sound-off viewers)	Substantially higher — the biggest single gain
Overall completion rate	Higher
Keyword discoverability	Higher (caption text can be indexed)
International reach	Wider (readers, not just language speakers)

The view-time effect for sound-off viewers is the one to internalize: without captions, a sound-off viewer sees moving images without context and typically scrolls away within seconds. With captions, the same viewer can follow the content and makes a more informed decision about whether to continue watching. As always, the reliable number is the one you measure on your own videos.

Caption Quality Standards

Not all captions are equal. Quality captions:

Element	Good	Bad
Accuracy	Word-for-word or close paraphrase	Garbled auto-generated text with errors
Timing	Synchronized with speech (±0.5 sec)	Delayed or early, misaligned with speech
Readability	1-2 lines, reasonable word count	Entire paragraphs, too much text at once
Formatting	Consistent placement, readable font	Jumping around the screen, tiny font
Completeness	All speech captioned	Missing segments, skipping content

Auto-generated vs. manual captions: Most platforms offer auto-generated captions (using speech-to-text AI). These have improved dramatically but still produce errors — especially with accents, technical terms, slang, and fast speech. Best practice: use auto-generation as a starting point, then review and correct.

Character: Marcus's Caption Strategy

Marcus's science videos were information-dense — packed with technical terms, statistics, and concepts. He'd avoided captions because they "cluttered" the frame. Then he analyzed his retention data by viewer segment:

Viewers who watched with sound on: 58% completion
Viewers who started with sound off: 22% completion

"I was losing more than half of my sound-off viewers instantly because my content was 100% voiceover with visual diagrams. Without the voiceover, the diagrams were meaningless."

Marcus added styled captions — white text with a subtle dark background bar, placed in the bottom third, synchronized to his narration. The results:

Metric	Without Captions	With Captions	Change
Overall completion	52%	64%	+23%
Sound-off completion	22%	51%	+132%
Save rate	5.4%	7.1%	+31%

"The captions didn't just help deaf viewers. They helped EVERY viewer — the ones in class, on the bus, in bed next to someone sleeping. I was gatekeeping my own content behind a sound-on requirement."

22.4 The "Subtitle Style": When Text Replaces Voiceover

What Is the Subtitle Style?

The subtitle style is a content format where on-screen text IS the primary verbal content — there is no voiceover. The creator films visual content (process, aesthetic, lifestyle) and adds text overlays to provide narration, commentary, or information that would traditionally be delivered through voice.

Traditional approach:
[Visual: Creator cooking]
[Audio: "First, chop the onions into small pieces"]

Subtitle style:
[Visual: Creator cooking]
[Text overlay: "chop these super small or it won't work 😤"]
[Audio: Just the sound of chopping — ASMR-adjacent]

Why the Subtitle Style Works

1. Sound-off optimization. The content works identically with sound on or off — there's no audio dependency.

2. Process sound preservation. Without voiceover, the natural sounds of the process (chopping, sizzling, painting, clicking) are audible — creating ASMR-adjacent engagement (Ch. 21).

3. Personality through writing. The text voice can be highly stylized — casual, funny, emotional — in ways that written text enables but spoken voice doesn't always. A text overlay saying "no because WHY did this happen" has a different energy than the same words spoken aloud.

4. Pacing control. Text appears and disappears at controlled rates. The creator decides exactly when the viewer reads each piece of information — more precise control than voiceover pacing.

5. Reduced production barrier. Creators who are uncomfortable speaking on camera or whose voice doesn't match their desired brand can create personality-driven content entirely through text.

When Subtitle Style Works Best

Content Type	Subtitle Style Fit	Why
Cooking/process	Excellent	Preserves process sounds; instructions via text
ASMR/aesthetic	Excellent	Text doesn't disrupt sensory experience
Get ready with me	Good	Text adds commentary to visual process
Day in my life	Good	Text narrates without formal voiceover feel
Commentary/analysis	Poor	Complex ideas need vocal nuance
Education/tutorial	Mixed	Simple tutorials work; complex explanations don't
Comedy (timing-dependent)	Mixed	Some comedy works in text; timing is harder

When Subtitle Style Doesn't Work

Complex explanations. When the content requires sustained, nuanced argument — the kind where vocal emphasis, pauses, and tonal shifts carry meaning — text overlays can't replicate the information density of skilled voiceover.

Emotional vulnerability. When the creator shares something personal and the emotional authenticity comes through voice quality (trembling, pausing, speaking softly), text can't carry the same emotional weight.

Rapid dialogue or reaction. When multiple people are speaking or when the content is response-based (reactions, commentary on clips), text overlays become too dense to read.

Breakdown: A Subtitle-Style Video, Beat by Beat

The principles get concrete fast when you watch the text do the narrating. Here is Luna's 40-second "painting a tiny galaxy" video — her highest-saved post of the month — with no voiceover at all, only ambient brush sounds and text:

Time	Visual	Text overlay	What the text is doing
0:00	Blank black canvas, brush dipping into white paint	"i'm going to ruin this or make it perfect, no in between"	Hook + stakes. A sound-off scroller gets a reason to stay (will it work?) without hearing a word.
0:06	First sponge dabs of color	"the trick nobody tells you: start ugly"	Curiosity payoff promise. Positions the messy middle as intentional, so the viewer doesn't bail when it looks rough.
0:15	Canvas looks muddy, almost bad	"(this is the part where i panic every time)"	Relatability + tension. The parenthetical voice carries personality the brush sounds can't.
0:24	White paint flicked for stars	"splatter = stars. yes that's the whole technique"	Practical value. A genuinely useful, repeatable tip — the save trigger (Ch. 9).
0:34	Finished galaxy revealed	"from 'what am i doing' to THIS in 40 sec"	Payoff + before/after framing. Names the transformation so the ending lands.

Three things make this work. First, every line could stand alone — a viewer who reads only one overlay still gets value. Second, the text carries the personality ("i panic every time") that Luna's quiet, whispered voiceover used to bury; the writing is funnier and more vulnerable than her spoken hooks. Third, the brush sounds are now audible, turning a silent disadvantage into ASMR-adjacent texture (Ch. 21). The same content with a voiceover would have buried the process sounds and added an audio dependency for the 70% of her audience who scroll with sound off.

The lesson: subtitle style isn't "captions instead of talking." It's a deliberate format where the writing becomes the performance — which means the text deserves as much drafting attention as a script would.

22.5 Animated Text: Motion That Guides the Eye

Why Text Moves

Static text on video creates a visual disconnect — the video moves but the text sits still. Animated text (text that enters, exits, or transforms with motion) bridges this disconnect by making text a dynamic element of the visual composition.

Animation serves three purposes:

1. Entry attention. When text animates onto the screen (pop, fade, slide, bounce), the motion triggers the orienting response (Ch. 1). The viewer's eye is drawn to the appearing text — guaranteeing it's noticed.

2. Emphasis. Text that shakes, scales, or flashes draws extra attention to specific words. This is the visual equivalent of vocal emphasis (Ch. 21) — using motion to say "this word matters."

3. Pacing. Animated text can control reading pace. Words appearing one at a time force sequential reading at the creator's chosen speed. All words appearing simultaneously allow the viewer to read at their own pace.

Animation Types and Their Effects

Animation	Motion	Effect	Best For
Pop/scale	Text appears instantly at full size	Impact, emphasis, comedy	Punchlines, key facts
Fade in	Text appears gradually	Gentle, elegant, sophisticated	Subtitles, gentle overlays
Slide in	Text enters from edge of frame	Directional energy, flow	Sequential information, lists
Bounce	Text appears with elastic overshoot	Playful, energetic, young	Comedy, lifestyle, fun
Typewriter	Letters appear one by one	Suspense, reading pace control	Reveals, dramatic statements
Shake/vibrate	Text trembles in place	Emotion, intensity, emphasis	Emphasis on specific words
Scale pulse	Text briefly grows then returns to size	"This is important" signal	Key words, numbers, names

Kinetic Typography

Kinetic typography is text animation where the motion of the words IS the content — the typography moves, transforms, and flows in ways that create meaning beyond the words themselves. Used heavily in lyric videos, educational explainers, and high-production creator content.

In its simplest form, kinetic typography means the way words move reflects their meaning: - The word "FAST" slides across the screen quickly - The word "slow" drifts across gradually - The word "BIG" appears in large scale - The word "tiny" appears small

More advanced kinetic typography creates visual stories through text movement — words entering from different directions, stacking, splitting apart, and transforming to illustrate concepts.

Character: Zara's Text Animation Evolution

Zara had been using the same text animation for every overlay: white Impact font, black outline, pop-in from center. It worked — but every text element looked identical, regardless of whether it was a punchline, a setup line, or a label.

"All my text looked the same, which meant none of it stood out," Zara said. "If everything is emphasized, nothing is emphasized."

Zara developed a three-tier text hierarchy:

Tier	Animation	Use	Example
Tier 1: Headline	Scale pop with bounce	Key punchlines, hooks	"I can't believe this worked"
Tier 2: Body	Gentle fade-in	Narration, context	"so I tried something different"
Tier 3: Label	Static (no animation)	Names, timestamps, locations	"Day 3"

The hierarchy meant that viewers' eyes were drawn to Tier 1 text first (animation = attention), then noticed Tier 2 (subtle but present), while Tier 3 stayed in the background. The punchlines popped. The narration flowed. The labels informed without competing.

22.6 Text as Hook: The Opening Caption Strategy

The Sound-Off Hook Problem

Chapter 16's hook toolbox focused on three categories: verbal, visual, and audio hooks. But there's a critical gap: for the 50-85% of viewers scrolling with sound off, verbal and audio hooks are invisible. They only experience the visual hook.

Text hooks bridge this gap — they deliver verbal hook content (curiosity, challenge, emotion, value) through visual text that works regardless of sound.

Five Text Hook Formats

1. The Question Hook A provocative question displayed as the opening text overlay.

+---------------------------+
|                           |
| "Why does your brain      |
|  want you to fail?"       |
|                           |
|   [Visual: person         |
|    looking thoughtful]    |
|                           |
+---------------------------+

Why it works: Questions activate the curiosity gap (Ch. 5) through visual text. The viewer doesn't need to hear the question — they read it, and the information gap drives continued watching.

2. The Statement Hook A bold, counterintuitive, or surprising statement.

+---------------------------+
| "Everything you know      |
|  about productivity       |
|  is wrong."               |
|                           |
|   [Visual: creator at     |
|    desk, casual setup]    |
|                           |
+---------------------------+

Why it works: Schema violation (Ch. 6) through text. The statement challenges what the viewer believes, creating a need to see whether the claim is justified.

3. The Preview Hook Text that tells the viewer what they're about to see.

+---------------------------+
| "I tried every viral      |
|  recipe so you don't      |
|  have to."                |
|                           |
|   [Visual: table full     |
|    of food experiments]   |
|                           |
+---------------------------+

Why it works: Value proposition (Ch. 16) delivered visually. The viewer knows exactly what they'll get — reducing the risk of investing time.

4. The Caption-as-Dialogue Hook Simulated conversation or internal monologue in text.

+---------------------------+
| her: "you should try      |
|       working out"        |
|                           |
| me:                       |
|   [Visual: lying on       |
|    couch, not moving]     |
|                           |
+---------------------------+

Why it works: Relatability (Ch. 14) through text-based setup/punchline. The text creates a micro-narrative that the visual completes — engaging both reading and visual processing simultaneously.

5. The List Hook Numbers or bullet points that promise structured content.

+---------------------------+
| "3 things I wish I        |
|  knew at 15"              |
|                           |
|   [Visual: creator        |
|    speaking directly      |
|    to camera]             |
|                           |
+---------------------------+

Why it works: The number creates a concrete promise (Ch. 16, value hook category). The viewer knows the content is structured, bounded, and efficient — reducing perceived time commitment.

The Sound-On Bonus

The best text hooks work on two levels: - Sound off: The text delivers the full hook. The viewer reads the question/statement and decides to keep watching. - Sound on: The text PLUS the vocal delivery create a dual-coded hook. The viewer reads AND hears, creating a stronger initial engagement.

This is why text hooks should complement (not duplicate) audio hooks. Ideal approach: the text conveys the core message, while the audio adds tone, emphasis, and personality.

Character: Luna's Text-First Discovery

Luna's art and ASMR content was beautiful — but her analytics showed that 70% of her potential audience scrolled past within 2 seconds because her hooks were audio-dependent (the sound of a brush, a whispered introduction). Without sound, her opening frames showed a canvas and supplies — visually interesting but lacking a reason to stay.

Luna started adding text hooks to her opening frames:

Before: [Visual: blank canvas, brush ready, no text, whispered "today we're painting..."] After: [Visual: same shot + text overlay: "painting my depression" or "this took 47 hours"]

The text gave sound-off viewers a reason to stay. The emotional or impressive text hook worked visually while the audio hook (brush sounds, whispered voice) worked for sound-on viewers.

Result: 2-second retention improved from 45% to 68%. Luna was reaching viewers who had always scrolled past because they never heard her audio hook.

"I was whispering my hooks to an audience that had their sound off. The text let them hear me with their eyes."

22.7 Chapter Summary

The Core Principles

Text bridges the sound-off gap. With 50-85% of initial views happening sound-off, text overlays make your content accessible to the majority of potential viewers.
Dual coding boosts retention. When viewers see AND hear the same information, comprehension improves and retention increases 15-25% — the largest gains for information-dense content.
Typography is readability first. Large enough to read on phones, high contrast against background, maximum two fonts, consistent placement. If viewers can't read it instantly, text hurts rather than helps.
Captions are for everyone. 80% of caption users are not deaf or hard of hearing. Captions serve sound-off viewers, noisy environments, second-language speakers, and comprehension preference — making them a universal engagement tool.
The subtitle style is a format choice. For process, aesthetic, and lifestyle content, text-as-narration works well. For complex analysis and emotional vulnerability, voiceover carries nuance that text can't replicate.
Animation draws the eye. Animated text triggers the orienting response, creates emphasis, and controls reading pace. But use a hierarchy — if everything animates equally, nothing stands out.
Text hooks work with sound off. Question hooks, statement hooks, preview hooks, dialogue hooks, and list hooks all deliver verbal hook content through visual text — capturing the sound-off audience that audio hooks miss.

The Character Updates

Marcus added styled captions to his science videos and saw sound-off completion jump from 22% to 51% — "I was gatekeeping my own content behind a sound-on requirement."
Zara developed a three-tier text hierarchy (headline pop, body fade, static label) so her punchlines stood out instead of competing with narration text.
Luna added text hooks to her art content and improved 2-second retention from 45% to 68% — "I was whispering my hooks to an audience that had their sound off."

What's Next

Chapter 23: Color, Light, and Mood explores how the visual warmth, brightness, and color palette of your content shape emotion before a single word is spoken or read. From basic color theory for creators to natural vs. artificial lighting, color grading and LUTs, high key vs. low key as storytelling tools, and practical lighting setups you can build for under $50 — Chapter 23 teaches you to paint emotion with your camera.

Prerequisites

Learning Objectives

In This Chapter

Chapter 22: Text on Screen — Words That Grab in a Visual World

Chapter Overview

22.1 Why Text Overlays Boost Retention (The Data)

The Silent Scroll Problem

The Dual Coding Advantage

The Retention Pattern

Why Text Works Beyond Retention

22.2 Typography Basics: Font, Size, Contrast, and Readability

Why Typography Matters for Video

The Five Rules of Video Typography

22.3 Captioning and Accessibility: Reaching Everyone

The Accessibility Imperative

Captions and Engagement

Caption Quality Standards

Character: Marcus's Caption Strategy

22.4 The "Subtitle Style": When Text Replaces Voiceover

What Is the Subtitle Style?

Why the Subtitle Style Works

When Subtitle Style Works Best

When Subtitle Style Doesn't Work

Breakdown: A Subtitle-Style Video, Beat by Beat

22.5 Animated Text: Motion That Guides the Eye

Why Text Moves

Animation Types and Their Effects

Kinetic Typography

Character: Zara's Text Animation Evolution

22.6 Text as Hook: The Opening Caption Strategy

The Sound-Off Hook Problem

Five Text Hook Formats

The Sound-On Bonus

Character: Luna's Text-First Discovery

22.7 Chapter Summary

The Core Principles

The Character Updates

What's Next

Chapter 22 Exercises → exercises.md

Chapter 22 Quiz → quiz.md

Case Study: The Captioner Who Unlocked a New Audience → case-study-01.md

Case Study: Text vs. Voice — An A/B Testing Journey → case-study-02.md

Chapter 22 Exercises → `exercises.md`

Chapter 22 Quiz → `quiz.md`

Case Study: The Captioner Who Unlocked a New Audience → `case-study-01.md`

Case Study: Text vs. Voice — An A/B Testing Journey → `case-study-02.md`