Quiz: Text on Screen
Test your understanding of text overlays, typography, captioning, and text hooks. Try to answer each question before revealing the answer.
Question 1: What percentage of initial video views on social media platforms typically begin with sound off?
Show Answer
**50-85%** of initial video views begin with sound off. Viewers scroll in public, in bed, in meetings, or other environments where sound would be inappropriate. This means the majority of potential viewers evaluate content visually before deciding to unmute — making text overlays essential for reaching this audience. (Section 22.1)Question 2: What is dual coding theory, and how does it explain why text overlays boost retention?
Show Answer
**Dual coding theory** (Paivio, 1971) explains that the brain encodes information more effectively when it receives the same message through two channels simultaneously — visual text AND auditory speech. When the viewer sees and hears the same words: 1. **Two memory traces are created** — one verbal (from reading), one auditory (from hearing), more durable than one alone 2. **Comprehension improves** — text and audio serve as mutual backups 3. **Processing effort decreases** — reduced cognitive load frees resources for engagement Result: text overlays improve retention by 15-25%, with the largest gains for information-dense content. (Section 22.1)Question 3: Name the five rules of video typography.
Show Answer
1. **Readability above all** — large enough for phone screens, displayed long enough to read, short phrases (5-8 words max per line) 2. **Contrast is non-negotiable** — text must be visible against any background (use shadows, outlines, or background bars) 3. **Font choice communicates tone** — sans-serif for modern, bold for impact, serif for authority, handwritten for personal 4. **Placement respects the frame** — follow composition principles; top third for hooks, bottom third for captions; avoid platform UI zones 5. **Consistency builds brand** — same fonts, colors, placement, and animation style across all videos (Section 22.2)Question 4: What is the safest contrast method for text on video, and why?
Show Answer
**White text with black outline (or shadow)** — readable on virtually any background. The black outline creates contrast against bright backgrounds, while the white text creates contrast against dark backgrounds. This dual-contrast approach works regardless of what's behind the text, making it the safest default choice. Other methods (background bars, dedicated text zones) also guarantee readability but are more visually prominent. The shadow/outline method is the least visually intrusive while maintaining readability. (Section 22.2)Question 5: According to the data, what percentage of people who use captions are NOT deaf or hard of hearing?
Show Answer
**80%** of viewers who use captions are not deaf or hard of hearing. They use captions because they: - Watch in sound-off environments - Process information better with visual text support - Find captions helpful with accents, fast speech, or background noise - Prefer reading along with audio This makes captions a **general-audience engagement feature**, not just an accessibility tool. (Section 22.3)Question 6: Marcus added captions to his science videos and saw sound-off completion jump dramatically. What were the before and after numbers?
Show Answer
- **Before captions:** Sound-off completion was **22%** - **After captions:** Sound-off completion was **51%** — a **+132% improvement** Overall completion also improved from 52% to 64% (+23%), and save rate increased from 5.4% to 7.1% (+31%). Marcus's insight: "I was gatekeeping my own content behind a sound-on requirement." (Section 22.3)Question 7: What is the "subtitle style," and name two content types where it works particularly well.
Show Answer
The **subtitle style** is a content format where on-screen text IS the primary verbal content — there is no voiceover. The creator films visual content and adds text overlays that function as narration, commentary, or information. Works particularly well for (any two): - **Cooking/process content** — text provides instructions while preserving process sounds (sizzling, chopping) - **ASMR/aesthetic content** — text doesn't disrupt the sensory experience - **Get ready with me** — text adds personality commentary to visual process - **Day in my life** — text narrates without formal voiceover feel (Section 22.4)Question 8: Name three reasons why the subtitle style works as a content format.
Show Answer
Any three of: 1. **Sound-off optimization** — content works identically with sound on or off 2. **Process sound preservation** — natural ASMR-adjacent sounds remain audible without voiceover competing 3. **Personality through writing** — text voice can be stylized in ways spoken voice can't always achieve 4. **Pacing control** — creator decides exactly when each piece of information is read 5. **Reduced production barrier** — accessible for creators uncomfortable speaking on camera (Section 22.4)Question 9: What are the three purposes animated text serves?
Show Answer
1. **Entry attention** — motion (pop, fade, slide, bounce) triggers the orienting response, guaranteeing the text is noticed 2. **Emphasis** — text that shakes, scales, or flashes draws extra attention to specific words, functioning as visual vocal emphasis 3. **Pacing** — animation controls reading speed; words appearing one at a time force sequential reading at the creator's pace; all at once allows viewer-controlled reading (Section 22.5)Question 10: Zara developed a three-tier text hierarchy. Name each tier and its animation style.
Show Answer
| Tier | Purpose | Animation | Example | |------|---------|-----------|---------| | **Tier 1: Headline** | Key punchlines, hooks | Scale pop with bounce | "I can't believe this worked" | | **Tier 2: Body** | Narration, context | Gentle fade-in | "so I tried something different" | | **Tier 3: Label** | Names, timestamps, locations | Static (no animation) | "Day 3" | The hierarchy ensures viewers' eyes are drawn to the most important text first (Tier 1's animation captures attention), then notice supporting text (Tier 2), while labels stay in the background (Tier 3). "If everything is emphasized, nothing is emphasized." (Section 22.5)Question 11: Name all five text hook formats from Section 22.6.
Show Answer
1. **Question hook** — A provocative question displayed as opening text (activates curiosity gap) 2. **Statement hook** — A bold or counterintuitive claim (triggers schema violation) 3. **Preview hook** — Text that tells the viewer what they'll see (delivers value proposition) 4. **Caption-as-dialogue hook** — Simulated conversation or internal monologue (creates relatability) 5. **List hook** — A numbered promise, e.g., "3 things..." (creates concrete, bounded expectation) All five work with sound off, delivering verbal hook content through visual text. (Section 22.6)Question 12: Luna added text hooks to her art content and improved 2-second retention from 45% to 68%. What was the specific problem her text hooks solved?
Show Answer
Luna's hooks were **audio-dependent** — the sound of a brush and a whispered introduction. With 70% of potential viewers scrolling with sound off, they saw only a canvas and art supplies in the opening frame. This was visually interesting but lacked a **reason to stay**. By adding text hooks (like "painting my depression" or "this took 47 hours"), Luna gave sound-off viewers a compelling reason to keep watching. The emotional or impressive text worked visually while the audio hook (brush sounds, whispered voice) still worked for sound-on viewers. Her insight: "I was whispering my hooks to an audience that had their sound off. The text let them hear me with their eyes." (Section 22.6)Question 13: What is the recommended formula for how long text should stay on screen?
Show Answer
**Display time (in seconds) = word count ÷ 3** A 6-word line needs approximately 2 seconds minimum on screen. This ensures the viewer has time to read the text without rushing. Combined with the recommendation to keep text to 5-8 words per line maximum, this means most text overlays should appear for 2-3 seconds. (Section 22.2)Question 14: What is kinetic typography, and how does it differ from standard text animation?