Further Reading: Text on Screen

Core Books

Thinking with Type: A Critical Guide for Designers, Students, and Writers

Ellen Lupton (2010, 2nd edition)

The essential typography reference — Lupton covers every principle of type design from letter spacing to hierarchy to grid systems. While written for graphic designers, her treatment of readability, contrast, and font selection applies directly to text overlays in video. Her concept of "typographic voice" — how font choice communicates personality and tone — is the theoretical basis for Section 22.2's font-tone associations.

Why read it: The comprehensive typography education that makes your text overlays look professional rather than amateur. Skim for principles; deep-read the chapters on hierarchy and readability.

Don't Make Me Think: A Common Sense Approach to Web Usability

Steve Krug (2014, 3rd edition)

Krug's principles of web usability — particularly "don't make me think" (reduce cognitive effort) and "satisficing" (users don't read, they scan) — translate directly to text on screen in video. His research shows that people don't read text carefully on screens; they scan for keywords and structure. This is why short phrases, high contrast, and clear hierarchy matter more than elegant prose in text overlays.

Why read it: Changes how you think about text consumption on screens — viewers scan, they don't read. Design accordingly.

Universal Principles of Design

William Lidwell, Kritina Holden, & Jill Butler (2010, revised edition)

A reference encyclopedia of 125 design principles, many directly applicable to text on screen: figure-ground contrast, Gestalt principles, the Von Restorff effect (distinctive items are remembered better — Ch. 6), visual hierarchy, and the Gutenberg diagram (the eye's natural scanning pattern). Each principle is explained in one two-page spread with visual examples.

Why read it: A design toolkit you can dip into for specific problems — "my text doesn't stand out" → look up contrast, figure-ground, or the isolation effect.


Academic Sources

"Dual Coding Theory and Education"

Paivio, A. (1971, 2006). Mental Representations: A Dual Coding Approach. Oxford University Press.

Paivio's original dual coding theory — the finding that information encoded through both verbal and visual channels is remembered better than information through either channel alone — is the theoretical foundation for why text overlays improve retention. His research showed that dual-coded information creates two independent memory traces, each capable of independently triggering recall.

Relevance: The scientific basis for Section 22.1's claim that text + audio together outperform either alone.

"The Modality Effect in Multimedia Learning"

Mayer, R. E. (2009). Multimedia Learning (2nd ed.). Cambridge University Press.

Mayer's research on multimedia learning produced 12 principles for effective instruction, several directly applicable to text on screen. Most relevant: the Redundancy Principle (don't duplicate information in identical channels — instead, text should complement audio) and the Spatial Contiguity Principle (text should appear near the visual it describes). Mayer's work explains why styled, well-placed text outperforms random text placement.

Relevance: Evidence-based principles for how text and visual elements should work together — not just that they should, but how.

"Reading and Writing in the Digital Age: The Effect of Captions on Video Comprehension"

Gernsbacher, M. A. (2015). Literacy in the Digital Age. John Benjamins.

Gernsbacher's research on caption use and comprehension demonstrates that captions improve understanding for ALL viewers — not just those with hearing difficulties. Her finding that 80% of caption users are not deaf or hard of hearing provides the evidence base for Section 22.3's argument that captions are a universal engagement tool.

Relevance: The empirical evidence that captions serve a far broader audience than the accessibility community alone.

"On-Screen Text in Television: Audience Perceptions and Effects"

Greer, C. F., & Gosen, J. D. (2002). Journal of Broadcasting & Electronic Media, 46(1), 1-17.

This study analyzed how viewers process and respond to on-screen text in video content, finding that text overlays increase information recall by 25-35% and perceived content quality by 15-20%. Greer and Gosen also found that text-heavy presentations were rated as more credible than text-free presentations — a finding relevant to educational and commentary creators.

Relevance: Evidence that text overlays don't just improve retention — they improve perceived quality and credibility.


Creator and Industry Resources

Kapwing — Subtitle and Caption Generator

An online tool for generating, editing, and styling captions for video content. Kapwing offers auto-generated captions with easy manual correction, multiple style options, and export for all major platforms. Useful for creators who want styled captions without professional editing software.

CapCut — Text and Typography Tools

The most widely used video editor for TikTok creators includes extensive text overlay, caption, and typography tools. CapCut's template system allows creators to browse and apply text styles, animations, and caption formats that are currently popular on TikTok — a practical way to see what text design trends are working.

Zach King — YouTube and TikTok

Zach King's content is a masterclass in text integration — he uses text as a storytelling element, with words appearing in the physical environment (written on whiteboards, appearing on surfaces, animated to interact with real-world objects). His approach demonstrates how text can be a creative tool rather than just an informational overlay.

@subjectivity — TikTok

An example of subtitle-style content done exceptionally well — no voiceover, text narration with strong personality, carefully styled typography that supports the visual aesthetic. Worth studying for text pacing, personality through written voice, and consistent text branding.


For Advanced Study

"The Effectiveness of Captioned Videos for L2 Listening Comprehension"

Montero Perez, M., Van Den Noortgate, W., & Desmet, P. (2013). System, 41(3), 720-739.

This meta-analysis of caption research in second-language contexts demonstrates that captions improve comprehension for non-native speakers by 25-40% — significantly more than the improvement for native speakers. This research supports the case study finding (Section: The Captioner Who Unlocked a New Audience) that captions expand international reach for English-language creators.

"Visual Rhetoric and the Information Age: The Role of Typography in Persuasion"

Brumberger, E. (2003). Technical Communication Quarterly, 12(1), 5-24.

Brumberger's research on how font choice affects persuasion demonstrates that typeface is not neutral — readers make judgments about content quality, credibility, and appropriateness based on font choice alone. Her finding that font-content congruence (the right font for the right context) increases persuasion by 15-25% provides evidence for Section 22.2's font-tone associations.

"Kinetic Typography: Past, Present, and Future"

Brownie, B. (2015). Design Issues, 31(3), 32-44.

Brownie's analysis of kinetic typography's history and evolution — from experimental film to digital media to social content — provides the scholarly context for Section 22.5's animated text discussion. Her treatment of how text movement creates meaning (motion as rhetoric) offers a theoretical framework for understanding why certain text animations feel right and others feel wrong.


Suggested Reading Order

Priority Source Time Investment
Start here Krug, Don't Make Me Think (skim, focus on scanning behavior) 1-2 hours
Next Lupton, Thinking with Type (Ch. 1-3 on letter, text, grid) 2-3 hours
Then CapCut text tool exploration (hands-on) 1-2 hours
Practice Kapwing caption generator (try on one video) 30 minutes
Deep dive Mayer (2009) multimedia learning principles 3-4 hours
Advanced Gernsbacher (2015) on caption research 1-2 hours
Advanced Brumberger (2003) on typography and persuasion 1-2 hours