Further Reading: Sound Design and Music
Core Books
Designing Sound: Audiovisual Aesthetics in 1970s American Cinema
Jay Beck (2016)
While focused on cinema history, Beck's analysis of how sound design emerged as a deliberate creative art provides the theoretical foundation for thinking about sound as a design discipline rather than an afterthought. His treatment of how sound creates immersion, emotion, and narrative meaning translates directly to creator content — where sound design is often the least intentional production element.
Why read it: Changes how you think about sound — from technical requirement to creative opportunity.
This Is Your Brain on Music: The Science of a Human Obsession
Daniel Levitin (2006)
Levitin, a neuroscientist and former record producer, explains how the brain processes music — from rhythm and melody to the emotional effects of tempo, key, and timbre. His treatment of why certain musical elements create predictable emotional responses provides the scientific foundation for the Music-Content Alignment Matrix in Section 21.3. Accessible and engaging, written for general audiences.
Why read it: The scientific basis for why music choices matter — and how to make them intentionally.
The Sound Effects Bible: How to Create and Record Hollywood-Style Sound Effects
Ric Viers (2008)
The most practical guide to creating and recording sound effects, including foley. Viers covers techniques for recording clean audio, enhancing sounds, and designing audio that serves a narrative purpose. While aimed at film and TV, the techniques apply directly to creator content — especially for process content, ASMR, and any format where real-world sounds enhance the experience.
Why read it: Immediately practical. If you want to improve your foley and sound effect work, this is the reference manual.
Academic Sources
"Music and Emotion: Theory and Research"
Juslin, P. N., & Sloboda, J. A. (Eds.) (2001). Oxford University Press.
The comprehensive academic reference on how music creates emotion. Juslin and Sloboda's collection covers tempo-arousal links, mode-valence associations, the role of timbre, and individual differences in emotional response to music. The research on "emotional contagion" through music — where the listener "catches" the emotion the music expresses — provides the theoretical basis for music's power in creator content.
Relevance: The empirical evidence that music doesn't just accompany emotion — it creates it.
"Why Do We Like to Listen Repeatedly to the Same Music?"
Margulis, E. H. (2014). On Repeat: How Music Plays the Mind. Oxford University Press.
Margulis's research on musical repetition explains why familiar sounds (trending audio, recurring intro music, signature sound effects) create stronger engagement than novel sounds. Her concept of "involuntary musical imagery" (earworms) connects to the mere exposure effect (Ch. 6) and explains why audio branding works — repeated sounds become mentally "owned" by the listener.
Relevance: The psychology behind audio branding and why familiar sounds create engagement.
"Faster Auditory Processing of the Emotional Content of Speech"
Paulmann, S., & Kotz, S. A. (2008). Brain Research, 1230, 151-163.
This study provides the empirical evidence for the claim in Section 21.1 that sound reaches the emotional processing system faster than visual information. Paulmann and Kotz demonstrate that the brain processes the emotional prosody (tone) of speech before processing the semantic content (meaning) — the voice tells you how to feel before you understand what's being said.
Relevance: Scientific evidence that vocal tone shapes emotional response before conscious comprehension.
"The Influence of Music on Advertising Effectiveness"
Kellaris, J. J., & Cox, A. D. (1989). Journal of Consumer Research, 16(4), 594-598.
While focused on advertising, Kellaris and Cox's research on how music tempo and modality (major vs. minor key) affect persuasion and attitude formation applies directly to creator content. Their finding that music-message congruence (matching music mood to message mood) increases persuasion by 22-35% provides the evidence base for the music-content alignment principle.
Relevance: Empirical evidence that music-content alignment increases effectiveness — not just aesthetically, but measurably.
Creator and Industry Resources
Epidemic Sound — Creator Music Platform
The most widely-used royalty-free music subscription for creators ($10-15/month). Epidemic Sound's library includes tens of thousands of tracks organized by mood, genre, tempo, and energy — making it easy to apply the Music-Content Alignment Matrix. Their blog includes guides on music for specific content types.
YouTube Audio Library (studio.youtube.com/channel/audio)
YouTube's free music and sound effects library, available to all YouTube creators. Tracks are organized by mood, genre, duration, and instrument. All tracks are cleared for YouTube use, and many are cleared for broader use. The sound effects collection includes foley and transition sounds.
Artlist — Music and Sound Effects
A subscription service ($10-17/month) offering royalty-free music and sound effects with a strong emphasis on cinematic and high-production-value tracks. Better than some alternatives for creators who need more polished or dramatic music options.
Freesound.org — Community Sound Library
A collaborative collection of sound effects, foley, and environmental sounds released under Creative Commons licenses. Excellent resource for specific sound effects (the "clap" of a book closing, the "whoosh" of a swinging arm) that platforms' built-in libraries may not include.
Every Frame a Painting — "The Marvel Symphonic Universe" (archived)
Tony Zhou's video essay analyzing why Marvel movie music is forgettable — despite billion-dollar budgets — demonstrates the consequences of treating music as background rather than design. The principles apply directly to creator content: music that "fills space" without communicating emotion is functionally invisible.
For Advanced Study
"Audio-Visual Correspondences and Cross-Modal Interactions"
Spence, C. (2011). In The Oxford Handbook of Multisensory Processing. Oxford University Press.
Spence's comprehensive review of how the brain combines audio and visual information provides the advanced neuroscience behind multisensory integration (Ch. 2) and beat editing (Ch. 20). His treatment of "cross-modal binding" — how the brain decides whether audio and visual stimuli belong to the same event — explains why synchronization (cutting on the beat, foley matching action) creates a stronger experience.
"Sound Design: The Expressive Power of Music, Voice, and Sound Effects in Cinema"
David Sonnenschein (2001). Michael Wiese Productions.
Sonnenschein's framework for designing sound in film — the "sound map" approach to planning audio across a project — provides advanced tools for creators who want to approach sound design systematically. His treatment of psychoacoustics (how the brain interprets sound) includes practical applications for volume, frequency, and spatial sound design.
"The Perception of Musical Tempo"
Quinn, S., & Watt, R. (2006). Perception, 35(2), 267-280.
Quinn and Watt's research on how humans perceive tempo reveals that perceived tempo doesn't always match actual BPM — context, musical complexity, and listener expectation all modulate how fast music "feels." This research has implications for music selection: a 100 BPM track with sparse instrumentation may feel slower than a 90 BPM track with complex rhythms. Understanding perceived tempo allows more precise music-content matching.
Suggested Reading Order
| Priority | Source | Time Investment |
|---|---|---|
| Start here | Levitin, This Is Your Brain on Music (Ch. 1-5) | 3-4 hours |
| Next | YouTube Audio Library (explore by mood/genre) | 1-2 hours |
| Then | Every Frame a Painting, "Marvel Symphonic Universe" | 15 minutes |
| Practice | Freesound.org (explore sound effects categories) | 1 hour |
| Deep dive | Juslin & Sloboda, Music and Emotion (Ch. 1, 3, 11) | 4-6 hours |
| Advanced | Paulmann & Kotz (2008) on emotional speech processing | 1-2 hours |
| Advanced | Sonnenschein, Sound Design | 6-8 hours |