Chapter 4 Exercises
This chapter's exercise set is the biggest Listening Lab in the book, and that's deliberate: Chapter 4 is where your ear training begins, and everything in Parts II through IX assumes you did this work. You'll need: your three reference tracks from the Project Checkpoint, your best available playback (used consistently β same system for every exercise), your listening journal, and a DAW with a stock spectrum analyzer for Part C. Write everything down. An observation you didn't write is an observation you'll lose, and case study 2 should have convinced you the receipts matter.
Selected answers and worked examples appear in the back-of-book Answers appendix. Exercises marked π§ require focused listening time; budget roughly 20 minutes for any full five-pass exercise.
A pacing suggestion that respects your own ear-fatigue rules: Part A in one sitting (it's reading and writing, not listening), then one or two Part B exercises per day across a week, Part C alongside whichever B exercise it supports, and Parts D and M at the end as consolidation. Spreading the listening work is not a compromise β spaced sessions train ears measurably better than a single heroic Saturday, and the heroic Saturday violates half the hygiene table anyway.
Part A β Conceptual
A1. Two modes. Define critical listening and passive listening in your own words, then describe one real moment from the past week when you were in each mode. Finally: explain why "passive listening is wrong" would be a misreading of this chapter.
A2. The map from memory. Without looking back, write out the six frequency bands: name, approximate range, and one sound that has its center of gravity in each. Then check yourself against the chapter and note which boundaries you misplaced. (Most people misplace the mids/high-mids border first β worth noticing which way your memory bent it.)
A3. The two traps. A friend mixes at whisper volume "to protect my ears" for six hours, then masters a comparison of two limiter settings by ear, with one version playing noticeably hotter. Using the FletcherβMunson curves, explain (a) what their quiet mixing is doing to their bass decisions, and (b) why their limiter comparison is rigged before it starts β and what one procedural rule would fix it.
A4. Masking, three ways. Explain perceptual masking to an imaginary reader who skipped this chapter, using: (1) one everyday non-music example, (2) one mix example involving the mud zone, and (3) the upward-spread property β including which instrument in a typical beat is the biggest masking aggressor and why.
A5. The three axes. Name the three soundstage dimensions and, for each, list the cue(s) a brain uses to construct it. Which axis is "honest" (physically real in stereo), which is constructed from learned distance cues, and which is mostly a frequency illusion?
A6. The threshold, defended. Write a paragraph (5β8 sentences) defending this claim to a skeptic: "You hear with your brain, not your ears β therefore hearing is trainable." Use at least two phenomena from the chapter (cocktail party, phonemic restoration, McGurk, missing fundamental, expectation bias) as evidence, and end with the practical consequence for a producer on a budget.
A7. Hygiene audit. Write your own listening-hygiene plan as five concrete commitments (session length, level discipline, break behavior, decision timing, hearing protection). Then identify which one you are most likely to break, and what it will cost you when you do.
A8. One question at a time. The five-pass method exists because of a specific limitation of human attention. Name it, describe the everyday phenomenon that demonstrates it, and explain what happens when a beginner ignores it and tries to "hear everything at once."
Part B β π§ Listening Lab
The heart of this chapter. Do these across several days, not one marathon β you now know why. Every exercise produces journal entries.
B1. The calibration listen. Run the complete five-pass method on Billie Eilish's "bad guy" before reading case study 1. Write full notes for all five passes. Then read the case study and compare its walkthrough against your notes, pass by pass. Score yourself: what did you catch? What did you miss entirely? What did you hear that the case study didn't mention? (That last category is not failure β it's your developing taste.) File the comparison in your journal as your calibration baseline.
B2. Genre contrast: the rock pass. Full five-pass listen on a classic rock-era mix β Fleetwood Mac's "Dreams" or Eagles' "Hotel California" are ideal candidates: famous, widely available, and mixed in an era with very different space and width philosophies. In your dynamics and space passes especially, note what this mix does that a modern pop mix wouldn't dare (and vice versa).
B3. Home turf. Full five-pass listen on one track from your reference playlist β the genre you actually produce. This one matters most: finish with a one-paragraph summary titled "What this mix does that mine doesn't." That paragraph is a personal curriculum; keep it where you can see it.
B4. The band-identification drill (your baseline test). Queue ten songs you know well, across genres. For each: play the first 30 seconds, pause, and name the band where the dominant energy sits (sub / lows / low-mids / mids / high-mids / air). Commit to an answer out loud or in writing before moving on β no hedging. Then verify with a spectrum analyzer (exercise C2 shows you how) and score yourself out of ten. Record the score and the date in your journal: this is the baseline you'll re-test against in Part II and beyond. Most untrained listeners land around chance on the middle bands and do better at the extremes; wherever you land, that number only moves one direction with practice.
B5. The masking hunt. Find three masking moments in dense professional mixes β places where an element you know is playing becomes inaudible when something louder enters its frequency neighborhood. Strong hunting grounds: a wall-of-sound rock chorus (The Killers' "Mr. Brightside" chorus), a Phil Spector-era production (The Ronettes' "Be My Baby"), or any maximalist EDM drop you know. For each: name the masked element, the masker, the band they're fighting over, and the moment (timestamp) the disappearance happens. Bonus: find one place where the arrangement deliberately removes a part to prevent masking β and note how you'd never notice the removal as a listener.
B6. The loudness-bias self-test. Pick one reference track. Listen to the same 60-second section three times at three distinct volumes: quiet (conversation underneath it would be easy), moderate (your standard), and loud (short exposure β one minute, no more). After each, write three sentences about the balance: how prominent do the bass, vocal, and cymbals feel? Don't re-read your earlier notes until done. Then compare the three descriptions and annotate them with what FletcherβMunson predicts. You will have described the same file three different ways. Keep this page forever β it's the proof you'll need on the day you're tempted to make a big decision at the wrong volume. Partner variant, strongly recommended: have a friend control the volume and the order while you look away, so you don't know which level is playing first β blind impressions are worth double, because they remove the expectation bias the chapter warned you your interpreter smuggles into everything.
B7. The whisper test. Play a professional mix at genuinely quiet volume β background-conversation level. List every element you can still clearly hear. That short list is the mix's true balance hierarchy, stripped of flattery by the equal-loudness curves. Repeat on a rough mix or demo of your own and compare lists. (If your vocal or lead element isn't on your track's whisper list, you've found your first balance problem β months before the mixing chapters.)
B8. Stage cartography. On headphones, eyes closed, map the stereo stage of three songs: your two favorite references plus one track famous for width (Tame Impala's "The Less I Know the Better" works beautifully). For each, draw the top-down soundstage grid from this chapter β center residents, wing residents, depth placement β and run the pointing test on at least three elements per song. Compare your three maps: which mix uses the narrowest total stage? Which saves width for specific moments?
B9. Depth ranking. Choose a sparse, spacious track β Lorde's "Royals," Bon Iver's "Skinny Love," or "bad guy" again β and list every element you can identify, then rank them nearest-to-farthest. For the nearest and farthest, write down which cues told you (level? brightness? dryness? transient detail?). Then do the same for one of your own tracks. Count the distinct distances in each. Flat mixes are a depth-count of one; don't be surprised, be motivated.
B10. The small-speaker translation listen. Play one bass-heavy reference on your best system, then immediately on earbuds or a phone speaker. Document five differences in band vocabulary β and pay special attention to the bass: can you still hear the bass notes on the small speaker? Can you still feel the weight? Explain the difference using the missing fundamental. This exercise is the seed of the translation testing protocol that Chapter 30 builds into a full diagnostic ritual.
Part C β ποΈ DAW Lab
Concepts β clicks. Any DAW works; Appendix E translates control names across all the major ones.
C1. The level-matched A/B station. Build the comparison setup you'll use for the rest of this book: import one reference track and your current work-in-progress into the same session on two tracks. Pull the reference's fader down until the two feel equally loud on a quick switch (your DAW's meters can sanity-check you afterward β expect the mastered reference to need a substantial cut). Practice switching between them with mutes or solos until the toggle takes under a second. Save this as part of your session template. Write down the rough offset you needed: that number is your first concrete measurement of the loudness gap between a master and a rough mix.
C2. Trust, then verify: the analyzer. Put your DAW's stock spectrum analyzer on the master bus (every major DAW ships one β Appendix E lists where it hides). Play five songs; for each, first call the dominant band by ear and write it down, then look at the analyzer and check yourself. Two rules: always guess before you look, and treat the analyzer as the feedback loop, not the perception. The analyzer is how guesses become skill; staring at it without guessing first trains your eyes and leaves your ears exactly where they started.
C3. The band-isolation tour. On a full professional mix, insert a stock EQ and use steep high-pass and low-pass filters together to isolate one band at a time β solo the sub, then the lows, then the mud zone, and on up to the air. (You're not mixing; you're using filters as a listening microscope. EQ as a mixing tool waits for Chapter 22.) Spend thirty seconds inside each band and write down what actually lives there β most people are shocked by two of them: how much musical information sits in the mids alone, and how little is genuinely in the sub. Repeat on one of your own tracks and compare notes, band by band.
C4. The mono fold-down discovery. Find your DAW's mono switch (master utility, or a stock plugin β Appendix E). Play a wide modern mix in stereo, then fold it to mono mid-chorus. Write down what changes: what gets quieter, what disappears, what's untouched. You don't yet have the tools to fix what you hear β those arrive in the mixing chapters β but from today, the mono switch is part of your listening vocabulary, and you'll never trust width you haven't folded.
C5. The silence inventory. With nothing playing, sit at your listening position for three full minutes and write down every sound you can identify: the computer fan, traffic, the refrigerator's cycle, your chair. This trains the same attentional muscle as the five-pass method β and the list itself matters: every sound on it is part of the noise floor under your monitoring chain (Chapter 3) and a preview of what your room contributes before treatment enters the picture in Part II.
Part D β Synthesis
D1. Design your program. Open Appendix F and build your personal week-one ear-training schedule: which drill on which day, at what time, for how many minutes, on which system. Make it survive your actual life (a ten-minute daily plan you keep beats a forty-minute plan you abandon by Thursday). Write the schedule down and put the first session on your calendar β tonight if possible.
D2. The listening autopsy. Run the complete five-pass method on your own current track, level-matched against your best reference (use your C1 station). Produce the difference list: every gap you can hear, organized by pass, each item written as a specific observation ("their kick has a click around 3 kHz that mine lacks") rather than a verdict ("mine is worse"). Rank the list by how much each difference bothers you. Congratulations: you've written your personal syllabus for Parts IIβVII. Date it and keep it β you'll cross items off for the rest of the book.
D3. Teach the bands. Write a short script (200β300 words) explaining the six frequency bands to a friend who has never produced, using only sounds from everyday life β no gear talk, no Hz numbers until the last line. (Teaching forces precision: if your explanation of the mud zone doesn't land without jargon, you don't own the concept yet.)
D4. Argue the theme. This book's first theme claims trained ears plus cheap tools beat untrained ears plus expensive tools. Build the case in a page, using at least three pieces of evidence from this chapter (candidates: the FletcherβMunson traps and their procedural fixes, masking detection, the difference list, the brain-not-ears threshold, the bedroom origins of a Grammy-winning album β the book's founding premise and case study 1's backstory). Then β this is the important half β steelman the opposing case: name two situations where the gear genuinely is the limit, and how you'd know you'd hit one. Honest tradeoffs beat slogans, even our own.
Part M β Mixed Practice
Interleaved review: each exercise reaches back into Chapters 1β3. Spaced retrieval is how this sticks.
M1. (Ch 1 + Ch 4) The phone-speaker paradox. A phone speaker physically reproduces almost nothing below roughly 300 Hz, yet you clearly hear the bassline of a track playing through one. Explain the full chain of what's happening, using harmonics from Chapter 1 and the missing fundamental from this chapter. Then state the production consequence: which band can you never judge on a phone, and what specifically can you still check there?
M2. (Ch 2 + Ch 4) The codec's secret. Chapter 2 told you lossy formats like MP3 discard data you'd "miss least." Using this chapter's masking section, explain what the encoder is actually computing and why low-bitrate files tend to fall apart most audibly in the air band β cymbals smearing and sizzle turning swirly β when the codec runs out of budget.
M3. (Ch 3 + Ch 4) The full monitoring chain, extended. Chapter 3 ended your monitoring chain at the speaker. Draw it again from the DAW's output to perception, adding this chapter's stages: room, outer ear, cochlea, interpreter (brain). Mark every stage that changes the sound before you experience it, and identify which stages you can buy improvements for and which you can only train. Then answer: in light of "fix it at the earliest stage," why is an untrained interpreter the most expensive fault in the whole chain?
M4. (Ch 1 + Ch 4) Anatomy of a kick, revisited. Chapter 1 dissected a kick drum into click, knock, and boom. Assign each component to its frequency band from this chapter's six. Now explain, using FletcherβMunson, why the click seems to vanish first when you turn the track down quietly β wait, check yourself: does the click vanish first, or the boom? Reason it out from the curves, then verify by ear with a real kick-heavy track at low volume, and write down what you actually heard.
M5. (Ch 2 + Ch 3 + Ch 4) The 96 kHz air myth. A forum post insists: "Record at 96 kHz β the air band sounds way more open because there's more frequency resolution up top." Evaluate the claim in three steps: what Chapter 2's Nyquist explanation says 44.1 kHz already captures (out to what frequency, versus where the air band lives); what Chapter 3 would tell you to check first in the chain before blaming the format; and what this chapter suggests about why the poster genuinely hears an improvement after switching (two candidate biases, both named in this chapter). Conclude with the level-matched, blind way you'd actually test the claim.
Done? Re-do the Productive Struggle exercise from the chapter: same song, same chorus, five new sentences. Put the two descriptions side by side, and notice that nobody upgraded your headphones in between.