Chapter 4 Quiz

Q: Critical listening differs from passive listening primarily in: the quality of equipment required where attention is aimed and whether observations are produced the volume at which you listen the genre of music involved

B. Critical listening aims attention at the sound's construction, one question at a time, and produces actionable (written) observations. It requires no special gear or volume — only a deliberate change of mode.

Q: The "mud zone" — the band where amateur mixes most commonly congest — is: below 60 Hz 60–200 Hz 200–500 Hz 2–6 kHz

C. 200–500 Hz, the low-mids. Nearly every instrument deposits secondary energy there, so the pileup is a committee crime with no single guilty track.

Q: Harshness and listening fatigue most often point to an excess in: the sub the lows the mids the high-mids (2–6 kHz)

D. The 2–6 kHz presence-and-harshness band, where the ear (thanks partly to ear-canal resonance around 3 kHz) is at its most sensitive — small excesses read as pain early.

Q: According to the Fletcher–Munson curves, listening at *quiet* volume makes a mix seem: bass-heavy and harsh bass-shy and mid-forward identical to loud playback wider than it is

B. At low levels, perception of the spectrum's edges — bass especially, extreme highs too — falls away first, leaving the midrange. Same file, different perceptual tone curve.

Q: Two versions of a mix are compared casually and version A is 1 dB louder. The most likely outcome is that listeners will report A as: louder better — richer, clearer, fuller indistinguishable from B more dynamic but thinner

B. Small level increases typically don't register consciously as "louder"; they arrive as Fletcher–Munson side effects — fuller lows, airier highs — which the brain files as quality. This is why every legitimate A/B must be level-matched.

Q: Masking spreads asymmetrically. The correct statement is: high sounds mask low sounds more than the reverse low sounds mask high sounds more than the reverse masking only occurs between identical frequencies masking only occurs between sounds on opposite sides of the stereo field

B. Masking spreads upward: loud low-frequency content casts a shadow over higher frequencies far more than the reverse, which is why the low end of a mix is always the potential aggressor.

Q: The five-pass reference listen plays one song five times because: ear fatigue requires it auditory attention is effectively single-lane, so each pass gets one question five plays are needed for the brain to memorize the song streaming services count five plays as engagement

B. You can aim attention precisely (cocktail-party effect) but not split it well. One question per play-through converts "listen harder" into five answerable questions.

Q: The correct order of the five passes as taught in this chapter is: frequency, balance, dynamics, space, width balance, frequency, space, width, dynamics width, space, balance, frequency, dynamics dynamics, balance, frequency, width, space

B. Balance (what's loudest), frequency (who owns each band), space (how far away), width (where left-right), dynamics (what moves).

Q: In a level-matched A/B between your rough mix and a commercial reference, the level adjustment should be: push your mix up until it matches the reference pull the reference down to your mix's ballpark push both up equally no adjustment — masters and mixes should be compared as-is

B. The mastered reference is hotter; pulling it down matches loudness safely. Pushing your mix up risks the 0 dBFS ceiling from [Chapter 2](../chapter-02-digital-audio/index.md), and matched — not maximum — loudness is all the comparison needs.

Q: Which set of cues signals "this sound is close to me" on the soundstage's depth axis? quiet, dark, wet, soft loud, bright, dry, sharp transients hard-panned, filtered, delayed centered, compressed, mono

B. Close sounds are louder, brighter (air doesn't absorb their highs over distance), drier, and transient-sharp. Distant sounds invert all four. Mixers fake these cues to build depth.

DataField.Dev

Chapter 4 Quiz

Closed book first pass — answer everything before opening any <details> block. The point isn't the score; it's finding out which concepts feel solid and which were an illusion of understanding (a distinction this chapter should have made you suspicious about in general). Scoring guidance is at the end.

Section 1 — Multiple Choice (15 questions, 1 point each)

1. Critical listening differs from passive listening primarily in: - A) the quality of equipment required - B) where attention is aimed and whether observations are produced - C) the volume at which you listen - D) the genre of music involved

Answer

**B.** Critical listening aims attention at the sound's construction, one question at a time, and produces actionable (written) observations. It requires no special gear or volume — only a deliberate change of mode.

2. The "mud zone" — the band where amateur mixes most commonly congest — is: - A) below 60 Hz - B) 60–200 Hz - C) 200–500 Hz - D) 2–6 kHz

Answer

**C.** 200–500 Hz, the low-mids. Nearly every instrument deposits secondary energy there, so the pileup is a committee crime with no single guilty track.

3. Harshness and listening fatigue most often point to an excess in: - A) the sub - B) the lows - C) the mids - D) the high-mids (2–6 kHz)

Answer

**D.** The 2–6 kHz presence-and-harshness band, where the ear (thanks partly to ear-canal resonance around 3 kHz) is at its most sensitive — small excesses read as pain early.

4. According to the Fletcher–Munson curves, listening at quiet volume makes a mix seem: - A) bass-heavy and harsh - B) bass-shy and mid-forward - C) identical to loud playback - D) wider than it is

Answer

**B.** At low levels, perception of the spectrum's edges — bass especially, extreme highs too — falls away first, leaving the midrange. Same file, different perceptual tone curve.

5. Two versions of a mix are compared casually and version A is 1 dB louder. The most likely outcome is that listeners will report A as: - A) louder - B) better — richer, clearer, fuller - C) indistinguishable from B - D) more dynamic but thinner

Answer

**B.** Small level increases typically don't register consciously as "louder"; they arrive as Fletcher–Munson side effects — fuller lows, airier highs — which the brain files as *quality*. This is why every legitimate A/B must be level-matched.

6. Masking spreads asymmetrically. The correct statement is: - A) high sounds mask low sounds more than the reverse - B) low sounds mask high sounds more than the reverse - C) masking only occurs between identical frequencies - D) masking only occurs between sounds on opposite sides of the stereo field

Answer

**B.** Masking spreads upward: loud low-frequency content casts a shadow over higher frequencies far more than the reverse, which is why the low end of a mix is always the potential aggressor.

7. The five-pass reference listen plays one song five times because: - A) ear fatigue requires it - B) auditory attention is effectively single-lane, so each pass gets one question - C) five plays are needed for the brain to memorize the song - D) streaming services count five plays as engagement

Answer

**B.** You can aim attention precisely (cocktail-party effect) but not split it well. One question per play-through converts "listen harder" into five answerable questions.

8. The correct order of the five passes as taught in this chapter is: - A) frequency, balance, dynamics, space, width - B) balance, frequency, space, width, dynamics - C) width, space, balance, frequency, dynamics - D) dynamics, balance, frequency, width, space

Answer

**B.** Balance (what's loudest), frequency (who owns each band), space (how far away), width (where left-right), dynamics (what moves).

9. In a level-matched A/B between your rough mix and a commercial reference, the level adjustment should be: - A) push your mix up until it matches the reference - B) pull the reference down to your mix's ballpark - C) push both up equally - D) no adjustment — masters and mixes should be compared as-is

Answer

**B.** The mastered reference is hotter; pulling it down matches loudness safely. Pushing your mix up risks the 0 dBFS ceiling from [Chapter 2](../chapter-02-digital-audio/index.md), and matched — not maximum — loudness is all the comparison needs.

10. Which set of cues signals "this sound is close to me" on the soundstage's depth axis? - A) quiet, dark, wet, soft - B) loud, bright, dry, sharp transients - C) hard-panned, filtered, delayed - D) centered, compressed, mono

Answer

**B.** Close sounds are louder, brighter (air doesn't absorb their highs over distance), drier, and transient-sharp. Distant sounds invert all four. Mixers fake these cues to build depth.

11. The precedence (Haas) effect says that two nearly identical arrivals within roughly 1–30 ms are perceived as: - A) an echo pair - B) one sound, located at the later arrival - C) one sound, located at the first arrival - D) two sounds an octave apart

Answer

**C.** They fuse into a single event whose direction is assigned to the first arrival; the copy adds loudness, fullness, and width rather than a separate identity. Past roughly 35–40 ms, fusion breaks and you hear an echo.

12. You hear a bassline's notes clearly on earbuds that physically cannot reproduce the bass's fundamental. The explanation is: - A) the earbuds boost the sub digitally - B) the missing fundamental — the brain infers pitch from harmonic spacing - C) bone conduction carries the sub - D) the recording engineer doubled the bass an octave up

Answer

**B.** Pitch is computed from the spacing of the harmonic stack, so the harmonics the small driver *can* reproduce imply the fundamental. You get the note; you don't get the weight.

13. The "whisper test" — playing a mix at very low volume — reveals: - A) the mix's true balance hierarchy - B) the noise floor of the recording - C) stereo width problems - D) clipping on the master

Answer

**A.** Equal-loudness behavior strips away everything but the genuinely loudest elements; whatever survives a whisper-level listen is the actual top of the balance hierarchy.

14. Lossy codecs like MP3 achieve their file-size reduction primarily by exploiting: - A) Nyquist's theorem - B) the precedence effect - C) perceptual masking — discarding what the louder content already hides - D) equal-loudness curves

Answer

**C.** Perceptual coding computes what masking renders inaudible and throws it away — [Chapter 2](../chapter-02-digital-audio/index.md)'s "what lossy loses," now with its mechanism attached.

15. A useful baseline rule from the listening-hygiene section is: - A) mix as loud as possible to hear every detail - B) take breaks by switching to a different playlist - C) work at a moderate consistent level, with sessions around 45–60 minutes followed by silence breaks - D) make final decisions only after midnight, when it's quiet

Answer

**C.** Moderate consistent level keeps the perceptual curve stable; regular silence breaks (not more audio) reset attention and sensitivity. These are habits, not laws — but they're cheap and they compound.

Section 2 — True / False (justify your answer, 2 points each)

16. True or false: People with professionally trained ears have measurably superior ear anatomy — that's what "golden ears" means.

Answer

**False.** The trainable difference is in the interpreter — categories, attention, and checked feedback built in the brain — not the transducer hardware. The threshold concept: you hear with your brain, not your ears. "Golden ears" are built, not born, which is precisely why ear training works.

17. True or false: There is one correct monitoring volume at which your ears hear the spectrum flat, and professional engineers mix at it.

Answer

**False.** Equal-loudness contours change shape at *every* level; no volume yields flat perception. Professionals respond with procedure, not a magic number: a moderate consistent working level, plus brief loud and quiet checks used as deliberate, different tests.

18. True or false: In a busy mix, a masked instrument is still technically audible — you could pick it out if you concentrated hard enough.

Answer

**False.** Genuine masking means the quieter sound is not delivered to conscious perception at all — it's perceptually absent, not merely difficult. That's why masking is detected indirectly (the mute test: listen for what *appears* when something else leaves) rather than by concentrating on the masked element.

19. True or false: The depth dimension of a stereo soundstage is constructed by the listener's brain from cues like loudness, brightness, dryness, and transient clarity — nothing is physically behind the speakers.

Answer

**True.** Depth is an inference from learned distance cues; mixers manufacture those cues (level, tone, reverb, transient softening) and the interpreter builds the theater. Width, by contrast, has a physically real basis in inter-channel differences.

20. True or false: Because this chapter's six band boundaries (60, 200, 500, 2k, 6k) are scientifically exact, a sound at 210 Hz behaves fundamentally differently from one at 190 Hz.

Answer

**False.** The boundaries are working conventions — a shared map for naming what you hear, consistent within this book — not physical walls. Nothing magical changes at the border numbers; what matters is owning *a* consistent vocabulary and using it until it becomes perception.

Section 3 — Short Answer (4 questions, 3 points each)

21. Explain the "loudness arms race" a producer can run against their own mix, naming the perceptual phenomenon that drives it and the band where it most often happens. End with the two-level defense hierarchy this book proposes (one fix from arrangement, one from a later mixing chapter).

Answer

Instruments sharing a band mask each other, so each feels too quiet in turn; the producer raises one, which masks the next, which gets raised too — every fader climbs and clarity never improves because the problem is overlap, not level. It most often happens in the low-mids (200–500 Hz), everyone's second home. Defenses: first, arrangement — fewer simultaneous occupants per band; second, EQ — carving complementary pockets, the business of Chapter 22.

22. Your friend says: "I A/B'd my mix against a reference and mine sounded weak, so I turned mine up until it sounded better and called it done." Identify two distinct procedural errors and describe the correct workflow.

Answer

Error one: comparing at unmatched levels — the mastered reference's extra loudness guarantees the rough mix "loses" for reasons that have nothing to do with quality. Error two: "turning mine up until it sounded better" treats louder-sounds-better bias as a fix and risks the 0 dBFS ceiling. Correct workflow: pull the *reference down* to the mix's loudness, then run structured passes (the five-pass method) and write a difference list — specific, actionable gaps — instead of a loudness contest.

23. Describe the mute test for masking: what you do, what you listen for, and why soloing the suspect track can't give you the same information.

Answer

Mute one track in the full mix and listen for what *appears* elsewhere — shimmer, air, warmth, detail that surfaces the moment the muted track leaves. Whatever appeared was being masked by the muted track. Soloing can't reveal this because masking is a relationship between sounds: a track in solo always sounds complete, and the listener never hears your mix in solo anyway.

24. A producer does all their critical listening on whichever playback is handy — earbuds today, living-room speakers tomorrow, the car on weekends — and their journal notes contradict each other constantly. Using Chapter 3's monitoring-chain idea and this chapter's threshold concept, explain what's going wrong and the single cheapest fix.

Answer

Every playback chain recolors the sound differently before it reaches the interpreter, and the interpreter calibrates to what it repeatedly hears. Rotating chains means the "instrument" being trained keeps changing, so observations aren't comparable and no stable calibration forms. Cheapest fix: consistency — pick one best-available system as the analysis chain and do all structured listening there, using other systems only as deliberate translation checks. Consistency beats quality.

Section 4 — Applied Scenario (1 question, 6 points)

25. Your collaborator sends a late-night message: "Mixed for seven hours straight tonight, mostly cranked because it slaps louder that way. Honestly it sounds incredible right now. I compared it to the Dua Lipa reference for two seconds (mine's quieter, but whatever, close enough) and I think mine's actually wider and brighter. Gonna bounce it and post it tonight."

Using at least four distinct concepts from this chapter, predict specifically what they'll hear tomorrow morning, explain why each prediction follows, and write the three-step protocol you'd recommend before anything gets posted.

Answer

Predictions and reasons (any four of these earn the points): (1) **Morning harshness** — seven hours loud means ear fatigue and temporary threshold shift dulled their high-mid sensitivity; they kept adding presence/air to compensate, so fresh ears will find it shrill. (2) **Bass surprises** — hours at high volume flatten the perceived curve (Fletcher–Munson), so the low end that felt rich will likely be thin or unbalanced at normal volume. (3) **The reference comparison was void** — two seconds at unmatched levels (theirs quieter) means the louder/quieter bias rigged every impression; "wider and brighter" judged against a louder master is exactly the illusion the chapter warns about, compounded by expectation bias toward their own work. (4) **"Sounds incredible right now" is the 2 a.m. trap** — fatigue plus volume creep flatters everything late in a session; the judgment was made by the least reliable version of their interpreter. Protocol: (1) Save and stop — no bounce, no post; sleep is part of the toolchain. (2) Tomorrow, fresh: listen once at moderate volume on the usual system *before* touching anything, then run the five-pass method against the reference, level-matched (reference pulled down), and write the difference list. (3) Make changes from the ranked list one at a time, with a whisper-test balance check, and only then bounce.

Scoring

Section	Points available
1 — Multiple choice	15
2 — True/false + justification	10
3 — Short answer	12
4 — Applied scenario	6
Total	43

Score	Reading
38–43	Solid. The vocabulary is becoming perception — keep the daily program running.
30–37	Good foundation; re-read the sections behind your misses (most students wobble on masking's upward spread and the equal-loudness traps first).
22–29	Re-do the Listening Lab's B4 and B6 with the chapter open — these ideas stick through the ears, not the eyes.
below 22	No verdict on you, only on the route: re-read with your three references playing, then retake in two days. This is a skill chapter; reps beat re-reading.

For T/F items, award the second point only if the justification names the mechanism (masking, equal-loudness behavior, interpreter vs. hardware), not merely the verdict.