Chapter 11 Exercises — Recording Vocals

These exercises are where the session protocol stops being a chapter and becomes muscle memory. You need a voice and any microphone — a USB mic counts, your interface mic counts, and several drills work with a phone. You do not need to be a "singer": every lab in Part C works with confident speaking, humming, or the bravest singing you've got, because the skills being drilled — position locking, gain margin, headphone-mix psychology, take logging — don't care about your range. If you have a willing friend with a voice, several exercises get better; none require one.

If time is short, work in this order: C2 and C3 (the setup checklist and the 4-take session — together they're the chapter in miniature), then B4 (alternate takes of famous songs — the fastest ear-opener in the set), then Part A to verify the concepts. Selected answers appear in the appendix "Answers to Selected Exercises"; exercises with a collapsible hint reward wrestling before peeking.


Part A — Conceptual

A1. The setup-serves-the-take audit. The chapter claims every technical decision in a vocal session exists to serve the performance. For each setup element below, write one sentence naming the performance failure it prevents (not the technical failure — the human one): (a) gain set with 6 dB of margin above the soundcheck maximum; (b) the pop filter used as a distance-keeper; (c) a touch of monitor-only reverb in the headphones; (d) the take log's coverage rule; (e) speaking immediately after every take.

A2. The closet trial. A friend is about to record their EP's lead vocals in a packed bedroom closet because "everyone online says closets are vocal booths." Write the honest two-sided brief: what the clothes genuinely do well (name the frequency region), what the closet's size does badly (name that region too, and the Chapter 10 mechanism behind it), and the five-minute test that settles it for their voice instead of the internet's.

A3. Complement, don't compound. Three singers: (1) a bright, cutting tenor whose esses already sparkle; (2) a warm, dark alto who gets called "muffled" on phone speakers; (3) a powerful belter tracking in an untreated bedroom. Assign each a first-call mic from the chapter's decision table and defend the choice in one sentence of mechanism — what the mic's fixed opinion adds or withholds, and why that's the missing ingredient rather than the same opinion twice.

Hint Start from what each voice has a surplus of, then pick the opinion that supplies the *deficit*. For singer 3, the room is part of the voice's situation — which family can't hear the room well, and why?

A4. The asymmetric cliff. Using Chapter 2's vocabulary, explain why the chapter tells you to track vocal peaks at -18 to -12 dBFS rather than "as hot as possible without clipping." Your answer must name what recording too low actually costs at 24-bit, what recording too hot costs, and why the shape of that tradeoff (not the mere existence of it) dictates the habit. Finish with the sentence about which take clipping always chooses to ruin, and why.

A5. The sharp singer, diagnosed three ways. A vocalist goes sharp on every chorus. List three different headphone-mix causes from the chapter that produce exactly this symptom, and the distinct fix for each. Then name the one question you'd ask the singer before touching anything.

A6. Comping's input contract. Comping doesn't happen until Chapter 15, but this chapter claims the comp is won or lost at tracking. List four variables that must match across takes for phrases to be interchangeable, and for each, name the tracking-day discipline that locks it. Why does a comp from mismatched takes read as a "ransom note"?

A7. Plosive vs. sibilance, mechanically. Both are consonant problems; they are nearly opposites physically. Contrast them in a short table of your own: what each is (moving air vs. concentrated high-frequency energy), where each lives on the spectrum, which setup choices make each worse, and the at-source defenses for each. Then explain why the off-axis tilt appears in both defense columns.

A8. The performance curve. Sketch (or describe) the arc of a vocal session — warmup slope, peak window, fatigue slope — and place these events where they belong: the gravy take, take 14 syndrome, the bank announcement, the targeted bridge punch-ins, the decision to stop. For two of those events, name the audible sign that tells you you've arrived there.


Part B — 🎧 Listening Lab

Listen on the best playback you own, then re-check one drill on earbuds. Keep the Chapter 4 listening journal going and write what you hear in band vocabulary (sub / lows / low-mids / mids / high-mids / air). Streaming versions are fine throughout. As always: these drills train trait recognition, not trivia — production, performance, and processing differ wildly between records, and your journal should say what you hear, not what the lore says you should hear.

B1. Distance hunting. Cue Phoebe Bridgers, "Garden Song" — a vocal that feels inches from your ear: breathy detail, proximity warmth, almost no audible room. Then cue Florence + The Machine, "Dog Days Are Over," where the voice rides bigger spaces and bigger distances as the arrangement grows. Journal: for each, name the band where the "closeness" or "distance" cue lives, and what you can hear between phrases (the room lives in the gaps). Predict what each vocal would lose if recorded at the other's distance.

B2. Dynamic grit vs. condenser air. Cue Michael Jackson, "Thriller" — engineer Bruce Swedien famously favored a broadcast-style dynamic for much of Jackson's vocal work in that era, and the lore is widely repeated. Then cue Adele, "Hello," a modern condenser-sheen lead. Chase traits, not trivia: compare the leading edges of consonants, the air band above 10 kHz, how breaths register, how "tough" versus "vivid" each voice feels. Journal three differences mapped to transducer properties from Chapter 7 — then write the honest disclaimer that performance, era, and processing are confounds, and that you're training your ears on tendencies.

B3. Sibilance safari. Pick three vocal-forward tracks: one whisper-close modern production (Billie Eilish, "when the party's over" — esses at point-blank range), one bright 2010s pop single of your choosing, and one famously smooth vocal (Sade, "Smooth Operator"). Hunt esses only: where does each record's s energy sit (high-mids or air), does it ever bite, and can you catch a moment where de-essing audibly dulls an s into a soft lisp? Journal which record you'd hand to a singer as the "this is how we want your esses to land" reference — that's a T4 move you'll reuse in your own soundchecks.

B4. Alternate takes of famous songs. This is the chapter's thesis made audible, using officially released material. Cue The Beatles, "While My Guitar Gently Weeps" — the Anthology 3 acoustic take — against the released album version; then any alternate take of Bob Dylan's "Like a Rolling Stone" from The Bootleg Series against the single. Journal, for each pair: which take is more polished, which is more alive, and — the hard question — which you'd have chosen as producer, defended in one sentence about the song rather than the singing. There is no wrong verdict; there is only a defended one.

B5. One take, no net. Cue any live, single-pass vocal performance you love — an NPR Tiny Desk version of a song you know in its studio form is ideal, or Aretha Franklin's Amazing Grace album for the towering example. A/B against the studio version. Journal: what does the live take have that polish can't add, what does the studio take have that one pass can't deliver, and where on the one-take/forty-take spectrum (case study 1) does your own taste apparently live?

B6. The breath census. Cue Billie Eilish, "everything i wanted," and Bon Iver, "re: stacks" — two productions where breaths are kept, audible, doing intimacy work. Then cue a maximalist modern pop chorus of your choosing and listen for breaths: mostly gone or tucked. Journal: when does a breath read as humanity and when as mess? Find one breath in any song you love that you're convinced was left in on purpose, and say what it does for the line that follows.

B7. Doubles and ad-libs as architecture. Cue Kendrick Lamar, "HUMBLE." — map where the voice is one Kendrick and where it becomes stacked doubles on emphasis words, plus the ad-lib layer answering the lead. Then cue any Migos-era trap track and chart the ad-libs as a second conversation. Journal: which words got doubled and why those words; what the ad-libs do rhythmically that the lead can't. Then re-listen to a chorus in your own genre and write down what its (subtler) double strategy seems to be — you're capturing exactly these layers in this chapter's checkpoint.


Part C — 🎛️ DAW Lab

Ground rules: record into your Chapter 6 template, name takes per your convention (vox_lead_tk3, not audio_047), and level-match before any comparison — the louder take wins by default and your education doesn't exempt you (Chapter 4). Work the labs at conversational monitoring level, and keep every take you record: deleting takes mid-session is the habit these labs exist to break. Phone users: capture in a voice-memo app and import; the lessons live in playback.

C1. Find the corner. Run the Chapter 10 toolkit specifically for a vocal spot: one-ear walk while speaking, clap test at three candidate positions, ten seconds of recorded speech at each. Pick the winner, then build the fort: duvet or blankets behind the singer position, rug underfoot. Record the same ten seconds again. Deliverable: a four-line journal verdict — where the spot is, what the treatment audibly changed (band vocabulary), and what residue remains (be honest; you'll meet it again at mix time).

C2. The setup checklist, executed. Run the chapter's full pre-take protocol on yourself or a friend: deadest spot; mic chosen by the voice with one sentence of why; position at 6–10 inches, tilted, pop filter (or improvised nylon-over-hanger filter) set as distance-keeper; tape on the floor; photo of the rig; gain set on the loudest passage at full energy with peaks landing -18 to -12 dBFS; headphone mix built from the recipe table. Then record a 60-second soundcheck verse and A/B it against one reference vocal at matched loudness (T4). Deliverable: the filled checklist plus a three-item gap list versus the reference, each item tagged fixable now (position/mic/room) or later stage (Chapters 15, 17, 22, 29).

C3. The 4-take session. The chapter, in miniature, and the lab that matters most. Using the C2 setup: warm up ten minutes (recorded — mic hot from the first hum), then perform four to six FULL takes of one verse-plus-chorus, no stopping for flubs, sixty seconds between takes. Keep a take log on paper: sections × takes, ✓/○/✗, one note per take. If working alone, leave one phone-notes talkback line after each pass — one focus only. Stop when the log shows coverage, then announce the bank out loud and record one gravy take. Deliverable: the completed log, plus a one-paragraph verdict written the next day with fresh ears: which take is the spine, which phrases from other takes are starred, and — critically — nothing comped. That's Chapter 15's job; your job was coverage.

C4. Plosive lab. Write one plosive-trap sentence ("Peter brought back the broken pump") and record it four ways at constant distance: (a) dead on-axis, no filter, at moderate level — protect your gain margin; (b) on-axis with the pop filter; (c) 15° off-axis, no filter; (d) off-axis with filter. Level-match and compare, watching the waveform for the low-frequency thump as well as listening. Deliverable: rank the four for plosive damage, then write the defense-in-depth sentence you'd actually use on a money line — which two defenses stack, and what (if anything) each cost the tone.

C5. Sibilance lab. Same design, different enemy. Record an ess-heavy sentence ("Cecily's silver bracelets sat in the seaside sun") three ways: on-axis; tilted 15–20°; and four inches farther back. If you own two mics with different top ends, repeat the on-axis pass on the darker one. Level-match. Deliverable: a short table — for each variant, where the esses sit (band vocabulary), how the overall tone changed, and which compromise you'd choose for a bright-voiced singer. Name what each fix billed you (T3): nothing in this lab is free, which is the lesson.

C6. The headphone-mix experiment. Record the same chorus three times, changing only the cans: (a) bone-dry monitoring, voice exposed; (b) the chapter's recipe — voice on top, tuning reference prominent, a touch of reverb; (c) everything loud, voice buried. Then one more pass with one ear off. Don't aim for a fair test of singing — aim to notice. Deliverable: rate each pass for pitch confidence, tension, and how long you'd want to work that way; write your personal headphone recipe card and keep it with the position photo. If recording a friend instead: change the cans between takes without saying what changed, and log what happens to their pitch and posture.

C7. The clipping inoculation. Once, deliberately, on a throwaway phrase — never on a take you care about: set gain so your loudest delivery slams past 0 dBFS and record three seconds. Zoom in on the flat-topped waveform; listen on headphones at modest level. Now re-set gain properly (loudest passage peaking ≈ -15 dBFS) and record the same phrase. Deliverable: one sentence describing what digital clipping sounds like in your own words, one sentence on what the flat tops are (Chapter 2), and your personal safety-margin number written into your session-notes template. Then delete the clipped file — the only deletion these labs endorse.

C8. The hot-mic harvest. Immediately after C3 (same session, same warm voice): record two double passes of your chorus, chasing the phrase ends; one harmony part worked out against your Chapter 9 pad (click off if it fights); and one free ad-lib pass over the outro. Label everything as it lands: vox_dbl_ch_tk1, vox_harm_hi_tk1, vox_adlib_tk1. Deliverable: your session's track list as evidence, plus two sentences: how closely the doubles' phrase-ends actually matched (listen and confess), and one moment from the ad-lib pass worth starring. If you skipped this and the session's cold — that's the lesson too; write down what re-matching the setup tomorrow will cost.


Part D — Synthesis

D1. The session design doc. A friend asks you to produce their first real vocal session: bright-voiced, nervous, untreated bedroom, one budget condenser and one broadcast-style dynamic available, three hours booked. Write a one-page plan: room prep, mic choice (defended by mechanism), position spec, gain plan, headphone recipe, warmup schedule, take protocol with log, talkback strategy for a nervous first-timer (including the playback question and the bone-conduction warning), and your stopping rule. Every line must cite a reason, not a rule.

D2. The talkback script. Here's a session log: take 1 — shaky start, strong chorus; take 2 — verse better, chorus held back, singer asked "was that okay?"; take 3 — best yet but the bridge breath collapsed; take 4 — careful everywhere, alive nowhere. Write your exact between-takes talkback lines after each take, one or two sentences each, following the etiquette: immediate, specific praise first, one focus, direction not adjectives. Then write the line you'd say instead of recording take 5, and defend it from the performance-curve section.

D3. The tradeoff ledger (T3). This chapter is a chain of trades. Pick four — dynamic versus condenser in your actual room; close-and-warm versus backed-off-and-natural; reverb-in-cans versus dry; one more take versus stopping — and write each as a ledger entry: what you bought, what you were billed, and the condition under which you'd reverse the call. The goal is to prove no decision in this chapter is "correct," only priced.

D4. The reference autopsy (T4). Choose THE vocal reference for your own track — the one professional lead whose capture you'd steal. Describe it in band vocabulary plus the chapter's terms: apparent distance, proximity warmth, air, room-in-the-gaps, sibilance behavior, breath policy. Then A/B your best C3 take against it at matched loudness and write the three biggest gaps, each tagged: position/mic/room (fixable now — go fix it and re-record one verse as proof), or editing/mix-stage (note the chapter where that gap's tool lives: 15, 17, 22, 24, or 29). The tag discipline is the exercise: knowing when a gap closes is what keeps reference comparisons motivating instead of crushing.


Part M — Mixed Practice

Interleaving review: each question reaches back to an earlier chapter and pulls it through this one.

M1. (Chapter 2) The resolution myth, retired. Your keeper vocal peaked at -19 dBFS and a forum commenter says you "threw away half your bits." Explain, with the noise-floor framing from Chapter 2, why the take is effectively as clean as one tracked 6 dB hotter — and name the one scenario in this chapter where conservative gain genuinely saves a take that hot gain destroys.

M2. (Chapter 3) The gain-stage census. List, in order, every gain stage between a singer's lips and the recorded WAV file in your own rig (Chapter 3's chain plus this chapter's settings). Mark the stage where this chapter says THE decision gets made, and name one stage where distortion could enter even while the DAW meter stays green — and how you'd catch it.

M3. (Chapter 7) The figure-8 booth problem. Your only good mic this week is a figure-8 ribbon. Design the vocal setup: where does the singer stand, what does the rear lobe hear in your room (be specific — go look), where do the pattern's side nulls point, and what does the family's fragility rule mean for your pop-filter and plosive plan? One paragraph, every choice justified by Chapter 7 mechanics.

M4. (Chapter 10) The flutter in the gap. Your C1 test recording sounds dry while you sing but has a faint metallic zing trailing each phrase ending. Name the phenomenon (Chapter 10), the two-surface geometry that causes it, the ten-second test that confirms it, and the placement-or-treatment fix that costs less than $30 — then explain why phrase endings exposed it when the phrases themselves didn't.

M5. (Chapters 4 + 9) The bed audit. A singer keeps delivering stiff, careful takes over your programmed bed, and your Chapter 4 critical listen of the headphone mix finds: velocity-flat pad, gridlocked timing, click loud on top. Using Chapter 9's humanization vocabulary, prescribe three bed changes you'd make before saying a word to the singer about performance — and explain why the order (fix the bed first, coach second) is this chapter's psychology applied.


Selected answers: see the appendix "Answers to Selected Exercises." The labs have no answer key — they have your ears, your take log, and tomorrow morning's verdict.