Glossary
Every term here earns its place by showing up in a real decision you'll make. The chapter in parentheses is where the book first defines the term at length — that's where to go for the full story, the diagram, and the ear training. The definitions below are the short version: what the thing is, and more often, what it does to what you hear. Cross-references in bold point to other entries on this page.
Read it cover to cover once and you'll have the vocabulary of the craft. Come back to it whenever a chapter throws a word you half-remember.
Numbers
808 (Ch. 13) — Originally the Roland TR-808 drum machine; today, mostly shorthand for the long, tuned sub-bass kick at the center of hip-hop and trap. An 808 is part kick, part bass note, and it lives so low (roughly 30–60 Hz fundamental) that most speakers only hint at it — which is why you saturate it so its harmonics imply the bottom. See sub bass, saturation.
2-bus — The stereo master bus where every channel sums before it leaves the mix. Sometimes written "stereo bus" or "mix bus." Everything you do here affects the whole record at once, so moves are small. See bus, mastering, headroom.
A
A/D–D/A (Ch. 3) — Analog-to-digital and digital-to-analog conversion. The A/D converter turns the voltage from your mic/preamp into numbers; the D/A turns the numbers back into voltage your speakers can move. Every recording crosses this bridge at least twice, and the quality of the crossing matters more than most marketing admits. See converter, audio interface.
Absorption (Ch. 10) — Treatment that removes acoustic energy by turning sound into a tiny bit of heat — foam, rockwool panels, thick blankets. Absorption tames first reflections and flutter echo; it does not, by itself, fix the low-end problems that need a bass trap. Contrast diffusion.
Aliasing (Ch. 2) — The ugly, inharmonic garbage that appears when a digital system tries to represent frequencies above the Nyquist limit and folds them back down into the audible band as out-of-tune ghosts. Good converters and oversampled plugins prevent it; cheap saturation and pitch-shifting can still cause it. See sample rate.
Ambisonics (Ch. 34) — A full-sphere surround format that captures or encodes sound as a 360° field rather than fixed channels, then decodes to whatever speaker or headphone setup you've got. You'll meet it mostly in VR and 360 video; a mention here so the word isn't a stranger. See binaural, Atmos.
Amplitude (Ch. 1) — The size of a sound wave's pressure swing — how far the air moves. More amplitude means louder; we measure it in decibels. Don't confuse amplitude (loudness) with frequency (pitch); they're the two independent axes of every sound. See dB SPL, waveform.
Arrangement (Ch. 16) — Which parts play, when, and how they're stacked across the song. Arrangement is the highest-leverage tool you own, because most "mix problems" are really arrangement problems wearing a disguise — too many parts fighting for one frequency band. See song sections, energy curve, density.
Atmos (Dolby Atmos) (Ch. 34) — An object-based immersive format: instead of mixing only to fixed channels (left, right, surround), you place sounds as objects in 3D space and let the playback system render them to whatever speakers or headphones are present. The streaming-era spatial format you're most likely to deliver. See binaural, ambisonics, stems vs multitracks.
Attack (compressor) (Ch. 23) — How fast a compressor clamps down once the signal crosses the threshold. Fast attack tames transients and can dull punch; slow attack lets the transient through and grabs the body behind it. Attack is half of why compression is a time tool, not just a loudness tool. See release, transient.
Audio interface (Ch. 8) — The box that gets sound in and out of your computer: mic preamps, A/D–D/A converters, headphone and monitor outputs, often phantom power. It's the heart of your signal chain and usually a better upgrade than another microphone. See converter, monitor.
Automation (Ch. 17, deep in Ch. 27) — Recorded, time-based moves of any parameter — volume, pan, a filter cutoff, an effect send — that play back automatically. Automation is how a static mix becomes a living performance: the vocal rides up a hair in the chorus, the filter opens into the drop. See rides, breakpoints, arrangement.
B
Balance (Ch. 20) — The relative loudness of every element in the mix. Balance is the single most important thing in mixing and the thing beginners under-rate; get the faders right from silence and half your problems vanish before you reach for a plugin. See static mix, gain staging.
Balanced / unbalanced (Ch. 3) — Two ways to wire an audio cable. Balanced connections (XLR, TRS) carry the signal twice with one copy flipped, so the gear can cancel noise picked up along the run — quieter over long distances. Unbalanced (TS, RCA) is fine for short runs but hums if you stretch it. See line/mic/instrument level.
Bass trap (Ch. 10) — Thick, dense absorption placed where low frequencies pile up — usually room corners. Bass traps tame the room modes that make your low end lie to you, and they're the treatment beginners skip and regret. Thin foam does almost nothing down here. See absorption, standing wave.
Binaural (Ch. 34) — Audio mixed or processed for headphones using the cues your two ears actually use to locate sound — level, timing, and the filtering your head and outer ears apply (the HRTF). Done well, it puts sounds outside your skull on ordinary headphones. See HRTF, Haas effect.
Bit depth (Ch. 2) — How many bits describe each sample's amplitude, which sets the dynamic range and noise floor of a digital recording. 24-bit gives you enormous headroom to record into without sweating levels; 16-bit is the CD delivery format. More bits is quieter, not "higher fidelity" in the way people assume. See dither, dBFS.
Breakpoints (Ch. 27) — The individual points you draw or drop on an automation lane that define where a value changes and how fast. Fewer, intentional breakpoints beat a jittery hand-drawn mess. See automation, rides.
Buffer size (Ch. 2) — How big a chunk of audio your computer processes at once, measured in samples. Small buffers mean low latency (good for tracking) but stress the CPU; large buffers are stable for mixing but feel laggy if you're playing live. You'll trade between the two constantly. See latency.
Bus (Ch. 6) — A channel that other channels feed into, so you can process or control a group together — a DRUMS bus, a VOX bus, the 2-bus. Buses are how a 34-track session becomes six things you can actually balance. See send/return, bus architecture.
Bus architecture (Ch. 28) — The deliberate routing plan of your whole session: which tracks feed which buses, in what order, with which processing where. Good architecture makes the mix controllable; bad architecture makes every change a wrestling match. See bus, gain staging.
C
Cardioid (Ch. 7) — The heart-shaped polar pattern that hears mostly what's in front and rejects the rear — the workhorse for vocals and close-miking because it ignores the room behind the singer. See omni, figure-8, proximity effect.
Clipping (Ch. 2) — What happens when a signal tries to exceed the maximum a system can represent and gets flat-topped instead. Digital clipping (over 0 dBFS) is harsh and unmusical; you avoid it with headroom. Analog clipping can be gentle and even desirable — that's saturation's cousin. See dBFS, headroom.
Comping (comp) (Ch. 11, executed Ch. 15) — Assembling one great composite take from several recorded takes, line by line or word by word. Comping is why a "perfect" vocal almost never happened in one pass; you build it. See takes, crossfade.
Compression — See compressor and its controls: threshold, ratio, attack, release, knee, makeup gain, gain reduction.
Compressor (Ch. 23) — A processor that automatically turns the signal down when it gets too loud, controlled by threshold and ratio and shaped over time by attack and release. Used to even out a performance, add punch, or glue a group together. The most misunderstood tool in the box — it manipulates time, not just level. See limiter, parallel compression.
Comping — See comping.
Condenser (Ch. 7) — A microphone type that uses a charged thin diaphragm and needs phantom power. Condensers are sensitive and detailed, great for capturing air and nuance, but they hear the whole room — including the bad parts. See dynamic, ribbon, polar pattern.
Console (Ch. 5) — The mixing desk of the analog studio: rows of channel strips summing to a master, where the verbs fader, bus, insert, and send were born. Your DAW's mixer is a console in software. See track, bus.
Converter (Ch. 8) — The A/D–D/A circuitry that translates between analog voltage and digital numbers. Modern converters are excellent even at modest prices; the old "digital sounds harsh" reputation was bad converters and bad gain staging, not digital itself. See audio interface, sample rate.
Copyright (composition vs master) (Ch. 36) — Two separate rights in every recorded song. The composition copyright covers the song itself — melody and lyrics — and belongs to the writers; the master copyright covers the specific recording and belongs to whoever owns it. Sampling needs clearance on both. See publishing, split sheet, sampling.
Correlation (Ch. 25) — A meter reading (roughly +1 to −1) that tells you how much your left and right channels agree. Near +1 is mono-safe; near −1 means the channels are fighting and may vanish in mono. See mono compatibility, mid-side, phase.
Critical listening (Ch. 4) — Listening for something specific — a frequency, a balance, a flaw — rather than just enjoying the music. It's a learnable skill, not a gift; you hear with your brain, and the brain can be trained. The whole book rests on this. See frequency bands, soundstage.
Crossfade (Ch. 15) — A short overlap between two audio regions where one fades out as the other fades in, hiding the edit so you don't hear a click or a jump. The editor's best friend when comping. See comping, region/clip.
D
dBFS (decibels full scale) (Ch. 2) — The digital loudness scale where 0 is the absolute ceiling and everything below is negative (−6, −18, etc.). Cross 0 dBFS and you clip. "Record around −18 dBFS average" is a habit that leaves you headroom, not a law. Contrast dB SPL, LUFS.
dB SPL (decibels sound pressure level) (Ch. 1) — The loudness of actual sound in the air, the thing your ears and an SPL meter measure. Used for monitoring levels, hearing safety, and the Fletcher–Munson story. Different beast from dBFS (a digital number). See amplitude.
De-esser (Ch. 29) — A frequency-targeted compressor that ducks only the harsh "ess" and "sh" sounds (sibilance, roughly 5–10 kHz) without dulling the whole vocal. Reach for it before you carve sibilance with broad EQ. See sibilance, compressor.
Decay (Ch. 24) — How long a reverb or sound takes to fade to silence — the reverb's length. Short decay reads as a small room; long decay as a hall or a cathedral. See reverb types, pre-delay, RT60.
Delay (Ch. 24) — An effect that repeats the signal one or more times after a set interval — a slap, an eighth-note, a dotted-eighth. Delay creates space and rhythm and, used in place of reverb, keeps a mix clearer. See feedback, Haas effect, throws.
Density (Ch. 16) — How much is happening at once — how full the arrangement is at a given moment. Managing density (when to add, when to strip down) is how you build contrast and keep the listener's ear fresh. See arrangement, contrast, energy curve.
DI (direct injection / direct input) (Ch. 12) — Recording an instrument's electrical signal straight into the interface, bypassing an amp and mic — common for bass and for capturing a clean guitar you can re-amp later. A DI is dry, controllable, and re-shapeable. See re-amping, line/mic/instrument level.
Diagnostic listening (Ch. 30) — Listening to a finished mix specifically to name what's wrong using a shared symptom vocabulary — muddy, harsh, thin, boxy, lifeless, small — so you can aim the fix. The skill that turns "something's off" into a to-do list. See critical listening, null test.
Diffusion (Ch. 10) — Treatment that scatters sound in many directions instead of absorbing it, keeping a room lively while killing harsh reflections — the opposite tool from absorption. Often overkill in a small bedroom studio. See first reflection, flutter echo.
Distortion (harmonic) (Ch. 26) — New frequencies (harmonics) added to a signal when it's pushed past linear behavior. Even harmonics tend to sound warm and full; odd harmonics sound edgy or aggressive. Controlled distortion is saturation; uncontrolled is noise. See saturation, drive.
Distributor (Ch. 35) — The service that delivers your finished master to streaming platforms and stores and collects the per-stream money on your behalf — DistroKid, TuneCore, CD Baby, and the like. The on-ramp from your hard drive to DSPs. See DSP, metadata, ISRC/UPC.
Dither (Ch. 2) — A tiny, carefully shaped noise added when you reduce bit depth (e.g., 24-bit down to 16-bit for CD) so the quantization error becomes smooth hiss instead of gritty distortion. Apply it once, last, on the final bounce. See bit depth, mastering.
Doubling (Ch. 17) — Recording (or duplicating) a part twice so the two copies thicken and widen it — classic on vocals, guitars, and synths. Two real performances double-tracked sound bigger and more natural than one cloned twice. See layering, stereo width.
Drive (Ch. 26) — The amount of signal you push into a saturator, amp, or distortion stage — more drive, more harmonics and grit. The input knob that decides how hard you're hitting the saturation. See distortion, gain staging.
DSP (digital service provider) (Ch. 35) — A streaming platform or download store — Spotify, Apple Music, YouTube Music, and the rest. (Note: in mixing contexts "DSP" can also mean digital signal processing; this book uses the streaming meaning unless stated.) See distributor, metadata.
Dynamic (microphone) (Ch. 7) — A rugged, moving-coil mic that needs no power, handles high SPL, and tends to reject room noise — the broadcast and stage workhorse (think SM57/SM58). Less detailed up top than a condenser, which is often exactly what loud sources want. See condenser, ribbon.
Dynamic range / PLR (Ch. 33) — The distance between the loudest and quietest parts of a track. PLR (peak-to-loudness ratio) compares the true peak to the average LUFS — a rough gauge of how "smashed" or "open" a master is. Streaming normalization made dynamic range a creative choice again. See LUFS, loudness war.
E
Ear candy (Ch. 17) — The small, often-buried details that reward repeat listens — a reversed cymbal, a whispered ad-lib, a one-bar filter sweep. Tasteful ear candy makes a track feel crafted without cluttering it. See arrangement, layering.
Energy curve (Ch. 16) — The rise and fall of intensity across a song — where it builds, drops, peaks. Mapping the energy curve keeps the arrangement from being flat (everything at 100%) or shapeless. See arrangement, density, contrast.
Envelope (ADSR) (Ch. 14) — The shape of how a synth sound (or any parameter) evolves over a single note: Attack (rise), Decay (fall to), Sustain (held level), Release (fade after note-off). ADSR is how you turn a static oscillator into a pluck, a pad, or a swell. See oscillator, LFO, filter.
EPK (electronic press kit) (Ch. 37) — A tidy package — bio, photos, music links, press quotes — you send to playlist curators, blogs, and venues so they don't have to dig. Your professional first impression in one link. See playlist pitching.
EQ (equalization) (Ch. 22) — Boosting or cutting specific frequency bands to fix or shape tone. The most-used tool in mixing, and the one where cutting usually beats boosting. See EQ types, Q, masking.
EQ types (Ch. 22) — The filter shapes an equalizer offers: a bell (boost/cut a band with adjustable Q), a shelf (lift or drop everything above/below a point), a high-pass (HP, removes lows below a point), a low-pass (LP, removes highs above a point). Knowing which shape to grab is half of EQ. See Q, EQ.
F
Feedback (delay) (Ch. 24) — How much of a delay's output is fed back into its input, setting how many repeats you hear before they die away. Low feedback is a slap; high feedback is a long, trailing echo. See delay, decay.
Figure-8 (bidirectional) (Ch. 7) — A polar pattern that hears equally from front and back but rejects the sides — the basis of ribbon mics and of mid-side recording. See cardioid, omni, mid-side.
Filter (synth/processing) (Ch. 14) — A circuit or plugin that removes or emphasizes part of the spectrum: a low-pass (LPF) removes highs, a high-pass (HPF) removes lows, and resonance boosts a peak right at the cutoff. The filter is where most of a subtractive synth's movement and character live. See subtractive, envelope, LFO.
First reflection (Ch. 10) — The first bounce of your speakers' sound off a nearby wall, ceiling, or desk before it reaches your ears — the reflection that smears your soundstage most. Treating first-reflection points (the "mirror trick") is the highest-value acoustic fix in a small room. See absorption, sweet spot.
Fletcher–Munson / equal-loudness (Ch. 4) — The fact that your ears' frequency balance changes with volume: quiet music sounds bass- and treble-shy, loud music sounds fuller. This is why you check mixes at multiple levels and why "louder always sounds better." See dB SPL, critical listening.
Flutter echo (Ch. 10) — The fast, metallic "brrrip" you hear clapping in an empty room with two parallel hard walls — sound bouncing rapidly back and forth. A dead giveaway of an untreated space. Broadband absorption or diffusion kills it. See first reflection.
FM (frequency modulation) synthesis (Ch. 14) — A synthesis method where one oscillator modulates another's frequency, producing complex, often metallic or bell-like tones from few parts — the sound of '80s electric pianos and basses. Contrast subtractive, wavetable. See oscillator.
Frequency (Ch. 1) — How many times a sound wave cycles per second, measured in hertz (Hz); it's what you perceive as pitch. Low frequency = low pitch, high frequency = high pitch. The independent partner of amplitude. See octave, harmonic.
Frequency bands (Ch. 4) — The six working zones the book divides the spectrum into — sub, low, low-mid, mid, high-mid, air — so you can name where a sound lives and where a problem hides. The shared language of every EQ decision. See EQ, masking, Appendix A.
Frequency response (Ch. 8) — How evenly a piece of gear — a mic, a speaker, a room — reproduces all frequencies. A "flat" response colors nothing; real monitors and rooms tilt the truth, which is why you learn your system's lies. See monitor, sweet spot.
G
Gain reduction (GR) (Ch. 23) — How many decibels a compressor or limiter is currently turning the signal down, shown on the GR meter. A few dB of GR is gentle leveling; lots is obvious squashing. The number you actually watch when compressing. See compressor, threshold.
Gain stage (Ch. 3) — Any point in the signal chain where level is set or changed — the preamp, a plugin's input, a fader. Every gain stage can add noise or distortion if mis-set; finding and managing them all is gain staging. See gain staging, unity gain.
Gain staging (Ch. 21) — Setting healthy levels at every gain stage so nothing is too hot (distorting) or too quiet (noisy), leaving consistent headroom through the whole chain. Levels are relationships, not absolutes — get them right early and everything downstream behaves. See headroom, unity gain.
Generative audio (Ch. 38) — AI systems that create new audio — melodies, drum fills, whole tracks — from prompts or examples rather than from a human performance. A fast-moving tool with real uses and real ethical and legal knots. See AI mastering, stem separation, training-data debates.
Glue (Ch. 23) — The cohesive feel a gentle bus compressor gives a group, making separately recorded parts sound like they're breathing together in one room. Usually just 2–3 dB of gain reduction on a drum or mix bus. See compressor, bus.
Groove (Ch. 13) — The feel of a rhythm — the tiny timing and velocity push-and-pull that makes a beat move instead of march. Quantizing to a rigid grid kills groove; humanizing restores it. See swing, humanization, quantize.
H
Haas effect (Ch. 25) — The psychoacoustic trick where a delay under ~30–40 ms is heard not as an echo but as direction and width — the basis of many stereo-widening tricks. Push it too far and the mono compatibility suffers. See delay, stereo width, correlation.
Harmonic / overtone (Ch. 1) — The higher frequencies that ring above a note's fundamental and give every instrument its timbre — the reason a guitar and a flute playing the same pitch sound different. Adding harmonics is what saturation does. See frequency, timbre.
Headphone mix (Ch. 11) — The custom monitor blend a performer hears in their headphones while tracking — often more of themselves, a click, and a little reverb for comfort. A good headphone mix gets a better take; a bad one makes singers push or drift. See takes, monitor.
Headroom (Ch. 21) — The safety margin between your normal signal level and the clipping ceiling. Leaving headroom (say, peaks around −6 dBFS before mastering) means transients, processing, and the master stage have room to work. See gain staging, dBFS, 2-bus.
HRTF (head-related transfer function) (Ch. 34) — The specific way your head, shoulders, and outer ears filter sound depending on where it comes from — the cues your brain decodes into "that's above and behind me." Binaural and Atmos headphone rendering simulate an HRTF. See binaural, ambisonics.
Humanization (Ch. 9) — Deliberately loosening programmed MIDI's perfect timing and uniform velocity so it feels played by a human rather than stamped by a machine. The cure for the lifeless, robotic grid. See velocity, quantize, groove.
I
Insert (Ch. 6) — A processor placed in line on a single channel, so 100% of that channel's signal passes through it — the normal home for EQ and compression. Contrast a send/return, where you tap a copy off to a shared effect. See send/return, bus.
ISRC (International Standard Recording Code) (Ch. 31, deep Ch. 35) — A unique code that identifies one specific recording worldwide, used to track streams, sales, and royalties. Every track you release should carry one; your distributor can assign it. Contrast UPC (which identifies a release). See metadata, distributor.
K
Knee (Ch. 23) — How abruptly a compressor transitions into compression as the signal approaches the threshold. A hard knee clamps suddenly (punchy, obvious); a soft knee eases in (gentle, transparent). See threshold, ratio.
L
Latency (Ch. 2) — The delay between an action and hearing the result — between singing and hearing yourself back through the computer. Driven mostly by buffer size; low latency matters when tracking, less when mixing. See buffer size.
Layering (Ch. 17) — Stacking multiple sounds to build one bigger sound — two kicks for weight and click, three synths for a wall of pad. Layering is powerful and dangerous; every layer is also more masking. See doubling, density.
LCR (left-center-right) (Ch. 25) — A panning philosophy that places elements hard left, dead center, or hard right rather than scattered in between — clean, wide, and translation-friendly. The starting grid for a tidy stereo image. See panning, pan law, stereo width.
LFO (low-frequency oscillator) (Ch. 14) — A slow oscillator used not to make sound but to move another parameter — wobbling a filter cutoff, adding vibrato, pulsing volume. The source of most rhythmic movement in synth patches. See oscillator, envelope, filter.
Limiter (Ch. 23, stages in Ch. 32) — A compressor with a very high ratio and fast action that acts as a brick wall, stopping the signal from exceeding a set ceiling — the last plugin in mastering, used to raise loudness and catch peaks. See compressor, true peak, ceiling.
Line/mic/instrument level (Ch. 3) — The three standard signal strengths: mic level is tiny (needs a preamp), instrument level (a guitar pickup) is small and high-impedance (needs a DI or instrument input), line level is the strong standard most gear passes around. Feed the wrong level to the wrong input and you get noise or distortion. See preamp, gain stage.
Linear-phase EQ (Ch. 32) — An EQ that adjusts tone without shifting the phase relationships between frequencies, at the cost of latency and possible pre-ringing. Prized in mastering and mid-side work; usually overkill on a single track. See EQ, multiband, mastering.
Loop (Ch. 13) — A short audio or MIDI phrase set to repeat seamlessly — a drum loop, a melodic loop. The building block of a lot of electronic and hip-hop production. Contrast one-shot. See sampler, groove.
LUFS (Loudness Units Full Scale) (Ch. 33) — The standardized measure of perceived loudness over time, the unit streaming platforms use to normalize playback. Integrated LUFS measures the whole track; short-term and momentary measure windows. Master to a target (often around −10 to −14 LUFS integrated) instead of chasing peaks. See true peak, normalization, dynamic range.
M
Makeup gain (Ch. 23) — The level you add back after a compressor has turned the loud parts down, to restore the overall volume. Without makeup gain, compression just makes things quieter. Match levels before you judge the move. See gain reduction, threshold.
Masking (Ch. 4, mix context Ch. 22) — When a louder sound hides a quieter one that shares its frequency band — the bass burying the kick, the pad swallowing the vocal. Most mixing is really un-masking: carving so every part has a clear pocket. See EQ, frequency bands.
Mastering (Ch. 31) — The final stage: quality control and translation that makes a mix competitive in loudness and consistent across systems and tracks. Mastering is polish and a safety net, not a rescue — it cannot fix a broken mix. See 2-bus, LUFS, limiter.
Master chain (Ch. 31, Ch. 32) — The ordered set of processors on the 2-bus during mastering — typically corrective EQ, then glue compression, then a limiter last. Order matters as much as the moves. See mastering, linear-phase EQ.
Mechanical / performance / sync royalty (Ch. 36) — Three of the ways music earns. A mechanical royalty is paid for reproducing a composition (a stream or sale); a performance royalty is paid when it's played publicly (radio, venues, streaming), collected by a PRO; a sync fee is paid to use music in film, TV, ads, or games. See publishing, PRO, copyright.
Metadata (Ch. 35) — The information attached to your release — track titles, artist names, ISRC/UPC codes, writer and producer credits, genre. Wrong or missing metadata means missing money and broken credits. See distributor, ISRC/UPC.
Mic bleed (Ch. 12) — Sound from one source leaking into a microphone aimed at another — the snare in the vocal mic, the room in everything. Manageable with placement and polar pattern; sometimes a feature, often a headache at mix time. See phase, close/room mic.
Mid-side (M/S) (Ch. 25, deep Ch. 28) — A way of handling stereo as a mid (center, mono-summed) channel and a side (the differences) channel, so you can process the center and the width independently. Powerful for widening, mono-compatible cleanup, and mastering. See stereo width, correlation, mono compatibility.
MIDI (Ch. 9) — The protocol that carries performance data — which note, how hard (velocity), when — rather than audio. MIDI drives virtual instruments and is endlessly editable: change the notes, the sound, the timing after the fact. See velocity, piano roll, quantize.
Mix (Ch. 20) — The craft (and the result) of combining all the recorded and programmed tracks into a balanced, finished stereo whole — levels, tone, space, width, and dynamics in service of the song. See balance, static mix, mastering.
Mono compatibility (Ch. 20, Ch. 25) — Whether your stereo mix still sounds right when summed to mono, as it will on phones, club systems, and smart speakers. Wide tricks that collapse or cancel in mono are a real-world failure, not a clever flex. See correlation, phase, Haas effect.
Monitor (Ch. 8) — A loudspeaker (usually a nearfield monitor) or headphone designed to reveal a mix accurately rather than flatter it. "Monitoring" is the whole act of listening back through your speakers, headphones, and room. See sweet spot, frequency response, open/closed-back.
Multiband (Ch. 32) — Processing (usually compression) that splits the signal into frequency bands and treats each separately — so you can tame a boomy low end without squashing the highs. Powerful and easy to overdo; a scalpel, not a default. See compressor, mastering, linear-phase EQ.
Multitrack (Ch. 5) — Recording each instrument or voice to its own separate track so they can be balanced and processed independently — the foundation of modern production. Contrast stems (grouped bounces). See track, overdub, console.
N
Non-destructive editing (Ch. 6) — Editing that changes how audio plays back without altering the original recorded file, so every cut, fade, and process is reversible. The freedom that lets you experiment fearlessly in a DAW. See region/clip, crossfade.
Normalization (Ch. 33) — Adjusting a track's level to hit a target. Peak normalization raises the loudest peak to a ceiling; loudness normalization (what streaming does) turns every track up or down to match a reference LUFS. The reason the loudness war ended. See LUFS, true peak, dynamic range.
Null test (Ch. 30, sidebar) — Flipping the phase of one signal against another and summing them; whatever's identical cancels to silence, leaving only the difference. A precise way to hear exactly what a process changed. See phase, diagnostic listening.
Nyquist (Ch. 2) — The principle that a digital system can perfectly capture any frequency up to half its sample rate (the Nyquist frequency); anything above that must be filtered out or it becomes aliasing. Why 44.1 kHz comfortably covers human hearing. See sample rate, aliasing.
O
Octave (Ch. 1) — The interval between a frequency and double (or half) that frequency — 220 Hz to 440 Hz is one octave. Notes an octave apart sound like "the same note, higher/lower." A core unit for thinking about pitch and frequency bands. See frequency, harmonic.
Off-axis coloration (Ch. 7) — How a microphone's tone changes for sound arriving from the side or rear rather than straight on. Good mics stay natural off-axis; cheap ones get honky, which makes mic bleed sound worse. See polar pattern, proximity effect.
Omni (omnidirectional) (Ch. 7) — A polar pattern that hears equally from all directions — natural and open, with no proximity effect, but it captures the whole room, good and bad. See cardioid, figure-8.
One-shot (Ch. 13) — A single, non-looping sample triggered once per hit — a single kick, snare, or vocal chop. The opposite of a loop, and the raw material of most programmed drums. See sampler, loop.
Open-back / closed-back (Ch. 8) — Two headphone designs. Open-back sounds more natural and spacious (great for mixing) but leaks sound and offers no isolation; closed-back isolates (great for tracking next to a live mic) but can color the low end. See monitor, headphone mix.
Oscillator (Ch. 14) — The part of a synth that generates the raw tone — a sine, square, saw, or other waveform — before the filter and envelope shape it. The sound source at the start of every synth voice. See filter, envelope, subtractive.
Overdub (Ch. 5) — Recording a new part while listening to already-recorded parts, building the arrangement layer by layer. The technique that turned recording from a live snapshot into a construction process. See multitrack, takes.
Overtone — See harmonic / overtone.
P
Pan law (Ch. 21 brief, deep Ch. 25) — The level compensation a DAW applies as you pan a sound toward center, so a centered element doesn't jump louder than a panned one. Different pan laws (−3, −4.5, −6 dB) explain why a mix's balance can shift when you change DAWs. See panning, LCR.
Panning (Ch. 25) — Placing a sound across the stereo field from left to right. Panning creates space, separation, and width — and good panning un-masks crowded mids by giving parts their own side. See LCR, stereo width, mono compatibility.
Parallel (New York) compression (Ch. 28) — Blending a heavily compressed copy of a signal under the untouched original, so you add density and punch without crushing the transients. The trick behind huge-but-natural drums and vocals. See compressor, bus.
Patch (Ch. 14) — A single saved synth or instrument sound — the full set of oscillator, filter, envelope, and modulation settings that make "the static bloom pad." Old modular synths were "patched" with cables; the name stuck. See virtual instrument, subtractive.
Phantom power (Ch. 3) — The 48-volt supply (labeled "48V" or "+48") that audio interfaces and consoles send up the mic cable to power condenser mics. Turn it on for condensers; leave it off for ribbons, which it can damage. See condenser, balanced/unbalanced.
Phase (Ch. 12, Ch. 25) — The timing relationship between waveforms. When two versions of a sound arrive slightly offset (two mics on one source, a wide stereo trick), some frequencies reinforce and others cancel — thin, hollow, or weird. Managing phase is central to multi-mic recording and stereo width. See mic bleed, correlation, null test.
Piano roll (Ch. 9) — The grid editor where MIDI notes appear as bars — pitch up the side, time across — that you draw, drag, and trim. Where most programmed music is actually written and edited. See MIDI, velocity, quantize.
Pitch correction (Ch. 15) — Software that nudges sung or played notes toward the intended pitch — from invisible tuning to the hard, robotic effect made famous on Cher's "Believe." A transparent tool when used with restraint, a sound in its own right when pushed. See editing grid, comping.
Plosive (Ch. 11) — The thump of air from "p" and "b" sounds hitting a mic capsule, producing a low-frequency pop. Tamed with a pop filter, mic angle, and distance — far easier to prevent than to fix. See pop filter, sibilance.
Plugin (Ch. 6) — Software that adds processing or instruments inside your DAW — an EQ, a compressor, a synth, a reverb. The modern studio is mostly plugins; knowing how a tool works beats owning more of them. See insert, virtual instrument.
PLR — See dynamic range / PLR.
Polar pattern (Ch. 7) — The shape of what a microphone hears from different directions — cardioid, omni, figure-8, and their cousins. Choosing the pattern is choosing how much room and bleed you let in. See cardioid, omni, off-axis coloration.
Pop filter (Ch. 11) — The mesh or foam screen between mouth and mic that diffuses the air blast from plosives before it pops the capsule. Cheap insurance on every vocal session. See plosive, condenser.
Pre-delay (Ch. 24) — The short gap between the dry sound and the start of its reverb tail. A little pre-delay keeps the reverb from smearing the source, preserving clarity while still adding space. See reverb types, decay.
Preamp (Ch. 3) — The circuit that boosts tiny mic-level signal up to usable line level, setting the first and most important gain stage. A clean preamp with the right gain matters more than its brand mystique. See gain stage, phantom power.
Pre-save (Ch. 35) — A release-campaign tool that lets fans commit before release day to automatically save your track to their library when it drops — building first-day streaming momentum. See distributor, release strategy.
PRO (performing rights organization) (Ch. 36) — A body like ASCAP, BMI, SESAC, or PRS that collects performance royalties on songwriters' and publishers' behalf when their songs are played publicly. Register your songs or you don't get paid for plays. See publishing, mechanical/performance/sync royalty.
Proximity effect (Ch. 7) — The bass boost a directional mic (cardioid, fig-8) produces as the source gets very close — the "radio voice" warmth, and the reason vocal distance is a tone control. Move in for body, back off for balance. See cardioid, off-axis coloration.
Publishing (Ch. 36) — The business of administering and monetizing the composition — collecting mechanical and performance money, licensing songs, registering works. A publisher (or a self-publishing setup) handles the songwriter's side of the income. See copyright, PRO, split sheet.
Pumping (Ch. 23) — The audible breathing of a compressor turning down and recovering in time with the music — often the kick ducking the whole mix. A flaw when accidental, a deliberate effect in a lot of dance music. See release, sidechain.
Q
Q (Ch. 22) — How wide or narrow an EQ band is. A high Q is surgical (narrow — good for killing a single resonance); a low Q is broad and musical (good for gentle tone shaping). The width control that decides whether you operate or sculpt. See EQ types, EQ.
Quantize (Ch. 9) — Snapping MIDI (or audio) notes to a rhythmic grid to tighten timing. Full quantize locks everything to perfect; partial quantize tightens while keeping feel. Over-quantizing is how a groove dies. See groove, swing, humanization.
R
Ratio (Ch. 23) — How hard a compressor clamps once past the threshold: 2:1 means for every 2 dB the input rises above threshold, the output rises only 1 dB. Low ratios level gently; high ratios (10:1 and up) approach limiting. See threshold, knee, limiter.
Re-amping (Ch. 12) — Sending a recorded clean DI back out through a real amplifier (or a sim) and re-recording it, so you choose the amp tone after the performance is safely captured. Maximum flexibility, zero pressure on the take. See DI, close/room mic.
Region / clip (Ch. 6) — A block of audio or MIDI on a track that you can move, trim, fade, copy, and process — your DAW's basic editing unit. (DAWs call it a "clip," "item," or "region"; same thing.) See track, non-destructive editing, crossfade.
Release (compressor) (Ch. 23) — How fast a compressor stops compressing once the signal falls back below threshold. Fast release can sound lively or pump; slow release is smooth and transparent. The other half of compression's time control. See attack, pumping.
Resonance (Ch. 14) — A boost in level right at a filter's cutoff frequency, emphasizing it — push it up and you get the squelchy, vocal, "acid" quality as you sweep the filter. The character knob next to cutoff. See filter, subtractive.
Reverb types (Ch. 24) — The flavors of artificial space: a room (short, intimate), a plate (smooth, dense, classic on vocals), a hall (big, lush, long), a spring (boingy, the guitar-amp sound). Choosing the right type sets the size and character of your space. See decay, pre-delay, wet/dry.
Ribbon (microphone) (Ch. 7) — A mic using a thin metal ribbon in a magnetic field, naturally figure-8, smooth and warm with a gentle top end — flattering on bright or harsh sources. Fragile, and usually killed by phantom power, so handle with care. See dynamic, condenser, figure-8.
Rides (Ch. 27) — Hands-on automation moves of a fader to keep a part — usually the vocal — sitting right phrase by phrase, word by word. A rode vocal sits in the mix in a way a compressor alone can't manage. See automation, breakpoints.
Riser (Ch. 14, Ch. 17) — A transition effect that builds tension into a drop or chorus — a rising pitch, swelling noise, or accelerating rhythm. The most common piece of ear candy in modern production. See ear candy, arrangement.
Room mode — See standing wave / room mode.
RT60 (Ch. 10) — The time it takes a sound in a room to decay by 60 dB — a single-number measure of how "live" or "dead" a space is. Big number = echoey hall; small number = treated, controlled room. Used here intuitively, not as homework. See decay, absorption, reverb types.
S
Sample rate (Ch. 2) — How many times per second a recording samples the audio — 44.1 kHz, 48 kHz, 96 kHz. It sets the highest frequency you can capture (the Nyquist limit). 48 kHz is the practical session standard in this book. See Nyquist, bit depth, aliasing.
Sampler (Ch. 9, Ch. 13) — An instrument that plays back recorded audio (a piano note, a vocal chop, a drum hit) mapped across a keyboard, pitching and shaping it. Distinct from a synth, which generates tone. See virtual instrument, one-shot, loop.
Sampling (technique) (Ch. 13) — Building new music from snippets of existing recordings — a drum break, a soul vocal, a film line. Creatively foundational to hip-hop and electronic music, and legally loaded: clearing a sample is a copyright matter. See copyright, one-shot, loop.
Saturation (Ch. 26) — Gentle, musical distortion that adds harmonics, warmth, and perceived loudness without sounding "broken" — the soul of tape, tube, and analog console color. Saturation makes a sound feel bigger and helps small speakers hear deep bass. See distortion, drive, tape saturation.
Send / return (Ch. 6) — A routing where you tap a copy of a channel's signal (the send) off to a shared effect on its own channel (the return), so several tracks can share one reverb or delay. Contrast an insert (100% in line, one channel). See insert, bus.
Session (Ch. 6) — The whole project file and its assets — tracks, regions, plugins, automation, settings — that make up your work in the DAW. (Called a "project," "session," or "song" depending on the DAW.) Good session hygiene saves real grief later. See track, non-destructive editing.
Sibilance (Ch. 11) — The harsh, hissy energy of "s," "sh," and "t" sounds (roughly 5–10 kHz) that spits on vocals, especially on earbuds. Tamed with a de-esser or careful automation, not a broad EQ cut that dulls the whole voice. See de-esser, plosive.
Sidechain (Ch. 28) — Triggering one processor from a different signal — most famously ducking a bass with a compressor keyed off the kick, so the kick punches through cleanly. The engine behind pumping dance mixes and clean kick/bass relationships. See compressor, pumping.
Signal chain (Ch. 3) — The complete path your sound travels — source → mic → preamp → converter → DAW → and back out to the monitors. Every link can help or hurt; understanding the chain is understanding where problems enter. See gain stage, A/D–D/A.
Sine / square / saw (Ch. 1) — Three basic waveforms. A sine is the pure tone, a single frequency with no harmonics; a square is hollow and buzzy (odd harmonics); a sawtooth is bright and rich (all harmonics) — the raw materials of subtractive synthesis. See oscillator, harmonic, timbre.
Slap (slapback) (Ch. 24) — A single, short delay (roughly 80–140 ms) with no repeats — the rockabilly vocal and guitar sound that adds life and a sense of room without a reverb tail. See delay, feedback.
Song sections (Ch. 16) — The labeled parts of a song's form — intro, verse, pre-chorus, chorus, bridge, outro — and how they're ordered. Thinking in sections is how you shape an arrangement and its energy curve. See arrangement, energy curve.
Soundstage (Ch. 4) — The illusion of a three-dimensional space — width, depth, and placement — created by a stereo mix and revealed by good monitoring. You build it with panning, reverb, and level; you lose it to first reflections in an untreated room. See stereo width, panning.
SPL handling (Ch. 7) — How loud a sound source a microphone can take before it distorts. Dynamic mics shrug off screaming amps and kick drums; sensitive condensers may need a pad. Match the mic to the source's loudness. See dynamic, proximity effect.
Standing wave / room mode (Ch. 10) — Low-frequency resonances that build up in a room wherever its dimensions reinforce certain wavelengths — making some bass notes boom and others disappear depending on where you sit. The reason your low end lies to you and why bass traps exist. See bass trap, RT60, sweet spot.
Static mix (Ch. 20) — A rough mix done with faders alone (and pans), set from silence, before any EQ, compression, or effects — the foundational balance the whole mix grows from. The most important and most-skipped 90 minutes of mixing. See balance, gain staging, mono compatibility.
Stem separation (Ch. 38) — AI processing that pulls a finished stereo mix apart into approximate instrument stems — vocals, drums, bass, other. Astonishing and imperfect; useful for remixing, sampling, and study. See stems vs multitracks, generative audio.
Stems vs multitracks (Ch. 19) — Two ways to hand off a session. Multitracks are every individual track exported separately (dozens of files); stems are grouped sub-mixes (drums, bass, vocals, music) bounced down to a handful. Know which one the recipient actually wants. See multitrack, bus.
Stereo width (Ch. 25) — How wide the mix feels across the speakers, from dead-center mono to fully spread. Width comes from panning, doubling, stereo effects, and mid-side tricks — and every bit of it must survive a mono compatibility check. See panning, LCR, Haas effect.
Subtractive synthesis (Ch. 14) — The classic synth method: start with a harmonically rich oscillator (saw or square) and subtract with a filter, shaped by an envelope. The most intuitive synthesis to learn and the one behind most analog-style sounds. Contrast FM, wavetable. See oscillator, filter.
Sub bass (Ch. 13) — The lowest octaves of bass (below roughly 60 Hz) — felt more than heard, carried by 808s and synth bass. Mismanaged, it eats all your headroom and vanishes on small speakers; managed (and saturated), it's the foundation. See 808, saturation, headroom.
Sweet spot (Ch. 8) — The listening position where your two monitors and your head form a tidy equilateral triangle and the soundstage snaps into focus. Mix from the sweet spot or you're mixing someone else's experience. See monitor, first reflection.
Swing (Ch. 13) — Shifting every other subdivision slightly late to create a rolling, human, "swung" feel instead of a straight grid. A few percent of swing is the difference between a beat that marches and one that grooves. See groove, quantize, humanization.
T
Takes (Ch. 11) — The individual recorded passes of a performance — take 1, take 2, take 7 — that you later comp into one composite best version. Record more takes than you think you need; editing them is cheaper than re-tracking. See comping, headphone mix.
Tape saturation (Ch. 5, Ch. 26) — The warm compression and harmonic color analog tape adds as it's pushed — rounded transients, fattened lows, a gentle high-end softening. A sound producers now chase deliberately with plugins. See saturation, console.
Throws (Ch. 29) — Momentary effect sends — a delay or reverb thrown onto the last word of a line — that decorate a vocal in motion without drowning it. Automation makes throws land exactly where they should. See delay, reverb types, rides.
Threshold (Ch. 23) — The level a signal must cross before a compressor, limiter, or gate starts acting. Lower the threshold and more of the signal gets processed. The first knob you set on any dynamics processor. See ratio, gain reduction.
Timbre (Ch. 1) — The tone color that distinguishes two sounds at the same pitch and loudness — why a violin and a trumpet on the same note sound different. Timbre comes mostly from the pattern of harmonics. See harmonic, waveform.
Track (Ch. 6) — A single lane in your session that holds one source's audio or MIDI — the vocal track, the kick track — with its own fader, pan, and processing. The basic unit of a multitrack recording. See region/clip, bus, session.
Training-data debates (Ch. 38) — The unresolved arguments over whether AI music tools were trained on copyrighted recordings without permission, and what that means for ethics, law, and royalties. Context you need before leaning on generative audio. See generative audio, voice cloning ethics.
Transient (Ch. 1) — The brief, loud spike at the start of a sound — the pick attack, the drum hit, the consonant. Transients carry punch and clarity, and how you treat them (with attack/release) shapes a mix's energy. See attack, compressor.
True peak (Ch. 33) — The real maximum level of a signal between samples, which can exceed the sample peaks and cause distortion when converted to analog or to a lossy format. Master to a true-peak ceiling (commonly −1 dBTP) for safety. See LUFS, limiter, normalization.
U
Unbalanced — See balanced / unbalanced.
Unity gain (Ch. 21) — A setting where a stage neither boosts nor cuts the signal — output level equals input level, the fader at "0." Keeping plugins at unity (matched input/output) lets you judge processing, not loudness. See gain staging, makeup gain.
UPC (Universal Product Code) (Ch. 35) — The code that identifies a whole release — a single, EP, or album — as a product, the way an ISRC identifies one recording. Your distributor typically assigns it. See ISRC, metadata.
V
Velocity (Ch. 9) — In MIDI, how hard a note was struck (0–127), which usually controls loudness and often timbre too. Varying velocity is the first and biggest step in making programmed parts feel human. See MIDI, humanization, piano roll.
Velocity layers (Ch. 13) — Different sampled recordings of an instrument triggered at different velocities, so a soft hit and a hard hit use genuinely different sounds — the realism behind good sampled drums and pianos. See velocity, sampler.
Version control (Ch. 19) — Keeping dated, numbered saves of a session (mix_v3, mix_v4_final) so you can go back when a "better" idea turns out worse. The discipline that saves you from your own revisions. See session, stems vs multitracks.
Virtual instrument (Ch. 9) — A software instrument — synth or sampler — played by MIDI inside your DAW, replacing a rack of hardware. The sound source for most programmed music. See MIDI, patch, plugin.
Voice cloning ethics (Ch. 38) — The questions raised by AI that can imitate a specific singer's voice without consent — questions of permission, likeness rights, and fraud. A live issue the moment you touch voice AI. See generative audio, training-data debates.
VU / average vs peak (Ch. 21) — Two ways of reading level. A VU meter (and "average"/RMS metering) tracks perceived loudness, sluggish and musical; a peak meter catches instantaneous maxima that decide clipping. You need both — they tell different truths about the same signal. See gain staging, headroom, true peak.
W
Warp (elastic audio) (Ch. 15) — Time-stretching audio to fit a tempo or fix timing without changing pitch — "warping" or "flex"/"elastic" editing. Powerful for tightening a performance; obvious and artifact-ridden when pushed too hard. See quantize, pitch correction, groove.
Waveform (Ch. 1) — The shape of a sound's pressure changes over time, the squiggle you see on a track. Its shape determines timbre; simple shapes (sine, square, saw) are the synth primitives. See amplitude, frequency, timbre.
Wavetable synthesis (Ch. 14) — Synthesis that scans through a set of stored single-cycle waveforms, morphing between them over time for evolving, modern digital tones. Contrast subtractive and FM. See oscillator, patch.
Wet / dry (Ch. 24) — The balance between the processed ("wet") signal and the untouched ("dry") original on an effect — how much reverb, delay, or color you're actually hearing. The mix knob that decides how present an effect is. See reverb types, send/return, parallel compression.
This glossary covers the core working vocabulary of the book. For the frequencies behind the words, see Appendix A (The Frequency Chart); for the decibel and loudness scales, Appendix B; and for what every control is called in your particular DAW, Appendix E (The DAW Translation Guide). When a term sends you to a chapter, that chapter is where the concept lives in full — with the diagram, the ear training, and the war story.