Chapter 31 Key Takeaways: The Physics of Recording — From Edison to Digital

Core Physical Principles

Recording preserves pattern, not energy. No recording technology captures the actual acoustic energy of a performance. Every medium captures a pattern isomorphic to the original sound wave, which can be used to recreate pressure variations later. The accuracy of this representation is limited by the physics of the storage medium.

Mechanical recording encodes frequency as spatial wavelength. On Edison's phonograph and later vinyl records, the frequency of a sound determines how rapidly the groove undulates. Higher frequencies produce shorter spatial wavelengths. The stylus geometry (tip radius) sets a physical upper limit on the highest frequency that can be accurately reproduced.

Magnetic recording relies on remanence and is complicated by hysteresis. Ferromagnetic tape retains magnetic orientation after the recording field is removed (remanence). But the nonlinear magnetization curve (hysteresis) causes distortion at low signal levels. AC bias — an ultrasonic oscillation added to the audio signal during recording — linearizes the process and reduces distortion dramatically.

Tape saturation generates even-order harmonics. When a signal is large enough to push the tape into magnetic saturation, the peaks of the waveform are rounded smoothly. This smooth nonlinearity generates primarily second-harmonic distortion — the octave above the fundamental. Second harmonics are consonant and perceived as "warmth." This is distinct from digital hard clipping, which generates harsh, high-order odd-harmonic distortion.

The three primary microphone types use different transduction mechanisms. Dynamic mics use electromagnetic induction (moving coil in magnetic field). Condenser mics use capacitance variation (charged diaphragm changing gap to backplate). Ribbon mics use electromagnetic induction of a conducting ribbon. Each type has characteristic frequency response, sensitivity, and appropriate applications.

Signal Chain and Studio Physics

The proximity effect is a physics phenomenon, not an imperfection. Directional microphones boost low frequencies when placed close to a sound source because the pressure gradient across the capsule is larger in near-field sound (curved wavefronts) than far-field sound. Engineers exploit this deliberately to add bass character to voices and instruments.

The inverse-square law governs the direct-to-reverberant ratio. As microphone distance from a source doubles, direct sound pressure falls by 6 dB, while reverberant field level remains approximately constant. This means microphone placement is fundamentally a choice about how much acoustic environment to include in the recording, not just about volume.

Gain staging is the management of signal-to-noise ratio through the signal chain. Each stage in the signal chain should receive the signal at a level that is well above the noise floor but below the clipping threshold. Poor gain staging allows noise to build up or creates distortion at intermediate stages before the final recording.

Stereo and Spatial Audio

Stereo is an illusion built on two binaural cues. The stereo illusion is constructed from Interaural Level Differences (ILD — level differences between ears, dominant at high frequencies) and Interaural Time Differences (ITD — arrival time differences between ears, dominant at low frequencies). Neither cue is physically accurate in loudspeaker stereo: the listener's ears each receive some signal from both speakers.

The stereo "sweet spot" is a physical consequence of geometry. The stereo illusion degrades outside the equilateral triangle sweet spot because the ILD and ITD cues become distorted by the asymmetric geometry. This is a fundamental limit of two-channel loudspeaker stereo, not a solvable engineering problem within the two-channel paradigm.

Technology and Music

The studio became an instrument. Multi-track recording, overdubbing, tape editing, and signal processing collectively transformed the recording studio from a document-capture facility into a compositional instrument. Works like Sgt. Pepper's Lonely Hearts Club Band could not have been created without the studio — they are compositions for the studio, not performances captured by the studio.

The Compact Disc achieves destructive interference at pit edges. CD pits are approximately one-quarter the wavelength of the reading laser deep. This causes destructive interference (180-degree phase shift in the round-trip optical path) at the pit edges, providing the optical contrast the photodetector uses to distinguish pits from lands.

Dynamic range compression can serve both aesthetic and commercial ends. Used skillfully, compression shapes the temporal envelope of sound in musically expressive ways. Used commercially to maximize loudness for competitive advantage, it can eliminate the dynamic contrast on which musical drama depends. LUFS normalization by streaming platforms has changed the commercial incentive structure, reducing (though not eliminating) the loudness war dynamic.

Theme Connections

Theme 4 (Technology as Mediator): Every recording technology — mechanical, magnetic, optical, digital — stands between the original sound and the preserved representation, and every technology makes choices about what to capture and what to discard. These choices are never neutral. The "warmth" of tape, the "precision" of digital, the "intimacy" of close-miking — all are consequences of specific technological mediations.

Theme 3 (Constraint as Creativity): The four-track tape limitation of 1960s studio technology generated the bounce-down technique. The physical limitations of stylus geometry drove the development of elliptical styli. The hysteresis problem of tape drove the invention of AC bias. Constraints have historically been among the most powerful generators of creative and technical innovation.