Case Study 10.2: The Yamaha DX7 and FM Synthesis — How One Algorithm Changed Pop Music
A Stanford Office, 1967
John Chowning sat in his office at Stanford University's Center for Computer Research in Music and Acoustics (CCRMA) and made the mistake — or rather, the discovery — that changed the sound of 1980s pop music. He was experimenting with a vibrato effect on a computer-generated tone, running the code on one of the facility's mainframes. He wanted to see what happened when the vibrato rate increased beyond the range of human perception: would the modulation simply disappear, or would something else happen?
Something else happened. As the modulation rate climbed into the audio range — 100 Hz, 200 Hz, 500 Hz — the tone he was modulating suddenly acquired a rich, complex, metallic quality. It no longer sounded like vibrato; it sounded like a bell. Or a clarinet. Or a plucked string. The quality of the sound changed with the rate and depth of modulation, following patterns Chowning would spend the next several years characterizing.
What he had discovered was FM synthesis — frequency modulation applied in the audio domain, generating complex spectra from simple inputs. Chowning published his findings in the Journal of the Audio Engineering Society in 1973. He then did something unusual for an academic: he patented the idea and licensed it to Yamaha Corporation of Japan for what reports describe as a very modest fee. Yamaha would make considerably more from the license than Stanford paid Chowning.
The Mathematics That Made the DX7
The DX7 synthesizer, released in 1983, implemented Chowning's FM algorithm with six "operators" — each operator being a combined oscillator and envelope generator. The six operators could be connected in 32 different "algorithm" configurations — different networks of carriers and modulators.
The mathematics underlying each configuration is the same FM formula:
x(t) = A · sin(2π·fc·t + I(t) · sin(2π·fm·t))
But when six operators are chained and cross-modulated in various configurations, the resulting complexity of the spectrum is orders of magnitude beyond what a two-operator system produces. A single carrier modulated by five modulators generates sidebands of sidebands of sidebands — a "fractal" of frequency modulation that can produce extraordinarily dense, evolving spectra from a set of mathematical rules no more complex than the original formula.
The DX7's 32 algorithms are essentially 32 different differential equation networks. Algorithm 1 has all six operators in a chain: operator 6 modulates operator 5, which modulates 4, which modulates 3, which modulates 2, which modulates operator 1 (the carrier). This creates a six-level nested FM structure — a computational depth that produces extremely complex timbres. Algorithm 32, by contrast, has all six operators as independent carriers with no modulation — additive synthesis of six simultaneous sine waves. The other 30 algorithms occupy the vast middle ground between these extremes.
The Physics of Specific DX7 Sounds
The Electric Piano The DX7's most famous sound — the electric piano preset, "E. PIANO 1" — became so ubiquitous in 1980s pop that it is nearly impossible to hear the chord from the opening of A-ha's "Take On Me" without triggering instant recognition. Physically, the DX7 electric piano sound works as follows:
- The carrier (operator 1) runs at the base pitch and produces the output tone
- Operator 2 modulates the carrier with a C:M ratio near 1:1 and a modulation index that starts high (bright, harmonically rich attack) and decays rapidly (smooth, mellow sustain)
- The decaying modulation index mimics the physics of the real Rhodes piano: when the hammer strikes the tine (a metal tine, not a string), the initial strike is bright (high harmonic content) and then decays to a purer, more sinusoidal tone as the tine vibrates more regularly
The key physics insight: a decaying modulation index produces a decaying spectrum complexity — the sound starts bright and becomes purer over time. This matches the observed behavior of struck-metal instruments. The DX7 was not designed with this physical model in mind; Chowning and the Yamaha programmers discovered that FM with a decaying envelope on the modulation index naturally produced this behavior.
The Bass Guitar DX7 bass sounds typically use a low fundamental (carrier frequency) with a high modulation index at attack that decays quickly. This produces the percussive "slap" of the fundamental plus harmonics at attack, followed by a clean, sinusoidal sustain — mimicking the physics of a plucked or slapped string whose initial disturbance is rich in harmonics but which quickly settles into its fundamental mode.
The Brass Brass-like FM sounds use a modulator at roughly twice the carrier frequency (C:M ≈ 1:2) with a modulation index that increases slightly with dynamics (velocity-sensitive index). This is acoustically meaningful: real brass instruments produce richer spectra when played loudly (overblowing causes increased harmonic content), and the DX7's velocity-sensitive modulation index approximates this physical relationship.
Cultural Impact: The Sound of a Decade
The DX7 sold over 200,000 units in its first year — an astonishing figure for a device that retailed for $2,000 in 1983 dollars. Within two years, it was the most common keyboard instrument in professional music production. Its presets appeared on records by Whitney Houston, Phil Collins, Madonna, Brian Eno, Peter Gabriel, Tina Turner, and hundreds of other artists. Songs that defined 1980s pop aesthetics — "Against All Odds," "Jump," "Owner of a Lonely Heart," "Take On Me," "Don't You (Forget About Me)," "Higher Love" — were built around DX7 sounds.
The DX7 changed pop music in several specific ways:
The "Suitcase Piano" Became the Default Keyboard Before the DX7, expensive Fender Rhodes electric pianos were the standard keyboard for sessions requiring that warm, electric piano timbre. After 1983, the DX7's FM approximation of the Rhodes became the de facto standard — partly because it was cheaper, partly because it was more portable, but also because its timbral characteristics (slightly brighter, slightly less "leaky" than the real Rhodes) suited the recording aesthetics of the era.
New Timbres Became Available FM synthesis can produce sounds that no subtractive synthesizer can easily achieve — the metallic, shimmering quality of certain FM spectra, the "hollow" quality of certain C:M ratios, the rapid timbral evolution possible with dynamic modulation indices. These new timbres became part of the 1980s pop palette not because producers made principled physics-based choices but because the DX7 was what was in the studio, and producers used it to make music.
The Physics of Nostalgia By the mid-1990s, the DX7's sounds had become sonic markers of a specific era — the 1980s. This is, in retrospect, an acoustically interesting phenomenon: why does the DX7 sound "dated" in a way that, say, a Steinway piano does not? Both are instruments of their era; both encode physics in their sounds. The difference is that the Steinway's physics (vibrating strings, resonant soundboard) are universal — they produce harmonic series and temporal envelopes that are found in nature. The DX7's physics (Bessel-function sidebands from specific C:M ratios) produce spectral patterns that are mathematically inevitable but rare in natural acoustic sounds. The "digitalness" of FM synthesis — the characteristic shimmer, the specific sideband structure — became associated with a particular technological moment rather than with universal acoustic physics.
By the 2010s, this same quality had become "retro" and "vintage" — the DX7 sounds that seemed cutting-edge in 1983 were nostalgic by 2015, and "FM synthesis" plugins (free and commercial) had proliferated for musicians seeking that specific aesthetic.
What FM Synthesis Teaches About Physics and Music
The story of FM synthesis and the DX7 illustrates several themes of this chapter with particular clarity:
Technology as Mediator: The DX7 mediated between Chowning's mathematical discovery (FM generates complex spectra via Bessel functions) and musical culture (the sound of 1980s pop). The technology didn't just implement physics — it translated a mathematical insight into a cultural moment.
Reductionism and Emergence: Two sine-wave oscillators (carrier and modulator) interact through a simple formula to produce spectra of extraordinary complexity. This is emergence: the mathematical simplicity of the FM formula gives no direct indication that it will produce bell sounds, electric piano sounds, and brass sounds — these properties emerge from the interaction.
Constraint and Creativity: The DX7's programming was notoriously difficult. The interface (six operators, 32 algorithms, each with multiple parameters) was so opaque that most users relied on the 32 factory presets rather than programming their own sounds. But this constraint — the difficulty of FM programming — paradoxically made certain sounds ubiquitous: the factory presets became the sonic vocabulary of an era because they were too hard to change. The constraint of the interface shaped the aesthetics of a decade.
Physics and Cultural Specificity: The DX7's FM sounds are neither culturally neutral nor acoustically universal. They are the product of a specific mathematical algorithm (Chowning's FM formula) implemented in a specific technological artifact (the DX7) at a specific cultural moment (1980s pop production). The physics of FM synthesis is universal; the cultural meaning of "that DX7 sound" is historically specific.
Chowning's Legacy
John Chowning received very little direct financial benefit from his discovery — Stanford's licensing income from the Yamaha agreement was substantial by academic standards, but modest by the standards of what the DX7 generated. Chowning has said, in interviews, that he is less interested in the commercial legacy than in the scientific one: that a simple mathematical formula, discovered by pushing a vibrato parameter beyond its expected range, turned out to implement Bessel function mathematics and generate acoustically valid, musically valuable timbres.
The lesson he draws from FM synthesis is one about physics and surprise: the deepest discoveries in synthesis have not come from trying to build something specific, but from pushing parameters beyond their expected ranges and listening to what happens. This is, in a sense, experimental physics applied to music: the experimental approach of probing a system and observing unexpected behaviors.
Discussion Questions
-
The DX7's factory presets defined the sound of 1980s pop partly because they were too difficult to program. This is the "constraint and creativity" paradox: a system that was hard to modify produced a consistent aesthetic that became culturally significant. Design a contemporary instrument or tool where difficulty of use might similarly produce a distinctive aesthetic. What are the physics, engineering, or interface constraints that would create this outcome?
-
FM synthesis sounds "dated" in a way that acoustic instruments do not. Propose a hypothesis for why this is the case, using the acoustic physics of FM spectra (Bessel function sidebands) vs. acoustic instrument spectra (harmonic series from physical vibration). Is it possible that in 200 years, FM synthesis sounds will seem as "timeless" as the sound of a harpsichord?
-
Chowning discovered FM synthesis by accident — by pushing a vibrato effect beyond its intended range. How does this mode of scientific and musical discovery (accidental discovery through exploration) compare to intentional design? What does it suggest about the value of "playing around" with physical systems beyond their intended parameters?
-
The DX7's six-operator FM architecture can be described as a network of coupled differential equations. When you play a note, the synthesizer solves these equations in real time. Is this different, in principle, from a physical instrument solving wave equations in real time through the physics of material vibration? What is the difference between a physical system "solving" its own equations and a computer solving those equations numerically?