Every musical system humans have ever devised — from the five-note pentatonic scales of ancient China to the forty-three-note just intonation scales of twentieth-century American composer Harry Partch, from the rhythmic cycles of South Indian...
In This Chapter
- The Architecture of Sound
- Opening: The Question Every Music Student Should Ask
- 11.1 What Is Pitch? — Frequency, Perception, and the Brain's Shortcuts
- 11.2 The Octave: Physics' Gift to Music — Why "Same Note, Different Register"
- 11.3 The Problem of Scale Construction — You Can't Just Pick Any 12 Notes
- 11.4 The Pentatonic Scale: Universal or Coincidence?
- 11.5 The Diatonic Scale: A Solution to the Consonance Problem
- 11.6 Why 12 Notes? The Physics and Math of 12-Tone Equal Temperament
- 11.7 Non-Western Scale Systems — Maqam, Raga, Pelog, and Microtonal Traditions
- 11.8 Theme 2 Deep Dive — Which Aspects of Scales Are Universal?
- 11.9 The Tritone: Music's Most Dangerous Interval
- 11.10 Melody vs. Harmony — What Scales Are Actually For
- 11.11 Thought Experiment: Designing a Scale for a Different Auditory Range
- 11.12 Summary and Bridge to Chapter 12
Part III Introduction: Musical Structure as Physics
The Architecture of Sound
Every musical system humans have ever devised — from the five-note pentatonic scales of ancient China to the forty-three-note just intonation scales of twentieth-century American composer Harry Partch, from the rhythmic cycles of South Indian Carnatic music that unfold over 108 beats to the four-bar loops of electronic dance music — shares a common foundation. That foundation is physics.
This might seem like a reductive claim. Music feels intensely human: emotional, cultural, expressive. It varies wildly across civilizations, eras, and traditions. How can physics be responsible for something so diverse?
The answer lies in the distinction between the laws that constrain musical systems and the choices made within those constraints. Physics does not dictate that humans must use twelve notes per octave, or that music must have a steady beat, or that a minor chord sounds sad while a major chord sounds happy. What physics does dictate is the behavior of vibrating strings and air columns, the mathematics of wave interference, the way the human auditory system extracts information from pressure waves, and the neurological rhythms that govern our sense of time.
Given those physical constraints, certain musical structures become nearly inevitable — not because they are "built into nature," but because they solve problems that every musical culture faces. Part III examines those structures: the scales that organize pitch, the tuning systems that make instruments compatible, and the rhythmic patterns that organize time. In each case, we find the same dialectic: universal physical constraints that shape musical possibility, and an enormous cultural creativity that operates within — and sometimes deliberately against — those constraints.
Chapter 11 begins with the most fundamental question: why these notes?
Chapter 11: Pitch, Frequency & Musical Scales — Why These Notes?
Opening: The Question Every Music Student Should Ask
You are sitting at a piano. There are 88 keys. Twelve of them repeat in a pattern called an octave. Western music uses these 12 notes almost exclusively, with occasional chromatic passages and rare excursions into microtonality. But why 12? Why not 10, or 17, or 23? Why does every culture independently arrive at scales that emphasize certain intervals — the octave, the fifth, the third? Is there something in the physics of sound that makes these intervals special, or is the entire enterprise of scale construction an arbitrary cultural agreement?
The answer, as we will discover, is "both — and that's what makes it interesting."
11.1 What Is Pitch? — Frequency, Perception, and the Brain's Shortcuts
The Physical Side: Frequency
When a guitar string vibrates at 440 oscillations per second, it creates a pressure wave that travels through the air at roughly 343 meters per second. When that wave reaches your eardrum, it pushes and pulls the membrane 440 times per second. Your cochlea — the snail-shaped organ inside your inner ear — performs a kind of mechanical frequency analysis, with different regions of the basilar membrane resonating at different frequencies. Hair cells at the resonating location send electrical signals to your auditory cortex. Your brain interprets the rate of vibration as pitch.
Pitch, then, is the perceptual correlate of frequency. The word "frequency" belongs to physics; "pitch" belongs to psychology.
💡 Key Insight: Pitch Is Not Frequency
Frequency is an objective, measurable property of a sound wave — cycles per second, measured in Hertz (Hz). Pitch is the subjective experience of that frequency in the mind of a listener. The relationship between them is usually predictable, but not always. Very loud sounds seem slightly higher in pitch at high frequencies and slightly lower at low frequencies. Pitch also depends on context: the same frequency can seem higher or lower depending on what preceded it. Physics provides the raw material; perception shapes the experience.
The Logarithmic Nature of Pitch Perception
One of the most important — and frequently misunderstood — aspects of pitch perception is that it is logarithmic, not linear.
When you go from 220 Hz to 440 Hz, you perceive an increase of one octave. When you go from 440 Hz to 880 Hz, you again perceive an increase of one octave — the same perceptual distance. But the first step added 220 Hz while the second added 440 Hz. Doubling the frequency always produces the same perceived interval, regardless of where you start. This means that equal steps on a musical scale correspond to equal ratios, not equal differences.
This is not a quirk of human psychology — it is a mathematical consequence of how the cochlea works and how the auditory cortex processes frequency information. Our brains evolved to hear ratios, not differences. A predator's growl at 100 Hz is "low" whether we're close (loud) or far (quiet); pitch constancy across intensity is survival-relevant, and the logarithmic encoding achieves it.
📊 Box 11.1: The Frequency of Musical Notes
| Note | Frequency (Hz) | Ratio to A4 |
|---|---|---|
| A3 | 220.00 | 1:2 |
| B3 | 246.94 | — |
| C4 (Middle C) | 261.63 | — |
| D4 | 293.66 | — |
| E4 | 329.63 | — |
| F4 | 349.23 | — |
| G4 | 392.00 | — |
| A4 | 440.00 | 1:1 (reference) |
| A5 | 880.00 | 2:1 |
| A6 | 1760.00 | 4:1 |
Note how each octave exactly doubles the frequency. The ratio between any two identical note names in adjacent octaves is always 2:1.
Categorical Perception: The Brain Bins Pitch
Perhaps the most remarkable feature of pitch perception is categorical perception. When people listen to a continuum of pitches between, say, A and B♭, they do not perceive a smooth gradient — they hear discrete categories. Pitches below a certain threshold are heard as "A"; pitches above it are heard as "B♭." The boundary between them is perceived as sharp, even though the physical transition is gradual.
This is exactly how we perceive speech sounds: the range of sounds between "ba" and "pa" is continuous, but English speakers hear one or the other. Categorical perception for pitch is partially culturally learned (musicians trained in Western 12-TET hear different boundaries than singers trained in Indian classical music) and partially neurologically universal. Both aspects matter for understanding why musical scales exist at all: scales are formalizations of categorical perception, drawing sharp lines through the continuous frequency spectrum.
⚠️ Common Misconception: Notes Are the Only "Real" Pitches
Equal temperament — the tuning system of modern pianos and guitars — divides the octave into 12 equal steps. But these 12 notes are not the only pitches in existence; they are simply the pitches a particular instrument has been tuned to. Jazz singers routinely sing "blue notes" that fall between the keys of the piano. Indian classical musicians use microtonal ornaments called gamakas that move between and through the standard scale degrees. The piano keyboard gives a misleading impression that only 12 pitches exist per octave. In reality, the frequency spectrum is continuous; scales are human impositions upon it.
11.2 The Octave: Physics' Gift to Music — Why "Same Note, Different Register"
The 2:1 Ratio
The octave has a special status in virtually every musical tradition on Earth. When you play A4 (440 Hz) and A5 (880 Hz) together, they blend so smoothly that many listeners perceive them as "the same note" — just higher or lower. In music theory, notes an octave apart are given the same name. This isn't an arbitrary convention; it reflects a real perceptual phenomenon grounded in physics.
Why does the octave sound like "the same note"? The answer lies in the harmonic series.
When any pitched instrument plays a note, it doesn't produce just one frequency. It produces a fundamental frequency f and simultaneously a cascade of overtones at 2f, 3f, 4f, 5f, and so on — the harmonic series. When you play A4 at 440 Hz, your instrument also produces energy at 880 Hz (the second harmonic), 1320 Hz (third), 1760 Hz (fourth), and beyond. The second harmonic — 880 Hz — is exactly A5.
This means that every time you play any note, you are already playing the octave above it (as a partial). When the octave is then played explicitly, its fundamental matches one of the already-present overtones of the lower note. The two sounds share so many frequency components that they fuse perceptually. This is why octave equivalence feels like recognition rather than contrast.
💡 Key Insight: The Octave Is Already Hidden in Every Note
Every pitched sound contains its own octave as an overtone. Octave equivalence is not a cultural convention — it is the auditory system recognizing a physical relationship that is present in the sound itself. Cultures that claim not to use octaves (a few exist) are still constrained by the octave in the physics of their instruments; they have simply chosen not to emphasize the relationship explicitly.
Octave Equivalence Across Cultures
The universality of octave equivalence is one of the strongest pieces of evidence that music has genuine physical foundations. Cross-cultural studies have found that even people with no musical training from cultures very different from Western Europe perceive octave-related pitches as more similar than other pitch pairs. Infants as young as six months respond to octave-transposed melodies as if they were the same melody. Non-human primates show some sensitivity to octave relationships, though the effect is much weaker than in humans.
The octave, then, is a genuine universal — one of the rare musical phenomena that seems to transcend culture and emerge from the physics of sound plus the biology of hearing.
🔗 Running Example: The Choir & the Particle Accelerator
In a particle accelerator, certain energy levels are stable — particles "prefer" them because they correspond to resonant states. Similarly, in a choir, the octave is the first stable resonance: when sopranos and basses sing the same note an octave apart, their sound waves reinforce each other so completely that the combined tone is richer than either alone. The choir, like the accelerator, gravitates toward these resonant states. The octave is the fundamental resonant state of pitch space.
11.3 The Problem of Scale Construction — You Can't Just Pick Any 12 Notes
Why Scales Are Necessary
Given an infinite continuum of pitches, you need a discrete set to build melody and harmony. You can't play "in between" notes (on most instruments) — you need to decide which pitches will exist. The selection of these pitches is scale construction, and it is harder than it looks.
The constraints are multiple and partially contradictory:
Constraint 1: Consonance. You want intervals that sound pleasing together, which means you want intervals whose frequencies stand in simple integer ratios (more on this in Chapter 12). The simplest ratios — 2:1, 3:2, 4:3, 5:4 — correspond to the most consonant intervals.
Constraint 2: Modularity. You want to be able to start the same scale pattern on different notes (play in different "keys") without retuning your instruments. This requires the scale to be transposable, which means the intervals between scale degrees must be consistent regardless of starting point.
Constraint 3: Economy. You don't want so many notes that instruments become unplayable or that musicians cannot keep track of the pitch space. The number of notes per octave should be manageable — probably between 5 and 24.
Constraint 4: Completeness. You want enough notes to allow for a rich variety of harmonic combinations and melodic directions.
Here is the problem: constraints 1 and 2 are mathematically irreconcilable. There is no set of pitches that satisfies both perfectly. This is not an engineering limitation that better technology can solve; it is a mathematical theorem. The formal proof involves the irrationality of log₂(3/2) — roughly, the reason that no whole number of fifths can equal a whole number of octaves. We will see this in detail in Chapter 12, but the essential point is: scale construction is a compromise, and different musical cultures have made different compromises.
⚠️ Common Misconception: There Is One "Natural" Scale
Some popular accounts of music theory suggest that the major scale is "built into nature" through the harmonic series. While the harmonic series does provide some justification for the major triad and the fifth, it does not straightforwardly generate the major scale or explain the choice of 12 pitches per octave. Multiple different scale systems can claim a basis in physics; the 12-tone equal temperament that dominates Western music is one solution to a problem that has many solutions.
11.4 The Pentatonic Scale: Universal or Coincidence?
The Five-Note Wonder
The pentatonic scale — a five-note scale — appears in musical traditions across every inhabited continent: Chinese classical music, Scottish folk songs, West African drumming, pre-Columbian Andean music, Japanese folk music, bluegrass, gospel, and rock guitar solos. When music educators introduce young children to improvisation, they often start on the pentatonic scale because it is nearly impossible to play a "wrong" note — every note harmonizes with every other.
Is this universality evidence of some deep physical truth? Or is it coincidence — perhaps the pentatonic scale is universal because it is simple, and simple things spread easily across cultures?
The answer involves both physics and social diffusion, in proportions that are still debated.
The Physics Argument
The most common pentatonic scale (in the key of C) uses the notes C, D, E, G, A — equivalent to the black keys on a piano starting from a particular note. These five notes are the first five distinct pitches that emerge from stacking perfect fifths (the ratio 3:2): C → G → D → A → E. The fifth is the most consonant interval after the octave, produced by the third harmonic of the harmonic series. If you start with any note and repeatedly multiply by 3/2 (folding back into the octave when necessary), you generate the pentatonic scale before you need to make any compromises.
In this sense, the pentatonic scale represents the "easiest" or most physically natural scale: it uses only intervals that arise early in the harmonic series, with the fewest compromises. It avoids the half-step (semitone) — the smallest interval in Western music — which is the first interval to require a compromise between pure ratios.
📊 Box 11.2: The Pentatonic Scale from Stacked Fifths
Starting from C, stacking perfect fifths (ratio 3:2, up to the nearest octave):
| Step | Raw Frequency Ratio | Fold into Octave | Note Name |
|---|---|---|---|
| Start | 1:1 | 1:1 | C |
| ×3/2 | 3:2 | 3:2 | G |
| ×3/2 | 9:4 | 9:8 | D |
| ×3/2 | 27:8 | 27:16 | A |
| ×3/2 | 81:16 | 81:64 | E |
These five ratios — 1, 9/8, 81/64, 3/2, 27/16 — give the major pentatonic scale. Notice that no compromise (rounding or adjustment) has been necessary yet.
The Cultural Argument
Not everyone finds the physics argument convincing. Some ethnomusicologists point out that the pentatonic scales used in different cultures are not all the same five-note selection. The Japanese in scale (pentatonic) has a very different character from the Chinese pentatonic or the West African pentatonic. What they share is the number five, not the specific intervals. And why five? Perhaps five notes is simply the optimal cognitive load — enough for variety, few enough to remember. Five is also the number of fingers on one hand, a counting convenience that might explain pentatonic scales with no reference to physics at all.
⚖️ Debate: Is the Pentatonic Scale "Built Into" Physics or "Built Into" Human Cognition?
The strongest claim — that the pentatonic scale is universal because of acoustic physics — overstates the evidence. The weaker claim — that it's purely cultural coincidence — understates the physical basis. The most defensible position is that the pentatonic scale represents a convergent solution to a shared problem: how do you select a small, memorable set of pitches that harmonize well? Different cultures solve this differently but often arrive at similar answers because the physics of consonance constrains the solution space. This is emergence, not determinism.
11.5 The Diatonic Scale: A Solution to the Consonance Problem
From Five to Seven
The major scale — also called the diatonic scale — adds two more notes to the pentatonic, creating seven distinct pitches per octave. These two additional notes are a perfect fourth (4:3 ratio) above the root and a leading tone (a half-step below the octave). The result is the scale most students learn first in Western music education: C, D, E, F, G, A, B.
Why add these two notes? Because they dramatically expand harmonic possibilities. With seven notes, you can construct a much richer variety of chords, allow for more complex melody, and set up the "leading tone" — the seventh scale degree, just below the octave — which creates intense tension and a strong pull back to the root. This tension-and-resolution dynamic is a cornerstone of Western tonal music.
Derivation from the Harmonic Series
The major triad — three notes that sound together — can be derived from the harmonic series in a particularly elegant way. If you take the fourth, fifth, and sixth harmonics of any tone, their frequencies stand in the ratio 4:5:6. This ratio produces a major third (5:4) and a perfect fifth (3:2). The major triad is literally built into the physics of any vibrating string or air column.
The full diatonic major scale can be constructed by choosing three overlapping major triads: the tonic (I), dominant (V), and subdominant (IV) triads. Together, these triads include every degree of the major scale exactly once. This is not a coincidence; it is a mathematical property of these three chords. The major scale is, in this sense, the "minimal" scale that contains all three of the most acoustically natural chords.
💡 Key Insight: The Major Scale as a Chord Network
Think of the major scale not as a set of notes but as a network of three interlocking major triads. In C major: - Tonic triad: C, E, G (frequencies in ratio 4:5:6) - Dominant triad: G, B, D (frequencies in ratio 4:5:6 from G) - Subdominant triad: F, A, C (frequencies in ratio 4:5:6 from F)
Every note in the C major scale appears in at least one of these three triads. The scale is the skeleton of a harmonic system, not just a melody tool.
11.6 Why 12 Notes? The Physics and Math of 12-Tone Equal Temperament
The Twelve-Tone Coincidence
Here is a mathematical near-miracle: twelve perfect fifths — twelve multiplications by 3/2, with octave reductions — nearly return to the starting note. Not exactly: twelve fifths overshoot seven octaves by about 23.5 cents (a cent is 1/100 of a semitone, so this overshoot is less than a quarter of a half-step). This near-coincidence means that twelve notes can provide something like "all the notes you need" with tolerable error in every interval.
The mathematical statement is: (3/2)¹² ≈ 2⁷, or 129.75 ≈ 128. The discrepancy — called the Pythagorean comma — is about 1.4%. This is small enough that twelve notes can serve as a practical scale with only modest compromises.
Why not seven notes? Seven is another option: seven fifths give a reasonable approximation to some useful intervals. But twelve provides much greater completeness — twelve notes can approximate every interval in the harmonic series up to the sixth partial with errors below 2%. Seven cannot.
Why not seventeen or nineteen? Both 17 and 19 also represent "good" numbers of equal divisions of the octave, with 19 particularly well-regarded. But 12 is the smallest number that provides a reasonably complete harmonic vocabulary while remaining cognitively manageable.
📊 Box 11.3: The 12-Note Equal Temperament Scale
Starting from A4 = 440 Hz, each note is 440 × 2^(n/12) Hz, where n is the number of semitones above A4.
| Note | Semitones from A4 | Frequency (Hz) | Closest Just Ratio | Error (cents) |
|---|---|---|---|---|
| A4 | 0 | 440.00 | 1:1 | 0.0 |
| A#4/B♭4 | 1 | 466.16 | 16:15 | −11.7 |
| B4 | 2 | 493.88 | 9:8 | +3.9 |
| C5 | 3 | 523.25 | 6:5 | +15.6 |
| C#5/D♭5 | 4 | 554.37 | 5:4 | −13.7 |
| D5 | 5 | 587.33 | 4:3 | −2.0 |
| D#5/E♭5 | 6 | 622.25 | 45:32 | −9.8 |
| E5 | 7 | 659.26 | 3:2 | +2.0 |
| F5 | 8 | 698.46 | 8:5 | +13.7 |
| F#5/G♭5 | 9 | 739.99 | 5:3 | −15.6 |
| G5 | 10 | 783.99 | 16:9 | −3.9 |
| G#5/A♭5 | 11 | 830.61 | 15:8 | +11.7 |
| A5 | 12 | 880.00 | 2:1 | 0.0 |
The most accurate intervals in 12-TET are the octave (perfect) and the fifth (off by only 2 cents — barely perceptible). The worst are the major third (off by nearly 14 cents — audible to trained listeners) and the minor third (off by 15.6 cents).
The Engineering Triumph of Equal Temperament
Twelve-tone equal temperament (12-TET) divides the octave into twelve equal semitones, each representing a frequency ratio of 2^(1/12) ≈ 1.0595. This means every semitone is the same size, and every key sounds identical. A piece in C major sounds exactly like the same piece in F# major — just higher. This is enormously convenient for instrument builders, composers, and performers.
The key word is "equal." Earlier temperaments (Pythagorean, meantone, well-temperament) created semitones of slightly different sizes, meaning some keys sounded "in tune" while others sounded harsh. In equal temperament, every key is equally in tune — or equally out of tune, depending on your perspective.
⚖️ Debate: Is Equal Temperament a Triumph of Engineering or a Betrayal of Physics?
The Case for Equal Temperament: Music needs practical instruments. A piano tuned to just intonation in one key will sound terrible in other keys. Equal temperament sacrifices a small amount of acoustic purity (the major third is 14 cents flat compared to the pure 5:4 ratio) in exchange for universal transposability. Every key is available without retuning. Orchestras, bands, and ensembles can play together without constantly negotiating tuning. Electronic music, which requires pitch-perfect synthesis, requires equal temperament as a foundation. The compromise is small and worthwhile.
The Case Against Equal Temperament: Every interval except the octave is wrong. The major third — one of the most important harmonic intervals — is consistently 14 cents flat, a deviation that choral singers, string players, and brass players instinctively correct by ear when they're not constrained by a keyboard. In just intonation, a major chord has a crystalline purity that equal temperament cannot match. J.S. Bach's Well-Tempered Clavier was written for well temperament, not equal temperament; in the keys Bach intended, the work sounds different — more complex, with each key having its own character. Equal temperament flattened that landscape into uniformity.
The Synthesis: Both arguments are correct because they are addressing different values. Equal temperament optimizes for flexibility and compatibility. Just intonation optimizes for acoustic purity in a fixed key. The choice between them is not a scientific question but a question of what music is for. For a traveling keyboard musician who plays in many keys, equal temperament is liberation. For a choir performing Renaissance polyphony in a single tonality, just intonation is transcendence.
11.7 Non-Western Scale Systems — Maqam, Raga, Pelog, and Microtonal Traditions
The World's Scale Systems
To a student trained entirely in Western music, the 12-TET major and minor scales might seem like the only options. In reality, they represent one particular solution to the scale construction problem — a solution that became globally dominant partly through acoustic merit and partly through the political and economic power of European colonialism. Other musical cultures developed radically different solutions.
Arab Maqam
Arab classical music uses a system of maqamat (singular: maqam) — melodic modes that specify not just a scale of pitches but a set of performance conventions: characteristic phrases, ornamentation styles, emotional associations, times of day for performance, and associations with physical and emotional states. The Arab system divides the octave into 24 quarter-tones — twice as many notes as 12-TET — and maqamat select specific subsets of these quarter-tones with characteristic raised and lowered degrees.
The most distinctive feature of many maqamat is the use of neutral thirds — intervals halfway between a major third and a minor third. In 12-TET, this pitch doesn't exist; it falls exactly between two piano keys. But to an Arab musician, the neutral third is a specific, named pitch with precise acoustic meaning. It corresponds to a frequency ratio of approximately 11:9, a ratio that doesn't appear in the standard Western harmonic vocabulary.
Indian Raga
The Indian classical tradition (both Hindustani in the north and Carnatic in the south) uses a system of ragas — scale-like structures that, like maqamat, specify much more than a set of pitches. A raga includes a scale (called the thaat in Hindustani music, the melakarta in Carnatic music), ascending and descending patterns (which may use different notes), characteristic phrases called pakad, ornaments (gamakas) specific to each note, a characteristic emotional quality (rasa), and in some traditions, appropriate times or seasons.
The Indian system recognizes 22 shrutis — microtonal positions within the octave — though any given raga uses only 5 to 7 of them. The shruti system is based on just intonation ratios, with each shruti corresponding to a specific small-integer frequency ratio. Many of these ratios — like 81:64 (the Pythagorean major third) or 11:8 (an 11th harmonic interval) — lie between the keys of a piano.
Indonesian Gamelan — Pelog and Slendro
The Indonesian gamelan — an ensemble of bronze percussion instruments — uses two scale systems that are particularly striking for Western ears: slendro (five notes roughly equally spaced across the octave) and pelog (seven notes unevenly spaced). What makes these remarkable is that different gamelan ensembles deliberately use different tunings — there is no standardized pitch, and indeed each gamelan's unique tuning is considered part of its identity.
Slendro roughly approximates 5-TET (five-tone equal temperament), but not exactly; the intervals vary from ensemble to ensemble. Pelog has intervals that don't correspond to any simple ratio system. The effect — particularly to Western ears — is haunting and otherworldly, with simultaneous instruments producing slight beating that is intentional and aesthetically prized.
🔵 Try It Yourself: Hear the Difference
Find recordings of the following scales on a streaming service and compare them deliberately: 1. C major scale played on piano (12-TET) 2. Raga Yaman (starts like Lydian mode but with specific ornaments) — Ravi Shankar recordings 3. Arab Maqam Bayati — any Egyptian classical recording 4. Japanese koto music (pentatonic) 5. Indonesian gamelan (Balinese style)
Notice how each system carves up the pitch space differently. None is "more natural" — each represents a culturally refined solution to the problem of scale construction.
The Blues Scale
The blues scale — a six-note scale used in blues, jazz, rock, and gospel — is particularly interesting because it was developed precisely to escape the confines of 12-TET. African American musicians, playing on instruments designed for 12-TET (like the guitar), developed a tradition of bending strings and sliding between notes to access pitches that didn't exist on the keyboard. The characteristic "blue note" — a pitch between the major third and minor third (or between the minor seventh and the octave) — is not a note in 12-TET; it is a deliberate microtonality, a subversion of the equal temperament grid.
The blues scale is discussed in depth in Case Study 11.2.
11.8 Theme 2 Deep Dive — Which Aspects of Scales Are Universal?
What the Evidence Shows
Cross-cultural music research has identified several aspects of scale systems that appear in virtually every musical culture:
Universal aspects: - The octave as a structural boundary. Nearly every musical culture organizes pitch space in units of octaves — notes an octave apart are treated as equivalent or closely related. - Small number of discrete scale degrees. No culture uses a continuous pitch space without discretization; all cultures select a small set (5–12) of specific pitches. - Unequal step sizes (in most traditions). Scales tend to use steps of two or three sizes, not all equal. Even pentatonic scales have larger and smaller intervals. - Emphasis on the fifth. Intervals near the 3:2 ratio appear prominently in scales across cultures, though not in all.
Culturally variable aspects: - Number of notes per octave. Five, seven, twelve, seventeen, twenty-two, and twenty-four are all attested in major musical traditions. - Specific interval sizes. Quarter-tones, neutral thirds, and other intervals not in 12-TET appear in many non-Western systems. - The role of the third. Western music places great weight on the major/minor distinction; many other systems treat the third as a variable, ornamental pitch rather than a structural one. - Scalar hierarchy. Western music has a strong sense of "tonic" — a home pitch — that other systems may structure differently.
💡 Key Insight: The Universal-Cultural Dialectic in Scales
The universal aspects of scale systems reflect physics (the harmonic series privileges certain ratios) and neuroscience (the brain prefers small integer ratios and categorical perception). The culturally variable aspects reflect choices about which physical constraints to prioritize, how many notes to include, and what emotional and social functions the music should serve. Understanding scales means understanding both levels.
The Universality of Dissonance?
One consistent finding across cultures is that intervals close to simple integer ratios are generally preferred as resting points, while intervals far from simple ratios create tension. This preference is partially physical (simple-ratio intervals produce less beating when played simultaneously — see Chapter 12) and partially neurological (the auditory cortex may be tuned to detect simple ratios). But the specific intervals treated as consonant versus dissonant vary across cultures and time periods. Medieval European music treated the third as dissonant; modern Western music treats it as consonant. Arabic music prizes the neutral third. Indonesian gamelan aesthetics prize deliberate beating that Western ears find harsh.
11.9 The Tritone: Music's Most Dangerous Interval
Diabolus in Musica
The tritone — an interval spanning exactly six semitones in 12-TET, or half an octave — has been called diabolus in musica (the devil in music) since at least the early medieval period. It was allegedly prohibited in sacred music (though historians debate how strictly this prohibition was actually enforced). In heavy metal, it is a cliché of menace. In jazz, it is the most important harmonic resource — the tritone substitution is a foundational technique. In Western tonal music, it is the interval of maximum tension, always demanding resolution.
Why?
The Physics of Maximal Dissonance
The tritone's ratio in just intonation is approximately 45:32 (augmented fourth) or 64:45 (diminished fifth) — highly complex ratios involving large numbers, far from the simple ratios that characterize consonant intervals. In 12-TET, the tritone is exactly 2^(6/12) = √2:1 — an irrational number, as far as possible from a simple integer ratio.
In the harmonic series, the tritone (or near-tritone) first appears between the 5th and 7th harmonics (at a 7:5 ratio) or the 7th and 10th harmonics (10:7). These are relatively high and complex harmonics, which means the tritone doesn't naturally arise in the lower, stronger partials of any tone. Its component frequencies share no common overtones in the lower harmonic series, so when two tritone-separated tones are played together, their overtones clash — creating the maximum acoustic roughness and beating.
🧪 Thought Experiment: A World Without Tritones
Imagine a musical culture that explicitly excluded the tritone from its scale — that arranged its pitch system so that no two commonly-used notes were exactly six semitones apart. Would such a culture lack harmonic tension? Or would it simply locate tension elsewhere? What interval would serve the function of the tritone — the "maximum dissonance" that demands resolution? This thought experiment suggests that dissonance is not about specific intervals but about contrast — whatever interval is furthest from the prevailing norm of consonance will carry the weight of tension, regardless of its acoustic properties.
The Tritone in Practice
In jazz, the tritone substitution works as follows: in a dominant seventh chord (which contains a tritone between its third and seventh), you can replace the chord with another dominant seventh chord whose root is a tritone away — because those two chords share the same tritone (just respelled enharmonically). This means the tritone literally defines the dominant seventh chord's identity and its pull toward resolution.
In film music, the tritone is ubiquitous in horror and thriller scoring precisely because its acoustic properties — maximum roughness, absence of common overtones, perceptual instability — evoke exactly the disorientation and threat the genre requires. The tritone is dissonance systematized, made functional.
11.10 Melody vs. Harmony — What Scales Are Actually For
Two Functions, One Scale
A musical scale serves two distinct but related purposes: it organizes melody (the sequence of single notes that constitutes a musical line) and harmony (the simultaneous combination of notes into chords). These two functions place somewhat different demands on scale construction.
For melody, the most important properties of a scale are: the intervals should be recognizable and memorable, the scale should have a clear sense of direction (higher vs. lower), and there should be some notes that feel stable (resting points) and some that feel unstable (motion points). The distinction between stable and unstable degrees is what creates melodic tension and resolution.
For harmony, the most important properties are: the intervals between simultaneous notes should be acoustically compatible (not produce excessive beating), there should be enough variety to allow for chord progressions, and the scale should allow for interesting voice leading (the movement of individual parts within a chord progression).
💡 Key Insight: Scales as Compressed Information
A scale is not just a set of notes — it is a compressed encoding of the harmonic and melodic grammar of a musical style. When you know that a piece is in C major, you know immediately that C, E, and G will be the most stable tones; that F# doesn't exist (unless marked as an accidental); that the chord built on G will tend to resolve to the chord built on C; and that a melody will probably end on C. All of this information is contained in the seven-note selection called the major scale. The scale is a theory of musical possibility, stored in a few notes.
The Minor Scale Problem
Western music uses not one but two primary diatonic scales — major and minor. The minor scale has a different pattern of whole and half steps and a characteristic quality that nearly all listeners describe as "darker" or "sadder" than the major. But here's the complication: there are actually three variants of the minor scale in common use — natural minor, harmonic minor (with a raised seventh), and melodic minor (which uses different notes ascending and descending). This proliferation suggests that the minor scale is not derived as naturally from acoustic physics as the major scale; it is a series of compromises, with each variant solving one problem and creating another.
11.11 Thought Experiment: Designing a Scale for a Different Auditory Range
🧪 Thought Experiment: The Cetacean Scale
Suppose you were designing a musical system for a species with the following auditory characteristics: - Hearing range: 200 Hz to 100,000 Hz (much wider than human range of ~20 Hz to 20,000 Hz) - Maximum frequency resolution: can distinguish frequencies differing by as little as 0.1% (humans manage about 0.3-1% at mid-range) - Temporal resolution: 10 times slower than humans (detects patterns at 1/10 our rate) - Social structure: communicates in groups of up to 100 individuals simultaneously
(These characteristics roughly approximate some cetaceans — whales and dolphins.)
Questions to consider:
1. What would octave equivalence mean? With a range of 200 Hz to 100,000 Hz, there are roughly 9 octaves available — three times more than a piano keyboard. Would this species have stronger octave equivalence (more octaves to explore) or weaker (so spread out that octave relationships are harder to perceive)?
2. How many notes per octave? With ten times the frequency resolution, this species could distinguish ten times as many pitches per octave. Would a 120-note octave be practical? Or would the brain still impose categorical perception in chunks of 12-24?
3. Would the harmonic series still matter? Yes — the physics of vibrating strings and air columns doesn't change. The harmonic series would still be 2:1, 3:2, 4:3, etc. But with 9 octaves of range, the species could hear many more harmonics simultaneously, potentially making very high-ratio intervals feel consonant.
4. How would slower temporal resolution change rhythm? A species that processes rhythmic events ten times more slowly would naturally use tempos ten times slower — perhaps 6 BPM instead of 60 BPM. Would a single "beat" feel like what we feel as a measure? What would "groove" mean at 6 BPM?
This thought experiment reveals what is truly universal about music (the harmonic series, the physics of vibration, wave interference) versus what is specific to human biology (our particular frequency range, our temporal resolution, our cochlear architecture).
11.12 Summary and Bridge to Chapter 12
What We've Learned
This chapter has examined pitch and scales from multiple angles — physical, mathematical, psychological, and cultural. The key conclusions are:
Pitch is perception, not just physics. Frequency is a physical property of sound waves; pitch is what the brain makes of that frequency. Categorical perception, logarithmic encoding, and octave equivalence are all properties of the auditory system, not of sound itself.
The octave is the only true universal. Of all musical intervals, the 2:1 octave has the strongest cross-cultural universality, grounded in the physics of the harmonic series. Everything else — the specific notes within the octave, the number of scale degrees, the sizes of intervals — is culturally variable.
Scale construction is constrained optimization. Building a musical scale means choosing a small set of pitches that maximize consonance, allow for transposition, and remain cognitively manageable. These constraints are partially irreconcilable — you can't have perfectly pure intervals in every key simultaneously. Different cultures have made different compromises.
12-TET is one solution, not the solution. The twelve-tone equal temperament system is practical, flexible, and globally dominant — but it sacrifices acoustic purity for universal transposability. Other systems (just intonation, meantone, maqam, shruti) make different trade-offs, achieving greater purity in restricted contexts.
The tritone is physics made dramatic. The interval most associated with tension, danger, and the uncanny in Western music corresponds to the interval with the most complex frequency ratios and maximum acoustic roughness. Physics and aesthetics align — but culture decides what to do with that alignment.
Bridge to Chapter 12
We have seen that scale construction requires compromise — you can't have all your intervals pure simultaneously. But we've only gestured at why this is so. Chapter 12 digs into the mathematics and physics of this irreconcilable tension: what the Pythagorean comma actually is, how different historical tuning systems have dealt with it, and why the story of tuning in Western music is essentially the story of choosing which notes to get wrong. The chapter culminates in the deep running example: how tuning systems in music parallel energy quantization in physics — with the Choir & the Particle Accelerator providing an extended analogy between harmonic resonance in choral singing and quantum energy levels in particle physics.
✅ Key Takeaways — Chapter 11
- Pitch is the brain's interpretation of frequency; the relationship is logarithmic, not linear.
- Octave equivalence (the 2:1 ratio) is the closest thing music has to a universal physical law.
- Pentatonic scales appear across cultures because they use the simplest frequency ratios (the first intervals in the harmonic series) without requiring compromise.
- The major scale can be derived from three overlapping major triads, each based on the 4:5:6 ratio of the 4th, 5th, and 6th harmonics.
- Twelve-tone equal temperament solves the transposability problem by making every interval slightly impure but every key identical.
- Non-Western scale systems — maqam, raga, gamelan — represent different solutions to the same constrained optimization problem, often using intervals that don't exist in 12-TET.
- The tritone (√2:1 ratio) is acoustically maximally dissonant because it shares the fewest overtones with its harmonic partners.
- Scales serve both melodic and harmonic functions simultaneously, acting as compressed encodings of a musical style's entire grammar.