Case Study 18-1: The Information Theory of the Beatles — Why Their Songwriting Was Informationally Optimal
The Most Analyzed Band in History
No popular music group has been studied more extensively from more angles than the Beatles. Musicologists have analyzed their harmonic progressions, psychologists have measured their emotional impact, economists have studied their commercial strategies, and cultural historians have traced their influence. In the twenty-first century, a new kind of analysis has been added: information theory.
The findings are not what you might expect. The Beatles' chord progressions are, by the standards of jazz or classical harmony, simple and predictable. And yet — information-theoretic analysis suggests that this simplicity is itself a kind of optimality: the Beatles found the harmonic sweet spot for maximizing certain measurable properties of musical engagement, at least within the framework of their target audience and era.
The Beatles' Harmonic Palette
Between 1963 and 1970, the Beatles produced approximately 200 recordings. During this period, their harmonic language evolved significantly, from the relatively simple diatonic harmony of early songs like "She Loves You" and "I Want to Hold Your Hand" to the more harmonically adventurous work of the Sgt. Pepper era and beyond. But even at their most complex, their harmonic language remained more accessible than jazz and far less predictable than the simplest pop.
Music researchers Tim Koozin and Trevor de Clercq have analyzed the chord progressions of large samples of Beatles songs using computational methods. Their findings align with what any experienced Beatles listener would suspect:
- The I chord (tonic, "home") accounts for approximately 28–35% of all chord durations across the catalog
- The V chord (dominant) accounts for approximately 15–20%
- The IV chord (subdominant) accounts for approximately 15–20%
- The vi chord (relative minor) accounts for approximately 8–12%
- Together, these four chords (I, IV, V, vi) account for approximately 65–80% of total chord time in most Beatles songs
This distribution is not uniform (which would maximize entropy) but is also not degenerate (one chord always). It occupies an intermediate region.
The I-V-vi-IV Analysis
The progression I-V-vi-IV has become so ubiquitous in popular music that it has been satirized in sketches by the Axis of Awesome (who demonstrated that hundreds of popular songs can be sung to the same four-chord pattern). In C major, this is C-G-Am-F. Many Beatles songs use this progression or closely related variants.
From an information-theoretic perspective, this progression has an interesting property: it satisfies strong functional harmony expectations (I to V is a standard motion; vi is a convincing substitute for I; IV prepares the return to I) while containing just enough variety to remain engaging.
The unigram entropy of a song using only I, V, vi, IV with probabilities approximately 0.35, 0.20, 0.25, 0.20 (roughly) is:
H ≈ -(0.35 log₂ 0.35 + 0.20 log₂ 0.20 + 0.25 log₂ 0.25 + 0.20 log₂ 0.20) ≈ -(0.35 × (-1.515) + 0.20 × (-2.322) + 0.25 × (-2) + 0.20 × (-2.322)) ≈ -(−0.530 − 0.464 − 0.500 − 0.464) ≈ 1.958 bits per chord
Compare to: complete four-chord uniformity (all equally likely) = log₂(4) = 2 bits. The tonic-weighted distribution gives about 1.96 bits — very close to but below the maximum. The slight tonic emphasis reduces entropy without dramatically reducing it.
The Bigram and Trigram Story
The more revealing analysis comes when we compute the conditional entropy of chord progressions — how much each chord tells us about what comes next.
In the I-V-vi-IV loop, the progression is deterministic once started: I is followed by V, V by vi, vi by IV, IV by I. Conditional entropy within a loop is zero — perfect prediction. But real Beatles songs are not pure four-chord loops; they have verses, choruses, bridges, and pre-choruses that create variation.
The Beatles' structural genius was in managing the tension between the predictable loop (low conditional entropy within sections) and the structural variation across sections (higher entropy at section boundaries). Within a verse or chorus, the listener can predict the next chord with high confidence. At the moment of structural transition — entering the bridge, returning from the bridge, approaching the climax — the next chord is less certain. These are the high-information moments, carefully placed.
Research on Beatles bridge harmonies shows that their bridges (the "B section" in AABA form) frequently introduce chords or progressions not used in the verse/chorus, deliberately elevating the harmonic entropy at the structural midpoint. The return to the chorus after the bridge is a resolution of this elevated entropy — a "harmonic homecoming" that is informationally satisfying in exactly the way a tonal cadence is.
Why This Is "Informationally Optimal"
The phrase "informationally optimal" requires careful qualification. It does not mean that the Beatles' music has the highest possible information content — it clearly does not. And it does not mean that their music is "the best" by any absolute measure. It means something more specific: that their music appears to be near-optimal for its specific function and audience.
Several converging lines of evidence support this:
Commercial success and longevity: The Beatles' catalog has remained popular across six decades and many cultural changes. While commercial success does not equal aesthetic quality, sustained popularity across diverse audiences over long periods provides some evidence that the music is satisfying a widespread cognitive need. Information-theoretically, this may mean that their music hits the entropy sweet spot for a very wide range of listeners.
The Goldilocks zone: Their chord progressions are not so simple (like a pure I-V-I loop) that they bore listeners, nor so complex (like bebop chord changes) that they overload listeners with unresolvable uncertainty. For an audience without specialized jazz training, the Beatles' harmonic language sits at approximately the right level of challenge: enough surprise to maintain interest, enough predictability to maintain comprehension.
Consistent prediction satisfaction: Studies of Beatles songs show that their deviations from the expected chord are usually "explained" within one or two additional chords — the harmonic resolution is usually nearby. This is consistent with a strategy of maximizing "tension-release cycles" within short time spans, generating many dopamine rewards per minute of listening without the sustained unresolved tension that characterizes more complex harmonic music.
Beyond Simple Information Theory: What the Beatles Had
Any honest analysis must acknowledge what information theory cannot explain about the Beatles' success.
The Axis of Awesome demonstrated that the I-V-vi-IV progression underlies hundreds of songs. The vast majority of those songs are not remotely as historically significant as "Let It Be" or "Yesterday." The chord progression is common; the songs are not interchangeable. What differentiates them?
Voice and melody: Lennon's and McCartney's vocal melodies are informationally rich in their pitch sequences even when the harmonic underpinning is simple. The melodies have irregular phrase lengths, unexpected melodic turns, and cross-rhythmic emphases that create local surprises within a globally predictable harmonic frame.
Lyrical-musical interaction: The relationship between lyric rhythm, melodic rhythm, and harmonic rhythm creates a multi-layered information texture that simple chord analysis cannot capture.
Production innovation: The Beatles' recording techniques (double-tracking, backward tape loops, orchestral arrangements) created sonic information — in timbre, texture, and spatialization — that is entirely orthogonal to harmonic entropy.
Cultural meaning: Songs like "Yesterday" and "Hey Jude" accumulate cultural significance through association, reference, and communal experience that cannot be predicted from any analysis of their information content.
The information-theoretic analysis is genuinely illuminating: it explains part of why Beatles music is accessible, memorable, and broadly appealing. But it does not explain why Beatles music is great rather than merely competent. For that, we need frameworks beyond information theory.
Discussion Questions
-
The I-V-vi-IV progression is used in both highly regarded pop songs and in forgettable commercial jingles. What factors determine whether a low-entropy harmonic framework becomes great music or merely adequate music? Does information theory have anything to say about this distinction?
-
The Beatles evolved from the simple three-chord rock of their early period to the more harmonically adventurous music of the White Album and Abbey Road. Does this trajectory make sense from an information-theoretic perspective? Were they responding to increasing listener sophistication, or to their own artistic ambitions, or both?
-
The Axis of Awesome showed that many pop songs use the same four chords. Does this mean these songs are "saying the same thing"? Or does it mean that the interesting information in these songs is located somewhere other than the chord progressions? If so, where?
-
How would you design a study to test whether Beatles-like harmonic entropy (moderate, with tonic-weighted distribution and low conditional entropy within sections) is genuinely near-optimal for Western pop listeners? What would "optimal" mean in this context, and how would you measure it?