> "Music is not a universal language. Every culture has its own musical language, and they are often mutually unintelligible."
In This Chapter
- 30.1 The Ethnomusicological Challenge
- 30.2 What All Music Has in Common — The Mehr et al. (2019) Study
- 30.3 The Universal Physics of Music
- 30.4 The Case for Cultural Construction
- 30.5 Indian Classical Music: Raga as Emotional Cosmology
- 30.6 West African Music: Rhythm as Primary Structure
- 30.7 Gamelan Music: Inharmonic Spectra and Alternative Consonance
- 30.8 Maqam: Microtonal Physics in Arabic and Turkish Music
- 30.9 Indigenous Music: Australia, Americas, Siberia
- 30.10 The Spotify Spectral Dataset: Cross-Cultural Spectral Comparison
- 30.11 Music and Language: Are They the Same System?
- 30.12 The Evolution of Music
- 30.13 Is Music a Cultural Universal? — The Synthesis Argument
- 30.14 Theme 2 Final Answer: The Physics Constrains, Culture Constructs Within Constraints
- 30.15 Part VI Synthesis: The Physics of Perception — What We Have Learned Across Chapters 26–30
- 30.16 Summary and Bridge to Part VII
Chapter 30: Music Across Cultures — Universal Physics, Diverse Structures
"Music is the universal language of mankind." — Henry Wadsworth Longfellow (1835)
"Music is not a universal language. Every culture has its own musical language, and they are often mutually unintelligible." — Bruno Nettl, ethnomusicologist (1983)
Both of these statements contain important truths. Both are also importantly wrong. The question of what, if anything, is universal in music has occupied ethnomusicologists, music psychologists, evolutionary biologists, and physicists for more than a century, and it is a question that — unlike many questions of comparable ambition — has begun to yield to rigorous empirical investigation over the past two decades.
The resolution, as this chapter will argue, is not a choice between the two positions but a more nuanced picture: physics constrains what is possible in musical structure, and those constraints are universal; culture constructs within those constraints, and those constructions are enormously diverse. Understanding where the constraints come from — from acoustic physics, from auditory system biology, from the mathematics of rhythm — is the key to distinguishing what is genuinely universal from what is merely widespread, and what is culturally contingent from what is acoustically impossible.
This chapter is the culmination of Theme 2 — the tension between universal structures and cultural specificity — that has run throughout Part VI. It draws on ethnomusicology, acoustic physics, evolutionary biology, and the cross-cultural data of the Spotify Spectral Dataset to build toward a synthesis: a principled account of which aspects of music are universal and why.
30.1 The Ethnomusicological Challenge
Before asking what is universal in music, we need to reckon with the field that has spent the longest time listening to music from around the world: ethnomusicology. And ethnomusicology's dominant stance, for most of the 20th century, was one of profound skepticism toward claims of musical universals.
The Comparative Musicology Tradition and Its Problems
The field now called ethnomusicology grew out of "comparative musicology" in the late 19th and early 20th centuries — a discipline characterized, in its early decades, by a troubling tendency to rank the world's musics hierarchically, with Western European art music at the pinnacle and the music of colonized peoples at the bottom. This was not fringe scholarship; it was the mainstream. The comparative musicologists collected recordings, analyzed scales and rhythms, and drew conclusions that were as much products of imperial ideology as of careful empirical reasoning.
The reaction — ethnomusicology's turn, in the mid-20th century, toward radical cultural relativism — was in many ways an appropriate correction. Bruno Nettl, Alan Merriam, and others insisted that music could only be understood in its cultural context, that Western musical categories could not be imposed on non-Western musical systems, and that the search for universals was likely a disguised search for Western features common to the musics that Western scholars most often heard.
This was good methodological hygiene. It produced several generations of deeply contextualized, culturally sensitive studies of specific musical traditions. It also, however, made the field largely resistant to any claim that might sound like a universal — resistant sometimes to the point of preemptively dismissing evidence.
Why the Skepticism Is Warranted — and Where It Overreaches
The ethnomusicological caution about universals is warranted for several reasons:
Sampling bias: Most claims about musical universals, prior to the 2000s, were based on music from a highly non-representative sample of the world's cultures — primarily the WEIRD populations (Western, Educated, Industrialized, Rich, Democratic) that dominate academic research. A "universal" found in European, American, and a handful of South American or East Asian musical traditions is not a universal; it is a regularity in a biased sample.
Category problems: The concepts musicologists use — "pitch," "scale," "rhythm," "melody," "harmony" — are largely developed within the Western tradition. Applying them cross-culturally may force non-Western music into ill-fitting analytical boxes, making it appear more like Western music than it actually is.
Publication bias: Cross-cultural studies that find universals are more likely to be published and cited than studies that find cross-cultural diversity, creating a distorted literature.
At the same time, the skepticism can overreach. The position that no feature of music is universal — that every dimension of music is entirely culturally contingent — makes a prediction about the cross-cultural data that, as we will see in Section 30.2, the data does not support. Cultural relativism is not the same as cultural solipsism: acknowledging that culture shapes music powerfully does not require claiming that biology and physics have no cross-cultural influence.
💡 Key Insight: The Right Question Is "Universal" vs. "Widespread" vs. "Culturally Specific"
Not all non-random cross-cultural patterns are universals in the strict sense. A useful three-way distinction: - Universals: Features found in every documented musical culture with no known exceptions (e.g., music exists; music involves pitch and rhythm together) - Near-universals or statistical universals: Features found in the overwhelming majority of cultures but with some exceptions (e.g., octave equivalence, preference for small-integer frequency ratios) - Widespread but culturally variable features: Features found in many cultures but varying substantially in how they are implemented (e.g., use of scales, use of meter, use of formal repetition)
Conflating these categories — declaring something "universal" when it is merely widespread — is as much a methodological error as dismissing universals entirely.
30.2 What All Music Has in Common — The Mehr et al. (2019) Study
The most ambitious empirical effort to identify musical universals to date was published in Science in 2019: Samuel Mehr, Manvir Singh, and colleagues' study of music across 60 human societies. The study represents a methodological advance over previous work in nearly every dimension — sample size, geographic breadth, control for sampling bias, and methodological sophistication — and its results have substantially shifted the debate.
The Study Design
The research team assembled recordings from 60 human societies, drawn from a database of ethnographic recordings (the HRAF collection) and supplemented by targeted field recordings. The 60 societies were selected to represent maximal geographic and cultural diversity, sampling from all inhabited continents and multiple linguistic families. The recordings spanned hunter-gatherer societies, small-scale agricultural communities, pastoral nomads, and larger traditional societies.
From these recordings, the team selected 118 songs representing four behavioral contexts that appear cross-culturally: lullabies, dance songs, healing/ceremonial songs, and love songs. Crucially, songs were selected for minimal Western contact, specifically to avoid the confound of Western musical influence spreading globally through colonialism and mass media.
Naive listeners from 30 countries — people with no knowledge of the cultures producing the songs — rated each song on dimensions including perceived behavioral purpose (is this a lullaby? a dance song?) and perceived emotional qualities. Would people with no training in a given musical tradition be able to identify its behavioral function from acoustic features alone?
Key Findings
Finding 1: Behavioral functions are acoustically identifiable across cultures. Naive listeners from 30 countries could identify lullabies, dance songs, healing songs, and love songs at rates substantially above chance, even when the songs came from cultures entirely unknown to them. Lullabies were the most consistently identifiable: slow tempo, descending melodic contour, narrow pitch range, and smooth rhythm seem to be universal acoustic signatures of music intended to soothe infants.
Finding 2: Certain acoustic features are consistent cross-culturally. The most cross-culturally consistent features were related to tempo (songs with similar functions tend to have similar tempos regardless of culture), melodic contour (lullabies tend to descend; dance songs tend to have more melodic variety), and rhythmic regularity (dance songs are more metrically regular). These are not subtle statistical tendencies; they are robust enough to enable above-chance identification by culturally naive listeners.
Finding 3: The emotional valence of musical features shows cross-cultural consistency in some dimensions. Faster tempos are perceived as more positive/happy and higher arousal across most cultures. Certain timbral features (roughness, breathiness) show cross-cultural consistency in their emotional associations. But the cross-cultural consistency of emotional valence has important limits — it is stronger for arousal than for valence, and it breaks down more at the level of specific emotions.
Finding 4: Cross-cultural variation remains large. The finding of some cross-cultural consistency does not mean cross-cultural uniformity. The study explicitly documents enormous variation in scale structure, tonal system, rhythmic organization, and formal structure. The universals it finds are at a fairly high level of abstraction (lullabies across cultures are slow and smooth; dance songs are fast and rhythmically regular) rather than at the level of specific acoustic parameters.
📊 Data Box: Mehr et al. (2019) at a Glance
| Feature | Finding |
|---|---|
| Societies sampled | 60, from all inhabited continents |
| Songs analyzed | 118, selected for minimal Western contact |
| Listener countries | 30 countries |
| Naive listeners | ~750 participants |
| Lullaby identification accuracy | ~70–75% (chance = 25%) |
| Dance song identification accuracy | ~65% (chance = 25%) |
| Strongest cross-cultural acoustic predictor | Tempo (fast for dance; slow for lullabies) |
| Strongest limits on universality | Scale structure, tuning, mode, harmonic language |
30.3 The Universal Physics of Music
The Mehr et al. findings beg a deeper question: why are some features of music consistent across cultures? The answer, for at least some features, lies in physics and the mathematics of sound.
Octave Equivalence: A Near-Universal With a Physical Basis
Virtually every documented musical culture treats pitches an octave apart (a 2:1 frequency ratio) as in some sense equivalent — giving them the same name, using them interchangeably in melodies, treating them as the same "note." This near-universal is explained by acoustic physics: a tone at 440 Hz (A4) and a tone at 880 Hz (A5) share all of their harmonic overtones — every partial of the higher tone is also a partial of the lower tone. The auditory system registers this overlap as a strong perceptual similarity. The near-universality of octave equivalence is thus grounded in the physics of harmonic spectra.
Small-Integer Frequency Ratios: Consonance Has a Physical Basis
The ratios that musical cultures most commonly use as the basis for scales and intervals are small-integer frequency ratios: 2:1 (octave), 3:2 (perfect fifth), 4:3 (perfect fourth), 5:4 (major third). As discussed in Chapter 26, intervals defined by small-integer ratios produce less beating (because the upper harmonics align) and are perceived as more consonant by most listeners.
The preference for small-integer ratios is not universal in the strict sense — many cultures use intervals that deviate substantially from simple ratios (quarter-tones in Arabic maqam, stretched octaves in gamelan). But it is a strong statistical tendency: when cross-cultural databases of scale intervals are analyzed, the distribution shows peaks near the simple-ratio intervals, with much less common use of complex-ratio intervals.
Scales With Seven or Fewer Notes Per Octave
Across a cross-cultural sample of scales, the distribution of pitches per octave is strikingly non-uniform. The most common scale sizes are 5, 6, and 7 notes per octave. Twelve-tone equal temperament's use of all 12 chromatic pitches is actually highly unusual cross-culturally; Western classical music is one of the few traditions that routinely uses all 12 pitch classes within a composition.
The clustering around 5–7 notes per octave may reflect the limits of working memory and short-term auditory categorization — a scale with 5–7 pitches produces intervals large enough to be reliably discriminated and held in working memory, while a scale with 11 or 12 pitches of similar sizes would be cognitively much more demanding. The "magic number" 7±2 meets musical scale structure.
Rhythm and Pulse: Temporal Universals
The use of pulse — a regular isochronous beat — as an organizing framework for rhythm appears to be a near-universal. While the specific metric patterns (duple, triple, asymmetric) and the tempo of the pulse vary dramatically, the principle of a regular temporal grid appears in virtually every documented musical tradition. This is likely driven by the motor system: the human body's natural locomotion rhythms (walking, heartbeat, breathing) provide a biological foundation for isochronous pulse, and music that aligns with motor rhythms promotes synchronized movement — the bodily basis of dance.
💡 Key Insight: Physics Constrains, It Does Not Determine
The physical regularities described in this section — octave equivalence, small-integer ratio preference, scales of 5–7 notes, isochronous pulse — define a space of musically viable options. They do not specify which option within that space any particular culture will choose. The physics explains why pentatonic scales are common across cultures (five pitches allow several small-integer-ratio intervals with minimal beating) without explaining why Japanese pentatonic scales have a different intervallic structure than West African pentatonic scales. The physics provides the floor; culture builds the house.
30.4 The Case for Cultural Construction
If the previous section traced the physical roots of musical universals, this section examines the dimensions along which music varies in ways that cannot be explained by physics alone — dimensions where cultural choice, historical contingency, and social function drive the construction of radically diverse musical systems.
Meter and Rhythmic Organization
The existence of pulse appears to be near-universal. The specific organization of that pulse into metric groups, the handling of rhythmic asymmetry, and the cognitive framework for experienced time are dramatically culturally variable.
Western classical music operates primarily in duple (2/4, 4/4) and triple (3/4) meters, with compound meter (6/8, 12/8) as additional options. These meters subdivide a measure into equal groups of 2 or 3 beats.
But many musical traditions organize rhythm in ways that Western metric theory struggles to describe. West African music, as we will see in Section 30.6, organizes rhythm through interlocking patterns rather than hierarchical metric groups. Bulgarian folk music and traditional Balkan music routinely use asymmetric meters: 5/8, 7/8, 9/8, 11/8, 13/8 — patterns that Western music theory can notate but that Western listeners without exposure typically cannot internalize as natural.
Mode and Scale Structure
Within the constraint of using 5–7 pitches per octave, the specific intervallic relationships that different cultures use vary enormously. Western music uses major and minor scales as its primary tonal resources. Indian classical music uses 72 parent scales (the melakarta system) as the basis for hundreds of ragas. Arabic and Turkish music uses the maqam system, which includes quarter-tone intervals absent from Western scales. Gamelan music uses scales that do not align with any simple-ratio framework, instead using the inharmonic spectra of the bronze instruments themselves as the organizing principle.
These are not minor variations on a common theme. They represent fundamentally different theories of what pitch relationships are meaningful and how they function in musical structure.
Tuning and Temperament
Even within Western music, the choice of how to tune intervals has been historically contested. Just intonation (tuning based on exact small-integer ratios) produces the "purest" consonances but requires different interval sizes depending on the key. Equal temperament (the current standard, dividing the octave into 12 equal semitones of 100 cents each) produces slightly "impure" intervals but allows free transposition. Meantone temperament, common in the Renaissance and Baroque, was a different compromise between these competing demands.
Cross-culturally, the divergence is even larger. Gamelan instruments are deliberately tuned in paired "beating" dyads; the tremulous quality of the beating is an aesthetic goal, not a failure of tuning accuracy. Arabic maqam uses intervals smaller than the Western semitone that simply do not exist in equal temperament.
Form, Function, and Social Context
The social function of music — who performs, who listens, in what contexts, for what purposes — varies as dramatically as any other musical parameter. In some traditions, music is inseparable from ritual and performed only in sacred contexts. In others, music is primarily social entertainment. In some, there is a sharp distinction between composers and performers; in others, composition and performance are unified. In some, collective participation is the norm; in others, specialist performers perform for passive audiences.
These social structures are not mere context for music — they shape the music itself. A musical tradition in which every community member is expected to participate in making music will look very different from one that valorizes specialist virtuosity. The music reflects and reinforces its social function.
30.5 Indian Classical Music: Raga as Emotional Cosmology
Indian classical music — the Hindustani tradition of the north and the Carnatic tradition of the south — represents one of the most elaborate and theoretically sophisticated musical systems ever developed. At its core is the concept of raga, which is simultaneously a scale, a melodic grammar, a set of mood associations, and a cosmological category.
What Is a Raga?
A raga is not simply a scale, though it contains a scale. It is better understood as a "melodic personality" — a set of rules specifying:
- The notes of the scale (which of the 12 chromatic pitches, in which octave positions, are available)
- Ascending and descending rules (the scale may include different pitches going up versus coming down — many ragas are asymmetric in this way)
- Characteristic phrases (specific melodic motifs that identify the raga and must appear in any improvisation within it)
- Emphasized pitches (certain scale degrees are stressed more than others, giving the raga its characteristic flavor)
- Ornaments (specific microtonal inflections, glides, and ornaments associated with the raga)
- Time of day (many ragas are associated with specific times — morning, evening, late night — and are traditionally performed only at those times)
- Season (some ragas are associated with specific seasons, particularly the monsoon)
- Emotional character (the rasa, or aesthetic emotion, that the raga evokes — including devotion, desire, pathos, heroism, and others)
The Physics of Raga
From a physics standpoint, ragas are structured in ways that map onto acoustic principles while extending far beyond them.
The asymmetric scales of many ragas (different pitches ascending versus descending) produce distinctive melodic shapes that are themselves acoustic signatures — a trained listener can identify certain ragas purely from the shape of a melodic phrase, even without hearing the full scale. This melodic shape encodes the raga as a contour pattern, the same kind of holistic pattern recognition that (as Chapter 29 noted) underlies musical chunking in general.
The microtonal inflections of raga performance go beyond the 12-note chromatic scale: a pitch may be inflected slightly sharp or flat in characteristic ways, and these inflections are acoustically detectable and musically essential. They are not imprecision; they are a refined expressive vocabulary that operates at the level of cents (hundredths of a semitone) rather than semitones.
The 72 Melakarta Parent Scales
The Carnatic tradition classifies ragas using a system of 72 parent scales (melakarta), derived by systematically varying the 12 chromatic pitches across the positions of the scale. This is a combinatorial system: given that a 7-note scale must include the tonic and the octave, and given 12 chromatic options for each of the remaining 5 positions, the melakarta system organizes all musically tractable 7-note scales into a comprehensive taxonomy.
This systematization is not merely theoretical. It serves as a pedagogical framework (students learn groups of ragas within each melakarta), a compositional resource (knowing the complete set of parent scales enables systematic exploration of melodic possibilities), and a theoretical achievement (it represents one of the most complete pre-20th-century efforts to mathematically enumerate a musical possibility space).
💡 Key Insight: The Raga System as Constrained Creative Space
The raga system is a perfect illustration of Theme 3 (the role of constraint in creativity). The elaborate rules of a raga — the prescribed phrases, the time-of-day associations, the ornaments — do not limit an improviser's creativity; they define the creative space within which improvisation is meaningful. A trained listener knows what to expect within a given raga, and the improviser's art lies in fulfilling and subverting those expectations in interesting ways. The constraint is the creativity's foundation.
30.6 West African Music: Rhythm as Primary Structure
If Indian classical music places melody at the center of its conceptual and aesthetic universe, West African music — the tradition from which virtually all African-American popular music descends — places rhythm there. This is not to say that melody is absent (it is not), but that the most elaborate structural work in much West African music is done by rhythm.
The Physics of Interlocking Rhythms
The organizing principle of much West African drumming ensemble music is polyrhythm: the simultaneous performance of two or more distinct rhythmic patterns that interlock to produce a composite rhythm more complex than any individual part. The physics of polyrhythm is the physics of temporal superposition — the same principle as acoustic interference, but applied in the time domain to rhythmic patterns.
The most studied polyrhythmic structure is the 3-against-2 pattern (three evenly spaced beats in the time of two), which is the simplest non-trivial polyrhythm. More elaborate patterns include 4-against-3, 5-against-4, 7-against-4, and the complex multi-layered patterns of traditional Ewe, Yoruba, and other West African drumming ensembles.
When multiple rhythmic patterns layer over each other, the result has emergent properties that no individual part contains. New accents appear at the coincidences of multiple patterns; gaps that feel like rests within one pattern are filled by other parts; the overall texture has rhythmic density and variety that would be impossible for a single performer. This is musical emergence — the composite is more than the sum of its parts.
Timeline Patterns: The Rhythmic Backbone
Many West African musical traditions use a fixed timeline pattern — often struck on a bell or high-pitched percussion instrument — as the rhythmic reference against which all other parts are organized. The timeline is not a simple metronome beat but an asymmetric repeating pattern that provides multiple reference points without imposing a simple grid.
The most famous timeline pattern is the 12-pulse pattern called the clave (from its use in Afro-Cuban music), which divides 12 equal time units as 3+3+2+2+2 or 2+3+2+2+3. This pattern is asymmetric — it does not divide evenly into equal halves or thirds — but it provides enough temporal landmarks that an experienced listener and performer can orient to it immediately.
The physical reason that such asymmetric timeline patterns are effective is that they provide multiple onset times spread across the rhythmic cycle, minimizing gaps between reference points and maximizing the information content of the pattern. A symmetric (completely regular) timeline provides redundant information — each beat says the same thing. An asymmetric timeline, optimally designed, says something slightly different with each stroke.
⚠️ Common Misconception: West African Music Is "All Rhythm, No Melody"
This stereotype, common in Western pop music discourse, is factually wrong. West African musical traditions include rich melodic and harmonic material — intricate call-and-response patterns, elaborate vocal harmonies in traditions like Ewe and Akan choral music, and carefully developed melodic instruments including the kora (a 21-string harp-lute), the balafon (a wooden xylophone), and numerous wind and string instruments. Reducing the richness of West African music to "just rhythm" is both analytically wrong and culturally reductive.
30.7 Gamelan Music: Inharmonic Spectra and Alternative Consonance
Balinese and Javanese gamelan music represents perhaps the most physically interesting case in the ethnomusicology of musical structure — because gamelan is a system in which the relationship between instrument physics and scale structure violates the standard Western model in a principled and deliberate way, and the result is a completely coherent, internally consistent acoustic system.
The Physics of Gamelan Instruments
Gamelan ensembles consist primarily of bronze percussion instruments: metallophones (like the gender and saron), gongs, and bells. Unlike a string or a wind column, a bronze key or gong does not vibrate in harmonic modes. Its vibrational modes — determined by its specific geometry and material properties — are inharmonic: the overtones are not integer multiples of the fundamental frequency.
For a standard bronze gamelan key, the first few partials might occur at approximately 1f, 1.6f, 2.0f, 2.8f, 4.1f... rather than the harmonic 1f, 2f, 3f, 4f, 5f... of a string or wind instrument. This inharmonic spectrum changes the rules of consonance: the patterns of partial overlap that make intervals consonant or dissonant in Western harmonic theory simply do not apply.
The Scale-Spectrum Match
Here is where gamelan becomes truly remarkable from a physics standpoint. Balinese gamelan tuning deliberately matches the scale structure to the inharmonic spectrum of its own instruments. The intervals of the pelog scale (the five- or seven-note scale of Balinese gamelan, with characteristic intervals of approximately 120, 270, 415, 540, and 675 cents — quite different from any Western scale) produce minimal beating when played on instruments with the inharmonic spectrum of the bronze keys used.
The gamelan is, in effect, a closed acoustic system: its scale and its instruments are co-designed (through centuries of practice and refinement, if not through explicit acoustic theory) such that the chosen intervals are the consonant intervals for those particular instruments. This is consonance — but consonance in a completely different physical framework than the one that generates Western harmonic theory.
This has a profound implication: consonance is not absolute. It is relative to the spectral structure of the instruments being played. Western consonance is what happens when the harmonic overtone series of strings and wind columns determines which intervals produce the least beating. Gamelan consonance is what happens when the inharmonic spectrum of bronze keys determines the same. Both are physically principled; neither is more "natural."
The Beating Pairs
One more remarkable feature of gamelan tuning: pairs of instruments that play the same pitch are deliberately tuned slightly apart — creating a gentle beating (amplitude modulation at frequencies of 5–7 Hz) that is an intended aesthetic feature, not a tuning error. This tremulous, shimmering quality is the characteristic sound of gamelan and is integral to its aesthetic. The Balinese term for this quality is ombak (wave). Far from being a failure to tune precisely, the beating is the instrument maker's deliberate choice — a feature of the design.
💡 Key Insight: Gamelan Proves That Consonance Is Relative to Spectrum
The gamelan case provides a concrete demonstration of the broader principle that physical constraints on music are real but not uniquely determined. The physics of overtones constrains which intervals are consonant — but "consonance" means minimal beating, and which intervals minimize beating depends on which overtones are present. Western music has built its harmonic system around the harmonic spectrum of strings and wind instruments. Gamelan has built an equally coherent system around the inharmonic spectrum of bronze percussion. There is no acoustic reason to prefer one system over the other; they are equally principled solutions to the problem of building a harmonically coherent music from available instruments.
30.8 Maqam: Microtonal Physics in Arabic and Turkish Music
The maqam system — the basis of classical Arabic, Turkish, Persian, and related musical traditions — is organized around melodic modes analogous in some ways to the raga and the Western mode, but with a critical acoustic difference: maqam includes intervals smaller than the Western semitone (100 cents), commonly quarter-tones (50 cents) and three-quarter-tones (150 cents).
Quarter-Tone Physics
A quarter-tone is an interval of 50 cents — exactly half a Western semitone. In equal temperament, there is no instrument pitch that falls between E and F or between B and C (the two semitone pairs in the Western scale). In maqam tuning, these "in-between" pitches are musically significant, with their own identities and functional roles.
The physics of quarter-tones is straightforward: they exist on the same frequency continuum as all other pitches. The question is whether the auditory system can distinguish them from the semitone pitches on either side — and the answer is clearly yes, for trained listeners. Just-noticeable differences for trained musicians are well below 25 cents; quarter-tone pitches are acoustically distinct.
What makes quarter-tone intervals musically functional — rather than merely acoustically distinct — is the maqam system's organization. Each maqam specifies not only a scale but a set of melodic rules: characteristic phrases, approach patterns, cadential formulas. The quarter-tone pitches are not "off-pitch versions" of nearby semitones; they are full members of the maqam's melodic grammar with their own functions.
Maqam and Melodic Grammar
The comparison to raga is instructive. Both maqam and raga are melodic modes with elaborate rules beyond mere scale content. Both include characteristic phrases that identify the mode. Both have emotional and expressive associations. And both operate in a pitch space that is not commensurable with the Western 12-semitone chromatic scale.
The key difference is that maqam is used in a tradition with a more extensively developed harmonic practice (particularly in Ottoman and contemporary Turkish music) than Indian classical music. Turkish music has developed a maqam-based polyphony and accompaniment practice that shows how microtonal melodic systems can coexist with at least some vertical harmonic organization.
⚠️ Common Misconception: Quarter-Tones Are "Out of Tune" Semitones
Western listeners often perceive quarter-tone intervals as "wrong" or "out of tune" — a slightly flat major third, a slightly sharp minor second. This perception is a product of calibration to the 12-semitone Western scale, not a perception of objective acoustic incorrectness. Quarter-tone intervals occupy their own positions in the frequency continuum with their own acoustic properties and their own functional roles in the musical systems that use them. Perceiving them as "out of tune" is the same categorical error as a gamelan musician hearing equal-tempered perfect fifths as slightly "off" compared to the slightly different intervals of pelog tuning — which they are, from that system's perspective.
30.9 Indigenous Music: Australia, Americas, Siberia
The diversity of music across the world's indigenous traditions challenges any remaining tendency toward a simplified model of musical universals and provides the strongest evidence for cultural construction within physical constraints.
Australian Aboriginal Music
Aboriginal Australian music is among the most distinctive in the world. The primary instrument, the didgeridoo (yidaki), is one of the world's oldest wind instruments — a wooden tube up to two meters long, played with a technique of circular breathing that allows continuous sound production. The didgeridoo produces a rich, harmonically complex drone with controllable formant resonances created by lip, tongue, and vocal tract manipulation.
Melodically, much Aboriginal Australian music operates in very narrow pitch ranges — sometimes spanning only two or three semitones — with pitch functioning less as a carrier of melodic structure and more as a carrier of speech content. Many Aboriginal Australian musical traditions are inseparable from language: songs carry specific linguistic content, often in sacred languages that are themselves musically organized.
The temporal organization of Aboriginal Australian music often follows the rhythms of the natural world — the footsteps of specific animals, the movements of specific weather patterns — rather than abstract metric grids. Music, in many traditions, is a form of environmental mapping rather than pure abstract sound structure.
Native American Music
The diversity within "Native American music" is vast enough to resist generalization — hundreds of distinct cultural traditions, each with its own musical system, social function, and acoustic properties. Some general features:
Melodic descent is common across many traditions — phrases typically begin at a higher pitch and descend over the course of the phrase. This descending contour is one of the features identified by the Mehr et al. study as cross-culturally common for certain musical functions.
Vocal timbre varies dramatically from tradition to tradition. Some traditions use a tense, nasally produced vocal quality that is entirely different from Western operatic production; others use a chest-resonant quality more similar to Western folk singing. Timbre is culturally specific even when melody is cross-culturally recognizable.
The drum, in many traditions, is not merely a rhythmic instrument but a sacred object with cosmological significance. The physics of the drum — its membrane vibration, its fundamental frequency, its decay characteristics — are inseparable from its spiritual meaning. The physics does not determine the meaning, but the meaning is always realized through specific physical properties.
Siberian Throat Singing (Tuvan and Related Traditions)
Tuvan khoomei (throat singing) produces a phenomenon that is acoustically remarkable: a single singer simultaneously producing what sounds like two or more distinct pitches. The mechanism involves the singer establishing a stable fundamental frequency with the vocal folds while simultaneously shaping the vocal tract to create a resonant cavity that strongly amplifies a specific overtone of the harmonic series — a frequency that may be 3 to 12 times the fundamental.
The physics is identical to the physics of vocal tract formants (discussed in Chapter 27), but taken to an extreme: by precisely controlling the shape of the lips, tongue, soft palate, and larynx, the Tuvan singer creates a narrow-band resonance that isolates a single overtone and makes it audible as a separate melodic voice. This is not an acoustic illusion; the separate frequency is physically present in the sound, simply selectively amplified.
Khoomei demonstrates that the human vocal tract has capacities that most singing traditions do not develop, and that culturally specific practice can unlock physical capacities latent in all human vocal anatomy.
🔵 Try It Yourself: Overtone Singing Basics
Find a quiet room and try this exercise. Sing a comfortable, sustained low pitch — something near the bottom of your comfortable range. While holding that pitch steady, slowly change the shape of your mouth from an "oo" vowel to an "ee" vowel. As you do, listen carefully to the quality of the sound. You should hear the timbre changing — and if you listen carefully, you may hear a faint higher frequency appearing and shifting as you change the vowel shape. That higher frequency is an overtone of your fundamental, being selectively amplified by the resonance of your changing vocal tract. This is the basic physical mechanism of overtone singing, accessible in a crude form to anyone with a normal voice.
30.10 The Spotify Spectral Dataset: Cross-Cultural Spectral Comparison
🔗 Running Example: The Spotify Spectral Dataset
The Spotify Spectral Dataset — 10,000 tracks, 12 genres, extracted acoustic features — provides a contemporary data window into the cross-cultural variation in musical sound. The 12 genres in the dataset span several cultural traditions and musical systems:
- Western art music (classical)
- Jazz (African-American, 20th century)
- Blues (African-American, Mississippi Delta tradition)
- Rock (Western pop/rock tradition)
- Electronic/EDM (Global, technology-mediated)
- Hip-hop/rap (African-American, global diaspora)
- Latin (multi-national: salsa, reggaeton, cumbia)
- Afrobeats (West African: Nigeria, Ghana)
- Indian classical (Hindustani and Carnatic traditions)
- Middle Eastern (Arabic/Turkish maqam)
- East Asian (including Chinese traditional, J-pop, K-pop)
- World/folk (a heterogeneous catch-all category)
What Spectral Features Show Cross-Genre Consistency
Despite enormous variation in other dimensions, certain spectral features show striking cross-genre consistency in the dataset:
Spectral centroid: The "center of gravity" of the frequency spectrum — loosely, how "bright" or "dark" a sound is — shows consistent genre-typical ranges. But across all genres, the centroid range for the most common vocal frequencies (roughly 300–3,000 Hz) is consistently represented. This may reflect the cross-cultural centrality of the human voice as a musical reference: all music, even instrumental music, is organized around the frequency range of human vocalization.
Fundamental frequency distribution: The distribution of fundamental frequencies used in melodies — across all 12 genres — shows that the most common melody frequencies cluster in the range of 200–800 Hz, corresponding to the middle-to-upper range of the human voice. This cross-genre consistency is striking, given that the 12 genres use radically different scales, tunings, and melodic grammars. The shared melodic frequency range appears to be a biological anchor point — the pitch range where the human auditory system is most sensitive and where the human voice naturally operates.
Temporal regularity (rhythm): The degree of temporal regularity — how close the onset timing of notes is to a perfectly regular grid — shows interesting cross-genre variation. Classical and jazz show moderate temporal regularity with systematic deviations (rubato and swing, respectively). Rock and electronic music show high temporal regularity (quantized to a grid). Hindustani and Carnatic classical music show a distinctive pattern: high regularity in the rhythmic cycle structure (tala) combined with elaborate local micro-timing variations in melodic improvisation.
What Spectral Features Vary Most Across Genres
Harmonic complexity: The degree to which multiple simultaneous frequencies (harmonics) contribute to the composite signal varies enormously. Electronic and hip-hop tracks show high spectral complexity (many synthesized harmonics); Indian classical solo recordings show less spectral complexity in many cases; gamelan-influenced tracks show a distinctive inharmonic spectral structure that is immediately distinguishable from harmonic spectra.
Timbre space: The combination of spectral centroid, spectral spread, and spectral flux (how rapidly the spectrum changes over time) defines a "timbre space" that differs substantially across genres. Jazz and blues use instruments (acoustic piano, trumpet, saxophone) with rich, time-varying harmonic spectra. Electronic music uses synthesized timbres that can span a range not available from acoustic instruments. Afrobeats and Latin genres show characteristic percussive spectral profiles from their instrumentation.
Dynamic range: Classical music shows the widest dynamic range (the ratio of loudest to quietest passages); electronic and pop genres show heavily compressed dynamics (the loudness war effect, discussed in Chapter 21); Indian classical music shows moderate dynamic range with characteristic crescendo structures within improvised passages.
📊 Key Finding: The Spectral Dataset Shows Both Universal Anchors and Genre-Specific Construction
The consistent cross-genre features (vocal frequency range for melody, broad temporal regularity) correspond closely to the physical and biological universals identified in Sections 30.3 — they are the acoustic fingerprints of constraints imposed by human auditory biology and the physics of harmonic sound. The highly variable features (timbre, harmonic complexity, dynamic structure) correspond to the culturally constructed dimensions of music — the dimensions where, within the physical possibility space, radically diverse choices have been made.
30.11 Music and Language: Are They the Same System?
The relationship between music and language is one of the most debated questions in cognitive science. Both involve structured sequences of sound, both are universal human behaviors, both develop in early childhood through exposure and practice, and both recruit overlapping neural systems. The question of whether they are fundamentally the same cognitive system — or two distinct systems that overlap in interesting ways — has generated substantial empirical and theoretical work.
The Chomskyan View and Hauser-Chomsky-Fitch (2002)
Noam Chomsky's generative grammar framework suggests that the core of human language — the syntactic capacity for hierarchical, recursive structure — is uniquely human and uniquely linguistic. Hauser, Chomsky, and Fitch's 2002 paper argued for "narrow language faculty" as distinct from "broad language faculty": the recursive syntactic capacity was proposed to be specifically linguistic, while other aspects of language (sound categorization, sequential learning, social pragmatics) are shared with music and other cognitive systems.
On this view, music uses some of the same cognitive machinery as language (rhythm, sequence, categorical perception of sound) but lacks the recursive hierarchical structure that Chomsky considers the defining feature of language. Music and language would be overlapping but non-identical systems.
The Dissenters: Shared Syntax Between Music and Language
Several music theorists and cognitive scientists have argued against the Chomskyan sharp distinction. Stefan Koelsch and colleagues have documented that music, like language, has a syntax — a set of structural rules that listeners track implicitly and that violate the neural systems that also process linguistic syntax when violated. Music-syntactic irregularities (unexpected chords, unresolved harmonic functions) produce ERP (event-related potential) components in EEG that are similar to the brain's response to syntactic violations in language.
Jay Schulkin and others have argued for "musical grammar" as a genuine instance of hierarchical, rule-governed structure — not identical to linguistic grammar, but not radically different either.
Current Neuroscience: Overlapping But Not Identical
The current state of the evidence supports a middle position: music and language share substantial neural resources (particularly in the left temporal and frontal regions associated with syntactic processing and sequential organization) but are not identical. The overlap is greatest for: - Sequential rule learning - Categorical sound discrimination - Metric/rhythmic processing - Working memory for sequences
The overlap is smallest for: - Lexical reference (words have meanings that point to the world; musical tones do not, in the same direct way) - Hierarchical constituent structure (the syntactic tree of a sentence has properties that musical phrase structure does not fully replicate)
The music-language relationship may be best understood through the lens of evolution: the two systems may share a common evolutionary ancestor (a domain-general sequential learning and sequencing system) that was subsequently elaborated in different directions for different communicative functions.
30.12 The Evolution of Music
Why does music exist at all? From an evolutionary perspective, the question requires explaining not just why music is pleasurable — many things are pleasurable without being adaptive — but why music-making capacity emerged in human evolutionary history and why it is maintained across all human cultures.
Four major hypotheses have been advanced, each capturing a different dimension of music's function:
Sexual Selection (Darwin's Hypothesis)
Darwin proposed, in The Descent of Man, that music evolved through sexual selection — that musical ability was a signal of mate quality, and that individuals with more impressive vocal performances had reproductive advantages. On this view, music is analogous to birdsong: a costly, elaborate signal whose production demonstrates the quality of the signaler.
Evidence: musical skill is sexually attractive; composers and performers often have higher reproductive success than non-musical peers; musical performance has many features of a costly signal (it is demanding to produce and easy for conspecifics to evaluate).
Counter-evidence: music is overwhelmingly cooperative in its most common forms; groups make music together far more often than individuals compete for mates through musical performance. A purely competitive signal account does not explain collective music-making.
Social Cohesion (The "Grooming Replacement" Hypothesis)
Robin Dunbar has argued that music — particularly collective vocal music — evolved as a replacement for social grooming in large human groups. Primate social cohesion is maintained through physical grooming (one individual removing parasites from another), which activates the opioid system and produces social bonding. But grooming can only be performed one-on-one, putting a ceiling on group size. Collective vocal music — singing, chanting, drumming together — also activates the opioid system (endorphin release through rhythmic movement and vocal production) and can be performed simultaneously by many individuals. It thus scales the bonding mechanism to large groups.
Evidence: group music-making elevates pain thresholds (an opioid measure), increases positive social feelings and cooperation, and is universally associated with social contexts (feasts, rituals, work, warfare). The cross-cultural use of music in religious and social bonding contexts is consistent with this hypothesis.
Parent-Infant Bonding
The Mehr group and others have documented that lullabies are the most universally identifiable cross-cultural musical form, suggesting that the parent-infant musical bond may be evolutionarily primary. Infant-directed speech (the exaggerated prosody, high pitch, and slow rate of "motherese") shares acoustic features with music, and infants respond differentially to music-like vocalizations from the earliest days of life.
On this view, music may have evolved as an extension of the parent-infant communicative channel — a way of maintaining proximity, communicating safety, and regulating infant arousal states at a distance. The evolution would then have been driven by selection on infant survival mediated by parental vocal investment.
Emotional Regulation
Daniel Levitin and others have proposed that music's primary evolutionary function is emotional regulation — modulating arousal, mood, and affective state. On this view, music is adaptive because individuals who can regulate their emotional states more effectively (through self-soothing vocal production, through music-induced arousal for activities requiring high energy, through music-induced relaxation for rest and recovery) have survival and reproductive advantages.
🔵 Try It Yourself: Music as Emotional Regulation
Over the next 48 hours, keep a simple log of every time you use music deliberately to change how you feel. Note: what you were feeling, what music you chose, and how you felt afterward. At the end, examine your log: How many instances involved increasing energy or arousal? How many involved soothing or calming? How many involved maintaining a mood? This simple self-study will give you a personal dataset on music's function in emotional regulation — the evolutionary function that many researchers consider most primary.
🧪 Thought Experiment: A World Without Music
Imagine a human population that somehow lacked the capacity for music — no singing, no drumming, no melodic vocalizations, no rhythm. What social functions would need to be performed by some other means? What developmental functions (particularly in parent-infant interaction) would be at risk? What consequences for social cohesion and group size would you predict? This thought experiment helps clarify which of music's functions are truly primary and which are secondary elaborations.
30.13 Is Music a Cultural Universal? — The Synthesis Argument
Drawing together the evidence from the preceding sections, the answer to "Is music a universal?" is: yes, in a limited and precisely defined sense that is more interesting and more defensible than either the naive Longfellow universalism or the radical Nettl relativism.
The case for universality: - Music — organized, intentional sound-making that is appreciated by an audience and functions to coordinate social emotion and behavior — exists in every documented human culture without exception. - Certain acoustic features of music (organized pitch and rhythm together, scales of 5–7 notes, preference for octave equivalence and small-integer-ratio intervals, isochronous pulse as a rhythmic reference) are near-universal, with physical and biological explanations. - Certain functional categories of music (music for infant soothing, music for coordinated group movement, music for ritual) are cross-culturally ubiquitous. - Naive listeners can identify the behavioral function of music from cultures entirely unknown to them, at rates substantially above chance — suggesting that some acoustic-functional relationships are cross-culturally shared.
The case for cultural specificity: - The specific implementation of musical structure — the scales, tunings, meters, timbres, formal structures, and social practices — varies dramatically across cultures in ways that are not biologically determined. - Consonance itself is not absolute; it depends on the spectral properties of available instruments, and different cultures have built different but equally coherent consonance systems. - The social functions, symbolic meanings, and cosmological frameworks within which music operates are entirely culturally constructed. - Naive listeners' ability to identify musical function breaks down at finer levels of detail — they can distinguish lullabies from dance songs but cannot identify the specific emotional or spiritual purposes of music within traditions they do not know.
30.14 Theme 2 Final Answer: The Physics Constrains, Culture Constructs Within Constraints
This is the thesis that the chapter has been building toward, and it is the final answer to the central question of Theme 2:
Universal structures in music arise from physics and biology — from the mathematics of overtone series, the limits of working memory, the neurological roots of pulse and rhythm in motor systems, and the evolutionary history of human social communication. These universal structures define a space of possibilities.
Cultural specificity in music arises from the choices made within that space — which intervals to use, how to organize rhythm, what instruments to build, what social functions music serves, what symbolic meanings it carries, what emotional associations it cultivates. These choices are not arbitrary; they are historically determined, socially embedded, and collectively maintained. But they are genuinely choices — free within the physical constraints, without a single "correct" answer.
The gamelan case is the clearest illustration: physics requires that consonant intervals minimize beating between partials. But which intervals minimize beating depends on what partials are present — and which partials are present depends on what instruments you build. Western music built string and wind instruments, discovered harmonic spectra, and constructed a tonal system around small-integer-ratio consonances. Bali built bronze percussion, discovered inharmonic spectra, and constructed a tonal system around the consonances of those spectra. Both are physically principled. Neither is more "natural."
The Indian raga system uses 5–7 notes per octave (a near-universal feature, for cognitive reasons) but constructs an elaborate set of melodic personalities within that space that encode cosmological time, emotional state, and aesthetic character — construction entirely beyond anything physics determines. West African polyrhythm uses isochronous pulse as a reference (a near-universal) but organizes rhythm through interlocking, asymmetric timeline patterns rather than hierarchical metric groups — a cultural choice with its own sophisticated aesthetic and social logic.
💡 Key Insight: This Is How to Think About "Nature vs. Nurture" in Music
The "physics constrains, culture constructs" principle is the correct framework for thinking about any "nature vs. nurture" question in music. Nature (physics, biology) constrains: it rules out certain intervals as acoustically unworkable, certain rhythmic patterns as cognitively inaccessible, certain pitch ranges as outside vocal capacity. But within the large remaining space of physical possibility, nurture (culture, history, social function) does the construction. Neither side "wins"; both are essential, and the interesting work is in specifying exactly where the constraints lie and exactly what choices are made within them.
30.15 Part VI Synthesis: The Physics of Perception — What We Have Learned Across Chapters 26–30
Part VI — "The Physics of Perception & Emotion" — has traced a path from the physical mechanisms of hearing (Chapters 26–27) through the psychological dynamics of expectation and emotion (Chapter 28) and the neuroscience of musical memory and ability (Chapter 29) to the cross-cultural synthesis of Chapter 30.
The central thread running through all five chapters is the interaction between universal mechanism and culturally shaped experience:
Chapter 26 showed that the cochlea performs a Fourier analysis that is universal across all human ears — but that the categorical experience of pitch is shaped by musical exposure and training.
Chapter 27 showed that the physics of resonance and vocal tract acoustics is universal — but that the specific timbres considered aesthetically valuable or expressive are culturally trained.
Chapter 28 showed that the dopaminergic reward system responds to musical expectation-and-resolution in ways that appear universal — but that what counts as "expected" or "surprising" is shaped by the specific tonal schemas of one's musical culture.
Chapter 29 showed that the neural architecture for pitch categorization is present in all human brains and subject to a universal critical period — but that whether that architecture gets calibrated to pitch labels depends on linguistic and musical cultural environment.
Chapter 30 has shown that physics defines the possibility space for musical structure, and culture constructs within it — producing the astonishing diversity of human music while explaining the deep structural regularities that underlie that diversity.
Together, these five chapters build a portrait of musical perception as a joint achievement of the universal (physics, evolution, neurobiology) and the particular (culture, language, history, individual experience). Neither dimension is dispensable. Neither fully explains what music is, what it does, or why it matters.
30.16 Summary and Bridge to Part VII
This chapter has examined music across cultures as a test case for the fundamental question of what is universal versus what is culturally specific in human music-making. The answer is a principled account grounded in physics and evolutionary biology: universal features of music arise from acoustic physics (the overtone series, the mathematics of consonance, the physics of rhythm) and from human biology (auditory anatomy, motor systems, cognitive working memory limits, the evolutionary history of social vocalization). Culturally variable features arise from the choices made within the space that physics and biology define — choices about scales, instruments, social function, formal structure, and aesthetic value.
The Mehr et al. (2019) study provides the most rigorous empirical evidence to date for musical universals — and for the limits of those universals. The Spotify Spectral Dataset provides a contemporary data window into the cross-genre acoustic variation that illustrates cultural construction. Indian classical music, West African drumming, gamelan, maqam, and indigenous traditions around the world demonstrate that the space within physics's constraints is vast, and human cultures have explored it with extraordinary creativity.
Bridge to Part VII: Part VI has focused on perception — how music is heard, processed, remembered, and felt by individual listeners and musicians. Part VII, "Music in Society and Technology," shifts the focus outward to the social, historical, and technological dimensions of music: how music institutions form and change, how music has been transformed by recording technology and now by artificial intelligence, how the economic structures of the music industry shape what music gets made, and what the future of music might look like. The physical principles do not change in Part VII — but their social and technological mediations take center stage.
⚖️ Debate/Discussion: Is Calling Something a "Music Universal" Just Imposing Western Categories on Diverse Practices?
The question of whether "music universals" research is genuine cross-cultural science or a disguised projection of Western categories is genuine and deserves serious engagement.
Position A — The skeptical view: The very concept of "music" is a Western analytical category. When Mehr et al. select recordings from 60 cultures and analyze them for "musical" features, they are already presupposing a definition of music that may not be shared by all the cultures sampled. Selecting "lullabies," "dance songs," "healing songs," and "love songs" as the four cross-cultural categories imposes a functional taxonomy that may carve the sonic world differently than any given culture's own taxonomy. The features found to be "universal" — tempo, melodic contour — are exactly the features that Western acoustic analysis is set up to find. The features that might differ most fundamentally — the cosmological meaning of specific pitches, the social relationships enacted through performance — are the features that don't show up in acoustic analysis.
Position B — The empiricist view: The cross-cultural consistency found in the Mehr et al. study was demonstrated not by Western music theorists analyzing recordings but by culturally naive listeners from 30 countries — including non-Western countries — identifying musical functions above chance. The test itself was designed to minimize the imposition of Western categories by using acoustic identification by diverse listeners rather than theoretical analysis. If listeners from 30 countries (including many from non-Western, non-WEIRD populations) can identify lullabies from 60 cultures without training, something real is being captured.
Position C — The pluralist view: The question of whether "music universals" research imposes Western categories is itself a question that must be answered empirically — by looking carefully at whether the categories used in the research correspond to categories that members of the sampled cultures would recognize and endorse, and by doing research in genuine collaboration with the cultures being studied, using their own analytical frameworks alongside Western acoustic analysis.
Discussion questions: 1. What would it mean for a musical universal to be "real" rather than a projected Western category? What kind of evidence would satisfy you? 2. The Mehr et al. study used "naive listeners" as judges — people who were not experts in the sampled cultures. Is this methodologically sound, or does it introduce a different kind of bias? 3. The gamelan case study (Case Study 30-1) shows that consonance is relative to instrument spectrum. Does this support the skeptical position (universals are relative) or the empiricist position (physics constrains, revealing real structure)?
✅ Key Takeaways
- Ethnomusicology's caution about musical universals is methodologically warranted but can overreach into a cultural relativism that the cross-cultural data does not support.
- The Mehr et al. (2019) study provides the most rigorous evidence to date for cross-cultural regularities in music: naive listeners from 30 countries can identify the behavioral function of music from 60 societies at rates substantially above chance.
- Physics explains the near-universals of music: octave equivalence and small-integer-ratio preferences arise from the mathematics of the harmonic series; the 5–7 note scale is a cognitive working-memory constraint; isochronous pulse is grounded in human motor biology.
- Cultural construction within these physical constraints explains the vast diversity of musical systems: different scales, tunings, rhythmic organizations, timbres, and social functions are equally valid, physically principled solutions to the problem of organizing sound.
- Indian raga, West African polyrhythm, gamelan, maqam, and indigenous musics from Australia to Siberia all demonstrate that the space within physics's constraints is vast — and that human cultures have explored it with astonishing creativity.
- Gamelan music specifically demonstrates that consonance is relative to instrument spectrum, not an acoustic absolute — the most direct proof that physical constraints allow multiple, equally coherent musical systems.
- The Spotify Spectral Dataset shows that cross-genre consistency clusters in the features corresponding to universal physical constraints (vocal frequency range for melody; broad temporal regularity), while the most variable features correspond to culturally constructed dimensions of musical sound.
- The correct framework for the "music universals" debate is: physics constrains the possibility space; culture constructs within that space. Both dimensions are essential; neither is dispensable.