Chapter 28 Exercises: Why Minor Sounds Sad — Cultural, Cognitive, and Physical Explanations
Part A: Physical and Acoustic Explanations
A1. The harmonic series is central to the physical explanation of consonance and dissonance. (a) Write out the first eight harmonics of a fundamental tone at 100 Hz. (b) Identify which intervals (expressed as frequency ratios) correspond to which positions in the series (e.g., harmonics 2:1 = octave, etc.). (c) The major third corresponds to the ratio 5:4 (harmonics 4 and 5); the minor third to 6:5 (harmonics 5 and 6). Calculate the frequencies of a just-intonation major third and minor third above a 264 Hz (C4) fundamental. (d) Now calculate the equal-tempered frequencies of the major and minor third above C4 (using the formula f = 264 × 2^(n/12)). How far off, in cents, is each equal-tempered interval from its just-intonation equivalent? (e) Based on your calculations, how large is the roughness difference between the equal-tempered major and minor third? Does this difference seem large enough to explain the strong emotional difference listeners report?
A2. Acoustic roughness is caused by amplitude modulations (beating) when two frequencies are close enough to interact in the auditory system. (a) The "critical bandwidth" of the auditory system — the frequency range within which two tones will produce beating and roughness — is approximately 10–15% of the center frequency and is never less than approximately 100 Hz. For a C4 fundamental (264 Hz), calculate the critical bandwidth. (b) A C minor chord contains C-E♭-G (264, 316.8, 396 Hz in just intonation). List all pairs of frequencies — including harmonics — that fall within each other's critical bandwidth and would therefore produce roughness. (c) Repeat for a C major chord (C-E-G: 264, 330, 396 Hz). Is there a measurable roughness difference? (d) What would the roughness difference be between major and minor if the fundamental were very low (e.g., 50 Hz) vs. very high (e.g., 2000 Hz)? What does this predict about the emotional salience of major/minor at extreme registers?
A3. Helmholtz's roughness theory predicts that maximum consonance should equal maximum pleasantness, and maximum dissonance should equal maximum unpleasantness. (a) Is this prediction consistently supported by psychological research? Identify at least one finding that complicates a simple roughness = unpleasantness mapping. (b) The tritone (diminished fifth, e.g., C to F#) is acoustically rougher than the minor third, yet in context (within a dominant seventh chord) it is experienced as "tense" rather than "unpleasant." How does context change the perception of roughness as an emotional signal? (c) Terhardt's "virtual pitch" theory and Parncutt's "tonal fusion" theory offer alternatives to Helmholtz's roughness account of consonance. Without looking these up, propose what such an alternative theory might focus on, based on what you know about pitch perception from Chapter 26. (d) If consonance perception were entirely explained by roughness, what would the theory predict about music played by bells (which have inharmonic overtone series)? Does this prediction hold?
A4. The major chord in just intonation corresponds to harmonics 4:5:6, a fact sometimes cited as evidence for its "natural" status. (a) Rameau (1722) argued that the major chord was "given by nature" through the harmonic series, and that this explained its privileged status in Western harmony. Schoenberg (1911) disputed this claim. What specific objections could be raised against Rameau's argument? (b) The minor chord in just intonation requires the ratios 10:12:15. Is there any sense in which these are "naturally" related, even if less simply than 4:5:6? (c) Some theorists have proposed that the minor chord can be derived from the harmonic series "from above" — from an undertone series rather than an overtone series. Explain this idea. Is it acoustically valid? (d) Even if the major chord's relationship to the harmonic series were granted as acoustically special, does this prove that the major chord should be perceived as pleasant? What additional psychological assumption is required?
A5. Equal temperament was adopted in Western music largely during the 17th–18th centuries. (a) Explain why equal temperament was adopted despite the acoustic impurities it introduces relative to just intonation. What practical musical problem does it solve? (b) Keyboard instruments tuned to mean-tone temperament (common before equal temperament) had some intervals that were considerably more pure and others (the "wolf" intervals) that were considerably more dissonant than their equal-tempered equivalents. How would this affect the emotional landscape of music written for mean-tone instruments? (c) Some contemporary composers and ensembles have returned to just intonation or other "alternative" tuning systems. If the physical roughness account of major/minor emotion is partially valid, would just-intonation performance strengthen the emotional major/minor distinction? Design an experiment to test this. (d) Barbershop quartet singing naturally drifts toward just intonation, especially for major chords. Singers and listeners report that these "locked" just-intonation chords have a distinctive "ringing" quality. What physical phenomenon (discussed in this chapter and in Chapter 26) produces this ringing, and how does it relate to the consonance account?
Part B: Cultural and Historical Explanations
B1. The cultural-historical account of the minor-sad association proposes that it was constructed through centuries of Western compositional practice. (a) Identify two specific historical periods or compositional developments that were particularly important in establishing the minor-sad convention in Western music. (b) Medieval Gregorian chant uses a modal system that does not clearly distinguish major and minor. Research one Gregorian chant mode (e.g., Dorian or Phrygian) and identify its structural relationship to what would later be called the minor scale. Was this mode used for sad liturgical texts, happy ones, or both? (c) In Renaissance polyphony (c. 1450–1600), the "rule of the octave" and the "clausula vera" (voice-leading conventions that became the dominant-tonic cadence) were being established. How might the development of these conventions have reinforced the minor-sad association? (d) By the Romantic period (c. 1820–1900), the minor-sad convention was fully established and extensively exploited. Identify one Romantic-period piece that uses the minor-happy or minor-triumphant association in a way that deliberately subverts the convention. What is the emotional effect of this subversion?
B2. Evaluate the claim: "The minor-sad association is merely a cultural convention and has no basis in the physics of sound." (a) What evidence would fully support this claim? (b) What evidence is inconsistent with this claim? (c) Is the claim falsifiable? What would count as evidence against it? (d) What is the strongest version of the physicalist's response to the claim? (e) Write a two-paragraph synthesis that fairly represents both the evidence for cultural construction and the evidence for physical grounding.
B3. The Western major/minor system consolidated during the Baroque period, eliminating the variety of modes used in Renaissance and medieval music. (a) What social, technological, and musical factors contributed to this consolidation? (b) If the Renaissance modal system had survived into the modern era, would we expect the same strong binary (major = happy, minor = sad) emotional coding? Or would emotional associations be more diffuse across many modes? (c) Jazz and blues music maintained some modal diversity through the 20th century (particularly the Dorian and Mixolydian modes). How does the emotional coding of these modes in jazz compare to their theoretical characterizations in music theory textbooks? (d) The development of recording technology in the 20th century dramatically expanded the range of music any individual listener was exposed to. How might global music exposure affect the cultural construction of mode-emotion associations in the 21st century?
B4. The minor-sad association is sometimes described as an "arbitrary" cultural convention. (a) What does it mean to call something "arbitrary" in cultural-historical analysis? Is the minor-sad association more or less arbitrary than, say, the cultural convention that red means stop and green means go? (b) Arbitrary conventions can be extremely robust — driving on the left or right side of the road, for example, is arbitrary but inflexible. Does the robustness of the minor-sad association in Western populations tell us anything about whether it is arbitrary? (c) Non-arbitrary conventions are those grounded in some feature of the world (e.g., the sun rising in the east). Is there any sense in which the minor-sad association is "non-arbitrary" — grounded in some feature of music, emotion, or human biology? (d) If the minor-sad association is in part arbitrary, what are the implications for music education — should we teach it as a rule or as a convention?
B5. Ethnomusicological research studies musical traditions across cultures. (a) Identify two musical traditions outside Western Europe in which the "minor-sad" association is demonstrably absent or reversed, and briefly describe the cultural context of minor-mode music in each tradition. (b) What methodological challenges arise when trying to measure emotional associations in cross-cultural music research? (c) The "Pygmy effect" — a tendency for Western researchers to project Western emotional categories onto non-Western music — is a documented bias in ethnomusicological fieldwork. How can researchers mitigate this bias when studying cross-cultural emotional associations in music? (d) When a researcher from Bangladesh hears Western minor-key music for the first time, what would you predict about their emotional response, based on the evidence in this chapter? What would you need to know about their musical background to make a more precise prediction?
Part C: Developmental and Cognitive Explanations
C1. The developmental trajectory of the minor-sad association provides evidence about whether it is innate or learned. (a) Summarize what is known about infants' (under 12 months) responses to major vs. minor music. What specific limitations of the infant research should be noted? (b) At approximately what age do Western children reliably associate minor-key music with "sad" and major-key music with "happy"? (c) Propose a longitudinal study that could trace the development of the minor-sad association from birth through age 10, including measures at multiple developmental stages. What controls would be necessary? (d) Japanese children show a somewhat weaker major-happy/minor-sad association than American children of comparable age, despite both groups being exposed to Western-style equal-tempered music. What does this finding tell us about the relative contributions of innate and cultural factors?
C2. The speech prosody hypothesis proposes that the minor third's emotional association derives from its acoustic similarity to the prosody of sad or distressed vocalizations. (a) What acoustic features of sad speech prosody would be expected to resemble the minor third? Be specific about frequency relationships. (b) The "calling pattern" (the "sol-mi" minor third used by children calling to parents) is sometimes cited as evidence for the prosody hypothesis. Critically evaluate this specific piece of evidence — what does it support, and what does it not support? (c) Design an experiment that could test whether the minor-sad association is mediated by the prosodic similarity hypothesis. What would you need to measure, and what results would support vs. disconfirm the hypothesis? (d) The prosody hypothesis would predict that tone-language speakers (e.g., Mandarin Chinese, Yoruba speakers) might show a weaker music-prosody transfer, because pitch in their language carries lexical rather than primarily emotional meaning. Is this prediction supported by available evidence? If you don't know, discuss what the prediction would imply if true.
C3. Cross-cultural studies of the minor-sad association have produced mixed results. Evaluate the following studies critically: (a) The Mafa study (Scheef et al., 2010) found that Cameroonian Mafa farmers with no prior Western music exposure rated Western major-key excerpts as more positive than minor-key excerpts, though with a smaller effect than Western participants. What specific conclusions can and cannot be drawn from this finding? (b) What acoustic features of the stimuli in cross-cultural studies might produce positive valence ratings independently of mode (major/minor)? (c) Design an improved cross-cultural study that controls for the confounds you identified in (b). What would you use as stimuli, and how would you control for non-modal acoustic features? (d) If a future well-controlled study found zero difference in the emotional ratings of major and minor music in a population with no Western music exposure, what would this prove about the physical grounding of the minor-sad association?
C4. Implicit vs. explicit processing can be dissociated in music cognition research. (a) Define the distinction between implicit and explicit processing in the context of music emotion research. (b) Is it possible to have an implicit emotional response to minor-key music (e.g., increased galvanic skin response, or faster reaction times to negative-valence words) while explicitly reporting no emotional response? What would this demonstrate? (c) Children below the age of reliable major-sad/minor-sad explicit association might still show implicit associations. Describe a paradigm that could measure implicit mode-emotion associations in preverbal or early-verbal children. (d) As adults age and accumulate more cultural music exposure, would you expect the minor-sad association to strengthen or weaken? Describe what type of study could test this prediction.
C5. Absolute pitch (AP) — the ability to name a note's pitch without an external reference — is present in approximately 1 in 10,000 adults in the general population but in approximately 1 in 1,500 among musicians who began training before age 6. (a) What does the rarity of absolute pitch tell us about the "naturalness" of musical pitch categories in the auditory system? (b) Individuals with absolute pitch perceive equal-tempered minor thirds as slightly "sharp" relative to just-intonation minor thirds. Does this more acute pitch sensitivity give AP possessors a stronger or weaker minor-sad association? Explain your prediction. (c) Perfect pitch is substantially more common in speakers of tone languages (Mandarin, Cantonese, Vietnamese). What does this imply about the relationship between linguistic and musical pitch processing, and about the role of early learning in establishing absolute pitch? (d) If absolute pitch is acquired through early learning rather than innate specification, what does this tell us about the broader question of innate vs. learned musical pitch associations?
Part D: Cross-Cultural and World Music Analysis
D1. Indian classical music uses the concept of "raga" to organize pitch and emotional content. (a) A raga is more than just a scale — it specifies ascending patterns, descending patterns, characteristic ornaments, and a specific "rasa" (emotional flavor). How does this rasa system compare to the Western mode-emotion association? Is it more specific or less specific? (b) Select one specific raga (e.g., Bhairav, Yaman, Bhairavi, Desh) and describe its interval structure. Does it contain any intervals that would be classified as "minor" in Western terminology? What rasa or emotional character is associated with it in Indian classical tradition? (c) The raga Bhairavi uses the "minor" second, third, sixth, and seventh — making it one of the most "minor-heavy" ragas in terms of Western interval classification. What emotional associations does Bhairavi carry in the Indian classical tradition? Do these align with the Western minor-sad association? (d) What methodological steps would be necessary to study whether Indian classical listeners associate Bhairavi with "sadness" in the same way Western listeners associate minor key music with sadness?
D2. Flamenco music (discussed in the case study) uses the Phrygian mode extensively in festive contexts. (a) What are the characteristic intervals of the Phrygian mode, and how do they relate to the Western minor scale? (b) The "Phrygian cadence" (a descent from the tonic to the seventh — e.g., Am to G to F to E in A Phrygian) is characteristic of flamenco and creates a distinctive sense of drive and energy, not sadness. Explain how the Phrygian cadence works harmonically and why it might feel energizing rather than sad in the flamenco context. (c) A Spanish listener trained in flamenco hears the Phrygian mode as "festive." A Western classical listener hears the same scale as "dark" or "exotic." Describe the process by which each listener's brain arrives at a different emotional response to the same acoustic stimulus. (d) If flamenco-trained Spanish listeners and classical-trained Western listeners were both tested in a cross-cultural music perception study, how might the inclusion of both groups affect the statistical results?
D3. Klezmer music uses the Freygish mode (Phrygian dominant scale) extensively. (a) The Freygish mode is built on the fifth degree of the harmonic minor scale. Write out the Freygish mode starting on D and identify its characteristic interval (the augmented second between scale degrees 2 and 3). (b) This augmented second — wider than a whole tone, narrower than a minor third — is used in several Middle Eastern and Eastern European musical traditions. What is the typical emotional association of the augmented second in Klezmer, in Arabic maqam, and in the Western classical tradition? (c) Fast, ornamented Klezmer dance music using the Freygish mode is associated with joy, celebration, and communal vitality. What specific musical features (beyond mode) make Klezmer sound celebratory despite its "minor" intervals? (d) The Freygish mode is built on a minor triad (D-F-A in D Freygish). A Western listener who has not encountered Klezmer would likely perceive the underlying harmony as "minor" and potentially "sad." Does the fact that context and style can completely override this acoustic signal support the cultural or the physical explanation?
D4. Research on the cross-cultural universals in music has identified some features that appear more consistently universal than others. (a) Based on the evidence in Chapters 26–28 and your broader knowledge, rank the following musical features from most cross-culturally universal to most culturally specific in their emotional associations: (i) fast tempo = high arousal, (ii) minor third = sadness, (iii) loud dynamics = high arousal, (iv) descending melodic contour = negative valence, (v) complex polyrhythm = high arousal. Justify your ranking. (b) The features you ranked as most universal tend to be those that map most directly onto acoustic properties of emotional vocalizations (prosody). Why would vocal prosody be a reliable cross-cultural anchor for music-emotion associations? (c) Samuel Mehr and colleagues (2019) analyzed 86 recordings from 60 societies and found that listeners from Western and non-Western cultures could reliably identify the context of use (dance, healing, love, lullaby) for unfamiliar music from other cultures. What does this finding imply about the universal vs. cultural nature of musical meaning? (d) Is the cross-cultural recognition of music's context of use the same thing as cross-cultural recognition of music's emotional valence? Why or why not?
D5. Comparative analysis: Choose three musical traditions from three different world regions and compare how each tradition uses scale/mode/interval structure to convey emotional meaning. For each tradition: (a) describe the fundamental organizing principle of pitch (scale, mode, raga, maqam, pentatonic framework, etc.); (b) describe how emotional expression is organized within that system (are there equivalents to major/minor distinction? How are they characterized?); (c) identify one specific musical example from that tradition that would challenge a Western listener's major/minor emotional associations; (d) based on your comparison, formulate a general hypothesis about the relationship between pitch structure and emotional expression in human music.
Part E: Research Design and Synthesis
E1. Critically evaluate the following research design: "We played major-key and minor-key versions of the same melody to 50 American college students and 50 participants from a remote Amazonian community with no prior Western music exposure. Both groups rated the major-key version as more positive and the minor-key version as more negative on a 7-point scale. We concluded that the major-happy/minor-sad association is universal." (a) Identify at least three specific methodological problems with this study. (b) What alternative explanation for the Amazonian participants' responses (other than a universal biological association) should be considered? (c) What control stimuli would be needed to confirm that mode (and not another acoustic feature that co-varies with mode) is driving the effect? (d) Propose a revised, better-controlled version of this study that would allow stronger conclusions.
E2. The speech prosody hypothesis, the acoustic roughness hypothesis, and the cultural learning hypothesis offer three distinct accounts of the minor-sad association. (a) For each hypothesis, identify one specific empirical prediction that could in principle falsify it. (b) Propose a single experimental paradigm that could test all three hypotheses simultaneously. What stimuli would you use, what measures would you take, and what pattern of results would support each hypothesis? (c) If all three hypotheses are partially true (and the evidence suggests they are), how would you design a study that estimates the relative contribution of each? (d) Is there a fourth hypothesis not mentioned in this chapter that might explain aspects of the minor-sad association not captured by the three discussed? Develop it briefly.
E3. The chapter argues that the minor-sad association is best understood as "acoustic physical seed, culturally amplified." (a) Construct a historical argument for how this amplification could have occurred over the period 1600–1800 in Western Europe, tracing the specific musical and cultural developments that strengthened the association. (b) Is there any modern musical development (a new genre, a new use of minor-key music in a positive context) that might be beginning to weaken the minor-sad association? (c) If the cultural amplification account is correct, would we expect future generations (exposed to more diverse world music through streaming services) to show a weaker minor-sad association than current adults? How would you design a longitudinal study to track this? (d) The "cultural amplification" account implies a feedback loop: cultural conventions shape emotional responses, which reinforce compositional conventions. Describe how this feedback loop might operate in the history of Western music.
E4. This chapter has examined the minor-sad association from multiple perspectives: physical, cultural, developmental, and prosodic. Write a 400–500 word synthesis that: (a) states the most defensible current answer to "Why does minor sound sad?" (b) fairly represents the strongest evidence for each of the four perspectives; (c) identifies the two most important remaining empirical questions; (d) explains what finding would most dramatically change your answer.
E5. Final synthesis question: "The minor-sad association is the most important single test case for the universal-vs-cultural theme in music science." (a) Do you agree with this claim? What features of the minor-sad association make it particularly illuminating as a test case? (b) What is the most important lesson from this case study for the broader question of musical universals? (c) Apply the lesson of this case study — "acoustic properties provide a seed; culture amplifies" — to one other musical emotional association (e.g., fast tempo = excitement, or high pitch = positive valence). Does the same two-component account apply? (d) What would it mean for human culture and human biology if the major/minor emotional distinction were found to be entirely culturally specific, with no physical grounding? Would this change anything about how we listen to or make music?