Chapter 1 Quiz: What Is Sound?
20 questions covering recall, application, and synthesis. Answers are hidden — attempt each question before revealing the answer.
Q1. Sound is classified as which type of wave?
a) Electromagnetic wave b) Transverse mechanical wave c) Longitudinal mechanical wave d) Gravitational wave
Answer
**c) Longitudinal mechanical wave.** Sound requires a material medium (mechanical) and the medium's oscillations are parallel to the direction of wave propagation (longitudinal). Compressions and rarefactions travel in the same direction as the energy transfer. Light is electromagnetic; water surface waves are transverse mechanical waves.Q2. A pure tone of 880 Hz is played in a room where the speed of sound is 340 m/s. What is the wavelength of this tone?
a) 0.23 m b) 0.39 m c) 2.59 m d) 5.76 m
Answer
**b) 0.39 m** Using c = fλ → λ = c/f = 340/880 ≈ 0.386 m ≈ 0.39 m. Note that 880 Hz is one octave above 440 Hz (concert A). The wavelength of 440 Hz is approximately 0.78 m, and 880 Hz has exactly half that wavelength — octaves double frequency and halve wavelength.Q3. Which of the following best describes what actually happens to air molecules when a sound wave passes through them?
a) The molecules flow from the sound source toward the listener. b) The molecules oscillate back and forth around fixed average positions. c) The molecules rotate in circular patterns. d) The molecules vibrate perpendicular to the direction of wave travel.
Answer
**b) The molecules oscillate back and forth around fixed average positions.** This is the key distinction between a wave and a flow. Sound energy propagates through the medium, but the medium itself undergoes no net displacement. Each air molecule returns to its equilibrium position after the wave passes. If molecules flowed from source to listener, you would feel a wind whenever someone nearby spoke.Q4. The speed of sound in air at 20°C is approximately 343 m/s. If the temperature drops to 0°C, what happens to the speed of sound?
a) It increases to about 380 m/s. b) It stays the same — speed of sound is independent of temperature. c) It decreases to about 331 m/s. d) It decreases to near zero.
Answer
**c) It decreases to about 331 m/s.** Using the approximation c ≈ 331 + 0.6T (where T is in Celsius): at T = 0°C, c ≈ 331 m/s. At T = 20°C, c ≈ 331 + 12 = 343 m/s. The speed of sound increases with temperature because warmer air molecules move faster and transmit disturbances more quickly. This is why cold wind instruments play flat until they warm up.Q5. What is the approximate frequency range of normal human hearing?
a) 2 Hz to 2,000 Hz b) 20 Hz to 20,000 Hz c) 200 Hz to 200,000 Hz d) 20 Hz to 200,000 Hz
Answer
**b) 20 Hz to 20,000 Hz (20 kHz)** This range spans three decades of frequency. The lower limit (20 Hz) corresponds to a wavelength of about 17 meters — the length of a large room. The upper limit (20,000 Hz) corresponds to a wavelength of about 1.7 cm. Both limits decline with age: most adults over 40 cannot hear much above 12,000–15,000 Hz.Q6. Which of the following correctly describes the function of the ossicles in the middle ear?
a) They amplify sound by vibrating at resonant frequencies. b) They filter out frequencies that are too loud to protect the cochlea. c) They convert fluid waves in the cochlea into nerve signals. d) They match the acoustic impedance of air to that of cochlear fluid, improving energy transfer.
Answer
**d) They match the acoustic impedance of air to that of cochlear fluid, improving energy transfer.** This is the impedance-matching function. Without the ossicles, a sound wave traveling through air would lose over 99.9% of its energy at the air-fluid interface (most would be reflected). The ossicle system uses area ratios (large eardrum to small oval window) and lever action to amplify pressure by a factor of 25–30, compensating for the impedance mismatch. The ossicles do not filter by loudness — that is a function of the stapedius reflex.Q7. The basilar membrane in the cochlea is narrow and stiff at the base and wide and flexible at the apex. What is the consequence of this graduated structure?
a) Low-frequency sounds cause maximum displacement at the base; high-frequency at the apex. b) High-frequency sounds cause maximum displacement at the base; low-frequency at the apex. c) All frequencies cause equal displacement throughout the membrane. d) The membrane responds only to sounds above 1,000 Hz.
Answer
**b) High-frequency sounds cause maximum displacement at the base; low-frequency sounds cause maximum displacement at the apex.** A stiffer structure has a higher natural resonant frequency. The stiff base resonates preferentially with high-frequency sounds; the flexible apex resonates with low frequencies. This tonotopic (frequency-place) organization allows the cochlea to act as a biological spectrum analyzer — similar to a Fourier transform — decomposing complex sounds into their frequency components before the signal even reaches the brain.Q8. A sound pressure level of 0 dB SPL corresponds to:
a) Absolute silence — no sound at all. b) The threshold of pain. c) The threshold of human hearing — the softest sound a healthy ear can detect. d) The level of a whispered conversation.
Answer
**c) The threshold of human hearing — the softest sound a healthy ear can detect.** 0 dB SPL is a reference level, not silence. It corresponds to a sound pressure of 20 micropascals — a pressure variation of about one ten-billionth of atmospheric pressure. The threshold of pain is around 130–140 dB SPL. True "silence" (0 pascals of pressure variation) would be expressed as negative infinity on the dB scale.Q9. A concert produces 90 dB at a distance of 10 meters from the stage. Using the inverse square law (intensity falls as 1/r²), what is the approximate sound level at 100 meters from the stage?
a) 9 dB b) 50 dB c) 70 dB d) 80 dB
Answer
**c) 70 dB** The distance increases by a factor of 10 (from 10 m to 100 m). Intensity scales as 1/r², so intensity drops by a factor of 10² = 100. In decibels: ΔL = 10 × log₁₀(100) = 10 × 2 = 20 dB. So the level drops from 90 dB to 90 - 20 = 70 dB. This illustrates the practical reality of outdoor concerts: even a 20-fold drop in perceived loudness requires a hundredfold increase in distance.Q10. Which statement about the distinction between "noise" and "music" is most accurate?
a) Noise consists of longitudinal waves; music consists of transverse waves. b) Noise is physically defined as aperiodic waveforms; music is physically defined as periodic waveforms — but this physical definition does not fully capture the cultural meaning of "music." c) Noise and music are physically identical; the distinction is entirely arbitrary. d) All music contains zero aperiodic components.
Answer
**b) Noise is physically defined as aperiodic waveforms; music is physically defined as periodic waveforms — but this physical definition does not fully capture the cultural meaning of "music."** The physical distinction is real and meaningful: periodic sounds have a discrete frequency spectrum and a perceptible pitch; aperiodic sounds have continuous spectra and no clear pitch. However, percussion instruments, noise music, and many extended instrumental techniques are plainly musical while being physically aperiodic. "Music" is a cultural category that physics can partially characterize but cannot fully determine.Q11. Why does sound travel approximately four times faster in water than in air?
a) Water molecules are much smaller than air molecules. b) Water has much higher density, which alone accounts for the speed difference. c) Water's greater elasticity (bulk modulus) more than compensates for its higher density. d) The speed of sound in water is determined by temperature alone.
Answer
**c) Water's greater elasticity (bulk modulus) more than compensates for its higher density.** The speed of sound ≈ √(bulk modulus / density). Water is much denser than air, which would slow sound down. But water is also far more elastically stiff — it resists compression much more strongly than air. This greater stiffness dominates, and the net effect is sound traveling at ~1,480 m/s in water versus ~343 m/s in air. Steel is denser still but also far stiffer, giving sound speeds around 5,120 m/s.Q12. Short Answer: Explain the Haas effect (precedence effect) and its significance for concert hall design. What is the approximate time window within which an early reflection is perceptually integrated with the direct sound?
Answer
The Haas effect (also called the precedence effect) is the phenomenon by which the human auditory system fuses sounds arriving within approximately 30–50 milliseconds of each other into a single perceived event, attributing the perceived location to the first-arriving sound. Reflections arriving within this window enhance the perceived loudness and richness of the sound without being heard as separate echoes. Reflections arriving after ~50 ms are perceived as distinct echoes. For concert hall design, this means architects try to ensure that all first reflections from walls and ceiling reach the audience within the 30-50 ms window. At 343 m/s, 30 ms corresponds to 10.3 meters of path length. A reflection from a side wall 5 meters away travels ~10 meters (to wall and back), arriving right at the integration boundary. Good hall design positions reflective surfaces so that early reflections are useful, while avoiding large, flat surfaces that could produce late echoes from distant walls.Q13. The inverse square law implies that if you double your distance from a sound source, the intensity of the sound:
a) Halves (drops by a factor of 2) b) Drops to one-quarter (factor of 4) c) Drops to one-eighth (factor of 8) d) Drops to one-sixteenth (factor of 16)
Answer
**b) Drops to one-quarter (factor of 4)** Intensity I ∝ 1/r². If r doubles (r → 2r), then I → 1/(2r)² = 1/4r² = (1/4) × (1/r²). The intensity drops to one-quarter. In decibels, this is a reduction of 10 × log₁₀(4) ≈ 6 dB — about a 6 dB drop for each doubling of distance, as long as the source behaves like a point source in a free field.Q14. Which part of the ear is responsible for the fact that humans are most sensitive to sounds around 3,000–4,000 Hz?
a) The pinna, which reflects sounds at this frequency preferentially. b) The ossicles, which resonate at 3,500 Hz. c) The ear canal, which acts as a resonant tube and amplifies frequencies around 3,500 Hz. d) The apex of the basilar membrane, which is most sensitive to this range.
Answer
**c) The ear canal, which acts as a resonant tube and amplifies frequencies around 3,500 Hz.** The ear canal is approximately 2.5 cm long. A tube closed at one end resonates at frequencies where the tube length equals one-quarter wavelength: f = c/(4L) = 343/(4 × 0.025) ≈ 3,430 Hz. This resonance amplifies sounds in the 2,000–5,000 Hz range, coinciding with the frequency range most important for understanding speech. This is why the ear's sensitivity curve peaks around 3,000–4,000 Hz and why hearing damage in this range is particularly disruptive to communication.Q15. What does "tonotopic organization" mean, and where does it appear in the auditory system?
Answer
"Tonotopic organization" (from Greek *tonos* = tone, *topos* = place) refers to the systematic mapping of sound frequencies to specific physical locations along a neural or anatomical structure. Different frequencies activate different places. It appears in the auditory system at multiple levels: 1. **Basilar membrane (cochlea):** High frequencies activate the base; low frequencies activate the apex. 2. **Auditory nerve:** Nerve fibers originating from different positions along the basilar membrane carry frequency-specific information. 3. **Auditory brainstem nuclei:** Frequency organization is preserved in the cochlear nucleus, inferior colliculus, and medial geniculate body. 4. **Auditory cortex:** Different regions of primary auditory cortex respond preferentially to different frequencies, organized in a tonotopic map. This spatial organization of frequency processing is what allows the brain to perform the perceptual equivalent of Fourier analysis — identifying which specific frequency components are present in a complex sound.Q16. A sound source emits 90 dB at 1 meter. A second identical source is placed next to the first, also producing 90 dB at 1 meter. What is the combined sound level at 1 meter?
a) 90 dB (they cancel each other) b) 93 dB c) 180 dB d) 100 dB
Answer
**b) 93 dB** When two equal incoherent sources combine, intensity doubles. ΔL = 10 × log₁₀(2) ≈ 3 dB. So 90 + 3 = 93 dB. The answer is not 180 dB (that would confuse dB values as if they were arithmetic numbers — a very common mistake). The answer is not 100 dB either. The logarithmic scale means that doubling intensity always adds exactly 3 dB regardless of the starting level. This is why adding more speakers or singers produces diminishing returns in perceived loudness.Q17. Bone conduction transmits sound to the cochlea by vibrating the skull bones. Why might your own voice sound different to you than it does on a recording?
Answer
When you speak, your vocal cords generate sound that reaches your own cochlea through TWO pathways simultaneously: 1. **Air conduction:** Sound radiates from your mouth, reflects off nearby surfaces, and enters your ear canal via the normal route. 2. **Bone conduction:** Vocal cord vibrations propagate through the skull bones directly to the cochlear fluid, bypassing the middle ear. The bone conduction pathway is particularly efficient at transmitting lower frequencies (it acts as a low-pass filter of sorts). So when you hear your own voice in real time, you hear a version enriched with more low-frequency content than the air-conducted sound alone would provide. When you hear a recording of your voice, ONLY the air-conducted signal was captured by the microphone — there is no bone conduction component. The voice sounds "thinner" and "higher" than you expect because you are accustomed to hearing your own voice with enhanced bass from bone conduction. Other people, who only ever hear your voice through air, hear what the recording captures and find it familiar.Q18. The chapter compares a choir to a particle accelerator as wave-interference systems. Which of the following is the BEST physical parallel drawn between the two?
a) Both systems use electromagnetic waves to produce music. b) In both systems, multiple slightly different wave sources interact through interference to produce organized resonance phenomena more complex than any single source alone. c) Both systems operate at exactly the same frequency range. d) Both systems were designed by physicists and musicians working together.
Answer
**b) In both systems, multiple slightly different wave sources interact through interference to produce organized resonance phenomena more complex than any single source alone.** In a choir: multiple voices at slightly different frequencies produce interference patterns (beats), resonance (formants), and emergent blend. In a particle accelerator: particles behave as quantum waves, and physicists identify particles as resonance states — energy levels where constructive interference creates stable, identifiable patterns. In both cases, the emergent organized phenomenon (choral blend; identified particle) is irreducible to any single source wave. The comparison is physical, not merely poetic.Q19. Short Answer: What is the difference between reverberation and echo, and why does the distinction matter for musical performance venues?
Answer
**Echo:** A distinct, perceptibly separate repetition of a sound, heard when a reflection arrives more than approximately 50 milliseconds after the direct sound. At 343 m/s, 50 ms corresponds to a path length difference of about 17 meters (round-trip 34 meters — i.e., a reflecting wall at least 17 meters away). Echoes in a concert hall are acoustically undesirable: they blur musical texture, make fast passages sound muddy, and in extreme cases (focusing echoes from concave surfaces) can make a single sound source appear to come from a different location. **Reverberation:** The smooth, gradually decaying persistence of sound in a space after the source stops, resulting from the superposition of many short-delayed reflections whose individual repetitions are below the threshold of distinct echo perception. Reverberation is musically desirable in most situations: it increases perceived warmth and blend, sustains tones naturally, and reduces the "dry" quality of anechoic spaces. **Musical implications:** - Symphony halls need reverberation times of 1.8–2.2 seconds for orchestral warmth. - Opera houses need shorter RT60 (~1.5 s) for vocal intelligibility. - Cathedrals (RT60 7–10 s) inspired the long sustained tones of plainchant. - Recording studios use controlled reverberation (real or digital) to add or remove perceived space.Q20. Synthesis question: In your own words, explain why it is not possible for physics alone to determine whether a given sound is "music." What does physics contribute to this question, and what does it leave open? Reference at least two concepts from this chapter.