Chapter 7 Further Reading: Timbre, Waveforms & Fourier's Revelation
Primary Sources and Mathematical Foundations
Fourier, Joseph. The Analytical Theory of Heat (1822; translated by Alexander Freeman, Cambridge University Press, 1878) The original source. Fourier's introduction, where he presents his decomposition theorem for the first time, is remarkably readable even by modern standards. The mathematical chapters are more demanding, but the conceptual introduction repays careful reading.
Cooley, James W., and John W. Tukey. "An Algorithm for the Machine Calculation of Complex Fourier Series." Mathematics of Computation 19:90 (1965): 297–301. The original FFT paper. Remarkably concise and historically significant. Available online through JSTOR. Reading the original 4-page paper gives a sense of how transformative a small mathematical insight can be.
Acoustics and Psychoacoustics
Roederer, Juan G. The Physics and Psychophysics of Music: An Introduction (4th ed., Springer, 2008) An essential reference for this chapter. Chapters on timbre, psychoacoustics of consonance, and the auditory system are directly relevant. The treatment of combination tones and critical bands is particularly clear.
Sethares, William A. Tuning, Timbre, Spectrum, Scale (2nd ed., Springer, 2005) An unconventional and fascinating book that explores how the relationship between harmonic spectra and consonance can be generalized: what would music sound like for beings who heard in different frequency ranges, or instruments with inharmonic spectra? Includes analysis of non-Western instruments and electronic music.
Bregman, Albert S. Auditory Scene Analysis: The Perceptual Organization of Sound (MIT Press, 1990) A landmark work in auditory cognitive science. Explains how the auditory system separates multiple simultaneous sound sources — directly relevant to understanding why choral music with many voices sounds coherent rather than a random noise cloud.
Zwicker, Eberhard, and Hugo Fastl. Psychoacoustics: Facts and Models (3rd ed., Springer, 2007) A comprehensive technical reference on psychoacoustics, including detailed coverage of masking, critical bands, spectral pitch, and timbre. More technical than Roederer but more comprehensive.
Digital Signal Processing and the FFT
Oppenheim, Alan V., and Ronald W. Schafer. Discrete-Time Signal Processing (3rd ed., Prentice Hall, 2009) The standard graduate textbook on digital signal processing. Chapters on the DFT and FFT are authoritative. More mathematical than this textbook assumes, but an essential reference for readers who want to go deeper into the algorithms.
Smith, Julius O. Mathematics of the Discrete Fourier Transform (DFT) with Audio Applications (W3K Publishing, 2nd ed., 2007) Freely available online at ccrma.stanford.edu. A pedagogically excellent treatment of the FFT specifically oriented toward audio applications. More accessible than Oppenheim and Schafer.
Müller, Meinard. Fundamentals of Music Processing: Audio, Analysis, Algorithms, Applications (Springer, 2015) An excellent graduate-level text covering music information retrieval, including spectral analysis, the CQT, chord recognition, and beat tracking. The treatment of the CQT and its advantages over FFT for music is particularly relevant to section 7.12.
Specific Topics
Wishart, Trevor. On Sonic Art (Routledge, revised ed., 1996) A theoretical framework for electronic and electroacoustic music, exploring how spectral composition differs from traditional note-based composition. Discusses the aesthetics of timbral composition using Fourier-based thinking.
Grey, John M. "Multidimensional Perceptual Scaling of Musical Timbres." Journal of the Acoustical Society of America 61:5 (1977): 1270–1277. A classic study of how listeners perceive timbral similarity and difference between instruments, showing that timbre is a multi-dimensional perceptual space rather than a single continuum.
McAdams, Stephen. "Spectral Fusion and the Creation of Auditory Images." In Music, Mind and Brain, ed. M. Clynes (Plenum Press, 1982) An important paper on how spectral components fuse into single perceived tones — directly relevant to understanding why a harmonic series is heard as one note rather than many.
Hermansky, Hynek. "Perceptual Linear Predictive (PLP) Analysis of Speech." Journal of the Acoustical Society of America 87:4 (1990): 1738–1752. On the specific spectral features used in modern speech recognition systems, including the relationship between human critical band hearing and spectral representation. Technical but relevant to section 7.9.
On Electronic Music and the Theremin
Glinsky, Albert. Theremin: Ether Music and Espionage (University of Illinois Press, 2000) The definitive biography of Lev Theremin and the instrument that bears his name. Covers the physics, the music, and the extraordinary life story of the inventor.
Holmes, Thom. Electronic and Experimental Music: Technology, Music, and Culture (5th ed., Routledge, 2016) A comprehensive history of electronic music that includes substantial coverage of how early electronic composers grappled with the aesthetic challenges of purely electronic timbres, including the problem of pure sine waves.
On Forensic Voice Analysis
National Research Council. Strengthening Forensic Science in the United States: A Path Forward (National Academies Press, 2009) The comprehensive NRC report that critically evaluated forensic science disciplines in the United States, including a detailed and skeptical assessment of forensic voice analysis. Available free online from the National Academies.
Nolan, Francis. The Phonetic Bases of Speaker Recognition (Cambridge University Press, 1983) A systematic examination of what acoustic features of the voice are stable (and therefore potentially useful for speaker recognition) and which are variable. Foundational for understanding the science behind forensic voice analysis.
Online Resources
Sonic Visualiser — https://sonicvisualiser.org Free, open-source software for analyzing audio files. Produces spectrograms and allows measurement of formant frequencies, fundamental frequency contours, and spectral features. Essential practical tool for exploring the concepts in this chapter.
Praat: Doing Phonetics by Computer — https://www.fon.hum.uva.nl/praat/ The standard tool used by linguists and speech scientists worldwide for acoustic phonetic analysis. Free to download. Its spectrogram and formant tracking capabilities are directly relevant to Case Study 7.1.
Stanford CCRMA: Center for Computer Research in Music and Acoustics — https://ccrma.stanford.edu A major research center at the intersection of music and technology. The CCRMA website includes free educational resources, software, and publications on digital signal processing and computer music.
Visual Music — National Museum of American History (Smithsonian) — Online exhibition covering the history of instruments that make music visually visible, including the spectrograph.