Case Study 2: Schrödinger's Equation in History — How a Vacation Led to the Most Important Equation in Physics

Contributors to Quantum Mechanics: From Wavefunctions to Qubits

Case Study 2: Schrödinger's Equation in History — How a Vacation Led to the Most Important Equation in Physics

The Central Question

Scientific breakthroughs are often presented as logical inevitabilities — each step following cleanly from the last. The reality is messier, more human, and more interesting. This case study traces the actual path to the Schrödinger equation: the wrong turns, the competing approaches, the personal rivalries, and the deep question of whether the equation was discovered or invented.

Part I: The State of Physics in Late 1925

The Crisis Was Clear; The Solution Was Not

By the end of 1925, the landscape of atomic physics was extraordinary. Two decades of accumulating evidence — from Planck's blackbody formula (1900) to de Broglie's matter waves (1924) — had demonstrated that classical physics could not describe the atomic world. The "old quantum theory" (Bohr-Sommerfeld quantization rules) could explain the hydrogen atom but failed for helium, could not handle intensities of spectral lines, and was riddled with ad hoc rules that everyone recognized as temporary scaffolding.

The question was not whether a new mechanics was needed, but what it would look like.

Heisenberg Strikes First

In June 1925, Werner Heisenberg — 23 years old, suffering from hay fever, and recuperating on the island of Helgoland — produced the first version of the new mechanics. His approach was radical: he refused to talk about electron orbits (which cannot be observed) and instead constructed a mathematical framework based entirely on observable quantities — the frequencies and intensities of spectral lines.

Heisenberg's formulation used arrays of numbers that, as Max Born and Pascual Jordan quickly recognized, were matrices. The resulting theory — matrix mechanics — was mathematically rigorous and gave correct predictions, but it was extraordinarily abstract. The algebra of infinite-dimensional matrices was unfamiliar to most physicists, and the theory offered no intuitive physical picture of what was happening inside the atom.

Most physicists found matrix mechanics baffling. This created an opening.

Part II: Schrödinger's Path

The Man

Erwin Schrödinger was 38 in late 1925 — significantly older than the young revolutionaries (Heisenberg was 23, Dirac was 23, Pauli was 25). He was an accomplished classical physicist with broad interests in general relativity, statistical mechanics, and philosophy. He was a professor at the University of Zurich, well-established but not at the center of the quantum revolution.

Schrödinger was deeply influenced by Louis de Broglie's 1924 thesis, which proposed that particles have wave-like properties with wavelength $\lambda = h/p$. Where others saw de Broglie's proposal as a curious mathematical trick, Schrödinger took it literally: if particles are waves, there must be a wave equation governing them.

🔵 Historical Note. Schrödinger's engagement with de Broglie's ideas was catalyzed by a seminar. In November 1925, Peter Debye — a colleague at Zurich — asked Schrödinger to present de Broglie's thesis to their seminar. After the presentation, Debye reportedly remarked: "You tell us about this de Broglie wave, but where is the wave equation?" This challenge stuck with Schrödinger.

The Christmas Vacation

In December 1925, Schrödinger left Zurich for the Alpine resort of Arosa, in eastern Switzerland. He brought with him a mysterious female companion (not his wife — Schrödinger's personal life was famously unconventional) and his notebooks.

During this vacation, he derived the time-independent Schrödinger equation for the hydrogen atom and showed that it gave the correct energy levels $E_n = -13.6\,\text{eV}/n^2$.

The exact sequence of his reasoning has been debated by historians, but the key steps appear to have been:

Start with the classical energy relation. For a particle in a potential: $E = \frac{p^2}{2m} + V(r)$.
Use de Broglie's relation $p = \hbar k$ and the wave equation $\nabla^2\psi + k^2\psi = 0$ to replace $p^2$ with $-\hbar^2\nabla^2$.
Arrive at $-\frac{\hbar^2}{2m}\nabla^2\psi + V\psi = E\psi$ — the time-independent Schrödinger equation.
Apply to hydrogen ($V = -e^2/(4\pi\epsilon_0 r)$) by separating variables in spherical coordinates. The requirement that solutions be finite and single-valued forces $E_n = -13.6\,\text{eV}/n^2$.

Schrödinger later said of this period: "At the moment I was struggling with a new atomic theory. If you knew what I have found, you would know that it is very beautiful."

The First Paper (January 1926)

Schrödinger submitted "Quantization as an Eigenvalue Problem" (Quantisierung als Eigenwertproblem) to Annalen der Physik on January 27, 1926. This paper solved the hydrogen atom using wave mechanics.

The paper was a sensation. Unlike Heisenberg's abstract matrices, Schrödinger's approach used the familiar tools of partial differential equations, eigenvalue problems, and classical analysis. Physicists who had been struggling with matrix mechanics could suddenly see what was going on — the electron was described by a wave function, and its allowed energies were determined by a boundary value problem.

Three More Papers (1926)

Schrödinger followed up with three more papers in rapid succession:

Second paper (February 1926): Extended the theory to the harmonic oscillator, rigid rotor, and diatomic molecules. Showed that quantized energy levels emerge naturally as eigenvalues.
Third paper (May 1926): Developed time-dependent perturbation theory and computed transition rates for the Stark effect (hydrogen in an electric field).
Fourth paper (June 1926): Introduced the time-dependent Schrödinger equation and proved the equivalence of wave mechanics and matrix mechanics — showing that Heisenberg's matrices could be derived from Schrödinger's wave functions and vice versa.

📊 By the Numbers. The four papers were published over a span of just six months. Together they constitute approximately 80 pages of dense mathematical physics. They established the complete framework of wave mechanics, solved multiple physical problems, and unified the competing matrix mechanics approach. By any measure, this was one of the most productive half-years in the history of science.

Part III: The Equivalence and the Rivalry

Wave Mechanics vs. Matrix Mechanics

Schrödinger proved in his fourth paper that wave mechanics and matrix mechanics are mathematically equivalent — they are different representations of the same underlying theory. The proof goes roughly as follows:

In matrix mechanics, observables are represented by matrices (operators) and states by column vectors.
In wave mechanics, observables are represented by differential operators and states by wave functions.
The matrix elements of Heisenberg's matrices are $H_{mn} = \int \phi_m^*(x)\hat{H}\phi_n(x)\,dx$ — the operator matrix elements in the basis of energy eigenfunctions.

Despite this mathematical equivalence, Schrödinger and Heisenberg personally despised each other's approaches:

Schrödinger on matrix mechanics: "I was discouraged, if not repelled, by what appeared to me a rather difficult method of transcendental algebra, defying any visualization."

Heisenberg on wave mechanics: "The more I ponder about the physical part of Schrödinger's theory, the more disgusting I find it. What Schrödinger writes about the visualizability of his theory... I consider to be garbage."

The acrimony ran deep. When Heisenberg learned that Schrödinger had solved the hydrogen atom with wave mechanics, he wrote to Pauli: "The more I think about the physical portion of the Schrödinger theory, the more repulsive I find it." Schrödinger, for his part, said he was "repelled" by matrix mechanics and its lack of physical picture.

The Dirac Synthesis

Paul Dirac, characteristically, cut through the rivalry. In his Principles of Quantum Mechanics (1930), Dirac developed a formulation — the abstract Hilbert space/bra-ket formulation — that encompassed both wave mechanics and matrix mechanics as special cases. In Dirac's framework, $\psi(x)$ and the matrix elements $c_n$ are just two different representations of the same abstract state vector $|\psi\rangle$.

🔗 Connection. We will develop Dirac's formulation in Chapter 8, which serves as the bridge between the wave mechanics of Parts I-II and the more abstract operator algebra of Parts III onward. The equivalence Schrödinger proved, and the synthesis Dirac provided, is one of the most beautiful structural results in all of physics.

Part IV: Alternative Derivations

Schrödinger's approach was not the only possible one. Several other paths to the same equation exist:

From Hamilton-Jacobi theory. Schrödinger was influenced by the optical-mechanical analogy: geometric optics is to wave optics as classical mechanics is to quantum mechanics. The Hamilton-Jacobi equation of classical mechanics, $\frac{\partial S}{\partial t} + H\left(x, \frac{\partial S}{\partial x}\right) = 0$, can be converted into the Schrödinger equation by the substitution $\psi = e^{iS/\hbar}$ and taking $\hbar$ to be small but nonzero. This is actually close to Schrödinger's original reasoning.

From the Feynman path integral. Richard Feynman showed in 1948 that the Schrödinger equation follows from summing the amplitudes $e^{iS[path]/\hbar}$ over all possible paths, where $S$ is the classical action. This approach, which we develop in Chapter 31, provides perhaps the deepest physical understanding of why the Schrödinger equation has the form it does.

From information-theoretic axioms. Modern reconstructions of quantum mechanics (Hardy 2001, Chiribella et al. 2011) derive the Schrödinger equation from abstract postulates about information processing, without ever mentioning waves or particles. These approaches suggest that quantum mechanics is not fundamentally about waves or particles but about the structure of information.

From symmetry principles. The Schrödinger equation can be derived by requiring that the laws of physics be invariant under Galilean transformations (rotations, translations, boosts, and time translations). The group-theoretic approach, developed by Wigner and Bargmann, shows that the TDSE is essentially the unique equation consistent with Galilean symmetry and the superposition principle.

Each of these derivations illuminates a different aspect of the equation's meaning. No single derivation is "the" derivation — the Schrödinger equation sits at the intersection of multiple deep principles.

Part V: What Schrödinger Got Wrong

The Charge Density Interpretation

Schrödinger initially interpreted $|\psi|^2$ as a literal charge density — the electron "smeared out" through space. He was deeply committed to this interpretation, which fit his classical sensibilities: the electron was a wave, and that was that.

This interpretation fails for several reasons:

Single-particle detection. Electrons are always detected at single points, not as spread-out blobs. If $|\psi|^2$ were a charge density, a detector should register a continuous distribution, not a point event.
Multi-particle systems. For two particles, the wave function is $\psi(x_1, x_2, t)$ — a function of six spatial coordinates (in 3D), not three. It cannot be interpreted as a density in ordinary three-dimensional space.
Spreading. A free-particle wave packet spreads over time. Under the charge density interpretation, the electron would literally expand. This contradicts the fact that electrons are always detected as point particles regardless of how much time has passed.

Max Born's statistical interpretation resolved all three problems. Schrödinger never fully accepted it — he famously said near the end of his life: "I don't like it, and I'm sorry I ever had anything to do with it."

The Time-Dependent Equation

Schrödinger actually derived the time-independent equation first and initially struggled with the time-dependent version. His original attempt involved a second-order-in-time equation (closer to the classical wave equation), which gave wrong results for time-dependent problems. It was only in his fourth paper that he wrote down the correct first-order-in-time equation with the crucial factor of $i$.

The necessity of the imaginary unit $i$ in the Schrödinger equation is deeply connected to the complex nature of quantum amplitudes and the distinction between propagating waves ($e^{i(kx-\omega t)}$, complex) and standing waves ($\sin(kx)\cos(\omega t)$, real). A real wave equation cannot correctly describe quantum interference.

Analysis Questions

Heisenberg's matrix mechanics and Schrödinger's wave mechanics are mathematically equivalent. Yet physicists overwhelmingly adopted wave mechanics in the years following 1926. Why? What does this tell us about the role of intuition and visualization in theoretical physics?
Schrödinger was motivated by the optical-mechanical analogy (geometric optics : wave optics :: classical mechanics : quantum mechanics). Trace this analogy in detail. Where does it work? Where does it break down?
The Schrödinger equation can be "derived" from de Broglie's relations, but de Broglie's relations are themselves postulates. In what sense, if any, is the Schrödinger equation "derived"? Compare this to the status of Newton's second law or Maxwell's equations.
Schrödinger and Heisenberg personally despised each other's approaches despite their mathematical equivalence. Find a modern example in physics (or another science) where two equivalent formulations of the same theory provoke strong preferences among practitioners. What drives these preferences?
Schrödinger's Christmas vacation in Arosa is one of the most famous episodes in the history of physics. Research the role of "retreats" and "isolation periods" in other major scientific breakthroughs (Newton's plague years, Einstein's patent office, Darwin's Downe House). What patterns, if any, do you observe?
The multiple independent derivations of the Schrödinger equation (from de Broglie waves, from Hamilton-Jacobi theory, from path integrals, from symmetry principles, from information theory) suggest something deep about the equation's status. What does the existence of multiple derivations tell us about the relationship between the equation and reality?