48 min read

> "If, in some cataclysm, all of scientific knowledge were to be destroyed, and only one sentence passed on to the next generation of creatures, what statement would contain the most information in the fewest words? I believe it is the atomic...

Chapter 5: Quantum Mechanics in Three Dimensions: The Hydrogen Atom

"If, in some cataclysm, all of scientific knowledge were to be destroyed, and only one sentence passed on to the next generation of creatures, what statement would contain the most information in the fewest words? I believe it is the atomic hypothesis..." — Richard Feynman

"The hydrogen atom is the Rosetta Stone of modern physics. Every advance in quantum theory is first tested against this single electron orbiting this single proton."

Everything we have built in Chapters 2 through 4 --- the Schrodinger equation, the Born interpretation, boundary conditions, quantized energies, the harmonic oscillator --- has been in one spatial dimension. Real atoms, of course, live in three dimensions. This chapter makes the leap from the toy models of one dimension to the single most important quantum system ever studied: the hydrogen atom.

The hydrogen atom is not merely another textbook problem. It is the system that made quantum mechanics. When Schrodinger's equation produced the correct hydrogen energy levels in 1926, the entire physics community understood that a revolution had occurred. No adjustable parameters. No hand-waving. One equation, one atom, and perfect agreement with 30 years of spectroscopic data that had baffled every classical theory.

We will derive everything from first principles. The mathematics is more involved than anything we have encountered so far --- separation of variables, special functions, multi-index quantum numbers --- but every step carries physical meaning. By the end, you will understand not just what the hydrogen atom looks like, but why it looks the way it does, and how this single system provides the template for understanding the entire periodic table.

The road ahead is demanding. We will transform a single partial differential equation in three variables into three ordinary differential equations, each of which yields a new quantum number. Along the way, we will encounter special functions (Legendre polynomials, spherical harmonics, Laguerre polynomials) that will become permanent members of our mathematical toolkit. Each of these functions is not a mere mathematical artifact --- it encodes specific physical information about the angular distribution, radial structure, and energy of the electron.

🏃 Fast Track: If you are comfortable with separation of variables in spherical coordinates and spherical harmonics from a mathematics course, you may skim Sections 5.2--5.4 and focus your attention on Sections 5.5--5.7, where the physics comes into sharpest focus. However, the derivation sections contain physical reasoning that is difficult to pick up elsewhere, so even experienced readers may find value in the full treatment.


5.1 From One Dimension to Three: The 3D Schrodinger Equation

In one dimension, the time-independent Schrodinger equation (TISE) reads:

$$-\frac{\hbar^2}{2m}\frac{d^2\psi}{dx^2} + V(x)\psi = E\psi$$

The generalization to three dimensions is straightforward in principle: replace the second derivative with the three-dimensional Laplacian operator $\nabla^2$:

$$-\frac{\hbar^2}{2m}\nabla^2\psi(\mathbf{r}) + V(\mathbf{r})\psi(\mathbf{r}) = E\psi(\mathbf{r})$$

where $\mathbf{r} = (x, y, z)$ and in Cartesian coordinates:

$$\nabla^2 = \frac{\partial^2}{\partial x^2} + \frac{\partial^2}{\partial y^2} + \frac{\partial^2}{\partial z^2}$$

This looks like three copies of the one-dimensional kinetic energy operator, one for each spatial direction. For a free particle ($V = 0$), the solutions are plane waves $\psi \propto e^{i\mathbf{k}\cdot\mathbf{r}}$, and the energy is $E = \hbar^2 k^2 / 2m$ with $k^2 = k_x^2 + k_y^2 + k_z^2$. But already something interesting appears: the energy depends only on the magnitude of $\mathbf{k}$, not its direction. This means that for any given energy, there are infinitely many plane-wave solutions pointing in different directions --- our first encounter with the connection between symmetry and degeneracy that will dominate this chapter.

Why Spherical Coordinates?

For the hydrogen atom, the potential energy is the Coulomb potential:

$$V(r) = -\frac{e^2}{4\pi\epsilon_0}\frac{1}{r}$$

where $r = |\mathbf{r}| = \sqrt{x^2 + y^2 + z^2}$ is the distance from the proton to the electron. This potential depends only on $r$, not on the angular orientation --- it is spherically symmetric. This is the defining property of a central force.

When confronted with a central force, we face a strategic choice: work in Cartesian coordinates, where the Laplacian is simple but the potential is complicated ($V = -e^2/4\pi\epsilon_0\sqrt{x^2+y^2+z^2}$), or work in spherical coordinates, where the potential is simple ($V = -e^2/4\pi\epsilon_0 r$) but the Laplacian is more complex. The second option wins decisively, because the simplicity of the potential in spherical coordinates allows the equation to separate.

💡 Key Insight: When the potential has spherical symmetry, spherical coordinates $(r, \theta, \phi)$ are the natural choice. The reason is not aesthetic --- it is that spherical symmetry makes the angular part of the problem separable and universal. The angular solutions (spherical harmonics) are the same for any central potential. Only the radial equation knows about the specific form of $V(r)$.

The spherical coordinates relate to Cartesian ones by:

$$x = r\sin\theta\cos\phi, \quad y = r\sin\theta\sin\phi, \quad z = r\cos\theta$$

with $r \geq 0$, $0 \leq \theta \leq \pi$ (polar angle from the $z$-axis), and $0 \leq \phi < 2\pi$ (azimuthal angle in the $xy$-plane). The volume element transforms as:

$$d^3\mathbf{r} = dx\;dy\;dz = r^2\sin\theta\;dr\;d\theta\;d\phi$$

The factor $r^2\sin\theta$ --- the Jacobian of the coordinate transformation --- will appear throughout our calculations. It is responsible for the crucial difference between $|R(r)|^2$ and the radial probability density $r^2|R(r)|^2$ that we will encounter in Section 5.7.

The Laplacian in spherical coordinates takes the form:

$$\nabla^2 = \frac{1}{r^2}\frac{\partial}{\partial r}\left(r^2\frac{\partial}{\partial r}\right) + \frac{1}{r^2\sin\theta}\frac{\partial}{\partial \theta}\left(\sin\theta\frac{\partial}{\partial \theta}\right) + \frac{1}{r^2\sin^2\theta}\frac{\partial^2}{\partial \phi^2}$$

This expression looks intimidating, but it has a beautiful structure. Notice that the last two terms depend only on angles and share a common factor of $1/r^2$. We can write:

$$\nabla^2 = \frac{1}{r^2}\frac{\partial}{\partial r}\left(r^2\frac{\partial}{\partial r}\right) + \frac{1}{r^2}\hat{\Lambda}^2$$

where $\hat{\Lambda}^2$ is the angular Laplacian (sometimes called the Legendrian):

$$\hat{\Lambda}^2 = \frac{1}{\sin\theta}\frac{\partial}{\partial\theta}\left(\sin\theta\frac{\partial}{\partial\theta}\right) + \frac{1}{\sin^2\theta}\frac{\partial^2}{\partial\phi^2}$$

The split of $\nabla^2$ into a radial part and an angular part $\hat{\Lambda}^2/r^2$ is the mathematical expression of a deep physical separation: the radial kinetic energy (motion toward or away from the center) and the angular kinetic energy (motion around the center).

🔗 Connection: This angular operator is intimately related to the orbital angular momentum operator: $\hat{L}^2 = -\hbar^2\hat{\Lambda}^2$. This means the angular part of the kinetic energy is $\hat{L}^2/2m_e r^2$ --- exactly the classical expression for rotational kinetic energy. We will explore this connection deeply in Chapter 12. For now, we note it without proof: the angular part of the Laplacian controls the orbital angular momentum of the particle.

The Two-Body Problem and the Reduced Mass

Strictly speaking, the hydrogen atom is a two-body problem: both the proton and the electron move. However, because the proton is about 1,836 times heavier than the electron, the proton barely moves and we can, to excellent approximation, treat it as fixed. The correction is to replace the electron mass $m_e$ with the reduced mass:

$$\mu = \frac{m_e m_p}{m_e + m_p} = m_e\frac{m_p}{m_e + m_p} \approx 0.99946\;m_e$$

This changes the energy levels by only 0.054\%. We will use $m_e$ throughout the derivation for simplicity, noting where the reduced mass correction matters. (In Case Study 2, we will see systems where the reduced mass correction is not negligible at all --- muonic hydrogen being the most dramatic example.)

🔄 Check Your Understanding (Spaced Review --- Ch 2): Write down the time-independent Schrodinger equation in one dimension. What is the physical interpretation of each term? Now write the 3D version. What is the only structural change?


5.2 Separation of Variables in Spherical Coordinates

We now apply the most powerful technique in the theory of partial differential equations: separation of variables. The idea is to convert one hard equation in three variables into three simpler equations, each in one variable. We guess that the full wavefunction can be written as a product of a radial part and an angular part:

$$\psi(r, \theta, \phi) = R(r) \cdot Y(\theta, \phi)$$

This is not guaranteed to work for an arbitrary potential, but for a central potential $V(r)$, it succeeds because the potential depends only on $r$ and the angular Laplacian operates only on angles --- the two parts of the equation live in different "worlds."

Substituting into the 3D TISE with a central potential $V(r)$:

$$-\frac{\hbar^2}{2m}\left[\frac{1}{r^2}\frac{d}{dr}\left(r^2\frac{dR}{dr}\right)Y + \frac{R}{r^2}\hat{\Lambda}^2 Y\right] + V(r)RY = ERY$$

Divide through by $RY$ and multiply by $-2mr^2/\hbar^2$:

$$\underbrace{\frac{1}{R}\frac{d}{dr}\left(r^2\frac{dR}{dr}\right) - \frac{2mr^2}{\hbar^2}\left[V(r) - E\right]}_{\text{depends only on } r} = \underbrace{-\frac{1}{Y}\hat{\Lambda}^2 Y}_{\text{depends only on } \theta, \phi}$$

This is the critical step: the left side depends only on $r$, the right side depends only on $\theta$ and $\phi$. If you vary $r$ while holding $\theta$ and $\phi$ fixed, the right side does not change. If you vary the angles while holding $r$ fixed, the left side does not change. The only way a function of $r$ can equal a function of angles for all values of all three variables is if both sides equal the same constant. We call it $l(l+1)$, where $l$ is a non-negative integer (the reason for this notation will become clear when we solve the angular equation):

Radial equation: $$\frac{1}{R}\frac{d}{dr}\left(r^2\frac{dR}{dr}\right) - \frac{2mr^2}{\hbar^2}\left[V(r) - E\right] = l(l+1)$$

Angular equation: $$-\hat{\Lambda}^2 Y = l(l+1) Y$$

The angular equation is identical for all central-force problems --- it does not involve $V(r)$ at all. This universality is one of the most powerful consequences of spherical symmetry. Solve the angular equation once, and you have the angular wavefunctions for the hydrogen atom, the 3D harmonic oscillator, the deuteron, any spherically symmetric quantum dot, and any other central-force system.

Separating the Angular Equation

The angular part itself separates. Writing $Y(\theta, \phi) = \Theta(\theta)\Phi(\phi)$ and substituting into $-\hat{\Lambda}^2 Y = l(l+1)Y$:

$$\frac{\sin\theta}{\Theta}\frac{d}{d\theta}\left(\sin\theta\frac{d\Theta}{d\theta}\right) + l(l+1)\sin^2\theta = -\frac{1}{\Phi}\frac{d^2\Phi}{d\phi^2}$$

Again, the left side depends only on $\theta$, the right side only on $\phi$. Set both equal to a constant $m^2$ (we use $m$ for the magnetic quantum number, not to be confused with the particle mass):

Azimuthal equation:

$$\frac{d^2\Phi}{d\phi^2} = -m^2\Phi$$

Polar equation:

$$\frac{1}{\sin\theta}\frac{d}{d\theta}\left(\sin\theta\frac{d\Theta}{d\theta}\right) + \left[l(l+1) - \frac{m^2}{\sin^2\theta}\right]\Theta = 0$$

We have transformed one partial differential equation in three variables into three ordinary differential equations, each in a single variable. This is the power of separation of variables combined with spherical symmetry. Each equation produces one quantum number: the azimuthal equation gives $m$, the polar equation gives $l$, and the radial equation (for a specific $V(r)$) gives $n$.

💡 Key Insight: Separation of variables is not a mere mathematical trick --- it reflects the physical independence of radial and angular motion in a central force. The angular part describes orbital angular momentum, which is conserved (the potential exerts no torque). The radial part describes the "in-and-out" motion in the effective potential. The quantum number $l$ is the bridge between the two: it appears in both the angular and radial equations, linking the angular momentum to the radial dynamics.

⚠️ Common Misconception: Students sometimes believe that separation of variables "forces" the wavefunction to be a product $R(r)Y(\theta,\phi)$. This is not the case. Any linear combination $\psi = \sum_{l,m} c_{lm} R_l(r)Y_l^m(\theta,\phi)$ is also a valid wavefunction. Separation of variables finds the building blocks (the eigenstates), from which arbitrary states can be constructed by superposition.

🔄 Check Your Understanding (Spaced Review --- Ch 3): In the infinite square well, we separated time and space, obtaining the TISE. Here we separate $r$, $\theta$, and $\phi$. What is the key assumption that makes separation of variables work in both cases? What physical fact does each separation exploit?


5.3 The Angular Equation: Spherical Harmonics

The Azimuthal ($\phi$) Equation

The azimuthal equation $\Phi'' = -m^2\Phi$ is one we solved in Chapter 3: the solutions are complex exponentials:

$$\Phi_m(\phi) = \frac{1}{\sqrt{2\pi}}e^{im\phi}$$

with the normalization chosen so that $\int_0^{2\pi}|\Phi_m|^2 d\phi = 1$.

The single-valuedness requirement $\Phi(\phi + 2\pi) = \Phi(\phi)$ forces $m$ to be an integer:

$$m = 0, \pm 1, \pm 2, \pm 3, \ldots$$

This quantum number $m$ is the magnetic quantum number (so named because it determines how the energy levels split in an external magnetic field --- the Zeeman effect, which we will encounter in Chapter 18). Physically, it describes the $z$-component of orbital angular momentum: $L_z = m\hbar$.

The quantization of $m$ is remarkable. Simply demanding that the wavefunction be single-valued as we go around the $z$-axis forces the $z$-component of angular momentum to come in integer multiples of $\hbar$. This is the quantum-mechanical origin of the quantization that Bohr assumed in 1913.

The Polar ($\theta$) Equation: Associated Legendre Functions

The polar equation is considerably harder. It is the associated Legendre equation, and its solutions are the associated Legendre functions $P_l^m(\cos\theta)$. The substitution $u = \cos\theta$ transforms the equation into a more standard form:

$$(1 - u^2)\frac{d^2P}{du^2} - 2u\frac{dP}{du} + \left[l(l+1) - \frac{m^2}{1-u^2}\right]P = 0$$

This equation has regular singular points at $u = \pm 1$ (the poles $\theta = 0, \pi$). The requirement that solutions be finite at the poles --- physically, the wavefunction must be well-behaved everywhere on the sphere --- imposes two constraints:

  1. $l$ must be a non-negative integer: $l = 0, 1, 2, 3, \ldots$
  2. $|m| \leq l$, i.e., $m = -l, -l+1, \ldots, l-1, l$

The second constraint has a beautiful physical interpretation: the $z$-component of angular momentum ($m\hbar$) cannot exceed the total angular momentum magnitude ($\hbar\sqrt{l(l+1)}$, which is approximately $\hbar l$ for large $l$). You cannot have more angular momentum pointing along the $z$-axis than you have in total.

The solutions are the associated Legendre functions $P_l^m(u)$, defined in terms of the ordinary Legendre polynomials $P_l(u)$ by:

$$P_l^m(u) = (-1)^m (1 - u^2)^{m/2} \frac{d^m}{du^m} P_l(u)$$

where $P_l(u)$ is the Legendre polynomial:

$$P_l(u) = \frac{1}{2^l l!}\frac{d^l}{du^l}(u^2 - 1)^l$$

This is the Rodrigues formula, which generates all Legendre polynomials from the simple function $(u^2-1)^l$ by differentiation. The first few Legendre polynomials are:

$l$ $P_l(\cos\theta)$ Physical shape
0 $1$ Uniform (isotropic)
1 $\cos\theta$ One nodal plane ($\theta = \pi/2$)
2 $\frac{1}{2}(3\cos^2\theta - 1)$ Two nodal cones
3 $\frac{1}{2}(5\cos^3\theta - 3\cos\theta)$ Three nodal surfaces

Each successive Legendre polynomial has one more angular node, just as higher energy states in one dimension have more spatial nodes. This is not a coincidence --- it is the same underlying mathematics of Sturm-Liouville eigenvalue problems.

The Spherical Harmonics

Combining the $\theta$ and $\phi$ solutions with the appropriate normalization constants gives the spherical harmonics:

$$Y_l^m(\theta, \phi) = \epsilon \sqrt{\frac{(2l+1)}{4\pi}\frac{(l-|m|)!}{(l+|m|)!}} \; P_l^{|m|}(\cos\theta) \; e^{im\phi}$$

where $\epsilon = (-1)^m$ for $m \geq 0$ and $\epsilon = 1$ for $m < 0$ (the Condon-Shortley phase convention, which is the standard in physics).

The first several spherical harmonics, written out explicitly, are:

$$Y_0^0 = \frac{1}{\sqrt{4\pi}} \qquad \text{(spherically symmetric)}$$

$$Y_1^0 = \sqrt{\frac{3}{4\pi}}\cos\theta, \qquad Y_1^{\pm 1} = \mp\sqrt{\frac{3}{8\pi}}\sin\theta \; e^{\pm i\phi}$$

$$Y_2^0 = \sqrt{\frac{5}{16\pi}}(3\cos^2\theta - 1), \qquad Y_2^{\pm 1} = \mp\sqrt{\frac{15}{8\pi}}\sin\theta\cos\theta \; e^{\pm i\phi}$$

$$Y_2^{\pm 2} = \sqrt{\frac{15}{32\pi}}\sin^2\theta \; e^{\pm 2i\phi}$$

Notice the pattern: $Y_l^0$ depends only on $\theta$ (no $\phi$ dependence, hence cylindrical symmetry about the $z$-axis), while $Y_l^m$ with $m \neq 0$ has $\phi$-dependence through the factor $e^{im\phi}$. The probability density $|Y_l^m|^2$ is always independent of $\phi$ (because $|e^{im\phi}|^2 = 1$), reflecting the rotational symmetry of the underlying physics about the axis along which we measure $L_z$.

Orthogonality and Completeness

The spherical harmonics satisfy the orthogonality relation:

$$\int_0^{2\pi}\int_0^{\pi} Y_{l'}^{m'*}(\theta,\phi) \; Y_l^m(\theta,\phi) \; \sin\theta \; d\theta \; d\phi = \delta_{ll'}\delta_{mm'}$$

This means different spherical harmonics are orthogonal in the sense of the inner product on the unit sphere. The factor $\sin\theta$ in the integration measure is essential --- it is the angular part of the spherical volume element.

The spherical harmonics also form a complete set: any well-behaved function on the surface of a sphere can be expanded as a linear combination of spherical harmonics:

$$f(\theta, \phi) = \sum_{l=0}^{\infty}\sum_{m=-l}^{l} c_{lm} Y_l^m(\theta, \phi)$$

This is the spherical analog of a Fourier series, and it is used throughout physics --- from multipole expansions in electrostatics to the analysis of the cosmic microwave background radiation. The coefficients $c_{lm}$ are the "angular momentum content" of the function $f$.

⚠️ Common Misconception: The spherical harmonics $Y_l^m$ are not the wavefunctions of the hydrogen atom. They are the angular part of the wavefunctions for any central-force problem. The radial part $R(r)$ depends on the specific potential, and it is the combination $\psi_{nlm}(r,\theta,\phi) = R_{nl}(r) Y_l^m(\theta,\phi)$ that gives the complete hydrogen wavefunction.

Physical Interpretation: Angular Momentum

The quantum numbers $l$ and $m$ directly encode the orbital angular momentum:

  • $\hat{L}^2 Y_l^m = \hbar^2 l(l+1) Y_l^m$ --- the magnitude-squared of angular momentum is $\hbar^2 l(l+1)$
  • $\hat{L}_z Y_l^m = m\hbar Y_l^m$ --- the $z$-component of angular momentum is $m\hbar$

Notice something strange: $|\mathbf{L}| = \hbar\sqrt{l(l+1)} > \hbar l = \hbar |m|_{\max}$. The angular momentum vector can never be perfectly aligned with the $z$-axis, even when $m = l$ (its maximum value). The "tip" of the angular momentum vector traces out a cone around the $z$-axis. This is a direct consequence of the uncertainty principle for angular momentum components. Since $\hat{L}_x$ and $\hat{L}_y$ do not commute with $\hat{L}_z$:

$$[\hat{L}_x, \hat{L}_y] = i\hbar\hat{L}_z$$

If $L_z$ has a definite value, then $L_x$ and $L_y$ do not, and the angular momentum vector precesses around the $z$-axis. The "extra" angular momentum $\sqrt{l(l+1)} - l$ is, so to speak, "smeared out" in the $x$ and $y$ directions by the uncertainty principle.

For $l = 2$, the magnitude is $|\mathbf{L}| = \hbar\sqrt{6} \approx 2.449\hbar$, while the maximum $z$-component is $L_z = 2\hbar$. The angular momentum vector is tilted at least $\arccos(2/\sqrt{6}) \approx 35.3°$ from the $z$-axis even in the state of maximum alignment.

📜 Historical Context: The letter labels for angular momentum quantum numbers come from 19th-century spectroscopy: $l = 0$ (s, "sharp"), $l = 1$ (p, "principal"), $l = 2$ (d, "diffuse"), $l = 3$ (f, "fundamental"). These names described the appearance of spectral lines in alkali metals. After $f$, the labeling continues alphabetically: $g, h, i, \ldots$ (skipping $j$ to avoid confusion with the total angular momentum quantum number). These seemingly arbitrary names encode decades of experimental effort to classify spectral lines before quantum mechanics explained their origin. The fact that $l = 0$ is called "s" for "sharp" and $l = 2$ is called "d" for "diffuse" tells us something about line broadening mechanisms that depend on $l$ --- a story for Chapter 18.

🔄 Check Your Understanding: For $l = 2$, list all possible values of $m$. What is the magnitude of the orbital angular momentum? What is the minimum angle between $\mathbf{L}$ and the $z$-axis? Sketch the allowed orientations of the angular momentum vector.


5.4 The Radial Equation: The Hydrogen Atom

With the angular part solved for all central-force problems, we now turn to the radial equation specific to the hydrogen atom. Using the separation constant $l(l+1)$ and the Coulomb potential, the radial equation becomes:

$$-\frac{\hbar^2}{2m_e}\left[\frac{1}{r^2}\frac{d}{dr}\left(r^2\frac{dR}{dr}\right) - \frac{l(l+1)}{r^2}R\right] - \frac{e^2}{4\pi\epsilon_0}\frac{R}{r} = ER$$

This equation looks complicated, but we can put it in a much more transparent form with a simple substitution.

The Effective Potential

Define $u(r) = rR(r)$. Then $R = u/r$ and after some algebra (which you should verify as an exercise), the radial equation becomes:

$$-\frac{\hbar^2}{2m_e}\frac{d^2u}{dr^2} + \underbrace{\left[-\frac{e^2}{4\pi\epsilon_0}\frac{1}{r} + \frac{\hbar^2 l(l+1)}{2m_e r^2}\right]}_{V_{\text{eff}}(r)}u = Eu$$

This is precisely a one-dimensional Schrodinger equation for $u(r)$ in the effective potential:

$$V_{\text{eff}}(r) = -\frac{e^2}{4\pi\epsilon_0 r} + \frac{\hbar^2 l(l+1)}{2m_e r^2}$$

The effective potential has two competing terms:

  1. Coulomb attraction ($\sim -1/r$): pulls the electron toward the nucleus, dominates at large $r$
  2. Centrifugal barrier ($\sim +1/r^2$): pushes the electron away from the nucleus for $l > 0$, dominates at small $r$

For $l = 0$ (s-states), there is no centrifugal barrier, and the electron can reach the nucleus --- indeed, s-state wavefunctions have a nonzero value at $r = 0$. For $l \geq 1$, the centrifugal term dominates at small $r$, creating an effective repulsion that keeps the electron away from the origin. The competition between attraction and repulsion creates a potential well at an intermediate distance, and it is in this well that the bound states live.

The minimum of $V_{\text{eff}}$ for $l \geq 1$ occurs at $r_{\min} = l(l+1)a_0$, where $a_0 = 4\pi\epsilon_0\hbar^2/(m_e e^2)$ is the Bohr radius. This gives us a first estimate of where the electron probability is concentrated for different values of $l$.

💡 Key Insight: The centrifugal barrier is not a new force --- it is the quantum-mechanical manifestation of conservation of angular momentum. A particle with angular momentum $l > 0$ simply cannot be found at $r = 0$, because that would require infinite angular speed. The barrier grows as $l(l+1)/r^2$, becoming impenetrable for large $l$. This is why high-$l$ orbitals are "donut-shaped" and stay far from the nucleus, while s-orbitals ($l = 0$) can penetrate to the nucleus.

The boundary condition $u(0) = 0$ (so that $R = u/r$ is finite at the origin) plays the same role as the boundary condition at the wall of an infinite square well: it eliminates one of the two linearly independent solutions and constrains the allowed energies.

Solving the Radial Equation

The solution of the radial equation for the Coulomb potential is one of the great achievements of mathematical physics. We introduce dimensionless variables to simplify the algebra. For bound states ($E < 0$), define:

$$\kappa = \frac{\sqrt{-2m_e E}}{\hbar}, \quad \rho = \kappa r, \quad \rho_0 = \frac{m_e e^2}{4\pi\epsilon_0\hbar^2\kappa}$$

The radial equation becomes:

$$\frac{d^2u}{d\rho^2} = \left[1 - \frac{\rho_0}{\rho} + \frac{l(l+1)}{\rho^2}\right]u$$

Asymptotic analysis guides us to the correct form of the solution. This technique --- examining the behavior of the equation in extreme limits to identify the dominant behavior --- is one of the most important tools in mathematical physics.

  • As $\rho \to \infty$: The terms $\rho_0/\rho$ and $l(l+1)/\rho^2$ become negligible compared to $1$, so $d^2u/d\rho^2 \approx u$. The solutions are $u \sim e^{\pm\rho}$, and we keep only the decaying exponential $u \sim e^{-\rho}$ for normalizability.
  • As $\rho \to 0$: The $l(l+1)/\rho^2$ term dominates, so $d^2u/d\rho^2 \approx [l(l+1)/\rho^2]u$. Trying $u \sim \rho^s$, we find $s(s-1) = l(l+1)$, giving $s = l+1$ (we reject $s = -l$ because it would make $R = u/r$ singular at the origin).

This motivates the substitution:

$$u(\rho) = \rho^{l+1} e^{-\rho} v(\rho)$$

which "factors out" the known asymptotic behavior, leaving $v(\rho)$ to encode the detailed structure. Substituting and simplifying yields the differential equation for $v$:

$$\rho\frac{d^2v}{d\rho^2} + 2(l+1-\rho)\frac{dv}{d\rho} + [\rho_0 - 2(l+1)]v = 0$$

We assume a power series solution $v(\rho) = \sum_{j=0}^\infty c_j \rho^j$ and derive a recursion relation for the coefficients $c_j$:

$$c_{j+1} = \frac{2(j + l + 1) - \rho_0}{(j+1)(j + 2l + 2)}c_j$$

Quantization: Why the Energy Levels Are Discrete

Here comes the crucial physics. For large $j$, the recursion relation gives $c_{j+1}/c_j \approx 2/j$, which means the series behaves like $e^{2\rho}$ at large $\rho$. Combined with the factor $\rho^{l+1}e^{-\rho}$ already present, this gives $u \sim \rho^{l+1}e^{+\rho}$, which diverges. The wavefunction would be non-normalizable.

The only escape is for the series to terminate: there must be some maximum value $j_{\max}$ beyond which all coefficients vanish. This requires the numerator of the recursion to vanish:

$$2(j_{\max} + l + 1) - \rho_0 = 0 \implies \rho_0 = 2n$$

where $n \equiv j_{\max} + l + 1$ is a positive integer. Since $j_{\max} \geq 0$ and $l \geq 0$, we have $n \geq l + 1$, or equivalently, $l \leq n - 1$.

This single condition --- the requirement that the series terminates to prevent the wavefunction from blowing up --- quantizes the energy. Substituting $\rho_0 = 2n$ back into the definition:

$$\frac{m_e e^2}{4\pi\epsilon_0\hbar^2\kappa} = 2n \implies \kappa = \frac{m_e e^2}{8\pi\epsilon_0\hbar^2 n}$$

And since $E = -\hbar^2\kappa^2/2m_e$:

$$\boxed{E_n = -\frac{m_e e^4}{2(4\pi\epsilon_0)^2\hbar^2}\cdot\frac{1}{n^2} = -\frac{13.6\;\text{eV}}{n^2}}$$

for $n = 1, 2, 3, \ldots$

This is one of the most important equations in all of physics. It gives the energy levels of the hydrogen atom from first principles, with no adjustable parameters. The constant $13.6\;\text{eV}$ is the ground state binding energy --- the Rydberg energy --- and $a_0 = 4\pi\epsilon_0\hbar^2/(m_e e^2) = 0.529\;\text{\AA}$ is the Bohr radius, the characteristic size of the hydrogen atom.

🧪 Experiment: The Rydberg constant $R_\infty = m_e e^4 / (8\epsilon_0^2 h^3 c) = 1.097\,373\,156\,816(11) \times 10^7\;\text{m}^{-1}$ is one of the most precisely measured constants in physics, known to better than 1 part in $10^{12}$. The agreement between theory and experiment for hydrogen energy levels is extraordinary --- one of the great triumphs of quantum mechanics.

💡 Key Insight (Theme: Mathematics and Physics Inseparable): The quantization of energy arises from a purely mathematical requirement: the power series must terminate to keep the wavefunction normalizable. There is no additional physical postulate needed. The mathematical structure of the differential equation, combined with the boundary conditions (finiteness at $r = 0$ and $r \to \infty$), forces the energy to be quantized. This is what we mean when we say that in quantum mechanics, the formalism is the physics.

The Radial Wavefunctions

The terminating polynomials $v(\rho)$ are the associated Laguerre polynomials $L_{n-l-1}^{2l+1}(2\rho/n)$. The normalized radial wavefunctions are:

$$R_{nl}(r) = -\sqrt{\left(\frac{2}{na_0}\right)^3\frac{(n-l-1)!}{2n[(n+l)!]^3}} \; e^{-r/na_0}\left(\frac{2r}{na_0}\right)^l L_{n-l-1}^{2l+1}\left(\frac{2r}{na_0}\right)$$

The first few radial wavefunctions, which you should commit to memory (or at least be comfortable deriving), are:

$n$ $l$ $R_{nl}(r)$
1 0 $2\left(\frac{1}{a_0}\right)^{3/2} e^{-r/a_0}$
2 0 $\frac{1}{2\sqrt{2}}\left(\frac{1}{a_0}\right)^{3/2}\left(2 - \frac{r}{a_0}\right)e^{-r/2a_0}$
2 1 $\frac{1}{2\sqrt{6}}\left(\frac{1}{a_0}\right)^{3/2}\frac{r}{a_0}e^{-r/2a_0}$
3 0 $\frac{2}{81\sqrt{3}}\left(\frac{1}{a_0}\right)^{3/2}\left(27 - 18\frac{r}{a_0} + 2\frac{r^2}{a_0^2}\right)e^{-r/3a_0}$

Examine these wavefunctions carefully. Notice: - All contain a decaying exponential $e^{-r/na_0}$ --- the wavefunction dies off at large $r$, with the decay length proportional to $n$. Higher-energy states are more spread out. - The $R_{20}$ wavefunction has a factor $(2 - r/a_0)$, which vanishes at $r = 2a_0$ --- a radial node. - The $R_{21}$ wavefunction has a factor $r/a_0$, which vanishes at $r = 0$ --- this is the centrifugal suppression near the nucleus for $l = 1$. - The ground state $R_{10}$ has no nodes (except the trivial ones at $r = 0$ for $l > 0$ and $r = \infty$).

⚠️ Common Misconception: Different textbooks use different conventions for Laguerre polynomials, leading to radial wavefunctions that look different but are mathematically identical. Griffiths, Shankar, and Sakurai all use slightly different normalizations. When looking up these functions or comparing with other sources, always check the convention being used. The physics is the same.

🔄 Check Your Understanding (Spaced Review --- Ch 4): In the harmonic oscillator, the solutions involved Hermite polynomials multiplied by a Gaussian $e^{-\xi^2/2}$. Here, we have Laguerre polynomials multiplied by a decaying exponential $e^{-r/na_0}$. What is the physical reason for the exponential decay in both cases? (Hint: what happens to the wavefunction in a classically forbidden region? What is the classically forbidden region for each problem?)


5.5 The Hydrogen Atom Spectrum

Energy Levels and Degeneracy

The energy levels $E_n = -13.6\;\text{eV}/n^2$ have a remarkable property: they depend only on $n$, not on $l$ or $m$. For a given $n$:

  • $l$ ranges from $0$ to $n-1$
  • For each $l$, $m$ ranges from $-l$ to $+l$ (that is, $2l + 1$ values)

The total number of states with energy $E_n$ is:

$$g_n = \sum_{l=0}^{n-1}(2l + 1) = n^2$$

🧩 Productive Struggle: Before reading on, try to compute $g_n$ yourself. Write out the sum explicitly for $n = 1, 2, 3, 4$:

  • $n = 1$: $l = 0$ only, so $g_1 = 1$.
  • $n = 2$: $l = 0$ gives 1 state, $l = 1$ gives 3 states, so $g_2 = 1 + 3 = 4$.
  • $n = 3$: $g_3 = 1 + 3 + 5 = 9$.
  • $n = 4$: $g_4 = 1 + 3 + 5 + 7 = 16$.

Can you see the pattern? The sum of the first $n$ odd numbers is $n^2$ (a fact known to the ancient Greeks, who proved it geometrically by arranging dots into L-shaped gnomons around a growing square). Can you prove this algebraically? (Hint: use the formula for an arithmetic series, or try induction.)

The complete energy level structure for the first four shells:

$n$ $E_n$ (eV) Allowed $l$ values Degeneracy $n^2$ Orbital labels
1 $-13.6$ 0 1 $1s$
2 $-3.40$ 0, 1 4 $2s, 2p$
3 $-1.51$ 0, 1, 2 9 $3s, 3p, 3d$
4 $-0.85$ 0, 1, 2, 3 16 $4s, 4p, 4d, 4f$

(Including spin, which we have not yet discussed, the degeneracy doubles to $2n^2$. See Section 5.6.)

The energy levels crowd together as $n$ increases, approaching the ionization threshold $E = 0$ in the limit $n \to \infty$. The spacing between adjacent levels is:

$$\Delta E = E_{n+1} - E_n = 13.6\;\text{eV}\left(\frac{1}{n^2} - \frac{1}{(n+1)^2}\right) \approx \frac{27.2\;\text{eV}}{n^3} \quad \text{(for large } n\text{)}$$

This decreasing spacing means that transitions between high-$n$ levels produce low-energy photons (radio waves for $n \sim 100$), while transitions to the ground state produce high-energy ultraviolet photons.

💡 Key Insight: The $n^2$ degeneracy of the hydrogen atom is anomalous. For a generic central potential, states with different $l$ but the same number of radial nodes would have different energies. The fact that $E$ depends only on $n$ --- and not on $l$ --- is a special property of the $1/r$ Coulomb potential. It arises from a "hidden" symmetry associated with the Laplace-Runge-Lenz vector, a conserved quantity unique to the inverse-square force law. In classical mechanics, this same hidden symmetry is why Kepler orbits close (they are non-precessing ellipses, not rosettes).

🔗 Connection: When we study multi-electron atoms in Chapter 16, this extra degeneracy is broken. In helium and beyond, the 2s and 2p levels are not degenerate --- the 2s electron "penetrates" closer to the nucleus and experiences less shielding, lowering its energy. This splitting is what makes the periodic table possible and is what makes chemistry interesting.

Spectral Series

When an electron transitions between energy levels, it emits or absorbs a photon with energy equal to the energy difference:

$$E_\gamma = |E_{n_i} - E_{n_f}| = 13.6\;\text{eV}\left|\frac{1}{n_f^2} - \frac{1}{n_i^2}\right|$$

The corresponding wavelength is:

$$\frac{1}{\lambda} = R_\infty\left(\frac{1}{n_f^2} - \frac{1}{n_i^2}\right)$$

where $R_\infty = 1.097 \times 10^7\;\text{m}^{-1}$ is the Rydberg constant (strictly, we should use the reduced-mass-corrected value $R_H$, which differs from $R_\infty$ by about 0.054%).

The spectral lines are organized into series based on the final state $n_f$:

Series $n_f$ Transitions from Spectral region First line $\lambda$ Series limit
Lyman 1 $n_i = 2, 3, 4, \ldots$ Ultraviolet 121.6 nm 91.2 nm
Balmer 2 $n_i = 3, 4, 5, \ldots$ Visible 656.3 nm (red) 364.6 nm
Paschen 3 $n_i = 4, 5, 6, \ldots$ Near infrared 1875 nm 820.4 nm
Brackett 4 $n_i = 5, 6, 7, \ldots$ Infrared 4051 nm 1458 nm
Pfund 5 $n_i = 6, 7, 8, \ldots$ Far infrared 7460 nm 2279 nm

The series limit corresponds to $n_i \to \infty$, where the initial state approaches the ionization threshold. Photons with wavelengths shorter than the series limit correspond to ionization (the electron escapes entirely), forming a continuous spectrum rather than discrete lines.

📜 Historical Context: The Bohr Model --- Triumph and Limitation

In 1913, Niels Bohr proposed a model of the hydrogen atom that quantized the electron's orbital angular momentum in units of $\hbar$: $L = n\hbar$ for $n = 1, 2, 3, \ldots$ Combined with classical mechanics and electrostatics, this gave the correct energy levels $E_n = -13.6\;\text{eV}/n^2$ and explained all known hydrogen spectral series.

The Bohr model was a stunning success --- but it was also fundamentally wrong in several crucial ways:

  • Wrong angular momentum: Bohr gives $L = n\hbar$ for the $n$-th level. Quantum mechanics gives $L = \hbar\sqrt{l(l+1)}$, which is zero for the ground state ($l = 0$). The ground state of hydrogen has no orbital angular momentum at all --- the electron is not "orbiting."
  • No spatial structure: Bohr's model has circular orbits at specific radii $r_n = n^2 a_0$. Quantum mechanics gives probability clouds with complex node structures.
  • Cannot handle multi-electron atoms: Bohr's model has no mechanism for electron-electron interactions and gives completely wrong results for helium.
  • No fine structure: Bohr cannot explain the small splittings within each energy level.

The Bohr model is not "wrong in detail but right in spirit." It is a theory that happens to give the right answer for the one quantity (energy) that does not depend on the angular momentum quantum number $l$. It is a fortunate accident that Bohr's quantization condition $L = n\hbar$ happens to give the same energy formula as Schrodinger's equation, because the Coulomb potential's hidden symmetry makes $E$ independent of $l$. For any other central potential, the Bohr model would fail even for energy levels.

🔗 Connection (Spaced Review --- Ch 1): In Chapter 1, we introduced $E_n = -13.6\;\text{eV}/n^2$ as a mysterious empirical formula. Now we have derived it from first principles. The Bohr model captured the $n$-dependence (which is exact for hydrogen) but imposed the wrong angular momentum structure. The full quantum treatment gives us quantum numbers $n, l, m$ with $l = 0, 1, \ldots, n-1$ --- a far richer structure that the Bohr model could not imagine.

Selection Rules

Not all transitions between energy levels are equally probable. Electric dipole transitions --- the most common type --- obey selection rules:

$$\Delta l = \pm 1, \quad \Delta m = 0, \pm 1$$

These rules arise from the matrix element of the position operator between initial and final states: $\langle n'l'm'|\hat{\mathbf{r}}|nlm\rangle$. The angular integral involves three spherical harmonics (initial, final, and the one from $\hat{\mathbf{r}}$), and the orthogonality properties of spherical harmonics force the selection rules.

The physical reason is conservation of angular momentum: the emitted photon carries one unit of angular momentum ($l_\gamma = 1$), so the atom's angular momentum must change by exactly one unit. The selection rule on $m$ reflects the three possible polarizations of the photon: $\Delta m = 0$ corresponds to linear polarization along $z$, while $\Delta m = \pm 1$ corresponds to right- and left-circular polarization.

This means, for example, that a direct transition from $3d$ to $1s$ (where $\Delta l = 2$) is forbidden as an electric dipole transition. It can still occur via higher-order processes (magnetic dipole, electric quadrupole), but these are much less probable.

📊 By the Numbers: Electric dipole transitions in hydrogen have typical lifetimes of $\sim 10^{-9}$ seconds (nanoseconds). Forbidden transitions (magnetic dipole, electric quadrupole) can have lifetimes of milliseconds or longer. The metastable $2s$ state of hydrogen, which cannot decay to the $1s$ ground state by a single photon (since $\Delta l = 0$), has a lifetime of about $1/7$ of a second --- extraordinary for an atomic excited state. It decays instead by two-photon emission, a process we will study in the context of time-dependent perturbation theory in Chapter 21.

🔄 Check Your Understanding: The Balmer series produces visible light. What is the wavelength of the Balmer-$\alpha$ line ($n = 3 \to n = 2$)? Is the transition $3s \to 2s$ allowed? If the atom transitions from $3d$ to $2p$, what are the possible values of $\Delta m$?


5.6 Quantum Numbers and the State Space

Each stationary state of the hydrogen atom is characterized by three quantum numbers:

Quantum Number Symbol Range Physical Meaning
Principal $n$ $1, 2, 3, \ldots$ Determines energy: $E_n = -13.6/n^2$ eV. Also sets the overall size: $\langle r \rangle \sim n^2 a_0$.
Azimuthal (orbital) $l$ $0, 1, \ldots, n-1$ Orbital angular momentum magnitude: $L = \hbar\sqrt{l(l+1)}$. Determines orbital shape and centrifugal barrier.
Magnetic $m$ (or $m_l$) $-l, -l+1, \ldots, l$ $z$-component of angular momentum: $L_z = m\hbar$. Determines orientation relative to a chosen axis.

These three quantum numbers define a complete set of commuting observables: $\hat{H}$, $\hat{L}^2$, and $\hat{L}_z$. Every stationary state of the hydrogen atom is simultaneously an eigenstate of all three operators, and specifying $(n, l, m)$ uniquely determines the state (up to normalization and an overall phase).

The Fourth Quantum Number: Spin

When we study the Stern-Gerlach experiment in Chapter 6 and spin in detail in Chapter 13, we will discover that the electron possesses an intrinsic angular momentum called spin, characterized by the quantum number $s = 1/2$. The spin magnetic quantum number $m_s$ takes values $+1/2$ ("spin up", often denoted $\uparrow$) and $-1/2$ ("spin down", denoted $\downarrow$).

The complete specification of a hydrogen atom state then requires four quantum numbers: $(n, l, m_l, m_s)$. The degeneracy of the $n$-th level becomes $2n^2$:

$n$ Orbital states ($n^2$) Spin states Total ($2n^2$) Subshells
1 1 $\times 2$ 2 $1s$
2 4 $\times 2$ 8 $2s, 2p$
3 9 $\times 2$ 18 $3s, 3p, 3d$
4 16 $\times 2$ 32 $4s, 4p, 4d, 4f$

Why the Extra Degeneracy?

The degeneracy of the hydrogen atom has two distinct sources, and it is important to distinguish them:

The $m$-degeneracy (the fact that states with the same $n$ and $l$ but different $m$ have the same energy) is guaranteed by rotational symmetry. Rotating the coordinate system changes $m$ but not the physics. If the Hamiltonian commutes with $\hat{L}_z$ (and it does, because the Coulomb potential is spherically symmetric), then states with different $m$ must have the same energy. This is called essential or geometric degeneracy. Every central-force problem has this degeneracy, and it is $(2l + 1)$-fold.

The $l$-degeneracy (the fact that states with different $l$ but the same $n$ have the same energy) is not guaranteed by rotational symmetry. It is an accidental degeneracy specific to the $1/r$ Coulomb potential. The name "accidental" is misleading --- it is actually due to a hidden symmetry associated with the Laplace-Runge-Lenz vector $\hat{\mathbf{A}}$. In classical mechanics, this vector points along the major axis of the Kepler ellipse and is conserved for inverse-square force laws. Its quantum-mechanical counterpart generates a symmetry that enlarges the rotation group $SO(3)$ to the larger group $SO(4)$.

🔗 Connection: We will explore this hidden symmetry in Chapter 10 when we study symmetry and conservation laws systematically. For now, the key point is: the Coulomb potential is special, and this specialness has profound consequences. When we add perturbations --- the relativistic correction to the kinetic energy, the spin-orbit interaction, the Lamb shift --- the hidden symmetry is broken, the $l$-degeneracy lifts, and the hydrogen spectrum develops its famous fine structure. Each of these perturbations is a window into physics beyond the simple Coulomb potential. That story is told in Chapters 17 and 18.

💡 Key Insight (Theme: Symmetry): Every degeneracy in quantum mechanics signals a symmetry. The $(2l+1)$-fold $m$-degeneracy reflects rotational symmetry ($SO(3)$). The $l$-degeneracy unique to hydrogen reflects a higher symmetry ($SO(4)$). This is a deep principle: whenever you encounter an "unexpectedly large" degeneracy, look for a hidden symmetry. The converse is also powerful: when a perturbation breaks a symmetry, the corresponding degeneracy is lifted, and the spectrum splits. Symmetry breaking and degeneracy lifting are two sides of the same coin.

🔄 Check Your Understanding (Spaced Review --- Ch 3): In the 1D infinite square well, the energy levels $E_n \propto n^2$ are non-degenerate. In the 2D square well, they become degenerate (e.g., $E_{1,2} = E_{2,1}$). Why does moving to higher dimensions create degeneracy? What symmetry is responsible in the 2D case?


5.7 Visualizing Hydrogen Orbitals

The complete hydrogen wavefunctions are:

$$\psi_{nlm}(r, \theta, \phi) = R_{nl}(r) \cdot Y_l^m(\theta, \phi)$$

The probability of finding the electron in a volume element $d^3\mathbf{r} = r^2\sin\theta \; dr \; d\theta \; d\phi$ is:

$$|\psi_{nlm}|^2 \; d^3\mathbf{r} = |R_{nl}(r)|^2 |Y_l^m(\theta,\phi)|^2 \; r^2\sin\theta \; dr \; d\theta \; d\phi$$

Radial Probability Distribution

For many questions, we care only about the radial distribution: what is the probability of finding the electron at a distance between $r$ and $r + dr$ from the nucleus, regardless of direction? Integrating over angles and using the orthonormality of $Y_l^m$:

$$P(r)\;dr = |R_{nl}(r)|^2 r^2\;dr$$

The radial probability density is:

$$P_{nl}(r) = r^2 |R_{nl}(r)|^2$$

The factor of $r^2$ comes from the volume element in spherical coordinates --- it represents the fact that a thin spherical shell at radius $r$ has surface area $4\pi r^2$. This factor changes everything about the radial distribution:

  • $|R_{10}(r)|^2$ is maximum at $r = 0$ --- the wavefunction amplitude is largest at the nucleus.
  • But $P_{10}(r) = r^2 |R_{10}(r)|^2$ is zero at $r = 0$ (because $r^2 = 0$) and has its maximum at $r = a_0$.

The resolution of this apparent paradox is that while the probability per unit volume is highest at the nucleus, the volume of a thin shell at radius $r$ is proportional to $r^2$, which vanishes at $r = 0$. The probability of finding the electron in a thin shell around $r = 0$ is tiny because the shell has essentially zero volume.

🔴 Warning: The distinction between $|R(r)|^2$ (probability density per unit volume) and $r^2|R(r)|^2$ (probability per unit radius) is a persistent source of confusion. When someone says "where is the electron most likely to be found?" you must ask: most likely per unit volume, or most likely at a given distance from the nucleus? These give different answers. In this textbook, "most probable radius" always refers to the maximum of $P(r) = r^2|R|^2$.

Most Probable Radius vs. Expectation Value

Setting $dP_{nl}/dr = 0$ gives the most probable radius $r_{mp}$. For the ground state ($n = 1, l = 0$):

$$P_{10}(r) = 4r^2(1/a_0)^3 e^{-2r/a_0}$$

$$\frac{dP_{10}}{dr} = 4(1/a_0)^3\left[2r - \frac{2r^2}{a_0}\right]e^{-2r/a_0} = 0$$

This gives $r_{mp} = a_0 = 0.529\;\text{\AA}$ --- exactly the Bohr radius. This is a satisfying connection to the Bohr model: the most likely distance of the electron from the proton in the quantum ground state equals the orbital radius of the Bohr model's ground state.

The expectation value of $r$ is:

$$\langle r \rangle_{nl} = \int_0^\infty r \; P_{nl}(r)\;dr = \frac{a_0}{2}\left[3n^2 - l(l+1)\right]$$

For the ground state: $\langle r \rangle_{10} = \frac{3}{2}a_0 = 0.794\;\text{\AA}$.

Notice that $\langle r \rangle \neq r_{mp}$. The expectation value is larger because the radial distribution has a long exponential tail extending to large $r$, which pulls the average outward. This asymmetry (the distribution falls steeply to the left of the peak but slowly to the right) is characteristic of all the hydrogen radial distributions.

For higher excited states, the expectation value grows quadratically with $n$: $\langle r \rangle \sim \frac{3}{2}n^2 a_0$ for s-states. A hydrogen atom in the $n = 100$ state has $\langle r \rangle \approx 15{,}000\;a_0 \approx 0.8\;\mu\text{m}$ --- larger than a small bacterium!

📊 By the Numbers: Other useful expectation values for hydrogen:

$$\langle r^2 \rangle_{nl} = \frac{a_0^2 n^2}{2}\left[5n^2 + 1 - 3l(l+1)\right]$$

$$\left\langle\frac{1}{r}\right\rangle_{nl} = \frac{1}{n^2 a_0}$$

$$\left\langle\frac{1}{r^2}\right\rangle_{nl} = \frac{1}{n^3(l + 1/2)a_0^2}$$

The expectation value $\langle 1/r \rangle = 1/(n^2 a_0)$ is especially useful: it gives the average potential energy $\langle V \rangle = -e^2/(4\pi\epsilon_0)\langle 1/r \rangle = 2E_n$ (by the virial theorem). These expectation values will be essential when we compute perturbative corrections in Chapters 17 and 18 --- the fine structure corrections to hydrogen are proportional to $\langle 1/r^3 \rangle$, which depends on $l$.

Node Structure

The wavefunctions have a characteristic node structure that connects directly to the energy:

  • Radial nodes: The number of zeros of $R_{nl}(r)$ for $r > 0$ is $n - l - 1$.
  • Angular nodes: The number of angular nodes (surfaces where $Y_l^m = 0$) is $l$. For $Y_l^0$, these are cones at specific polar angles. For $Y_l^m$ with $m \neq 0$, there are both cone-shaped and planar nodes.
  • Total nodes: The total number of nodes is $(n - l - 1) + l = n - 1$.

This last result --- total nodes = $n - 1$ --- is consistent with the general principle established in Chapter 3: higher-energy states have more nodes. Just as the $n$-th energy state of the 1D infinite well has $n - 1$ nodes, the hydrogen state with principal quantum number $n$ has $n - 1$ total nodes. The novelty in 3D is that these nodes can be distributed between radial nodes and angular nodes in different ways for different values of $l$.

Orbital $n$ $l$ Radial nodes Angular nodes Total
$1s$ 1 0 0 0 0
$2s$ 2 0 1 0 1
$2p$ 2 1 0 1 1
$3s$ 3 0 2 0 2
$3p$ 3 1 1 1 2
$3d$ 3 2 0 2 2
$4s$ 4 0 3 0 3
$4f$ 4 3 0 3 3

The node structure provides a powerful way to identify orbitals visually: count the radial nodes (spherical shells where $\psi = 0$) and angular nodes (conical or planar surfaces where $\psi = 0$), add them up, and you get $n - 1$.

Orbital Shapes

The familiar orbital shapes from chemistry --- the spherical s-orbitals, dumbbell-shaped p-orbitals, cloverleaf d-orbitals --- are representations of $|Y_l^m(\theta,\phi)|^2$ or, more commonly, of the real linear combinations:

$$p_x \propto \frac{1}{\sqrt{2}}(Y_1^{-1} - Y_1^1) \propto \sin\theta\cos\phi \propto \frac{x}{r}$$

$$p_y \propto \frac{i}{\sqrt{2}}(Y_1^{-1} + Y_1^1) \propto \sin\theta\sin\phi \propto \frac{y}{r}$$

$$p_z \propto Y_1^0 \propto \cos\theta \propto \frac{z}{r}$$

These real combinations are used because they transform simply under reflections and are aligned with the coordinate axes. The complex spherical harmonics $Y_l^m$ are eigenstates of $\hat{L}_z$ (with definite $m$); the real combinations $p_x, p_y, p_z$ are not eigenstates of $\hat{L}_z$, but they are eigenstates of $\hat{L}^2$ (with $l = 1$). The choice between complex and real basis depends on the problem: for atoms in a magnetic field (which picks out the $z$-axis), use complex $Y_l^m$; for chemical bonding (where the axes are defined by neighboring atoms), use real combinations.

Similarly, the five $d$-orbitals ($l = 2$) have the familiar real combinations $d_{z^2}$, $d_{xz}$, $d_{yz}$, $d_{xy}$, and $d_{x^2-y^2}$. The $d_{z^2}$ orbital has a distinctive "double dumbbell plus donut" shape, while the others have four-lobed cloverleaf patterns oriented along different planes.

⚠️ Common Misconception: Orbital plots do NOT show the path of the electron. There is no "orbit." The electron does not travel around the nucleus on any trajectory. The orbital is a probability distribution --- a three-dimensional cloud of likelihood. The lobes of a p-orbital are regions where the electron is most likely to be found, but the electron is not bouncing back and forth between them. This point cannot be overemphasized. The word "orbital" was deliberately chosen (by Robert Mulliken in the 1930s) to be different from "orbit," precisely to prevent this confusion.

⚠️ Common Misconception: The boundary surfaces you see in textbook orbital plots are surfaces of constant probability density, not surfaces that "contain" the electron. Typically they are drawn to enclose some fraction (say 90%) of the total probability. The actual wavefunction extends to infinity --- there is always some probability of finding the electron at any distance from the nucleus, no matter how large.

🔄 Check Your Understanding: How many radial nodes does the $4d$ orbital have? How many angular nodes? The $2s$ wavefunction has $R_{20}(r) \propto (2 - r/a_0)e^{-r/2a_0}$. Where is the radial node? Compute $P_{20}(r)$ and find its maxima numerically or graphically. Which peak is higher?


5.8 The Periodic Table: Quantum Mechanics Meets Chemistry

The hydrogen atom is the foundation for understanding the entire periodic table. Although multi-electron atoms are vastly more complicated (a topic for Chapter 16), the basic organizational principle comes directly from the hydrogen quantum numbers.

Shell Structure

The quantum numbers define a natural hierarchy:

  • Shell: All states with the same $n$ (historically labeled $K, L, M, N, O, \ldots$ for $n = 1, 2, 3, 4, 5, \ldots$)
  • Subshell: All states with the same $n$ and $l$ (e.g., $3d$)
  • Orbital: A specific $(n, l, m)$ state
  • State: A specific $(n, l, m, m_s)$ state --- includes both spatial and spin information

Each subshell can hold $2(2l+1)$ electrons (the factor of 2 from spin, the factor $2l+1$ from the magnetic quantum number):

Subshell $l$ Orbitals ($2l+1$) Max electrons $2(2l+1)$
$s$ 0 1 2
$p$ 1 3 6
$d$ 2 5 10
$f$ 3 7 14

Each shell can hold $2n^2$ electrons total (the total degeneracy including spin). This immediately explains the structure of the periodic table: the first period has 2 elements (filling the $1s$ subshell), the second period has 8 elements (filling $2s$ and $2p$), and so on.

The Aufbau Principle and Its Exceptions

In multi-electron atoms, the electrons fill the available states from lowest energy to highest, subject to the Pauli exclusion principle (Chapter 15): no two electrons can have the same set of four quantum numbers $(n, l, m_l, m_s)$.

The crucial difference from hydrogen is that in multi-electron atoms, the energy does depend on $l$. The accidental degeneracy is broken by electron-electron repulsion, and the approximate filling order is:

$$1s < 2s < 2p < 3s < 3p < 4s < 3d < 4p < 5s < 4d < \cdots$$

The reason $4s$ fills before $3d$ (even though $3d$ has lower $n$) is a phenomenon called penetration and shielding. The $4s$ electron has a probability distribution that penetrates close to the nucleus (because of its radial nodes at small $r$), where it "sees" the full nuclear charge $Z$. The $3d$ electron, lacking this penetration, is shielded from the nucleus by inner electrons and feels a reduced effective charge. The net effect is that $4s$ is lower in energy than $3d$ for atoms with $Z \approx 19$--$29$ (potassium through copper).

This level ordering is approximate and has many exceptions. Chromium ($Z = 24$) has the configuration $[\text{Ar}]3d^5 4s^1$ rather than the expected $[\text{Ar}]3d^4 4s^2$, because the half-filled $3d$ subshell has extra stability from exchange energy. Copper ($Z = 29$) has $[\text{Ar}]3d^{10} 4s^1$ rather than $[\text{Ar}]3d^9 4s^2$ for a similar reason.

🔗 Connection: The full treatment of multi-electron atoms, including the Hartree-Fock self-consistent field method, is the subject of Chapter 16. The variational approach to helium (the simplest multi-electron atom) is treated in Chapter 19. These chapters will make quantitative what we are only sketching qualitatively here.

What the Hydrogen Atom Explains

From hydrogen's quantum numbers alone, we can understand the architecture of the periodic table:

  1. Why the periodic table has rows of length 2, 8, 8, 18, 18, 32: These are the capacities of the subshells being filled in each period, modified by the actual energy ordering.
  2. Why elements in the same column have similar chemistry: They have the same outer-electron (valence) configuration --- the same $l$ and number of electrons in the outermost subshell.
  3. Why transition metals exist: The fourth and fifth periods include elements filling the $3d$ and $4d$ subshells, which hold 10 electrons each.
  4. Why lanthanides and actinides form separate rows: They are filling the $4f$ and $5f$ subshells, which hold 14 electrons each.
  5. Why there are two elements in the first period, eight in the second: $n = 1$ holds $2(1)^2 = 2$ electrons; $n = 2$ holds $2s + 2p = 2 + 6 = 8$.

💡 Key Insight (Theme: Mathematics and Physics Inseparable): The structure of the periodic table --- arguably the most important organizing principle in all of chemistry --- is a direct consequence of the mathematics of the angular momentum eigenvalue problem in three dimensions. The integers $n, l, m$ and $m_s$, combined with the Pauli exclusion principle (Chapter 15), determine the architecture of matter. The fact that $l$ ranges from $0$ to $n - 1$, that $m$ ranges from $-l$ to $+l$, and that $m_s = \pm 1/2$ --- these mathematical constraints, emerging from the requirement that wavefunctions be single-valued, finite, and normalizable --- are the ultimate explanation for why matter has the structure it does. This is perhaps the most stunning example of the "unreasonable effectiveness of mathematics in the natural sciences" that Wigner famously described.

📊 By the Numbers: The hydrogen atom's energy levels explain spectral lines to about 1 part in $10^5$. Adding fine structure (relativistic corrections + spin-orbit coupling, Chapter 18) improves this to about 1 part in $10^7$. Adding quantum electrodynamic corrections (the Lamb shift, discovered in 1947) pushes agreement to 1 part in $10^{12}$. Hydrogen is the most precisely tested system in all of physics, and every improvement in precision has either confirmed the theory or pointed to new physics.

🔄 Check Your Understanding (Spaced Review --- Ch 4): The harmonic oscillator energy levels $E_n = (n + 1/2)\hbar\omega$ have equal spacing. The hydrogen energy levels $E_n = -13.6/n^2$ eV have unequal spacing that converges as $n \to \infty$. What are the physical consequences of this difference for the emission spectra? Why does the harmonic oscillator spectrum have only one frequency while hydrogen has infinitely many?


5.9 Summary and Project Checkpoint

What We Have Accomplished

In this chapter, we made the leap from one-dimensional quantum mechanics to the full three-dimensional treatment of the hydrogen atom. Let us retrace our steps:

  1. 3D Schrodinger equation in spherical coordinates, exploiting the spherical symmetry of the Coulomb potential to separate the equation into radial and angular parts.
  2. Separation of variables splitting the PDE into three coupled ODEs (radial, polar, azimuthal), each producing one quantum number.
  3. Spherical harmonics $Y_l^m(\theta,\phi)$ as the universal angular solutions for central forces, with quantum numbers $l$ (orbital angular momentum magnitude) and $m$ ($z$-component). These are the eigenfunctions of $\hat{L}^2$ and $\hat{L}_z$.
  4. Radial equation for the Coulomb potential, solved by associated Laguerre polynomials, yielding the energy eigenvalues $E_n = -13.6\;\text{eV}/n^2$ through the requirement that the series solution terminates.
  5. Spectral series (Lyman, Balmer, Paschen, ...) explained by transitions between energy levels, with selection rules $\Delta l = \pm 1$, $\Delta m = 0, \pm 1$ dictated by conservation of angular momentum.
  6. Quantum number structure: $(n, l, m)$ with $l = 0, 1, \ldots, n-1$ and $m = -l, \ldots, +l$, giving $n^2$-fold degeneracy (or $2n^2$ including spin). The $l$-degeneracy is "accidental," arising from the hidden $SO(4)$ symmetry of the Coulomb potential.
  7. Orbital visualization: Radial probability distributions $P(r) = r^2|R|^2$, node structure (radial + angular = $n - 1$), and orbital shapes from spherical harmonics.
  8. Periodic table foundations: Shell and subshell structure from hydrogen quantum numbers, with the Aufbau principle modified by penetration and shielding in multi-electron atoms.

Key Equations

Equation Significance
$E_n = -13.6\;\text{eV}/n^2$ Hydrogen energy levels --- the fundamental result
$a_0 = 0.529\;\text{\AA}$ Bohr radius --- the natural length scale
$\hat{L}^2 Y_l^m = \hbar^2 l(l+1) Y_l^m$ Angular momentum eigenvalues
$\hat{L}_z Y_l^m = m\hbar Y_l^m$ $z$-component of angular momentum
$P(r) = r^2|R_{nl}(r)|^2$ Radial probability density (includes $r^2$ factor!)
$\langle r \rangle_{nl} = \frac{a_0}{2}[3n^2 - l(l+1)]$ Expectation value of radius
$1/\lambda = R_\infty(1/n_f^2 - 1/n_i^2)$ Spectral line wavelengths (Rydberg formula)
Degeneracy $= n^2$ (or $2n^2$ with spin) The "$l$-degeneracy" unique to the Coulomb potential
$\Delta l = \pm 1$, $\Delta m = 0, \pm 1$ Electric dipole selection rules

Quantum Toolkit v0.5 --- Project Checkpoint

In this chapter's code, you will build:

  • hydrogen_Rnl(n, l, r) --- Compute normalized radial wavefunctions using associated Laguerre polynomials from scipy.special
  • hydrogen_energy(n) --- Return energy eigenvalue in eV (trivial but useful for automation)
  • radial_probability(n, l, r) --- Compute $P_{nl}(r) = r^2|R_{nl}(r)|^2$
  • plot_orbital_3d(n, l, m) --- Visualize 3D probability density via 2D cross-sections in the $xz$-plane
  • HydrogenAtom class --- Encapsulate all of the above with expectation value calculations, spectral line tables, and selection rule checking

These functions extend your growing toolkit alongside the 1D solver from Chapter 3 and the QHO functions from Chapter 4. In the code directory, example-01-hydrogen.py demonstrates the radial wavefunctions and spectral calculations, example-02-orbitals.py creates the orbital visualizations, and project-checkpoint.py packages everything into a reusable class.

Looking Ahead

The hydrogen atom is solved, but our journey with it has only begun. In Chapter 6, we will formalize the operator methods that underlie everything we have done, introducing the measurement postulate and the uncertainty principle in their full generality. In Chapters 12--14, we will develop the full algebraic theory of angular momentum using ladder operators, revealing why $l$ and $m$ must be integers (or half-integers, in the case of spin). In Chapter 16, we will see how hydrogen serves as the starting point for understanding all atoms in the periodic table. And in Chapters 17--18, we will add corrections --- the relativistic kinetic energy, spin-orbit coupling, the Darwin term, the Lamb shift --- that turn the hydrogen atom from a textbook exercise into the most precisely tested prediction in all of physics.

The hydrogen atom is the gift that keeps on giving. Every new tool we develop will be tested against it, and every time, it will reveal new layers of structure. The system you have just solved is a companion for the rest of this book. Welcome to the atom that made quantum mechanics.

🔗 Connection (Preview --- Ch 6): In Chapter 6, we will recast the hydrogen atom solution in the language of operators and commutators. The angular momentum operators $\hat{L}^2$ and $\hat{L}_z$ will be understood as part of a Lie algebra --- a mathematical structure that governs the quantum numbers $l$ and $m$ and explains why they must be integers. The formalism of Chapter 6 turns the specific calculations of this chapter into consequences of deep symmetry principles.


Key Vocabulary

Term Definition
Central force A force directed radially, $V = V(r)$, with spherical symmetry
Spherical coordinates $(r, \theta, \phi)$: radial distance, polar angle, azimuthal angle
Laplacian $\nabla^2$: the divergence of the gradient; kinetic energy operator in QM
Angular Laplacian $\hat{\Lambda}^2$: the angular part of $\nabla^2$; related to $\hat{L}^2$ by $\hat{L}^2 = -\hbar^2\hat{\Lambda}^2$
Separation of variables Technique decomposing a PDE into coupled ODEs by assuming product solutions
Spherical harmonics $Y_l^m(\theta,\phi)$: eigenfunctions of $\hat{L}^2$ and $\hat{L}_z$ on the sphere
Associated Legendre functions $P_l^m(\cos\theta)$: polar angle part of spherical harmonics
Legendre polynomials $P_l(\cos\theta)$: special case $m = 0$ of associated Legendre functions
Principal quantum number $n = 1, 2, 3, \ldots$: determines hydrogen energy level
Azimuthal quantum number $l = 0, 1, \ldots, n-1$: determines orbital angular momentum magnitude
Magnetic quantum number $m = -l, \ldots, +l$: determines $z$-component of orbital angular momentum
Spin quantum number $m_s = \pm 1/2$: intrinsic angular momentum projection
Bohr radius $a_0 = 0.529\;\text{\AA}$: characteristic size of hydrogen ground state
Rydberg energy $13.6\;\text{eV}$: ionization energy of hydrogen from ground state
Rydberg constant $R_\infty = 1.097 \times 10^7\;\text{m}^{-1}$: spectroscopic constant
Effective potential $V_{\text{eff}}(r) = V(r) + \hbar^2 l(l+1)/(2mr^2)$: includes centrifugal barrier
Centrifugal barrier $\hbar^2 l(l+1)/(2mr^2)$: angular momentum contribution to radial equation
Associated Laguerre polynomials $L_{n-l-1}^{2l+1}$: radial part of hydrogen wavefunctions
Degeneracy Number of distinct quantum states sharing the same energy
Essential degeneracy Degeneracy guaranteed by the obvious (rotational) symmetry of the Hamiltonian
Accidental degeneracy Degeneracy beyond what obvious symmetry requires; signals a hidden symmetry
Selection rules Constraints on allowed transitions: $\Delta l = \pm 1$, $\Delta m = 0, \pm 1$
Spectral series Groups of transitions sharing the same $n_f$ (Lyman, Balmer, Paschen, ...)
Shell All states with the same principal quantum number $n$
Subshell All states with the same $n$ and $l$
Radial probability density $P(r) = r^2|R_{nl}|^2$: probability per unit radius
Node Location where the wavefunction vanishes; radial nodes + angular nodes = $n - 1$