Chapter 19 Key Takeaways: The Variational Principle
Core Message
The variational principle provides rigorous upper bounds on ground state energies without requiring you to solve the Schrödinger equation. By encoding physical intuition into a trial wavefunction and minimizing the energy expectation value over adjustable parameters, you can achieve remarkable accuracy for systems that are analytically intractable. The variational method is the foundation of all modern computational quantum mechanics.
Key Concepts
1. The Variational Theorem
For any normalized trial state $|\psi\rangle$ in the Hilbert space of a quantum system with Hamiltonian $\hat{H}$ and ground state energy $E_0$:
$$\langle \psi | \hat{H} | \psi \rangle \geq E_0$$
Equality holds if and only if $|\psi\rangle$ is the exact ground state. The proof follows from expanding $|\psi\rangle$ in the energy eigenbasis and using $E_n \geq E_0$ for all $n$. This means every trial function gives a legitimate upper bound — you can never accidentally undershoot the true ground state energy.
2. Trial Wavefunctions and Physical Intuition
The quality of a variational calculation depends entirely on the choice of trial function. Good trial functions: (a) satisfy the correct boundary conditions, (b) capture the qualitative shape of the ground state, (c) incorporate known physics (cusp conditions, screening, symmetry), and (d) contain enough adjustable parameters to accommodate the unknown features. More parameters always improve (or maintain) the optimized energy.
3. Screening and the Helium Atom
The helium ground state cannot be solved exactly due to electron-electron repulsion. A one-parameter trial function $\psi \propto e^{-Z'(r_1+r_2)}$ with effective nuclear charge $Z'$ gives $Z'_{\text{opt}} = 27/16 \approx 1.69$ and energy $E = -2.848$ hartree — beating first-order perturbation theory ($-2.750$ hartree) and achieving 1.9% accuracy vs. experiment ($-2.904$ hartree).
4. Chemical Bonding in H₂⁺
The LCAO trial function $\psi = c_A\phi_A + c_B\phi_B$ for the hydrogen molecule ion reveals chemical bonding as a quantum interference effect. The exchange integral $K$ — with no classical analogue — creates the energy splitting between bonding (attractive) and antibonding (repulsive) molecular orbitals.
5. The Ritz Method (Linear Variational Method)
When the trial function is a linear combination of fixed basis functions, $|\psi\rangle = \sum c_i |\phi_i\rangle$, the variational optimization reduces to the generalized eigenvalue problem $\mathbf{H}\mathbf{c} = E\mathbf{S}\mathbf{c}$. This gives upper bounds on all eigenvalues simultaneously (Hylleraas-Undheim-MacDonald theorem) and is the foundation of computational quantum chemistry.
6. Variational Monte Carlo
For many-electron systems, energy integrals are high-dimensional and cannot be evaluated by standard quadrature. VMC uses Metropolis sampling from $|\psi|^2$ to estimate the average local energy $E_L = \hat{H}\psi/\psi$. The statistical error scales as $1/\sqrt{M}$ regardless of dimensionality. The zero-variance principle ensures that the statistical noise vanishes as the trial function approaches an eigenstate.
Key Equations
| Equation | Name | Meaning |
|---|---|---|
| $\langle \psi \| \hat{H} \| \psi \rangle \geq E_0$ | Variational theorem | Energy expectation value of any state bounds the ground state from above |
| $E[\psi] = \frac{\langle \psi \| \hat{H} \| \psi \rangle}{\langle \psi \| \psi \rangle}$ | Energy functional (Rayleigh quotient) | Maps trial states to energy values |
| $Z'_{\text{opt}} = Z - 5/16$ | Helium screening parameter | Optimal effective nuclear charge for simple helium trial function |
| $E_{\text{He}} = -(27/16)^2/2 \times 2 = -729/256$ hartree | Helium variational energy | $\approx -2.848$ hartree (1.9% error) |
| $\det(\mathbf{H} - E\mathbf{S}) = 0$ | Secular determinant | Condition for nontrivial solutions in the Ritz method |
| $E_L(\mathbf{r}) = \hat{H}\psi(\mathbf{r})/\psi(\mathbf{r})$ | Local energy | Key quantity in variational Monte Carlo |
| $E \approx \frac{1}{M}\sum_{i=1}^{M} E_L(\mathbf{r}_i)$ | VMC energy estimator | Monte Carlo estimate of energy from Metropolis samples |
Key Techniques
| Technique | When to Use | Key Feature |
|---|---|---|
| Nonlinear variational method | Strong physical intuition for ground state shape | Parameters appear nonlinearly (e.g., in exponents); requires numerical optimization |
| Ritz (linear variational) method | Good basis set available | Reduces to matrix eigenvalue problem; bounds all eigenvalues; systematically improvable |
| Variational Monte Carlo | High-dimensional systems ($N > 3$ electrons) | Statistical estimation via Metropolis sampling; $1/\sqrt{M}$ convergence independent of dimension |
Critical Distinctions
Variational Method vs. Perturbation Theory
| Aspect | Perturbation Theory | Variational Method |
|---|---|---|
| Starting point | Solvable nearby problem | Physical guess for ground state |
| Gives | Order-by-order corrections | Rigorous upper bound |
| Convergence | Not guaranteed (may be asymptotic) | Guaranteed (from above) |
| Excited states | Natural | Requires extra constraints |
| Best when | Perturbation is small | Ground state shape is known |
What the Variational Method Does and Does NOT Tell You
It tells you: - An upper bound on $E_0$ (guaranteed by the theorem) - An approximate ground state wavefunction - How the energy depends on physical parameters (screening, bond length, etc.)
It does NOT tell you: - How close your bound is to the true $E_0$ - A lower bound (you need additional methods for that) - Guaranteed accuracy for the wavefunction (energy is second-order insensitive to wavefunction errors)
Common Pitfalls
-
Confusing upper bound with underestimate. The variational method overestimates $E_0$, not underestimates. Students frequently get the direction of the inequality backward.
-
Assuming good energy means good wavefunction. The energy is a quadratic functional of $|\psi\rangle$, so $O(\epsilon)$ wavefunction errors produce $O(\epsilon^2)$ energy errors. A 0.1% energy error can mask a 3% wavefunction error.
-
Using trial functions that violate boundary conditions. While the theorem still holds, violating boundary conditions wastes variational freedom and gives unnecessarily poor bounds.
-
Forgetting that $\hat{H}$ in the expectation value is the TRUE Hamiltonian. In the helium calculation, the variational parameter $Z'$ appears in the trial function, but the Hamiltonian always has the physical nuclear charge $Z = 2$.
-
Neglecting the cusp conditions. Trial functions that do not satisfy the Kato cusp condition at nuclear or electron-electron coalescence points converge slowly.
Connections to Other Chapters
| Chapter | Connection |
|---|---|
| Ch 5 (Hydrogen) | Hydrogen's exact solution provides the building blocks for trial functions (1s, 2s, 2p orbitals) |
| Ch 8-9 (Dirac notation, Eigenvalues) | The variational theorem and Ritz method are expressed in Dirac notation; the Ritz method reduces to matrix eigenvalue problems |
| Ch 15-16 (Identical particles, Multi-electron atoms) | Pauli principle constrains trial functions; Slater determinants are the basis of Hartree-Fock |
| Ch 17 (Perturbation theory) | Complementary approximation method; comparison on helium shows variational superiority for large perturbations |
| Ch 20 (WKB) | Third approximation method; semiclassical approach complements variational for different regimes |
| Ch 26 (Condensed matter) | Band structure calculations use the Ritz method with Bloch functions |
| Ch 38 (Capstone: Hydrogen) | Full hydrogen simulation integrates variational and other methods |
One-Sentence Summary
The variational principle turns unsolvable quantum problems into optimization problems — any guess for the wavefunction gives a guaranteed upper bound on the ground state energy, and the better your physical intuition, the tighter the bound.