The Variational Method

You can write the Schrödinger equation for the helium atom in a single line — and then you are stuck. Two electrons repelling each other while both orbit the nucleus is a three-body problem with no closed-form solution, and helium is the easy hard case: real molecules, solids, and everything quantum chemistry cares about are worse. Diagonalising the Hamiltonian exactly is simply off the table.

And yet chemists routinely quote ground-state energies of huge molecules to astonishing accuracy. How? By clever guessing. The variational method turns "guess the answer" into a rigorous, systematic technique: propose a trial wavefunction, compute the average energy it would have, and — this is the magic — know for certain that this number can never be smaller than the true ground-state energy. Every guess gives an honest upper bound, so a better guess is simply one that gives a lower number. It is the single most important approximation tool in quantum mechanics, and the engine underneath density-functional theory, Hartree–Fock, and most of computational physics.

The variational principle

Fix a system with Hamiltonian \hat H whose true (and unknown) energy levels are E_0 \le E_1 \le E_2 \le \dots, with E_0 the ground-state energy we are after. Take any normalisable state |\psi\rangle at all — a wild guess, it does not have to solve anything — and form its average energy, the Rayleigh quotient:

E[\psi] = \frac{\langle\psi|\hat H|\psi\rangle}{\langle\psi|\psi\rangle}.

Read it slowly, because it is doing a lot of work. It says the expectation of energy over literally any guess sits at or above E_0 — never below. You cannot cheat your way under the true ground energy no matter how badly you guess. So the whole game becomes: make E[\psi] as small as you can, and you are squeezing down onto E_0 from above.

Why it is true — a two-line proof

The result looks almost too strong to be free, but the proof is short and worth doing once by hand. The energy eigenstates \{|n\rangle\} of \hat H form a complete orthonormal basis (Postulate 2), so expand the trial state in them:

|\psi\rangle = \sum_n c_n\,|n\rangle, \qquad \hat H|n\rangle = E_n|n\rangle, \qquad \langle\psi|\psi\rangle = \sum_n |c_n|^2.

Sandwich the Hamiltonian. Because the |n\rangle are orthonormal, all the cross terms vanish and we get a clean weighted sum of the energy levels:

\langle\psi|\hat H|\psi\rangle = \sum_n |c_n|^2\,E_n.

Now the punchline. Every E_n \ge E_0 by definition of the ground state, and every weight |c_n|^2 \ge 0, so replacing each E_n by the smallest one only shrinks the sum:

\langle\psi|\hat H|\psi\rangle = \sum_n |c_n|^2 E_n \;\ge\; \sum_n |c_n|^2 E_0 = E_0 \sum_n |c_n|^2 = E_0\,\langle\psi|\psi\rangle.

Divide through by \langle\psi|\psi\rangle > 0 and out drops E[\psi] \ge E_0. The inequality is saturated only when all the weight sits on the ground level — that is, when the guess is the ground state. That is the entire principle: a weighted average of numbers can never be less than the smallest number in the list.

The method: guess, then minimise

A single guess gives a single bound; the trick is to guess a whole family at once. Choose a trial function that carries one or more adjustable variational parameters \alpha — a "knob" you can turn — write down the trial energy as a function of that knob,

E(\alpha) = \frac{\langle\psi_\alpha|\hat H|\psi_\alpha\rangle}{\langle\psi_\alpha|\psi_\alpha\rangle},

and then minimise over the parameter. Since every value of \alpha gives a legitimate upper bound, the smallest one is the best (tightest, lowest) bound the family can offer. So you solve

\frac{dE}{d\alpha} = 0

for the optimal \alpha^\star, and report E(\alpha^\star) as your estimate of E_0. Enrich the family with more parameters and the bound can only get better — this is exactly how industrial electronic-structure codes work, optimising thousands of parameters to press the estimate down toward the truth. The graph below is this minimisation made visible.

Watching the bound tighten

Here is the variational search for the harmonic oscillator's ground state, using the Gaussian family we work out below. In tidy units the trial energy takes the shape E(\alpha) = A\,\alpha + B/\alpha — a kinetic piece that grows with \alpha and a potential piece that shrinks with it. Their tug-of-war makes a valley. Drag the \alpha slider and watch the trial energy (the coloured dot on the curve) slide down toward the dashed floor at E_0 — it kisses the floor at the bottom of the valley and, no matter how hard you push, never dips below it. That dashed line is the true ground energy the principle guarantees you cannot beat.

The bottom of the valley sits where dE/d\alpha = A - B/\alpha^2 = 0, i.e. \alpha^\star = \sqrt{B/A}, and the minimum value there works out to the neat E(\alpha^\star) = 2\sqrt{AB}. Keep that little formula in your pocket — the quiz uses it, and it is the answer to a surprising number of one-parameter variational problems.

Worked example 1 — a Gaussian nails the oscillator

Take the one-dimensional harmonic oscillator, \hat H = -\dfrac{\hbar^2}{2m}\dfrac{d^2}{dx^2} + \tfrac12 m\omega^2 x^2, and guess a normalised Gaussian trial function with width parameter \alpha > 0:

\psi_\alpha(x) = \left(\frac{2\alpha}{\pi}\right)^{1/4} e^{-\alpha x^2}.

Computing the two expectation values (standard Gaussian integrals) gives a kinetic average that grows with \alpha and a potential average that falls with it:

\langle T\rangle = \frac{\hbar^2\alpha}{2m}, \qquad \langle V\rangle = \frac{m\omega^2}{8\alpha}, \qquad E(\alpha) = \frac{\hbar^2\alpha}{2m} + \frac{m\omega^2}{8\alpha}.

This is exactly the A\alpha + B/\alpha shape from the graph, with A = \hbar^2/2m and B = m\omega^2/8. Minimise:

\frac{dE}{d\alpha} = \frac{\hbar^2}{2m} - \frac{m\omega^2}{8\alpha^2} = 0 \quad\Longrightarrow\quad \alpha^\star = \frac{m\omega}{2\hbar}.

Substitute back and the two terms become equal at

E(\alpha^\star) = 2\sqrt{AB} = 2\sqrt{\frac{\hbar^2}{2m}\cdot\frac{m\omega^2}{8}} = \tfrac12\hbar\omega.

That is not just a good bound — it is the exact ground-state energy, and \alpha^\star reproduces the exact ground-state wavefunction. Why so perfect? Because the true ground state of the oscillator genuinely is a Gaussian, so it lives inside our trial family, and by the "equality iff" clause the method finds it dead on. When your family contains the answer, the variational method returns the answer.

Worked example 2 — a guess that misses (but only just)

The oscillator is a flattering case. To see the method in its honest role — an estimate, not an oracle — try a trial function that is the wrong shape. Put a particle in the infinite square well of width L (walls at x = 0 and x = L) and, instead of the true sinusoidal ground state, guess a simple tent — a triangle rising to a peak at the centre and back down to the walls:

\psi(x) = \begin{cases} x, & 0 \le x \le L/2,\\[2pt] L - x, & L/2 \le x \le L.\end{cases}

The tent has the right qualitative features (it vanishes at both walls, it is humped in the middle) but the wrong curvature — it has a sharp kink at the peak where the true ground state is smoothly rounded. Evaluating the Rayleigh quotient for this shape gives

E_{\text{tent}} = \frac{\langle\psi|\hat H|\psi\rangle}{\langle\psi|\psi\rangle} = \frac{10\,\hbar^2}{2mL^2} \approx 1.013\,E_0, \qquad E_0 = \frac{\pi^2\hbar^2}{2mL^2}.

The tent overshoots the true ground energy by a mere 1.3% — above it, exactly as the principle promises, but strikingly close for such a crude guess. That is the everyday experience of the method: even a rough trial function, as long as it has the right gross features, lands you within a few percent of the truth, always on the high side. Round off the kink (add a parameter that smooths the peak) and the bound tightens further toward E_0.

What it can and cannot promise

The power of the variational method is its guarantee, but that guarantee is narrow, and knowing its edges keeps you honest:

In principle you should: enlarge the family and the optimised bound can only fall or stay put, never rise, because the smaller family is a special case of the bigger one. Push this to its limit — let the trial function be an arbitrary combination of a complete basis — and minimising the Rayleigh quotient becomes exactly diagonalising the Hamiltonian in that basis. This is the Rayleigh–Ritz method, and it is how quantum-chemistry codes actually run: expand in a finite "basis set," and the variational principle turns the whole problem into a matrix eigenvalue computation whose lowest eigenvalue is the best bound the basis allows.

The catch is cost. Every extra parameter is another dimension to optimise, and the effort explodes long before the family is "complete." The entire craft of computational chemistry is choosing a trial family that is small enough to compute yet rich enough to hug the true ground state — squeezing the most bound out of the fewest knobs.

The classic trap. Because E(\alpha) \ge E_0 always, the trial energy is a score you are trying to minimise, not maximise. If you tweak your trial function and the energy goes up, you did not "add energy" or find a more excited state — you simply made a worse approximation to the ground state. Between two guesses, the one with the lower energy is unconditionally the better one. Lower is always closer to the truth.

A second, subtler trap: do not expect the minimum to be the exact ground energy. It equals E_0 only when the true ground state actually lives inside your trial family (as the Gaussian does for the oscillator). For the tent-in-a-box, minimising still leaves you above E_0 — the best your family can do is not the same as the right answer. dE/d\alpha = 0 finds the bottom of your valley, not the bottom of nature's.