Scattering Theory (an introduction)
Almost everything we know about the subatomic world we learned by throwing particles at
things and watching how they bounce. Rutherford fired alpha particles at a sheet of gold
foil and, from the surprising few that ricocheted straight back, deduced that the atom hides a tiny
dense nucleus. Neutron beams diffract off crystals to reveal where the atoms sit; electrons scattered
off protons showed the proton is not a point but a fuzzy cloud of quarks; and every collider from
SLAC to the LHC is, at heart, the same experiment scaled up — smash things together, catch the
debris, and read the target's structure off the pattern of where the pieces fly.
Scattering theory is the language that turns "how often does it bounce, and which
way?" into "what does the target — the force V(r) — actually look like?"
It is the bridge between a number you can measure in a detector and the potential you cannot see. This
page builds that bridge in four planks: the scattering amplitude that carries all the
physics, the cross-section that a detector actually counts, the Born
approximation that ties the amplitude to the shape of the force, and partial waves
with their phase shifts, the natural language when the force is central.
The setup: a plane wave meets a potential
Idealise a beam as a plane
wave of definite momentum p = \hbar k travelling along the
z-axis, e^{ikz}. It flies in from far away, hits
a localised potential V(r) (one that dies off outside some
small region — a nucleus, an atom, a short-range force), and scatters. Far downstream, the
wavefunction settles into a clean and universal shape: the original plane wave, still ploughing ahead,
plus an outgoing spherical ripple spreading from the target.
\psi(\mathbf{r}) \;\xrightarrow{\;r \to \infty\;}\;
\underbrace{e^{ikz}}_{\text{incoming beam}}
\;+\; \underbrace{f(\theta)\,\frac{e^{ikr}}{r}}_{\text{scattered ripple}}.
The e^{ikr}/r is just an expanding sphere of probability — its
1/r falloff is exactly what keeps the total probability crossing each shell
constant (surface area grows as r^2, so |{\cdot}|^2
must fall as 1/r^2). Everything specific to this target — how much
scatters, and in which directions — is bundled into the single function out front,
f(\theta), the scattering amplitude. It has units of
length, it generally depends on the scattering angle \theta
(and energy), and it is the one object the whole theory is built to compute. Know
f(\theta) and you know the experiment.
Cross-sections: what a detector actually counts
A detector does not see amplitudes; it sees counts. Park a detector subtending a small solid
angle d\Omega in the direction (\theta,\phi) and
ask: what fraction of the incoming flux lands in it? That ratio, per unit solid angle, is the
differential cross-section, and it is simply the squared amplitude:
\frac{d\sigma}{d\Omega} = |f(\theta)|^2 .
The name is geometric. A cross-section has units of area — since
f is a length, |f|^2 is an area — and it behaves
like the effective "size" of the target as the beam sees it. A big
d\sigma/d\Omega in some direction means the target presents a large
effective face there, deflecting many particles that way. Integrate over all directions and you get
the total cross-section, the whole effective target area:
\sigma = \int |f(\theta)|^2\, d\Omega
= \int_0^{2\pi}\!\!\int_0^{\pi} |f(\theta)|^2 \sin\theta\, d\theta\, d\phi .
Nuclear cross-sections are so small that they are measured in barns,
1\,\text{barn} = 10^{-28}\,\text{m}^2 — roughly the geometric cross-section
of a heavy nucleus. (The name is a physicists' joke: to a beam a uranium nucleus is, by subatomic
standards, "as big as a barn.") A large cross-section means a fat, easy-to-hit target; a small one
means the beam mostly sails through untouched.
The classic slip. Because d\sigma/d\Omega = |f(\theta)|^2
looks exactly like the Born-rule "amplitude squared = probability" you have met everywhere else in
quantum mechanics, it is tempting to read the cross-section as a probability, a pure number between
0 and 1. It is not. f(\theta) has units of length, so
|f|^2 is an area (m², or barns), and the total
\sigma can be any positive size at all.
The way to keep it straight: a cross-section answers "how big a target does the interaction
present?", not "how likely is one event?". You only get an actual count rate by multiplying the
cross-section by the beam's luminosity (particles per area per second). Amplitude squared,
yes — but the amplitude here is a length, so its square is an area.
The Born approximation: the amplitude is the Fourier transform of the force
So how do we get f(\theta) from a given potential? Exactly is hard; but if
the potential is weak (or the energy high), first-order perturbation theory gives a
beautifully clean answer — the Born approximation. Treat the scattered wave as small
and keep only the leading term, and the amplitude becomes an integral over the potential:
-
For a weak, localised potential the scattering amplitude is (proportional to) the
Fourier transform of the potential,
f(\theta) \;\approx\; -\frac{m}{2\pi\hbar^2}
\int e^{\,i\mathbf{q}\cdot\mathbf{r}}\; V(\mathbf{r})\; d^3r ,
evaluated at the momentum transfer
\mathbf{q} = \mathbf{k} - \mathbf{k}'.
-
For elastic scattering the beam only changes direction, not speed, so
|\mathbf{k}| = |\mathbf{k}'| = k and the transfer magnitude is fixed by
the angle alone:
q = |\mathbf{q}| = 2k\sin\!\frac{\theta}{2}.
Forward scattering (\theta \to 0) probes small
q (long wavelengths, the gross shape); back-scattering
(\theta \to \pi) probes the largest q (fine
detail).
Read that slowly, because it is the punchline of the whole subject: measuring the angular
distribution of scattered particles measures the Fourier transform of the force. Different
angles \theta sample different spatial frequencies q
of V(\mathbf{r}); sweep the detector around and you build up the full
transform, then invert it to reconstruct the potential you could never touch. This is precisely how
form factors and the internal structure of nuclei and nucleons were mapped — the
charge distribution of the proton was read straight off the way electrons scattered from it. Scattering
is a microscope whose lens is a Fourier transform.
Because it is only the first term of a series. The Born approximation is first-order
perturbation theory: it assumes the scattered wave is a small correction to the incoming plane wave,
so it is trustworthy only for a weak potential or high energy,
where the particle barely notices the target on its way through. Turn the potential up — a deep
well, a strong resonance, a slow particle that lingers — and higher-order terms (the particle
scattering twice, three times, …) matter, and the neat single Fourier transform is no longer enough.
That is exactly when the partial-wave picture below takes over: it makes no
weak-potential assumption and is the tool of choice at low energy and strong coupling. The two
approaches are complementary — Born for weak-and-fast, partial waves for strong-and-slow.
Partial waves and phase shifts
When the potential is central — V(r) depends only on the
distance r, not the direction — angular momentum is conserved, and a
different decomposition is natural. Break the incoming plane wave into components of definite orbital
angular momentum \ell = 0, 1, 2, \dots (the familiar
s, p, d, … waves). A central force cannot change
\ell, so each partial wave scatters completely independently — and, being
elastic, all a partial wave can do is get shifted in phase relative to the free wave
it would have been. That shift is the phase shift
\delta_\ell, and it holds all the physics of the \ell-th
channel.
-
The full amplitude is a sum over channels, each weighted by
2\ell+1 (the degeneracy of that angular momentum) and a Legendre
polynomial P_\ell(\cos\theta):
f(\theta) = \frac{1}{k}\sum_{\ell=0}^{\infty}
(2\ell+1)\, e^{i\delta_\ell}\sin\delta_\ell\; P_\ell(\cos\theta).
-
The total cross-section collapses to a clean sum of squared sines (the cross terms integrate to
zero, since the P_\ell are orthogonal):
\sigma = \frac{4\pi}{k^2}\sum_{\ell=0}^{\infty}
(2\ell+1)\,\sin^2\delta_\ell .
The phase shift reads like a story. A positive \delta_\ell
means the potential pulled the wave inward — the mark of an attractive
force; a negative \delta_\ell means the wave was pushed
out, a repulsive force. When a phase shift races rapidly up through
\pi/2, its \sin^2\delta_\ell hits its maximum
value of 1 and that channel scatters as hard as quantum mechanics allows —
a resonance, the fingerprint of a temporary bound state (a short-lived particle
forming and decaying). And crucially, at low energy the higher partial waves are
frozen out (a slow particle carries little angular momentum, so it cannot reach past the target's edge
for \ell > 0): only \ell = 0, the
s-wave, survives, and scattering becomes beautifully simple and isotropic.
Watch the pattern form: s-wave and p-wave interference
You do not need the full infinite sum to feel how partial waves shape the angular pattern. Keep just
the first two — the isotropic s-wave (a constant, since
P_0 = 1) and the p-wave (P_1(\cos\theta) = \cos\theta)
— so the amplitude is f(\theta) \approx f_0 + f_1\cos\theta and the
differential cross-section is
\frac{d\sigma}{d\Omega} = \big|\,f_0 + f_1\cos\theta\,\big|^2 .
Drag the two weights below. With only the s-wave (f_1 = 0) the pattern is a
flat circle — scattering is equally likely in every direction, the hallmark of low-energy s-wave
physics. Turn up the p-wave and the constant and the \cos\theta begin to
interfere: they add in the forward direction (\theta = 0,
\cos\theta = 1) and fight in the backward direction
(\theta = 180^\circ, \cos\theta = -1), tilting
the whole distribution forward. Flip the sign of f_1 and the peak swings to
the back. The angular pattern is a direct readout of which partial waves are in play and how they
interfere.
This forward–backward asymmetry is not a curiosity; it is data. An experimenter who measures the
angular distribution and fits it to a sum of Legendre polynomials reads off the individual phase
shifts \delta_\ell — and from those, the shape and strength of the force.
Worked examples
Example 1 — isotropic (s-wave) total cross-section. At low energy only
\ell = 0 survives, so f(\theta) = f_0 is a constant
and d\sigma/d\Omega = |f_0|^2 is the same in every direction. The total
cross-section is then that constant times the whole solid angle
\int d\Omega = 4\pi:
\sigma = \int |f_0|^2\, d\Omega = 4\pi\,|f_0|^2 .
If, say, |f_0| = 2\,\text{fm}, then
\sigma = 4\pi(2)^2 = 16\pi \approx 50\,\text{fm}^2 = 0.5\,\text{barn}. An
isotropic amplitude of a few femtometres already gives a cross-section of order a barn.
Example 2 — momentum transfer at a given angle. A beam with wavenumber
k = 3\,\text{fm}^{-1} scatters through \theta = 60^\circ.
The momentum transfer is
q = 2k\sin\!\frac{\theta}{2} = 2(3)\sin 30^\circ = 6 \times \tfrac12
= 3\,\text{fm}^{-1}.
At exact back-scattering, \theta = 180^\circ,
\sin(\theta/2) = 1, the transfer is maximal:
q = 2k. That is why the sharpest structural detail lives in the
back-scattered particles — they carry the largest q, the highest spatial
frequency of the potential.
Example 3 — an s-wave cross-section from its phase shift. With only the s-wave
active, the total cross-section is
\sigma_0 = \dfrac{4\pi}{k^2}\sin^2\delta_0. Take
k = 1\,\text{fm}^{-1} and a phase shift racing through resonance,
\delta_0 = 90^\circ, so \sin^2\delta_0 = 1:
\sigma_0 = \frac{4\pi}{(1)^2}\times 1 = 4\pi \approx 12.6\,\text{fm}^2 .
This is the unitarity limit — with \sin^2\delta_0 = 1 the
s-wave scatters as strongly as it possibly can. A phase shift passing through
90^\circ is the very signature of a resonance.
A gorgeous piece of bookkeeping called the optical theorem:
\sigma_{\text{total}} = \frac{4\pi}{k}\,\text{Im}\,f(0).
The total amount of scattering in all directions is fixed by the
imaginary part of the forward amplitude alone. It sounds like magic, but the logic is
pure conservation: every particle knocked out of the forward beam must go somewhere, so the beam is
slightly dimmed straight ahead by exactly the amount scattered everywhere else — and that shadowing
shows up as the imaginary (absorptive) part of f(0). Probability in equals
probability out; the optical theorem is that ledger, written in one line.