The Maxwell–Boltzmann Distribution
Fill a room with air and every cubic centimetre holds some
2.5\times 10^{19} molecules, each rattling about, colliding billions of
times a second. It is tempting to picture them all skittering along at one shared "thermal speed" set
by the temperature. They do not. At any instant some molecules are almost stationary, a great many
cruise near a few hundred metres per second, and a rare few are screaming along at more than twice
that. There is a whole spread of speeds — and the exact shape of that spread is one
of the most consequential curves in physics.
That shape controls an astonishing amount of the everyday world. It is why a puddle
evaporates at room temperature even though "boiling" is far away: only the fastest
molecules in the tail have enough energy to break free, and picking them off cools what remains. It is
why chemical reaction rates rocket up when you heat things — reactions need molecules
above an activation energy, and warming the gas fattens the high-speed tail. And it is why the Earth
has held on to its \text{N}_2 and \text{O}_2 for
billions of years while its hydrogen and helium leaked away into space: light
molecules move faster, more of them sit in the tail above escape velocity, and over aeons the
atmosphere sieves itself. One curve, and it reaches from a drying pavement to the composition of
planets.
This page is about that one curve — the Maxwell–Boltzmann speed distribution. We
will build it from the Boltzmann
distribution on energy, pin down its three characteristic speeds, prove it is properly
normalised using the Gamma
function, and watch it slide and broaden as you turn up the heat.
From Boltzmann to speeds
The starting point is the Boltzmann factor: in thermal equilibrium at temperature
T, the probability of finding a molecule in a state of energy
E is weighted by e^{-E/kT}, where
k = 1.381\times 10^{-23}\ \text{J/K} is Boltzmann's constant. For a free
molecule the only energy is kinetic, E = \tfrac{1}{2}mv^2 = \tfrac{1}{2}m\,(v_x^2 + v_y^2 + v_z^2),
so each velocity component carries its own Gaussian weight:
P(v_x) \propto e^{-mv_x^2/2kT}, \qquad P(v_y) \propto e^{-mv_y^2/2kT}, \qquad P(v_z) \propto e^{-mv_z^2/2kT}.
Each of these is a bell curve centred on zero: a molecule is just as likely to drift
left as right, so the most probable value of any single component is v_x = 0.
Multiply the three together (the components are independent) and the probability of a particular
velocity vector \mathbf{v} is
P(\mathbf{v}) \propto e^{-m(v_x^2+v_y^2+v_z^2)/2kT} = e^{-mv^2/2kT},
which depends only on the magnitude v = |\mathbf{v}|. So far this
is the distribution of velocities. But we usually want the distribution of speeds —
how likely a molecule is to have speed near some value v, regardless of
direction. That is a different question, and answering it introduces the factor that gives the curve
its whole character.
Picture "velocity space", the abstract space whose axes are
v_x, v_y, v_z. Every molecule is a point in it, and the exponential above
gives the density of points. All the molecules with speed near v lie in a
thin spherical shell of radius v and thickness
dv. The volume of that shell is its surface area times its thickness,
4\pi v^2\,dv. There are simply more ways to have a large speed than
a small one — more directions to point — and that count grows as
4\pi v^2. Multiplying the per-velocity probability by this shell volume
gives the speed distribution:
-
The fraction of molecules with speed between v and
v + dv is f(v)\,dv, with
f(v) = 4\pi \left(\frac{m}{2\pi kT}\right)^{3/2} v^2\, e^{-mv^2/2kT}.
-
It is a product of two competing factors. The
v^2 (the density of states, from the spherical shell) rises
from zero — there are ever more directions at higher speed. The exponential
e^{-mv^2/2kT} falls — high kinetic energies are Boltzmann-
suppressed. Their product is zero at v = 0, climbs to a
peak, and decays away in a long tail.
This is the key structural fact, and it is worth saying plainly: f(0) = 0.
No molecule sits exactly still, not because rest is forbidden, but because a single point (zero speed)
has vanishing "shell volume" — there is no direction to a zero vector. The competition between the
rising v^2 and the falling exponential is what makes a peak at all.
Watch it move
The curve below is f(v) for a real gas. Slide the
temperature up and the peak marches to the right and the whole hump flattens and
spreads — because the area underneath is always exactly 1 (every molecule
has some speed), so widening the curve must lower it. Switch the gas between
helium, nitrogen and oxygen at fixed temperature and watch the opposite: heavier molecules sit slower
and more tightly bunched. Notice too that the curve always starts at the origin — that is the
v^2 factor forcing f(0)=0 — and that even when
the peak is modest there is a long, thin high-speed tail stretching far to the right.
Everything downstream — evaporation, reaction rates, atmospheric escape — lives in how this curve
moves. Warming the gas does not just shift the average; it disproportionately populates the tail,
because the tail's height depends exponentially on mv^2/2kT. A modest rise
in temperature can multiply the number of molecules above a high threshold speed many-fold, which is
exactly why chemistry speeds up so dramatically with heat.
Three speeds, three questions
A curve with a peak and a skewed tail does not have a single "the speed"; it has several natural ones,
and they answer subtly different questions. Three matter, and they always come in the same order.
1. The most probable speed v_p — the location of the peak,
the single commonest speed. Find it where df/dv = 0. Dropping constants,
we differentiate v^2 e^{-mv^2/2kT}:
\frac{d}{dv}\!\left(v^2 e^{-mv^2/2kT}\right) = \left(2v - \frac{mv^3}{kT}\right)e^{-mv^2/2kT} = 0 \;\Longrightarrow\; 2 = \frac{mv^2}{kT},
\boxed{\,v_p = \sqrt{\frac{2kT}{m}}\,}.
2. The mean speed \langle v\rangle — the ordinary
average, \int_0^\infty v\,f(v)\,dv. The skewed tail drags it above the
peak. As we compute below with the Gamma function,
\boxed{\,\langle v\rangle = \sqrt{\frac{8kT}{\pi m}} = \sqrt{\tfrac{8}{\pi}}\,\sqrt{\frac{kT}{m}}\,}.
3. The root-mean-square speed v_{\text{rms}} — the speed
of the "average kinetic energy" molecule, \sqrt{\langle v^2\rangle}. This
one you can get without any integral, straight from equipartition: each of the three
translational degrees of freedom carries \tfrac{1}{2}kT of energy, so
\tfrac{1}{2}m\langle v^2\rangle = \tfrac{3}{2}kT \;\Longrightarrow\; \langle v^2\rangle = \frac{3kT}{m} \;\Longrightarrow\; \boxed{\,v_{\text{rms}} = \sqrt{\frac{3kT}{m}}\,}.
All three are \sqrt{kT/m} times a pure number, so they keep a
fixed ratio no matter the gas or temperature:
v_p : \langle v\rangle : v_{\text{rms}} \;=\; \sqrt{2} : \sqrt{\tfrac{8}{\pi}} : \sqrt{3} \;\approx\; 1.414 : 1.596 : 1.732,
or, dividing through by the smallest,
v_p : \langle v\rangle : v_{\text{rms}} \;\approx\; 1 : 1.128 : 1.225.
The order v_p < \langle v\rangle < v_{\text{rms}} is permanent and worth
remembering: the peak is the most common speed, the mean is pulled up by the tail, and the
rms is highest of all because squaring weights the fast molecules most heavily. Two handy ratios fall
straight out:
v_{\text{rms}}/v_p = \sqrt{3/2} = \sqrt{1.5} \approx 1.225 and
\langle v\rangle / v_p = \sqrt{4/\pi} \approx 1.128.
Why the Gamma function: normalisation and moments
Every claim above — that the curve integrates to 1, that
\langle v\rangle = \sqrt{8kT/\pi m}, that
\langle v^2\rangle = 3kT/m — is an integral of the form
\int_0^\infty v^k e^{-av^2}\,dv, with
a = m/2kT. Substituting u = av^2 turns each into
the Gamma
function \Gamma(z) = \int_0^\infty t^{z-1}e^{-t}\,dt. That is
the precise reason the Gamma function is a prerequisite here — it is the machine that evaluates all
the moments of this distribution:
-
For a > 0 and integer or half-integer powers,
\int_0^\infty v^k\, e^{-a v^2}\,dv = \tfrac{1}{2}\, a^{-(k+1)/2}\, \Gamma\!\left(\tfrac{k+1}{2}\right).
-
The values we need are
\Gamma(\tfrac{1}{2}) = \sqrt{\pi},
\Gamma(\tfrac{3}{2}) = \tfrac{\sqrt{\pi}}{2},
\Gamma(2) = 1, and
\Gamma(\tfrac{5}{2}) = \tfrac{3\sqrt{\pi}}{4}.
Normalisation uses k = 2, so we need
\Gamma(3/2) = \sqrt{\pi}/2:
\int_0^\infty v^2 e^{-av^2}\,dv = \tfrac{1}{2}\,a^{-3/2}\,\Gamma(\tfrac{3}{2}) = \tfrac{1}{2}\,a^{-3/2}\cdot\frac{\sqrt{\pi}}{2} = \frac{\sqrt{\pi}}{4}\,a^{-3/2}.
Feed that into \int_0^\infty f(v)\,dv with the front constant
4\pi (a/\pi)^{3/2} (since m/2\pi kT = a/\pi):
\int_0^\infty f(v)\,dv = 4\pi\left(\frac{a}{\pi}\right)^{3/2}\cdot \frac{\sqrt{\pi}}{4}\,a^{-3/2} = \pi\,\pi^{-3/2}\,\sqrt{\pi} = \pi^{\,1 - 3/2 + 1/2} = \pi^0 = 1.\;\checkmark
The constant 4\pi(m/2\pi kT)^{3/2} out front is exactly what makes
the total come to one — it is not decoration, it is the normalisation demanded by
\Gamma(3/2). The mean speed uses
k = 3 and \Gamma(2) = 1:
\langle v\rangle = 4\pi\left(\frac{a}{\pi}\right)^{3/2}\!\int_0^\infty v^3 e^{-av^2}\,dv = 4\pi\left(\frac{a}{\pi}\right)^{3/2}\cdot \tfrac{1}{2}a^{-2}\,\Gamma(2) = \frac{2}{\sqrt{\pi a}} = \sqrt{\frac{8kT}{\pi m}},
recovering the mean speed quoted earlier. And the mean square uses
k = 4 and \Gamma(5/2) = 3\sqrt{\pi}/4:
\langle v^2\rangle = 4\pi\left(\frac{a}{\pi}\right)^{3/2}\!\int_0^\infty v^4 e^{-av^2}\,dv = 4\pi\left(\frac{a}{\pi}\right)^{3/2}\cdot \tfrac{1}{2}a^{-5/2}\cdot\frac{3\sqrt{\pi}}{4} = \frac{3}{2a} = \frac{3kT}{m},
so v_{\text{rms}} = \sqrt{3kT/m} — the same answer equipartition gave for
free, now confirmed by the integral. Three different questions, one family of Gamma integrals, and the
half-integer values \Gamma(3/2) and
\Gamma(5/2) doing the real work.
Worked example: nitrogen at room temperature
Take \text{N}_2 at T = 300\ \text{K}. One
molecule has mass
m = 28\times 1.66\times 10^{-27} = 4.65\times 10^{-26}\ \text{kg}. The
recurring quantity is
\frac{kT}{m} = \frac{(1.38\times 10^{-23})(300)}{4.65\times 10^{-26}} \approx 8.90\times 10^{4}\ \text{m}^2/\text{s}^2, \qquad \sqrt{\frac{kT}{m}} \approx 298\ \text{m/s}.
The three characteristic speeds are then just numerical multiples of that root:
- v_p = \sqrt{2}\,(298) = \sqrt{2kT/m} \approx 422\ \text{m/s};
- \langle v\rangle = \sqrt{8/\pi}\,(298) \approx 1.596\times 298 \approx 476\ \text{m/s};
- v_{\text{rms}} = \sqrt{3}\,(298) \approx 1.732\times 298 \approx 517\ \text{m/s}.
So a typical nitrogen molecule in this room is moving at roughly half a kilometre per
second — faster than a rifle bullet — and the three speeds sit in the promised
422 : 476 : 517 ordering, a ratio of about
1 : 1.13 : 1.22.
A ratio question. Suppose you heat this gas from
300\ \text{K} to 1200\ \text{K} — a factor of
four. Since every speed scales as \sqrt{T}, all of them grow by
\sqrt{4} = 2. The rms speed climbs from
517\ \text{m/s} to about 1034\ \text{m/s}. The
curve keeps its shape but stretches out along the speed axis and flattens, its area still exactly one.
This is the classic trap. There are two different curves and it is fatal to blur
them. The distribution of a single velocity component,
P(v_x) \propto e^{-mv_x^2/2kT}, is an ordinary Gaussian bell
peaked at zero — the commonest value of v_x really is
0, and it is symmetric (leftward is as likely as rightward). The
distribution of speed, f(v) \propto v^2 e^{-mv^2/2kT}, is
not that curve: the extra 4\pi v^2 shell factor kills it at
the origin, so f(0) = 0, and it is skewed with a one-sided tail (speed can't
be negative). A molecule most likely has each component near zero, yet almost never has
speed near zero, because the only way for the speed to be tiny is for all three components to
be tiny at once — vanishingly rare.
And do not confuse the three speeds. The peak of f(v) is
the most probable speed v_p, not the average. The mean
\langle v\rangle sits noticeably to the right of the peak because the tail
pulls it, and v_{\text{rms}} is further right still. Reading the peak as
"the average speed", or swapping v_{\text{rms}} for
\langle v\rangle, are the two most common slips — they differ by about
9\%, which matters.
Look at the high-speed tail. A molecule escapes a planet if its speed exceeds the escape speed
(about 11\ \text{km/s} for Earth). Almost no molecule reaches that in a
single leap — but the tail of f(v) is never exactly zero, and molecules at
the very top of the atmosphere that happen to be moving upward and fast enough simply leave, for good.
Over billions of years even a trickle drains a reservoir.
The rate depends ferociously on v_{\text{rms}}/v_{\text{esc}}, and since
v_{\text{rms}} = \sqrt{3kT/m}, lighter molecules are far more
exposed: hydrogen (m = 2) and helium
(m = 4) sit high in speed, so a much larger slice of their distribution
pokes above escape velocity than for nitrogen (m = 28). That is exactly
what we see — Earth has quietly leaked its hydrogen and helium to space while holding tightly onto its
heavy \text{N}_2, \text{O}_2 and
\text{CO}_2. The Moon (tiny escape speed) and Mars (small mass, warm
enough) lost almost everything; giant, cold Jupiter kept even its hydrogen. The composition of a
planet's atmosphere is, in large part, this one curve read against the local escape velocity.