The Lebesgue Outer Measure
The previous page,
measures and measure spaces,
set out the rules of the game: a measure is a countably additive set function on a σ-algebra, sending
\varnothing to 0. But it took the existence of such a
thing on \mathbb{R} — one that assigns [a,b] its length
b-a — entirely on faith. This page pays that debt. We are going to
build Lebesgue measure on the line from scratch, and the whole construction rests on a
single primitive fact you have known since childhood: the length of an open interval
(a,b) is b-a. Nothing else is assumed.
The plan has two movements. First we define a crude size — the outer measure
\mu^{*} — that is defined for every subset of
\mathbb{R} by wrapping it in intervals as tightly as possible. It is generous
and well-behaved, but it fails the one law a measure must obey: it is only
subadditive, not additive. Then Lebesgue's
successor Carathéodory supplies the master stroke — a razor that carves out exactly the sets on which
\mu^{*} is additive. On that family, and only there,
\mu^{*} becomes an honest, complete measure. That is Lebesgue measure.
The outer measure: wrap it in intervals
Take any set A \subseteq \mathbb{R} — literally any set, no assumptions. A
countable cover of A by open intervals is a sequence
(a_1,b_1), (a_2,b_2), \dots whose union contains
A. Every such cover has a total length
\sum_n (b_n - a_n), a number in [0,\infty]. The
Lebesgue outer measure of A is the cheapest cover you can
find — the infimum of that total length over all admissible covers:
\mu^{*}(A) \;=\; \inf\Bigl\{\, \sum_{n=1}^{\infty} (b_n - a_n) \;:\; A \subseteq \bigcup_{n=1}^{\infty} (a_n, b_n) \,\Bigr\}.
Two features are the whole point. First, this \inf always exists: the
set of achievable total lengths is a nonempty set of non-negative reals (nonempty because
\mathbb{R} = \bigcup_n (-n, n) is one cover, though a useless one for a bounded
set), and every nonempty set of reals bounded below has an infimum by the
completeness of \mathbb{R}.
There is no risk that "the best cover" fails to exist — completeness guarantees a greatest lower bound
even when no single cover attains it. Second, \mu^{*} is defined for
every subset of \mathbb{R}, no exceptions — the very thing a
genuine measure could not manage on all of \mathbb{R}. That universality is
bought at a price we will discover shortly.
What outer measure gets right
Straight from the definition, three properties fall out, and one crucial property does not.
Empty set and monotonicity
\mu^{*}(\varnothing) = 0: the single interval
(-\varepsilon, \varepsilon) covers \varnothing with
length 2\varepsilon, and \varepsilon is arbitrary, so
the infimum is 0. And \mu^{*} is
monotone: if A \subseteq B then every cover of
B is a cover of A, so the infimum for
A is taken over a larger set of covers and can only be smaller —
\mu^{*}(A) \le \mu^{*}(B).
Countable subadditivity — the \varepsilon/2^{n} trick
The workhorse. For any sequence of sets A_1, A_2, \dots (disjoint or
not),
\mu^{*}\!\Bigl(\bigcup_{n=1}^{\infty} A_n\Bigr) \;\le\; \sum_{n=1}^{\infty} \mu^{*}(A_n).
Proof. Fix \varepsilon > 0. By definition of the infimum, each
A_n has a cover by open intervals whose total length overshoots
\mu^{*}(A_n) by less than an amount we get to choose — and we choose the shrinking
budget \varepsilon / 2^{n}:
A_n \subseteq \bigcup_k (a_{n,k}, b_{n,k}), \qquad \sum_k (b_{n,k} - a_{n,k}) \;<\; \mu^{*}(A_n) + \frac{\varepsilon}{2^{n}}.
Now pool all these intervals into one countable family (a countable union of countable sets is
countable). It covers \bigcup_n A_n, so its total length bounds the outer
measure of the union:
\mu^{*}\!\Bigl(\bigcup_n A_n\Bigr) \;\le\; \sum_n \sum_k (b_{n,k} - a_{n,k}) \;<\; \sum_n \Bigl(\mu^{*}(A_n) + \frac{\varepsilon}{2^{n}}\Bigr) \;=\; \sum_n \mu^{*}(A_n) + \varepsilon.
The geometric budget \sum_n \varepsilon/2^{n} = \varepsilon is the entire
trick: infinitely many small errors sum to one small error. Since
\varepsilon was arbitrary, the inequality holds without it. This one lemma
underwrites everything below.
What outer measure gets WRONG
Here is the catch. \mu^{*} is not countably additive on all
subsets — for disjoint sets the inequality above can be strict. That is not a defect we will patch; it is a
theorem. The Vitali construction
exhibits disjoint sets whose outer measures sum to more than the outer measure of their union. So
\mu^{*} is an outer measure — monotone, subadditive, zero on
\varnothing — but not yet a measure. Turning subadditivity into additivity is the
job of Carathéodory's razor.
It really is length: \mu^{*}([a,b]) = b-a
A construction that failed to give an interval its own length would be worthless. It doesn't fail — but the
proof has a surprising asymmetry.
The easy direction, \mu^{*}([a,b]) \le b-a. The single open
interval (a-\varepsilon,\, b+\varepsilon) covers
[a,b] and has length b-a+2\varepsilon. So the infimum
is at most b-a+2\varepsilon for every
\varepsilon>0, hence at most b-a. One cover was
enough.
The hard direction, \mu^{*}([a,b]) \ge b-a, is where the real
content lives, and it needs compactness. Take any open cover
[a,b] \subseteq \bigcup_n (a_n,b_n). Because
[a,b] is closed and bounded, the
Heine–Borel theorem hands us a finite subcover.
A short induction on a finite list of intervals covering an interval shows their lengths must sum to at
least b-a (walk from a to
b, at each step the interval containing the current point reaches strictly past
it). So every cover — via its finite subcover — has total length
\ge b-a, and therefore so does the infimum. Compactness is what stops an
infinite cover from cheating with cancellation; without it the inequality is false.
And countable sets vanish: \mu^{*}(\mathbb{Q}) = 0
Enumerate a countable set as x_1, x_2, x_3, \dots (the rationals, say). Given
\varepsilon>0, wrap x_n in the open interval
\bigl(x_n - \tfrac{\varepsilon}{2^{n+1}},\, x_n + \tfrac{\varepsilon}{2^{n+1}}\bigr),
of length \varepsilon/2^{n}. These intervals cover the whole set and their
total length is
\sum_{n=1}^{\infty} \frac{\varepsilon}{2^{n}} = \varepsilon.
The cover has total length \varepsilon, arbitrarily small, so
\mu^{*}(\mathbb{Q}) = 0. Every countable set is
\mu^{*}-null. This is the rigorous version of the hand-wave on the
Riemann page: \mathbf{1}_{\mathbb{Q}} "ought to" integrate to
0 because \mathbb{Q} is negligible — and now
"negligible" has an exact meaning, outer measure 0. Notice the
\varepsilon/2^{n} budget again: the same geometric series that gave
subadditivity dissolves any countable set.
See the cover shrink
Drag the ε slider and watch the total length of the cover fall toward
0; drag k to add more points and see that even covering more of
the set costs almost nothing, because interval number n has width
\varepsilon/2^{n}. The readout is the exact finite sum
\sum_{n=1}^{k} \varepsilon/2^{n} = \varepsilon\,(1 - 2^{-k}), always strictly
below \varepsilon. Push ε down and the whole scaffold of intervals collapses to
the points — that limit, total length 0, is the outer measure of a countable set.
For a solid interval [a,b] the picture would refuse to collapse: no matter how
cleverly you cover it, compactness pins the total length at \ge b-a. Countable
dust shrinks to nothing; a continuum keeps its length. That contrast is the entire difference between
measure zero and positive measure.
Carathéodory's razor: which sets are measurable
We need a test that selects the sets on which \mu^{*} behaves additively — and
the test must use nothing but \mu^{*} itself, since that is all we have.
Carathéodory's idea is startlingly clean: a set is measurable if it splits every other set
additively.
A \text{ is (Lebesgue-)measurable} \iff \mu^{*}(E) = \mu^{*}(E \cap A) + \mu^{*}(E \cap A^{c}) \quad \text{for every } E \subseteq \mathbb{R}.
Read A as a knife. For any "test set"
E, the knife cuts it into the part inside
A and the part outside; A is measurable exactly when
those two pieces' outer measures always add back to the whole. By subadditivity the
\le direction is automatic (E = (E\cap A) \cup (E\cap A^{c})),
so the condition really only asks for the reverse inequality
\mu^{*}(E) \ge \mu^{*}(E\cap A) + \mu^{*}(E\cap A^{c}): the knife must not
lose any measure. Sets that fail this — like the Vitali set — leak: their two pieces overstate the
whole.
Let \mathcal{M} be the collection of Carathéodory-measurable subsets of
\mathbb{R}. Then:
-
\mu^{*} is an outer measure — monotone, countably
subadditive, and \mu^{*}(\varnothing)=0 — on all subsets.
-
\mathcal{M} is a σ-algebra: closed under complement and
countable unions, and containing \varnothing and
\mathbb{R}.
-
Restricted to \mathcal{M}, \mu^{*} is
countably additive — so \mu := \mu^{*}\!\restriction_{\mathcal{M}}
is a genuine measure, Lebesgue measure.
-
\mu is complete, and
\mathcal{M} contains every Borel set
(\mathcal{B} \subseteq \mathcal{M}) — hence every open set, closed set,
and every \mu^{*}-null set.
We state this without the full proof (the σ-algebra closure argument is a careful but elementary exercise
in applying the splitting condition to nested test sets). Two corollaries deserve a spotlight. Because
every open interval is measurable and \mathcal{M} is a σ-algebra,
\mathcal{M} swallows the whole Borel hierarchy —
\mathcal{B} \subseteq \mathcal{M}. And because
\mu is complete, every set of outer measure zero is
measurable: if \mu^{*}(A)=0 then for any test set
E, monotonicity forces
\mu^{*}(E\cap A)=0 and
\mu^{*}(E\cap A^{c}) \le \mu^{*}(E), so the splitting condition holds
trivially. Null sets pass Carathéodory's test for free.
It looks like it comes out of nowhere — why quantify over all test sets
E rather than just demand
\mu^{*}(A) + \mu^{*}(A^{c}) = \mu^{*}(\mathbb{R})? Because the strong,
universally-quantified form is exactly what makes the measurable sets close up into a σ-algebra and makes
\mu^{*} additive on them — the weaker "split only the whole line" condition is
too flabby to induct with. Carathéodory's real triumph is that his criterion is abstract:
it never mentions intervals, or \mathbb{R}, or length at all. Feed it
any outer measure on any set — outer measures built from coverings on a metric space, on
an abstract measurable space, on Hausdorff dimension — and the very same splitting condition promotes it to
a complete measure on its own σ-algebra of measurable sets. The construction you just saw for
\mathbb{R} is one instance of a universal machine for manufacturing measures out
of nothing but a notion of "cover cheaply".
Four traps that snare everyone the first time:
-
Outer measure is subadditive, not additive. For general disjoint sets you only get
\mu^{*}(A \cup B) \le \mu^{*}(A) + \mu^{*}(B) — writing
= is exactly the error the whole page exists to prevent. Additivity is a
privilege of measurable sets, earned via Carathéodory, not a property of
\mu^{*} itself.
-
Outer measure zero \Rightarrow measurable, but positive outer measure
alone tells you nothing. Null sets are always measurable (completeness). But a set can have
positive outer measure and still be non-measurable: the
Vitali set has
\mu^{*}(V) > 0 yet fails the splitting condition. Having a size is not the same
as being measurable.
-
The infimum need not be attained. There is generally no cheapest cover — only
covers of total length approaching \mu^{*}(A) from above. Every cover of
[0,1] by open intervals has length strictly greater than
1; the value 1 is the infimum, reached in the limit,
not by any one cover.
-
The cover is by OPEN intervals, and it is countable. Both matter. Open intervals make
the compactness/Heine–Borel argument work for the lower bound; countability is what lets the
\varepsilon/2^{n} budget converge. Swap in arbitrary (uncountable) covers or
drop openness and the theory changes.