The Lebesgue Integral
We have paid the entrance fee. A
measure
\mu now tells us how big a measurable set is, and a
measurable function
f is one whose level sets \{f \ge y\} are
things \mu can weigh. With those two tools in hand we can finally do the
thing the whole module was built for: define the integral
\int f \,d\mu.
And we define it in a way that will look strange to anyone raised on Riemann. There are no vertical
strips, no upper sums, no two-sided squeeze. Instead we build the integral from below,
in three clean stages — first for the simplest possible functions, then for every non-negative
measurable function by a single supremum, then for signed functions by splitting into a
positive and a negative part. Each stage is a few lines. That economy is the whole point: the hard
work went into measure theory, and now the payoff arrives almost for free.
Stage 1 — simple functions: integrate by counting values
A simple function is a measurable function that takes only finitely many values. Every
such non-negative \varphi can be written in standard form as a
finite sum of indicator functions of disjoint measurable sets:
\varphi = \sum_{k=1}^{n} c_k \, \mathbf{1}_{A_k}, \qquad c_k \ge 0, \quad A_k \text{ disjoint, measurable.}
Its integral is the one definition you would guess — value times size, summed:
\int \varphi \, d\mu \,=\, \sum_{k=1}^{n} c_k \, \mu(A_k).
We adopt the convention 0 \cdot \infty = 0: a height of zero over a set of
infinite measure contributes nothing (the graph has no area), and a positive height over a null set
contributes nothing either. This makes the arithmetic in [0, \infty] behave.
Well-defined. A simple function has many representations —
2\,\mathbf{1}_{[0,1]} equals
2\,\mathbf{1}_{[0,\frac12)} + 2\,\mathbf{1}_{[\frac12,1]} — so we must check the
value \sum c_k \mu(A_k) does not depend on which one we pick. It does not:
given two representations, pass to the common refinement formed from all the pairwise intersections
A_k \cap B_j; finite additivity of \mu splits each
term exactly in step, and the two sums collapse to the same number. From the same refinement trick,
integration of simple functions is linear
(\int(a\varphi + b\psi) = a\int\varphi + b\int\psi) and
monotone (\varphi \le \psi \Rightarrow \int\varphi \le \int\psi).
On [0,1] with ordinary length \mu, take
\varphi = 3\,\mathbf{1}_{[0,\,1/4)} + 5\,\mathbf{1}_{[1/4,\,1]}.
The two sets are disjoint and measurable, with
\mu([0,\tfrac14)) = \tfrac14 and
\mu([\tfrac14,1]) = \tfrac34. So
\int_{[0,1]} \varphi \, d\mu = 3\cdot\tfrac14 + 5\cdot\tfrac34 = \tfrac34 + \tfrac{15}{4} = \tfrac{18}{4} = \tfrac92.
Exactly the total area of the two boxes — no partitions, no limits. For a step function Riemann would
agree; the point is that we never referred to the domain's geometry, only to the measures of
the level sets. That indifference to the shape of the domain is what will let the same formula swallow
the Dirichlet function a moment from now.
Stage 2 — non-negative functions: the integral from below
Now let f \ge 0 be any measurable function — possibly wild, possibly
unbounded. We define its integral as the supremum of the integrals of all simple functions
that fit underneath it:
\int f \, d\mu \,=\, \sup\Bigl\{ \int \varphi \, d\mu \;:\; \varphi \text{ simple},\; 0 \le \varphi \le f \Bigr\} \;\in\; [0, \infty].
Read it slowly. We look at every simple staircase \varphi that stays below
f everywhere, compute its honest area \int\varphi
by Stage 1, and take the least upper bound over all of them. That single number is the
integral. It is always defined (the set of candidate values is non-empty — \varphi=0
qualifies — and bounded above in [0,\infty]), and for a simple
f it recovers Stage 1, since \varphi = f is the
largest admissible staircase.
Notice what is absent from this definition, compared with Riemann:
- There are no upper sums — no over-estimating staircases pressing down from above.
- There is no squeeze — we never demand that an approach from below meets an approach from above.
- There is only a supremum from below, and it is automatically the right answer.
Why is one-sided enough, when Riemann needed both jaws of a vice to close? Because the two theories
slice in different directions. Riemann slices the domain and must pin the height on each strip
between an inf and a sup — a genuinely two-sided problem, and the source of every failure. Lebesgue
slices the range: a simple \varphi \le f chops the values of
f into finitely many levels and asks only for the measure of each
super-level set. Refining the levels can only add area under f, never
overshoot, so pushing the staircases up toward f converges on the true value
with nothing needed from above. The supremum is the integral because the range-slicing can
never overstate the case.
Drag the slider to add levels. Each new level cuts a fresh horizontal band; for band at height
c_k we measure the set where f reaches that high,
and the simple integral \sum_k c_k \cdot \mu(A_k) — read out live — climbs
toward the area under f. It only ever rises, and it never crosses the curve.
That monotone climb from below is the entire content of Stage 2.
Stage 3 — signed functions: split into positive and negative parts
A general measurable f takes both signs. Break it into two non-negative
pieces, the positive and negative parts:
f^{+} = \max(f, 0), \qquad f^{-} = \max(-f, 0), \qquad\text{so}\qquad f = f^{+} - f^{-}, \quad |f| = f^{+} + f^{-}.
Both f^{+} and f^{-} are non-negative and
measurable, so Stage 2 gives each of them an integral in [0,\infty]. We call
f integrable (or summable) when
both are finite:
f \text{ integrable} \iff \int f^{+}\, d\mu < \infty \ \text{ and } \ \int f^{-}\, d\mu < \infty \iff \int |f|\, d\mu < \infty,
and then the integral is the difference — a subtraction of two finite numbers, so no
\infty - \infty ever appears:
\int f \, d\mu \,=\, \int f^{+}\, d\mu \,-\, \int f^{-}\, d\mu.
The equivalence with \int|f| < \infty is immediate: since
|f| = f^{+} + f^{-}, linearity on non-negative functions makes
\int|f| = \int f^{+} + \int f^{-}, and a sum of two non-negative numbers is
finite iff both are. This is why Lebesgue integrability is often stated in one breath as
"\int|f| < \infty" — the Lebesgue integral is an absolute
integral.
Finally, to integrate over a measurable subset E rather than the whole space,
multiply by that set's indicator:
\int_E f \, d\mu \,=\, \int f \cdot \mathbf{1}_E \, d\mu.
The triumph: \displaystyle\int_0^1 \mathbf{1}_{\mathbb{Q}} \, d\mu = 0
Here is the function that
Riemann could not touch
— the indicator of the rationals, whose upper and lower Darboux integrals were stuck forever at
1 and 0. Under Lebesgue it is not a pathology at
all; it is a simple function, and we integrate it by the Stage 1 formula in a single line.
On [0,1], write
\mathbf{1}_{\mathbb{Q}} = 1\cdot\mathbf{1}_{A} + 0\cdot\mathbf{1}_{B} where
A = \mathbb{Q}\cap[0,1] and B is the irrationals.
The rationals are countable, hence a null set: \mu(A) = 0. Therefore
\int_0^1 \mathbf{1}_{\mathbb{Q}} \, d\mu = 1\cdot\mu\bigl(\mathbb{Q}\cap[0,1]\bigr) + 0\cdot\mu(B) = 1\cdot 0 + 0\cdot 1 = 0.
No partitions to refine, no gap to close, no drama. Where the Riemann machine ground to a halt because
every subinterval held both a rational and an irrational, the Lebesgue machine simply weighs the
set where the function is 1, finds it has measure zero, and returns
0. This one line is the concrete cash value of the entire module — the
promise made on
"The Need for Measure",
now kept.
The basic properties, and how Riemann fits inside
From the three-stage construction the workhorse rules follow (each proved first for simple functions,
then lifted by the supremum). For integrable f, g and scalars
a, b:
- Linearity: \displaystyle\int (a f + b g)\, d\mu = a\int f\, d\mu + b\int g\, d\mu.
- Monotonicity: f \le g \Rightarrow \int f\, d\mu \le \int g\, d\mu.
- Triangle inequality: \bigl|\int f\, d\mu\bigr| \le \int |f|\, d\mu.
-
Vanishing:
\int |f|\, d\mu = 0 \iff f = 0 almost everywhere
(i.e. the set \{f \ne 0\} is null).
The last one is worth dwelling on: the integral is blind to what happens on a null set.
Change f on the rationals, on any countable set, on any measure-zero set at
all, and every integral of f is unchanged. "Almost everywhere" becomes the
native dialect of the theory.
Built in three stages, for a measure space (X, \mathcal{M}, \mu):
- Simple \varphi = \sum_k c_k \mathbf{1}_{A_k} (disjoint A_k): \int \varphi\, d\mu = \sum_k c_k\, \mu(A_k), with 0\cdot\infty = 0.
- Non-negative f \ge 0: \int f\, d\mu = \sup\{\int \varphi\, d\mu : 0 \le \varphi \le f,\ \varphi \text{ simple}\} — a supremum from below only.
- General f = f^{+} - f^{-}: \int f\, d\mu = \int f^{+}\, d\mu - \int f^{-}\, d\mu, defined when f is integrable (\int|f|\, d\mu < \infty).
- Agreement: every Riemann-integrable f on [a,b] is Lebesgue-integrable there, with the same value — the Lebesgue integral extends the Riemann integral.
That last bullet is the reassurance that nothing was lost: for every honest area you already know how to
compute, the two integrals return the identical number. The Lebesgue integral is a strict superset — it
agrees with Riemann where Riemann is defined, and keeps going where Riemann quits.
Lebesgue liked to explain his integral as a way of counting money. Riemann totals a pile of coins in
the order they lie on the counter; Lebesgue first sorts by denomination — all the pennies,
all the dimes — counts each heap, and multiplies by its value. Sorting the range
instead of marching along the domain is exactly the "supremum over simple
\varphi \le f" definition.
And that one-sided, from-below definition has a beautiful dividend. Because the integral is a supremum
over staircases underneath f, if a sequence
0 \le f_1 \le f_2 \le \cdots \uparrow f climbs monotonically up to
f, the integrals climb up to \int f as well:
f_n \uparrow f \implies \int f_n \, d\mu \uparrow \int f \, d\mu.
This is the Monotone Convergence Theorem, and it falls out almost for free precisely
because we defined \int f as a sup from below — there is no upper-sum
machinery fighting the limit. The very swap of limit and integral that Riemann could not perform is,
for Lebesgue, nearly a definition. (Its siblings, Fatou's lemma and Dominated Convergence, are the
subject of the next pages.)
Four traps in the definition of the integral:
-
"Integrable" means \int|f| < \infty — both parts finite.
If \int f^{+} = \int f^{-} = \infty, the function is not
integrable, even though you might be tempted to write "\int f = \infty - \infty".
The difference is simply undefined. A non-negative f always has a
well-defined integral in [0,\infty], but a signed one needs both parts
tamed.
-
Improper Riemann integrals are not automatically Lebesgue integrals. The classic
\int_0^\infty \frac{\sin x}{x}\, dx = \frac{\pi}{2} exists as an
improper (conditionally convergent) Riemann integral, but it is not
Lebesgue integrable, because \int_0^\infty \bigl|\tfrac{\sin x}{x}\bigr|\, dx = \infty.
The Lebesgue integral is absolute; it refuses conditionally convergent cancellation.
-
\int f = 0 does NOT mean f = 0.
It means f = 0 almost everywhere. The function
\mathbf{1}_{\mathbb{Q}} integrates to 0 yet is
nonzero on a dense infinite set. For a non-negative f, though,
\int f = 0 does force f = 0 a.e.
-
The sup is over simple functions strictly BELOW f only.
Do not also take an infimum from above and check they meet — that is the Riemann reflex, and it is
not part of this definition. Range-slicing needs one side; adding the other buys nothing and
reintroduces exactly the failure mode we escaped.