The Lebesgue Integral

We have paid the entrance fee. A measure \mu now tells us how big a measurable set is, and a measurable function f is one whose level sets \{f \ge y\} are things \mu can weigh. With those two tools in hand we can finally do the thing the whole module was built for: define the integral

\int f \,d\mu.

And we define it in a way that will look strange to anyone raised on Riemann. There are no vertical strips, no upper sums, no two-sided squeeze. Instead we build the integral from below, in three clean stages — first for the simplest possible functions, then for every non-negative measurable function by a single supremum, then for signed functions by splitting into a positive and a negative part. Each stage is a few lines. That economy is the whole point: the hard work went into measure theory, and now the payoff arrives almost for free.

Stage 1 — simple functions: integrate by counting values

A simple function is a measurable function that takes only finitely many values. Every such non-negative \varphi can be written in standard form as a finite sum of indicator functions of disjoint measurable sets:

\varphi = \sum_{k=1}^{n} c_k \, \mathbf{1}_{A_k}, \qquad c_k \ge 0, \quad A_k \text{ disjoint, measurable.}

Its integral is the one definition you would guess — value times size, summed:

\int \varphi \, d\mu \,=\, \sum_{k=1}^{n} c_k \, \mu(A_k).

We adopt the convention 0 \cdot \infty = 0: a height of zero over a set of infinite measure contributes nothing (the graph has no area), and a positive height over a null set contributes nothing either. This makes the arithmetic in [0, \infty] behave.

Well-defined. A simple function has many representations — 2\,\mathbf{1}_{[0,1]} equals 2\,\mathbf{1}_{[0,\frac12)} + 2\,\mathbf{1}_{[\frac12,1]} — so we must check the value \sum c_k \mu(A_k) does not depend on which one we pick. It does not: given two representations, pass to the common refinement formed from all the pairwise intersections A_k \cap B_j; finite additivity of \mu splits each term exactly in step, and the two sums collapse to the same number. From the same refinement trick, integration of simple functions is linear (\int(a\varphi + b\psi) = a\int\varphi + b\int\psi) and monotone (\varphi \le \psi \Rightarrow \int\varphi \le \int\psi).

On [0,1] with ordinary length \mu, take

\varphi = 3\,\mathbf{1}_{[0,\,1/4)} + 5\,\mathbf{1}_{[1/4,\,1]}.

The two sets are disjoint and measurable, with \mu([0,\tfrac14)) = \tfrac14 and \mu([\tfrac14,1]) = \tfrac34. So

\int_{[0,1]} \varphi \, d\mu = 3\cdot\tfrac14 + 5\cdot\tfrac34 = \tfrac34 + \tfrac{15}{4} = \tfrac{18}{4} = \tfrac92.

Exactly the total area of the two boxes — no partitions, no limits. For a step function Riemann would agree; the point is that we never referred to the domain's geometry, only to the measures of the level sets. That indifference to the shape of the domain is what will let the same formula swallow the Dirichlet function a moment from now.

Stage 2 — non-negative functions: the integral from below

Now let f \ge 0 be any measurable function — possibly wild, possibly unbounded. We define its integral as the supremum of the integrals of all simple functions that fit underneath it:

\int f \, d\mu \,=\, \sup\Bigl\{ \int \varphi \, d\mu \;:\; \varphi \text{ simple},\; 0 \le \varphi \le f \Bigr\} \;\in\; [0, \infty].

Read it slowly. We look at every simple staircase \varphi that stays below f everywhere, compute its honest area \int\varphi by Stage 1, and take the least upper bound over all of them. That single number is the integral. It is always defined (the set of candidate values is non-empty — \varphi=0 qualifies — and bounded above in [0,\infty]), and for a simple f it recovers Stage 1, since \varphi = f is the largest admissible staircase.

Notice what is absent from this definition, compared with Riemann:

There are no upper sums — no over-estimating staircases pressing down from above.
There is no squeeze — we never demand that an approach from below meets an approach from above.
There is only a supremum from below, and it is automatically the right answer.

Why is one-sided enough, when Riemann needed both jaws of a vice to close? Because the two theories slice in different directions. Riemann slices the domain and must pin the height on each strip between an inf and a sup — a genuinely two-sided problem, and the source of every failure. Lebesgue slices the range: a simple \varphi \le f chops the values of f into finitely many levels and asks only for the measure of each super-level set. Refining the levels can only add area under f, never overshoot, so pushing the staircases up toward f converges on the true value with nothing needed from above. The supremum is the integral because the range-slicing can never overstate the case.

Drag the slider to add levels. Each new level cuts a fresh horizontal band; for band at height c_k we measure the set where f reaches that high, and the simple integral \sum_k c_k \cdot \mu(A_k) — read out live — climbs toward the area under f. It only ever rises, and it never crosses the curve. That monotone climb from below is the entire content of Stage 2.

Stage 3 — signed functions: split into positive and negative parts

A general measurable f takes both signs. Break it into two non-negative pieces, the positive and negative parts:

f^{+} = \max(f, 0), \qquad f^{-} = \max(-f, 0), \qquad\text{so}\qquad f = f^{+} - f^{-}, \quad |f| = f^{+} + f^{-}.

Both f^{+} and f^{-} are non-negative and measurable, so Stage 2 gives each of them an integral in [0,\infty]. We call f integrable (or summable) when both are finite:

f \text{ integrable} \iff \int f^{+}\, d\mu < \infty \ \text{ and } \ \int f^{-}\, d\mu < \infty \iff \int |f|\, d\mu < \infty,

and then the integral is the difference — a subtraction of two finite numbers, so no \infty - \infty ever appears:

\int f \, d\mu \,=\, \int f^{+}\, d\mu \,-\, \int f^{-}\, d\mu.

The equivalence with \int|f| < \infty is immediate: since |f| = f^{+} + f^{-}, linearity on non-negative functions makes \int|f| = \int f^{+} + \int f^{-}, and a sum of two non-negative numbers is finite iff both are. This is why Lebesgue integrability is often stated in one breath as "\int|f| < \infty" — the Lebesgue integral is an absolute integral.

Finally, to integrate over a measurable subset E rather than the whole space, multiply by that set's indicator:

\int_E f \, d\mu \,=\, \int f \cdot \mathbf{1}_E \, d\mu.

The triumph: \displaystyle\int_0^1 \mathbf{1}_{\mathbb{Q}} \, d\mu = 0

Here is the function that Riemann could not touch — the indicator of the rationals, whose upper and lower Darboux integrals were stuck forever at 1 and 0. Under Lebesgue it is not a pathology at all; it is a simple function, and we integrate it by the Stage 1 formula in a single line.

On [0,1], write \mathbf{1}_{\mathbb{Q}} = 1\cdot\mathbf{1}_{A} + 0\cdot\mathbf{1}_{B} where A = \mathbb{Q}\cap[0,1] and B is the irrationals. The rationals are countable, hence a null set: \mu(A) = 0. Therefore

\int_0^1 \mathbf{1}_{\mathbb{Q}} \, d\mu = 1\cdot\mu\bigl(\mathbb{Q}\cap[0,1]\bigr) + 0\cdot\mu(B) = 1\cdot 0 + 0\cdot 1 = 0.

No partitions to refine, no gap to close, no drama. Where the Riemann machine ground to a halt because every subinterval held both a rational and an irrational, the Lebesgue machine simply weighs the set where the function is 1, finds it has measure zero, and returns 0. This one line is the concrete cash value of the entire module — the promise made on "The Need for Measure", now kept.

The basic properties, and how Riemann fits inside

From the three-stage construction the workhorse rules follow (each proved first for simple functions, then lifted by the supremum). For integrable f, g and scalars a, b:

Linearity: \displaystyle\int (a f + b g)\, d\mu = a\int f\, d\mu + b\int g\, d\mu.
Monotonicity: f \le g \Rightarrow \int f\, d\mu \le \int g\, d\mu.
Triangle inequality: \bigl|\int f\, d\mu\bigr| \le \int |f|\, d\mu.
Vanishing: \int |f|\, d\mu = 0 \iff f = 0 almost everywhere (i.e. the set \{f \ne 0\} is null).

The last one is worth dwelling on: the integral is blind to what happens on a null set. Change f on the rationals, on any countable set, on any measure-zero set at all, and every integral of f is unchanged. "Almost everywhere" becomes the native dialect of the theory.

Built in three stages, for a measure space (X, \mathcal{M}, \mu):

Simple \varphi = \sum_k c_k \mathbf{1}_{A_k} (disjoint A_k): \int \varphi\, d\mu = \sum_k c_k\, \mu(A_k), with 0\cdot\infty = 0.
Non-negative f \ge 0: \int f\, d\mu = \sup\{\int \varphi\, d\mu : 0 \le \varphi \le f,\ \varphi \text{ simple}\} — a supremum from below only.
General f = f^{+} - f^{-}: \int f\, d\mu = \int f^{+}\, d\mu - \int f^{-}\, d\mu, defined when f is integrable (\int|f|\, d\mu < \infty).
Agreement: every Riemann-integrable f on [a,b] is Lebesgue-integrable there, with the same value — the Lebesgue integral extends the Riemann integral.

That last bullet is the reassurance that nothing was lost: for every honest area you already know how to compute, the two integrals return the identical number. The Lebesgue integral is a strict superset — it agrees with Riemann where Riemann is defined, and keeps going where Riemann quits.

Lebesgue liked to explain his integral as a way of counting money. Riemann totals a pile of coins in the order they lie on the counter; Lebesgue first sorts by denomination — all the pennies, all the dimes — counts each heap, and multiplies by its value. Sorting the range instead of marching along the domain is exactly the "supremum over simple \varphi \le f" definition.

And that one-sided, from-below definition has a beautiful dividend. Because the integral is a supremum over staircases underneath f, if a sequence 0 \le f_1 \le f_2 \le \cdots \uparrow f climbs monotonically up to f, the integrals climb up to \int f as well:

f_n \uparrow f \implies \int f_n \, d\mu \uparrow \int f \, d\mu.

This is the Monotone Convergence Theorem, and it falls out almost for free precisely because we defined \int f as a sup from below — there is no upper-sum machinery fighting the limit. The very swap of limit and integral that Riemann could not perform is, for Lebesgue, nearly a definition. (Its siblings, Fatou's lemma and Dominated Convergence, are the subject of the next pages.)

Four traps in the definition of the integral:

"Integrable" means \int|f| < \infty — both parts finite. If \int f^{+} = \int f^{-} = \infty, the function is not integrable, even though you might be tempted to write "\int f = \infty - \infty". The difference is simply undefined. A non-negative f always has a well-defined integral in [0,\infty], but a signed one needs both parts tamed.
Improper Riemann integrals are not automatically Lebesgue integrals. The classic \int_0^\infty \frac{\sin x}{x}\, dx = \frac{\pi}{2} exists as an improper (conditionally convergent) Riemann integral, but it is not Lebesgue integrable, because \int_0^\infty \bigl|\tfrac{\sin x}{x}\bigr|\, dx = \infty. The Lebesgue integral is absolute; it refuses conditionally convergent cancellation.
\int f = 0 does NOT mean f = 0. It means f = 0 almost everywhere. The function \mathbf{1}_{\mathbb{Q}} integrates to 0 yet is nonzero on a dense infinite set. For a non-negative f, though, \int f = 0 does force f = 0 a.e.
The sup is over simple functions strictly BELOW f only. Do not also take an infimum from above and check they meet — that is the Riemann reflex, and it is not part of this definition. Range-slicing needs one side; adding the other buys nothing and reintroduces exactly the failure mode we escaped.