The Baire Category Theorem
Here is a fact that ought to be impossible. Pick a continuous function on
[0,1] "at random" — whatever that should mean. It is almost certainly
nowhere differentiable: a jagged, fractal monster with no tangent line at a single point.
The smooth, well-behaved functions you drew all through calculus are not the norm — they are a
vanishingly rare exception. Nobody proves this by exhibiting the typical monster.
Instead one shows that the set of "nice" functions is topologically negligible, and the
engine that makes "negligible" a rigorous, usable idea is a single theorem about complete metric
spaces: the Baire category theorem.
That same theorem is the quiet workhorse under the three cornerstone results of linear analysis on
Banach spaces — the
uniform boundedness principle, the open mapping theorem, and the
closed graph theorem. Each looks like a statement about linear operators; each is,
at heart, the observation that a complete space is too "big" to be built out of countably many
thin pieces. Learn Baire once and all three fall out of the same move. This page teaches that
one idea — what it says, why completeness is the whole game, and how it powers functional analysis.
We work throughout in a metric space (X, d); the deep applications need
it to be complete
(every Cauchy sequence converges in X). For the operator theorems
X and Y will be Banach spaces — complete
normed vector spaces.
Step 1 — nowhere dense: sets with no room inside
The first building block measures a very strong kind of smallness. A subset
A \subseteq X is nowhere dense if its closure has empty
interior:
A \text{ is nowhere dense} \iff \operatorname{int}\big(\overline{A}\big) = \varnothing.
Unpack it. Taking the closure \overline{A} first is essential — it fills
in all the limit points, so we cannot cheat by pointing at a "gappy" set that is secretly dense.
Empty interior then says: no open ball fits inside \overline{A}.
Equivalently, the complement X \setminus \overline{A} is a
dense open set — around every point, arbitrarily close, there are points that miss
A entirely. A nowhere-dense set is one you can always dodge.
-
A single point in \mathbb{R} is nowhere dense — its
closure is itself, and no interval fits in a point.
-
A line inside \mathbb{R}^2, or the integers
\mathbb{Z} inside \mathbb{R}: closed already,
and containing no ball. Nowhere dense.
-
The middle-thirds Cantor set C \subseteq [0,1]: closed
(it is an intersection of closed sets) and containing no interval (at stage
n it is covered by 2^n intervals of total
length (2/3)^n \to 0). Nowhere dense — yet uncountable. Small
in this topological sense has nothing to do with cardinality.
-
The rationals \mathbb{Q} \subseteq \mathbb{R} are
not nowhere dense: \overline{\mathbb{Q}} = \mathbb{R}, whose
interior is all of \mathbb{R}. But — crucially —
\mathbb{Q} is a countable union of the nowhere-dense
singletons \{q\}. That distinction is the next idea.
Step 2 — meagre vs comeagre: a topological notion of "size"
Nowhere-dense is the atom of smallness. Close it under countable unions and you get
the working notion. A set is meagre (older name: of the first
category) if it is a countable union of nowhere-dense sets:
M \text{ meagre} \iff M = \bigcup_{n=1}^{\infty} A_n, \quad \text{each } A_n \text{ nowhere dense.}
Anything not meagre is of the second category; the complement of a meagre set is
comeagre (or residual). Think of meagre as "topologically
negligible" and comeagre as "topologically almost everything." So
\mathbb{Q}, a countable union of points, is meagre in
\mathbb{R}, and the irrationals
\mathbb{R} \setminus \mathbb{Q} are comeagre.
Why insist on countable unions? Two reasons. First, meagreness is meant to be a
\sigma-ideal — closed under countable unions, so that "a countable list of
negligible things is still negligible" (a monotone-class-style closure that mirrors the
measure-zero sets). Second, an arbitrary union of nowhere-dense sets is useless:
every point \{x\} is nowhere dense, and their uncountable union is all of
X. The line between countable and uncountable is exactly where the content
lives — and Baire's theorem is the statement that, in a complete space, that line cannot be crossed.
No — and the mismatch is spectacular. Both meagre sets and Lebesgue-null sets are notions of
"negligible," and you might hope they agree. They do not. One can split the real line as
\mathbb{R} = M \cup N where M is
meagre (topologically tiny) and N has
Lebesgue measure zero (metrically tiny) — so every real number is
negligible in at least one of the two senses, yet the whole line is the union. A set can be fat for
measure and thin for category, or vice versa. The "fat Cantor set" (a nowhere-dense set of positive
measure) is meagre but not null; a comeagre set can have measure zero. Keep the two ideas in
separate mental drawers.
Step 3 — the theorem
Everything above is definitions; here is the one theorem, in the three interchangeable forms you will
meet in the wild. All three say the same thing about a complete metric space.
Let (X, d) be a complete metric space (or, more
generally, a locally compact Hausdorff space). Then the following equivalent statements hold.
-
(Intersection form) A countable intersection of dense open sets is dense: if
each G_n is open and dense, then
\bigcap_{n} G_n is dense in X.
-
(Union form) A countable union of nowhere-dense sets has empty interior;
equivalently, X is not meagre in itself.
-
(Category form) If X = \bigcup_{n} F_n with each
F_n closed, then at least one F_n has
nonempty interior — it contains an open ball.
The three forms are De Morgan duals: complement the dense open G_n to get
nowhere-dense closed F_n = X \setminus G_n, and "the intersection stays
dense" becomes "the union misses points," i.e. cannot be all of X. The
category form is the one that fuels functional analysis: write your complete space as a countable
union of closed pieces, and one piece must be fat enough to contain a ball.
Why completeness does all the work: the nested-ball proof
Take the intersection form. Let G_1, G_2, \dots be dense open, and let
B_0 be any open ball; we must show
B_0 \cap \bigcap_n G_n \ne \varnothing. Because
G_1 is dense and open, it meets B_0 in a
nonempty open set, so we can find a closed ball
\overline{B_1} \subseteq B_0 \cap G_1, \qquad \operatorname{radius}(B_1) < \tfrac{1}{2}.
Now repeat inside B_1 using G_2: density and
openness give a closed ball \overline{B_2} \subseteq B_1 \cap G_2 of radius
< \tfrac{1}{4}. Continuing, we get a nested sequence
\overline{B_1} \supseteq \overline{B_2} \supseteq \overline{B_3} \supseteq \cdots, \qquad \operatorname{radius}(B_n) < 2^{-n},
with \overline{B_n} \subseteq G_n. The centres
(x_n) form a Cauchy sequence — for
m, n \ge N both centres lie in \overline{B_N},
so d(x_m, x_n) < 2^{-N+1}. Here, and only here, we spend
completeness: the limit x = \lim x_n exists in
X. Since the balls are closed and nested, x \in
\overline{B_n} for every n, hence
x \in \bigcap_n G_n, and also x \in B_0. Done.
Remove completeness and the sequence of shrinking demands can converge to a "hole" that isn't a point
of the space — the whole argument leaks out. That is not a technicality; it is the theorem's entire
content. Watch it happen in the figure below.
Seeing Baire: a dense open set you can shrink to nothing
Enumerate the rationals in [0,1] as
q_1, q_2, q_3, \dots and, given a budget
\varepsilon > 0, put an open interval of length
\varepsilon \, 2^{-k} around q_k. Their union
G_\varepsilon = \bigcup_{k=1}^{\infty} \Big(q_k - \tfrac{\varepsilon}{2^{k+1}},\; q_k + \tfrac{\varepsilon}{2^{k+1}}\Big)
is open and dense (it contains every rational, and the rationals
are dense), yet its total length is at most
\sum_k \varepsilon\,2^{-k} = \varepsilon — as small as we please. Step
through the construction, and slide \varepsilon toward
0: the covered set thins out but never stops being dense. Now Baire says
that intersecting countably many such dense open sets
\bigcap_m G_{1/m} still leaves a dense residual — a comeagre set
of measure zero on which "most" of the interesting behaviour lives.
The engine: one move, three theorems
Every one of the three big theorems runs the same play. You have a Banach space
X — complete, hence non-meagre in itself. You write it as a countable
union of closed sets X = \bigcup_n F_n built from the operators in
question. Baire's category form forces one F_N to contain an open ball. A
dose of linearity (scaling and translating that ball, using symmetry and convexity)
then spreads the local control to the whole space. That is the entire strategy; only the choice of
F_n changes. Keep it in mind as the three theorems go by.
Theorem A — the Uniform Boundedness Principle (Banach–Steinhaus)
Let X be a Banach space, Y a normed space,
and \{T_i\}_{i \in I} a family of
bounded linear
operators T_i : X \to Y. If the family is
pointwise bounded, then it is uniformly bounded:
-
Hypothesis: for each x \in X,
\sup_{i} \|T_i x\| < \infty.
-
Conclusion: \sup_{i} \|T_i\| < \infty — a single
finite bound on all the operator norms at once.
How Baire drives it. For n \in \mathbb{N} set
F_n = \big\{\, x \in X : \sup_{i} \|T_i x\| \le n \,\big\} = \bigcap_{i} \big\{\, x : \|T_i x\| \le n \,\big\}.
Each \{x : \|T_i x\| \le n\} is closed (preimage of a closed set under the
continuous map x \mapsto \|T_i x\|), so F_n is
closed. Pointwise boundedness says every x lands in some
F_n, i.e. X = \bigcup_n F_n. By Baire, some
F_N contains a ball B(x_0, r). For
\|z\| \le r we then have x_0, x_0 + z \in F_N, so
by the triangle inequality \|T_i z\| \le \|T_i(x_0+z)\| + \|T_i x_0\| \le 2N
for every i. Rescaling, \|T_i\| \le 2N/r
for all i. The pointwise data has become a uniform bound — that is the
Baire miracle in three lines.
Uniform boundedness settles a question Fourier could not: must the Fourier series of a
continuous function converge? No. There exists a continuous
2\pi-periodic f whose Fourier partial sums
S_N f(0) diverge. The proof is pure Banach–Steinhaus and
exhibits no function at all. On the Banach space C(\mathbb{T}) with the
sup norm, evaluation-of-the-partial-sum at 0 is a bounded functional
\Lambda_N(f) = S_N f(0) = \frac{1}{2\pi}\int_{-\pi}^{\pi} f(t)\,D_N(t)\,dt, \qquad \|\Lambda_N\| = \frac{1}{2\pi}\int_{-\pi}^{\pi} |D_N(t)|\,dt = L_N,
where D_N is the Dirichlet kernel and L_N the
Lebesgue constant. A classical estimate gives
L_N \sim \tfrac{4}{\pi^2}\ln N \to \infty, so the norms
\|\Lambda_N\| are not uniformly bounded. By the contrapositive
of Banach–Steinhaus, the family cannot be pointwise bounded — there must be some
f \in C(\mathbb{T}) with \sup_N |\Lambda_N(f)| =
\infty, i.e. S_N f(0) diverges. In fact the set of such
"bad" f is comeagre: divergence is the topologically
typical behaviour, convergence the exception. Baire strikes again.
Theorem B — the Open Mapping Theorem (and bounded inverse)
Let X, Y be Banach spaces and
T : X \to Y a bounded linear surjection. Then:
-
Openness: T is an open map — it sends open
sets to open sets.
-
Bounded inverse theorem (corollary): if moreover
T is a bijection, then T^{-1} is
automatically bounded. A continuous linear bijection between Banach spaces is a
(bi-continuous) isomorphism.
How Baire drives it. It suffices to show the image
T\big(B_X(0,1)\big) of the open unit ball contains a ball around
0 in Y. Since T is
onto and X = \bigcup_n B_X(0,n),
Y = T(X) = \bigcup_{n=1}^{\infty} \overline{T\big(B_X(0,n)\big)}.
Y is complete, so by Baire one of these closed sets has nonempty interior;
by scaling, \overline{T(B_X(0,1))} contains a ball. A second step — a
successive-approximation argument that uses completeness of
X to sum a rapidly converging series of preimages — upgrades the closure
\overline{T(B_X(0,1))} to the genuine image
T(B_X(0,2)), removing the bar. Both completenesses appear: Baire needs
Y complete, the series needs X complete.
Theorem C — the Closed Graph Theorem
Let X, Y be Banach spaces and
T : X \to Y linear. Then T is
bounded (continuous) if and only if its graph is closed:
-
\Gamma(T) = \{ (x, Tx) : x \in X \} \subseteq X \times Y is closed
in the product.
-
Concretely: whenever x_n \to x and
Tx_n \to y, one already has y = Tx. You get
to assume the limit y exists — you need only check it is the
right point.
How Baire drives it (via B). The "only if" is easy. For "if", equip the product
X \times Y with the norm \|(x,y)\| = \|x\| + \|y\|
— it is a Banach space. The graph \Gamma(T), being a closed
subspace of a Banach space, is itself Banach. The projection
\pi_X : \Gamma(T) \to X, (x, Tx) \mapsto x, is a
bounded linear bijection between Banach spaces, so by the bounded inverse
theorem its inverse x \mapsto (x, Tx) is bounded — whence
\|Tx\| \le \|(x,Tx)\| \le C\|x\| and T is
bounded. So the closed graph theorem is the open mapping theorem in disguise, and both are Baire.
The practical payoff: to prove an operator continuous you no longer have to produce the limit of
Tx_n from thin air — the hardest half of continuity is granted for free.
You merely verify that if the two limits exist they agree, which is usually a one-line computation.
The one hypothesis you must never drop
All three theorems carry the word Banach for a reason. Baire is a theorem about
complete spaces, so every result it powers inherits completeness as a load-bearing wall — of
both spaces. Kick it out and each theorem fails, often dramatically.
The single most common exam mistake is applying uniform boundedness, the open mapping theorem, or
the closed graph theorem on a normed space that is not complete. They are false there.
Concrete counterexamples, one per theorem:
-
Uniform boundedness fails. Let
c_{00} be the space of finitely-supported sequences with the sup norm
\|x\|_\infty — a normed space that is not complete. Define
functionals T_n(x) = n\,x_n. Each x \in c_{00}
has only finitely many nonzero terms, so T_n x = 0 for large
n and \sup_n |T_n x| < \infty — the family is
pointwise bounded. But \|T_n\| = n \to \infty, so it is
not uniformly bounded. The only missing hypothesis is completeness of the domain, and its
absence is fatal. (The proof breaks exactly at Baire: the incomplete
c_{00} is meagre in itself.)
-
Closed graph fails. The differentiation operator
D = \tfrac{d}{dx} from
\big(C^1[0,1], \|\cdot\|_\infty\big) to
\big(C[0,1], \|\cdot\|_\infty\big) has a closed graph
(if f_n \to f uniformly and f_n' \to g
uniformly, then f' = g) but is unbounded
(take f_n(x) = \sin(nx): \|f_n\|_\infty = 1
but \|f_n'\|_\infty = n). No contradiction with the theorem: the domain
\big(C^1, \|\cdot\|_\infty\big) is not complete in the sup
norm.
-
Bounded inverse fails. On an incomplete space the identity-style trick collapses:
the formal identity from \big(C[0,1], \|\cdot\|_\infty\big) to the same
space under the weaker L^1 norm is a continuous bijection whose inverse
is unbounded — precisely because \big(C[0,1], \|\cdot\|_1\big)
is not complete.
Moral: before you invoke any of the three, check that both the source and the target are
Banach. It is not decoration — it is the hypothesis that summons Baire.
René-Louis Baire proved his theorem in his 1899 doctoral thesis, aged 25, while classifying which
functions are pointwise limits of continuous ones (the "Baire classes"). For decades it looked like
a curiosity of real analysis. Then in 1927 Stefan Banach and Hugo Steinhaus turned
it into the uniform boundedness principle, and the open mapping and closed graph theorems followed —
suddenly a lemma about complete metric spaces was the backbone of an entire subject. There is even a
rival, Baire-free proof of uniform boundedness (the "gliding hump," which builds a bad
x by hand), but the Baire route is shorter, slicker, and reveals the
common structure behind all three theorems — which is why it is the one everyone teaches.