Lᵖ Completeness & Convergence

The Riesz-Fischer Theorem

In the previous chapter, \(L^p\) Spaces — Construction & Inequalities, we constructed \(L^p(\Omega, \mathcal{F}, \mu)\) as a normed vector space: we passed from raw measurable functions to equivalence classes modulo a.e. equality, defined the \(p\)-norm via the Lebesgue integral, and proved Hölder's inequality and Minkowski's inequality. This established the algebraic and metric structure of \(L^p\).

This chapter completes the picture in two stages. First, we prove the Riesz-Fischer theorem — that \(L^p\) is complete and hence a Banach space — after first developing a self-contained toolkit of convergence theorems for the Lebesgue integral (MCT, Fatou's lemma, and DCT). Second, we map the broader landscape of convergence modes for sequences of measurable functions — convergence in \(L^p\), almost everywhere, in measure, and uniformly almost everywhere — establishing their logical relationships and the canonical counterexamples (the traveling bump, the typewriter sequence) that prevent further implications. We close with applications to probability, Fourier analysis, and quantum mechanics.

We have established that \(L^p\) is a normed space. Completeness — the property that every Cauchy sequence converges within the space — is what elevates a normed space to a Banach space. In Completeness, we studied this property for general metric spaces. Now we prove it concretely for \(L^p\).

The proof relies on three fundamental convergence theorems from Lebesgue integration. We state them precisely here for reference, as they are the essential tools of the argument.

Toolkit from Lebesgue Integration

Conventions. Throughout this chapter, we work on a measure space \((\Omega, \mathcal{F}, \mu)\), and we inherit the convention from the previous chapter: \(\mathbb{F}\) denotes the scalar field \(\mathbb{R}\) or \(\mathbb{C}\), and measurable functions \(f : \Omega \to \mathbb{F}\) are \((\mathcal{F}, \mathcal{B}(\mathbb{F}))\)-measurable. A null set means an \(\mathcal{F}\)-measurable set of \(\mu\)-measure zero. No \(\sigma\)-finiteness or completeness of \(\mu\) is assumed unless stated.

The following three theorems govern the interchange of limits and integrals. They were introduced conceptually in Lebesgue Integration; we now state and prove them in the precise form needed for the completeness proof. The MCT proof uses only the supremum-of-simple-functions definition of the integral together with the continuity of measure from below; Fatou and DCT follow from MCT via the chain below.

Theorem: Monotone Convergence Theorem (MCT)

Let \((g_n)\) be a sequence of measurable functions satisfying \(0 \leq g_1(x) \leq g_2(x) \leq \cdots\) for a.e. \(x \in \Omega\). Set \(g(x) := \sup_n g_n(x)\), which is measurable as a pointwise supremum of measurable functions and satisfies \(g(x) = \lim_{n \to \infty} g_n(x)\) for a.e. \(x\) (namely, on the set where monotonicity holds). Then \[ \int_\Omega g \, d\mu \;=\; \lim_{n \to \infty} \int_\Omega g_n \, d\mu. \] In words: for nonnegative increasing sequences, the integral of the limit equals the limit of the integrals.

Proof:

Modifying each \(g_n\) on a null set does not change any integral, so we may assume the monotonicity \(0 \leq g_1 \leq g_2 \leq \cdots\) holds everywhere. Then \(g(x) = \lim_n g_n(x) = \sup_n g_n(x)\) is measurable as the pointwise supremum of measurable functions.

Easy direction (\(\leq\)): Since \(g_n \leq g\) pointwise, every simple function \(q \in S(g_n)\) also satisfies \(q \leq g\), i.e., \(S(g_n) \subseteq S(g)\). Hence \[ \int g_n \, d\mu \;=\; \sup_{q \in S(g_n)} \int q \, d\mu \;\leq\; \sup_{q \in S(g)} \int q \, d\mu \;=\; \int g \, d\mu \] for every \(n\). The sequence \(\bigl(\int g_n \, d\mu\bigr)\) is non-decreasing (same argument applied to \(g_n \leq g_{n+1}\)), so its limit exists in \([0, \infty]\) and \[ \lim_{n \to \infty} \int g_n \, d\mu \;\leq\; \int g \, d\mu. \]

Hard direction (\(\geq\)): We show \(\int q \, d\mu \leq \lim_n \int g_n \, d\mu\) for every simple \(q \in S(g)\); taking the supremum over such \(q\) then yields the desired inequality, by definition of \(\int g \, d\mu\).

Fix \(q = \sum_{i=1}^k a_i \, \chi_{A_i} \in S(g)\), with \(a_i \geq 0\) and \(\{A_i\}\) disjoint measurable. Fix \(\alpha \in (0, 1)\) and define \[ E_n \;=\; \{x \in \Omega : g_n(x) \geq \alpha \, q(x)\}. \] Each \(E_n\) is measurable: on \(\bigcup_i A_i\), \(q\) takes the constant value \(a_i\) on each \(A_i\), so \(E_n \cap A_i = \{g_n \geq \alpha a_i\} \cap A_i\); on the complement \(\Omega \setminus \bigcup_i A_i\), \(q\) vanishes and \(E_n\) contains this complement entirely (since \(g_n \geq 0\)). Hence \[ E_n \;=\; \Bigl(\bigcup_{i=1}^k \bigl(\{g_n \geq \alpha a_i\} \cap A_i\bigr)\Bigr) \cup \Bigl(\Omega \setminus \bigcup_{i=1}^k A_i\Bigr), \] a finite combination of measurable sets, hence measurable. The sequence is increasing because \(g_n\) is. We claim \(\bigcup_n E_n = \Omega\). Indeed, fix \(x \in \Omega\):

If \(q(x) = 0\), then \(g_n(x) \geq 0 = \alpha q(x)\) for every \(n\), so \(x \in E_1 \subseteq \bigcup E_n\).
If \(q(x) > 0\), then \(g(x) \geq q(x) > \alpha q(x)\) (strict, since \(\alpha < 1\)), and \(g_n(x) \uparrow g(x)\), so eventually \(g_n(x) \geq \alpha q(x)\), placing \(x\) in some \(E_n\).

On \(E_n\) we have \(g_n \geq \alpha q\), so \[ \int g_n \, d\mu \;\geq\; \int_{E_n} g_n \, d\mu \;\geq\; \alpha \int_{E_n} q \, d\mu \;=\; \alpha \sum_{i=1}^k a_i \, \mu(A_i \cap E_n). \] For each \(i\), the sets \(A_i \cap E_n\) increase to \(A_i\) (since \(E_n \uparrow \Omega\)), so by continuity from below, \(\mu(A_i \cap E_n) \uparrow \mu(A_i)\). The sum has finitely many terms, so we may pass to the limit term-by-term: \[ \lim_{n \to \infty} \int g_n \, d\mu \;\geq\; \alpha \sum_{i=1}^k a_i \, \mu(A_i) \;=\; \alpha \int q \, d\mu. \] This holds for every \(\alpha \in (0, 1)\); letting \(\alpha \to 1^-\) gives \(\lim_n \int g_n \, d\mu \geq \int q \, d\mu\). Taking the supremum over \(q \in S(g)\) yields \(\lim_n \int g_n \, d\mu \geq \int g \, d\mu\), completing the proof.

Theorem: Fatou's Lemma

Let \((g_n)\) be a sequence of measurable functions satisfying \(g_n(x) \geq 0\) for a.e. \(x\). Then \(\liminf_n g_n\) is measurable (as a pointwise countable sup of countable infs of measurable functions), and \[ \int_\Omega \liminf_{n \to \infty} g_n \, d\mu \;\leq\; \liminf_{n \to \infty} \int_\Omega g_n \, d\mu. \] In words: the integral of the \(\liminf\) is bounded above by the \(\liminf\) of the integrals. The inequality can be strict — passing limits through integrals can "lose mass."

Proof:

As in MCT, modifying each \(g_n\) on the null set \(N_n = \{g_n < 0\}\) does not affect any integral; on the complement of \(N = \bigcup_n N_n\) (itself null), all \(g_n\) are non-negative. Working modulo \(N\), we may assume \(g_n \geq 0\) everywhere. For each \(n\), set \(h_n(x) = \inf_{k \geq n} g_k(x)\). Then \(0 \leq h_1 \leq h_2 \leq \cdots\) (the infimum over a smaller index set is larger), \(h_n\) is measurable as the pointwise infimum of countably many measurable functions, and by definition \[ \lim_{n \to \infty} h_n(x) \;=\; \sup_n \inf_{k \geq n} g_k(x) \;=\; \liminf_{n \to \infty} g_n(x). \] Applying MCT to \((h_n)\): \[ \int \liminf_{n \to \infty} g_n \, d\mu \;=\; \lim_{n \to \infty} \int h_n \, d\mu. \] Since \(h_n \leq g_n\) pointwise (the infimum is at most each member), monotonicity of the integral gives \(\int h_n \, d\mu \leq \int g_n \, d\mu\) for every \(n\). Taking \(\liminf\) on both sides and using \(\lim \int h_n \, d\mu = \liminf \int h_n \, d\mu\) (the limit exists), \[ \int \liminf_{n \to \infty} g_n \, d\mu \;=\; \liminf_{n \to \infty} \int h_n \, d\mu \;\leq\; \liminf_{n \to \infty} \int g_n \, d\mu. \]

Theorem: Dominated Convergence Theorem (DCT)

Let \((f_n)\) be a sequence of measurable functions such that \(f_n(x) \to f(x)\) for a.e. \(x\). Suppose there exists a dominating function \(h \in L^1(\mu)\) with \(|f_n(x)| \leq h(x)\) for a.e. \(x\) and all \(n\). Then \(f \in L^1(\mu)\) and \[ \lim_{n \to \infty} \int_\Omega f_n \, d\mu \;=\; \int_\Omega f \, d\mu. \] In words: under pointwise convergence with a uniform integrable bound, limits and integrals commute.

Proof:

Modifying on a null set, assume the convergence \(f_n \to f\) and the bound \(|f_n| \leq h\) hold everywhere. Then \(f\) is measurable as the pointwise limit of measurable functions, and \(|f| \leq h\) everywhere gives \(\int |f| \, d\mu \leq \int h \, d\mu < \infty\), hence \(f \in L^1\). For brevity we treat the real-valued case; the complex case follows by splitting \(f_n = \mathrm{Re}(f_n) + i\,\mathrm{Im}(f_n)\) and applying the real result to each part, noting that \(|\mathrm{Re}(f_n)|, |\mathrm{Im}(f_n)| \leq |f_n| \leq h\) and both parts converge pointwise.

The sequences \(h + f_n\) and \(h - f_n\) are non-negative (since \(|f_n| \leq h\)) and converge pointwise to \(h + f\) and \(h - f\) respectively. Note that \(|f_n| \leq h \in L^1\) gives \(\bigl|\int f_n \, d\mu\bigr| \leq \int h \, d\mu < \infty\), so the real sequence \(\bigl(\int f_n \, d\mu\bigr)\) is bounded, and both \(\liminf\) and \(\limsup\) are finite real numbers — in particular, the identity \(\liminf(-a_n) = -\limsup(a_n)\) applies without \(\infty - \infty\) ambiguity. Apply Fatou's lemma to each: \[ \int (h + f) \, d\mu \;=\; \int \liminf_{n} (h + f_n) \, d\mu \;\leq\; \liminf_{n} \int (h + f_n) \, d\mu \;=\; \int h \, d\mu + \liminf_{n} \int f_n \, d\mu, \] \[ \int (h - f) \, d\mu \;=\; \int \liminf_{n} (h - f_n) \, d\mu \;\leq\; \liminf_{n} \int (h - f_n) \, d\mu \;=\; \int h \, d\mu - \limsup_{n} \int f_n \, d\mu, \] where the last equality uses \(\liminf(-a_n) = -\limsup(a_n)\). On each left-hand side, expand \(\int (h \pm f) \, d\mu = \int h \, d\mu \pm \int f \, d\mu\) (integral linearity, valid since \(h, f \in L^1\)) and subtract \(\int h \, d\mu\) (finite, hence cancellable): \[ \int f \, d\mu \;\leq\; \liminf_{n} \int f_n \, d\mu, \qquad -\int f \, d\mu \;\leq\; -\limsup_{n} \int f_n \, d\mu. \] The second rearranges to \(\limsup_n \int f_n \, d\mu \leq \int f \, d\mu\). Combining with the first, \[ \limsup_{n} \int f_n \, d\mu \;\leq\; \int f \, d\mu \;\leq\; \liminf_{n} \int f_n \, d\mu. \] Since \(\liminf \leq \limsup\) always, all three quantities coincide; the common value is \(\lim_n \int f_n \, d\mu\), which equals \(\int f \, d\mu\).

The MCT requires monotonicity but imposes no integrability bound — it even allows the limit to be infinite. Fatou's lemma relaxes monotonicity to mere nonnegativity, at the cost of an inequality rather than equality. The DCT trades the nonnegativity assumption for a dominating function, recovering full equality. Together, these three tools form the backbone of measure-theoretic analysis, and the chain of derivations above (MCT \(\Rightarrow\) Fatou \(\Rightarrow\) DCT) shows that once MCT is established from the integral definition and the continuity of measure, the remaining two follow.

The Theorem

Theorem: Riesz-Fischer

For \(1 \leq p \leq \infty\), the space \(L^p(\Omega, \mathcal{F}, \mu)\) is complete. That is, \(L^p\) is a Banach space.

The proof splits into two cases of quite different character. For \(1 \leq p < \infty\), we develop the argument in five clearly delineated steps, each carrying its own role: extracting a fast subsequence, building a dominating function via MCT, obtaining a pointwise limit, upgrading to \(L^p\) convergence via DCT, and lifting from the subsequence to the full sequence. This same five-step pattern reappears in virtually every completeness proof in modern analysis (Sobolev spaces, Besov spaces, Hardy spaces), making it one of the most important proof patterns to internalize. For \(p = \infty\), the convergence theorems of Lebesgue integration are not needed: a simpler uniform-convergence argument suffices. We treat the two cases in turn.

Proof for \(1 \leq p < \infty\)

The challenge: We are given a Cauchy sequence \((f_n)\) in \(L^p\) and must produce a limit function \(f \in L^p\) with \(\|f_n - f\|_p \to 0\). The difficulty is that \(L^p\) convergence is an integral condition — it says nothing directly about pointwise behavior. We need to bridge from integral estimates to pointwise convergence and back.

Reduction to an equivalent criterion. Rather than working directly with an arbitrary Cauchy sequence, we use the following reformulation of completeness for normed spaces.

Lemma: Absolute Summability Criterion

A normed space \((\mathcal{X}, \|\cdot\|)\) is complete if and only if every absolutely summable series converges — that is, \(\sum_{k=1}^{\infty} \|h_k\| < \infty\) implies that the partial sums \(\sum_{k=1}^{N} h_k\) converge in norm to some element of \(\mathcal{X}\).

Proof:

(\(\Rightarrow\)) Assume \(\mathcal{X}\) is complete and \(\sum \|h_k\| < \infty\). For \(M > N\), the partial sums satisfy \(\bigl\|\sum_{k=1}^{M} h_k - \sum_{k=1}^{N} h_k\bigr\| = \bigl\|\sum_{k=N+1}^{M} h_k\bigr\| \leq \sum_{k=N+1}^{M} \|h_k\| \to 0\) as \(N, M \to \infty\), since the tail of a convergent series tends to zero. Hence the partial sums form a Cauchy sequence and converge by completeness.

(\(\Leftarrow\)) Assume every absolutely summable series converges. Let \((x_n)\) be a Cauchy sequence in \(\mathcal{X}\). Extract a fast subsequence \(x_{n_1}, x_{n_2}, \ldots\) with \(\|x_{n_{k+1}} - x_{n_k}\| < 2^{-k}\). Setting \(h_k = x_{n_{k+1}} - x_{n_k}\), we have \(\sum \|h_k\| < \sum 2^{-k} = 1 < \infty\), so by hypothesis the series \(\sum h_k\) converges. Its partial sums telescope to \(x_{n_{K+1}} - x_{n_1}\), so the subsequence \((x_{n_k})\) converges to some \(x\). To upgrade convergence of the subsequence to convergence of the full sequence, we use the standard \(\epsilon/2\) argument: given \(\epsilon > 0\), choose \(N_0\) so that \(\|x_m - x_n\| < \epsilon/2\) for \(m, n \geq N_0\), and \(K\) so that \(n_K \geq N_0\) and \(\|x_{n_K} - x\| < \epsilon/2\); then for all \(n \geq N_0\), \(\|x_n - x\| \leq \|x_n - x_{n_K}\| + \|x_{n_K} - x\| < \epsilon\). Hence \(x_n \to x\). (This same lifting argument reappears as Step 5 of the Riesz-Fischer proof below.)

This criterion is often easier to work with than the Cauchy sequence definition directly, because summability conditions mesh naturally with the MCT. The proof below is the concrete implementation of this criterion for \(L^p\): Step 1 reduces to an absolutely summable sequence of differences; Steps 2-4 establish its convergence; Step 5 lifts back to the full sequence.

Proof:

Step 1 — Extract a fast subsequence.
Since \((f_n)\) is Cauchy, for each \(k \geq 1\) there exists a threshold index \(N_k\) such that \(\|f_m - f_n\|_p < 2^{-k}\) for all \(m, n \geq N_k\); replacing \(N_k\) by \(\max(N_1, \dots, N_k)\) if necessary, we may assume \(N_1 \leq N_2 \leq \cdots\). Construct \((n_k)\) inductively: pick \(n_1 \geq N_1\), and given \(n_k\), pick \(n_{k+1} > n_k\) with \(n_{k+1} \geq N_{k+1}\) (always possible since the Cauchy thresholds impose only a lower bound on indices). Since \(n_k, n_{k+1} \geq N_k\) (using the monotonicity of \((N_k)\)), the resulting subsequence \((f_{n_k})\) is strictly index-increasing and satisfies \[ \|f_{n_{k+1}} - f_{n_k}\|_p \;<\; 2^{-k} \quad \text{for all } k \geq 1. \] In particular, the "differences" \(h_k = f_{n_{k+1}} - f_{n_k}\) satisfy \(\sum_{k=1}^{\infty} \|h_k\|_p < \sum_{k=1}^{\infty} 2^{-k} = 1 < \infty\).

Step 2 — Construct a dominating function via MCT.
Define the partial sums of absolute values: \[ G_N(x) \;=\; |f_{n_1}(x)| + \sum_{k=1}^{N} |f_{n_{k+1}}(x) - f_{n_k}(x)|. \] The sequence \((G_N)\) is nonnegative, pointwise increasing, and measurable (as finite sums of measurable functions). Since \(t \mapsto t^p\) is continuous and nondecreasing on \([0, \infty)\) for any \(p > 0\) (a fortiori for \(p \geq 1\)), the sequence \((G_N^p)\) is also nonnegative, pointwise increasing, and measurable. Applying the Monotone Convergence Theorem to \((G_N^p)\): \[ \int_\Omega G^p \, d\mu \;=\; \lim_{N \to \infty} \int_\Omega G_N^p \, d\mu, \] where \(G(x) = \lim_{N \to \infty} G_N(x)\). Taking \(p\)-th roots (the continuous map \(t \mapsto t^{1/p}\) on \([0, \infty]\) preserves limits), \(\|G\|_p = \lim_{N \to \infty} \|G_N\|_p\). Each \(|h_k|\) lies in \(L^p\) (since \(h_k \in L^p\) and \(\||h_k|\|_p = \|h_k\|_p\)), so applying Minkowski's inequality finitely many times gives \[ \|G_N\|_p \;\leq\; \|f_{n_1}\|_p + \sum_{k=1}^{N} \|h_k\|_p \;\leq\; \|f_{n_1}\|_p + 1. \] Therefore \(\|G\|_p \leq \|f_{n_1}\|_p + 1 < \infty\), which means \(G \in L^p\). In particular, since \(\int G^p \, d\mu < \infty\) and \(G \geq 0\), we have \(G(x) < \infty\) for a.e. \(x\).

Step 3 — Obtain a pointwise limit.
Fix any \(x\) with \(G(x) < \infty\) (which holds for a.e. \(x\), as established in Step 2). From the definition of \(G\) and \(G_N\), \[ \sum_{k=1}^{\infty} |f_{n_{k+1}}(x) - f_{n_k}(x)| \;=\; G(x) - |f_{n_1}(x)| \;<\; \infty, \] so the series \(\sum_{k=1}^{\infty} (f_{n_{k+1}}(x) - f_{n_k}(x))\) is absolutely convergent in \(\mathbb{F}\), hence convergent (since \(\mathbb{F}\) is complete). By telescoping, \(\sum_{k=1}^{K} h_k(x) = f_{n_{K+1}}(x) - f_{n_1}(x)\), so \(f_{n_{K+1}}(x) = f_{n_1}(x) + \sum_{k=1}^{K} h_k(x)\) converges as \(K \to \infty\); reindexing, the subsequence \((f_{n_K}(x))\) itself converges. Define \[ f(x) \;=\; \lim_{K \to \infty} f_{n_K}(x). \] (On the measure-zero set where \(G(x) = \infty\), we set \(f(x) = 0\), say; since this set is null, the resulting a.e.-equivalence class is independent of the choice.)

We show \(|f_{n_K}(x)| \leq G(x)\) for every \(K\) and every \(x\) with \(G(x) < \infty\). Indeed, by the same telescoping plus the triangle inequality, \[ |f_{n_K}(x)| \;=\; \biggl| f_{n_1}(x) + \sum_{k=1}^{K-1} h_k(x) \biggr| \;\leq\; |f_{n_1}(x)| + \sum_{k=1}^{K-1} |h_k(x)| \;\leq\; |f_{n_1}(x)| + \sum_{k=1}^{\infty} |h_k(x)| \;=\; G(x). \] Passing to the limit \(K \to \infty\) — using \(f_{n_K}(x) \to f(x)\) and continuity of \(|\cdot|\) — gives \(|f(x)| \leq G(x)\) for a.e. \(x\) (namely, on \(\{G < \infty\}\)). The function \(f\) is measurable as the a.e.-pointwise limit of measurable functions (extended by \(0\) on the null set where \(G = \infty\)), and \(G \in L^p\) together with \(|f|^p \leq G^p\) a.e. yields, by monotonicity of the integral, \(\int |f|^p \, d\mu \leq \int G^p \, d\mu < \infty\), so \(f \in L^p\).

Step 4 — Prove \(L^p\) convergence of the subsequence.
From Step 3, \(f_{n_K}(x) \to f(x)\) for a.e. \(x\); since \(t \mapsto |t|^p\) is continuous on \(\mathbb{F}\), this gives \(|f_{n_K} - f|^p \to 0\) a.e. Furthermore, using \(|f_{n_K}|, |f| \leq G\) a.e. (Step 3), \[ |f_{n_K}(x) - f(x)|^p \;\leq\; \bigl(|f_{n_K}(x)| + |f(x)|\bigr)^p \;\leq\; (G(x) + G(x))^p \;=\; (2G(x))^p, \] and \(\int (2G)^p \, d\mu = 2^p \int G^p \, d\mu < \infty\), so \((2G)^p \in L^1\) is an admissible dominator. By the Dominated Convergence Theorem: \[ \|f_{n_K} - f\|_p^p \;=\; \int |f_{n_K} - f|^p \, d\mu \;\to\; 0 \quad \text{as } K \to \infty. \] Since \(t \mapsto t^{1/p}\) is continuous on \([0, \infty)\) with \(t^{1/p} = 0 \iff t = 0\), this gives \(\|f_{n_K} - f\|_p \to 0\).

Step 5 — Lift from the subsequence to the full sequence.
(This is the same lifting argument as in Lemma: Absolute Summability Criterion (\(\Leftarrow\)), applied concretely to \((f_n)\).) We now know \(f_{n_K} \to f\) in \(L^p\). To show \(f_n \to f\) in \(L^p\), we use the fact that \((f_n)\) is Cauchy. Fix \(\epsilon > 0\) and choose \(N_0\) such that \(\|f_m - f_n\|_p < \epsilon/2\) for all \(m, n \geq N_0\). Since \(n_K \to \infty\) and \(\|f_{n_K} - f\|_p \to 0\), we can pick a single \(K\) satisfying both \(n_K \geq N_0\) and \(\|f_{n_K} - f\|_p < \epsilon/2\). Then for all \(n \geq N_0\), the Cauchy condition applies to the pair \((n, n_K)\), giving \[ \|f_n - f\|_p \;\leq\; \|f_n - f_{n_K}\|_p + \|f_{n_K} - f\|_p \;<\; \frac{\epsilon}{2} + \frac{\epsilon}{2} \;=\; \epsilon. \] Hence \(f_n \to f\) in \(L^p\), completing the proof for \(1 \leq p < \infty\).

The Case \(p = \infty\)

For \(L^\infty\), the argument is simpler and does not require the MCT. Since \(L^\infty\) is closed under subtraction (as a vector space), each difference \(f_m - f_n\) lies in \(L^\infty\), so by Lemma: Essential Supremum Is Attained applied to \(f_m - f_n\), for each pair of indices \(m, n\), the inequality \(|f_m(x) - f_n(x)| \leq \|f_m - f_n\|_\infty\) holds for a.e. \(x\), outside an exceptional null set \(E_{m,n}\); likewise, for each \(n\), the inequality \(|f_n(x)| \leq \|f_n\|_\infty\) holds outside a null set \(F_n\). Taking the countable union \(E = \bigl(\bigcup_{m,n \in \mathbb{N}} E_{m,n}\bigr) \cup \bigl(\bigcup_n F_n\bigr)\) (still a null set, as a countable union of null sets), we have \[ |f_m(x) - f_n(x)| \;\leq\; \|f_m - f_n\|_\infty \quad \text{and} \quad |f_n(x)| \;\leq\; \|f_n\|_\infty \quad \text{for all } x \notin E \text{ and all } m, n. \] If \((f_n)\) is Cauchy in \(L^\infty\), the right side tends to zero as \(m, n \to \infty\), so \((f_n(x))\) is a Cauchy sequence in \(\mathbb{F}\) for every \(x \notin E\). Since \(\mathbb{F}\) is complete, \(f_n(x) \to f(x)\) pointwise on \(\Omega \setminus E\). Defining \(f(x) = 0\) on \(E\), the function \(f\) is measurable as the pointwise limit of measurable functions on \(\Omega \setminus E\), extended by zero on a null set.

Moreover, the convergence is uniform outside \(E\): for any \(\epsilon > 0\), choose \(N\) such that \(\|f_m - f_n\|_\infty < \epsilon\) for \(m, n \geq N\); then for \(x \notin E\) and \(m \geq N\), letting \(n \to \infty\) in \(|f_m(x) - f_n(x)| \leq \|f_m - f_n\|_\infty < \epsilon\) gives \(|f_m(x) - f(x)| \leq \epsilon\). This shows two things at once. First, \(f \in L^\infty\): by the triangle inequality, for \(x \notin E\), \[ |f(x)| \;\leq\; |f_N(x)| + |f(x) - f_N(x)| \;\leq\; \|f_N\|_\infty + \epsilon, \] where we used \(F_N \subseteq E\) for the first term (giving \(|f_N(x)| \leq \|f_N\|_\infty\) on \(\Omega \setminus E\)) and the uniform bound \(|f - f_N| \leq \epsilon\) on \(\Omega \setminus E\) (established above) for the second. This is an admissible essential bound for \(f\), giving \(\|f\|_\infty \leq \|f_N\|_\infty + \epsilon < \infty\), so \(f \in L^\infty\). Second, the bound \(|f_m - f| \leq \epsilon\) on \(\Omega \setminus E\) (a null-set complement) makes \(\epsilon\) an admissible essential bound for \(|f_m - f|\), so \(\|f_m - f\|_\infty \leq \epsilon\) for all \(m \geq N\). Since \(\epsilon\) was arbitrary, \(\|f_n - f\|_\infty \to 0\), completing the proof for \(p = \infty\).

Why the Proof Architecture Matters

The five-step pattern above — extract a fast subsequence, build a dominating function, obtain pointwise convergence, apply DCT, lift to the full sequence — is the standard template for proving completeness of function spaces throughout analysis. Spaces built over \(L^p\) — notably Sobolev spaces \(W^{k,p}\), which arise in the study of partial differential equations and physics-informed neural networks — inherit completeness by applying the Riesz-Fischer argument to each derivative component and reassembling. Recognizing this architecture once equips you to deploy it, directly or as a building block, wherever function space completeness is needed.

An Important Corollary

The Riesz-Fischer proof yields more than just completeness. Step 4 produced a subsequence \((f_{n_K})\) that converges to \(f\) both in \(L^p\) and pointwise a.e. This is worth recording as an independent result:

Corollary: Subsequence with Pointwise Convergence

If \(f_n \to f\) in \(L^p\) (\(1 \leq p \leq \infty\)), then there exists a subsequence \((f_{n_k})\) such that \(f_{n_k}(x) \to f(x)\) for a.e. \(x\).

Proof:

Case \(1 \leq p < \infty\): Since \((f_n)\) converges in \(L^p\), it is Cauchy. Apply Steps 1-3 of the Riesz-Fischer proof to extract a subsequence \((f_{n_k})\) and produce an a.e. pointwise limit \(\tilde f \in L^p\) with \(f_{n_k}(x) \to \tilde f(x)\) for a.e. \(x\); Step 4 further gives \(f_{n_k} \to \tilde f\) in \(L^p\). On the other hand, by hypothesis \(f_n \to f\) in \(L^p\), so the subsequence also converges in \(L^p\) to \(f\). The \(L^p\) limit is unique (if \(g_n \to g\) and \(g_n \to g'\) in \(L^p\), then \(\|g - g'\|_p \leq \|g - g_n\|_p + \|g_n - g'\|_p \to 0\), so \(\|g - g'\|_p = 0\), i.e., \(g = g'\) a.e.), so \(\tilde f = f\) a.e. Therefore \(f_{n_k}(x) \to f(x)\) for a.e. \(x\), as claimed.

Case \(p = \infty\): The argument in the Case \(p = \infty\) section above (applied to \((f_n)\), which is Cauchy because it converges) directly produces a function \(\tilde f\) with \(f_n(x) \to \tilde f(x)\) for every \(x \notin E\), where \(E\) is a null set. The same uniqueness argument as above identifies \(\tilde f = f\) a.e. Hence the full sequence — not merely a subsequence — converges a.e. to \(f\), and the corollary holds trivially.

This corollary connects \(L^p\) convergence (an integral condition) back to pointwise behavior (a condition on individual points). As we will see in the next section, the converse does not hold: pointwise a.e. convergence alone does not imply \(L^p\) convergence, and \(L^p\) convergence does not imply full pointwise a.e. convergence (only a subsequence is guaranteed).

Convergence in \(L^p\) — A Hierarchy of Modes

With \(L^p\) established as a Banach space, we can study convergence within it. But \(L^p\) convergence is only one of several natural notions of convergence for sequences of measurable functions. Understanding how these notions relate to one another is essential for working effectively with function spaces — and for bridging to probability theory, where the same hierarchy reappears under different names.

Four Notions of Convergence

Let \((f_n)\) be a sequence of measurable \(\mathbb{F}\)-valued functions on \((\Omega, \mathcal{F}, \mu)\), and let \(f\) be a measurable \(\mathbb{F}\)-valued function. We consider four modes of convergence.

Definition: \(L^p\) Convergence

For \(1 \leq p \leq \infty\), we say \(f_n \to f\) in \(L^p\) if \(\|f_n - f\|_p \to 0\) as \(n \to \infty\).

Definition: Pointwise Almost-Everywhere Convergence

We say \(f_n \to f\) almost everywhere (a.e.) if there exists a measurable null set \(E\) (i.e., \(\mu(E) = 0\)) such that \(f_n(x) \to f(x)\) for every \(x \notin E\).

Definition: Convergence in Measure

We say \(f_n \to f\) in measure if, for every \(\epsilon > 0\), \[ \mu\bigl(\{x \in \Omega : |f_n(x) - f(x)| > \epsilon\}\bigr) \;\to\; 0 \quad \text{as } n \to \infty. \] The set \(\{|f_n - f| > \epsilon\}\) is measurable as the preimage of \((\epsilon, \infty)\) under the measurable function \(|f_n - f|\), so the measure on the left is well-defined.

Definition: Uniform Almost-Everywhere Convergence

We say \(f_n \to f\) uniformly almost everywhere if there exists a null set \(E\) such that \(\sup_{x \notin E} |f_n(x) - f(x)| \to 0\).

This definition coincides with \(L^\infty\) convergence: \(f_n \to f\) uniformly a.e. if and only if \(\|f_n - f\|_\infty \to 0\).

Proof of equivalence:

The forward direction is immediate, since \(\sup_{x \notin E}|f_n - f|\) is an admissible essential bound for \(|f_n - f|\), so \(\|f_n - f\|_\infty \leq \sup_{x \notin E}|f_n - f| \to 0\). The reverse uses Lemma: Essential Supremum Is Attained: for each \(n\), \(|f_n - f| \leq \|f_n - f\|_\infty\) outside a null set \(E_n\); setting \(E = \bigcup_n E_n\) (still null as a countable union), the inequality \(|f_n(x) - f(x)| \leq \|f_n - f\|_\infty\) holds for every \(x \notin E\) and every \(n\), giving \(\sup_{x \notin E}|f_n - f| \leq \|f_n - f\|_\infty \to 0\).

Among these, uniform a.e. convergence is the strongest (it is equivalent to \(L^\infty\) convergence). Convergence in measure sits at the bottom of the hierarchy in a qualified sense: it is implied by \(L^p\) convergence for any \(p\), and by pointwise a.e. convergence on finite measure spaces, but a.e. convergence can fail to imply it on infinite measure spaces (see the counterexamples below). The relation between \(L^p\) convergence (\(1 \leq p < \infty\)) and pointwise a.e. convergence is more subtle — neither implies the other in general — and is the focus of the implication map below.

The Implication Map

Theorem: Relations Between Modes of Convergence

The following implications hold:

For \(1 \leq p \leq \infty\), \(L^p\) convergence \(\Rightarrow\) convergence in measure. (Proof below.)
For \(1 \leq p \leq \infty\), \(L^p\) convergence \(\Rightarrow\) some subsequence converges a.e. (This is the corollary Subsequence with Pointwise Convergence established during the Riesz-Fischer proof.)
For \(1 \leq p < \infty\), pointwise a.e. convergence + domination by \(h \in L^p\) \(\Rightarrow\) \(L^p\) convergence. (This is the \(L^p\)-Dominated Convergence Theorem, stated and proved below.)
Convergence in measure \(\Rightarrow\) some subsequence converges a.e. (Proof below.)

No other implications hold in general. Concretely:

\(L^p\) convergence does not imply pointwise a.e. convergence — the traveling bump counterexample below exhibits a sequence with \(\|f_n\|_p \to 0\) yet \(f_n(x)\) divergent for every \(x\).
Pointwise a.e. convergence does not imply \(L^p\) convergence (without domination). Example: \(f_n = n \chi_{[0, 1/n]}\) on \([0,1]\) satisfies \(f_n(x) \to 0\) a.e. but \(\|f_n\|_1 = 1\) for all \(n\).
Pointwise a.e. convergence does not imply convergence in measure on infinite measure spaces. Example: on \(\mathbb{R}\) with Lebesgue measure, \(f_n = \chi_{[n, n+1]}\) satisfies \(f_n(x) \to 0\) for every \(x\), but \(\mu(\{|f_n| > 1/2\}) = 1\) for all \(n\).

On finite measure spaces (such as probability spaces), the picture tightens: the third counterexample above is ruled out, and we recover an additional implication.

Proposition: A.E. Convergence Implies Convergence in Measure on Finite Measure Spaces

If \(\mu(\Omega) < \infty\) and \(f_n \to f\) a.e., then \(f_n \to f\) in measure.

Proof:

Let \(N\) be the null set outside which \(f_n(x) \to f(x)\). Fix \(\epsilon > 0\) and define \[ B_n \;=\; \bigcup_{k \geq n} \{|f_k - f| > \epsilon\}. \] The sequence \((B_n)\) is decreasing (\(B_{n+1} \subseteq B_n\)). We claim \(\bigcap_n B_n \subseteq N\): for \(x \notin N\), there exists \(n_0 = n_0(x, \epsilon)\) such that \(|f_k(x) - f(x)| \leq \epsilon\) for all \(k \geq n_0\), so \(x \notin B_{n_0}\) and hence \(x \notin \bigcap_n B_n\). Therefore \(\mu\bigl(\bigcap_n B_n\bigr) \leq \mu(N) = 0\). Since \(\mu(B_1) \leq \mu(\Omega) < \infty\), continuity of measure from above applies, giving \(\mu(B_n) \to 0\). Since \(\{|f_n - f| > \epsilon\} \subseteq B_n\), we conclude \(\mu(\{|f_n - f| > \epsilon\}) \to 0\), i.e., \(f_n \to f\) in measure.

Proof of (1) — \(1 \leq p < \infty\) case:

This follows from the Chebyshev-Markov inequality, which we briefly justify here for completeness: for any non-negative measurable \(g\) and any \(t > 0\), \[ \int_\Omega g \, d\mu \;\geq\; \int_{\{g > t\}} g \, d\mu \;\geq\; t \cdot \mu(\{g > t\}), \] so \(\mu(\{g > t\}) \leq t^{-1} \int g \, d\mu\). Applying this with \(g = |f_n - f|^p\) and \(t = \epsilon^p\), \[ \mu\bigl(\{|f_n - f| > \epsilon\}\bigr) \;=\; \mu\bigl(\{|f_n - f|^p > \epsilon^p\}\bigr) \;\leq\; \frac{1}{\epsilon^p} \int_\Omega |f_n - f|^p \, d\mu \;=\; \frac{\|f_n - f\|_p^p}{\epsilon^p}. \] If \(\|f_n - f\|_p \to 0\), the right side tends to zero, so \(f_n \to f\) in measure.

The case \(p = \infty\): If \(\|f_n - f\|_\infty \to 0\), then by Lemma: Essential Supremum Is Attained, for any \(\epsilon > 0\), once \(n\) is large enough that \(\|f_n - f\|_\infty < \epsilon\), the set \(\{|f_n - f| > \epsilon\}\) is contained in the null exceptional set of the lemma. Hence \(\mu(\{|f_n - f| > \epsilon\}) = 0\) for all sufficiently large \(n\), which is even stronger than required.

Proof of (4) — convergence in measure \(\Rightarrow\) subsequence converges a.e.:

Suppose \(f_n \to f\) in measure. For each \(k \geq 1\), \(\mu(\{|f_n - f| > 2^{-k}\}) \to 0\) as \(n \to \infty\), so there exists \(N_k\) with \(\mu(\{|f_n - f| > 2^{-k}\}) < 2^{-k}\) for all \(n \geq N_k\); replacing \(N_k\) by \(\max(N_1, \dots, N_k)\) if necessary, we may assume \(N_1 \leq N_2 \leq \cdots\). Pick \(n_1 \geq N_1\), and given \(n_k\), pick \(n_{k+1} > n_k\) with \(n_{k+1} \geq N_{k+1}\). Then \(n_k \geq N_k\) for every \(k\), so \[ \mu(A_k) \;<\; 2^{-k}, \qquad \text{where } A_k = \{|f_{n_k} - f| > 2^{-k}\}. \] Define the "tail" sets \(B_m = \bigcup_{k \geq m} A_k\). Writing \(B_m\) as a disjoint union \(B_m = \bigcup_{k \geq m} \bigl(A_k \setminus \bigcup_{j < k} A_j\bigr)\) (each piece a subset of \(A_k\)) and applying countable additivity with monotonicity yields the \(\sigma\)-subadditivity bound \(\mu(B_m) \leq \sum_{k \geq m} \mu(A_k) < \sum_{k \geq m} 2^{-k} = 2^{-m+1}\), which tends to \(0\) as \(m \to \infty\). Set \(B = \bigcap_{m=1}^\infty B_m\); since \(\mu(B) \leq \mu(B_m)\) for every \(m\), we have \(\mu(B) = 0\), so \(B\) is null.

For \(x \notin B\), there exists \(m_0\) (depending on \(x\)) with \(x \notin B_{m_0}\), i.e., \(x \notin A_k\) for every \(k \geq m_0\); equivalently, \(|f_{n_k}(x) - f(x)| \leq 2^{-k}\) for every \(k \geq m_0\), so \(f_{n_k}(x) \to f(x)\). This holds for every \(x \notin B\), giving \(f_{n_k} \to f\) a.e.

The Traveling Bump: Why \(L^p\) Does Not Imply A.E.

Counterexample (\(1 \leq p < \infty\)):

Consider the interval \([0, 1]\) with Lebesgue measure. We construct a sequence of indicator functions that converges to zero in \(L^p\) but does not converge at any point.

Enumerate the dyadic-style intervals in blocks: the \(j\)-th block consists of the \(j\) intervals \([0, 1/j], [1/j, 2/j], \ldots, [(j-1)/j, 1]\), each of width \(1/j\). (The intervals within a block share endpoints at the points \(k/j\), \(k = 1, \ldots, j-1\); this overlap occurs on a set of measure zero and does not affect any of the \(L^p\) computations below.) Listing the blocks consecutively for \(j = 1, 2, 3, \ldots\) gives the sequence \[ \underbrace{[0,1]}_{j=1},\;\; \underbrace{[0, \tfrac{1}{2}],\, [\tfrac{1}{2}, 1]}_{j=2},\;\; \underbrace{[0, \tfrac{1}{3}],\, [\tfrac{1}{3}, \tfrac{2}{3}],\, [\tfrac{2}{3}, 1]}_{j=3},\;\; \underbrace{[0, \tfrac{1}{4}], \ldots}_{j=4},\;\; \ldots \] Let \(f_n = \chi_{I_n}\) where \(I_n\) is the \(n\)-th interval. The \(j\)-th block ends at index \(j(j+1)/2\), so an index \(n\) belongs to block \(j\) precisely when \(j(j-1)/2 < n \leq j(j+1)/2\); in particular, \(j \to \infty\) as \(n \to \infty\), and the corresponding interval has width \(|I_n| = 1/j\). Therefore \(\|f_n\|_p = |I_n|^{1/p} = j^{-1/p} \to 0\), so \(f_n \to 0\) in \(L^p\).

However, fix any \(x \in [0, 1]\). Within the \(j\)-th block, \(x\) lies in at least one of the \(j\) intervals (the block tiles \([0,1]\), with at most two intervals overlapping at the rational endpoints \(k/j\)), so \(f_n(x) = 1\) for at least one \(n\) in the block. For \(j \geq 2\), \(x\) lies in at most two intervals of the block, so \(f_n(x) = 0\) for at least \(j - 2 \geq 0\) values of \(n\) in the block — and \(j - 2 \geq 1\) for \(j \geq 3\). Letting \(j \to \infty\), the value \(1\) is attained infinitely often and the value \(0\) is attained infinitely often. Therefore \(f_n(x)\) fails to converge at every \(x \in [0, 1]\); the sequence does not converge a.e. — in fact, it does not converge at any point at all.

This counterexample shows that \(L^p\) convergence is fundamentally an average condition: it says the integrated \(p\)-th power of the difference is small, but it does not control the pointwise behavior at any given point. The Riesz-Fischer corollary guarantees only that a subsequence converges pointwise a.e. — the full sequence may oscillate wildly at each point.

The \(L^p\) Dominated Convergence Theorem

The standard DCT gives conditions under which pointwise convergence implies \(L^1\) convergence. We now state the natural \(L^p\) generalization, whose proof applies the standard DCT to the sequence \(|f_n - f|^p\) with a dominator constructed from \(h\).

Theorem: \(L^p\)-Dominated Convergence

Let \(1 \leq p < \infty\). Suppose \(f_n \to f\) a.e., and there exists \(h \in L^p\) such that \(|f_n(x)| \leq h(x)\) for a.e. \(x\) and all \(n\). Then \(f \in L^p\) and \[ \|f_n - f\|_p \;\to\; 0 \quad \text{as } n \to \infty. \]

Proof:

Modifying on a null set, both \(|f_n| \leq h\) and \(f_n \to f\) hold outside a single null set (the union of the individual null sets is null as a countable union). Define \(f\) to be the a.e. pointwise limit on the good set, extended by zero on the exceptional null set; then \(f\) is measurable as the pointwise limit of measurable functions on the good set. On the good set, \(|f(x)| = \lim_n |f_n(x)| \leq h(x)\), so \(|f| \leq h\) a.e. Since \(p \geq 1\) and \(t \mapsto t^p\) is non-decreasing on \([0, \infty)\), this gives \(|f|^p \leq h^p\) a.e. By monotonicity of the integral, \(\int |f|^p \, d\mu \leq \int h^p \, d\mu < \infty\), so \(f \in L^p\).

Now consider \(|f_n - f|^p \leq (|f_n| + |f|)^p \leq (h + h)^p = (2h)^p\) a.e., and \(\int (2h)^p \, d\mu = 2^p \int h^p \, d\mu < \infty\), so \((2h)^p \in L^1\) is an admissible dominator. Since \(t \mapsto |t|^p\) is continuous on \(\mathbb{F}\) and \(f_n \to f\) a.e., we have \(|f_n - f|^p \to 0\) a.e. Applying the standard Dominated Convergence Theorem to the sequence \(|f_n - f|^p\) with dominator \((2h)^p\) yields \(\int |f_n - f|^p \, d\mu \to 0\), and since \(t \mapsto t^{1/p}\) is continuous on \([0, \infty)\) with \(t^{1/p} = 0 \iff t = 0\), this gives \(\|f_n - f\|_p \to 0\).

Looking Ahead: Probability and Convergence

The hierarchy of convergence modes we have just developed has a direct parallel in probability theory. When the measure space is a probability space \((\Omega, \mathcal{F}, \mathbb{P})\) and the functions are random variables:

A.e. convergence becomes almost sure (a.s.) convergence.
Convergence in measure becomes convergence in probability.
\(L^p\) convergence becomes \(L^p\) convergence of random variables, i.e., \(\mathbb{E}[|X_n - X|^p] \to 0\).

A fourth mode — convergence in distribution — operates at a different level: it concerns not the random variables \(X_n\) themselves (as functions on \(\Omega\)) but the induced probability measures on \(\mathbb{R}\) (or \(\mathbb{R}^d\)). It has no direct pointwise or \(L^p\) analogue in the function-space setting; from a functional-analytic perspective it corresponds to weak-* convergence of measures, a topic taken up in the probability chapters.

The three function-space relations — \(L^p \Rightarrow\) in measure, \(L^p \Rightarrow\) a.e. subsequence, and in measure \(\Rightarrow\) a.e. subsequence — carry over directly to probability theory under the renaming above; and because probability spaces are finite (indeed unit-mass), the additional implication a.s. \(\Rightarrow\) in probability also holds without qualification. The full picture, including how these modes relate to one another and their role in the law of large numbers and central limit theorem, is developed in our chapters on measure-theoretic probability and its limit theorems.

Why Complete Function Spaces Are Essential

We have now proven that \(L^p\) is a Banach space: a normed vector space in which every Cauchy sequence converges. In Completeness, we motivated this property for general metric spaces as the absence of "holes." But for function spaces, completeness carries a far more concrete significance: it guarantees that the result of a limiting operation is still a legitimate object in the space — a function with finite energy, a probability distribution with finite moments, or a physically meaningful quantum state.

We close this chapter by examining three domains where completeness of \(L^p\) is not a mathematical luxury but an absolute necessity.

Probability Theory: Finite Moments and Estimation

In probability, a random variable \(X\) on a probability space \((\Omega, \mathcal{F}, \mathbb{P})\) is simply a measurable function \(X : \Omega \to \mathbb{F}\). Saying \(X \in L^p(\Omega, \mathbb{P})\) means precisely that the \(p\)-th moment is finite: \[ \mathbb{E}\bigl[|X|^p\bigr] \;=\; \int_\Omega |X|^p \, d\mathbb{P} \;<\; \infty. \] The case \(p = 2\) is especially important: \(X \in L^2\) means that both the mean and the variance are finite, and \(L^2(\Omega, \mathbb{P})\) is a Hilbert space with inner product \(\langle X, Y \rangle = \mathbb{E}[X \overline{Y}]\) (reducing to \(\mathbb{E}[XY]\) over \(\mathbb{R}\)).

Completeness of \(L^2\) guarantees that the orthogonal projection onto any closed subspace exists. This is the mathematical foundation of least-squares estimation: the conditional expectation \(\mathbb{E}[X \mid \mathcal{G}]\) is the \(L^2\)-projection of \(X\) onto the subspace of \(\mathcal{G}\)-measurable random variables. Without completeness, the projection might not land inside the space — the "best estimate" might not exist as a random variable with finite variance.

Hölder's inequality also takes on a probabilistic reading: for conjugate exponents \(p, q\) (with \(1/p + 1/q = 1\)) and random variables \(X \in L^p\), \(Y \in L^q\), \[ \mathbb{E}[|XY|] \;\leq\; \bigl(\mathbb{E}[|X|^p]\bigr)^{1/p} \, \bigl(\mathbb{E}[|Y|^q]\bigr)^{1/q}. \] This bounds the expectation of a product in terms of individual moment conditions — a tool used constantly in proving concentration inequalities, convergence theorems, and the convergence rates of estimators.

Signal Processing: Finite Energy and Fourier Reconstruction

In signal processing, a signal \(f : \mathbb{R} \to \mathbb{C}\) has finite energy if \[ \|f\|_2^2 \;=\; \int_{-\infty}^{\infty} |f(t)|^2 \, dt \;<\; \infty. \] The space of finite-energy signals is exactly \(L^2(\mathbb{R})\). Plancherel's theorem states that the Fourier transform preserves this energy: adopting the convention \(\hat{f}(\xi) = \int_{-\infty}^{\infty} f(t) e^{-it\xi} \, dt\), \[ \|f\|_{L^2}^2 \;=\; \frac{1}{2\pi}\|\hat{f}\|_{L^2}^2. \] In other words, the rescaled transform \(f \mapsto (2\pi)^{-1/2} \hat{f}\) becomes a unitary operator on \(L^2(\mathbb{R})\): an isometry that maps \(L^2\) onto itself.

But unitarity is only meaningful if the space is complete. If \(L^2\) had "holes," the Fourier transform of a finite-energy signal might land outside the space — there would be frequency representations that correspond to no legitimate time-domain signal, or vice versa. Completeness ensures that the Fourier transform is a bijection on \(L^2\), that every finite-energy spectrum reconstructs a finite-energy signal, and that Parseval's identity holds with exact equality. The entire mathematical framework of spectral analysis rests on the Riesz-Fischer theorem.

Quantum Mechanics: Wave Functions and Unitary Evolution

In quantum mechanics, the state of a particle is described by a wave function \(\psi \in L^2(\mathbb{R}^3)\) satisfying the normalization condition \(\|\psi\|_2 = 1\). The physical interpretation is Born's rule: \(|\psi(x)|^2\) is the probability density for finding the particle at position \(x\). The \(L^2\) norm being \(1\) ensures that probabilities sum to \(1\).

Time evolution is governed by the Schrödinger equation, whose solution is a one-parameter family of unitary operators \(U(t) = e^{-iHt/\hbar}\) acting on \(L^2(\mathbb{R}^3)\). Unitarity means \(\|U(t)\psi\|_2 = \|\psi\|_2 = 1\) for all \(t\) — probability is conserved under time evolution.

If \(L^2\) were not complete, the limiting operations that pervade quantum theory — spectral decompositions of observables, the construction of stationary states as eigenfunctions of \(H\), and the infinite series expansions of states in energy eigenbases — could yield objects outside the space, with infinite energy or failing to be square-integrable, making the probability interpretation collapse. Completeness guarantees that these limits stay within the space of physical states, and that the spectral decomposition of observables (which extends the compact self-adjoint case to bounded and, via the theory of spectral measures, unbounded self-adjoint operators) produces well-defined measurement outcomes. In this sense, the Riesz-Fischer theorem is not merely a mathematical convenience — it is a precondition for the logical consistency of quantum theory.

The Common Thread

Across all three domains, the pattern is the same. Each field relies on limiting operations — expectations of infinite sums, inverse Fourier transforms, time evolution of differential equations — and completeness is the guarantee that these limits remain within the space of objects that have physical or mathematical meaning. An estimator with finite variance. A signal with finite energy. A quantum state with total probability one.

In Completeness, we described a complete metric space as one "without holes." Here we see what that metaphor means concretely for function spaces: a "hole" in \(L^p\) would be a sequence of perfectly legitimate functions — each with finite \(p\)-th integral — whose limit escapes to something infinite, undefined, or physically meaningless. The Riesz-Fischer theorem seals every such hole.

Looking Ahead

This chapter has established \(L^p\) as a Banach space, settling the \(L^p\) completeness proof invoked (and explicitly deferred) in Intro to Functional Analysis, and thereby securing the Banach-space foundation tacitly used in Dual Spaces. The road ahead branches in two complementary directions:

Fourier analysis in Hilbert spaces will take the special case \(p = 2\) and develop its Hilbert space structure in full — the inner product, Plancherel's theorem as a unitary equivalence, and the Heisenberg uncertainty principle as a theorem about noncommuting operators on \(L^2\).
Measure-theoretic probability will reinterpret the convergence theorems (MCT, DCT, Fatou) and the \(L^p\) hierarchy in the language of random variables and expectations, closing the gap between the measure-theoretic foundations of Measure Theory / Lebesgue Integration and the probabilistic reasoning used throughout Section III.

Both paths build directly on the completeness of \(L^p\) proven here — the first by specializing to the richest structure (\(L^2\) as a Hilbert space), the second by specializing to the richest interpretation (\(L^p\) of random variables on a probability space).

\(L^p\) Completeness & Convergence

Loading...

The Riesz-Fischer Theorem

Toolkit from Lebesgue Integration

The Theorem

Proof for \(1 \leq p < \infty\)

The Case \(p = \infty\)

Why the Proof Architecture Matters

An Important Corollary

Convergence in \(L^p\) — A Hierarchy of Modes

Four Notions of Convergence

The Implication Map

The Traveling Bump: Why \(L^p\) Does Not Imply A.E.

The \(L^p\) Dominated Convergence Theorem

Looking Ahead: Probability and Convergence

Why Complete Function Spaces Are Essential

Probability Theory: Finite Moments and Estimation

Signal Processing: Finite Energy and Fourier Reconstruction

Quantum Mechanics: Wave Functions and Unitary Evolution

The Common Thread

Looking Ahead