The Escape Lemma
The
fundamental theorem on flows
attached to every smooth vector field a maximal flow whose domain
\(\mathcal{D} \subseteq \mathbb{R} \times M\) is open and contains \(\{0\} \times M\),
and which restricts at each point \(p\) to a
maximal integral curve
\(\theta^{(p)} : \mathcal{D}^{(p)} \to M\). When the vector field is not complete,
some of these maximal intervals are bounded; a trajectory then exists only on a finite
open interval, and the natural question is what forces its termination. The escape
lemma gives the answer in geometric terms: a trajectory cannot terminate at finite
time while remaining in a compact subset of the manifold.
Lemma (Escape Lemma)
Let \(V\) be a smooth vector field on a smooth manifold \(M\), and let
\(\gamma : J \to M\) be a maximal integral curve of \(V\). If the domain \(J\)
has a finite least upper bound \(b\), then for every \(t_0 \in J\) the image
\(\gamma\bigl([t_0, b)\bigr)\) is not contained in any compact subset of \(M\).
Proof:
Suppose for contradiction that there exists \(t_0 \in J\) and a compact subset
\(K \subseteq M\) with \(\gamma\bigl([t_0, b)\bigr) \subseteq K\). Choose any
sequence \(t_k \in [t_0, b)\) with \(t_k \to b\). The points \(\gamma(t_k)\) lie
in the compact set \(K\), so a subsequence — still denoted \(\gamma(t_k)\) for
simplicity — converges to some point \(q \in K\).
Applying the
local existence theorem for integral curves
at \(q\), we obtain an integral curve \(\sigma : (-\varepsilon, \varepsilon) \to M\)
of \(V\) with \(\sigma(0) = q\) for some \(\varepsilon > 0\). The openness of the
flow domain \(\mathcal{D}\) at the point \((0, q)\), which lies in \(\mathcal{D}\)
because \(\mathcal{D}\) contains \(\{0\} \times M\), gives an open product
neighborhood \((-\eta, \eta) \times U \subseteq \mathcal{D}\) for some
\(\eta \in (0, \varepsilon)\) and some open neighborhood \(U\) of \(q\) in \(M\).
This is the geometric content of smooth dependence on initial conditions: for
every \(r \in U\), the maximal interval \(\mathcal{D}^{(r)}\) contains
\((-\eta, \eta)\), so all integral curves starting in \(U\) are simultaneously
defined on the interval \((-\eta, \eta)\).
Fix \(k\) large enough that \(\gamma(t_k) \in U\) and \(b - t_k < \eta\), and let
\(\sigma_k : (-\eta, \eta) \to M\) be the maximal integral curve of \(V\) starting
at \(\gamma(t_k)\). The
translation lemma
identifies the curve \(\tilde\gamma : (t_k - \eta, \, t_k + \eta) \to M\) defined
by \(\tilde\gamma(t) = \sigma_k(t - t_k)\) as an integral curve of \(V\); setting
\(t = t_k\) gives \(\tilde\gamma(t_k) = \sigma_k(0) = \gamma(t_k)\). Both
\(\tilde\gamma\) and \(\gamma\) are integral curves of \(V\) that pass through the
same point at time \(t_k\), so on the overlap of their domains
\(J \cap (t_k - \eta, t_k + \eta)\) — which is nonempty because it contains
\(t_k\) and is open — the uniqueness clause of the fundamental theorem forces
\(\tilde\gamma = \gamma\).
Define a curve \(\hat\gamma\) on the open interval
\(J \cup (t_k - \eta, \, t_k + \eta)\) by taking \(\gamma\) on \(J\) and
\(\tilde\gamma\) on \((t_k - \eta, t_k + \eta)\). The two pieces agree on the
overlap by the previous paragraph, so \(\hat\gamma\) is well-defined, and each
piece is an integral curve of \(V\) on its own domain — hence \(\hat\gamma\) is
an integral curve of \(V\) on the union. The choice \(b - t_k < \eta\) makes
\(t_k + \eta > b\), so the union strictly contains \(J\), and \(\hat\gamma\) is
an integral curve of \(V\) extending \(\gamma\) to a strictly larger open
interval. This contradicts the maximality of \(\gamma\), so the assumption that
\(\gamma\bigl([t_0, b)\bigr)\) lies in a compact set is untenable.
The contrapositive form of the lemma is the form most often invoked in practice. If
every trajectory of \(V\) remains in a compact set for all time at which it is defined,
the maximal existence interval of each trajectory cannot have a finite upper bound,
and the same argument applied to the time-reversed flow rules out a finite lower
bound; the vector field is therefore complete. This is the geometric content behind
the completeness statements of the previous development: compactness of the manifold,
or compactness of the support of \(V\), suffices precisely because it confines every
trajectory inside a compact set.
The Flowout Theorem
Flows provide the basic apparatus for many geometric constructions on manifolds, and
most of those constructions rest on a single structural result describing how a flow
behaves near a submanifold transverse to the generating vector field. The picture is
easy to state informally: starting from an embedded submanifold \(S \subseteq M\) and
moving each point of \(S\) along the integral curve of \(V\) through it, one sweeps
out a higher-dimensional submanifold of \(M\). The theorem below makes this sweep
precise — it is, locally near \(S\), the image of an injective immersion that turns
\(\partial/\partial t\) on the parameter space into \(V\) on the image. When \(S\) has
codimension one, the image is open in \(M\) and the parametrization becomes a
diffeomorphism.
Theorem (Flowout Theorem)
Let \(M\) be a smooth manifold, let \(S \subseteq M\) be an embedded
\(k\)-dimensional
submanifold,
and let \(V \in \mathfrak{X}(M)\) be a smooth vector field that is nowhere tangent
to \(S\). Let \(\theta : \mathcal{D} \to M\) be the flow of \(V\), set
\(\mathcal{O} = (\mathbb{R} \times S) \cap \mathcal{D}\), and let
\(\Phi = \theta|_{\mathcal{O}}\).
- (a) \(\Phi : \mathcal{O} \to M\) is a smooth
immersion.
- (b) The vector field \(\partial/\partial t \in
\mathfrak{X}(\mathcal{O})\) is \(\Phi\)-related to \(V\).
- (c) There exists a smooth positive function
\(\delta : S \to \mathbb{R}\) such that the restriction
\(\Phi|_{\mathcal{O}_\delta}\) is injective, where
\[
\mathcal{O}_\delta = \bigl\{ (t, p) \in \mathcal{O} : |t| < \delta(p) \bigr\} .
\]
- (d) If \(S\) has codimension one in \(M\), then
\(\Phi|_{\mathcal{O}_\delta}\) is a diffeomorphism onto an open submanifold
of \(M\).
The image \(\Phi(\mathcal{O}_\delta) \subseteq M\) is called the
flowout from \(S\) along \(V\).
The codification of the image as a flowout — definition placed alongside the theorem —
will be used as a self-standing term in later developments.
Definition: Flowout
With notation as in the flowout theorem, the image
\(\Phi(\mathcal{O}_\delta) \subseteq M\) is the flowout from \(S\)
along \(V\): the subset obtained by flowing each point of \(S\) along
\(V\) for a time interval whose length is permitted to depend smoothly on the
point.
The proof of the theorem proceeds through the four statements in a non-standard order,
because the relation in (b) is the cleanest consequence of the integral-curve
equation, and the immersion property in (a) is then read off from a basis count at
points of \(S\), with the rest of \(\mathcal{O}\) handled by translation along the
flow.
Proof of (b): The \(\Phi\)-Relation
Proof of (b):
Fix \(p \in S\), and let \(\sigma : \mathcal{D}^{(p)} \to \mathbb{R} \times S\)
be the curve \(\sigma(t) = (t, p)\). Its image lies in \(\mathcal{O}\) by the
definition of \(\mathcal{O}\). The composition
\(\Phi \circ \sigma : t \mapsto \theta(t, p) = \theta^{(p)}(t)\) is the maximal
integral curve of \(V\) through \(p\), so for every \(t_0 \in \mathcal{D}^{(p)}\),
\[
d\Phi_{(t_0, p)}\!\left( \frac{\partial}{\partial t}\bigg|_{(t_0, p)} \right)
= (\Phi \circ \sigma)'(t_0)
= V_{\Phi(t_0, p)} .
\]
The first equality is the chain rule, and the second is the integral-curve
equation \(\theta^{(p)\prime}(t_0) = V_{\theta(t_0, p)}\). The identity is exactly
the condition that \(\partial/\partial t\) and \(V\) are
\(\Phi\)-related.
Proof of (a): The Immersion Property
Proof of (a):
We first check that \(d\Phi_{(0, p)}\) is injective for every \(p \in S\), then
propagate the conclusion to general points \((t_0, p_0) \in \mathcal{O}\) by a
translation argument.
At the point \((0, p)\), the restriction of \(\Phi\) to the slice
\(\{0\} \times S \subseteq \mathcal{O}\) is the composition of the canonical
identification \(\{0\} \times S \cong S\) with the inclusion
\(S \hookrightarrow M\); both are embeddings, so the restriction
\(\Phi|_{\{0\} \times S}\) is an embedding too. The differential
\(d\Phi_{(0, p)}\) therefore carries the subspace
\(T_{(0, p)}(\{0\} \times S) \cong T_p S\) injectively into \(T_p M\), and in
fact takes it to the subspace \(T_p S \subseteq T_p M\) by the inclusion
identification.
Pick a basis \((E_1, \ldots, E_k)\) of \(T_p S\). Then
\(\bigl(\partial/\partial t|_{(0, p)}, E_1, \ldots, E_k\bigr)\) is a basis of
\(T_{(0, p)}\mathcal{O} \cong T_0\mathbb{R} \oplus T_p S\). Applying
\(d\Phi_{(0, p)}\) and using the formula just derived together with the
inclusion of \(T_p S\) into \(T_p M\), this basis maps to
\((V_p, E_1, \ldots, E_k)\) inside \(T_p M\). The hypothesis that \(V\) is
nowhere tangent to \(S\) gives \(V_p \notin T_p S\); the resulting
\((k + 1)\)-tuple is therefore linearly independent, and
\(d\Phi_{(0, p)}\) is injective.
Now fix an arbitrary \((t_0, p_0) \in \mathcal{O}\). The translation map
\(\tau_{t_0} : (t, p) \mapsto (t + t_0, p)\) is defined on the open set
\(\mathcal{O} - (t_0, 0) = \{(t, p) : (t + t_0, p) \in \mathcal{O}\}\), which
contains \((0, p_0)\) because \((t_0, p_0) \in \mathcal{O}\); shrinking to a
small open neighborhood of \((0, p_0)\) inside this set if necessary, both
\(\tau_{t_0}\) and its inverse \(\tau_{-t_0}\) are smooth and bijective onto
their images in \(\mathcal{O}\). The time-\(t_0\) flow map \(\theta_{t_0}\),
defined on the open set \(M_{t_0} = \{p \in M : (t_0, p) \in \mathcal{D}\}\),
is a diffeomorphism onto its image by the
diffeomorphism invariance of flows.
On a neighborhood of \((0, p_0)\) where all four maps below are simultaneously
defined, the flow group law gives the commutative diagram
\[
\begin{array}{ccc}
\mathcal{O} & \xrightarrow{\tau_{t_0}} & \mathcal{O} \\\\
\Phi \downarrow & & \downarrow \Phi \\\\
M & \xrightarrow{\theta_{t_0}} & M ,
\end{array}
\]
the verification being
\(\theta_{t_0}(\Phi(t, p)) = \theta_{t_0}(\theta(t, p)) = \theta(t + t_0, p)
= \Phi(t + t_0, p) = \Phi(\tau_{t_0}(t, p))\). Taking differentials at
\((0, p_0)\), the horizontal differentials are isomorphisms, so the two
vertical maps \(d\Phi_{(0, p_0)}\) and \(d\Phi_{(t_0, p_0)}\) have the same
rank. We have already shown that \(d\Phi_{(0, p_0)}\) is injective, so
\(d\Phi_{(t_0, p_0)}\) is too. This is the immersion property.
Proof of (c): Injectivity on a Sub-Flow Domain
The image \(\Phi(\mathcal{O})\) may overlap with itself: the same point of \(M\) can
be reached at different times along different trajectories. Restricting the time
parameter through a smooth positive function on \(S\) eliminates this overlap. The
construction proceeds in two steps — a slice-chart argument that controls overlaps
point by point, and a partition-of-unity argument that glues these local controls into
a single global function \(\delta : S \to \mathbb{R}\).
Proof of (c) (sketch):
Fix \(p_0 \in S\). Choose a
slice chart
\((U, (x^i))\) for \(S\) in \(M\) centered at \(p_0\), so that
\(U \cap S = \{x^{k+1} = \cdots = x^n = 0\}\) where \(n = \dim M\). Because \(V\)
is not tangent to \(S\), some component \(V^j(p_0)\) with \(j > k\) is nonzero;
by continuity, \(V^j\) does not vanish on a neighborhood of \(p_0\) and
therefore retains a single sign there. Shrinking \(U\) to such a neighborhood,
there exists a constant \(c > 0\) such that
\[
V^j(p) \geq c
\qquad \text{or} \qquad
V^j(p) \leq -c
\qquad \text{for all } p \in U ,
\]
and the sign is the same as the sign of \(V^j(p_0)\) throughout \(U\). The
component \(\Phi^j(t, p)\) of \(\Phi\) in this chart satisfies
\(\partial \Phi^j / \partial t = V^j \circ \Phi\) — the coordinate reading of
the \(\Phi\)-relation in (b) — and \(\Phi^j(0, p) = 0\) for \(p \in S\) because
\(\Phi(0, p) = p\) lies on the slice. Choose a product neighborhood
\((-\varepsilon_{p_0}, \varepsilon_{p_0}) \times W_{p_0}\) of \((0, p_0)\) in
\(\mathcal{O}\) so that \(\Phi\) maps it into \(U\); the sign-preserving lower
bound for \(V^j\) on \(U\) gives a corresponding lower bound for
\(\partial \Phi^j / \partial t\) on the product neighborhood, and the fundamental
theorem of calculus yields
\[
\Phi^j(t, p)
= \int_0^t \frac{\partial \Phi^j}{\partial s}(s, p) \, ds
= \int_0^t V^j\bigl( \Phi(s, p) \bigr) \, ds .
\]
The integrand has constant sign and absolute value bounded below by \(c\), so
\(|\Phi^j(t, p)| \geq c |t|\) on the product neighborhood. The \(j\)-th
coordinate vanishes on \(S \cap U\), so this lower bound implies
\(\Phi(t, p) \in S \iff t = 0\) on the product neighborhood. For a second
application, suppose \(\Phi(t, p) = \Phi(t', p)\) for the same \(p \in W_{p_0}\)
and \(t, t' \in (-\varepsilon_{p_0}, \varepsilon_{p_0})\); then \(t \mapsto
\Phi(t, p)\) is the integral curve \(\theta^{(p)}\) of \(V\) restricted to a
small interval on which \(\Phi^j\) is strictly monotone in \(t\) — a strictly
monotone function is injective, forcing \(t = t'\).
These constructions give, for each \(p \in S\), an open neighborhood
\(W_p \subseteq S\) and a positive number \(\varepsilon_p\) such that the
restriction of \(\Phi\) to \((-\varepsilon_p, \varepsilon_p) \times W_p\) is
injective and avoids \(S\) for nonzero times. The cover \(\{W_p : p \in S\}\) of
\(S\) admits a
partition of unity
\(\{\psi_p\}\) subordinate to it, and the function
\[
f(q) = \sum_{p \in S} \varepsilon_p \psi_p(q)
\]
is smooth and positive on \(S\). The defining property of the partition of unity
gives a useful bound: for each \(q \in S\), the values \(\psi_p(q)\) sum to one,
and the sum involves only finitely many \(p\) with \(\psi_p(q) > 0\); choosing
among these the index \(p_0\) with the largest \(\varepsilon_{p_0}\) yields
\(f(q) \leq \varepsilon_{p_0}\).
Set \(\delta = f / 2\) and suppose \(\Phi(t, q) = \Phi(t', q')\) for two points
\((t, q)\) and \((t', q')\) of \(\mathcal{O}_\delta\). Without loss of
generality \(f(q') \leq f(q)\). Since the values \(\psi_p(q)\) sum to one in
the partition of unity, choose an index \(p_0\) with \(\psi_{p_0}(q) > 0\) and
\(\varepsilon_{p_0}\) maximal among such indices; then \(q \in W_{p_0}\) by the
support condition on \(\psi_{p_0}\), and \(f(q) \leq \varepsilon_{p_0}\) by the
bound established above. Apply the flow group law to the equation
\(\theta_t(q) = \theta_{t'}(q')\): on the side of \(q\), this rewrites as
\(\theta_{-t'}(\theta_t(q)) = q'\), and the group law identifies the left side
with \(\theta_{t - t'}(q) = \Phi(t - t', q)\). Therefore
\[
\Phi(t - t', q) = q' \in S .
\]
The triangle inequality combined with \(|t| < \delta(q) = f(q) / 2\) and
\(|t'| < \delta(q') = f(q') / 2 \leq f(q) / 2\) gives
\[
|t - t'| \leq |t| + |t'| < f(q) \leq \varepsilon_{p_0} ,
\]
so \((t - t', q)\) lies in the product neighborhood
\((-\varepsilon_{p_0}, \varepsilon_{p_0}) \times W_{p_0}\) on which the
Step 1 conclusion \(\Phi(s, q) \in S \iff s = 0\) holds. From
\(\Phi(t - t', q) \in S\), this forces \(t - t' = 0\), that is, \(t = t'\).
Substituting back into the original equation gives
\(\Phi(t, q) = \Phi(t, q')\), with both \(q\) and \(q'\) in \(S\) and
\(\Phi(0, q) = q\), \(\Phi(0, q') = q'\). At \(t = 0\) the equation reduces to
\(q = q'\). For \(t \neq 0\), applying the flow group law in the same way as
before reduces the equation \(\Phi(t, q) = \Phi(t, q')\) to
\(\Phi(0, q) = \Phi(0, q')\), again giving \(q = q'\). The restriction
\(\Phi|_{\mathcal{O}_\delta}\) is therefore injective.
Proof of (d): The Codimension-One Case
Proof of (d):
Suppose \(S\) has codimension one in \(M\). The parameter space \(\mathcal{O}\)
has dimension \(1 + k = 1 + (n - 1) = n = \dim M\). The map
\(\Phi|_{\mathcal{O}_\delta}\) is by (a) a smooth immersion and by (c) injective,
and its source and target have the same dimension. An injective smooth immersion
between manifolds of equal dimension is open by the
local-diffeomorphism characterization
and hence — being also injective —
a diffeomorphism onto its image, which is an open submanifold of \(M\).
Where the Flowout Appears Next
The flowout theorem is the structural input for many later constructions on
manifolds. A
tubular neighborhood
of an embedded submanifold can be presented as the flowout from the submanifold
along the geodesic spray of a Riemannian metric, or — equivalently in our present
setting — as the image of a flowout-type construction starting from the normal
bundle. Foliation theory uses a closely analogous picture to convert a nowhere-tangent
vector field into a transverse parameter along leaves. In the next section we will
use the flowout theorem itself in the form most relevant for the local structure
of vector fields: with \(S\) chosen as an embedded hypersurface, the
codimension-one case (d) provides the coordinate change that puts \(V\) into a
canonical form.
Regular Points, Singular Points, and Equilibrium
The vanishing locus of a vector field is the first object to isolate before describing
the local behavior of its flow. At points where the vector field is zero, the integral
curve is constant in time and contributes nothing dynamical; at all other points, the
integral curve moves with a nonzero
velocity
at the starting instant, and the picture is locally that of a flow line carrying a
point along a direction prescribed by \(V\). This dichotomy is sharper than it looks:
the propositions below show that the regular case is preserved along the entire
trajectory in the sense that the integral curve is an immersion, while the singular
case forces the trajectory to be the constant curve.
Definition: Singular Point
Let \(V\) be a
vector field
on a smooth manifold \(M\). A point \(p \in M\) is called a singular point
of \(V\) if \(V_p = 0\).
Definition: Regular Point
A point \(p \in M\) is called a regular point of \(V\) if
\(V_p \neq 0\). Every point of \(M\) is therefore either a singular or a regular
point of \(V\), and the two classes partition \(M\).
The dichotomy in qualitative behavior of integral curves at the two kinds of points
is the content of the next proposition.
Proposition (Integral Curves at Regular and Singular Points)
Let \(V \in \mathfrak{X}(M)\), let \(\theta : \mathcal{D} \to M\) be its
maximal flow,
and let \(p \in M\).
- If \(p\) is a singular point of \(V\), then \(\mathcal{D}^{(p)} = \mathbb{R}\)
and \(\theta^{(p)}\) is the constant curve \(\theta^{(p)}(t) \equiv p\).
- If \(p\) is a regular point of \(V\), then \(\theta^{(p)} : \mathcal{D}^{(p)}
\to M\) is a smooth
immersion.
Proof:
Suppose first that \(V_p = 0\). The constant curve \(\gamma : \mathbb{R} \to M\)
with \(\gamma(t) \equiv p\) has zero velocity at every point, and the constant
zero coincides with \(V_{\gamma(t)} = V_p = 0\) at every \(t\). Hence \(\gamma\)
is an integral curve of \(V\) starting at \(p\), defined on all of
\(\mathbb{R}\). By the maximality of \(\theta^{(p)}\), the domain of every
integral curve of \(V\) starting at \(p\) is contained in
\(\mathcal{D}^{(p)}\); applied to \(\gamma\), this gives
\(\mathbb{R} \subseteq \mathcal{D}^{(p)}\), and since
\(\mathcal{D}^{(p)} \subseteq \mathbb{R}\) trivially, equality holds:
\(\mathcal{D}^{(p)} = \mathbb{R}\). The uniqueness clause of the fundamental
theorem then forces \(\theta^{(p)} = \gamma\) on \(\mathcal{D}^{(p)}\), so the
trajectory \(\theta^{(p)}\) is the constant curve at \(p\).
Now suppose \(p\) is regular, and argue the contrapositive of the immersion
conclusion: assume \(\theta^{(p)}\) fails to be an immersion at some
\(s \in \mathcal{D}^{(p)}\), so that
\((\theta^{(p)})'(s) = 0\). Let \(q = \theta^{(p)}(s) = \theta_s(p)\). The
integral-curve equation gives
\[
V_q = V_{\theta^{(p)}(s)} = (\theta^{(p)})'(s) = 0 ,
\]
so \(q\) is a singular point of \(V\). By the singular case just proved,
\(\mathcal{D}^{(q)} = \mathbb{R}\) and \(\theta^{(q)}(t) \equiv q\). The
hypotheses for the group law of the flow are now in place — \(s \in
\mathcal{D}^{(p)}\) by assumption, and \(t - s \in \mathcal{D}^{(q)} =
\mathbb{R}\) automatically — so for every \(t \in \mathcal{D}^{(p)}\),
\[
\theta^{(p)}(t) = \theta_t(p) = \theta_{t - s} \bigl( \theta_s(p) \bigr)
= \theta_{t - s}(q) = q .
\]
In particular \(\theta^{(p)}\) is constant on \(\mathcal{D}^{(p)}\), so
\(V_p = (\theta^{(p)})'(0) = 0\) — contradicting the assumption that \(p\) is
regular. The trajectory \(\theta^{(p)}\) is therefore an immersion at every
point of \(\mathcal{D}^{(p)}\), and is smooth as the composition of the smooth
embedding \(\iota_p : \mathcal{D}^{(p)} \to \mathcal{D}\) given by
\(\iota_p(t) = (t, p)\) with the smooth flow \(\theta\).
A name for the flow-level counterpart of a singular point of the generating vector
field is convenient for later use.
Definition: Equilibrium Point
Let \(\theta : \mathcal{D} \to M\) be a smooth flow. A point \(p \in M\) is called
an equilibrium point of \(\theta\) if \(\theta(t, p) = p\) for
every \(t \in \mathcal{D}^{(p)}\).
The two halves of the proposition above translate at the flow level into a simple
identification. The equilibrium points of a flow are precisely the singular points
of its infinitesimal generator: singular points produce constant trajectories, and
conversely, a maximal integral curve starting at a regular point cannot reach a
singular point in finite time — were it to do so, the immersion property of the
regular case would fail at the moment of arrival. This is the link that lets one
read off the equilibrium structure of a flow directly from the vanishing locus of
the underlying vector field.
Canonical Form Near a Regular Point
The flowout theorem in its codimension-one case gives a diffeomorphism between an
open subset of \(\mathbb{R} \times S\) and an open subset of \(M\), under which the
coordinate vector field \(\partial/\partial t\) on the parameter space corresponds
to \(V\) on the image. Read this in coordinates and an immediate consequence
follows: in some smooth chart near every regular point of \(V\), the vector field
has the same coordinate representation \(\partial/\partial s^1\) — independent of
the specifics of \(V\), the manifold, or the chosen point. This is the canonical
form theorem, and it is the structural statement that classifies the local behavior
of a smooth vector field up to coordinate change.
Proof:
We reduce to a configuration in which the flowout theorem applies, then read off
the desired coordinates from the inverse of the resulting diffeomorphism.
If no hypersurface \(S\) is supplied with the regular point, construct one. Choose
any smooth chart \((U, (x^i))\) centered at \(p\). Since \(V_p \neq 0\), some
component \(V^j(p) \neq 0\); fix such an index \(j\) and let
\[
S = \bigl\{ q \in U : x^j(q) = 0 \bigr\} .
\]
This is an embedded hypersurface containing \(p\) (the
level-set criterion for embedded submanifolds
identifies it as such, since the differential of the coordinate function \(x^j\)
is everywhere nonzero on \(U\)). The condition \(V^j(p) \neq 0\) is exactly the
statement that \(V_p\) is not tangent to \(\{x^j = 0\}\) at \(p\). If a
hypersurface \(S\) was supplied, we use it directly.
In either case, the hypothesis \(V_p \notin T_p S\) is open: \(V\) is nowhere
tangent to \(S\) on some neighborhood of \(p\) in \(S\). Shrinking \(S\) to this
neighborhood if necessary, the
flowout theorem
applies, and the codimension-one case (d) provides a sub-flow domain
\(\mathcal{O}_\delta \subseteq \mathbb{R} \times S\) and a diffeomorphism
\(\Phi : \mathcal{O}_\delta \to W\) onto an open submanifold \(W \subseteq M\)
containing \(S\), with \(\Phi_*(\partial/\partial t) = V\) on \(W\).
Choose a smooth local parametrization \(X : \Omega \to S\) of \(S\), where
\(\Omega \subseteq \mathbb{R}^{n - 1}\) is open and \(X\) is a diffeomorphism onto
an open subset \(W_0 \subseteq S\) containing \(p\); write the coordinates on
\(\Omega\) as \((s^2, \ldots, s^n)\). After shrinking \(\Omega\) and \(\delta\)
if necessary so that
\(\bigl((-\varepsilon, \varepsilon) \times \Omega\bigr) \subseteq \mathcal{O}_\delta\)
under the embedding \((t, s^2, \ldots, s^n) \mapsto (t, X(s^2, \ldots, s^n))\),
define
\[
\Psi : (-\varepsilon, \varepsilon) \times \Omega \to M , \qquad
\Psi(t, s^2, \ldots, s^n) = \Phi \bigl( t, X(s^2, \ldots, s^n) \bigr) .
\]
Both \(\Phi\) and the auxiliary map \((t, s) \mapsto (t, X(s))\) are
diffeomorphisms onto their images — the former by codimension-one case (d) of
the flowout theorem, the latter as the product of the identity with the
diffeomorphism \(X\). The composition \(\Psi\) is therefore a diffeomorphism
onto an open neighborhood of \(p\) in \(M\).
It remains to compute the pushforward of \(\partial/\partial t\) under \(\Psi\).
Fix a point \((t_0, s_0) \in (-\varepsilon, \varepsilon) \times \Omega\) and let
\(q_0 = X(s_0)\). The chain rule applied to the composition
\(\Psi = \Phi \circ (\mathrm{id} \times X)\) gives
\[
d\Psi_{(t_0, s_0)}\!\left( \frac{\partial}{\partial t}\bigg|_{(t_0, s_0)} \right)
= d\Phi_{(t_0, q_0)} \!\circ d(\mathrm{id} \times X)_{(t_0, s_0)}\!
\left( \frac{\partial}{\partial t}\bigg|_{(t_0, s_0)} \right) .
\]
The product map \(\mathrm{id} \times X\) acts as the identity on the first
factor of \((-\varepsilon, \varepsilon) \times \Omega\), so its differential
sends \(\partial/\partial t|_{(t_0, s_0)}\) to
\(\partial/\partial t|_{(t_0, q_0)}\): the time direction is preserved verbatim,
and the spatial directions in \(\Omega\) are sent to spatial directions in
\(S\) by \(dX\). The above chain-rule expression therefore reduces to
\[
d\Psi_{(t_0, s_0)}\!\left( \frac{\partial}{\partial t}\bigg|_{(t_0, s_0)} \right)
= d\Phi_{(t_0, q_0)}\!\left( \frac{\partial}{\partial t}\bigg|_{(t_0, q_0)} \right) .
\]
The flowout-theorem identity
\(d\Phi_{(t_0, q_0)}(\partial/\partial t) = V_{\Phi(t_0, q_0)}\) then yields
\(d\Psi_{(t_0, s_0)}(\partial/\partial t) = V_{\Psi(t_0, s_0)}\), so on the
image of \(\Psi\) we have the pushforward identity
\(\Psi_*(\partial/\partial t) = V\). The inverse
\(\Psi^{-1}\) is therefore a smooth chart, and renaming the time coordinate
\(t\) to \(s^1\), it provides
coordinate vectors
\((s^1, \ldots, s^n)\) in which \(V = \partial/\partial s^1\). The set \(S\)
corresponds in these coordinates to \(\{s^1 = 0\}\), so \(s^1\) is a local
defining function for \(S\).
The Local Linearization Principle
The canonical form theorem is the local structure statement for smooth vector
fields: up to a coordinate change near any regular point, \(V\) looks exactly
like the constant vector field \(\partial/\partial s^1\) on \(\mathbb{R}^n\) —
all of the interesting local behavior of the flow is therefore concentrated at
the singular points, the equilibrium points of the flow. The phenomenon has a
recurring analogue: for a Lie group, the
exponential map is a local diffeomorphism near the origin
of the Lie algebra, and the group structure on a neighborhood of the identity is
recovered from the linear data of the Lie algebra. In both settings, a single
coordinate change transports a nontrivial geometric object onto a flat model in
a neighborhood of a distinguished point, and the genuine local geometry is
pushed onto the locus where that linearization fails: the singular set in the
vector-field case, and — for the exponential map — the locus where the
differential of \(\exp\) ceases to be invertible, beyond which the global
behavior of \(\exp\) on the Lie algebra is no longer captured by the flat
identification near the origin.
Polar Coordinates from a Rotation Flow
The canonical form theorem is constructive when the integral curves of \(V\) can be
written down in closed form: pick a hypersurface \(S\) transverse to \(V\) and a
parametrization \(X\) of \(S\), form the composite \(\Psi(t, s) = \theta_t(X(s))\),
and invert. The simplest nontrivial instance recovers a coordinate system that the
reader already knows by another name.
Take \(W = x\,\partial/\partial y - y\,\partial/\partial x\) on \(\mathbb{R}^2\),
the rotation generator whose flow is
\(\theta_t(a, b) = (a \cos t - b \sin t, \, a \sin t + b \cos t)\). The point
\((1, 0)\) is a regular point of \(W\), since
\(W_{(1, 0)} = \partial/\partial y|_{(1, 0)} \neq 0\). Because the \(y\)-component
of \(W\) is nonzero at \((1, 0)\), the \(x\)-axis is a hypersurface to which \(W\)
is not tangent there; parametrize it by \(X(s) = (s, 0)\). The map
\(\Psi : \mathbb{R}^2 \to \mathbb{R}^2\) of the canonical form construction is
\[
\Psi(t, s) = \theta_t(s, 0) = (s \cos t, \, s \sin t) .
\]
Inverting locally near \((1, 0)\) expresses \((t, s)\) in terms of \((x, y)\):
\[
(t, s) = \Psi^{-1}(x, y) = \bigl( \tan^{-1}(y / x), \, \sqrt{x^2 + y^2} \bigr) .
\]
In these coordinates, \(W = \partial/\partial t\) — the rotation field is the
coordinate vector along the angular direction. The coordinates \((t, s)\) of the
canonical form are, in this example, the angular and radial coordinates
respectively: \(t\) is the angle and \(s\) is the distance from the origin. The
pair \((t, s)\) is, of course, the system of polar coordinates, recovered here as
the canonical form coordinates for the rotation vector field rather than introduced
ad hoc.
From Local Canonical Form to Global Dynamics
The canonical form theorem describes the local geometry of a flow only away from
equilibrium points, and the qualitative behavior near an equilibrium — closed
orbits, basins of attraction, saddle structure, spiraling — falls outside the
scope of any single coordinate change. The systematic study of these phenomena,
including their global and long-time aspects, is smooth dynamical systems theory:
the analysis of trajectories on a manifold over arbitrary time scales, the
classification of invariant sets, and the interaction between flow geometry and
manifold topology. Machine learning provides one of the more recent settings in
which such a flow is engineered rather than encountered: the
flow-matching
construction in generative modeling
learns a vector field whose time-\(1\) flow transports a base distribution to a
target, and the quality of the generative procedure is governed by the global
behavior of the trajectories of the learned field. The trajectories of a smooth
vector field on a manifold, locally pinned down by the canonical form theorem
and globally controlled by the completeness and naturality results of the
preceding sections, are the technical apparatus shared by both classical
dynamical systems and these modern generative constructions.