Lie Groups: Matrix Groups and Continuous Symmetry

From Discrete to Continuous Symmetry Matrix Lie Groups Looking Ahead

From Discrete to Continuous Symmetry

In Geometry of Symmetry, we explored the dihedral group \(D_n\) — the finite group of symmetries of a regular \(n\)-gon — and previewed two continuous groups, \(SO(3)\) and \(SE(3)\), that govern rotation and rigid body motion in three-dimensional space. There, we observed these continuous groups visually and intuitively: \(SO(3)\) allows rotations by any angle, and \(SE(3)\) combines such rotations with arbitrary translations. We noted that the passage from \(D_n\) to \(SO(3)\) is a shift from discrete to continuous symmetry — but we did not make this precise.

The present page fulfills this task. We formalize continuous symmetry groups as matrix Lie groups and develop the matrix exponential, the fundamental tool that connects infinitesimal symmetries to finite transformations. The key question driving this development is: what does it mean for a group to be "continuous"?

Topological Groups

The answer lies in combining two structures we have studied separately: the algebraic structure of a group and the topological structure of a space with a notion of continuity. A group whose operations respect topology is called a topological group.

Definition: Topological Group

A topological group is a group \(G\) equipped with a topology such that the following two maps are continuous:

  1. Multiplication: \(\mu : G \times G \to G\), \(\mu(g, h) = gh\), where \(G \times G\) carries the product topology.
  2. Inversion: \(\iota : G \to G\), \(\iota(g) = g^{-1}\).

The continuity of multiplication means that if \(g\) and \(h\) are "close" to \(g_0\) and \(h_0\) respectively, then \(gh\) is close to \(g_0 h_0\). Continuity of inversion ensures that nearby elements have nearby inverses. Together, they guarantee that the algebraic operations are compatible with the topology — the group structure does not "tear" the underlying space.

Examples:

(a) \((\mathbb{R}, +)\) with the standard (Euclidean) topology is a topological group: addition \((x, y) \mapsto x + y\) and negation \(x \mapsto -x\) are both continuous.

(b) \((\mathbb{R} \setminus \{0\}, \cdot)\) with the subspace topology inherited from \(\mathbb{R}\) is a topological group: multiplication and the map \(x \mapsto 1/x\) are continuous on \(\mathbb{R} \setminus \{0\}\).

(c) The circle group \(S^1 = \{z \in \mathbb{C} : |z| = 1\}\) with the subspace topology from \(\mathbb{C}\) is a topological group under complex multiplication. Multiplication and inversion \(z \mapsto \bar{z}\) are continuous. This is the simplest example of a compact, connected topological group.

Toward Lie Groups

A Lie group is a group that is simultaneously a smooth manifold, with smooth (infinitely differentiable) group operations. Since smooth manifolds have not yet been formally defined in this curriculum — they are the subject of a future page on Smooth Manifolds — we take a concrete approach that is both fully rigorous and historically prior to the abstract definition.

Our strategy is to define matrix Lie groups as closed subgroups of the general linear group \(GL(n, \mathbb{C})\). A deep theorem — Cartan's Closed Subgroup Theorem, stated in the next section — guarantees that every such closed subgroup is automatically a smooth manifold with smooth group operations. In other words, closedness alone buys us all the smoothness we need.

This approach mirrors a recurring pattern in our curriculum: just as Natural Gradient Descent previewed Riemannian geometry before manifolds were formally available, we now study the most important Lie groups concretely before the general definition arrives. Every matrix Lie group we define here will be a Lie group in the abstract sense — we lose nothing by starting with matrices, and we gain the ability to compute.

Matrix Lie Groups

We now define the arena in which all our groups will live. The space \(M_n(\mathbb{F})\) of \(n \times n\) matrices over \(\mathbb{F}\) (where \(\mathbb{F} = \mathbb{R}\) or \(\mathbb{C}\)) is a finite-dimensional vector space isomorphic to \(\mathbb{F}^{n^2}\), and we equip it with the topology induced by any norm (all norms on a finite-dimensional space are equivalent). Within this space sits the group of invertible matrices.

Definition: General Linear Group

The general linear group is \[ GL(n, \mathbb{F}) = \{ A \in M_n(\mathbb{F}) : \det(A) \neq 0 \}. \] Since the determinant \(\det : M_n(\mathbb{F}) \to \mathbb{F}\) is a polynomial in the matrix entries (hence continuous), and \(\{0\}\) is closed in \(\mathbb{F}\), the complement \(GL(n, \mathbb{F}) = \det^{-1}(\mathbb{F} \setminus \{0\})\) is an open subset of \(M_n(\mathbb{F})\).

Matrix multiplication is polynomial in the entries, hence continuous. Matrix inversion is given by Cramer's rule as a rational function of the entries (with denominator \(\det(A) \neq 0\)), hence continuous on \(GL(n, \mathbb{F})\). Therefore \(GL(n, \mathbb{F})\) is a topological group.

The general linear group is the "universe" of matrix groups. Every matrix Lie group will be a subgroup of \(GL(n, \mathbb{C})\) (or \(GL(n, \mathbb{R})\)) satisfying one additional condition: closedness.

Definition: Matrix Lie Group

A matrix Lie group is a subgroup \(G \leq GL(n, \mathbb{C})\) that is a closed subset in the topology of \(M_n(\mathbb{C})\). That is, \(G\) is closed under the group operations and under taking limits of convergent sequences: if \(A_1, A_2, A_3, \dots \in G\) and \(A_k \to A\) in \(M_n(\mathbb{C})\), then either \(A \in G\) or \(A \notin GL(n, \mathbb{C})\).

The closedness condition is remarkably mild — it is the only topological condition we impose. Yet it has profound consequences, as we will see in Cartan's theorem below. Intuitively, closedness prevents the group from having "holes" or "missing boundary points" that would destroy its manifold structure.

The Classical Groups

We now introduce the classical matrix Lie groups — the groups that appear throughout mathematics, physics, and computer science. For each group, we verify that it is indeed a matrix Lie group by checking that it is a closed subgroup of \(GL(n, \mathbb{C})\).

Definition: Special Linear Group

The special linear group is \[ SL(n, \mathbb{R}) = \{ A \in GL(n, \mathbb{R}) : \det(A) = 1 \}. \] This is the group of volume-preserving linear transformations.

Proof that \(SL(n, \mathbb{R})\) is a matrix Lie group:

The determinant map \(\det : GL(n, \mathbb{R}) \to \mathbb{R}^*\) is a continuous group homomorphism (with \(\mathbb{R}^*\) under multiplication). Its kernel is \(SL(n, \mathbb{R}) = \det^{-1}(\{1\})\). Since \(\{1\}\) is closed in \(\mathbb{R}^*\) and \(\det\) is continuous, \(SL(n, \mathbb{R})\) is a closed subgroup of \(GL(n, \mathbb{R})\). The dimension of \(SL(n, \mathbb{R})\) is \(n^2 - 1\) (the single constraint \(\det(A) = 1\) removes one degree of freedom).

Definition: Orthogonal Group

The orthogonal group is \[ O(n) = \{ A \in GL(n, \mathbb{R}) : A^\top A = I \}. \] Equivalently, \(O(n)\) consists of the linear transformations that preserve the Euclidean inner product: \(\langle Ax, Ay \rangle = \langle x, y \rangle\) for all \(x, y \in \mathbb{R}^n\).

Remember, these are precisely the orthogonal matrices \(U\) satisfying \(U^\top U = I\) — the norm-preserving transformations studied earlier in the curriculum.

Proof that \(O(n)\) is a closed subgroup of \(GL(n, \mathbb{R})\):

Consider the map \(\Phi : M_n(\mathbb{R}) \to M_n(\mathbb{R})\) defined by \(\Phi(A) = A^\top A\). This map is continuous (it is polynomial in the entries), and \(O(n) = \Phi^{-1}(\{I\})\). Since \(\{I\}\) is a closed set in \(M_n(\mathbb{R})\), the preimage \(O(n)\) is closed. Furthermore, if \(A^\top A = I\), then \(\det(A)^2 = \det(A^\top A) = 1\), so \(\det(A) = \pm 1 \neq 0\), confirming \(O(n) \subset GL(n, \mathbb{R})\).

Proof that \(O(n)\) is compact:

Bounded: For any \(A \in O(n)\), the Frobenius norm satisfies \(\|A\|_F^2 = \mathrm{tr}(A^\top A) = \mathrm{tr}(I) = n\). Hence every element of \(O(n)\) has the same Frobenius norm \(\sqrt{n}\), so \(O(n)\) is bounded in \(M_n(\mathbb{R}) \cong \mathbb{R}^{n^2}\).

Closed: We proved this above.

By the Heine-Borel theorem, a subset of \(\mathbb{R}^{n^2}\) is compact if and only if it is closed and bounded. Therefore \(O(n)\) is compact.

The condition \(\det(A) = \pm 1\) for \(A \in O(n)\) shows that \(O(n)\) has two connected components: the matrices with \(\det(A) = +1\) (proper rotations) and those with \(\det(A) = -1\) (improper rotations, i.e., rotations composed with a reflection). The component containing the identity is the special orthogonal group.

Definition: Special Orthogonal Group

The special orthogonal group is \[ SO(n) = O(n) \cap SL(n, \mathbb{R}) = \{ A \in GL(n, \mathbb{R}) : A^\top A = I,\; \det(A) = 1 \}. \] This is the group of rotations of \(\mathbb{R}^n\). It is a closed subgroup of \(GL(n, \mathbb{R})\) (as the intersection of two closed subgroups), and it is compact (as a closed subset of the compact set \(O(n)\)). Its dimension is \(n(n-1)/2\).

Proof that \(SO(n)\) is connected:

We show that \(SO(n)\) is path-connected, which implies connectedness. Let \(A \in SO(n)\). We must find a continuous path from \(I\) to \(A\) within \(SO(n)\). By the real Schur decomposition, every element of \(SO(n)\) is orthogonally similar to a block-diagonal matrix consisting of \(2 \times 2\) rotation blocks \(\begin{pmatrix} \cos\theta_k & -\sin\theta_k \\ \sin\theta_k & \cos\theta_k \end{pmatrix}\), and when \(n\) is odd, a single \(1 \times 1\) block. Since \(\det A = 1\), any eigenvalue \(-1\) must occur with even multiplicity, so each pair of \(-1\) eigenvalues is absorbed into a \(2 \times 2\) rotation block with angle \(\pi\); the remaining \(1 \times 1\) block is necessarily \(+1\). Each such block can be continuously rotated to the identity by replacing \(\theta_k\) with \(t\theta_k\) for \(t \in [0, 1]\). Since conjugation by the orthogonal change-of-basis matrix is a homeomorphism of \(SO(n)\), this gives a continuous path from \(I\) to \(A\).

The dimension formula \(\dim SO(n) = n(n-1)/2\) reflects the number of independent parameters in a skew-symmetric matrix (equivalently, the number of independent constraints in \(A^\top A = I\): the symmetric matrix equation gives \(n(n+1)/2\) equations on \(n^2\) entries, leaving \(n^2 - n(n+1)/2 = n(n-1)/2\) degrees of freedom). For the cases of greatest importance: \(\dim SO(2) = 1\) (one angle of rotation) and \(\dim SO(3) = 3\) (three Euler angles, or equivalently, a rotation axis and an angle).

Definition: Unitary Group

The unitary group is \[ U(n) = \{ A \in GL(n, \mathbb{C}) : A^* A = I \} \] where \(A^* = \overline{A}^\top\) denotes the conjugate transpose. This is the group of linear transformations preserving the standard Hermitian inner product on \(\mathbb{C}^n\). By the same argument as for \(O(n)\), \(U(n)\) is a closed, compact subgroup of \(GL(n, \mathbb{C})\). It is connected, and its dimension is \(n^2\) (as a real manifold).

Definition: Special Unitary Group

The special unitary group is \[ SU(n) = U(n) \cap SL(n, \mathbb{C}) = \{ A \in GL(n, \mathbb{C}) : A^* A = I,\; \det(A) = 1 \}. \] It is a closed, compact, connected subgroup of \(GL(n, \mathbb{C})\) of dimension \(n^2 - 1\). The group \(SU(2)\) plays a central role in quantum mechanics and, as we shall see in the next page, is intimately related to \(SO(3)\) via a 2:1 covering map.

Definition: Special Euclidean Group

The special Euclidean group \(SE(3)\) is the group of rigid body motions (rotations and translations) of \(\mathbb{R}^3\). It is realized as a matrix Lie group via the embedding into \(GL(4, \mathbb{R})\): \[ SE(3) = \left\{ \begin{pmatrix} R & \mathbf{t} \\ \mathbf{0}^\top & 1 \end{pmatrix} : R \in SO(3),\; \mathbf{t} \in \mathbb{R}^3 \right\} \subset GL(4, \mathbb{R}). \] The group operation corresponds to composition of rigid body motions: \[ \begin{pmatrix} R_1 & \mathbf{t}_1 \\ \mathbf{0}^\top & 1 \end{pmatrix} \begin{pmatrix} R_2 & \mathbf{t}_2 \\ \mathbf{0}^\top & 1 \end{pmatrix} = \begin{pmatrix} R_1 R_2 & R_1 \mathbf{t}_2 + \mathbf{t}_1 \\ \mathbf{0}^\top & 1 \end{pmatrix}. \] The group \(SE(3)\) is a closed, connected subgroup of \(GL(4, \mathbb{R})\), but it is not compact (the translation component \(\mathbf{t}\) is unbounded). Its dimension is \(6\) (\(3\) for rotation + \(3\) for translation).

The following table summarizes the classical matrix Lie groups:

Group Defining Condition Dimension Connected Compact
\(GL(n, \mathbb{R})\) \(\det(A) \neq 0\) \(n^2\) No (2 components) No
\(SL(n, \mathbb{R})\) \(\det(A) = 1\) \(n^2 - 1\) Yes No
\(O(n)\) \(A^\top A = I\) \(n(n-1)/2\) No (2 components) Yes
\(SO(n)\) \(A^\top A = I,\; \det(A) = 1\) \(n(n-1)/2\) Yes Yes
\(U(n)\) \(A^* A = I\) \(n^2\) Yes Yes
\(SU(n)\) \(A^* A = I,\; \det(A) = 1\) \(n^2 - 1\) Yes Yes
\(SE(3)\) \(\begin{pmatrix} R & \mathbf{t} \\ \mathbf{0}^\top & 1 \end{pmatrix},\; R \in SO(3)\) 6 Yes No

Cartan's Closed Subgroup Theorem

Why is closedness the only condition we need? The following theorem, due to Élie Cartan, justifies our entire approach.

Theorem: Cartan's Closed Subgroup Theorem

Every closed subgroup of \(GL(n, \mathbb{C})\) is a smooth submanifold of \(M_n(\mathbb{C})\), and the group operations (multiplication and inversion) are smooth maps with respect to this manifold structure.

The proof requires the inverse function theorem on manifolds, which lies beyond our current tools (it will become available after smooth manifolds are formally introduced). We state it here as the foundational result that justifies our definition: calling a closed subgroup of \(GL(n, \mathbb{C})\) a "matrix Lie group" is not merely a convention — Cartan's theorem guarantees that it is genuinely a Lie group in the abstract sense.

It is worth noting the parallel with quotient structures. In Topological Spaces, we saw that closedness of a subgroup \(H\) is the condition ensuring the quotient space \(G/H\) is Hausdorff. Here, closedness of the subgroup \(G\) in \(GL(n)\) is the condition ensuring \(G\) is a smooth manifold. In both cases, closedness is the gatekeeper to good geometric behavior.

Quotient Spaces

If \(H \leq G\) is a closed subgroup of a matrix Lie group \(G\), then the quotient space \(G/H\) carries a natural smooth manifold structure (by a generalization of Cartan's theorem). Here, \(G/H\) denotes the set of left cosets \(\{gH : g \in G\}\) equipped with the quotient topology.

An important distinction must be made. When \(H\) is a normal subgroup, the quotient \(G/H\) is both a smooth manifold and a group (a factor group). When \(H\) is merely closed (not necessarily normal), \(G/H\) is still a smooth manifold — called a homogeneous space — but it does not carry a group structure.

Example: The 2-Sphere as a Homogeneous Space

The group \(SO(3)\) acts transitively on the unit sphere \(S^2 \subset \mathbb{R}^3\) (any unit vector can be rotated to any other). The stabilizer of the north pole \(\mathbf{e}_3 = (0, 0, 1)^\top\) is the subgroup of rotations about the \(z\)-axis, which is isomorphic to \(SO(2)\). Therefore: \[ SO(3) / SO(2) \cong S^2. \] The 2-sphere is a homogeneous space — it is a smooth manifold, but not a group (there is no natural way to "multiply" two points on a sphere).

Looking Ahead

We have established the classical matrix Lie groups — \(GL\), \(SL\), \(O\), \(SO\), \(U\), \(SU\), and \(SE(3)\) — as closed subgroups of the general linear group, and Cartan's theorem assures us that each one is a smooth manifold with smooth group operations. The summary table in the previous section makes a recurring pattern visible: each group is carved out of \(GL(n)\) by a set of nonlinear algebraic equations (\(A^\top A = I\), \(\det(A) = 1\), etc.).

A natural question arises: is there a systematic way to move between these nonlinear group-level constraints and simpler, linear conditions? The answer is yes, and the tool is the matrix exponential — a power series that converts matrices satisfying linear conditions (e.g., skew-symmetry \(A^\top = -A\)) into group elements satisfying the corresponding nonlinear ones (e.g., orthogonality \(e^A (e^A)^\top = I\)). In the next page, we define this exponential map, prove its fundamental properties, and use it to derive explicit formulas for rotations — including Rodrigues' rotation formula, the computational backbone of 3D rotation in robotics and computer graphics.

The Road to Lie Algebras

The matrix exponential will reveal that each Lie group \(G\) has an associated Lie algebra \(\mathfrak{g}\) — a vector space of "infinitesimal generators" equipped with an operation called the Lie bracket that encodes the group's non-commutativity at the linear level. This linearization is the key to making Lie groups computationally tractable: instead of working with nonlinear group elements, we work with their linear Lie algebra counterparts and exponentiate back when needed. The full development of this theory spans the next three pages:

The Matrix Exponential (next) — the bridge from linear to nonlinear.
Lie Algebras and the Lie Bracket — the tangent space at the identity and its algebraic structure.
The Lie Correspondence — how the algebra determines the group, the Baker-Campbell-Hausdorff formula, and the adjoint representations.