Algebraic Extensions

Introduction

In the previous chapter, we stood at a crossroads. Having established that we can always expand a field to find the roots of a polynomial, we must now ask: how do we measure the "size" or the "degrees of freedom" of this new, extended universe?

We now take the path toward continuous symmetry and geometry. Before we can understand the smooth, continuous dimensions of Lie groups and manifolds used in modern ML, we must first understand dimension in its purest, discrete algebraic form. When we adjoin a new element to a field, we are effectively adding a new "dimension" to a vector space.

Connection to Geometric Deep Learning (GDL)

In robotics and 3D computer vision, we constantly track degrees of freedom (for instance, a rigid body in \(SE(3)\) has 6 degrees of freedom). In abstract algebra, the "degree of an extension" tracks the exact dimensionality of a new field over its base. The rules governing how these discrete dimensions stack and multiply - specifically the Tower Rule - are the rigorous mathematical ancestors of dimension counting in the smooth manifolds and Lie groups we will study next.

Characterization of Extensions

When we expand a base field \(F\) by adding a new element \(a\) to create an extension \(E\), the structure of this new universe depends entirely on the nature of \(a\). Specifically, we must ask if \(a\) is bound by the algebraic rules of \(F\), or if it is entirely independent.

Definition: Algebraic & Transcendental Extensions

Let \(E\) be an extension field of a field \(F\). Then \(a \in E\) is called:

algebraic over \(F\) if \(a\) is the zero of some nonzero polynomial in \(F[x]\).
transcendental over \(F\) if \(a\) is not algebraic over \(F\).

An extension \(E\) of \(F\) is called an algebraic extension of \(F\) if every element of \(E\) is algebraic over \(F\). If \(E\) is not an algebraic extension of \(F\), it is called a transcendental extension of \(F\). Also, an extension of \(F\) of the form \(F(a)\) is called a simple extension of \(F\).

If an element is algebraic (like \(\sqrt{2}\) over \(\mathbb{Q}\)), it brings a finite, measurable amount of new information to the field. If it is transcendental (like \(\pi\) or \(e\)), it brings an infinite number of linear independent powers, blowing the dimension of the field up to infinity. Because we are building toward the study of finite-dimensional continuous groups, we care deeply about the algebraic case.

Theorem: Characterization of Extensions

Let \(E\) be an extension of the field \(F\) and let \(a \in E\). If \(a\) is transcendental over \(F\), then \(F(a) \cong F(x)\).

If \(a\) is algebraic over \(F\), then \(F(a) \cong F[x]/\langle p(x) \rangle\), where \(p(x)\) is the unique monic irreducible polynomial in \(F[x]\) of minimal degree such that \(p(a) = 0\).

Proof:

We treat the two cases separately.

Algebraic case. Since \(a\) is algebraic, the set \(I = \{f(x) \in F[x] \mid f(a) = 0\}\) is a nonzero ideal of \(F[x]\). As \(F[x]\) is a principal ideal domain, \(I = \langle p(x) \rangle\) for a unique monic polynomial \(p(x)\) of minimal degree (uniqueness: any two such generators differ by a unit, and the monic condition pins down the unit). This \(p(x)\) is irreducible — if \(p = gh\) with both factors of lower degree, then \(g(a)h(a) = 0\) in the field \(F(a)\) forces one factor to vanish at \(a\), contradicting the minimality of \(\deg p\). The isomorphism \(F(a) \cong F[x] / \langle p(x) \rangle\) then follows from the structure theorem for \(F(a)\) proved in the previous chapter.

Transcendental case. Consider the evaluation homomorphism \(\phi : F[x] \to F(a),\ f(x) \mapsto f(a)\). Since \(a\) is transcendental, \(\ker \phi = \{0\}\), so \(\phi\) is injective and \(F[x]\) embeds into \(F(a)\). The image \(\phi(F[x])\) is an integral domain inside the field \(F(a)\), so its field of fractions \(\operatorname{Frac}(\phi(F[x])) \cong F(x)\) embeds into \(F(a)\) as well. By minimality of \(F(a)\) as the smallest field containing \(F\) and \(a\), we conclude \(F(a) \cong F(x)\).

As established in the previous chapter, this isomorphism gives us a concrete data structure. But more importantly, it implies a strict limit on the size of the new field. This brings us to the most crucial property of algebraic extensions: their vector space dimension.

Finite Extensions & The Tower Rule

Here, abstract algebra beautifully intersects with Linear Algebra. Any extension field \(E\) can simply be viewed as a vector space over its base field \(F\). For example, the complex numbers \(\mathbb{C}\) form a 2-dimensional vector space over the real numbers \(\mathbb{R}\), with the basis \(\{1, i\}\).

Definition: Degree of an Extension

Let \(E\) be an extension field of a field \(F\). If \(E\) has dimension \(n\) as a vector space over \(F\), \(E\) has degree \(n\) over \(F\) and write \[ [E:F] = n. \] If \([E:F]\) is finite, \(E\) is called a finite extension of \(F\), and if not, \(E\) is called an infinite extension of \(F\).

A powerful realization is that restricting the vector space dimension strictly forces all elements inside the field to be algebraic. There is simply not enough "room" in a finite-dimensional space for an element to be transcendental.

Theorem: Finite Implies Algebraic

If \(E\) is a finite extension of the field \(F\), then \(E\) is an algebraic extension of \(F\).

Intuitively: if the vector space dimension is \(n\), then any sequence of \(n+1\) elements must be linearly dependent. For any element \(a \in E\), the \(n+1\) powers \(\{1, a, a^2, \dots, a^n\}\) must have some linear dependence relation \(c_0 + c_1a + \dots + c_na^n = 0\). This equation is exactly a polynomial over \(F\) that \(a\) satisfies, proving \(a\) is algebraic.

Proof:

Suppose that \(E\) is a finite extension of \(F\) with dimension \([E:F] = n\), and \(a \in E\). Consider the set of \(n+1\) elements: \(\{1, a, \ldots, a^n\}\). Since \(E\) is an \(n\)-dimensional vector space over \(F\), any set of more than \(n\) elements must be linearly dependent.

Thus, there exist elements \(c_0, c_1, \ldots, c_n \in F\) (not all zero) such that \[ c_n a^n + c_{n-1} a^{n-1} + \cdots + c_1 a + c_0 = 0. \] Clearly, then, \(a\) is a root of the nonzero polynomial \[ f(x) = c_n x^n + c_{n-1} x^{n-1} + \cdots + c_1 x + c_0. \] Therefore, by definition, \(a\) is algebraic over \(F\). Since this applies to any \(a \in E\), the entire extension \(E\) is algebraic.

Remark: The Converse Fails

The converse is false: an algebraic extension need not be finite. The standard counterexample is the field of all algebraic numbers \(\overline{\mathbb{Q}}\), consisting of every \(\alpha \in \mathbb{C}\) algebraic over \(\mathbb{Q}\). It is algebraic over \(\mathbb{Q}\) by construction, yet \([\overline{\mathbb{Q}} : \mathbb{Q}] = \infty\) — the elements \(\sqrt[n]{2}\) for \(n = 1, 2, 3, \ldots\) generate subfields of degrees \(1, 2, 3, \ldots\), so no finite \(\mathbb{Q}\)-basis can exist.

When building complex architectures - such as stacking layers in a deep neural network or composing transformations in a robotic arm - we need to know how the total dimensionality behaves. In field theory, we stack extensions. The Tower Rule tells us exactly how these discrete dimensions multiply.

Theorem: Tower Rule

If \(K\) is a finite extension of the field \(E\), and \(E\) is a finite extension of the field \(F\), then \(K\) is a finite extension of \(F\), and their degrees multiply: \[ [K:F] = [K:E][E:F]. \]

Proof:

Let \(X = \{x_1, x_2, \ldots, x_n\}\) be a basis for \(K\) over \(E\), and let \(Y = \{y_1, y_2, \ldots, y_m\}\) be a basis for \(E\) over \(F\). We claim that the set \[ YX = \{y_j x_i \mid 1 \leq j \leq m, \, 1 \leq i \leq n\} \] forms a basis for \(K\) over \(F\). To prove this, we must show that \(YX\) spans \(K\) over \(F\), and that \(YX\) is linearly independent over \(F\).

First, let \(a \in K\). Then there are elements \(b_1, b_2, \ldots, b_n \in E\) such that \[ a = b_1 x_1 + b_2 x_2 + \cdots + b_n x_n \] and, for each \(i = 1, 2, \ldots, n\), there are elements \(c_{i1}, c_{i2}, \ldots, c_{im} \in F\) such that \[ b_i = c_{i1} y_{1} + c_{i2} y_2 + \cdots + c_{im} y_{m}. \] Thus, \[ \begin{align*} a &= \sum_{i=1}^n b_i x_i \\\\ &= \sum_{i=1}^n \left(\sum_{j=1}^m c_{ij} y_j \right) x_i \\\\ &= \sum_{i, j} c_{ij} (y_j x_i). \end{align*} \] Thus, \(YX\) spans \(K\) over \(F\).

Next, suppose there are elements \(c_{ij} \in F\) such that \[ \begin{align*} 0 &= \sum_{i, j} c_{ij} (y_j x_i) \\\\ &= \sum_i \left(\sum_j (c_{ij} y_j)\right) x_i. \end{align*} \] Then, since each \(\sum_j c_{ij} y_j \in E\) and \(X\) is a basis for \(K\) over \(E\), for each \(i\), we have \[ \sum_j c_{ij} y_j = 0. \] However, each \(c_{ij} \in F\) and \(Y\) is a basis for \(E\) over \(F\), so each \(c_{ij} = 0\). Therefore, the set \(YX\) is linearly independent over \(F\).

The \(mn\) products \(y_j x_i\) are pairwise distinct: if \(y_{j_1} x_{i_1} = y_{j_2} x_{i_2}\) with \((j_1, i_1) \neq (j_2, i_2)\), the difference would give a nontrivial linear dependence among the elements of \(YX\), contradicting the independence just proved. Therefore \(YX\) is a basis for \(K\) over \(F\) of cardinality \(mn\), and we conclude \[ [K:F] = mn = [K:E][E:F]. \]

If we extend a field with multiple algebraic elements, say \(F(a, b)\), do we really need to track both independent dimensions? In many cases, no. The Primitive Element Theorem acts as a form of algebraic dimensionality reduction, proving that we can often "compress" multiple extensions into a single generated extension.

Theorem: Primitive Element Theorem

If \(F\) is a field of characteristic \(0\), and \(a\) and \(b\) are algebraic over \(F\), then there exists a single element \(c \in F(a, b)\) such that \(F(a, b) = F(c)\).

Proof Sketch:

Let \(p(x)\) and \(q(x)\) be the minimal polynomials of \(a\) and \(b\) over \(F\), with distinct zeros \(a_1 = a, a_2, \ldots, a_m\) and \(b_1 = b, b_2, \ldots, b_n\) in some extension \(K\) of \(F\). Since \(F\) has characteristic \(0\), \(F\) is infinite, so we can choose \(d \in F\) avoiding the finitely many values \((a_i - a)/(b - b_j)\) for \(i \geq 1, j > 1\). Set \(c = a + db\); the choice of \(d\) ensures that \(a_i \neq a + d(b - b_j)\) for all \(i \geq 1, j > 1\).

We claim \(F(a, b) = F(c)\). The inclusion \(F(c) \subseteq F(a, b)\) is immediate from \(c = a + db\). For the reverse, it suffices to show \(b \in F(c)\), since then \(a = c - db \in F(c)\) as well. Consider the polynomials \(q(x)\) and \(r(x) = p(c - dx)\) over \(F(c)\). Both vanish at \(x = b\), so their gcd in \(F(c)[x]\) is a polynomial of positive degree dividing \(q(x)\). The choice of \(d\) forces \(b\) to be the only common root of \(q\) and \(r\) in \(K\), so \(\gcd(q, r) = x - b\) up to a unit. Since gcds are computable in \(F(c)[x]\), we conclude \(b \in F(c)\), and therefore \(F(a, b) \subseteq F(c)\).

Example: Generator Compression and Data Packing

To see how the Primitive Element Theorem works as "algebraic data compression," let's look at a concrete numerical example. Consider the field \(E = \mathbb{Q}(\sqrt{2}, \sqrt{3})\), which is formed by adjoining both \(\sqrt{2}\) and \(\sqrt{3}\) to the rational numbers \(\mathbb{Q}\).

Multidimensional Structure:
By the Tower Rule applied to \(\mathbb{Q} \subset \mathbb{Q}(\sqrt{2}) \subset \mathbb{Q}(\sqrt{2}, \sqrt{3})\), \[ [\mathbb{Q}(\sqrt{2}, \sqrt{3}) : \mathbb{Q}] = [\mathbb{Q}(\sqrt{2}, \sqrt{3}) : \mathbb{Q}(\sqrt{2})] \cdot [\mathbb{Q}(\sqrt{2}) : \mathbb{Q}] = 2 \cdot 2 = 4. \] The basis is \(\{1, \sqrt{2}, \sqrt{3}, \sqrt{6}\}\), meaning it behaves like a 4-dimensional vector space over \(\mathbb{Q}\).
Compression via a "Primitive Element":
Surprisingly, we do not need to keep track of two separate generators. We can define a single, combined element \(c = \sqrt{2} + \sqrt{3}\). This single element is sufficient to generate the entire field: \[ \mathbb{Q}(\sqrt{2}, \sqrt{3}) = \mathbb{Q}(\sqrt{2} + \sqrt{3}). \]
Decoding the Information:
We can see this as a "lossless" encoding by showing that the original generators can be recovered entirely through polynomial operations on \(c\). Let's calculate the cube of \(c\): \[ \begin{align*} c^3 &= (\sqrt{2} + \sqrt{3})^3 = (\sqrt{2})^3 + 3(\sqrt{2})^2\sqrt{3} + 3\sqrt{2}(\sqrt{3})^2 + (\sqrt{3})^3 \\\\ &= 2\sqrt{2} + 6\sqrt{3} + 9\sqrt{2} + 3\sqrt{3} \\\\ &= 11\sqrt{2} + 9\sqrt{3}. \end{align*} \] Now, if we subtract \(9c\) from \(c^3\), the \(\sqrt{3}\) terms beautifully cancel out: \[ c^3 - 9c = (11\sqrt{2} + 9\sqrt{3}) - 9(\sqrt{2} + \sqrt{3}) = 2\sqrt{2}. \] Therefore, we can perfectly decode \(\sqrt{2}\) using the formula \(\sqrt{2} = \frac{1}{2}(c^3 - 9c)\). Once we have \(\sqrt{2}\), we easily get \(\sqrt{3} = c - \sqrt{2}\).

The CS Perspective:
This is perfectly analogous to lossless data packing. Instead of maintaining two separate variables (or allocating two separate memory addresses for \(\sqrt{2}\) and \(\sqrt{3}\)), we can apply a specific weighting to sum them into a single variable (\(c\)) without any loss of structural information. The Primitive Element Theorem guarantees that such a lossless flattening is always possible for finite extensions (in characteristic 0).

Properties of Algebraic Extensions

In algorithm design and mathematical modeling, closure is a critical property. It ensures that an operation applied to valid inputs always produces a valid output. Before we can transition to studying smooth manifolds and continuous groups, we must guarantee that our algebraic building blocks are perfectly stable.

Theorem: Subfield of Algebraic Elements

Let \(E\) be an extension field of the field \(F\). Then the set of all elements in \(E\) that are algebraic over \(F\) is a subfield of \(E\).

This means that if you take two algebraic numbers and add, subtract, multiply, or divide them, the result is guaranteed to also be an algebraic number. Finally, this closure applies not just to elements within a single extension, but to the extensions themselves.

Proof:

Suppose that \(a, b \in E\) are algebraic over \(F\) and \(b \neq 0\). To show they form a subfield, we must show that all \(a+b\), \(a-b\), \(ab\) and \(a/b\) are algebraic over \(F\). To do this, it suffices to show that the extension degree \([F(a, b):F]\) is finite, because each of these four elements is contained within \(F(a, b)\).

Since \(b\) is algebraic over \(F\), the simple extension \(F(b)\) has finite degree over \(F\): by the Characterization of Extensions and the structure theorem for \(F(b)\), \([F(b) : F]\) equals the degree of the minimal polynomial of \(b\). Since \(a\) is algebraic over \(F\), it satisfies the same nonzero polynomial when viewed over the larger field \(F(b)\), so \(a\) is algebraic over \(F(b)\) as well. Applying the same argument to the simple extension \(F(b)(a) = F(a, b)\), we see that \([F(a, b) : F(b)]\) is finite.

Here, by the Tower Rule, we know that \[ [F(a, b):F] = [F(a, b):F(b)][F(b):F]. \] Since both terms on the right are finite, \([F(a,b) : F]\) is finite. Because Finite Implies Algebraic, every element inside \(F(a,b)\) must be algebraic over \(F\).

Since field operations are closed, \(a+b\), \(a-b\), \(ab\), and \(a/b\) are all in \(F(a,b)\). Therefore, they are all guaranteed to be algebraic over \(F\). The algebraic elements form a subfield of \(E\).

Theorem: Algebraic over Algebraic is Algebraic

If \(K\) is an algebraic extension of \(E\), and \(E\) is an algebraic extension of \(F\), then \(K\) is an algebraic extension of \(F\).

Proof:

Let \(\alpha \in K\). Since \(K\) is algebraic over \(E\), \(\alpha\) satisfies a nonzero polynomial \(f(x) = e_n x^n + e_{n-1} x^{n-1} + \cdots + e_0\) with coefficients \(e_0, e_1, \ldots, e_n \in E\). Consider the intermediate field \[ E_0 = F(e_0, e_1, \ldots, e_n). \] Each \(e_i\) is algebraic over \(F\) (because \(E\) is algebraic over \(F\)). Adjoining the \(e_i\) one at a time and applying the previous theorem at each step, the chain \[ F \subset F(e_0) \subset F(e_0, e_1) \subset \cdots \subset F(e_0, e_1, \ldots, e_n) = E_0 \] consists of finite extensions; hence \([E_0 : F]\) is finite by the Tower Rule.

Now \(\alpha\) is algebraic over \(E_0\), since \(f(x)\) lies in \(E_0[x]\) by construction; hence \([E_0(\alpha) : E_0]\) is also finite. Applying the Tower Rule, \[ [E_0(\alpha) : F] = [E_0(\alpha) : E_0] \cdot [E_0 : F] < \infty. \] Since \(\alpha \in E_0(\alpha)\) and \([E_0(\alpha) : F]\) is finite, Finite Implies Algebraic yields that \(\alpha\) is algebraic over \(F\). As \(\alpha \in K\) was arbitrary, \(K\) is algebraic over \(F\).

These theorems prove that the "algebraic universe" is closed. We can stack algebraic extensions on top of each other safely without accidentally falling into infinite-dimensional (transcendental) chaos. With this secure, discrete foundation of dimensions and extensions established, we are now ready to transition to continuous geometry, starting with the discrete symmetries of polygons and evolving into the continuous Lie Groups of 3D space: \(SO(3)\) and \(SE(3)\).

Loading...

Introduction

Connection to Geometric Deep Learning (GDL)

Characterization of Extensions

Finite Extensions & The Tower Rule

Remark: The Converse Fails

Example: Generator Compression and Data Packing

Properties of Algebraic Extensions