Rings
In the previous pages, we developed the theory of groups - algebraic structures with a single
binary operation satisfying closure, associativity, identity, and inverse properties. Groups
capture symmetry and have wide applications from cryptography to physics. However, when we
consider our most familiar number systems - the integers \(\mathbb{Z}\), the rationals
\(\mathbb{Q}\), or the reals \(\mathbb{R}\) - we notice that they come equipped with
two operations: addition and multiplication. A group structure alone cannot
capture this richer algebraic behavior. This observation motivates the study of rings:
algebraic structures that combine an Abelian group under addition with a second operation (multiplication)
that interacts with addition through the distributive laws.
In the context of computer science, ring theory is directly relevant to several applications:
-
Modular arithmetic: The ring \(\mathbb{Z}_n\) underlies cryptographic
systems like RSA, hash functions, and pseudorandom number generators. Understanding when
\(\mathbb{Z}_n\) has zero divisors (when \(n\) is composite) versus when it is a field
(when \(n\) is prime) is essential for cryptographic security.
-
Error-correcting codes: Cyclic redundancy checks (CRC), Reed-Solomon
codes, and BCH codes are all built on polynomial rings over finite fields. The algebraic
structure enables efficient encoding and decoding algorithms.
-
Computer algebra systems: Software like Mathematica, Maple, and SageMath
implement algorithms for polynomial factorization, GCD computation, and solving systems
of polynomial equations - all of which rely on ring-theoretic foundations.
Definition: Ring
A
ring \(R\) is a set with two binary operations, addition(\(a + b\)) and multiplication(\(ab\)),
such that for all \(a, b, c\) in \(R\):
- Commutativity of addition: \(a + b = b + a\).
- Associativity of addition: \((a + b) + c = a + (b + c)\).
- Additive identity: There exists an element \(0 \in R\) such that \(a + 0 = a\).
- Additive inverses: There exists an element \(-a \in R\) such that \(a + (-a) = 0\).
- Associativity of multiplication: \(a(bc) = (ab)c\).
- Distributive laws: \(a(b + c) = ab + ac\) and \((b + c)a = ba + ca\).
Thus, a ring is an Abelian group under addition, coupled
with an associative multiplication that is left and right distributive over addition. Note that multiplication
need not be commutative.
Furthermore, a multiplicative identity is not required by the general definition. If a ring
possesses a multiplicative identity \(1\) such that \(1a = a1 = a\) for all \(a \in R\), we call the ring a
ring with unity. It is important to note that the existence of a
multiplicative identity is independent of commutativity. A ring can have a unity without being commutative.
For example, in the ring \(M_2(\mathbb{Z})\), the identity matrix \(I_2\) exists, but \(AB \neq BA\) for
most matrices \(A, B \in M_2(\mathbb{Z})\).
Even in a commutative ring with unity, a nonzero element is not required
to have a multiplicative inverse. An element \(a\) that possesses an inverse \(a^{-1}\)
(satisfying \(aa^{-1} = a^{-1}a = 1\)) is called a unit.
Note that in many rings, such as \(\mathbb{Z}\),
most elements fail to be units.
Remark (Uniqueness of unity and of multiplicative inverses)
In any ring with unity, the unity element \(1\) is unique: if \(1\) and \(1'\) are both multiplicative
identities, then \(1 = 1 \cdot 1' = 1'\).
Likewise, if \(a \in R\) is a unit, its multiplicative inverse is unique: if \(b\) and \(b'\) both
satisfy \(ab = ba = 1\) and \(ab' = b'a = 1\), then
\(b = b \cdot 1 = b(ab') = (ba)b' = 1 \cdot b' = b'\).
While units allow us to perform division-like operations via multiplicative
inverses, we often need to describe the relationship between elements even when an inverse
does not exist. This leads us to the concept of divisibility in a ring.
Let \(R\) be a commutative ring, and let \(a, b \in R\) where \(a \neq 0\). We say that
\(a\) divides \(b\) (written \(a \mid b\)) if there exists an element
\(c \in R\) such that \(b = ac\). If no such \(c\) exists, we write \(a \nmid b\).
Remark (\(U(n)\) as the unit group of \(\mathbb{Z}_n\))
Previously, we defined the group of units modulo \(n\),
denoted as \(U(n)\), as the set of integers:
\[
\{a \in \mathbb{Z} \mid 0 < a < n, \gcd(a, n) = 1\}.
\]
In the language of ring theory, this is exactly the group of units of the ring
\(\mathbb{Z}_n\). That is:
\[
U(\mathbb{Z}_n) = U(n).
\]
Let's observe some simple examples for rings.
- \(\mathbb{Z}\): the set of integers under ordinary addition and multiplication forms
a commutative ring with unity 1.
Note that the only units are \(1\) and \(-1\): (\(U(\mathbb{Z}) = \{-1, 1\}\).)
By contrast, as we know, matrix multiplication is not commutative. Thus,
- \(M_2(\mathbb{Z})\): the set of \(2 \times 2\) matrices with integer entries forms a noncommutative ring with unity
\[
I_2 = \begin{bmatrix} 1 & 0 \\ 0 & 1 \\ \end{bmatrix}.
\]
Moreover, the group of units of \(M_n(\mathbb{Z})\) is denoted
\[
U(M_n(\mathbb{Z})) = GL(n, \mathbb{Z})
\]
and is called the general linear group of degree \(n\) over \(\mathbb{Z}\).
As a concrete illustration of the unit concept in a noncommutative ring, the following theorem
characterizes its elements explicitly.
Theorem: Units of \(M_n(\mathbb{Z})\)
A matrix \(A \in M_n(\mathbb{Z})\) is a unit if and only if \(\det A = \pm 1\).
Proof:
First, suppose \(\det A = \pm 1\). Since \(\det A \neq 0\), the matrix \(A\), viewed as a matrix over \(\mathbb{Q}\), is invertible,
and its inverse over \(\mathbb{Q}\) is given by the adjugate formula
\[
A^{-1} = \frac{1}{\det A} \operatorname{adj}(A),
\]
where \(\operatorname{adj}(A)\) is the
adjugate matrix (the transpose of the cofactor matrix).
Since \(A\) has integer entries, each cofactor is an integer, so \(\operatorname{adj}(A) \in M_n(\mathbb{Z})\).
Because \(\det A = \pm 1\), we have \(\frac{1}{\det A} = \pm 1 \in \mathbb{Z}\), and thus
the right-hand side lies in \(M_n(\mathbb{Z})\). This integer matrix satisfies
\(A \cdot A^{-1} = A^{-1} \cdot A = I_n\) (the identity already holds over \(\mathbb{Q}\) and
is preserved by inclusion \(M_n(\mathbb{Z}) \hookrightarrow M_n(\mathbb{Q})\)). Therefore, \(A\) is a unit in \(M_n(\mathbb{Z})\).
Next, suppose \(A\) is a unit in \(M_n(\mathbb{Z})\), so there exists \(B \in M_n(\mathbb{Z})\)
with \(AB = I_n\). Taking determinants of both sides:
\[
\det(AB) = \det(I_n) \implies (\det A)(\det B) = 1.
\]
Since \(\det A\) and \(\det B\) are both integers and their product is \(1\), the only possibilities
are \(\det A = \det B = 1\) or \(\det A = \det B = -1\). In either case, \(\det A = \pm 1\).
Theorem: Rules of Multiplication
Let \(a\), \(b\), and \(c\) belong to a ring \(R\). Then:
- \(a0 = 0a = 0\).
- \( a(-b) = (-a)b = -(ab)\).
- \((-a)(-b) = ab\).
- \(a(b - c) = ab - ac\) and \((b - c)a = ba - ca\).
Furthermore, if \(R\) has a unity element \(1\), then:
- \((-1)a = -a\).
- \((-1)(-1) = 1\).
Proof:
Each identity follows from the ring axioms by short manipulations. We sketch the first; the rest are analogous.
(\(a0 = 0\)). Since \(0 + 0 = 0\), the distributive law gives \(a0 = a(0 + 0) = a0 + a0\).
Adding \(-(a0)\) to both sides yields \(0 = a0\). The identity \(0a = 0\) follows by the symmetric argument
using right distributivity.
(\(a(-b) = -(ab)\)). By distributivity and the previous identity,
\(0 = a \cdot 0 = a(b + (-b)) = ab + a(-b)\), so \(a(-b)\) is the additive inverse of \(ab\);
that is, \(a(-b) = -(ab)\). Similarly, \((-a)b = -(ab)\).
(\((-a)(-b) = ab\)). Apply the previous identity twice:
\((-a)(-b) = -(a(-b)) = -(-(ab)) = ab\), the last step using that the additive inverse of an additive inverse
returns the original element.
(\(a(b - c) = ab - ac\)). By definition \(b - c = b + (-c)\), so
\(a(b - c) = a(b + (-c)) = ab + a(-c) = ab - ac\). The right-distributive version is symmetric.
(Unity case: \((-1)a = -a\)). By the second identity with \(b = 1\),
\((-1)a = -(1 \cdot a) = -a\). Setting \(a = -1\) in this gives \((-1)(-1) = -(-1) = 1\).
Just as subgroups play a critical role in group theory, subrings are equally
important in ring theory.
Definition: Subring
A subset \(S\) of a ring \(R\) is a subring of \(R\) if \(S\) is itself
a ring under the same operations as \(R\).
Just as we have the One-Step Subgroup Test in group theory, there is an analogous test for subrings.
Convention. Following Gallian-style abstract algebra, we adopt the general
definition of a ring (without requiring unity). Consequently, a subring need not contain the unity
of the parent ring; closure under subtraction and multiplication suffices. (We discuss the alternative
"ring with unity" convention after the test.)
Theorem: Subring Test
A nonempty subset \(S\) of a ring \(R\) is a subring if \(S\) is closed under
subtraction and multiplication, that is,
\[
\forall a, b \in S, \quad (a - b) \in S \text{ and } ab \in S.
\]
Proof:
Suppose \(S \neq \emptyset\) is closed under subtraction and multiplication.
We must verify all ring axioms.
Additive structure (Abelian group): Since \(S \neq \emptyset\), there exists
some \(s_0 \in S\). By closure under subtraction, \(s_0 - s_0 = 0 \in S\), establishing the additive
identity. For any \(a \in S\), we have \(0 - a = -a \in S\), establishing additive inverses
(here \(-a\) denotes the additive inverse in \(R\), which we have just shown lies in \(S\)). For any
\(a, b \in S\), since \(-b \in S\), we have \(a - (-b) = a + b \in S\), establishing closure
under addition. Associativity and commutativity of addition are inherited from \(R\).
Multiplicative structure: Closure under multiplication is given by hypothesis,
and associativity of multiplication is inherited from \(R\).
Distributivity: Since \(S \subseteq R\), the distributive laws applied to elements of \(S\)
hold automatically as instances of the distributive laws in \(R\).
Therefore, \(S\) is a subring of \(R\).
The elegance of this test lies in how closure under subtraction encapsulates three requirements
simultaneously: existence of the additive identity (via \(a - a = 0\)), existence of additive
inverses (via \(0 - a = -a\)), and closure under addition (via \(a - (-b) = a + b\)).
Recall from the definition that a ring is an Abelian group under addition. Thus, the Subring
Test consists of two parts: the One-Step Subgroup Test
for the additive structure, and closure under multiplication.
Example: \(\mathbb{Z}\) is a subring of \(\mathbb{Q}\)
First, we recognize that the rational numbers \(\mathbb{Q}\) form a commutative ring with unity \(1\)
under standard addition and multiplication. Now, let's see if the subset \(\mathbb{Z} \subset \mathbb{Q}\) inherits this status:
- Nonempty: \(0 \in \mathbb{Z}\), so \(\mathbb{Z} \neq \emptyset\).
- Closed under subtraction: For any \(a, b \in \mathbb{Z}\), \(a - b \in \mathbb{Z}\).
- Closed under multiplication: For any \(a, b \in \mathbb{Z}\), \(ab \in \mathbb{Z}\).
By the Subring Test, \(\mathbb{Z}\) is a subring of \(\mathbb{Q}\).
Note on \(2\mathbb{Z}\). Under our convention (general ring, no unity required),
the set of even integers \(2\mathbb{Z}\) is a subring of \(\mathbb{Z}\).
Some textbooks adopt the alternative convention that all rings must have unity, in which case a subring is
required to share the parent's unity \(1\); under that stricter convention, \(2\mathbb{Z}\) would fail because \(1 \notin 2\mathbb{Z}\).
For most systems in Computer Science and Cryptography, we primarily work with rings with unity,
but adopting the looser definition here keeps the Subring Test maximally applicable.
Example: \(\mathbb{Q}\) is a subring of \(\mathbb{R}\)
Next, we consider the set of real numbers \(\mathbb{R}\), which is a commutative ring with unity \(1\).
We apply the Subring Test to verify that the set of rational numbers \(\mathbb{Q} \subset \mathbb{R}\) is a subring.
-
Nonempty: \(0 \in \mathbb{Q}\), so \(\mathbb{Q} \neq \emptyset\).
(Note: \(1 \in \mathbb{Q}\) also confirms it's nonempty and preserves the unity.)
-
Closed under subtraction: For any \(\frac{a}{b}, \frac{c}{d} \in \mathbb{Q}\) where \(a, b, c, d \in \mathbb{Z}\) and \(b, d \neq 0\):
\[
\frac{a}{b} - \frac{c}{d} = \frac{ad - bc}{bd} \in \mathbb{Q}
\]
Since \(ad - bc \in \mathbb{Z}\) and \(bd \in \mathbb{Z} \setminus \{0\}\), the result is a rational number.
-
Closed under multiplication: For any \(\frac{a}{b}, \frac{c}{d} \in \mathbb{Q}\):
\[
\frac{a}{b} \cdot \frac{c}{d} = \frac{ac}{bd} \in \mathbb{Q}
\]
Since \(ac \in \mathbb{Z}\) and \(bd \in \mathbb{Z} \setminus \{0\}\), the product is a rational number.
By the Subring Test, \(\mathbb{Q}\) is a subring of \(\mathbb{R}\).
Integral Domains
In the hierarchy of algebraic structures, integral domains represent a crucial step
between general rings and fields, designed to capture the reliable arithmetic properties of the integers.
Definition: Zero-Divisors
A zero-divisor is a nonzero element \(a\) of a commutative ring \(R\) such that
there exists a nonzero element \(b \in R\) with \(ab = 0\).
Intuitively, a zero-divisor is a nonzero element that "collapses" another nonzero element to zero
through multiplication — pathological behavior absent in \(\mathbb{Z}\). Banning zero-divisors
entirely yields the next structural refinement:
Definition: Integral Domain
An integral domain is a commutative ring with unity and no zero-divisors.
The standard ring of integers \(\mathbb{Z}\) is the prototypical integral domain.
In contrast, \(\mathbb{Z}_6\) is not an integral domain: the elements \(2\) and \(3\)
are both nonzero, yet
\[
2 \cdot 3 = 6 \equiv 0 \pmod{6}.
\]
Thus \(2\) and \(3\) are zero-divisors in \(\mathbb{Z}_6\), exactly as foreshadowed in the introduction:
\(\mathbb{Z}_n\) acquires zero-divisors precisely when \(n\) is composite. (The matrix ring
\(M_2(\mathbb{Z})\) also exhibits the analogous "collapsing" phenomenon, but our formal definition
above is restricted to commutative rings; we revisit the noncommutative case in later chapters.)
The absence of zero-divisors grants integral domains a powerful property that general rings lack:
the ability to cancel common factors.
Theorem: Cancellation Law
Let \(a, b, c\) be elements of an integral domain. If \(a \neq 0\) and \(ab = ac\), then \(b = c\).
Proof:
From \(ab = ac\) we obtain \(ab - ac = 0\), and by the distributive law (see
Rules of Multiplication)
this gives
\[
a(b - c) = 0.
\]
Since \(a \neq 0\) and the integral domain has no
zero-divisors,
the equation \(a(b - c) = 0\) forces \(b - c = 0\), i.e., \(b = c\).
This theorem reveals the primary motivation behind the definition: the integral-domain axioms are
engineered to isolate the core properties that make \(\mathbb{Z}\) so reliable. By excluding
zero-divisors, integral domains ensure that multiplication never results in an irreversible loss
of information. This structure provides the necessary foundation for algebra before moving toward
even more restricted systems, such as Fields, where division is fully defined.
In the next section, the cancellation law itself will play a decisive role: combined with finiteness,
it forces an integral domain to already be a field.
Fields
We now arrive at the most strictly defined algebraic structure in this hierarchy.
While integral domains ensure that cancellation is valid, fields go one step further
by guaranteeing that every nonzero element has a multiplicative inverse (i.e., unit).
This creates a system where division is fully defined for all nonzero elements.
Definition: Field
A field is a commutative ring with unity where every nonzero element is a unit.
Theorem: A Field is an Integral Domain
Every field is an integral domain. In particular, a field has no zero-divisors.
Proof:
Let \(F\) be a field. By definition of a field,
\(F\) is a commutative ring with unity, so we need only verify the absence of zero-divisors.
Suppose \(a, b \in F\) with \(a \neq 0\) and \(ab = 0\). Since every nonzero element of \(F\) is a unit,
\(a^{-1}\) exists. Multiplying both sides of \(ab = 0\) on the left by \(a^{-1}\) and applying
associativity together with the unity property gives
\[
b = (a^{-1}a)\, b = a^{-1}(ab) = a^{-1} \cdot 0 = 0,
\]
where the last equality uses Rules of Multiplication.
Hence whenever a product is zero in a field, at least one factor must be zero — there are no zero-divisors.
Combined with commutativity and unity, this confirms that \(F\) is an integral domain.
This theorem reveals the structural elegance of fields: alongside the additive abelian group \((F, +)\)
inherited from the ring axioms, the absence of zero-divisors ensures that multiplication restricts to a
well-defined operation on the nonzero elements \(F \setminus \{0\}\), where every element is a unit.
Thus a field carries two intertwined group structures, connected by the distributive law.
The most familiar examples of fields are the number systems we use in calculus and analysis.
In these sets, division by any nonzero number is always possible.
- The Rational Numbers (\(\mathbb{Q}\)):
Includes all fractions \(p/q\) where \(q \neq 0\). If \(a \in \mathbb{Q}\) is nonzero, its inverse \(1/a\) is also in \(\mathbb{Q}\).
- The Real Numbers (\(\mathbb{R}\)):
The standard field for continuous mathematics. Every nonzero real number \(x\) has a multiplicative inverse \(1/x\).
- The Complex Numbers (\(\mathbb{C}\)):
Even though \(\mathbb{C}\) contains "imaginary" numbers, it is a field because every nonzero complex number \(a + bi\) has an inverse \(\frac{a - bi}{a^2 + b^2}\) within the set.
Note that the integers \(\mathbb{Z}\) do not form a field. While \(\mathbb{Z}\) is an integral
domain (no zero-divisors), the only elements with multiplicative inverses are \(1\) and \(-1\). For instance,
\(2 \in \mathbb{Z}\), but its inverse \(1/2 \notin \mathbb{Z}\).
A natural question arises: do we always need to check for multiplicative inverses explicitly?
For infinite rings like \(\mathbb{Z}\), we know that the absence of zero-divisors does not guarantee
the existence of inverses. However, when the ring is finite, the situation changes
dramatically. Finiteness prevents elements from "escaping" to infinity without cycling back, forcing
the structure to be more rigid.
Theorem: A Finite Integral Domain is a Field
A finite integral domain is a field.
Proof:
Let \(D\) be a finite integral domain with unity \(1\). To show that \(D\) is a field, we must show
that every nonzero element \(a \in D\) has a multiplicative inverse (i.e., unit).
If \(a = 1\), it is its own inverse, so we may henceforth assume \(a \neq 1\).
(This assumption will be used below to rule out a degenerate case.)
Consider the sequence of powers of \(a \in D\):
\[
a, a^2, a^3, a^4, \dots
\]
Since \(D\) is finite, the Pigeonhole Principle implies that the sequence
\(a, a^2, a^3, \dots\) cannot consist of distinct elements indefinitely.
Thus, there must be two positive integers \(i\) and \(j\) such that:
\[
i > j \text{ and } a^i = a^j.
\]
(This equality tells us that after a certain number of steps, the sequence has returned to a previously visited state.)
We can rewrite this as:
\[
\begin{align*}
a^i &= a^j \\\\
a^i - a^j &= 0 \\\\
a^j(a^{i-j} - 1) &= 0,
\end{align*}
\]
where the last step uses the distributive law (see Rules of Multiplication).
We claim \(a^j \neq 0\). This follows by induction on \(j\): the base case \(j = 1\) gives
\(a^1 = a \neq 0\) by hypothesis; for the inductive step, assume \(a^{j-1} \neq 0\), and observe
that if \(a^j = a^{j-1} \cdot a\) were zero, then with both factors nonzero one of them would be a
zero-divisor, contradicting the integral-domain hypothesis. The
Cancellation Law applied to \(a^j(a^{i-j} - 1) = 0\) then yields
\[
a^{i-j} - 1 = 0, \quad \text{i.e.,} \quad a^{i-j} = 1.
\]
This result shows that in a finite integral domain, the powers of any element always cycle back to the unity \(1\).
Since we assumed \(a \neq 1\), we cannot have \(i - j = 1\) (which would force \(a^1 = a = 1\),
contradicting our assumption). Combined with \(i > j\), this gives \(i - j \geq 2\).
We then factor out one \(a\):
\[
a \cdot a^{i-j-1} = 1
\]
Therefore, \(a^{i-j-1}\) is the inverse of \(a\).
This theorem is the theoretical bedrock of public-key cryptography. It guarantees that as long as we work with
integers modulo a prime \(p\) (denoted as \(\mathbb{Z}_p\)), we are working in a field.
In practical terms, this means:
- Guaranteed Inverses: Every nonzero element \(a \in \mathbb{Z}_p\) has a unique modular inverse \(a^{-1}\).
- Solvability: We can solve linear equations like \(ax \equiv b \pmod p\) simply by computing \(x \equiv a^{-1}b \pmod p\).
- Algorithmic Reliability: Without these properties, algorithms like RSA (which relies on the multiplicative group of units) and Elliptic Curve Cryptography (defined over finite fields) would not function.
By ensuring the modulus is prime, we transform a simple set of remainders into a fully functional Field,
providing the mathematical precision required for modern security.
Theorem: \(\mathbb{Z}_p\) is a Field
For any prime \(p\), the ring \(\mathbb{Z}_p\) of integers modulo \(p\) is a field.
Proof:
The set \(\mathbb{Z}_p = \{0, 1, \dots, p-1\}\) contains exactly \(p\) elements, so
\(\mathbb{Z}_p\) is a finite commutative ring with unity. It therefore
suffices to show that \(\mathbb{Z}_p\) is an integral domain — that is, has no zero-divisors.
Suppose \(a, b \in \mathbb{Z}_p\) with \(ab \equiv 0 \pmod{p}\), so \(p \mid ab\).
Since \(p\) is prime, Euclid's Lemma (any prime dividing a product divides
at least one of its factors) yields \(p \mid a\) or \(p \mid b\), i.e., \(a = 0\) or
\(b = 0\) in \(\mathbb{Z}_p\). Hence \(\mathbb{Z}_p\) has no zero-divisors and is an
integral domain.
Being a finite integral domain, \(\mathbb{Z}_p\) is a field by the
A Finite Integral Domain is a Field
theorem.
Let's verify the field properties of \(\mathbb{Z}_5 = \{0, 1, 2, 3, 4\}\).
According to our theorem, since \(5\) is prime, every non-zero element must have a multiplicative inverse.
Multiplicative Inverses in \(\mathbb{Z}_5\)
| Element (\(a\)) |
Calculation (\(a \cdot x \equiv 1 \pmod 5\)) |
Inverse (\(a^{-1}\)) |
| 1 |
\(1 \cdot 1 = 1\) |
1 |
| 2 |
\(2 \cdot 3 = 6 \equiv 1\) |
3 |
| 3 |
\(3 \cdot 2 = 6 \equiv 1\) |
2 |
| 4 |
\(4 \cdot 4 = 16 \equiv 1\) |
4 |
Because we found a unique inverse for every non-zero element, \(\mathbb{Z}_5\) behaves as a field.
In a computer program, this means we can perform "division" by \(3\) simply by
multiplying by \(2\).
In a standard program, dividing \(4\) by \(3\) would result in a floating-point number (\(1.333...\)), introducing potential rounding errors.
However, in the field \(\mathbb{Z}_5\), "dividing by \(3\)" is equivalent to "multiplying by the inverse of \(3\)," which is \(2\).
Thus, we calculate:
\[
4 \div 3 \equiv 4 \cdot 3^{-1} \equiv 4 \cdot 2 = 8 \equiv 3 \pmod 5
\]
This allows algorithms to remain purely integer-based and 100% precise, which is the fundamental reason why finite fields are used in
cryptography and error-correction codes.
Characteristic of a Ring
We have seen that in standard number systems like \(\mathbb{Z}\) or \(\mathbb{R}\), adding the unity element \(1\)
to itself repeatedly (\(1 + 1 + 1 + \dots\)) never results in zero. However, in modular arithmetic rings
like \(\mathbb{Z}_n\), repeated addition eventually cycles back to zero (e.g., in \(\mathbb{Z}_5\), \(1+1+1+1+1 = 5 \equiv 0\)).
This distinction - whether a ring behaves like "integers" (infinite growth) or "clock arithmetic" (cyclic)
- is captured by the concept of the characteristic.
Convention. Throughout this section, for a positive integer \(n\) and a ring element \(x\),
the expression \(n \cdot x\) denotes the integer multiple
\[
n \cdot x = \underbrace{x + x + \cdots + x}_{n \text{ times}},
\]
the additive analogue of the multiplicative power \(x^n\) (cf. the additive-notation conventions
introduced in our earlier groups material). We extend this to \(0 \cdot x = 0\) and
\((-n) \cdot x = n \cdot (-x)\). The interaction with ring multiplication satisfies the
distributive identity
\[
(s \cdot a)(t \cdot b) = (st) \cdot (ab) \qquad \text{for all positive integers } s, t \text{ and all } a, b \in R,
\]
obtained by iterated application of the ring's distributive law.
Definition: Characteristic of a Ring
The characteristic of a ring \(R\) is the least positive integer \(n\) such that
\[
\forall x \in R, \, n \cdot x = 0.
\]
If no such integer exists, we say that \(R\) has characteristic \(0\). The characteristic
of \(R\) is denoted by \(\operatorname{char} R\).
For example, \(\operatorname{char} \mathbb{Z} = 0\) and \(\operatorname{char} \mathbb{Z}_n = n\). In general, we have following theorems.
Theorem: Characteristic of a Ring with Unity
Let \(R\) be a ring with unity \(1\). If \(1\) has infinite order under addition, then the characteristic of \(R\) is \(0\).
If \(1\) has order \(n\) under addition, then the characteristic of \(R\) is \(n\).
Proof:
Let \(R\) be a ring with unity \(1\).
Case 1: \(1\) has infinite order under addition.
By definition, if \(1\) has infinite order, there is no positive integer \(n\) such that \(n \cdot 1 = 0\).
Since the characteristic is defined as the least such positive integer \(n\) that satisfies \(n \cdot x = 0\) for all \(x\),
and no such \(n\) exists even for the element \(1\), it follows that \(\operatorname{char} R = 0\).
Case 2: \(1\) has additive order \(n\).
Suppose \(1\) has additive order \(n\). Then \(n\) is the least positive integer such that \(n \cdot 1 = 0\).
For any arbitrary element \(x \in R\), we have
\[
\begin{align*}
n \cdot x &= \underbrace{x + x + \cdots + x}_{n \text{ times}} \\\\
&= \underbrace{(1 \cdot x) + (1 \cdot x) + \cdots + (1 \cdot x)}_{n \text{ times}} \\\\
&= (\underbrace{1 + 1 + \cdots + 1}_{n \text{ times}}) \cdot x \\\\
&= (n \cdot 1) \cdot x \\\\
&= 0 \cdot x \\\\
&= 0.
\end{align*}
\]
This demonstrates that \(n \cdot x = 0\) for all \(x \in R\), so by the definition of the
characteristic, \(\operatorname{char} R \leq n\). Conversely, any positive integer \(m\) with
\(m \cdot x = 0\) for all \(x \in R\) must in particular satisfy \(m \cdot 1 = 0\); since \(n\)
is the least such positive integer for \(x = 1\) (the additive order of \(1\)),
we have \(m \geq n\), giving \(\operatorname{char} R \geq n\). Combining both inequalities,
\(\operatorname{char} R = n\).
Theorem: Characteristic of an Integral Domain
The characteristic of an integral domain is either \(0\) or prime.
Proof:
Let \(D\) be an integral domain with unity \(1\). If \(1\) has infinite additive order, then
\(\operatorname{char} D = 0\) by the Characteristic of a Ring with Unity theorem.
Now, suppose \(\operatorname{char} D = n\), where \(n > 0\). By the convention that an integral
domain contains at least two distinct elements (so \(0 \neq 1\); the trivial ring \(\{0\}\) is
excluded), we have \(1 \cdot 1 = 1 \neq 0\), so \(n \neq 1\) and hence \(n \geq 2\).
We will prove that \(n\) must be prime by contradiction.
Assume \(n\) is composite such that \(n = st\), where \(s\) and \(t\) are integers such that \(1 < s < n\) and \(1 < t < n\).
Recall the property of scalar multiplication in rings: \((s \cdot a)(t \cdot b) = (st) \cdot (ab)\) for any \(a, b \in D\).
By setting both \(a\) and \(b\) to the unity element \(1\), we can evaluate the product:
\[
0 = n \cdot 1 = (st) \cdot 1 = (s \cdot 1)(t \cdot 1).
\]
Since \(D\) is an integral domain, it has no zero-divisors. Therefore, the equation
\((s \cdot 1)(t \cdot 1) = 0\) implies that:
\[
s \cdot 1 = 0 \quad \text{or} \quad t \cdot 1 = 0.
\]
However, this contradicts the definition of the characteristic \(n\) as the least positive integer such that
\(n \cdot 1 = 0\). Since both \(s\) and \(t\) are strictly less than \(n\), neither \( s \cdot 1 \) nor \(t \cdot 1\)
can be zero. Therefore, \(n\) cannot be composite and must be prime.
In digital systems, fields of characteristic 2 (such as \(GF(2^n)\), the finite field with \(2^n\) elements,
which we will construct systematically in the upcoming chapters on extension fields and finite fields) are important.
They are the backbone of modern technologies like AES encryption, QR code error correction,
and RAID-6 storage. The reason lies in how perfectly their arithmetic aligns with "binary" logic.
In a field of characteristic 2, the definition \(1 + 1 = 0\) leads to a unique property. For any element \(a\) in the field:
\[
a + a = a(1 + 1) = a \cdot 0 = 0
\]
Since \(a + a = 0\), the additive inverse of \(a\) (the element you add to \(a\) to get \(0\)) is simply \(a\) itself.
Mathematically, this means:
\[
a = -a.
\]
This identity leads to two dramatic simplifications in computing:
-
Unification of Addition and Subtraction:
Since \(a = -a\), there is no distinction between adding and subtracting.
Calculating \(A - B\) is identical to \(A + B\). This means we do not need to implement separate algorithms for subtraction.
-
Hardware Efficiency (XOR Logic):
In these fields, addition does not involve "carrying" digits. Because there are no carries, the operation is strictly bitwise.
In computer architecture, addition and subtraction in \(GF(2^n)\) are performed using XOR (Exclusive OR) gates.
For a developer, this means that in characteristic 2 fields, A + B and A - B are both implemented
as A ^ B in code. This allows cryptographic algorithms to run at extremely high speeds by replacing heavy arithmetic
with simple bitwise logic.
Perspective: The Crossroads of Discrete and Continuous
The study of fields marks a pivotal moment in our curriculum. The characteristic
of a field acts as a structural switch between two complementary mathematical worlds — and this
very dichotomy organizes the entire site.
The Discrete / Continuous Dichotomy
Fields of characteristic \(p\) (such as \(\mathbb{Z}_p\) and \(GF(2^n)\))
are exact, periodic, and reversible — they belong to The Discrete World
of the site, where Discrete Mathematics and the algebraic side of
Probability live, ultimately powering cryptography and error correction.
The standard fields of characteristic \(0\) — most importantly \(\mathbb{R}\)
and \(\mathbb{C}\) — carry rich analytic structures: completeness and
limits. (The rationals \(\mathbb{Q}\), though also of characteristic \(0\),
lack completeness: this is precisely why \(\mathbb{R}\) was constructed.) These fields
belong to The Continuous World, where Calculus & Analysis,
continuous probability, and the gradient-based methods of Machine Learning
live. Without a complete real field, the convergence theory underlying gradient descent
could not even be formulated.
Modern computation lives in the tension between these worlds: hardware operates in
characteristic \(p\), while the algorithms we deploy assume characteristic \(0\).
Translating between the two is one of the recurring themes of applied mathematics — and
of this site.
To fully grasp the construction of these fields, we must first master the structural mechanics
of rings. In the next chapter, we introduce Ideals and Factor Rings — the essential
blueprints used to partition algebraic structures and forge the fields that power our digital world.