Extreme Points
The separation theory of the previous development expressed every closed convex set as an intersection of closed half-spaces — a description from the outside, by the functionals that bound the set. There is a dual question, from the inside: which points of a convex set are irreducible, in the sense that they cannot be manufactured by averaging other points of the set? A vertex of a polygon is such a point; a point on the interior of an edge is not, since it is the midpoint of two of its neighbors. The theorem of Krein and Milman, the goal of this page, will show that for a compact convex set in a locally convex space these irreducible points always exist and suffice to recover the entire set as their closed convex hull.
We work over a scalar field \(\mathbb{F}\), either \(\mathbb{R}\) or \(\mathbb{C}\). The definition of an extreme point is purely linear-algebraic: it refers only to the vector-space structure of the ambient space, not to any topology. Recall that an open line segment between two points \(x_1\) and \(x_2\) of a vector space is the set \((x_1, x_2) = \{\, t x_2 + (1 - t) x_1 : 0 \lt t \lt 1 \,\}\), and that such a segment is proper when its endpoints are distinct, \(x_1 \neq x_2\).
Let \(K\) be a convex subset of a vector space \(\mathcal{X}\). A point \(a \in K\) is an extreme point of \(K\) if there is no proper open line segment that contains \(a\) and lies entirely in \(K\). The set of all extreme points of \(K\) is denoted \(\operatorname{ext} K\).
Equivalently, \(a\) is extreme when it admits no representation \(a = t x_2 + (1 - t) x_1\) with \(x_1, x_2 \in K\), \(0 \lt t \lt 1\), and \(x_1 \neq x_2\): the only way to write \(a\) as an interior point of a segment in \(K\) is the trivial one, in which both endpoints coincide with \(a\). For the closed unit disc \(\{(x, y) : x^2 + y^2 \leq 1\}\) in \(\mathbb{R}^2\), the extreme points are exactly the boundary circle \(\{x^2 + y^2 = 1\}\): no point of the open disc is extreme, since it sits inside a small segment, while no boundary point lies in the interior of any segment contained in the disc. For a closed convex polygon, the extreme points are precisely its vertices. A closed half-plane, a line, and the whole space have no extreme points at all: every point lies in the interior of some segment.
Faces and the Removal Criterion
The proof of the Krein-Milman theorem relies on a reformulation of extremality: \(a\) is extreme exactly when deleting it leaves a convex set. This recasting lets the topological machinery of the theorem engage with the algebraic definition. We collect it with the other standard equivalent forms.
Let \(K\) be a convex subset of a vector space \(\mathcal{X}\) and let \(a \in K\). The following statements are equivalent.
(a) \(a \in \operatorname{ext} K\).
(b) If \(x_1, x_2 \in \mathcal{X}\) and \(a = \tfrac{1}{2}(x_1 + x_2)\), then either
\(x_1 \notin K\), or \(x_2 \notin K\), or \(x_1 = x_2 = a\).
(c) If \(x_1, x_2 \in \mathcal{X}\), \(0 \lt t \lt 1\), and \(a = t x_1 + (1 - t) x_2\), then
either \(x_1 \notin K\), or \(x_2 \notin K\), or \(x_1 = x_2 = a\).
(d) If \(x_1, \dots, x_n \in K\) and \(a \in \operatorname{co}\{x_1, \dots, x_n\}\), then
\(a = x_k\) for some \(k\).
(e) \(K \setminus \{a\}\) is a convex set.
We prove (c) \(\Leftrightarrow\) (a), (c) \(\Rightarrow\) (b) \(\Rightarrow\) (c), (c) \(\Rightarrow\) (d) \(\Rightarrow\) (b), and (a) \(\Leftrightarrow\) (e).
(c) \(\Leftrightarrow\) (a).
A proper open line segment in \(K\) containing \(a\) is precisely a representation
\(a = t x_1 + (1 - t) x_2\) with \(x_1, x_2 \in K\), \(0 \lt t \lt 1\), and \(x_1 \neq x_2\). Statement (a)
denies the existence of such a representation; statement (c) says that any representation
\(a = t x_1 + (1 - t) x_2\) with \(0 \lt t \lt 1\) and both \(x_1, x_2 \in K\) forces \(x_1 = x_2\), in which
case \(x_1 = x_2 = a\). The two assertions are contrapositives of one another, hence equivalent.
(c) \(\Rightarrow\) (b).
Statement (b) is the special case \(t = \tfrac{1}{2}\) of (c).
(b) \(\Rightarrow\) (c).
Suppose (b) holds and let \(a = t x_1 + (1 - t) x_2\) with \(x_1, x_2 \in K\) and \(0 \lt t \lt 1\); we show
\(x_1 = x_2 = a\). Assume \(x_1 \neq x_2\), seeking a contradiction. The point \(a\) lies strictly between
\(x_1\) and \(x_2\) on the line through them; parametrize that line as
\(z(s) = x_2 + s(x_1 - x_2)\), so \(z(0) = x_2\), \(z(1) = x_1\), and \(a = z(t)\) with \(0 \lt t \lt 1\).
Choose \(\delta \gt 0\) small enough that \(0 \lt t - \delta\) and \(t + \delta \lt 1\), and put
\(u = z(t - \delta)\), \(v = z(t + \delta)\). Both \(u\) and \(v\) are convex combinations of \(x_1, x_2 \in K\),
so \(u, v \in K\) by convexity, and \(\tfrac{1}{2}(u + v) = z(t) = a\) with \(u \neq v\) (as
\(x_1 \neq x_2\) and \(\delta \gt 0\)). This is a representation of \(a\) as a midpoint of two distinct points
of \(K\), contradicting (b). Hence \(x_1 = x_2\), and then \(a = t x_1 + (1 - t) x_1 = x_1\), giving
\(x_1 = x_2 = a\).
(c) \(\Rightarrow\) (d).
We induct on \(n\). For \(n = 1\), \(a \in \operatorname{co}\{x_1\} = \{x_1\}\) gives \(a = x_1\). Suppose the
implication holds for \(n - 1\) points, and let \(a = \sum_{k=1}^{n} \lambda_k x_k\) with \(x_k \in K\),
\(\lambda_k \geq 0\), and \(\sum_k \lambda_k = 1\). If some \(\lambda_k = 0\) the representation uses at most
\(n - 1\) points and the inductive hypothesis applies. If some \(\lambda_k = 1\) then \(a = x_k\) and we are
done. Otherwise every \(\lambda_k \in (0, 1)\). Write
\[
a = \lambda_n x_n + (1 - \lambda_n)\, w,
\qquad
w = \sum_{k=1}^{n-1} \frac{\lambda_k}{1 - \lambda_n}\, x_k ,
\]
where the coefficients of \(w\) are nonnegative and sum to \(1\), so \(w \in \operatorname{co}\{x_1, \dots,
x_{n-1}\} \subseteq K\) by convexity. Now \(a = \lambda_n x_n + (1 - \lambda_n) w\) with
\(0 \lt \lambda_n \lt 1\) and \(x_n, w \in K\); by (c), \(x_n = w = a\). In particular \(a = x_n\).
(d) \(\Rightarrow\) (b).
If \(a = \tfrac{1}{2}(x_1 + x_2)\) with \(x_1, x_2 \in K\), then \(a \in \operatorname{co}\{x_1, x_2\}\), so by
(d) \(a = x_1\) or \(a = x_2\); either way \(\tfrac{1}{2}(x_1 + x_2) = a\) forces \(x_1 = x_2 = a\). Thus the
hypothesis of (b) with \(x_1, x_2 \in K\) yields \(x_1 = x_2 = a\), which is the conclusion of (b) in the
remaining case.
(a) \(\Rightarrow\) (e).
Suppose \(a\) is extreme; we show \(K \setminus \{a\}\) is convex. Let \(y_1, y_2 \in K \setminus \{a\}\) and
\(0 \lt t \lt 1\), and set \(z = t y_1 + (1 - t) y_2\). Since \(K\) is convex, \(z \in K\). Suppose, for
contradiction, that \(z = a\). Then \(a = t y_1 + (1 - t) y_2\) with \(y_1, y_2 \in K\) and
\(0 \lt t \lt 1\); extremality (via the equivalent form (c)) forces \(y_1 = y_2 = a\), contradicting
\(y_1 \neq a\). Hence \(z \neq a\), so \(z \in K \setminus \{a\}\), and \(K \setminus \{a\}\) is convex.
(e) \(\Rightarrow\) (a).
Suppose \(K \setminus \{a\}\) is convex but, for contradiction, \(a\) is not extreme. Then there exist
\(x_1, x_2 \in K\) with \(x_1 \neq x_2\), \(0 \lt t \lt 1\), and \(a = t x_1 + (1 - t) x_2\). Since
\(x_1 \neq x_2\), at most one of them equals \(a\); we claim neither does. If \(x_1 = a\), then
\(a = t a + (1 - t) x_2\) gives \((1 - t) a = (1 - t) x_2\), so \(x_2 = a\) (as \(1 - t \neq 0\)),
contradicting \(x_1 \neq x_2\); symmetrically \(x_2 \neq a\). Thus \(x_1, x_2 \in K \setminus \{a\}\). By
convexity of \(K \setminus \{a\}\), the point \(a = t x_1 + (1 - t) x_2\) lies in \(K \setminus \{a\}\), i.e.
\(a \neq a\) — absurd. Hence \(a\) is extreme.
The equivalence of (a) and (e) is the form we will exploit: \(a\) is extreme precisely when \(K \setminus \{a\}\) is convex. Searching for an extreme point therefore becomes a search for a maximal proper relatively open convex subset of \(K\), whose complement, as we will see, is a single extreme point — an object Zorn's lemma produces once compactness keeps the union of a chain of such sets proper.