Section 5.2 The Hahn-Banach Theorem
Once we have a linear space \(X\text{,}\) it is natural to consider linear equations. A linear equation writes as
\begin{equation*}
\eta(v)=c,
\end{equation*}
where \(\eta:X\to\bR\) (resp. \(\eta:X\to\bC\) in case of complex scalars) is a linear map (also called covector) and \(c\in\bR\) (resp. \(c\in\bC\)). Recall that the set of all covectors is denoted by \(X^*\text{.}\) In finite dimension, given a basis \(e_1,\dots,e_n\) of \(X\text{,}\) \(v = v^1 e_1 + \dots v^n e_n\) and the equation writes
\begin{equation*}
\eta_1 v^1 + \dots + \eta_n v^n = c,
\end{equation*}
where we set \(\eta_i=\eta(e_i)\) for all \(i=1,2,\dots\text{.}\) Of course such equation is always solvable. Things get more interesting considering more than one equation. Consider a collection \(\eta^{(i)}, i\in I\) of covectors and a corresponding collection \(c^{(i)}, i\in I\) of scalars. Is the system
\begin{equation*}
\eta^{(i)}(v) = c^{(i)},\qquad i\in I
\end{equation*}
solvable? Now, recall that the elements of \(X\) can be considered as linear maps on \(X^*\text{,}\) namely that each \(v\in X\) determines uniquely a map \(v:X^*\to\bR\) defined by \(v(\eta) = \eta(v)\text{.}\) Then, after switching the roles of vectors and covectors, when \(X\) is reflexive, one can reformulate the system above as follows: given a collection of vectors \(v^{(i)}\text{,}\) find a covector \(\eta\in X^*\) such that
\begin{equation*}
\eta(v^{(i)}) = c^{(i)},\qquad i\in I.
\end{equation*}
There is an obvious obstruction to the existence of \(\eta\text{,}\) namely the \(c^{(i)}\) must be compatible with the fact that \(\eta\) is a function and therefore cannot have more than one value on any vector. Hence, given two distinct real numbers \(c,d\text{,}\) the pair of equations \(\eta(v)=c, \eta(v)=d\) has no solution. The Hahn-Banach theorem shows that the one above is the only obstruction to the existence of a solution. The proof consists in a finite-dimensional inductive step, that therefore solves the problem for any Banach space with a Schauder basis. For all other cases, the claim is proved by a standard application of the Zorn Lemma, that will discuss in some length below. Definition 5.2.1.
Given a real vector space \(X\text{,}\) a sublinear function on \(X\) is a function \(p:X\to\bR\) such that:- \(p(\lambda v)=\lambda p(v)\) for every \(v\in X\) and \(\lambda\geq0\text{;}\)
- \(p(v+u)\leq p(v)+p(u)\) for every \(v,u\in X\text{.}\)
Lemma 5.2.2. One-dimensional dominated extension theorem.
Let \(X\) be a real vector space, \(p\) a sublinear function on \(X\text{,}\) \(M\) a proper linear subspace of \(X\text{.}\) Assume that \(\eta\in M^*\) is dominated by \(p\text{.}\) Then, for any \(v\in X\setminus M\text{,}\) there is a \(p\)-dominated extension \(\eta'\in (M\oplus\bR v)^*\) of \(\eta\text{.}\)Proof.
\begin{equation*}
\eta_b(m\oplus \lambda v) = \eta(m) + \lambda b
\end{equation*}
is an extension of \(\eta\) on \((M\oplus\bR v)^*\text{.}\) We need to prove that there are \(b\) such that \(\eta_b\) is dominated by \(p\text{.}\) We use the fact that
\begin{equation*}
\eta(m) - \eta(n) = \eta(m-n) \leq p(m-n) = p(m+v-v-n)\leq p(m+v) + p(-v-n).
\end{equation*}
Indeed, then
\begin{equation*}
-p(-n-v)-\eta(n)\leq p(m+v) - \eta(m)\text{ for all }m,n\in M
\end{equation*}
and so
\begin{equation*}
\sup_{n\in M} -p(-n-v)-\eta(n) \leq \inf_{m\in M} p(m+v) - \eta(m).
\end{equation*}
We claim that, for any \(b\) such that
\begin{equation*}
\sup_{n\in M} -p(-n-v)-\eta(n) \leq b \leq \inf_{m\in M} p(m+v) - \eta(m),
\end{equation*}
\(\eta_b\) is dominated by \(p\text{.}\) Indeed, for any \(\lambda\neq0\) and \(m\in M\text{,}\)
\begin{equation*}
-p(-\frac{m}{\lambda}-v)-\eta(\frac{m}{\lambda})\leq b\leq p(\frac{m}{\lambda}+v) - \eta(\frac{m}{\lambda}).
\end{equation*}
Assume \(\lambda>0\text{.}\) Then
\begin{equation*}
p(\frac{m}{\lambda}+v) - \eta(\frac{m}{\lambda}) = \frac{1}{\lambda}\left(p(m+\lambda v) - \eta(m)\right),
\end{equation*}
so that
\begin{equation*}
\lambda b\leq p(m+\lambda v) - \eta(m)
\end{equation*}
and similarly for \(\lambda<0\text{.}\)Definition 5.2.3.
Let \(X\) be a partially ordered set. A subset \(C\) of \(P\) is a chain if any two elements of \(C\) are comparable. An upper bound of a subset \(Q\) of \(P\) is an element \(\bar q\in P\) such that \(q\leq \bar q\) for all \(q\in Q\text{.}\)Proposition 5.2.4. Zorn's Lemma.
Let \(P\) be a non-empty partially ordered set such that all chains in \(P\) have an upper bound. Then \(P\) has a maximal element.Proposition 5.2.5. Hausdorff maximal principle.
Let \(P\) be a partially ordered set. Then every chain of \(P\) is contained in a maximal chain (i.e. a chain that is not strictly contained in some other chain).Proposition 5.2.6. Axiom of Choice.
For any collection \(X\) of non-empty sets, there exists a function on \(X\) that maps each set in \(X\) to an element of that set.Theorem 5.2.7. Hahn-Banach.
Let \(X\) be a real vector space, \(p\) a sublinear function on \(X\text{,}\) \(M\) a proper linear subspace of \(X\text{.}\) Assume that \(\eta\in M^*\) is dominated by \(p\text{.}\) Then, for any \(v\in X\setminus M\text{,}\) there is a \(p\)-dominated extension \(\eta'\in X^*\) of \(\eta\text{.}\)Proof.
\begin{equation*}
\eta(v) = \begin{pmatrix}\eta_1\amp\dots\amp\eta_n\end{pmatrix} \begin{pmatrix}v^1\\\dots\\v^n\\\end{pmatrix} = \eta_1v^1+\dots+\eta_nv^n.
\end{equation*}
In particular, this shows that, for each
\begin{equation*}
v=\begin{pmatrix}v^1\\\dots\\v^n\\\end{pmatrix},
\end{equation*}
there is a covector
\begin{equation*}
\eps_v=\begin{pmatrix}v^1\amp\dots\amp v^n\end{pmatrix}
\end{equation*}
with the following properties:
\begin{equation*}
\eps_v(v)=\|v\|^2,\qquad\|\eps_v\|=\|v\|.
\end{equation*}
Hence, by setting \(\eta_v=\eps_v/\|v\|\text{,}\) we see that
\begin{equation*}
\eta_v(v)=\|v\|,\qquad\|\eta_v\|=1.
\end{equation*}
Next corollary shows that a linear functional with the same properties is in the dual of every topological vector space. Corollary 5.2.8.
Let \(X\) be a real vector space. Then, for any non-zero \(v\in X\text{,}\) there is a \(\eta\in X^*\) with \(\|\eta\|=1\) such that \(\eta(v)=\|v\|\text{.}\)Proof.
\begin{equation*}
\eps(\lambda v) = \lambda \|v\| \leq |\lambda|\cdot\|v\| = \|\lambda v\|,
\end{equation*}
namely \(\eps\) is dominated by the norm of \(X\text{,}\) which is sublinear. Hence, by Hahn-Banach theorem, there is an \(\eta\in X^*\) that extends \(\eps\) and is dominated by the norm of \(X\) as well. This means that
\begin{equation*}
\eta(v)=\|v\|\text{ and }\eta(v)\leq\|v\|\text{ for all }v\in X.
\end{equation*}
Thus,
\begin{equation*}
\|\eta(v)\| = \sup_{v\in X, v\neq0}\frac{|\eta(v)|}{\|v\|} = \frac{|\eta(v)|}{\|v\|} = 1.
\end{equation*}
Corollary 5.2.9.
Let \(X\) be a real normed space. Then, for every \(x\in X\text{,}\)
\begin{equation*}
\|x\|_X = \max_{\|\eta\|_{X^*}}|\eta(x)|.
\end{equation*}
Corollary 5.2.10.
Let \(X\) be a real vector space, \(M\) a proper linear subspace of \(X\text{.}\) Assume that \(\eta\in M^*\) is bounded. Then there is a linear functional \(\bar\eta\in X^*\) such that \(\|\bar\eta\|_{X^*} = \|\eta\|_{M^*}\text{.}\)Corollary 5.2.11. Dual separating theorem.
Let \(x,y\in X\text{.}\) If \(\eta(x) = \eta(y)\) for all \(\eta\in X^*\) then \(x=y\text{.}\)Proof.
\begin{equation*}
p(\alpha x +\beta y) = \|\alpha x\|
\end{equation*}
is a sublinear function and
\begin{equation*}
\eps(\alpha x +\beta y) = \alpha \|x\| \leq p(\alpha x +\beta y).
\end{equation*}
\(\eps(x) = \|x\|\neq \eps(y) =0.\) Hence, by the Hahn-Banach Theorem, we can extend \(\eps\) to a linear form \(\eta\in X^*\text{.}\) Since \(\eps(x) = \|x\|\neq \eps(y) =0\text{,}\) then \(\eta(x)\neq \eta(y)\text{,}\) contradicting the assumption. Hence \(x=y\text{.}\)Theorem 5.2.12.
\(J\) embeds isometrically \(X\) in \(X^{**}\text{.}\)Proof.
\begin{equation*}
\|J_v\|_{X^{**}} = \sup_{\|\eta\|_{X^*}=1}|J_v(\eta)|.
\end{equation*}
Since \(|J_v(\eta)|\leq \|\eta\|_{X^*}\|v\|_X,\) this means that
\begin{equation*}
\|J_v\|_{X^{**}}\leq\|v\|_X.
\end{equation*}
By Corollary 5.2.8, for every \(v\in X\) there is an \(\eta\in X^*\) such that \(\eta(v)=\|v\|_X\) and \(\|\eta\|_{X^*}=1\text{,}\) so \(\|J_v\|_{X^{**}}=\|v\|_X\text{.}\)Definition 5.2.13.
We say that \(X\) is reflexive if the map \(J\) is surjective.Proposition 5.2.14.
Let \(X\) be a normed space, \(v^{(1)},\dots,v^{(n)}\) a collection of linearly independent vectors and \(c^{(1)},\dots,c^{(n)}\) a list of scalars. Then there exists a covector \(\eta\in X^*\) such that
\begin{equation*}
\eta(v^{(i)}) = c^{(i)},\qquad i=1,\dots,n.
\end{equation*}
