FUNCTIONAL ANALYSIS
1
Douglas N. Arnold
2
References:
John B. Conway,
A Course in Functional Analysis
, 2nd Edition, Springer-Verlag, 1990.
Gert K. Pedersen,
Analysis Now
, Springer-Verlag, 1989.
Walter Rudin,
Functional Analysis
, 2nd Edition, McGraw Hill, 1991.
Robert J. Zimmer,
Essential Results of Functional Analysis
, University of Chicago Press,
1990.
CONTENTS
I. Vector spaces and their topology
:::::::::::::::::::::::::::::::::::::::::::::::
2
Subspaces and quotient spaces
::::::::::::::::::::::::::::::::::::::::::::
4
Basic properties of Hilbert spaces
:::::::::::::::::::::::::::::::::::::::::
5
II. Linear Operators and Functionals
::::::::::::::::::::::::::::::::::::::::::::::
9
The Hahn{Banach Theorem
::::::::::::::::::::::::::::::::::::::::::::::
10
Duality
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
10
III. Fundamental Theorems
:::::::::::::::::::::::::::::::::::::::::::::::::::::::
14
The Open Mapping Theorem
:::::::::::::::::::::::::::::::::::::::::::::
14
The Uniform Boundedness Principle
::::::::::::::::::::::::::::::::::::::
15
The Closed Range Theorem
::::::::::::::::::::::::::::::::::::::::::::::
16
IV. Weak Topologies
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
18
The weak topology
:::::::::::::::::::::::::::::::::::::::::::::::::::::::
18
The weak* topology
::::::::::::::::::::::::::::::::::::::::::::::::::::::
19
V. Compact Operators and their Spectra
::::::::::::::::::::::::::::::::::::::::
22
Hilbert{Schmidt operators
:::::::::::::::::::::::::::::::::::::::::::::::
22
Compact operators
:::::::::::::::::::::::::::::::::::::::::::::::::::::::
23
Spectral Theorem for compact self-adjoint operators
::::::::::::::::::::::
26
The spectrum of a general compact operator
:::::::::::::::::::::::::::::
28
VI. Introduction to General Spectral Theory
::::::::::::::::::::::::::::::::::::::
31
The spectrum and resolvent in a Banach algebra
:::::::::::::::::::::::::
31
Spectral Theorem for bounded self-adjoint operators
::::::::::::::::::::::
35
1
These lecture notes were prepared for the instructor's personal use in teaching a half-semester course
on functional analysis at the beginning graduate level at Penn State, in Spring 1997. They are certainly
not meant to replace a good text on the subject, such as those listed on this page.
2
Department of Mathematics, Penn State University, University Park, PA 16802.
Email: dna@math.psu.edu. Web: http://www.math.psu.edu/dna/.
Typeset by
A
M
S
-TEX
1
2
I. Vector spaces and their topology
Basic denitions: (1) Norm and seminorm on vector spaces (real or complex). A norm
denes a Hausdor topology on a vector space in which the algebraic operations are con-
tinuous, resulting in a
normed linear space
. If it is complete it is called a Banach space.
(2) Inner product and semi-inner-product. In the real case an inner product is a positive
denite, symmetric bilinear form on
X X
!
R
. In the complex case it is positive denite,
Hermitian symmetric, sesquilinear form
X X
!
C
. An (semi) inner product gives rise
to a (semi)norm. An inner product space is thus a special case of a normed linear space.
A complete inner product space is a Hilbert space, a special case of a Banach space.
The polarization identity expresses the norm of an inner product space in terms of the
inner product. For real inner product spaces it is
(
xy
) = 14(
k
x
+
y
k
2
;
k
x
;
y
k
2
)
:
For complex spaces it is
(
xy
) = 14(
k
x
+
y
k
2
+
i
k
x
+
iy
k
2
;
k
x
;
y
k
2
;
i
k
x
;
iy
k
2
)
:
In inner product spaces we also have the parallelogram law:
k
x
+
y
k
2
+
k
x
;
y
k
2
= 2(
k
x
k
2
+
k
y
k
2
)
:
This gives a criterion for a normed space to be an inner product space. Any norm coming
from an inner product satises the parallelogram law and, conversely, if a norm satises the
parallelogram law, we can show (but not so easily) that the polarization identity denes
an inner product, which gives rise to the norm.
(3) A
topological vector space
is a vector space endowed with a Hausdor topology such
that the algebraic operations are continuous. Note that we can extend the notion of Cauchy
sequence, and therefore of completeness, to a TVS: a sequence
x
n
in a TVS is Cauchy if
for every neighborhood
U
of 0 there exists
N
such that
x
m
;
x
n
2
U
for all
mn
N
.
A normed linear space is a TVS, but there is another, more general operation involving
norms which endows a vector space with a topology. Let
X
be a vector space and suppose
that a family
fk
k
g
2A
of seminorms on
X
is given which are
su cient
in the sense that
T
fk
x
k
= 0
g
= 0. Then the topology generated by the sets
fk
x
k
< r
g
,
2
A
,
r >
0,
makes
X
a TVS. A sequence (or net)
x
n
converges to
x
i
k
x
n
;
x
k
!
0 for all
. Note
that, a fortiori,
j
k
x
n
k
;
k
x
k
j
!
0, showing that each seminorm is continuous.
If the number of seminorms is nite, we may add them to get a norm generating the
same topology. If the number is countable, we may dene a metric
d
(
xy
) =
X
n
2
;
n
k
x
;
y
k
n
1 +
k
x
;
y
k
n
3
so the topology is metrizable.
Examples: (0) On
R
n
or
C
n
we may put the
l
p
norm, 1
p
1
, or the weighted
l
p
norm with some arbitrary positive weight. All of these norms are equivalent (indeed
all norms on a nite dimensional space are equivalent), and generate the same Banach
topology. Only for
p
= 2 is it a Hilbert space.
(2) If is a subset of
R
n
(or, more generally, any Hausdor space) we may dene the
space
C
b
() of bounded continuous functions with the supremum norm. It is a Banach
space. If
X
is compact this is simply the space
C
() of continuous functions on .
(3) For simplicity, consider the unit interval, and dene
C
n
(0
1]) and
C
n
(0
1]),
n
2
N
,
2
(0
1]. Both are Banach spaces with the natural norms.
C
0
1
is the space of
Lipschitz functions.
C
(0
1])
C
0
C
0
C
1
(0
1]) if 0
<
1.
(4) For 1
p <
1
and an open or closed subspace of
R
n
(or, more generally, a
-nite
measure space), we have the space
L
p
() of equivalence classes of measurable
p
-th power
integrable functions (with equivalence being equality o a set of measure zero), and for
p
=
1
equivalence classes of essentially bounded functions (bounded after modication
on a set of measure zero). For 1
< p <
1
the triangle inequality is not obvious, it is
Minkowski's inequality. Since we modded out the functions with
L
p
-seminorm zero, this
is a normed linear space, and the Riesz-Fischer theorem asserts that it is a Banach space.
L
2
is a Hilbert space. If meas()
<
1
, then
L
p
()
L
q
() if 1
q
p
1
.
(5) The sequence space
l
p
, 1
p
1
is an example of (4) in the case where the
measure space is
N
with the counting measure. Each is a Banach space.
l
2
is a Hilbert
space.
l
p
l
q
if 1
p
q
1
(note the inequality is reversed from the previous example).
The subspace
c
0
of sequences tending to 0 is a closed subspace of
l
1
.
(6) If is an open set in
R
n
(or any Hausdor space), we can equip
C
() with the
norms
f
7!
j
f
(
x
)
j
indexed by
x
2
. This makes it a TVS, with the topology being that
of pointwise convergence. It is not complete (pointwise limit of continuous functions may
not be continuous).
(7) If is an open set in
R
n
we can equip
C
() with the norms
f
7!
k
f
k
L
1
(
K
)
indexed
by compact subsets of , thus dening the topology of uniform convergence on compact
subsets. We get the same toplogy by using only the countably many compact sets
K
n
=
f
x
2
:
j
x
j
n
dist(
x@
)
1
=n
g
:
The topology is complete.
(8) In the previous example, in the case is a region in
C
, and we take complex-
valued functions, we may consider the subspace
H
() of holomorbarphic functions. By
Weierstrass's theorem it is a closed subspace, hence itself a complete TVS.
(9) If
fg
2
L
1
(
I
),
I
= (0
1) and
Z
1
0
f
(
x
)
(
x
)
dx
=
;
Z
1
0
g
(
x
)
0
(
x
)
dx
4
for all innitely dierentiable
with support contained in
I
(so
is identically zero near
0 and 1), then we say that
f
is weakly dierentiable and that
f
0
=
g
. We can then dene
the
Sobolev space
W
1
p
(
I
) =
f
f
2
L
p
(
I
) :
f
0
2
L
p
(
I
)
g
, with the norm
k
f
k
W
1
p
(
I
)
=
Z
1
0
j
f
(
x
)
j
p
dx
+
Z
1
0
j
f
0
(
x
)
j
p
dx
1
=p
:
This is a larger space than
C
1
(
I
), but still incorporates rst order dierentiability of
f
.
The case
p
= 2 is particularly useful, because it allows us to deal with dierentiability
in a Hilbert space context. Sobolev spaces can be extended to measure any degree of
dierentiability (even fractional), and can be dened on arbitrary domains in
R
n
.
Subspaces and quotient spaces.
If
X
is a vector space and
S
a subspace, we may dene the vector space
X=S
of cosets.
If
X
is normed, we may dene
k
u
k
X=S
= inf
x
2
u
k
x
k
X
, or equivalently
k
x
k
X=S
= inf
s
2
S
k
x
;
s
k
X
:
This is a seminorm, and is a norm i
S
is closed.
Theorem.
If
X
is a Banach space and
S
is a closed subspace then
S
is a Banach space
and
X=S
is a Banach space.
Sketch.
Suppose
x
n
is a sequence of elements of
X
for which the cosets
x
n
are Cauchy.
We can take a subsequence with
k
x
n
;
x
n
+1
k
X=S
2
;
n
;1
,
n
= 1
2
:::
. Set
s
1
= 0, dene
s
2
2
S
such that
k
x
1
;
(
x
2
+
s
2
)
k
X
1
=
2, dene
s
3
2
S
such that
k
(
x
2
+
s
2
)
;
(
x
3
+
s
3
)
k
X
1
=
4,
:::
. Then
f
x
n
+
s
n
g
is Cauchy in
X :::
A converse is true as well (and easily proved).
Theorem.
If
X
is a normed linear space and
S
is a closed subspace such that
S
is a
Banach space and
X=S
is a Banach space, then
X
is a Banach space.
Finite dimensional subspaces are always closed (they're complete). More generally:
Theorem.
If
S
is a closed subspace of a Banach space and
V
is a nite dimensional
subspace, then
S
+
V
is closed.
Sketch.
We easily pass to the case
V
is one-dimensional and
V
\
S
= 0. We then have that
S
+
V
is algebraically a direct sum and it is enough to show that the projections
S
+
V
!
S
and
S
+
V
!
V
are continuous (since then a Cauchy sequence in
S
+
V
will lead to a
Cauchy sequence in each of the closed subspaces, and so to a convergent subsequence).
Now the projection
:
X
!
X=S
restricts to a 1-1 map on
V
so an isomorphism of
V
onto
its image
V
. Let
:
V
!
V
be the continuous inverse. Since
(
S
+
V
)
V
, we may form
the composition
j
S
+
V
:
S
+
V
!
V
and it is continuous. But it is just the projection
onto
V
. The projection onto
S
is
id
;
, so it is also continuous.
5
Note.
The sum of closed subspaces of a Banach space need not be closed. For a coun-
terexample (in a separable Hilbert space), let
S
1
be the vector space of all real sequences
(
x
n
)
1
n
=1
for which
x
n
= 0 if
n
is odd, and
S
2
be the sequences for which
x
2
n
=
nx
2
n
;1
,
n
= 1
2
:::
. Clearly
X
1
=
l
2
\
S
1
and
X
2
=
l
2
\
S
2
are closed subspaces of
l
2
, the space
of square integrable sequences (they are dened as the intersection of the null spaces of
continuous linear functionals). Obviously every sequence can be written in a unique way
as sum of elements of
S
1
and
S
2
:
(
x
1
x
2
:::
) = (0
x
2
;
x
1
0
x
4
;
2
x
3
0
x
6
;
3
x
5
:::
) + (
x
1
x
1
x
3
2
x
3
x
5
3
x
5
:::
)
:
If a sequence has all but nitely many terms zero, so do the two summands. Thus all
such sequences belong to
X
1
+
X
2
, showing that
X
1
+
X
2
is dense in
l
2
. Now consider the
sequence (1
0
1
=
2
0
1
=
3
:::
)
2
l
2
. Its only decomposition as elements of
S
1
and
S
2
is
(1
0
1
=
2
0
1
=
3
0
:::
) = (0
;
1
0
;
1
0
;
1
:::
) + (1
1
1
=
2
1
1
=
3
1
:::
)
and so it does not belong to
X
1
+
X
2
. Thus
X
1
+
X
2
is not closed in
l
2
.
Basic properties of Hilbert spaces.
An essential property of Hilbert space is that the distance of a point to a closed convex
set is alway attained.
Projection Theorem.
Let
X
be a Hilbert space,
K
a closed convex subset, and
x
2
X
.
Then there exists a unique
x
2
K
such that
k
x
;
x
k
= inf
y
2
K
k
x
;
y
k
:
Proof.
Translating, we may assume that
x
= 0, and so we must show that there is a unique
element of
K
of minimal norm. Let
d
= inf
y
2
K
k
y
k
and chose
x
n
2
K
with
k
x
n
k
!
d
.
Then the parallelogram law gives
x
n
;
x
m
2
2
= 12
k
x
n
k
2
+ 12
k
x
m
k
2
;
x
n
+
x
m
2
2
1
2
k
x
n
k
2
+ 12
k
x
m
k
2
;
d
2
where we have used convexity to infer that (
x
n
+
x
m
)
=
2
2
K
. Thus
x
n
is a Cauchy
sequence and so has a limit
x
, which must belong to
K
, since
K
is closed. Since the norm
is continuous,
k
x
k
= lim
n
k
x
n
k
=
d
.
For uniqueness, note that if
k
x
k
=
k
~
x
k
=
d
, then
k
(
x
+ ~
x
)
=
2
k
=
d
and the parallelogram
law gives
k
x
;
~
x
k
2
= 2
k
x
k
2
+ 2
k
~
x
k
2
;
k
x
+ ~
x
k
2
= 2
d
2
+ 2
d
2
;
4
d
2
= 0
:
The unique nearest element to
x
in
K
is often denoted
P
K
x
, and referred to as the
projection of
x
onto
K
. It satises
P
K
P
K
=
P
K
, the denition of a projection. This
terminology is especially used when
K
is a closed linear subspace of
X
, in which case
P
K
is a linear projection operator.
6
Projection and orthogonality.
If
S
is any subset of a Hilbert space
X
, let
S
?
=
f
x
2
X
:
h
xs
i
= 0 for all
s
2
S
g
:
Then
S
?
is a closed subspace of
X
. We obviously have
S
\
S
?
= 0 and
S
S
??
.
Claim: If
S
is a closed subspace of
X
,
x
2
X
, and
P
S
x
the projection of
x
onto
S
, then
x
;
P
S
x
2
S
?
. Indeed, if
s
2
S
is arbitrary and
t
2
R
, then
k
x
;
P
S
x
k
2
k
x
;
P
S
x
;
ts
k
2
=
k
x
;
P
S
x
k
2
;
2
t
(
x
;
P
S
xs
) +
t
2
k
s
k
2
so the quadratic polynomial on the right hand side has a minimum at
t
= 0. Setting the
derivative there to 0 gives (
x
;
P
S
xs
) = 0.
Thus we can write any
x
2
X
as
s
+
s
?
with
s
2
S
and
s
?
2
S
?
(namely
s
=
P
S
x
,
s
?
=
x
;
P
S
x
). Such a decomposition is certainly unique (if
s
+
s
?
were another one we
would have
s
;
s
=
s
?
;
s
?
2
S
\
S
?
= 0.) We clearly have
k
x
k
2
=
k
s
k
2
+
k
s
?
k
2
.
An immediate corollary is that
S
??
=
S
for
S
a closed subspace, since if
x
2
S
??
we
can write it as
s
+
s
?
, whence
s
?
2
S
?
\
S
??
= 0, i.e.,
x
2
S
. We thus see that the
decomposition
x
= (
I
;
P
S
)
x
+
P
S
x
is the (unique) decomposition of
x
into elements of
S
?
and
S
??
. Thus
P
S
?
=
I
;
P
S
. For
any subset
S
of
X
,
S
??
is the smallest closed subspace containing
S
.
Orthonormal sets and bases in Hilbert space.
Let
e
1
,
e
2
,
:::
,
e
N
be orthonormal elements of a Hilbert space
X
, and let
S
be their
span. Then
P
n
h
xe
n
i
e
n
2
S
and
x
;
P
n
h
xe
n
i
e
n
?
S
, so
P
n
h
xe
n
i
e
n
=
P
S
x
. But
k
P
n
h
xe
n
i
e
n
k
2
=
P
N
n
=1
h
xe
n
i
2
, so
N
X
n
=1
h
xe
n
i
2
k
x
k
2
(Bessel's inequality). Now let
E
be an orthonormal set of arbitrary cardinality. It follows
from Bessel's inequality that for
>
0 and
x
2
X
,
f
e
2
E
:
h
xe
i
g
is nite, and
hence that
f
e
2
E
:
h
xe
i
>
0
g
is countable. We can thus extend Bessel's inequality to
an arbitrary orthonormal set:
X
e
2E
h
xe
i
2
k
x
k
2
where the sum is just a countable sum of positive terms.
It is useful to extend the notion of sums over sets of arbitrary cardinality. If
E
is an
arbitary set and
f
:
E
!
X
a function mapping into a Hilbert space (or any normed linear
space or even TVS), we say
(
?
)
X
e
2E
f
(
e
) =
x
7
if the net
P
e
2F
f
(
e
), indexed by the
nite
subsets
F
of
E
, converges to
x
. In other words,
(
?
) holds if, for any neighborhood
U
of the origin, there is a nite set
F
0
E
such that
x
;
P
e
2F
f
(
e
)
2
U
whenever
F
is a nite subset of
E
containing
F
0
. In the case
E
=
N
,
this is equivalent to absolute convergence of a series. Note that if
P
e
2E
f
(
e
) converges,
then for all
there is a nite
F
0
such that if
F
1
and
F
2
are nite supersets of
F
0
, then
k
P
e
2F
1
f
(
e
)
;
P
e
2F
2
f
(
e
)
k
. It follows easily that each of the sets
f
e
2
E
j
k
f
(
e
)
k
1
=n
g
is nite, and hence,
f
(
e
) = 0 for all but countably many
e
2
E
.
Lemma.
If
E
is an orthonormal subset of a Hilbert space
X
and
x
2
X
, then
X
e
2E
h
xe
i
e
converges.
Proof.
We may order the elements
e
1
,
e
2
,
:::
of
E
for which
h
xe
i
6
= 0. Note that
k
N
X
n
=1
h
xe
n
i
e
n
k
2
=
N
X
n
=1
jh
xe
n
ij
2
k
x
k
2
:
This shows that the partial sums
s
N
=
P
N
n
=1
h
xe
n
i
e
n
form a Cauchy sequence, and so
converge to an element
P
1
n
=1
h
xe
n
i
e
n
of
X
. As an exercise in applying the denition,
we show that
P
e
2E
h
xe
i
e
=
P
1
n
=1
h
xe
n
i
e
n
. Given
>
0 pick
N
large enough that
P
1
n
=
N
+1
jh
xe
n
ij
2
<
. If
M > N
and
F
is a nite subset of
E
containing
e
1
,
:::
,
e
N
,
then
k
M
X
n
=1
h
xe
n
i
e
n
;
X
e
2F
h
xe
i
e
k
2
:
Letting
M
tend to innity,
k
1
X
n
=1
h
xe
n
i
e
n
;
X
e
2F
h
xe
i
e
k
2
as required.
Recall the proof that every vector space has a basis. We consider the set of all linearly
independent subsets of the vector space ordered by inclusions, and note that if we have a
totally ordered subset of this set, then the union is a linearly independent subset containing
all its members. Therefore Zorn's lemma implies that there exists a maximal linearly
independent set. It follows directly from the maximality that this set also spans, i.e., is a
basis. In an inner product space we can use the same argument to establish the existence
of an orthonormal basis.
In fact, while bases exist for all vector spaces, for innite dimensional spaces they are
dicult or impossible to construct and almost never used. Another notion of basis is much
8
more useful, namely one that uses the topology to allow innite linear combinations. To
distinguish ordinary bases from such notions, an ordinary basis is called a Hamel basis.
Here we describe an orthonormal Hilbert space basis. By denition this is a maximal
orthonormal set. By Zorn's lemma, any orthonormal set in a Hilbert space can be extended
to a basis, and so orthonormal bases exist. If
E
is such an orthonormal basis, and
x
2
X
,
then
x
=
X
e
2E
h
xe
i
e:
Indeed, we know that the sum on the right exists in
X
and it is easy to check that its inner
product with any
e
0
2
E
is
h
xe
0
i
. Thus
y
:=
x
;
P
e
2E
h
xe
i
e
is orthogonal to
E
, and if it
weren't zero, then we could adjoin
y=
k
y
k
to
E
to get a larger orthonormal set.
Thus we've shown that any element
x
of
X
can be expressed as
P
c
e
e
for some
c
e
2
R
,
all but countably many of which are 0. It is easily seen that this determines the
c
e
uniquely,
namely
c
e
=
h
xe
i
, and that
k
x
k
2
=
P
c
2
e
.
The notion of orthonormal basis allows us to dene a Hilbert space dimension, namely
the cardinality of any orthonormalbasis. To know that this is well dened, we need to check
that any two bases have the same cardinality. If one is nite, this is trivial. Otherwise,
let
E
and
F
be two innite orthonormal bases. For each 0
6
=
x
2
X
, the inner product
h
xe
i
6
= 0 for at least one
e
2
E
. Thus
F
e
2E
f
f
2
F
:
h
fe
i
6
= 0
g
i.e.,
F
is contained in the union of card
E
countable sets. Therefore card
F
@
0
card
E
=
card
E
.
If
S
is any set, we dene a particular Hilbert space
l
2
(
S
) as the set of functions
c
:
S
!
R
which are zero o a countable set and such that
P
s
2S
c
2
s
<
1
. We thus see that via a basis,
any Hilbert space can be put into a norm-preserving (and so inner-product-preserving)
linear bijection (or Hilbert space isomorphism) with an
l
2
(
S
). Thus, up to isomorphism,
there is just one Hilbert space for each cardinality. In particular there is only one innite
dimensional separable Hilbert space (up to isometry).
Example: The best known example of an orthonormal basis in an innite Hilbert space
is the set of functions
e
n
= exp(2
in
) which form a basis for complex-valued
L
2
(0
1]).
(They are obviously orthonormal, and they are a maximal orthonormal set by the Weier-
strass approximation Theorem. Thus an arbitrary
L
2
function has an
L
2
convergent
Fourier series
f
(
) =
1
X
n
=;1
^
f
(
n
)
e
2
in
with ^
f
(
n
) =
h
fe
n
i
=
R
1
0
f
(
)
e
;2
in
d
. Thus from the Hilbert space point of view, the
theory of Fourier series is rather simple. More dicult analysis comes in when we consider
convergence in other topologies (pointwise, uniform, almost everywhere,
L
p
,
C
1
,
:::
).
9
Schauder bases.
An orthonormal basis in a Hilbert space is a special example of a
Schauder basis. A subset
E
of a Banach space
X
is called a Schauder basis if for every
x
2
X
there is a unique function
c
:
E
!
R
such that
x
=
P
e
2E
c
e
e
. Schauder constructed
a useful Schauder basis for
C
(0
1]), and there is useful Schauder bases in many other
separable Banach spaces. In 1973 Per Eno settled a long-standing open question by
proving that there exist separable Banach spaces with no Schauder bases.
II. Linear Operators and Functionals
B
(
XY
) = bounded linear operators between normed linear spaces
X
and
Y
. A linear
operator is bounded i it is bounded on every ball i it is bounded on some ball i it is
continuous at every point i it is continuous at some point.
Theorem.
If
X
is a normed linear space and
Y
is a Banach space, then
B
(
XY
)
is a
Banach space with the norm
k
T
k
B
(
XY
)
= sup
06=
x
2
X
k
Tx
k
Y
k
x
k
X
:
Proof.
It is easy to check that
B
(
XY
) is a normed linear space, and the only issue is to
show that it is complete.
Suppose that
T
n
is a Cauchy sequence in
B
(
XY
). Then for each
x
2
X T
n
x
is Cauchy
in the complete space
Y
, so there exists
Tx
2
Y
with
T
n
x
!
Tx
. Clearly
T
:
X
!
Y
is
linear. Is it bounded? The real sequence
k
T
n
k
is Cauchy, hence bounded, say
k
T
n
k
K
.
It follows that
k
T
k
K
, and so
T
2
B
(
XY
). To conclude the proof, we need to show
that
k
T
n
;
T
k
!
0. We have
k
T
n
;
T
k
= sup
k
x
k1
k
T
n
x
;
Tx
k
= sup
k
x
k1
lim
m
!1
k
T
n
x
;
T
m
x
k
= sup
k
x
k1
limsup
m
!1
k
T
n
x
;
T
m
x
k
limsup
m
!1
k
T
n
;
T
m
k
:
Thus limsup
n
!1
k
T
n
;
T
k
= 0.
If
T
2
B
(
XY
) and
U
2
B
(
YZ
), then
UT
=
U
T
2
B
(
XZ
) and
k
UT
k
B
(
XZ
)
k
U
k
B
(
YZ
)
k
T
k
B
(
XY
)
. In particular,
B
(
X
) :=
B
(
XX
) is a
Banach algebra
, i.e., it has an
additional \multiplication" operation which makes it a non-commutative algebra, and the
multiplication is continuous.
The dual space is
X
:=
B
(
X
R
) (or
B
(
X
C
) for complex vector spaces). It is a Banach
space (whether
X
is or not).
10
The Hahn{Banach Theorem.
A key theorem for dealing with dual spaces of normed
linear spaces is the Hahn-Banach Theorem. It assures us that the dual space of a nontrivial
normed linear space is itself nontrivial. (Note: the norm is important for this. There exist
topological vector spaces, e.g.,
L
p
for 0
< p <
1, with no non-zero continuous linear
functionals.)
Hahn-Banach.
If
f
is a bounded linear functional on a subspace of a normed linear space,
then
f
extends to the whole space with preservation of norm.
Note that there are virtually no hypotheses beyond linearity and existence of a norm.
In fact for some purposes a weaker version is useful. For
X
a vector space, we say that
p
:
X
!
R
is sublinear if
p
(
x
+
y
)
p
(
x
) +
p
(
y
) and
p
(
x
) =
p
(
x
) for
xy
2
X
,
0.
Generalized Hahn-Banach.
Let
X
be a vector space,
p
:
X
!
R
a sublinear functional,
S
a subspace of
X
, and
f
:
S
!
R
a linear function satisfying
f
(
x
)
p
(
x
)
for all
x
2
S
,
then
f
can be extended to
X
so that the same inequality holds for all
x
2
X
.
Sketch.
It suces to extend
f
to the space spanned by
S
and one element
x
0
2
X
n
S
,
preserving the inequality, since if we can do that we can complete the proof with Zorn's
lemma.
We need to dene
f
(
x
0
) such that
f
(
tx
0
+
s
)
p
(
tx
0
+
s
) for all
t
2
R
,
s
2
S
. The case
t
= 0 is known and it is easy to use homogeneity to restrict to
t
=
1. Thus we need to
nd a value
f
(
x
0
)
2
R
such that
f
(
s
)
;
p
(
;
x
0
+
s
)
f
(
x
0
)
p
(
x
0
+
s
)
;
f
(
s
) for all
s
2
S:
Now it is easy to check that for any
s
1
,
s
2
2
S
,
f
(
s
1
)
;
p
(
;
x
0
+
s
1
)
p
(
x
0
+
s
2
)
;
f
(
s
2
),
and so such an
f
(
x
0
) exists.
Corollary.
If
X
is a normed linear space and
x
2
X
, then there exists
f
2
X
of norm
1 such that
f
(
x
) =
k
x
k
.
Corollary.
If
X
is a normed linear space,
S
a closed subspace, and
x
2
X
, then there
exists
f
2
X
of norm 1 such that
f
(
x
) =
k
x
k
X=S
.
Duality.
If
X
and
Y
are normed linear spaces and
T
:
X
!
Y
, then we get a natural
map
T
:
Y
!
X
by
T
f
(
x
) =
f
(
Tx
) for all
f
2
Y
,
x
2
X
. In particular, if
T
2
B
(
XY
), then
T
2
B
(
Y
X
). In fact,
k
T
k
B
(
Y
X
)
=
k
T
k
B
(
XY
)
. To prove
this, note that
j
T
f
(
x
)
j
=
j
f
(
Tx
)
j
k
f
kk
T
kk
x
k
. Therefore
k
T
f
k
k
f
kk
T
k
, so
T
is indeed bounded, with
k
T
k
k
T
k
. Also, given any
y
2
Y
, we can nd
g
2
Y
such that
j
g
(
y
)
j
=
k
y
k
,
k
g
k
= 1. Applying this with
y
=
Tx
(
x
2
X
arbitrary), gives
k
Tx
k
=
j
g
(
Tx
)
j
=
j
T
gx
j
k
T
kk
g
kk
x
k
=
k
T
kk
x
k
. This shows that
k
T
k
k
T
k
. Note
that if
T
2
B
(
XY
),
U
2
B
(
YZ
), then (
UT
)
=
T
U
.
If
X
is a Banach space and
S
a subset, let
S
a
=
f
f
2
X
j
f
(
s
) = 0
8
s
2
S
g
11
denote the
annihilator
of
S
. If
V
is a subset of
X
, we similarly set
a
V
=
f
x
2
X
j
f
(
x
) = 0
8
f
2
V
g
:
Note the distinction between
V
a
, which is a subset of
X
and
a
V
, which is a subset of
X
. All annihilators are closed subspaces.
It is easy to see that
S
T
X
implies that
T
a
S
a
, and
V
W
X
implies that
a
W
a
V
. Obviously
S
a
(
S
a
) if
S
X
and
V
(
a
V
)
a
if
V
X
. The Hahn-Banach
theorem implies that
S
=
a
(
S
a
) in case
S
is a closed subspace of
X
(but it can happen
that
V
(
(
a
V
)
a
for
V
a closed subspace of
X
. For
S
X
arbitrary,
a
(
S
a
) is the smallest
closed subspace of
X
containing the subset
S
, namely the closure of the span of
S
.
Now suppose that
T
:
X
!
Y
is a bounded linear operator between Banach spaces. Let
g
2
Y
. Then
g
(
Tx
) = 0
8
x
2
X
(
)
T
g
(
x
) = 0
8
x
2
X
(
)
T
g
= 0. I.e.,
R
(
T
)
a
=
N
(
T
)
:
Similarly, for
x
2
X
,
Tx
= 0
(
)
f
(
Tx
) = 0
8
f
2
Y
(
)
T
f
(
x
) = 0
8
f
2
Y
, or
a
R
(
T
) =
N
(
T
)
:
Taking annihilators gives two more results:
R
(
T
) =
a
N
(
T
)
R
(
T
)
N
(
T
)
a
:
In particular we see that
T
is injective i
T
has dense range" and
T
is injective if
T
has
dense range.
Note: we will have further results in this direction once we introduce the weak*-topology
on
X
. In particular, (
a
S
)
a
is the weak* closure of a subspace
S
of
X
and
T
is injective
i
T
has weak* dense range.
Dual of a subspace.
An important case is when
T
is the inclusion map
i
:
S
!
X
,
where
S
is a closed subspace of
X
. Then
r
=
i
:
X
!
S
is just the restriction map:
rf
(
s
) =
f
(
s
). Hahn-Banach tells us that
r
is surjective. Obviously
N
(
r
) =
S
a
. Thus we
have a canonical isomorphism
r
:
X
=S
a
!
S
. In fact, the Hahn-Banach theorem shows
that it is an isometry. Via this isometry one often identies
X
=S
a
with
S
.
Dual of a quotient space.
Next, consider the projection map
:
X
!
X=S
where
S
is
a closed subspace. We then have
: (
X=S
)
!
X
. Since
is surjective, this map is
injective. It is easy to see that the range is contained in
S
a
. In fact we now show that
maps (
X=S
)
onto
S
a
, hence provides a canonical isomorphism of
S
a
with (
X=S
)
. Indeed,
if
f
2
S
a
, then we have a splitting
f
=
g
with
g
2
(
X=S
)
(just dene
g
(
c
) =
f
(
x
)
where
x
is any element of the coset
c
). Thus
f
=
g
is indeed in the range of
. This
correspondence is again an isometry.
Dual of a Hilbert space.
The identication of dual spaces can be quite tricky. The case
of Hilbert spaces is easy.
12
Riesz Representation Theorem.
If
X
is a real Hilbert space, dene
j
:
X
!
X
by
j
y
(
x
) =
h
xy
i
. This map is a linear isometry of
X
onto
X
. For a complex Hilbert space
it is a conjugate linear isometry (it satises
j
y
=
j
y
).
Proof.
It is easy to see that
j
is an isometry of
X
into
X
and the main issue is to show
that any
f
2
X
can be written as
j
y
for some
y
. We may assume that
f
6
= 0, so
N
(
f
) is
a proper closed subspace of
X
. Let
y
0
2
N
(
f
)]
?
be of norm 1 and set
y
= (
fy
0
)
y
0
. For
all
x
2
X
, we clearly have that (
fy
0
)
x
;
(
fx
)
y
0
2
N
(
f
), so
j
y
(
x
) =
h
x
(
fy
0
)
y
0
i
=
h
(
fy
0
)
xy
0
i
=
h
(
fx
)
y
0
y
0
i
=
fx:
Via the map
j
we can dene an inner product on
X
, so it is again a Hilbert space.
Note that if
S
is a closed subspace of
X
, then
x
2
S
?
(
)
j
s
2
S
a
. The Riesz map
j
is sometimes used to identify
X
and
X
. Under this identication there is no distinction
between
S
?
and
S
a
.
Dual of
C
()
.
Note: there are two quite distinct theorems referred to as the Riesz
Representation Theorem. The proceeding is the easy one. The hard one identies the
dual of
C
() where is a compact subset of
R
n
(this can be generalized considerably). It
states that there is an isometry between
C
()
and the space of nite signed measures on
. (A nite signed measure is a set function of the form
=
1
;
2
where
i
is a nite
measure, and we view such as a functional on
C
(
X
) by
f
7!
R
f d
1
;
R
f d
2
.) This
is the real-valued case" in the complex-valued case the isometry is with complex measures
+
i
where
and
are nite signed measures.
Dual of
C
1
.
It is easy to deduce a representation for an arbitrary element of the dual
of, e.g.,
C
1
(0
1]). The map
f
7!
(
ff
0
) is an isometry of
C
1
onto a closed subspace of
C C
. By the Hahn-Banach Theorem, every element of (
C
1
)
extends to a functional on
C C
, which is easily seen to be of the form
(
fg
)
7!
Z
f d
+
Z
g d
where
and and
are signed measures ((
X Y
)
=
X
Y
with the obvious identi-
cations). Thus any continuous linear functional on
C
1
can be written
f
7!
Z
f d
+
Z
f
0
d:
In this representation the measures
and
are
not
unique.
Dual of
L
p
.
H#older's inequality states that if 1
p
1
,
q
=
p=
(
p
;
1), then
Z
fg
k
f
k
L
p
k
g
k
L
q
13
for all
f
2
L
p
,
g
2
L
q
. This shows that the map
g
7!
g
:
g
(
f
) =
Z
fg
maps
L
q
linearly into (
L
p
)
with
k
g
k
(
L
p
)
k
g
k
L
q
. The choice
f
= sign(
g
)
j
g
j
q
;1
shows
that there is equality. In fact, if
p <
1
,
is a linear isometry of
L
q
onto (
L
p
)
. For
p
=
1
it is an isometric injection, but not in general surjective. Thus the dual of
L
p
is
L
q
for
p
nite. The dual of
L
1
is a very big space, much bigger than
L
1
and rarely used.
Dual of
c
0
.
The above considerations apply to the dual of the sequence spaces
l
p
. Let
us now show that the dual of
c
0
is
l
1
. For any
c
= (
c
n
)
2
c
0
and
d
= (
d
n
)
2
l
1
, we dene
d
(
c
) =
P
c
n
d
n
. Clearly
j
d
(
c
)
j
sup
j
c
n
j
X
j
d
n
j
=
k
c
k
c
0
k
d
k
l
1
so
k
d
k
c
0
k
d
k
l
1
. Taking
c
n
=
sign(
d
n
)
n
N
0
n > N
we see that equality holds. Thus
:
l
1
!
c
0
is an isometric injection. We now show that it
is onto. Given
f
2
c
0
, dene
d
n
=
f
(
e
(
n
)
) where
e
(
n
)
is the usual unit sequence
e
(
n
)
m
=
mn
.
Let
s
n
= sign(
d
n
). Then
j
d
n
j
=
f
(
s
n
e
(
n
)
), so
N
X
n
=0
j
d
n
j
=
N
X
n
=0
f
(
s
n
e
(
n
)
) =
f
(
N
X
n
=0
s
n
e
(
n
)
)
k
f
k
:
Letting
N
!
1
we conclude that
d
2
l
1
. Now by construction
d
agrees with
f
on all
sequences with only nitely many nonzeros. But these are dense in
c
0
, so
f
=
d
.
The bidual.
If
X
is any normed linear space, we have a natural map
i
:
X
!
X
given
by
i
x
(
f
) =
f
(
x
)
x
2
X f
2
X
:
Clearly
k
i
x
k
k
f
k
and, by the Hahn-Banach theorem, equality holds. Thus
X
may be
identied as a subspace of the Banach space
X
. If we dene ~
X
as the closure of
i
(
X
) in
X
, then
X
is isometrically embedded as a dense subspace of the Banach space ~
X
. This
determines ~
X
up to isometry, and is what we dene as the completion of
X
. Thus any
normed linear space has a completion.
If
i
is onto, i.e., if
X
is isomorphic with
X
via this identication, we say that
X
is
reexive (which can only happen is
X
is complete). In particular, one can check that if
X
is a Hilbert space and
j
:
X
!
X
is the Riesz isomorphism, and
j
:
X
!
X
the Riesz
isomorphism for
X
, then
i
=
j
j
, so
X
is reexive.
Similarly, the canonical isometries of
L
q
onto (
L
p
)
and then
L
p
onto (
L
q
)
compose to
give the natural map of
L
p
into its bidual, and we conclude that
L
p
(and
l
p
) is reexive
for 1
< p <
1
. None of
L
1
,
l
1
,
L
1
,
l
1
,
c
0
, or
C
(
X
) are reexive.
If
X
is reexive, then
i
(
a
S
) =
S
a
for
S
X
. In other words, if we identify
X
and
X
, the distinction between the two kinds of annihilators disappears. In particular, for
reexive Banach spaces,
R
(
T
) =
N
(
T
)
a
and
T
is injective i
T
has dense range.
14
III. Fundamental Theorems
The Open Mapping Theorem and the Uniform Boundedness Principle join the Hahn-
Banach Theorem as the \big three". These two are fairly easy consequence of the Baire
Category Theorem.
Baire Category Theorem.
A complete metric space cannot be written as a countable
union of nowhere dense sets.
Sketch of proof.
If the statement were false, we could write
M
=
S
n
2N
F
n
with
F
n
a closed
subset which does not contain any open set. In particular,
F
0
is a proper closed set, so
there exists
x
0
2
M
,
0
2
(0
1) such that
E
(
x
0
0
)
M
n
F
0
. Since no ball is contained in
F
1
, there exists
x
1
2
E
(
x
0
0
=
2) and
1
2
(0
0
=
2) such that
E
(
x
1
1
)
M
n
F
1
. In this
way we get a nested sequence of balls such that the
n
th ball has radius at most 2
;
n
and is
disjoint from
F
n
. It is then easy to check that their centers form a Cauchy sequence and
its limit, which must exist by completeness, can't belong to any
F
n
.
The Open Mapping Theorem.
The Open Mapping Theorem follows from the Baire
Category Theorem and the following lemma.
Lemma.
Let
T
:
X
!
Y
be a bounded linear operator between Banach spaces. If
E
(0
Y
r
)
T
(
E
(0
X
1))
for some
r >
0
, then
E
(0
Y
r
)
T
(
B
(0
X
2))
.
Proof.
Let
U
=
T
(
E
(0
X
1)). Let
y
2
Y
,
k
y
k
< r
. There exists
y
0
2
U
with
k
y
;
y
0
k
r=
2.
By homogeneity, there exists
y
1
2
1
2
U
such that
k
y
;
y
0
;
y
1
k
r=
4,
y
2
2
1
4
U
such that
k
y
;
y
0
;
y
1
;
y
2
k
r=
8, etc. Take
x
n
2
1
2
n
U
such that
Tx
n
=
y
n
, and let
x
=
P
n
x
n
2
X
.
Then
k
x
k
2 and
Tx
=
P
y
n
=
y
.
Remark.
The same proof works to prove the statement with 2 replaced by any number
greater than 1. With a small additional argument, we can even replace it with 1 itself.
However the statement above is sucient for our purposes.
Open Mapping Theorem.
A bounded linear surjection between Banach spaces is open.
Proof.
It is enough to show that the image under
T
of a ball about 0 contains some ball
about 0. The sets
T
(
E
(0
n
)) cover
Y
, so the closure of one of them must contain an open
ball. By the previous result, we can dispense with the closure. The theorem easily follows
using the linearity of
T
.
There are two major corollaries of the Open Mapping Theorem, each of which is equiv-
alent to it.
Inverse Mapping Theorem or Banach's Theorem.
The inverse of an invertible
bounded linear operator between Banach spaces is continuous.
Proof.
The map is open, so its inverse is continuous.
15
Closed Graph Theorem.
A linear operator between Banach spaces is continuous i its
graph is closed.
A map between topological spaces is called closed if its graph is closed. In a general
Hausdor space, this is a weaker property than continuity, but the theorem asserts that
for linear operators between Banach spaces it is equivalent. The usefulness is that a direct
proof of continuity requires us to show that if
x
n
converges to
x
in
X
then
Tx
n
converges
to
Tx
. By using the closed graph theorem, we get to assume as well that
Tx
n
is converging
to some
y
in
Y
and we need only show that
y
=
Tx
.
Proof.
Let
G
=
f
(
xTx
)
j
x
2
X
g
denote the graph. Then the composition
G
X Y
!
X
is a bounded linear operator between Banach spaces given by (
xTx
)
7!
x
. It is
clearly one-to-one and onto, so the inverse is continuous by Banach's theorem. But the
composition
X
!
G
X Y
!
Y
is simply the
T
, so
T
is continuous.
Banach's theorem leads immediately to this useful characterization of closed imbeddings
of Banach spaces.
Theorem.
Let
T
:
X
!
Y
be a bounded linear map between Banach spaces. Then
T
is
one-to-one and has closed range if and only if there exists a positive number
c
such that
k
x
k
c
k
Tx
k
8
x
2
X:
Proof.
If the inequality holds, then
T
is clearly one-to-one, and if
Tx
n
is a Cauchy sequence
in
R
(
T
), then
x
n
is Cauchy, and hence
x
n
converges to some
x
, so
Tx
n
converges to
Tx
.
Thus the inequality implies that
R
(
T
) is closed.
For the other direction, suppose that
T
is one-to-one with closed range and consider the
map
T
;1
:
R
(
T
)
!
X
. It is the inverse of a bounded isomorphism, so is itself bounded.
The inequality follows immediately (with
c
the norm of
T
;1
).
Another useful corollary is that if a Banach space admits a second weaker or stronger
norm under which it is still Banach, then the two norms are equivalent. This follows
directly from Banach's theorem applied to the identity.
The Uniform Boundedness Principle.
The Uniform Boundedness Principle (or the
Banach-Steinhaus Theorem) also comes from the Baire Category Theorem.
Uniform Boundedness Principle.
Suppose that
X
and
Y
are Banach spaces and
S
B
(
XY
)
. If
sup
T
2S
k
T
(
x
)
k
Y
<
1
for all
x
2
X
, then
sup
T
2S
k
T
k
<
1
.
Proof.
One of the closed sets
f
x
j
j
f
n
(
x
)
j
N
8
n
g
must contain
E
(
x
0
r
) for some
x
0
2
X
,
r >
0. Then, if
k
x
k
< r
,
j
f
n
(
x
)
j
j
f
n
(
x
+
x
0
)
;
f
n
(
x
0
)
j
N
+ sup
j
f
n
(
x
0
)
j
=
M
, with
M
independent of
n
. This shows that the
k
f
n
k
are uniformly bounded (by
M=r
).
In words: a set of linear operators between Banach spaces which is bounded pointwise
is norm bounded.
16
The uniform boundedness theorem is often a way to generate counterexamples. A
typical example comes from the theory of Fourier series. For
f
:
R
!
C
continuous and
1-periodic the
n
th partial sum of the Fourier series for
f
is
f
n
(
s
) =
n
X
k
=;
n
Z
1
;1
f
(
t
)
e
;2
ikt
dte
2
iks
=
Z
1
;1
f
(
t
)
D
n
(
s
;
t
)
dt
where
D
n
(
x
) =
n
X
k
=;
n
e
2
ikx
:
Writing
z
=
e
2
is
, we have
D
n
(
s
) =
n
X
k
=;
n
z
k
=
z
;
n
z
2
n
;
1
z
;
1 =
z
n
+1
=
2
;
z
;
n
;1
=
2
z
1
=
2
;
z
;1
=
2
= sin(2
n
+ 1)
x
sin
x :
This is the Dirichlet kernel, a
C
1
periodic function. In particular, the value of the
n
th
partial sum of the Fourier series of
f
at 0 is
T
n
f
:=
f
n
(0) =
Z
1
;1
f
(
t
)
D
n
(
t
)
dt:
We think of
T
n
as a linear functional on the Banach space of 1-periodic continuous function
endowed with the sup norm. Clearly
k
T
n
k
C
n
:=
Z
1
;1
j
D
n
(
t
)
j
dt:
In fact this is an equality. If
g
(
t
) = sign
D
n
(
t
), then sup
j
g
j
= 1 and
T
n
g
=
C
n
. Actually,
g
is not continuous, so to make this argument correct, we approximate
g
by a continuous
functions, and thereby prove the norm equality. Now one can calculate that
R
j
D
n
j
!
1
as
n
!
1
. By the uniform boundedness theorem we may conclude that there exists a
continuous periodic function for whose Fourier series diverges at
t
= 0.
The Closed Range Theorem.
We now apply the Open Mapping Theorem to better
understand the relationship between
T
and
T
. The property of having a closed range
is signicant to the structure of an operator between Banach spaces. If
T
:
X
!
Y
has
a closed range
Z
(which is then itself a Banach space), then
T
factors as the projection
X
!
X=
N
(
T
), the isomorphism
X=
N
(
T
)
!
Z
, and the inclusion
Z
Y
. The Closed
Range Theorem says that
T
has a closed range if and only if
T
does.
Theorem.
Let
T
:
X
!
Y
be a bounded linear operator between Banach spaces. Then
T
is invertible i
T
is.
Proof.
If
S
=
T
;1
:
Y
!
X
exists, then
ST
=
I
X
and
TS
=
I
Y
, so
T
S
=
I
X
and
S
T
=
I
Y
, which shows that
T
is invertible.
17
Conversely, if
T
is invertible, then it is open, so there is a number
c >
0 such that
T
B
Y
(0
1) contains
B
X
(0
c
). Thus, for
x
2
X
k
Tx
k
= sup
f
2
B
Y
(0
1)
j
f
(
Tx
)
j
= sup
f
2
B
Y
(0
1)
j
(
T
f
)
x
j
sup
g
2
B
X
(0
c
)
j
g
(
x
)
j
=
c
k
x
k
:
The existence of
c >
0 such that
k
Tx
k
c
k
x
k
8
x
2
X
is equivalent to the statement that
T
is injective with closed range. But since
T
is injective,
T
has dense range.
Lemma.
Let
T
:
X
!
Y
be a linear map between Banach spaces such that
T
is an
injection with closed range. Then
T
is a surjection.
Proof.
Let
E
be the closed unit ball of
X
and
F
=
TE
. It suces to show that
F
contains
a ball around the origin, since then, by the lemma used to prove the Open Mapping
Theorem,
T
is onto.
There exists
c >
0 such that
k
T
f
k
c
k
f
k
for all
f
2
Y
. We shall show that
F
contains the ball of radius
c
around the origin in
Y
. Otherwise there exists
y
2
Y
,
k
y
k
c
,
y =
2
F
. Since
F
is a closed convex set we can nd a functional
f
2
Y
such that
j
f
(
Tx
)
j
for all
x
2
E
and
f
(
y
)
>
. Thus
k
f
k
> =c
, but
k
T
f
k
= sup
x
2
E
j
T
f
(
x
)
j
= sup
x
2
E
j
f
(
Tx
)
j
:
This is a contradiction.
Closed Range Theorem.
Let
T
:
X
!
Y
be a bounded linear operator between Banach
spaces. Then T has closed range if and only if
T
does.
Proof.
1)
R
(
T
) closed =
)
R
(
T
) closed.
Let
Z
=
R
(
T
). Then
T
:
X=
N
(
T
)
!
Z
is an isomorphism (Inverse Mapping Theorem).
The diagram
X
T
;
;
;
;
!
Y
?
?
y
x
?
?
X=
N
(
T
)
=
;
;
;
;
!
T
Z:
commutes. Taking adjoints,
X
T
;
;
;
;
Y
x
?
?
?
?
y
N
(
T
)
a
=
;
;
;
;
T
Y
=Z
a
:
This shows that
R
(
T
) =
N
(
T
)
a
.
18
2)
R
(
T
) closed =
)
R
(
T
) closed.
Let
Z
=
R
(
T
) (so
Z
a
=
N
(
T
)) and let
S
be the range restriction of
T
,
S
:
X
!
Z
.
The adjoint is
S
:
Y
=Z
a
!
X
, the lifting of
T
to
Y
=Z
a
. Now
R
(
S
) =
R
(
T
), is
closed, and
S
is an injection. We wish to show that
S
is onto
Z
. Thus the theorem follows
from the preceding lemma.
IV. Weak Topologies
The weak topology.
Let
X
be a Banach space. For each
f
2
X
the map
x
7!
j
f
(
x
)
j
is
a seminorm on
X
, and the set of all such seminorms, as
f
varies over
X
, is sucient by
the Hahn-Banach Theorem. Therefore we can endow
X
with a new TVS structure from
this family of seminorms. This is called the weak topology on
X
. In particular,
x
n
!
x
weakly (written
x
n w
;
!
x
) i
f
(
x
n
)
!
f
(
x
) for all
f
2
X
. Thus the weak topology is
weaker than the norm topology, but all the elements of
X
remain continuous when
X
is
endowed with the weak topology (it is by denition the weakest topology for which all the
elements of
X
are continuous).
Note that the open sets of the weak topology are rather big. If
U
is an weak neighbor-
hood of 0 in an innite dimensional Banach space then, by denition, there exists
>
0
and nitely many functionals
f
n
2
X
such that
f
x
j
j
f
n
(
x
)
j
<
8
N
g
is contained in
U
.
Thus
U
contains the innite dimensional closed subspace
N
(
f
1
)
\
:::
\
N
(
f
n
).
If
x
n w
;
!
x
weakly, then, viewing the
x
n
as linear functionals on
X
(via the canonical
embedding of
X
into
X
), we see that the sequence of real numbers obtained by applying
the
x
n
to any
f
2
X
is convergent and hence bounded uniformly in
n
. By the Uniform
Boundedness Principle, it follows that the
x
n
are bounded.
Theorem.
If a sequence of elements of a Banach space converges weakly, then the sequence
is norm bounded.
On the other hand, if the
x
n
are small in norm, then their weak limit is too.
Theorem.
If
x
n w
;
!
x
in some Banach space, then
k
x
k
liminf
n
!1
k
x
n
k
.
Proof.
Take
f
2
X
of norm 1 such that
f
(
x
) =
k
x
k
. Then
f
(
x
n
)
k
x
n
k
, and taking the
liminf gives the result.
For convex sets (in particular, for subspaces) weak closure coincides with norm closure:
Theorem.
1) The weak closure of a convex set is equal to its norm closure.
2) A convex set is weak closed i it is normed closed.
3) A convex set is weak dense i it is norm dense.
Proof.
The second and third statement obviously follow from the rst, and the weak closure
obviously contains the norm closure. So it remains to show that if
x
does not belong to
the norm closure of a convex set
E
, then there is a weak neighborhood of
x
which doesn't
intersect
E
. This follows immediately from the following convex separation theorem.
19
Theorem.
Let
E
be a nonempty closed convex subset of a Banach space
X
and
x
a point
in the complement of
E
. Then there exists
f
2
X
such that
f
(
x
)
<
inf
y
2
E
f
(
y
)
.
In fact we shall prove a stronger result:
Theorem.
Let
E
and
F
be disjoint, nonempty, convex subsets of a Banach space
X
with
F
open. Then there exists
f
2
X
such that
f
(
x
)
<
inf
y
2
E
f
(
y
)
for all
x
2
F
.
(The previous result follows by taking
F
to be any ball about
x
disjoint from
E
.)
Proof.
This is a consequence of the generalized Hahn-Banach Theorem. Pick
x
0
2
E
and
y
0
2
F
and set
z
0
=
x
0
;
y
0
and
G
=
F
;
E
+
z
0
. Then
G
is a convex open set containing
0 but not containing
z
0
. (The convexity of
G
follows directly from that of
E
and
F
" the
fact that
G
is open follows from the representation of
G
=
S
y
2
E
F
;
y
+
z
0
as a union of
open sets" obviously 0 =
y
0
;
x
0
+
z
0
2
G
, and
z
0
=
2
G
since
E
and
F
and disjoint.)
Since
G
is open and convex and contains 0, for each
x
2
X
,
f
t >
0
j
t
;1
x
2
G
g
is a
nonempty open semi-innite interval. Dene
p
(
x
)
2
0
1
) to be the left endpoint of this
interval. By denition
p
is positively homogeneous. Since
G
is convex,
t
;1
x
2
G
and
s
;1
y
2
G
imply that
(
t
+
s
)
;1
(
x
+
y
) =
t
s
+
tt
;1
x
+
s
s
+
ts
;1
y
2
G
whence
p
is subadditive. Thus
p
is a sublinear functional. Moreover,
G
=
f
x
2
X
j
p
(
x
)
<
1
g
.
Dene a linear functional
f
on
X
0
:=
R
z
0
by
f
(
z
0
) = 1. Then
f
(
tz
0
) =
t
tp
(
z
0
) =
p
(
tz
0
) for
t
0 and
f
(
tz
0
)
<
0
p
(
tz
0
) for
t <
0. Thus
f
is a linear functional on
X
0
satisfying
f
(
x
)
p
(
x
) there. By Hahn-Banach we can extend
f
to a linear functional on
X
satisfying the same inequality. This implies that
f
is bounded (by 1) on the open set
G
, so
f
belongs to
X
.
If
x
2
F
,
y
2
E
, then
x
;
y
+
z
0
2
G
, so
f
(
x
)
;
f
(
y
) + 1 =
f
(
x
;
y
+
z
0
)
<
1,
or
f
(
x
)
< f
(
y
). Therefore sup
x
2
F
f
(
x
)
inf
y
2
E
f
(
y
). Since
f
(
F
) is an open interval,
f
(~
x
)
<
sup
x
2
F
f
(
x
) for all ~
x
2
F
, and so we have the theorem.
The weak* topology.
On the dual space
X
we have two new topologies. We may endow
it with the weak topology, the weakest one such that all functionals in
X
are continuous,
or we may endow it with the topology generated by all the seminorms
f
7!
f
(
x
),
x
2
X
.
(This is obviously a sucient family of functionals.) The last is called the weak* topology
and is a weaker topology than the weak topology. If
X
is reexive, the weak and weak*
topologies coincide.
Examples of weak and weak* convergence: 1) Consider weak convergence in
L
p
()
where is a bounded subset of
R
n
. From the characterization of the dual of
L
p
we see
that
f
n w
;
;
!
f
in
L
1
=
)
f
n w
;
!
f
weak in
L
p
=
)
f
n w
;
!
f
weak in
L
q
20
whenever 1
q
p <
1
. In particular we claim that the complex exponentials
e
2
inx w
;
;
!
0 in
L
1
(0
1]) as
n
!
1
. This is simply the statement that
lim
n
!1
Z
1
0
g
(
x
)
e
2
inx
dx
= 0
for all
g
2
L
1
(0
1]), i.e., that the Fourier coecients of an
L
1
tend to 0, which is known as
the Riemann{Lebesgue Lemma. (Proof: certainly true if
g
is a trigonometric polynomial.
The trig polynomials are dense in
C
(0
1]) by the Weierstrass Approximation Theorem,
and
C
(0
1]) is dense in
L
1
(0
1]).) This is one common example of weak convergence
which is not norm convergence, namely weak vanishing by oscillation.
2) Another common situation is weak vanishing to innity. As a very simple example,
it is easy to see that the unit vectors in
l
p
converge weakly to zero for 1
< p <
1
(and
weak* in
l
1
, but not weakly in
l
1
). As a more interesting example, let
f
n
2
L
p
(
R
) be
a sequence of function which are uniformly bounded in
L
p
, and for which
f
n
j
;
nn
]
0.
Then we claim that
f
n
!
0 weakly in
L
p
if 1
< p <
1
. Thus we have to show that
lim
n
!1
Z
R
f
n
g dx
= 0
for all
g
2
L
q
. Let
S
n
=
f
x
2
R
j
j
x
j
n
g
. Then lim
n
R
S
n
j
g
j
q
dx
= 0 (by the dominated
convergence theorem). But
j
Z
R
f
n
g dx
j
=
j
Z
S
n
f
n
g dx
j
k
f
n
k
L
p
k
g
k
L
q
(
S
n
)
C
k
g
k
L
q
(
S
n
)
!
0
:
The same proof shows that if the
f
n
are uniformly bounded they tend to 0 in
L
1
weak*.
Note that the characteristic functions
nn
+1]
do
not
tend to zero weakly in
L
1
however.
3) Consider the measure
n
= 2
n
;1
=n
1
=n
]
dx
. Formally
n
tends to the delta function
0
as
n
!
1
. Using the weak* topology on
C
(
;
1
1]) this convergence becomes precise:
n w
;
;
!
0
.
Theorem (Alaoglu).
The unit ball in
X
is weak* compact.
Proof.
For
x
2
X
, let
I
x
=
f
t
2
R
:
j
t
j
k
x
k
g
, and set = $
x
2
X
I
x
. Recall that this
Cartesian product is nothing but the set of all functions
f
on
X
with
f
(
x
)
2
I
x
for all
x
. This set is endowed with the Cartesian product topology, namely the weakest topology
such that for all
x
2
X
, the functions
f
7!
f
(
x
) (from to
I
x
) are continuous. Tychono's
Theorem states that is compact with this topology.
Now let
E
be the unit ball in
X
. Then
E
and the topology thereby induced on
E
is precisely the weak* topology. Now for each pair
xy
2
X
and each
c
2
R
, dene
F
xy
(
f
) =
f
(
x
) +
f
(
y
)
;
f
(
x
+
y
),
G
xc
=
f
(
cx
)
;
cf
(
x
). These are continuous functions
on and
E
=
\
xy
2
X
F
;1
xy
(0)
\
\
x
2
X
c
2R
G
;1
cx
(0)
:
Thus
E
is a closed subset of a compact set, and therefore compact itself.
21
Corollary.
If
f
n w
;
;
!
f
in
X
, then
k
f
k
liminf
n
!1
k
f
n
k
X
.
Proof.
Let
C
= liminf
k
f
n
k
and let
>
0 be arbitrary. Then there exists a subsequence
(also denoted
f
n
) with
k
f
n
k
C
+
. The ball of radius
C
+
being weak* compact, and
so weak* closed,
k
f
k
< C
+
. Since
was arbitrary, this gives the result.
On
X
the weak* topology is that induced by the functionals in
X
.
Theorem.
The unit ball of
X
is weak* dense in the unit ball of
X
.
Proof.
Let
z
belong to the unit ball of
X
. We need to show that for any
f
1
::: f
n
2
X
of norm 1, and any
>
0, the set
f
w
2
X
j
j
(
w
;
z
)(
f
i
)
j
< i
= 1
::: n
g
contains a point of the unit ball of
X
. (Since any neighborhood of
z
contains a set of this
form.)
It is enough to show that there exists
y
2
X
with
k
y
k
<
1 +
such that (
y
;
z
)(
f
i
) = 0
for each
i
. Because then
y=
(1 +
) belongs to the closed unit ball of
X
, and
j
((1 +
)
;1
y
;
z
)(
f
i
)
j
=
j
((1 +
)
;1
y
;
y
)(
f
i
)
j
k
((1 +
)
;1
y
;
y
k
=
k
y
k
1 +
< :
Let
S
be the span of the
f
i
in
X
. Since
S
is nite dimensional the canonical map
X
!
S
is surjective. (This is equivalent to saying that if the null space of a linear
functional
g
contains the intersection of the null spaces of a nite set of linear functionals
g
i
, then
g
is a linear combination of the
g
i
, which is a simple, purely algebraic result.
Proof: The nullspace of the map (
g
1
::: g
n
) :
X
!
R
n
is contained in the nullspace of
g
,
so
g
=
T
(
g
1
::: g
n
) for some linear
T
:
R
n
!
R
.]) Consequently
X=
a
S
is isometrically
isomorphic to
S
.
In particular
z
j
S
concides with
y
+
a
S
for some
y
2
X
. Since
k
z
k
S
1, and we can
choose the coset representative
y
with
k
y
k
1 +
as claimed.
Corollary.
The closed unit ball of a Banach space
X
is weakly compact if and only if
X
is reexive.
Proof.
If the closed unit ball of
X
is weakly compact, then it is weak* compact when
viewed as a subset of
X
. Thus the ball is weak* closed, and so, by the previous theorem,
the embedding of the the unit ball of
X
contains the ball of
X
. It follows that the
embedding of
X
is all of
X
.
The reverse direction is immediate from the Alaoglu theorem.
22
V. Compact Operators and their Spectra
Hilbert{Schmidt operators.
Lemma.
Suppose that
f
e
i
g
and
f
~
e
i
g
are two orthonormal bases for a separable Hilbert
space
X
, and
T
2
B
(
X
)
. Then
X
ij
jh
Te
i
e
j
ij
2
=
X
ij
jh
T
~
e
i
~
e
j
ij
2
:
Proof.
For all
w
2
X
,
P
j
jh
we
j
ij
2
=
k
w
k
2
, so
X
ij
jh
Te
i
e
j
ij
2
=
X
i
k
Te
i
k
2
=
X
j
k
T
e
j
k
2
:
But
X
i
k
T
e
i
k
2
=
X
ij
jh
T
e
i
~
e
j
ij
2
=
X
j
k
T
~
e
j
k
2
:
Denition.
If
T
2
B
(
X
) dene
k
T
k
2
by
k
T
k
2
2
=
X
ij
jh
Te
i
e
j
ij
2
=
X
i
k
Te
i
k
2
where
f
e
i
g
is any orthonormal basis for
X
.
T
is called a Hilbert{Schmidt operator if
k
T
k
2
<
1
, and
k
T
k
2
is called the Hilbert{Schmidt norm of
T
.
We have just seen that if
T
is Hilbert{Schmidt, then so is
T
and their Hilbert{Schmidt
norms coincide.
Proposition.
k
T
k
k
T
k
2
.
Proof.
Let
x
=
P
c
i
e
i
be an arbitrary element of
X
. Then
k
Tx
k
2
=
X
i
X
j
c
j
h
Te
j
e
i
i
2
:
By Cauchy{Schwarz
j
X
j
c
j
h
Te
j
e
i
ij
2
X
j
c
2
j
X
j
jh
Te
j
e
i
ij
2
=
k
x
k
2
X
j
jh
Te
j
e
i
ij
2
:
Summing on
i
gives the result.
23
Proposition.
Let
be an open subset of
R
n
and
K
2
L
2
( )
. Dene
T
K
u
(
x
) =
Z
K
(
xy
)
u
(
y
)
dy
for all
x
2
:
Then
T
K
denes a Hilbert{Schmidt operator on
L
2
()
and
k
T
K
k
2
=
k
K
k
L
2
.
Proof.
For
x
2
, set
K
x
(
y
) =
K
(
xy
). By Fubini's theorem,
K
x
2
L
2
() for almost all
x
2
, and
k
K
k
2
L
2
=
Z
k
K
x
k
2
dx:
Now,
T
K
u
(
x
) =
h
K
x
u
i
, so, if
f
e
i
g
is an orthonormal basis, then
k
T
K
k
2
2
=
X
i
k
T
K
e
i
k
2
=
X
i
Z
j
(
T
K
e
i
)(
x
)
j
2
dx
=
X
i
Z
jh
K
x
e
i
ij
2
dx
=
Z
X
i
jh
K
x
e
i
ij
2
dx
=
Z
k
K
x
k
2
dx
=
k
K
k
2
L
2
:
Compact operators.
Denition.
A bounded linear operator between Banach spaces is called compact if it
maps the unit ball (and therefore every bounded set) to a precompact set.
For example, if
T
has nite rank (dim
R
(
T
)
<
1
), then
T
is compact.
Recall the following characterization of precompact sets in a metric space, which is often
useful.
Proposition.
Let
M
be a metric space. Then the following are equivalent:
(1)
M
is precompact.
(2)
For all
>
0
there exist nitely many sets of diameter at most
which cover
M
.
(3)
Every sequence contains a Cauchy subsequence.
Sketch of proof.
(1) =
)
(2) and (3) =
)
(1) are easy. For (2) =
)
(3) use a Cantor
diagonalization argument to extract a Cauchy subsequence.
Theorem.
Let
X
and
Y
be Banach spaces and
B
c
(
XY
)
the space of compact linear
operators from
X
to
Y
. Then
B
c
(
XY
)
is a closed subspace of
B
(
XY
)
.
Proof.
Suppose
T
n
2
B
c
(
XY
),
T
2
B
(
XY
),
k
T
n
;
T
k
!
0. We must show that
T
is
compact. Thus we must show that
T
(
E
) is precompact in
Y
, where
E
is the unit ball in
X
. For this, it is enough to show that for any
>
0 there are nitely many balls
U
i
of
radius
in
Y
such that
T
(
E
)
i
U
i
:
24
Choose
n
large enough that
k
T
;
T
n
k
=
2, and let
V
1
V
2
::: V
n
be nitely many balls
of radius
=
2 which cover
T
n
E
. For each
i
let
U
i
be the ball of radius
with the same
center as
V
i
.
It follows that closure of the nite rank operators in
B
(
XY
) is contained in
B
c
(
XY
).
In general, this may be a strict inclusion, but if
Y
is a Hilbert space, it is equality. To
prove this, choose an orthonormal basis for
Y
, and consider the nite rank operators of the
form
PT
where
P
is the orthogonal projection of
Y
onto the span of nitely many basis
elements. Using the fact that
TE
is compact (
E
the unit ball of
X
) and that
k
P
k
= 1, we
can nd for any
>
0, an operator
P
of this form with sup
x
2
E
k
(
PT
;
T
)
x
k
.
The next result is obvious but useful.
Theorem.
Let
X
and
Y
be Banach spaces and
T
2
B
c
(
XY
)
. If
Z
is another Banach
space and
S
2
B
(
YZ
)
then
ST
is compact. If
S
2
B
(
ZX
)
, then
TS
is compact. If
X
=
Y
, then
B
c
(
X
) :=
B
c
(
XX
)
is a two-sided ideal in
B
(
X
)
.
Theorem.
Let
X
and
Y
be Banach spaces and
T
2
B
(
XY
)
. Then
T
is compact if and
only if
T
is compact.
Proof.
Let
E
be the unit ball in
X
and
F
the unit ball in
Y
. Suppose that
T
is compact.
Given
>
0 we must exhibit nitely many sets of diameter at most
which cover
T
F
.
First choose
m
sets of diameter at most
=
3 which cover
TE
, and let
Tx
i
belong to the
i
th
set. Also, let
I
1
::: I
n
be
n
intervals of length
=
3 which cover the interval
;k
T
k
k
T
k
].
For any
m
-tuple (
j
1
::: j
m
) of integers with 1
j
i
n
we dene the set
f
f
2
F
j
f
(
Tx
i
)
2
I
j
i
i
= 1
::: m
g
:
These sets clearly cover
F
, so there images under
T
cover
T
F
, so it suces to show that
the images have diameter at most
. Indeed, if
f
and
g
belong to the set above, and
x
is any
element of
E
, pick
i
such that
k
Tx
;
Tx
i
k
=
3. We know that
k
f
(
Tx
i
)
;
g
(
Tx
i
)
k
=
3.
Thus
j
(
T
f
;
T
g
)(
x
)
j
=
j
(
f
;
g
)(
Tx
)
j
j
f
(
Tx
)
;
f
(
Tx
i
)
j
+
j
g
(
Tx
)
;
g
(
Tx
i
)
j
+
j
(
f
;
g
)(
Tx
i
)
j
:
This shows that
T
compact =
)
T
compact. Conversely, suppose that
T
:
Y
!
X
is compact. Then
T
maps the unit ball of
X
into a precompact subset of
Y
. But the
unit ball of
X
may be viewed as a subset of the unit ball of its bidual, and the restriction
of
T
to the unit ball of
X
coincides with
T
there. Thus
T
maps the unit ball of
X
to a
precompact set.
Theorem.
If
T
is a compact operator from a Banach space to itself, then
N
(
1
;
T
)
is
nite dimensional and
R
(
1
;
T
)
is closed.
Proof.
T
is a compact operator that restricts to the identity on
N
(
1
;
T
). Hence the
closed unit ball in
N
(
1
;
T
) is compact, whence the dimension of
N
(
1
;
T
) is nite.
25
Now any nite dimensional subspace is complemented (see below), so there exists a
closed subspace
M
of
X
such that
N
(
1
;
T
) +
M
=
X
and
N
(
1
;
T
)
\
M
= 0. Let
S
= (
1
;
T
)
j
M
, so
S
is injective and
R
(
S
) =
R
(
1
;
T
). We will show that for some
c >
0,
k
Sx
k
c
k
x
k
for all
x
2
M
, which will imply that
R
(
S
) is closed. If the desired
inequality doesn't hold for any
c >
0, we can choose
x
n
2
M
of norm 1 with
Sx
n
!
0.
After passing to a subsequence we may arrange that also
Tx
n
converges to some
x
0
2
X
.
It follows that
x
n
!
x
0
, so
x
0
2
M
and
Sx
0
= 0. Therefore
x
0
= 0 which is impossible
(since
k
x
n
k
= 1).
In the proof we used the rst part of the following lemma. We say that a closed
subspace
N
is complemented in a Banach space
X
if there is another closed subspace such
that
M
N
=
X
.
Lemma.
A nite dimensional or nite codimensional closed subspace of a Banach space
is complemented.
Proof.
If
M
is a nite dimensional subspace, choose a basis
x
1
::: x
n
and dene a linear
functionals
i
:
M
!
R
by
i
(
x
j
) =
ij
. Extend the
i
to be bounded linear functionals
on
X
. Then we can take
N
=
N
(
1
)
\
:::
\
N
(
n
).
If
M
is nite codimensional, we can take
N
to be the span of a set of nonzero coset
representatives.
A simple generalization of the theorem will be useful when we study the spectrum of
compact operators.
Theorem.
If
T
is a compact operator from a Banach space to itself,
a non-zero complex
number, and
n
a positive integer, then
N
(
1
;
T
)
n
]
is nite dimensional and
R
(
1
;
T
)
n
]
is closed.
Proof.
Expanding we see that (
1
;
T
)
n
=
n
(
1
;
S
) for some compact operator
S
, so the
result reduces to the previous one.
We close the section with a good source of examples of compact operators, which in-
cludes, for example, any matrix operator on
l
2
for which the matrix entries are square-
summable.
Theorem.
A Hilbert{Schmidt operator on a separable Hilbert space is compact.
Proof.
Let
f
e
i
g
be an orthonormal basis. Let
T
be a given Hilbert{Schmidt operator
(so
P
i
k
Te
i
k
2
<
1
). Dene
T
n
by
T
n
e
i
=
Te
i
if
i
n
,
T
n
e
i
= 0 otherwise. Then
k
T
;
T
n
k
k
T
;
T
n
k
2
=
P
1
i
=
n
+1
k
Te
i
k
2
!
0.
26
Spectral Theorem for compact self-adjoint operators.
In this section we assume
that
X
is a
complex
Hilbert space. If
T
:
X
!
X
is a bounded linear operator, we view
T
as a map from
X
!
X
via the Riesz isometry between
X
and
X
. That is,
T
is dened
by
h
T
xy
i
=
h
xTy
i
:
In the case of a nite dimensional complex Hilbert space,
T
can be represented by a
complex square matrix, and
T
is represented by its Hermitian transpose.
Recall that a Hermitian symmetric matrix has real eigenvalues and an orthonormal
basis of eigenvectors. For a self-adjoint operator on a Hilbert space, it is easy to see that
any eigenvalues are real, and that eigenvectors corresponding to distinct eigenvalues are
orthogonal. However there may not exist an orthonormal basis of eigenvectors, or even any
nonzero eigenvectors at all. For example, let
X
=
L
2
(0
1]), and dene
Tu
(
x
) =
xu
(
x
) for
u
2
L
2
. Then
T
is clearly bounded and self-adjoint. But it is easy to see that
T
does not
have any eigenvalues.
Spectral Theorem for Compact Self-Adjoint Operators in Hilbert Space.
Let
T
be a compact self-adjoint operator in a Hilbert space
X
. Then there is an orthonormal
basis consisting of eigenvectors of
T
.
Before proceeding to the proof we prove one lemma.
Lemma.
If
T
is a self-adjoint operator on a Hilbert space, then
k
T
k
= sup
k
x
k1
jh
Txx
ij
:
Proof.
Let
= sup
k
x
k1
jh
Txx
ij
. It is enough to prove that
jh
Txy
ij
k
x
kk
y
k
for all
x
and
y
. We can obviously assume that
x
and
y
are nonzero. Moreover, we may
multiply
y
by a complex number of modulus one, so we can assume that
h
Txy
i
0. Then
h
T
(
x
+
y
)
x
+
y
i
;
h
T
(
x
;
y
)
x
;
y
i
= 4Re
h
Txy
i
= 4
jh
Txy
ij
:
so
jh
Txy
ij
4(
k
x
+
y
k
2
+
k
x
;
y
k
2
) =
2(
k
x
k
2
+
k
y
k
2
)
:
Now apply this result with
x
replaced by
p
k
y
k
=
k
x
k
x
and
y
replaced by
p
k
x
k
=
k
y
k
y
.
Proof of spectral theorem for compact self-adjoint operators.
We rst show that
T
has a
nonzero eigenvector. If
T
= 0, this is obvious, so we assume that
T
6
= 0. Choose a sequence
x
n
2
X
with
k
x
n
k
= 1 so that
jh
Tx
n
x
n
ij
!
k
T
k
. Since
T
is self-adjoint,
h
Tx
n
x
n
i
2
R
,
so we may pass to a subsequence (still denoted
x
n
), for which
h
Tx
n
x
n
i
!
=
k
T
k
.
27
Since
T
is compact we may pass to a further subsequence and assume that
Tx
n
!
y
2
X
.
Note that
k
y
k
j
j
>
0.
Using the fact that
T
is self-adjoint and
is real, we get
k
Tx
n
;
x
n
k
2
=
k
Tx
n
k
2
;
2
h
Tx
n
x
n
i
+
2
k
x
n
k
2
2
k
T
k
2
;
2
h
Tx
n
x
n
i
!
2
k
T
k
2
;
2
2
= 0
:
Since
Tx
n
!
y
we infer that
x
n
!
y
as well, or
x
n
!
y=
6
= 0. Applying
T
we have
Ty=
=
y
, so
is indeed a nonzero eigenvalue.
To complete the proof, consider the set of all orthonormal subsets of
X
consisting of
eigenvectors of
T
. By Zorn's lemma, it has a maximal element
S
. Let
W
be the closure
of the span of
S
. Clearly
TW
W
, and it follows directly (since
T
is self-adjoint), that
TW
?
W
?
. Therefore
T
restricts to a self-adjoint operator on
W
?
and thus, unless
W
?
= 0,
T
has an eigenvector in
W
?
. But this clearly contradicts the maximality of
S
(since we can adjoin this element to
S
to get a larger orthonormal set of eigenvectors).
Thus
W
?
= 0, and
S
is an orthonormal basis.
The following structure result on the set of eigenvalues is generally considered part of
the spectral theorem as well.
Theorem.
If
T
is a compact self-adjoint operator on a Hilbert space, then the set of
nonzero eigenvalues of
T
is either a nite set or a sequence approaching
0
and the corre-
sponding eigenspaces are all nite dimensional.
Remark.
0 may or may not be an eigenvalue, and its eigenspace may or may not be nite.
Proof.
Let
e
i
be an orthonormal basis of eigenvectors, with
Te
i
=
i
e
i
. Here
i
ranges over
some index set
I
. It suces to show that
S
=
f
i
2
I
j
j
i
j
g
is nite for all
>
0. Then
if
ij
2
I
k
Te
i
;
Te
j
k
2
=
k
i
e
j
;
j
e
j
k
2
=
j
i
j
2
+
j
j
j
2
so if
ij
2
S
, then
k
Te
i
;
Te
j
k
2
2
2
. If
S
were innite, we could then choose a sequence
of unit elements in
X
whose image under
T
has no convergent subsequence, which violates
the compactness of
T
.
Suppose, for concreteness, that
X
is an innite dimensional separable Hilbert space and
that
f
e
n
g
n
2N
is an orthonormal basis adapted to a compact self-adjoint operator
T
on
X
.
Then the map
U
:
X
!
l
2
given by
U
(
X
n
c
n
e
n
) = (
c
0
c
1
:::
)
is an isometric isomorphism. Moreover, when we use this map to transfer the action of
T
to
l
2
, i.e., when we consider the operator
UTU
;1
on
l
2
, we see that this operator is simply
multiplication by the bounded sequence (
0
1
:::
)
2
l
1
. Thus the spectral theorem says
that every compact self-adjoint
T
is unitarily equivalent to a multiplication operator on
l
2
.
(An isometric isomorphism of Hilbert spaces is also called a unitary operator. Note that
it is characterized by the property
U
=
U
;1
.)
A useful extension is the spectral theorem for commuting self-adjoint compact operators.
28
Theorem.
If
T
and
S
are self-adjoint compact operators in a Hilbert space
H
and
TS
=
ST
, then there is an orthonormal basis of
X
whose elements are eigenvectors for both
S
and
T
.
Proof.
For an eigenvalue
of
T
, let
X
denote the corresponding eigenspace of
T
. If
x
2
X
, then
TSx
=
STx
=
Sx
, so
Sx
2
X
. Thus
S
restricts to a self-adjoint
operator on
X
, and so there is an orthonormal basis of
S
{eigenvectors for
X
. These are
T
{eigenvectors as well. Taking the union over all the eigenvalues
of
T
completes the
construction.
Let
T
1
and
T
2
be any two self-adjoint operators and set
T
=
T
1
+
iT
2
. Then
T
1
=
(
T
+
T
)
=
2 and
T
2
= (
T
;
T
)
=
(2
i
). Conversely, if
T
is any element of
B
(
X
), then we
can dene two self-adjoint operators from these formulas and have
T
=
T
1
+
iT
2
. Now
suppose that
T
is compact and also
normal
, i.e., that
T
and
T
commute. Then
T
1
and
T
2
are compact and commute, and hence we have an orthonormal basis whose elements
are eigenvectors for both
T
1
and
T
2
, and hence for
T
. Since the real and imaginary parts
of the eigenvalues are the eigenvalues of
T
1
and
T
2
, we again see that the eigenvalues form
a sequence tending to zero and all have nite dimensional eigenspaces.
We have thus shown that a compact normal operator admits an orthonormal basis
of eigenvectors. Conversely, if
f
e
i
g
is an orthonormal basis of eigenvectors of
T
, then
h
T
e
i
e
j
i
= 0 if
i
6
=
j
, which implies that each
e
i
is also an eigenvector for
T
. Thus
T
Te
i
=
TT
e
i
for all
i
, and it follows easily that
T
is normal. We have thus shown:
Spectral Theorem for compact normal operators.
Let
T
be a compact operator on a
Hilbert space
X
. Then there exists an orthonormal basis for
X
consisting of eigenvectors of
T
if and only if
T
is normal. In this case, the set of nonzero eigenvalues form a nite set or
a sequence tending to zero and the eigenspaces corresponding to the nonzero eigenvalues are
nite dimensional. The eigenvalues are all real if and only if the operator is self-adjoint.
The spectrum of a general compact operator.
In this section we derive the structure
of the spectrum of a compact operator (not necessarily self-adjoint or normal) on a complex
Banach space
X
.
For any operator
T
on a complex Banach space, the
resolvent set
of
T
,
(
T
) consists
of those
2
C
such that
T
;
1
is invertible, and the spectrum
(
T
) is the complement.
If
2
(
T
), then
T
;
1
may fail to be invertible in several ways. (1) It may be that
N
(
T
;
1
)
6
= 0, i.e., that
is an eigenvalue of
T
. In this case we say that
belongs to the
point spectrum
of
T
, denoted
p
(
T
). (2) If
T
;
1
is injective, it may be that its range is
dense but not closed in
X
. In this case we say that
belongs to the
continuous spectrum
of
T
,
c
(
T
). Or (3) it may be that
T
;
1
is injective but that its range is not even dense
in
X
. This is the
residual spectrum
,
r
(
T
). Clearly we have a decomposition of
C
into the
disjoint sets
(
T
),
p
(
T
),
c
(
T
), and
r
(
T
). As an example of the continuous spectrum,
consider the operator
Te
n
=
n
e
n
where the
e
n
form an orthonormal basis of a Hilbert
space and the
n
form a positive sequence tending to 0. Then 0
2
c
(
T
). If
Te
n
=
n
e
n
+1
,
0
2
r
(
T
).
29
Now if
T
is compact and
X
is innite dimensional, then 0
2
(
T
) (since if
T
were
invertible, the image of the unit ball would contain an open set, and so couldn't be pre-
compact). From the examples just given, we see that 0 may belong to the point spectrum,
the continuous spectrum, or the residual spectrum. However, we shall show that all other
elements of the spectrum are eigenvalues, i.e., that
(
T
) =
p
(
T
)
f
0
g
, and that, as in the
normal case, the point spectrum consists of a nite set or a sequence approaching zero.
The structure of the spectrum of a compact operator will be deduced from two lemmas.
The rst is purely algebraic. To state it we need some terminology: consider a linear
operator
T
from a vector space
X
to itself, and consider the chains of subspaces
0 =
N
(
1
)
N
(
T
)
N
(
T
2
)
N
(
T
3
)
:
Either this chain is strictly increasing forever, or there is a least
n
0 such that
N
(
T
n
) =
N
(
T
n
+1
), in which case only the rst
n
spaces are distinct and all the others equal the
n
th one. In the latter case we say that the kernel chain for
T
stabilizes
at
n
. In particular,
the kernel chain stabilizes at 0 i
T
is injective. Similarly we may consider the chain
X
=
R
(
1
)
R
(
T
)
R
(
T
2
)
R
(
T
3
)
and dene what it means for the range chain to stabilize at
n >
0. (So the range stabilizes
at 0 i
T
is surjective.) It could happen that neither or only one of these chains stabilizes.
However:
Lemma.
Let
T
be a linear operator from a vector space
X
to itself. If the kernel chain
stabilizes at
m
and the range chain stabilizes at
n
, then
m
=
n
and
X
decomposes as the
direct sum of
N
(
T
n
)
and
R
(
T
n
)
.
Proof.
Suppose
m
were less than
n
. Since the range chain stabilizes at
n
, there exists
x
with
T
n
;1
x =
2
R
(
T
n
), and then there exists
y
such that
T
n
+1
y
=
T
n
x
. Thus
x
;
Ty
2
N
(
T
n
),
and, since kernel chain stabilizes at
m < n
,
N
(
T
n
) =
N
(
T
n
;1
). Thus
T
n
;1
x
=
T
n
y
, a
contradiction. Thus
m
n
. A similar argument, left to the reader, establishes the reverse
inequality.
Now if
T
n
x
2
N
(
T
n
), then
T
2
n
x
= 0, whence
T
n
x
= 0. Thus
N
(
T
n
)
\
R
(
T
n
) = 0.
Given
x
, let
T
2
n
y
=
T
n
x
, so
x
decomposes as
T
n
y
2
R
(
T
n
) and
x
;
T
n
y
2
N
(
T
n
).
The second lemma brings in the topology of compact operators.
Lemma.
Let
T
:
X
!
X
be a compact operator on a Banach space and
1
2
:::
a
sequence of complex numbers with
inf
j
n
j
>
0
. Then the following is impossible: There
exists a strictly increasing chain of closed subspaces
S
1
S
2
with
(
n
1
;
T
)
S
n
S
n
;1
for all
n
.
Proof.
Suppose such a chain exists. Note that each
TS
n
S
n
for each
n
. Since
S
n
=S
n
;1
contains an element of norm 1, we may choose
y
n
2
S
n
with
k
y
n
k
2, dist(
y
n
S
n
;1
) = 1.
If
m < n
, then
z
:=
Ty
m
;
(
n
1
;
T
)
y
n
n
2
S
n
;1
30
and
k
Ty
m
;
Ty
n
k
=
j
n
jk
y
n
;
z
n
k
j
n
j
:
This implies that the sequence (
Ty
n
) has no Cauchy subsequence, which contradicts the
compactness of
T
.
We are now ready to prove the result quoted at the beginning of the subsection.
Theorem.
Let
T
be a compact operator on a Banach space
X
. Then any nonzero ele-
ment of the spectrum of
T
is an eigenvalue. Moreover
(
T
)
is either nite or a sequence
approaching zero.
Proof.
Consider the subspace chains
N
(
1
;
T
)
n
] and
R
(
1
;
T
)
n
] (these are closed
subspaces by a previous result). Clearly
1
;
T
maps
N
(
1
;
T
)
n
] into
N
(
1
;
T
)
n
;1
],
so the previous lemma implies that the kernel chain stabilizes, say at
n
. Now
R
(
1
;
T
)
n
] =
a
N
(
1
;
T
)
n
] (since the range is closed), and since these last stabilize, the range chain
stabilizes as well.
Thus we have
X
=
N
(
1
;
T
)
n
]
R
(
1
;
T
)
n
]. Thus
R
(
1
;
T
)
6
=
X
=
)
R
(
1
;
T
)
n
6
=
X
=
)
N
(
1
;
T
)
n
6
= 0 =
)
N
(
1
;
T
)
6
= 0
:
In other words
2
(
T
) =
)
2
p
(
T
).
Finally we prove the last statement. If it were false we could nd a sequence of
eigenvalues
n
with inf
j
n
j
>
0. Let
x
1
x
2
:::
be corresponding nonzero eigenvectors
and set
S
n
= span
x
1
::: x
n
]. These form a strictly increasing chain of subspaces (re-
call that eigenvectors corresponding to distinct eigenvalues are linearly independent) and
(
n
1
;
T
)
S
n
S
n
;1
, which contradicts the lemma.
The above reasoning also gives us the Fredholm alternative:
Theorem.
Let
T
be a compact operator on a Banach space
X
and
a nonzero complex
number. Then either (1)
1
;
T
is an isomorphism, or (2) it is neither injective nor
surjective.
Proof.
Since the kernel chain and range chain for
S
=
1
;
T
stabilize, either they both
stabilize at 0, in which case
S
is injective and surjective, or neither does, in which case it
is neither.
We close this section with a result which is fundamental to the study of Fredholm
operators.
Theorem.
Let
T
be a compact operator on a Banach space
X
and
a nonzero complex
number. Then
dim
N
(
1
;
T
) = dim
N
(
1
;
T
) = codim
R
(
1
;
T
) = codim
R
(
1
;
T
)
:
31
Proof.
Let
S
=
1
;
T
. Since
R
(
S
) is closed
X=
R
(
S
)]
=
R
(
S
)
a
=
N
(
S
)
:
Thus
X=
R
(
S
)]
is nite dimensional, so
X=
R
(
S
) is nite dimensional, and these two
spaces are of the same dimension. Thus codim
R
(
S
) = dim
N
(
S
).
For a general operator
S
we only have
R
(
S
)
N
(
S
)
a
, but, as we now show, when
R
(
S
) is closed,
R
(
S
) =
N
(
S
)
a
. Indeed,
S
induces an isomorphism of
X=
N
(
S
) onto
R
(
S
), and for any
f
2
N
(
S
)
a
,
f
induces a map
X=
N
(
S
) to
R
. It follows that
f
=
gS
for some bounded linear operator
g
on
R
(
S
), which can be extended to an element of
X
by Hahn-Banach. But
f
=
gS
simply means that
f
=
S
g
, showing that
N
(
S
)
a
R
(
S
)
(and so equality holds) as claimed.
Thus
N
(
S
)
=
X
=
N
(
S
)
a
=
X
=
R
(
S
)
so codim
R
(
S
) = dim
N
(
S
)
= dim
N
(
S
).
We complete the theorem by showing that dim
N
(
S
)
codim
R
(
S
) and dim
N
(
S
)
codim
R
(
S
). Indeed, since
R
(
S
) is closed with nite codimension, it is complemented by a
nite dimensional space
M
(with dim
M
= codim
R
(
S
). Since
N
(
S
) is nite dimensional,
it is complemented by a space
N
. Let
P
denote the projection of
X
onto
N
(
S
) which is
a bounded map which to the identity on
N
(
S
) and to zero on
N
. Now if codim
R
(
S
)
<
dim
N
(
S
), then there is a linear map of
N
(
S
) onto
M
which is not injective. But then
T
;
fP
is a compact operator and
1
;
T
+
fP
is easily seen to be surjective. By the Fredholm
alternative, it is injective as well. This implies that
f
is injective, a contradiction. We
have thus shown that dim
N
(
S
)
codim
R
(
S
). Since
T
is compact, the same argument
shows that dim
N
(
S
)
codim
R
(
S
). This completes the proof.
VI. Introduction to General Spectral Theory
In this section we skim the surface of the spectral theory for a general (not necessarily
compact) operator on a Banach space, before encountering a version of the Spectral The-
orem for a bounded self-adjoint operator in Hilbert space. Our rst results don't require
the full structure of in the algebra of operators on a Banach space, but just an arbitrary
Banach algebra structure, and so we start there.
The spectrum and resolvent in a Banach algebra.
Let
X
be a Banach algebra with
an identity element denoted
1
. We assume that the norm in
X
has been normalized so that
k
1
k
= 1. The two main examples to bear in mind are (1)
B
(
X
), where
X
is some Banach
space" and (2)
C
(
G
) endowed with the sup norm, where
G
is some compact topological
space, the multiplication is just pointwise multiplication of functions, and
1
is the constant
function 1.
In this set up the resolvent set and spectrum may be dened as before:
(
x
) =
f
2
C
j
x
;
1
is invertible
g
,
(
x
) =
C
n
(
x
). The
spectral radius
is dened to be
r
(
x
) =
sup
j
(
x
)
j
. For
2
(
x
), the resolvent is dened as
R
x
(
) = (
x
;
1
)
;1
.
32
Lemma.
If
xy
2
X
with
x
invertible and
k
x
;1
y
k
<
1
, then
x
;
y
is invertible,
(
x
;
y
)
;1
=
1
X
n
=0
(
x
;1
y
)
n
x
;1
and
k
(
x
;
y
)
;1
k
k
x
;1
k
=
(1
;
k
x
;1
y
k
)
.
Proof.
k
X
(
x
;1
y
)
n
x
;1
k
k
x
;1
k
X
k
x
;1
y
k
n
k
x
;1
k
=
(1
;
k
x
;1
y
k
)
so the sum converges absolutely and the norm bound holds. Also
1
X
n
=0
(
x
;1
y
)
n
x
;1
(
x
;
y
) =
1
X
n
=0
(
x
;1
y
)
n
;
1
X
n
=0
(
x
;1
y
)
n
+1
=
1
and similarly for the product in the reverse order.
As a corollary, we see that if
j
j
>
k
x
k
, then
1
;
x
is invertible, i.e.,
2
(
x
). In other
words:
Proposition.
r
(
x
)
k
x
k
.
We also see from the lemma that lim
!1
k
R
x
(
)
k
= 0. Another corollary is that if
2
(
x
) and
j
j
<
k
R
x
(
)
k
;1
, then
;
2
(
x
) and
R
x
(
;
) =
1
X
n
=0
R
x
(
)
n
+1
n
:
Theorem.
The resolvent
(
x
)
is always open and contains a neighborhood of
1
in
C
and
the spectrum is always non-empty and compact.
Proof.
The above considerations show that the resolvent is open, and so the spectrum is
closed. It is also bounded, so it is compact.
To see that the spectrum is non-empty, let
f
2
X
be arbitrary and dene
(
) =
f
R
x
(
)]. Then
maps
(
x
) into
C
, and it is easy to see that it is holomorphic (since we
have the power series expansion
(
;
) =
1
X
n
=0
f
R
(
)
n
+1
]
n
if
is suciently small). If
(
x
) =
, then
is entire. It is also bounded (since it tends to 0
at innity), so Liouville's theorem implies that it is identically zero. Thus for any
f
2
X
,
f
(
1
;
x
)
;1
] = 0 . This implies that (
1
;
x
)
;1
= 0, which is clearly impossible.
33
Corollary (Gelfand{Mazur).
If
X
is a complex Banach division algebra, then
X
is
isometrically isomorphic to
C
.
Proof.
For each 0
6
=
x
2
X
, let
2
(
x
). Then
x
;
1
is not invertible, and since
X
is a
division algebra, this means that
x
=
1
. Thus
X
=
C
1
.
Now we turn to a bit of \functional calculus." Let
x
2
X
and let
f
be a complex
function of a complex variable which is holomorphic on the closed disk of radius
k
x
k
about
the origin. Then we make two claims: (1) plugging
x
into the power series expansion of
f
denes an element
f
(
x
)
2
X
" and (2) the complex function
f
maps the spectrum of
x
into
the spectrum of
f
(
x
). (In fact onto, as we shall show later in the case
f
is polynomial.)
To prove these claims, note that, by assumption, the radius of convergence of the power
series for
f
about the origin exceeds
k
x
k
, so we can expand
f
(
z
) =
P
1
n
=0
a
n
z
n
where
P
j
a
n
jk
x
k
n
<
1
. Thus the series
P
a
n
x
n
is absolutely convergent in the Banach space
X
" we call its limit
f
(
x
). (This is the
denition
of
f
(
x
). It is a suggestive abuse of
notation to use
f
to denote the this function, which maps a subset of
X
into
X
, as well as
the original complex-valued function of a complex variable.) Now suppose that
2
(
x
).
Then
f
(
)
1
;
f
(
x
) =
1
X
n
=1
a
n
(
n
1
;
x
n
) = (
1
;
x
)
1
X
n
=1
a
n
P
n
=
1
X
n
=1
a
n
P
n
(
1
;
x
)
where
P
n
=
n
;1
X
k
=0
k
x
n
;
k
;1
:
Note that
k
P
n
k
n
k
x
k
n
;1
, so
P
1
n
=1
a
n
P
n
converges to some
y
2
X
. Thus
f
(
)
1
;
f
(
x
) = (
1
;
x
)
y
=
y
(
1
;
x
)
:
Now
f
(
)
1
;
f
(
x
) can't be invertible, because these formulas would then imply that
1
;
x
would be invertible as well, but
2
(
x
). Thus we have veried that
f
(
)
2
;
f
(
x
)
for
all
2
(
x
).
Theorem (Spectral Radius Formula).
r
(
x
) = lim
n
!1
k
x
n
k
1
=n
= inf
n
k
x
n
k
1
=n
.
Proof.
If
2
(
x
), then
n
2
(
x
n
) (which is also evident algebraically), so
j
n
j
k
x
n
k
.
This shows that
r
(
x
)
inf
n
k
x
n
k
1
=n
.
Now take
f
2
X
, and consider
(
) =
f
(
I
;
x
)
;1
] =
1
X
n
=0
;
n
;1
f
(
x
n
)
:
Then
is clearly holomorphic for
>
k
x
k
, but we know it extends holomorphically to
> r
(
x
) and tends to 0 as
tends to innity. Let
(
) =
(1
=
). Then
extends
34
analytically to zero with value zero and denes an analytic function on the open ball of
radius 1
=r
(
x
) about zero, as does, therefore,
(
)
=
=
1
X
n
=0
f
(
n
x
n
)
:
This shows that for each
j
j
<
1
=r
(
x
) and each
f
2
X
,
f
(
n
x
n
) is bounded. By the
uniform boundedness principle, the set of elements
n
x
n
are bounded in
X
, say by
K
.
Thus
k
x
n
k
1
=n
K
1
=n
=
j
j
!
1
=
j
j
. This is true for all
j
j
<
1
=r
(
x
), so limsup
k
x
n
k
1
=n
r
(
x
).
Corollary.
If
H
is a Hilbert space and
T
2
B
(
H
)
a normal operator, then
r
(
T
) =
k
T
k
.
Proof.
k
T
k
2
= sup
k
x
k1
h
TxTx
i
= sup
k
x
k1
h
T
Txx
i
=
k
T
T
k
since
T
T
is self-adjoint. Using the normality of
T
we also get
k
T
T
k
2
= sup
k
x
k1
h
T
TxT
Tx
i
= sup
k
x
k1
h
TT
TxTx
i
= sup
k
x
k1
h
T
T
2
xTx
i
= sup
k
x
k1
h
T
2
xT
2
x
i
=
k
T
2
k
2
:
Thus
k
T
k
2
=
k
T
2
k
. Replacing
T
with
T
2
gives,
k
T
k
4
=
k
T
4
k
, and similarly for all powers
of 2. The result thus follows from the spectral radius formula.
As mentioned, we can now show that
p
maps
(
x
)
onto
;
p
(
x
)
if
p
is a polynomial.
Spectral Mapping Theorem.
Let
X
be a complex Banach algebra with identity,
x
2
X
,
and let
p
a polynomial in one variable with complex coe cients. Then
p
;
(
x
)
=
;
p
(
x
)
.
Proof.
We have already shown that
p
;
(
x
)
;
p
(
x
)
. Now suppose that
2
;
p
(
x
)
.
By the Fundamental Theorem of Algebra we can factor
p
;
, so
p
(
x
)
;
1
=
a
$
ni
=1
(
x
;
i
1
)
for some nonzero
a
2
C
and some roots
i
2
C
. Since
p
(
x
)
;
1
is not invertible, it
follows that
x
;
i
1
is not invertible for at least one
i
. In orther words,
i
2
(
x
), so
=
p
(
i
)
2
p
;
(
x
)
.
35
Spectral Theorem for bounded self-adjoint operators in Hilbert space.
We now
restrict to self-adjoint operators on Hilbert space and close with a version of the Spectral
Theorem for this class of operator. We follow Halmos's article \What does the Spec-
tral Theorem Say?" (American Mathematical Monthly 70, 1963) both in the relatively
elementary statement of the theorem and the outline of the proof.
First we note that self-adjoint operators have real spectra (not just real eigenvalues).
Proposition.
If
H
is a Hilbert space and
T
2
B
(
H
)
is self-adjoint, then
(
T
)
R
.
Proof.
jh
(
1
;
T
)
xx
ij
j
Im
h
(
1
;
T
)
xx
ij
=
j
Im
jk
x
k
2
so if Im
6
= 0,
1
;
T
is injective with closed range. The same reasoning shows that
(
1
;
T
)
=
1
;
T
is injective, so
R
(
1
;
T
) is dense. Thus
2
(
T
).
Spectral Theorem for self-adjoint operators in Hilbert space.
If
H
is a complex
Hilbert space and
T
2
B
(
H
)
is self-adjoint, then there exists a measure space
with
measure
, a bounded measurable function
:
!
R
, and an isometric isomorphism
U
:
L
2
!
H
such that
U
;1
TU
=
M
where
M
:
L
2
!
L
2
is the operation of multiplication by
. (Here
L
2
means
L
2
(
"
C
)
,
the space of complex-valued functions on
which are square integrable with respect to the
measure
.)
Sketch of proof.
Let
x
be a nonzero element of
H
, and consider the smallest closed sub-
space
M
of
H
containing
T
n
x
for
n
= 0
1
:::
, i.e.,
M
=
f
p
(
T
)
x
j
p
2
P
C
g
. Here
P
C
is the
space of polynomials in one variable with complex coecients. Both
M
and its orthogonal
complement are invariant under
T
(this uses the self-adjointness of
T
). By a straightfor-
ward application of Zorn's lemma we see that
H
can be written as a Hilbert space direct
sum of
T
invariant spaces of the form of
M
. If we can prove the theorem for each of these
subspaces, we can take direct products to get the result for all of
H
. Therefore we may
assume from the start that
H
=
f
p
(
T
)
x
j
p
2
P
C
g
for some
x
. (In other terminology, that
T
has a
cyclic vector
x
.)
Now set =
(
T
), which is a compact subset of the real line, and consider the space
C
=
C
(
R
), the space of all continuous real-valued functions on . The subspace of
real-valued polynomial functions is dense in
C
(since any continuous function on can
be extended to the interval
;
r
(
T
)
r
(
T
)] thanks to Tietze's extension theorem and then
approximated arbitrarily closely by a polynomial thanks to the Weierstrass approximation
theorem). For such a polynomial function,
p
, dene
Lp
=
h
p
(
T
)
xx
i
2
R
. Clearly
L
is
linear and
j
Lp
j
k
p
(
T
)
kk
x
k
2
=
r
;
p
(
T
)
k
x
k
2
by the special form of the spectral radius formula for self-adjoint operators in Hilbert space.
Since
;
p
(
T
)
=
p
;
(
T
)
, we have
r
;
p
(
T
)
=
k
p
k
L
1
()
=
k
p
k
C
, and thus,
j
Lp
j
k
x
k
2
k
p
k
C
:
36
This shows that
L
is a bounded linear functional on a dense subspace of
C
and so extends
uniquely to dene a bounded linear functional on
C
.
Next we show that
L
is positive in the sense that
Lf
0 for all non-negative functions
f
2
C
. Indeed, if
f
=
p
2
for some polynomial, then
Lf
=
h
p
(
T
)
2
xx
i
=
h
p
(
T
)
xp
(
T
)
x
i
0
:
For an arbitrary non-negative
f
, we can approximate
p
f
uniformly by polynomials
p
n
, so
f
= lim
p
2
n
and
Lf
= lim
Lp
2
n
0.
We now apply the Riesz Representation Theorem for the representation of the linear
functional
L
on
C
. It state that there exists a nite measure on such that
Lf
=
R
f d
for
f
2
C
(it is a positive measure since
L
is positive). In particular,
h
p
(
T
)
xx
i
=
R
pd
for all
p
2
P
R
.
We now turn to the space
L
2
of complex-valued functions on which are square inte-
grable with respect to the measure
. The subspace of complex-valued polynomial func-
tions is dense in
L
2
(since the measure is nite, the
L
2
norm is dominated by the supremum
norm). For such a polynomial function,
q
, dene
Uq
=
q
(
T
)
x
. Then
k
Uq
k
2
=
k
q
(
T
)
x
k
2
=
h
q
(
T
)
xq
(
T
)
x
i
=
h
q
(
T
)
q
(
T
)
xx
i
=
Z
j
q
j
2
d
=
k
q
k
2
L
2
:
Thus
U
is an isometry of a dense subspace of
L
2
into
H
and so extends to an isometry of
L
2
onto a closed subspace of
H
. In fact,
U
is onto
H
itself, since, by the assumption that
x
is a cyclic vector for
T
, the range of
U
is dense.
Finally, dene
:
!
R
by
(
) =
. If
q
is a complex polynomial, then (
M
q
)(
) =
q
(
), which is also a polynomial. Thus
U
;1
TUq
=
U
;1
Tq
(
T
)
x
=
U
;1
;
(
M
q
)(
T
)
x
=
M
q:
Thus the bounded operators
U
;1
TU
and
M
coincide on a dense subset of
L
2
, and hence
they are equal.
For a more precise description of the measure space and the extension to normal oper-
ators, see Zimmer.
MATH 502
Second exam
May 9, 1997
Professor Arnold
Do 3 problems, omit 1. State which problem you omit. Partial credit will be awarded, but
errors and unacknowledged omissions count negatively.
1. Suppose that
x
x
1
x
2
:
:
:
are elements in a Hilbert space such that
x
n
converges to
x
weakly and
kx
n
k
converges to
kxk
. Prove that
x
n
converges to
x
in norm. Give a
counterexample in Banach space.
2. Let
X
and
Y
be Banach spaces and set
U
=
f
T
2
B
(
X
Y
)
j
T
(
Y
) =
X
g:
Prove that
U
is open in
B
(
X
Y
).
3. Prove that the sum of a closed subspace and a nite dimensional subspace of a Banach
space is closed.
4. Prove that any compact subset of the complex plane is the spectrum of a bounded
linear operator on a Banach algebra.
MATH 502
Second exam
May 9, 1997
Professor Arnold
Do 3 problems, omit 1. State which problem you omit. Partial credit will be awarded, but
errors and unacknowledged omissions count negatively.
1. Suppose that
x
x
1
x
2
:
:
:
are elements in a Hilbert space such that
x
n
converges to
x
weakly and
kx
n
k
converges to
kxk
. Prove that
x
n
converges to
x
in norm. Give a
counterexample in Banach space.
Under the given hypotheses,
kx
n
;
xk
2
=
kx
n
k
2
+
kxk
2
;
2
hx
n
xi
=
kx
n
k
2
;
kxk
2
;
2
hx
n
;
x
xi
!
0
:
For a counterexample in Banach space, consider
x
n
=
e
1
+
e
n
in
c
0
, where the
e
n
are the
usual sequence space basis elements. Then
kx
n
k
= 1 for all
n
>
1 and
x
n
!
e
1
weakly.
2. Let
X
and
Y
be Banach spaces and set
U
=
f
T
2
B
(
X
Y
)
j
T
(
Y
) =
X
g:
Prove that
U
is open in
B
(
X
Y
).
Since the null-space of
T
is the annihilator of range of
T
,
T
is one-to-one i
T
has dense
range. Combining with the closed range theorem, we get that
T
is one-to-one with closed
range i
T
has dense, closed range, i.e., i
T
is surjective. Thus
U
is the set of operators
which are one-to-one with closed range, which is the set of operators for which there exists
a
c
>
0 such that
kT
xk
ckxk
for all
x
. If
T
2
U
and
c
>
0 is the corresponding constant,
we show that
T
+
S
2
U
for all
kS
k
c=
2. Indeed
k
(
T
+
S
)
xk
kT
xk
;
kS
xk
ckxk
;
c=
2
kxk
=
c=
2
kxk:
3. Prove that the sum of a closed subspace and a nite dimensional subspace of a Banach
space is closed.
It is enough to consider a closed space
S
and one element
x
=
2
S
and show that
S
+
Rx
is closed. By the Hahn-Banach Theorem there is a bounded linear functional
f
which
vanishes on
S
and satises
f
(
x
) = 1. Now suppose that
y
n
2
S
+
R
x
and
y
n
!
y
. Say
y
n
=
s
n
+
r
n
x
,
s
n
2
S
,
r
n
2
R
. Then
r
n
=
f
(
y
n
)
!
f
(
y
). Also
s
n
=
y
n
;
r
n
x
!
y
;
f
(
y
)
x
,
and, since
S
is closed,
y
;
f
(
y
)
x
2
S
. Thus
y
=
y
;
f
(
y
)
x
] +
f
(
y
)
x
2
S
+
Rx
.
Alternate proof:
S
+
F
=
;1
(
F
) where
is the projection of the Banach space
X
onto
X=S
. Since
F
is nite dimensional,
F
is nite dimensional, hence closed in
X=S
, and its
inverse image under a continuous map is closed as well.
4. Prove that any compact subset of the complex plane is the spectrum of a bounded
linear operator on a Banach algebra.
Let
K
be the compact set and let
X
be the space of continuous complex valued functions
on
K
. Dene
T
:
X
!
X
by
T
f
(
z
) =
z
f
(
z
). If
=
2
K
, then dist(
K
)
>
0, so 1
=
(
z
;
)
is bounded on
K
, so multliplication by 1
=
(
z
;
) is continuous on
K
, and provides the
inverse of
T
;
1
. Thus
=
2
(
T
). Conversely, if
2
K
, then we can take
f
2
X
of norm
one with (
T
;
1
)
f
arbitrarily small (e.g., if
f
is chosen to takes values in 0 1] and vanish
when
jz
;
j
, then
k
(
T
;
I
)
f
k
). This shows that
T
;
1
cannot be invertible
i.e.,
2
(
T
). Note: there are variants of this construction. E.g., we could choose a dense
sequence of numbers in
K
and consider the corresponding diagonal matrix on
`
2
.