Advanced Calculus
Advanced Calculus
Real Analysis
Second Edition
Robert G. Bartle
Professor of Mathematics
University of Illinois
Urbana-Champaign
John Wiley
& Sons., New York - London - Sydney - Toronto
Copyright © 1964, 1976 by John Wiley & Sons, Inc.
10987654321
To my parents
Preface
vii
viii PREFACE
Sections 4 to 10
29, 30, 31
PREFACE ix
xi
xii CHAPTER SUMMARIES
. Connected Sets, 80
The connectedness of intervals in R, polygonally connected open
sets are connected, connected sets in R are intervals
. The Complex Number System, 86
Definition and elementary properties
Convergence 90
14. Introduction to Sequences, 90
Convergence, uniqueness of the limit, examples
15. Subsequences and Combinations, 98
Subsequences, algebraic combinations of sequences
16. Two Criteria for Convergence, 104
Monotole Convergence Theorem, Bolzano-Weierstrass
Theorem, Cauchy sequences, the Cauchy Criterion
17. Sequences of Functions, 113
Convergence, uniform convergence, the uniform norm, Cauchy
Criterion for Uniform Convergence
18. The Limit Superior, 123
The limit superior and inferior of a sequence in R, unbounded
sequences, infinite limits
19, Some Extensions, 128
Order of magnitude, Cesaro summation, double sequences,
iterated limits
References 456
Index 475
INTRODUCTION:
A GLIMPSE AT
SET THEORY
1
2 INTRODUCTION: A GLIMPSE AT SET THEORY
member of the set A, or that the set A contains the element x, or that x is
in A. We shall not examine the nature of this property of being an element
of a set any further. For most purposes it is possible to employ the naive
meaning of ‘‘membership,” and an axiomatic characterization of this rela-
tion is not necessary.
If A is a set and x is an element which does not belong to A, we shall
often write
x€A.
y
(4)
Figure 1.1
1. THE ALGEBRA OF SETS 3
Thus in order to show that the sets A and B are equal we must show that the
possibilities (2) and (3) mentioned above cannot occur. Equivalently, we must
show that both AC B and BCA,
{x : P(x)}
for the set of all elements x for which the property P holds. We usually
read this as ‘‘the set of all x such that P(x).” It is often worthwhile to
specify which elements we are testing for the property P. Hence we shall
often write
{x eS: P(x)}
for the subset of S for which the property P holds.
Set Operations
We now introduce some methods of constructing new sets from given ones.
1.2. DEFINITION. If A andB aresets, then their intersection
is the set of
all elements that belong to both A and B. We shall denote the
intersection of the sets A, B by the symbol ANB, which is read “A
intersect B.” (See Figure 1.2.)
1.3. Derinrrion. If A and B are sets, then their union is the set of all
elements which belong either to A or to B or to both A and B. We shall
denote the union of the sets A, B by the symbol A UB, which is read “A
union B.” (See Figure 1.2.)
We could also define A NB and AUB by
ANB={x:xeA and xeB},
AUB={x:xeEA or xeBh.
In connection with the latter, it is important to realize that the word “‘or’’ is
being used in the inclusive sense that is customary in mathematics and
logic. In legal terminology this inclusive sense is sometimes indicated by
“and/or.”
We have tacitly assumed that the intersection and the union of two sets is again a
set. Among other things this requires that there must exist a set which has no elements
atall(forifA andB have nocommon elements, their intersection has no elements).
1.4 DEFINITION. The set which has no elements is called the empty or
the void set and will be denoted by the symbol#. If A and B are sets withno
common elements (thatis, if AM B = 9), then we say that A and B are disjoint
or that they are non-intersecting.
1. THE ALGEBRA OF SETS 5
ANB KY
aus TMM
Figure 1.2. The intersection and union of two sets.
The next result gives some of the algebraic properties of the operations on
sets that we have just defined. Since the proofs of these assertions are
routine, we shall leave most of them to the reader as exercises.
x<€BorxeC. Hence we either have (i) x ¢ A and x €B, or we have (ii) xe A and
xéC. Therefore, either xe ANB or xe ANC, so xE(ANB)U(ANC). This
shows that AN(B UC) is a subset of (AN B)U(ANC).
Conversely, let y be anelementof(A NB)U (AMC). Then, either (iii) ye AM B,
or (iv) ye ANC. It follows that y € A, and either ye BoryeC. Therefore, ye A
and ye BUC so that ye AN(BUC). Hence (ANB)U(ANC) is a subset of
AN(BUC). In view of Definition 1.1, we conclude that the sets A M(B UC) and
(A NB)U(ANC) are equal.
Asan indication of an alternate method, we note that there are, in principle, a total of
8( = 2°) possibilities for an element x relative to three sets A, B, C (see Figure 1.3);
namely:
The proof consists in showing that both sides of the first equation in (d) contain those
and only those elements x belonging to the cases (1), (2), or (3).
A=A,UA,U-:-*UA,, B=A:iNA2N-*
+N Aas.
es
(en
(8)
Figure 1.3
1. THE ALGEBRA OF SETS 7
Sometimes, in order to save space, we mimic the notation used in calculus for
sums and employ a more condensed notation, such as
A=Uiz A= UfAsi=12,...,0),
A\B={xEA:x¢B}.
ANB ES
1.7 THEOREM. The sets AMB and A\B are non-intersecting and
A=(ANB)U(A\B).
PROOF. Suppose xe AMB and xe A \B. The latter asserts that xe A
and x¢B which contradicts the relation xe¢ AM B. Hence the sets are
disjoint.
IfxeA, then eitherx¢ Borx¢B. Inthe formercase x ¢ A andx€B so
that xe AMB. In the latter situation, xe A and x¢B so thatxeA\B.
This shows that A is a subset of (AMB)U(A\B). Conversely, if ye
(AN B)U(A \B), then either ye ANB, or ye A\B. In either case we
have y € A, showing that (A 1 B) U(A \ B) isa subset of A. QED.
We shall now state the De Morgan? laws for three sets; a more general
formulation will be given in the exercises.
A\(BUC)=(A\B)N(A\C),
A\(BNC)=(A\B)U(A\C).
PROOF. We shall carry out a demonstration of the first relation, leaving
the second one to the reader. To establish the equality of the sets, we show
that every element in A\(BUC) is contained in both (A\B) and
(A \C) and conversely.
Ifxisin A \(B UC), then x isinA butxisnotinBUC. Hencex isin A,
but x isneither in B norin C.(Why?) Therefore, x isin A but not B, and x is
in A but not C. That is, xe A\B and xeA\C, showing that xe
(A\B)N(A\C).
Conversely, if xe(A\B)N(A\C), then xe(A\B) and xe(A\C).
Thus x¢ A and bothx¢B andx¢C. It follows that xe A andx¢(BUC),
so thatxe A\(BUC).
Since the sets (A \B)N(A\C) and A \(BUC) contain the same ele-
ments, they are equal by Definition 1.1. OED,
Cartesian Product
1.9 DeFINITION. If A and B are two non-void sets, then the Cartesian
product A x B of A and B is the set of all ordered pairs (a, b) with ae A and
be B. (See Figure 1.5.)
AxB
(The definition just given is somewhat informal as we have not defined what is
meant by an ‘ordered pair.” We shall not examine the matter further except to
mention that the ordered pair (a, b) could be defined to be the set whose sole
elements are {a}, {a, b}. Itcan then be shown that the ordered pairs (a, b) and (a’, b’)
are equal if and only if a=a' and b=b’. This is the fundamental property of
ordered pairs.)
Thus if A ={1, 2, 3} and B = {4, 5}, then the set A x B is the set whose
elements are the ordered pairs
(1, 4), (1,5), (2, 4), (2, 5), (3, 4), (G, 5).
We may visualize the set A x B as the set of six points in the plane with the
coordinates which we have just listed.
We often draw a diagram (such as Figure 1.5) to indicate the Cartesian
product of two sets A, B. However, it should be realized that this dia-
gram may be somewhat of a simplification. For example, if A=
{xe R:l<x<2}andB={xeR:0<x<1 o0r2<x <3}, then instead of
a rectangle, we should have a drawing like Figure 1.6.
2 ~
AxB
1 2
ee)
1 2
Exercises
1.A. Draw a diagram to represent each of the sets mentioned in Theorem 1.5.
1.B. Prove part (c) of Theorem 1.5.
1.C. Prove the second part of (d) of Theorem 1.5.
1.D. Prove that A cB if and only if ANB=A.
1.E. Show that the set D of all elements that belong either to A or to B but not to
both is given by
D=(A\B)U(B\A).
1.J. If {Ay, Az,..., A,}is a collection of sets, and if E is any set, show that
ENN
A =(\(ENA), EU A=[) (EVA).
rl mi yal an
1.K. Let E be aset and {A,, A.,..., A,} bea collection of sets. Establish the De
Morgan laws:
E\MQA=U(B\A)
V= a
E\U A= pel I=
(ENA).
Note that if E \ A, is denoted by €(A,), these relations take the form
e(Aa)-Uea),
jel j=l
(0 a)-A 6a),
ye! rl
1.L. Let J be any set and, for each j € J, let A, be contained in X. Show that
AXB=(AXB,)U(AXB,).
Section 2. Functions
We now turn to a discussion of the fundamental notion of a function or
mapping. It will be seen that a function is a special kind of a set, although
2. FUNCTIONS il
there are other visualizations which are often suggestive. All of the later
sections will be concerned with various types of functions, but they will
usually be of less abstract nature than considered in the present intoductory
section.
To the mathematician of a century ago the word “function” ordinarily
meant a definite formula, such as
f(x)=x?+3x—-5,
which associates to each real number x another real number f(x). The fact
that certain formulas, such as
g(x)=vx—-5
do not give rise to real numbers for all real values of x was, of course,
well-known but was not regarded as sufficient grounds to require an
extension of the notion of function. Probably one could arouse controversy
among those mathematicians as to whether the absolute value
h(x) =|x|
of a real number is an “honest function” or not. For, after all, the definition
of |x| is given “in pieces” by
I={ x, if x20,
lax if x<0.
As mathematics developed, it became increasingly clear that the require-
ment that a function be a formula was unduly restrictive and that a more
general definition would be useful. It also became evident that it isimportant
to make a clear distinction between the function itself and the values of the
function. The reader probably finds himselfin the position of the mathemati-
cian of a century ago in these two respects due to no fault of his own. We
propose to bring him up to date with the current usage, but we shall do so in
two steps. Our first revised definition of a function would be:
A function f fromaset A toa set Bisa rule of correspondence that assigns
to each x in a certain subset D of A, a uniquely determined element f(x)
of B.
Certainly, the explicit formulas of the type mentioned above are includedin
this tentative definition. The proposed definition allows the possibility that
the function might not be defined for certain elements of A and also allows the
consideration of functions for which the sets A and B are not necessarily real
numbers (but might even be desks and chairs—or cats and dogs).
12 INTRODUCTION: A GLIMPSE AT SET THEORY
b=f(a) or frarb
instead of (a, b)ef. We often refer to the element b as the value off at the
point a, or the image under f of the point a.
Tabular Representation
TABLE 2.1
Player Free Throws Made
Anderson
OTAPRPUCN
Bade
Bateman
Hochschild
Kakutani
Kovalevsky
Osborn
Peressini
SPN
Rosenberg
14 INTRODUCTION: A GLIMPSE AT SET THEORY
\
f(x)
Figure 2.3. A function as a machine.
2. FUNCTIONS 15
This last visualization makes clear the distinction between f and f(x): the
first is the machine, the second is the output of the machine when we put x into
it. Certainly it is useful to distinguish between a machine and its outputs.
Only afool would confuse a meat grinder with ground meat; however, enough
people have confused functions with their values that itis worthwhile to make
a modest effort to distinguish between them notationally.
If f isa function with domain D(f) and D1 isa subset of D(f), itis often useful
to define a new function f, with domain D, by fi(x)=f(x) for all xeD,.
This, function f, is called the restriction off to the set D,. In terms of
Definition 2.2, we have
fi={(a, De f:ae Dy}.
Sometimes we write f; =f | D, to denote the restriction of the functionf to the
set Dj.
A similar construction (that appears less artificial) is the notion of an
“extension.” If g is a function with domain D(g) and D2 > D(g), then any
function g2 with domain D2 such that g2(x) = g(x) for all x € D(g) is called an
extension of g to the set Dz.
Composition of Functions
2.4 ExampLes. (a) Letf, g befunctions whose values at the real number
x are the real numbers given byt
Since D(g) is the set R of all real numbers and R(f)< D(g), the domain
D(gef) is also R and gef(x) =3(2x)’>—1=12x’—1. On the other hand,
D(feg)=R, but fog(x) = 2(3x?— 1) = 6x?—2.
(b) If h is the function with D(h) = {x € R:x = 1} defined by
h(x) =vx-1,
g(y)
= Are siny or g(y) =Sin’' y.
surjective (that is, it maps D(f) onto B). Iff is bijective, we may say that f
is a bijection.
(c) Since ECE UF and FCEUB, it follows from part (a) that f(E)U
f(F)Sf(E UF). Conversely, if y <¢f(E UF), then there exists an element
x€EUF such that y=f(x). Since x€E or x€F, it follows that either
y = f(x) €f(E) or that yef(F). Therefore, we conclude that f(E UF) ¢
f(E) Uf(F), which completes the proof of part (c).
(d) Part (d) follows immediately from (a). O.E.D.
It will be seen in Exercise 2.J that, in general, it is not possible to replace the
inclusion sign in (b) by equality.
We now introduce the notion of the inverse image of a set under a function.
Note that it is not required that the function be injective.
2.12 DeFINITION. If H is a subset of B, then the inverse image of H
under f is the subset of D(f) given by
{x : f(x)
€ H}.
We usually denote the inverse image of a set H underf by the symbol f~'(H).
(See Figure 2.7.)
Once again, we emphasize thatf need not be injective so that the inverse function f~'
need not exist. (However, if f-’ does exist, then f-'(H) is the direct image of H
under f7').
Exercises
2.4. Prove that Definition 2.2 actually yields a function and not just a subset.
2.B. Let A= B=R and consider the subset C = {(x, y):x°+ y’=l} of AX B. Is
this set a function with domain in R and range in R?
2.C. Consider the subset of R x R defined by D = {(x, y):|x|+|y|= 1}. Describe
this set in words. Is it a function?
2.D. Give an example of two functions f, g on R to R such that f# g, but such that
fog=gof.
2.E. Prove that if f is an injection from A to B, then f"'={(b, a): (a, b)ef}isa
function. Then prove it is an injection.
2.F. Suppose f is an injection. Show that f’ef(x)=x for all x in D(f) and
fef(y)=y for all y in R(f).
2.G. Let f and g be functions and suppose that gef(x) =x for all x in D(f).
Show that f is injection and that R(f)< D(g) and R(g)S D(f).
2.H. Let f, g be functions such that
Ifn, m € N, we all have an intuitive idea of what is meant by saying that n is less
than or equal to m. We now borrow this notion, realizing that complete
precision requires more analysis than we have given. We assume that every
non-empty subsetof Nhasaleastelement. Thisisan important property of N;
we sometimes say that N is well-ordered meaning that N has this property.
This Well-Ordering Property is equivalent to mathematical induction. We
shall feel free to make use of arguments based on mathematical induction,
which we suppose to be familiar to the reader.
By an initial segment of N is meant a set consisting of all of the natural
numbers which are less than or equal to some fixed element of N. Thus an
initial segment S, of N determines and is determined by an element n of N
as follows:
An element x of N belongs to S, if and only if x <n.
For example: the subset S, = {1, 2} is the initial segment of N determined by
the natural number 2; the subset S,={1, 2, 3, 4} is the initial segment of N
determined by the natural number 4; but the subset {1, 3, 5} of N is not an
initial segment of N, since it contains 3 but not 2, and 5 but not 4.
+ A reader wishing to learn about these topics should consult the book of Halmos cited in the
References.
3. FINITE AND INFINITE SETS 23
E ={2,4,6,8,...}
O={1,3,5,7,...}
are not initial segments of N. However, since they can be put into one-one
correspondence with all of N (how?), they are both denumerable.
Even though the set Z of all integers
Z={...,—2,-1,0,1,2,...},
contains the set N, it may be seen that Z is a denumerable set. (How?)
We now state some theorems without proof. At first reading itis probably
best to accept them without further examination; ona later reading, however,
the reader will do well to attempt to provide proofs for these statements. In
doing so, he will find the inductive property of the set N of natural numbers to
be useful.t
It is a consequence of the second part of Theorem 3.4 that the set Q of all
rational numbersformsacountableset. (Werecallthatarationalnumberisa
fraction m/n, where m and n are integers andn#0. To see that Q isa
countable set we form the sets
Ao= {0},
Ai=fi, —t i i 1
Note that each of the sets A, is countable and that their union is all of Q.
Hence Theorem 3.4 asserts that Q iscountable. Infact, wecan enumerate Q
by the “diagonal procedure”’:
0 1112 _ 11
915 49 2p 1p 2p Bp ee ee
Despite the fact that the set of rational numbers is countable, the entire
set R of real numbers is not countable. In fact, the set I of real numbers x
satisfying 0 = x = 1 isnot countable. To demonstrate this, we shall use the
elegant “‘diagonal’’ argument of G. Cantor.j We assume it is known that
every real number x with 0 =x <1 has a decimal representation in the
form x = 0.a,a2a3 +--+, where each a, denotes one of the digits 0, 1, 2, 3, 4,
5, 6, 7, 8, 9. It is to be realized that certain real numbers have two
representations in this form (for example, the rational number 7 has the
two representations
0.1000 -- - and 0.0999 ---).
X2=0.bib.bs see
X3=0.€1C2€3° °°
Nowlet y; bea digit different from 0, a,, and 9; let y. bea digit different from 0,
bo, and 9; let ys bea digit different from 0, c3,and 9, etc. Consider the number
y with decimal representation
y= O.yi1y2y3 tte
which clearly satisfies 0< y <1. The number y is not one of the numbers
with two decimal representations, since y,#0,9. Atthesame time y # x, for
any n (since the nth digits in the decimal representations for y and x, are
different). Therefore, any denumerable collection of real numbers in this
interval will omit at least one real number belonging to this interval.
Therefore, this interval is not a countable set.
Suppose that a set A is infinite; we shall suppose that there is a one-one
correspondence with a subset of A and allof N. In other words, we assume
that every infinite set contains a denumerable subset. This assertion is a weak
form of the so-called ‘‘Axiom of Choice,”’ which is one of the usual axioms of
set theory. After the reader has digested the contents of this book, he may
turn to an axiomatic treatment of the foundations which we have been
discussing in a somewhat informal fashion. However, for the moment he
would do well to take the above statement as a temporary axiom. It can be
replaced later by a more far-reaching axiom of set theory.
Exercises
27
28 THE REAL NUMBERS
In this section we shall give the ‘‘algebraic” structure of the real number
system. Briefly expressed, the real numbers form a ‘“‘field’’ in the sense
of abstract algebra. We shall now explain what that means.
By a binary operation in a set F we mean a function B with domain F x F
and rangein F. Instead of using the notation B(a, b) to denote the value of
the binary operation B at the point (a, b) in F x F, it is conventional to use
a notation such as aBb, or a+b, or a: b.
4.1 ALGEBRAIC PROPERTIES OF R. In the set R of real numbers
there are two binary operations (denoted by+ and - and called addition and
multiplication, respectively) satisfying the following? properties:
(Al) a+b=b+a_ forall a, bin R;
(A2) (a+b)+c=a+(b+c) for all a, b,c in R;
(A3) there exists an element 0 in R with O+a=a and a+0=
a for all a in R;
(A4) for each element a in R there is an element —a in R such that
at+(-a)=0 and (-a)+a=0;
(M1) a-b=b-a_ forall a, bin R;
(M2) (a: b)-c=a-:(b:c) for all a, b,c in R;
(M3) the element 1 in R is distinct from 0 and has the property that
1l-a=a and a-:l=a forallainR;
(M4) for each element a# 0 in R there is an element 1/a in R such that
a:(t/a)=1 and (1/a)-a=1;
(D) a-(b+c)=(a-b)+(a-c) and (b+c)-a=(b-a)t+(c-a) for
all a, b, c in R.
These properties are certainly familiar to the reader. We will now
obtain a few easy (but important) consequences of them. First of all we
shall prove that 0 is the only element of R that satisfies (A3), and 1 is the
only element that satisfies (M3).
4.2 THEOREM. (a) If z and a are elements of R such that z+a=a,
then z =0.
(b) If w and b40 are elements ofR such that w - b=), thenw=1.
PROOF. (a) The hypothesis is that z+ a= a. Add —a to both sides and
use (A4), (A2), (A4), and (A3) to obtain
0=a+(—a)=(z+a)+(—a)=z+(a+(-a))
=z+0=z.
t This list is not intended to be “‘minimal.’” Thus the second assertions in (A3) and (A4)
follow from the first assertions by using (A1).
4. THE ALGEBRAIC PROPERTIES OF R 29
The proof of part (b) is left as an exercise. Note that it uses the
hypothesis that b# 0. Q.E.D.
We now show that the elements —a and 1/a (when a# 0) are uniquely
determined by the properties given in (A4) and (M4).
4.3 THEOREM. (a) If a and b are elements of R and a+b=0, then
b=~a.
(b) If a#0 and b are elements ofR and a - b =1, then b = 1/a.
PROOF. (a) Ifa+b=0, add —a to both sides to obtain (-a)+(a+b)=
—a+0. Now use (A2) on the left and (A3) on the right side to obtain
((-a)+a)+b=—a.
If we use (A4) and (A3) on the left side, we obtain b = —a.
The proof of (b) is left as an exercise. Note that it uses the hypothesis
that a#0. Q.E.D.
Properties (A4) and (M4) guarantee the possibility of solving the equa-
tions
=(-a)+(at+x1)=(-a)+b.
Hence x; = (—a)+b.
The proof of part (b) is left as an exercise. QED.
30 THE REAL NUMBERS
—(-1)=(-1): (2).
Hence the assertion follows from part (d) with a = 1. Q.E.D.
4.6 THEOREM (a) Ifae Rand a#0, then 1/a40 and 1/(1/a) =a.
(b) Ifa, be Randa - b=0, then either a=0 or b=0.
(c) If a, bE R, then (—a) + (—b) =a - b.
(d) IfaeR and a# 0, then 1/(-a)=—(1/a).
b=1-b=((1/a)-a)-b=(1/a)-(a-b)
=(1/a)-0=0.
Rational Numbers
From now on we shall generally drop the use of the dot to denote
multiplication and write ab for ab. As usual we shall write a’ for aa,
nti
a’ for aaa =(a’)a, and if ne N we define a"**=(a")a. It follows by use
of mathematical induction that if m, ne N, then
(*) a™*" =q"q"
b/a or b
a
instead of (1/a)-b=b-+(1/a). We shall also write a for 1/a, and a™ for
1/a". It can then be shown that formula (*) above holds for m, ne Z when
ax 0.
Elements of R which are of the form
b —b
aon |
for a, bE N, a#0, are said to be rational numbers, and the set of all
rational numbers in R will be denoted by the standard notation Q. All of
the elements of R which are not rational numbers are said to be irrational
numbers. Although this terminology is unfortunate, it is also quite
standard and we shall adopt it.
We shall close this section with a proof of the fact that there does not
exist a rational number whose square is 2.
4.7 THEOREM. There does not exist a rational number r such that
2
r=2.,
32 THE REAL NUMBERS
Exercises
It should be noted that, according to the terminology just introduced, the number
0 is both positive and negative; it is the only number with this dual status. This
terminology may seem a bit strange at first, but it will prove to be a convenience.
Some authors reserve the term “‘positive’’ for the elements of the set P and use the
term ‘non-negative’ for the elements of P U {0}.
a<b<c or c>b>a.
axb<c or c>b=a.
PROOF. (a) If a—b and b—c belong to P, then from 5.1(i) we infer
that a—c=(a—b)+(b—c) also belongs to P. Hence a>c.
(b) By 5.1(iii) exactly one of the following possibilities takes place:
a—beP, a-b=0, b-a=-(a-—b)eP.
(c) If a# b, then from part (b) we must have either a—b or b—a in P.
Hence, either a>b or b>a. In either case one of the hypotheses is
contradicted. O.E.D.
34 THE REAL NUMBERS
Absolute Value
The trichotomy property 5.1(iii) assures that if a~0, then one of the
numbers a and —a is strictly positive. The absolute value of a#0 is
defined to be the strictly positive one of the pair {a, —a}, and the absolute
value of 0 is defined to be 0.
5.10 DerFinirion. If ae R, the absolute value of a is denoted by |a|
and is defined by
laj=a if a0,
=—a if a<0.
Thus the domain of the absolute value function is all of R, its range is
PU {0}, and it maps the elements a, —a, into the same element.
5.11 THEOREM. (a) |al=0 if and only if a=0.
(b) |-al=|a| forallaeR.
(c) |ab| =a] |b| forall a, bER.
(d) Ifc = 0, then ja|=c if and only if -cs asc.
(e) -lja|<a<ja| forallaeR.
PROOF. (a) If a=0, then by definition |0}=0. If a#0, then also
—a#0, so that |a| #0.
(b) If a=0, then |0|=0=|-O|. If a>0, then JaJ=a=|-al. If a<0,
then |a| =—a =|-al.
(c) If a>0 and b>0, then ab>0 so that |ab|=ab=|a||b|. If a<0
and b>0, then ab<0 so that |ab|=—(ab)=(—a)b=|a||b|. The other
cases are handled similarly.
36 THE REAL NUMBERS
(d) If(a|=c, then both a=c and—a=c. (Why?) From the latter and
Theorem 5.6(c’) we infer that —-c < a so that -c<a=c. Conversely, if
this relation holds then both a < c and —a < c, whence |a| = ¢.
(e) Use part (d) with c =|a| = 0. QED.
The next result will be used very frequently in the sequel. (Recall that
a+b means both a+b and a—b.)
5.12 THE TRIANGLE INEQUALITY. If a, b are any real numbers, then
5.13. CoROLLARY. [If a1, @2,...,@, are any n real numbers, then
a: ta2t- + aa] = lai]t+la2}+---+fanl.
PROOF. If n=2, the conclusion is precisely 5.12. If n>2, we use
mathematical induction and the fact that
<|ait+aot+-+-+axl+la.i|. o8D.
Exercises
5.1. Show that ifa<x<b and a<y<=b, then |x—y|<b-—a. Interpret this
geometrically.
5.J. Let 6>0, ae R. Show that a—8<x<a+té if and only if |x—al/<6.
Similarly, a—8 <x <a+6 if and only if |x—a| = 8.
5.K. If a, be R and b# 0, show that |a/b| = |a|/|b].
5.L. Show if a, be R, then |a + b|=|a|+|b| if and only if ab = 0.
5.M. Sketch the points (x, y) in the plane R < R for which |y| = |x].
5.N. Sketch the points (x, y) in the plane R x R for which |x|/+|y|= 1.
5.0. If x, y, z belong to R, then x = y = z if and only if |x — y|+|y —z|=|x—z|.
5.P. If0<a<1, then 0<a*<a<1, while if 1<a, then 1<a<a’,
In this section we shall present one more property of the real number
system which is often called the ‘completeness property” since it guaran-
tees the existence of elements in R when certain hypotheses are satisfied.
There are various versions of this completeness property, but we choose to
give here what is probably the most efficient method by assuming that
bounded sets in R have a supremum.
™\ aN
Lower bounds for S Upper bounds for S
proof. Suppose that u satisfies (i) and (ii). The condition (i) implies
that u is an upper bound of S._ If v is any number with v <u, then property
(ii) shows that v cannot be an upper bound of S. Hence u is the supremum
of S.
Conversely, let u be the supremum of S. Since u is an upper bound of
S, condition (i) holds. If o<u, then v is not an upper bound of S.
Therefore, there exists an element s, € S such that v < s,. Q.E.D.
The reader should convince himself that the number 1] is the supremum
of both of the sets S$; and S, which were defined after Definition 6.1. We
note that S, contains its supremum, but that S, does not contain its
supremum. Thus, when we say that a set has a supremum, we are
making no statement as to whether the set contains the supremum as an
element or not.
It is a deep and fundamental property of the real number system that
every non-empty subset of R which is bounded above has a supremum. We
shall make frequent and essential use of this property, which we take as
our final assumption about R.
6.4 SUPREMUM PROPERTY. Every non-empty set of real numbers
which has an upper bound has a supremum.
The analogous property of infima can be readily established from the
Supremum Property.
6.5 INFIMUM PROPERTY. Every non-empty set of real numbers which
has a lower bound has an infimum.
PROOF. Let S be bounded below and let S;={-s:s¢S} so that S, is
bounded above. The Supremum Property assures that S, has a supremum
u. We leave it to the reader to show that —u is the infimum of S. O.E.D.
The Existence of V2
One important property of the Supremum Property is that, as we have
said before, it assures the existence of certain real numbers. We shall
make use of it many times in this way. At the moment we will show that it
guarantees the existence of a positive real number x such that x* = 2; that
is, a positive square root of 2. This result complements Theorem 4.7.
6.8 THEOREM. There exists a positive number x © R such that x* = 2.
PROOF, Let S={y€R:0<y,y’= 2}. The set S is bounded above by
2; for, if not, then there exists an element s¢S such that 2<s whence it
follows that 4<s? <2, a contradiction. By the Supremum Property the
set S has a supremum and we let x= sup S. Clearly x >0.
We claim that x°=2. If not, then either x°<2 or x’°>2. If x?<2, let
néN be chosen such that 1/n <(2-x”)/(2x +1). In this case
2
b=Va or b=a"’,
We now know that there exists at least one irrational element, namely,
V2 (the positive square root of 2). Actually there are “more” irrational
numbers than rational numbers in the sense that (as we have seen in
Section 3) the set of rational numbers is countable while the set of irra-
tional numbers is not countable. We shall now show that there are
arbitrarily small irrational numbers; this result complements Corollary 6.7.
6.9 CoROLLARY. Let €>0 be an irrational number and let z>0.
Then there exists a natural number m such that the irrational number E/m
satisfies O0<é/m <z.
PROOF. Since €>0, z>0, it follows from Theorem 5.6(d) and 5.6(c)
that é/z>0. By the Archimedean Property there exists a natural number
m such that 0<é/z<m. Therefore 0<é/m<z and it is an exercise to
show that &/m is irrational. Q.E.D.
We now show that between any two distinct real numbers there is a
rational number and an irrational number. (In fact, there are infinitely
many of both kinds!)
6.10 THEOREM. Let x and y be real numbers with x <y.
(a) Then there is a rational number r such thatx<r<y.
(b) If €>0 is any irrational number, then there is a rational number s such
that the irrational number sé satisfies x < s& <y.
PROOF. It is no loss of generality to assume that 0<x. (Why?)
(a) Since y —x >0, it follows from Corollary 6.7(b) that there is a natural
number m such that 0<1/m<y—x. From Corollary 6.7(a) there is a
natural number k such that
k =k _— Xx,
m m
and we let n be the least such natural number. Therefore
n-1 n
—s=x<—.
m m
42 THE REAL NUMBERS
which implies that y-x =< 1/m contrary to the choice of m. Therefore
x<nim<y.
(b) Supposing that O0<x<y and €>0, we have x/E<y/&é& By part
(a) there exists a rational number s such that x/é<s<y/é Therefore
x<s&<y. (Show that sé is irrational.) Q.E.D.
Exercises
6.A. Prove that a non-empty finite set of real numbers has a supremum and an
infimum.
6.B. If a subset S of R contains an upper bound, then this upper bound is the
supremum of S.
6.C. Give an example of a set of rational numbers which is bounded but does not
have a rational supremum.
6.D. Give an example of a set of irrational numbers that has a rational sup-
remum.
6.E. Prove that the union of two bounded sets is bounded.
6.F. Give an example of a countable collection of bounded sets whose union is
bounded, and an example where the union is unbounded.
6.G. If S is a bounded set in R and if S, is a nonempty subset of S, then show
that
inf S <= inf S, <= sup S, = sup S.
Sometimes it is more convenient to express this in another way. Let D#§ and let
f:D— R have bounded range. If D, is a non-empty subset of D, then
inf {f(x):x € D} = inf {f(x):x € Do} = sup {f(x):x € Do} = sup {f(x):x € D}.
6.H. Let X and Y be non-empty sets and let f:X x Y — R have bounded range
in R. Let
f(x) = sup {f(x, y):y € ¥}, fly) = sup {f(x, y):x © X}.
sup
my f(x, y) = supeysup f(x, y) = supyo supx f(x, y).
6.1. Let f and f, be as in the preceding exercise and let
8aly)
= inf {f(x, y):x
© X}
6. THE COMPLETENESS PROPERTY OF R 43
Prove that
sup {g.(y):y © Y} s inf {f,(x):x € X}.
Show that strict inequality can hold. We sometimes express this inequality by
6.J. Let X be a non-empty set and let f: X > R have bounded range in R. If
aéR, show that
sup {a+ f(x):x €X}=a-+sup {f(x):x € X},
inf {a+ f(x):x€X}=a+tinf {f(x):x © X}.
6.K. Let X be a non-empty set and let f and g be defined on X and have
bounded ranges in R. Show that
exists and has the property that b?=a. This number will be denoted by Va or a” ?
and is called the positive square root of a.
6.N. Use Exercise 5.P to show that if O0<a<1, then 0<a<vVa< 1, while if
1<a, then 1<Va<a.
Projects}
6.a. If a and b are strictly positive real numbers and if n € N, we have defined a"
and b”. It follows by mathematical induction that if m, n €N, then
(i) a"a* =a":
t The projects are intended to be somewhat more challenging to the reader, but they differ
considerably in difficulty. We have put these three (rather difficult) projects here because
they belong here logically. The reader should return to them later after he has
accumulated more experience with suprema.
44 THE REAL NUMBERS
We shall adopt the convention that a°= 1 anda" = 1/a". Thus we have defined a’*
for x in Z and it is readily checked that properties (i)—(iii) remain valid.
We wish to define a* for rational numbers x in such a way that (i)-(iii) hold. The
following steps can be used as an outline. Throughout we shall assume that a and b
are real numbers exceeding 1.
(a) If r is a rational number given by r= m/n, where m and n are integers and
n>0 we define S,(a)={xeR:0<x"<a™}. Show that S,(a) is a bounded non-
empty subset of R and define a’ =sup S,(a).
(b) Prove that z = a' is the unique positive root of the equation z*=a"™. (Hint:
there is a constant K such that if O0<e<1, then (1+e)"<1+Ke. Hence if
x" <a" <y’, there exists an ¢ >0 such that
x"(1+e)"<a™<y"/(i+e)")
(c) Show that the value of a’ given in part (a) does not depend on the representa-
tion of rin the form m/n. Also show that if r is an integer, then the new definition
of a’ gives the same value as the old one.
(d) Show that if r, se Q, then a‘a’ =a" and (a‘') =a”.
(e) Show that a’b' = (ab).
(f) If re Q, r>0, then a <b if and only if a’<b’.
(g) If r, s¢ Q, then r<s if and only if a’<a‘.
(h) If c is a real number satisfying O0<c<1, we define c'=(1/c)"".. Show that
parts (d) and (e) hold and that a result similar to (g), but with the inequality
reversed, holds.
6.8. Now that a* has been defined for rational numbers x, we wish to define it for
real x. In doing so, make free use of the results of the preceding project. As
before, let a and b be real numbers exceeding 1. If ueR, let
Ta)
= {a :reQ, r <u}.
a“ =sup T,(a).
Prove that this definition yields the same result as the previous one when u is
rational. Establish the properties that correspond to the statements given in parts
(d)-(g) of the preceding project. The very important function which has been
defined on R in this project is called the exponential function (to the base a).
Some alternative definitions will be given in later sections. Sometimes it is
convenient to denote this function by the symbol
eXxPa
and denote its value at the real number u by exp,(u) instead of a".
6.y. Making use of the properties of the exponential function that were estab-
lished in the preceding project, show that exp, is an injective function with domain
R and range {y¢R:y>0}. Under our standing assumption that a > 1, this expo-
nential function is strictly increasing in the sense that if x<u, then exp,(x)<
exp.(u). Therefore, the inverse function exists with domain {v€R:v >0} and
7. CUTS, INTERVALS, AND THE CANTOR SETS 45
range R. We call this inverse function the logarithm (to the base a) and denote it
by
log..
A {
NF
B
Ae
are called the closed rays determined by a. The point a is called the end
point of these rays. These sets are often denoted by the notations
are called the half-open (or half-closed) cells determined by a and b and
are denoted by
h2hoah2>:::Dh2Iwi2D:::
of inclusions holds. It is important to notice that a nested sequence of
intervals does not need to have acommon point. Indeed, it is an exercise
to show that if I, = (n, +~), ne N, then the sequence of intervals obtained
is nested but has no common point. Similarly, if J, =(0, 1/n), ne N, then
the sequence is nested but has no common point.
However, it is a very important property of R that every nested se-
quence of closed cells has a common point. We shall now prove that fact.
7.3. NESTED CELLS PROPERTY. If néEN, let I, be a non-void closed
cell in R and suppose that this sequence is nested in the sense that
Lh2h>-:-2hL2:--
We note that, under the hypotheses of 7.3, there may be more than one
common element. In fact, if we let yn =inf{b, :n¢ N}, it is an exercise to
show that
[Enl= 1b
The Cantor Set
Fi
Fs
Fy
Figure 7.2. The Cantor set.
We now give two senses in which F is “thin.” First we observe that F does not
contain any non-void interval. For if x belongs to F and (a, b) is an open interval
containing x, then (a, b) contains some middle thirds that were removed to obtain
F. (Why?) Hence (a, b) is not a subset of the Cantor set, but contains infinitely
many points in its complement €(F).
A second sense in which F is thin refers to ‘length.””. While it is not possible to
define length for arbitrary subsets of R, it is easy to convince oneself that F cannot
have positive length. For, the length of F, is 3, that of F, is 3, and, in general, the
length of F, is 3)". Since F is a subset of F,, it cannot have length exceeding that of
F,. Since this must be true for each n in N, we conclude that F, although uncount-
able, cannot have positive length.
As strange as the Cantor set may seem, it is relatively well-behaved in many
respects. It provides us with a bit of insight into how complicated subsets of R can
be and how little our intuition guides us. It also serves as a test for the concepts
that we will introduce in later sections and whose import are not fully grasped in
terms of intervals and other very elementary subsets.
Models for R
In Sections 4—6, we have introduced R axiomatically in the sense that we
have listed some properties that we assume it to have. This approach
raises the question as to whether such a set actually exists and to what
extent it is uniquely determined. While we shall not settle these questions,
a few remarks about them is certainly appropriate.
The existence of a set which is a complete ordered field can be demon-
strated by actual construction. If one feels sufficiently familiar with the
rational field Q, one can define real numbers to be special subsets of Q and
define addition, multiplication, and order relations between these subsets
in such a way as to obtain a complete ordered field. There are two
standard procedures that are used in doing this: one is Dedekind’s method
of ‘‘cuts’’ which is discussed in the book of Rudin that is cited in the
References. The second way is Cantor’s method of “Cauchy sequences”
which is discussed in the book of Hamilton and Landin.
50 THE REAL NUMBERS
Exercises
7.A. If (A, B) is a cut in R, show that sup A =inf B.
7.B. If the cuts (A, B) and (A’, B’) determine the real numbers & and &’,
respectively, show that €<&€’ implies that AC A’, AZ A’.
7.C. Is the converse of the preceding exercise true?
7.D. Let A={xER:x =0 or x’? <2} and B={xeR:x>0 and x’>2}. Show
that (A, B) is a cut in R.
+ LEOPOLD KRONECKER (1823-1891) studied with Dirichlet in Berlin and Kummer in Bonn.
After making a fortune before he was thirty, he returned to mathematics. He is known for his
work in algebra and number theory and for his personal opposition to the ideas of Cantor on
set theory.
7. CUTS, INTERVALS, AND THE CANTOR SETS 51
7.E. Let I, = (n, +%) for ne N. Show that the sequence of intervals is nested,
but that there is no common point.
7.F. Let J, =(0,1/n) for ne N. Show that this sequence of intervals is nested,
but that there is no common point.
7.G. If I, =[a,, b.], 1é N, is a nested sequence of closed cells, show that
Q80,.8::'<a<'''sb
S's b sb.
If we put € =sup
{a, 2 € N} and 7 =inf {b,,:m
€ N}, show that [é q]= ial I.
7.H. Show that every number in the Cantor set has a ternary (= base 3) expan-
sion using only the digits 0, 2.
7.1. Show that the collection of “right hand” end points in F is denumerable.
Show that if all these end points are deleted from F, then what remains can be put
onto one-one correspondence with all of [0,1). Conclude that the set F is not
countable.
7,J. Every open interval (a, b) which contains a point of F also contains an entire
“middle third”’ set which belongs to €(F). Hence F does not contain any non-void
open interval.
7.K. By removing sets with ever decreasing length, show that we can construct a
“Cantor-like’’ set which has positive length. How large can we make the length of
this set?
7.L. Show that F is not the union of a countable collection of closed intervals.
I
THE TOPOLOGY
OF CARTESIAN SPACES
52
8. VECTOR AND CARTESIAN SPACES 53
for all s eS, then it can readily be checked that R* is a vector space under
these operations. [Here 0 is the function identically equal to zero, and —u
is the function whose value at s ES is —u(s).]
In later sections, we shall encounter many other vector spaces.
Generally we shall write x — y instead of x +(—y).
The reader will note that the scalar multiplication in a vector space Visa
function with domain R x V and range V. Many vector spaces are also
equipped with a function with domain VxV and range R that is of
importance.
8.3 DEFINITION. If V is a vector space, then an inner product (or dot
product) is a function on V x V to R, denoted by (x, y)> x - y, satisfying
the properties:
@) x-x20 forall xeV;
Gi) x-x=0 if and only if x=0;
(iii) x-y=y-x forall x, yeV;
(iv) x-(y+z)=x-yt+x-z and (x+y)-z=x-z+y-z forall x, y,
zéEV;
(v) (ax): y=a(x-y)=x-(ay) for all ac R, and x, yeV.
A vector space in which an inner product has been defined is called an
inner product space.
It is possible for different inner products to be defined in the same vector
space (cf. Exercise 8.D).
8.4 EXAMPLES. (a) The ordinary multiplication in R satisfies the
above properties, so R is an inner product space.
(b) In R’, we define
(x1, X2) + (y1, y2) = Xiyit X2ye2.
It is easy to check that this defines an inner product on R’.
(c) In R?’, we define
(X1, X25 2+ +5 Xp) * (Yay Vos - ++ Vp) = X1 Yr tH H2y2t+ + -+Xpyp.
It is easy to check that this defines an inner product on R?.
8.5 DeFiniTion. If V is a vector space, then a norm on V is a
function on V to R denoted by x + ||x|| satisfying the properties:
(i) |x| =O for all xeV;
(ii) ||x|=0 if and only if x=0;
(iii) ||ax||=|a||[x|| for all ae R, xeV;
(iv) |x + yll = [xl|+ lly] for all x, ye Vv.
A vector space in which a norm has been defined is called a normed space.
8. VECTOR AND CARTESIAN SPACES 55
As we will see in the exercises, the same vector space can have several
interesting norms.
Properties (i), (ii), and (iii) are very easily checked. Property (iv) is a bit
more complicated.
(c) In R?, we define
We shall now give a theorem which asserts that an inner product can
always be used to define a norm in a very natural way.
8.7 THEOREM. Let V be an inner product and define ||x|| by
= |[x|P+2(x - y+ flylP
< ||x{? +2 laf lly lly lP
< (lhxl]+lyl)*,
whence it follows that ||x + y|| < |]x||+ly|| for all x, y € V. QED.
We leave the proof of the following corollary as an exercise.
8.8 COROLLARY. If x, y are elements of V, then
(**) x y| = [lxllllyll.
Moreover, if y#0 then the equality can hold in (**) if and only if there is a
real number c such that x = cy.
Both of the inequalities (*) and («*) are called the Schwarz Inequality, or
the Cauchy-Bunyakovskii-Schwarz Inequality.; They will be frequently
used. The inequality 8.5(iv) is called the Triangle Inequality. We leave it
to the reader to show that
The real numbers xi, X2,...,Xp are called the first, second,..., p-th
coordinates (or components) of the vector x = (x1, X2,..., Xp).
In R’, the real number |x|] can be thought of either as the “length” of x
or as the distance from x to 0. More generally, we think of {|x — y|| as the
distance from x to y. With this interpretation, property 8.5(ii) asserts that
the distance from x to y is zero if and only if x =y. Property 8.5(iii) with
a =—1 asserts that ||x — y||=|ly — x|], which means that the distance from x
to y is equal to the distance from y to x. The Triangle Inequality implies
that
IIx—yll = lx —z{|+llz - yl
which means that the distance from x to y is no greater than the sum of the
distance from x to z and the distance from z to y.
8.9 Derinition. Let x€R° and let r>0Q. Then the set {ye R?:
jx — yl|<r} is called the open ball with center x and radius r. The
set {ye R?:|x—yllar} is called the closed ball with center x and
radius r. The set {ye R? :||x —y||=r} is called the sphere with center x
and radius r.
The notion of a ball depends on the norm. It will be seen in the
exercises that some balls are not very “round.”
It is often convenient to have relations between the norm of a vector in
R?’ and the magnitude of its components.
8.10 THEOREM. If x ={xi, X2,..., Xp) is any element of R°, then
Exercises
8.A. If V is a vector space and if x + z =x for some x and z in V, show that z=0.
Hence the zero element in V is unique.
8.B. If x+y =0 for some x and y in V, show that y=—x.
8.C. Let S={1,2,...,p}, for some peN. Show that the vector space R* is
“essentially the same”’ as the space R’.
8.D. If w, and w, are strictly positive, show that the definition
(1, X2) * (Yi, Yo) = X1Y1Wi
+ X2VoWo,
yields an inner product on R*. Generalize this to R’.
8.E. The definition
(X1, X2) - (Ys Ya) = X11
[xh= [xa]
+ [a] ++ + - + [xp
Prove that x +> ||x|], is a norm on R’.
8.G. If x = (x1, %2,...,%,) ER’, define |x|. by
(1-£)x
+ ty =x+t(y—x)
also belongs to K. Interpret this condition geometrically and show that the subsets
K,= {x € R?:||x||= 1}
is not convex.
8.R. The intersection of any collection of convex subsets of R’ is convex. The
union of two convex subsets of R’ may not be convex.
8.S. If M is any set, then a function d: M x M —> R is called a metric on M if it
satisfies:
(i) d(x, y)=0 for all x, y in M;
(ii) d(x, y)=0 if and only if x=y;
(iii) d(x, y)=d(y, x) for all x, y in M;
(iv) d(x, y) <d(x,z)+d(z, y) for all x, y, z in M.
Show that if x ||x|| is any norm on a vector space V and if we define d by
d(x, y) =||x — yl, for x, y € V, then d is a metric on V.
8.T. Suppose that d is a metric on aset M. By using Definition 8.9 as a model,
define an open ball with center x € M and radius r. Interpret the sets S, and S., in
Exercise 8.H as open balls in R® with respect to two different metrics. Interpret
Exercise 8.K as saying that a ball with center 0 relative to the metric d, (derived from
the norm in 8.6(b)) contains and is contained in balls with center 0 relative to the
metric d, derived from {| ||. Make similar interpretations of Exercise 8.L and
Theorem 8.10.
60 THE TOPOLOGY OF CARTESIAN SPACES
8.U. Let M be any set and let d be defined on M x M by the requirement that
_ fo if x=y,
dx y={} if x#y.
Show that d gives a metric on M in the sense defined in Exercise 8.S. Ifx is any point
in M, then the open ball with center x and radius 1 (relative to the metric d) consists
of precisely one point. However, the open ball with center x and radius 2 (relative to
d) consists of all of M. This metric d, is sometimes called the discrete metric on the
set M.
Projects
ab = (a’+b’)/2,
and that the equality holds if and only if a= b. (Hint: consider (a — b)’.)
(b) Let a, and a, be positive real numbers. Show that
(San) =(SefSe}
n
j=l
2 n
j=l
n
k=1
Show that the equality holds if and only if the ordered sets (a,, a2,...,4a,) and
(by, b2,..., b,) are proportional.
+ JosEpH-LoUIS LAGRANGE (1736-1813) was born in Turin, where he become professor at the
age of nineteen. He later went to Berlin for twenty years as successor to Euler and then to
Paris. He is best known for his work on the calculus of variations and analytical mechanics.
8. VECTOR AND CARTESIAN SPACES 61
= si j=l
8.8. In this project, let {a,, a,,...,a@,}, and so forth, be sets of n positive real
numbers.
(a) Itcan be proved (for example, by using the Mean Value Theorem) that if a and
b are positive and 0<a<1, then
a*b'* <aa+(1—a)b
and that the equality holds if and only if a=b. Assuming this, let r>1 and let s
satisfy .
Lt,
r os
(so that s>1 andr+s=rs). Show that if A and B are positive, then
AB<* 4B
ros
and that the equality holds if and only if A’ = B’.
(b) Let {a,,...,a,} and {b,,...,b,} be positive real numbers. If r, s>1 and
(1/r) + (1/s) = 1, establish Hélder’s Inequality+
n n tree yds
2» ab, = {2 ai} ‘{e or} .
(Hint: Let A = {3 a‘}"" and B = {> b;}"* and apply part (a) to a/A and b/B.)
(c) Using Hélder’s Inequality, establish the Minkowski Inequality?
(e) If a, a, and b, < by, then (a,— a2)(b:— b,) = O and hence
a,b, + ayb, = a,b.+ azb,.
Show that if a,< a,<-::<a, andb,<b,<--:< b,, then
9.3. Orkn Set Properties. (a) The empty set @ and the entire space
R?® are open in R?.
(b) The intersection of any two open sets is open in R?.
(c) The union of any collection of open sets is open in R®.
The intersection of the sets G, is the set F={x € R:0 < x = 1}, which is not
open.
Closed Sets
(9.2) A={xeR:0<x<1}.
This set A fails to be open in R, since it contains the point 0. Similarly, it fails to be
closed in R, because its complement in R is the set {x € R:x <0 or x = 1}, which is
not open since it contains the point 1. The reader should construct other examples
of sets which are neither open nor closed in R’.
9. OPEN AND CLOSED SETS 65
We now state the fundamental properties of closed sets. The proof of this
result follows directly from Theorem 9.3 by using DeMorgan’s laws
(Theorem 1.8 and Exercise 1.K).
9.6 CLOSED SET PROPERTIES. (a) The empty set @ and the entire
space R? are closed in R’.
(b) The union of any two closed sets is closed in R?.
(c) The intersection of any collection of closed sets is closed in R’.
Neighborhoods
9.7 DEFINITION. (a) Ifx €R’, then any set which contains an open set
containing x is called a neighborhood of x.
(b) A point x € R? is called an interior point of a set A < R? in case there
is a neighborhood of x which is entirely contained in A.
(c) A point x € R? is called a boundary point of aset A < R? in case every
neighborhood of x contains points in A and points in €(A).
(d) A point x € R? is called an exterior point of aset A < R? in case there
exists a neighborhood of x which is entirely contained in @(A).
It should be noted that given x € R’ and A € R’, there are three mutually exclusive
possibilities: (i) x is an interior point of A, (ii) x is a boundary point of A, or (iii)
x is an exterior point of A.
It follows from what we have shown that an open set contains none of its
boundary points. Closed sets are the other extreme in this respect.
Open Sets in R
(nb ineBe(e-2.x12)c
m m m m
this contradicts the choice of the m,. Therefore we have x ¢J, for this
value of n. Since xéG is arbitrary, we infer that
GoUh.
neN
It follows from the theorem just given that a subset of R is closed if and
only if it is the intersection of a countable collection of closed intervals.
(Why?) It does not follow that the countable union of closed intervals must
be closed, nor does every closed set have this property.
A generalization of this result is given in Exercise 9.G.
Exercises
9.A. Justify the assertion made about the set G, F made in Example 9.2(b).
9.B. Justify the assertions made in Example 9.2(c).
9.C. Prove that the intersection of any finite collection of open sets is open in R?’.
(Hint: use 9.3(b) and induction.)
9.D. What are the interior, boundary, and exterior points in R of the set [0, 1).
Conclude that it is neither open nor closed.
9.E. Give an example in R? which is neither open nor closed. Prove your
assertion.
9.F. Write out the details of the proof of Theorem 9.6.
9.G. Show that a subset of R’ is open if and only if it is the union of a countable
collection of open balls. (Hint: the set of all points in R? all of whose coordinates are
rational numbers is countable.)
9.H. Every open subset of R? is the union of a countable collection of closed sets.
9.1, Every closed subset of R° is the intersection of a countable collection of
open sets.
9.J. If A is any subset of R’, let A° denote the union of all open sets which are
contained in A; the set A° is called the imterior of A. Note that A° is an open set;
prove that it is the largest open set contained in A. Prove that
ACSA, (A= A°
(ANBY=A°NB’, — (R’)°= RR’.
Give an example to show that (A UB)°= A°U B® may not hold.
9.K. Prove that a point belongs to A° if and only if it is an interior point of A.
68 THE TOPOLOGY OF CARTESIAN SPACES
9.L. If A is any subset of R’, let A~ denote the intersection of all closed sets
containing A; the set A” is called the closure of A. Note that A” is a closed set;
prove that it is the smallest closed set containing A. Prove that
ASA, (AY =Aq
(AUB) =A°UB, f =o.
In this section we shall present two very important results that will be
often used in later chapters. In a sense they can be regarded as the
Completeness Property for R’, when p> 1.
We recall from Section 7 that if a = b, then the open cell in R, denoted by
(a, b), is the set defined by
(a, b)={xE R:a<x <b},
It is readily seen that such a set is open in R. Similarly, the closed cell [a, b]
in R is the set
[a, b])={xeR:a<x
<b},
which is closedin R. The Cartesian product of two intervals is usually called
a rectangle and the Cartesian product of three intervals is often called a
10. NESTED CELLS AND BOLZANO-WEIERSTRASS THEOREMS 69
The reader will recall from Section 7 that the Supremum Property of the
real number system implies that every nested sequence of non-empty closed
cells in R has a common point. We shall now prove that this property
carries over to the space R?.
10.2 NESTED CELLS THEOREM. Let (i.) be a sequence of non-empty
closed ‘cells in R? which is nested in the sense that L2In2-:--2h2°°°
Then there exists a point in R’ which belongs to all of the cells.
It is easy to see that the cells [aui, ii], kK EN, form a nested sequence of
non-empty closed cells of real numbers and hence by the completeness of
the real number system R, there is a real number y; which belongs to all
of these cells. Applying this argument to each coordinate, we obtain a
point y=(yi,..-, yp) of R? such that if j satisfies j=1,2,...,p, then y,
belongs to all the cells {[axj, bij]: k €N}. Hence the point y belongs to
all of the cells (I,,). QED.
The next result is one of the most important results in this book. It is of
basic importance and will be frequently used. It should be noted that the
conclusion may fail if either hypothesis is removed [see Examples 10.4(f, g)].
t BERNARD BoLZzAno (1781-1848) was professor of the philosophy of religion at Prague, but
he had deep thoughts about mathematics. Like Cauchy, he was a pioneer in introducing a
higher standard of rigor in mathematical analysis. His treatise on the paradoxes of the infinite
appeared after his death.
KARL WEIERSTRASS (1815-1897) was for many years a professor at Berlin and exercised a
profound influence on the development of analysis. Always insisting on rigorous proof he
developed, but did not publish, an introduction to the real number system. He also made
important contributions to real and complex analysis, differential equations, and the calculus of
variations.
10. NESTED CELLS AND BOLZANO-WEIERSTRASS THEOREMS 71
Jo
ly — wll s Vp Uh) = 357 ML).
—-——--¥_ i
Figure 10.1
72 THE TOPOLOGY OF CARTESIAN SPACES
Jp
ai Wh) <r.
For such a value of k we have . © V. Since I, contains infinitely many
elements of B, it follows that V contains at least one element of B different
from y. Therefore, y isa cluster point of B. QED.
Exercises
10.A. Let I, c R? be the open cells given by I, = (0, 1/n)*-++X(0,1/n). Show
that these cells are nested but that they do not contain any common point.
10.B. Let J, R’ be the closed intervals given by J, =[n, +%)x-+--*[n, +00).
Show that these intervals are nested, but that they do not contain any common point.
10.C. A point x is a cluster point of a set A ¢ R? if and only if every neighborhood
of x contains infinitely many points of A.
10.D. Let A={1/n:neN}. Show that every point of A isa boundary pointin R,
but that 0 is the only cluster point of A in R.
10.E. Let A, B be subsets of R’ and let x beacluster pointof AN Bin R’. Prove
that x is a cluster point of both A and B.
10.F. Let A, B be subsets of R° and let x be acluster point of AUB in R’. Prove
that x is either a cluster point of A or of B.
10.G. Show that every point in the Cantor set F is a cluster point of both F and
€(F).
10.H. If A is any subset of R’, then there exists a countable subset C of A such
that if x « A and ¢ >0, then there is an element z € C such that ||x — z||<e. Hence
every element of A is either in C or is a cluster point of C.
Projects
10.a. Let M be a set and d be a metric on M as defined in Exercise 8.5.
Reexamine the definitions and theorems of Sections 9 and 10, in order to determine
which carry over for sets that have a metric. It will be seen, for example, that the
notions of open, closed, and bounded set carry over. The Bolzano-Weierstrass fails
for suitable M and d, however. Whenever possible, either show that the theorem
extends or give a counterexample to show that it may fail.
10.8. Let F be a family of subsets of a set X which (i) contains # and X, (ii)
contains the intersection of any finite family of sets in J, and (iii) contains the union
of any family of setsin Y. Wecall J atopology for X, and refer to the setsin J as the
open sets. Reexamine the definitions and theorem s of Secticns 9 and 10, trying to
determine which carry over for sets X which have a topology 7.
Gu,
Cr
i=!
It is usually not an easy matter to prove that a set is compact, using the
definition only. We now present a remarkable and important theorem
which completely characterizes compact subsets of R’. In fact, part of the
importance of the Heine-Borel Theorem{ is due to the simplicity of
the conditions for compactness in R?.
It is readily seen that each set Gn, mE N, is open in R®. Also, the union of
all the sets G,,, m € N, consists of all points of R’ except x. Since x€ K, each
point of K belongs to some set Gn. In view of the compactness of K, it
follows that there exists a natural number M such that K is contained in the
union of the sets
Gi, G2,..., Gu.
Since the sets G,, increase with m, then K is contained in Gu. Hence the
neighborhood {z € R? :||z
— x||<1/M} does not intersect K, showing that
€(K) is open. Therefore, K is closed in R®. (See Figure 11.1 where the
closed balls complementary to the G,, are depicted.)
Next we show that if K is compact in R”, then K is bounded (that is, K is
contained in some set {x € R? :||x||<r} for sufficiently large r). In fact, for
ft EpUARD HEINE (1821-1881) studied at Berlin under Weierstrass and later taught at Bonn
and Haile. In 1872 he proved that a continuous function on a closed interval is uniformly
continuous.
(F. BE. J.) Emie Borel (1871-1956), a student of Hermite, was professor at Paris and one of
the most influential mathematicians of his day. He made numerous and deep contributions to
analysis and probability. In 1895 he proved that if a countable collection of open intervals
cover a closed interval, then they have a finite subcovering.
11. THE HEINE-BOREL THEOREM 75
ee ee ee _ hy
Figure 11.2
Some Applications
Asaconsequence of the Heine-Borel Theorem, we obtain the next result,
which is due to Cantor. It is a strengthening of the Nested Cells Theorem,
since general closed sets are considered here and not just closed cells.
11. THE HEINE-BOREL THEOREM 77
Figure 11.3
11. THE HEINE-BOREL THEOREM 79
Figure 11.4
finite number of the sets in G, say G(x.),..., G(x), contains the compact
set F. By using arcs from the circles with centers x, and radii 3 (x,), we
obtain the desired curve C. (See Figure 11.4). The detailed construction
of the curve will not be given here. O.E.D.
Exercises
11.A. Show directly from the definition (i.e., without using the Heine-Borel
Theorem) that the open ball given by ({(x, y):x?+y’?<1} is not compact in R’.
11.B. Show directly that the entire space R* is not compact.
11.C. Prove directly that if K is compact in R’ and FC K isa closed set, then F
is compact in R°.
11.D. Prove that if K is a compact subset of R, then K is compact when
regarded as a subset of R’.
11.E. By modifying the argument in Example 11.2(d), prove that the interval
J={(x, y):0=x<1,0<y = l}is compact in R’.
11.F. Locate the places where the hypotheses that the set K is bounded and that
it is closed were used in the proof of the Heine-Borel Theorem.
11.G. Prove the Cantor Intersection Theorem by selecting a point x, from F,
and then applying the Bolzano- Weierstrass Theorem 10.6 to the set {x, :n © N}.
11.H. If F is closed in R® and if
+ RENE Louls BaIRE (1874-1932) was a professor at Dijon. He worked in set-theory and
real analysis.
12. CONNECTED SETS &1
R?
(d) If 0<c<1, then the sets A={xER,x =< ¢}, B={xeER:x>c} split
the unit interval I={xeR:0 =x <= 1} into disjoint, non-empty sets with
union I. However since A is not open, this example does not show that I is
disconnected. In fact, we shall show below that the set Fis connected.
12.3. THEOREM. The closed unit interval I =[0, 1] is a connected subset
of R.
PROOF. We proceed by contradiction and suppose that A, B are open
sets forming a disconnection of I. Thus AMF and BOF are non-empty
bounded disjoint sets whose union is F. Since A and B are open, the sets
ANfFand BMI cannot consist of only one point. (Why?) For the sake of
definiteness, we suppose that there exist points a¢ A, b€B such that
O0<a<b<1. Applying the Supremum Property 6.4, we let c=
sup {x € A:x<b} so that O0<c<1; hence ce AUB. If ceA, thenc#b
and since A is open there is a point a1¢ A, c< ai, such that the interval
{c,a:] is contained in {x€A:x<b}, contrary to the definition of c.
Similarly, if c € B, then since B is open there is a point b, € B, b,<c, such
that the interval [b,,c] is contained in BNI, contrary to the definition
ofc. Hence the hypothesis that F is disconnected leads to a contradiction.
QED.
The reader should note that the same proof can be used to show that the
Open interval (0, 1) is connected in R.
82 THE TOPOLOGY OF CARTESIAN SPACES
RP
Figure 12.2
S={x+t(y—x):tel.
then it is easily seen that A, and B, are disjoint non-empty open subsets of
R. Hence the pair Ai, B: form a disconnection for the unit interval 5
contradicting Theorem 12.3. Therefore, if G is not connected, there exist
two points in G which cannot be joined by a polygonal curve in G.
Next, suppose that G is a connected open set in R” and that x belongs to
G. Let G, be the subset of G consisting of all points in G which can be
ani
—_————oo
> ae a“ —
~N a _"
Figure 12.4
Connected Sets in R
We close this section by showing that the connected subsets of R are
precisely the intervals (see Section 7).
12.8 THEOREM. A subset R is connected if and only if it is an
interval.
PARTIAL PROOF. The proof given in Theorem 12.3 can be readily
modified to establish the connectedness of an arbitrary non-void interval.
We leave the details to the reader.
Conversely, let C&R be connected and suppose that C#@. We note
that C has the property that if a, be C and a<b, then any number c
satisfying a<c<b must also belong to C; for if c€ C, then the sets A =
{xe R:x<c} and B={xeR:x>c} form a disconnection of C.
12. CONNECTED SETS 8&5
(i) Now suppose that C is bounded above and below, and let a = inf C
and b=sup C. We shall show that C must have one of the four forms
Exercises
12.A. If A and B are connected subsets of R°, give examples to show that A U B,
ANB, A \B can be either connected or disconnected.
12.B. If C&R? is connected and x is a cluster point of C, then CU {x} is
connected.
12.C. If CCR? is connected, show that its closure C™ (see Exercise 9.L) is also
connected.
12.E. If K CR? is convex (see Exercise 8.Q), then K is connected.
12.F. The Cantor set F is wildly disconnected. Show that if x, ye F, x# y, then
there is a disconnection A, B of F such that xe A, ye B.
12.G. If C, and C, are connected subsets of R, then the product C,xC, is a
connected subset of R’.
12.H. Show that the set
is connected in R*. However there does not exist a polygonal curve lying entirely in
A joining (0, 0) to other points in the set.
86 THE TOPOLOGY OF CARTESIAN SPACES
Once the real number system is at hand, it is a simple matter to create the
complex number system. We shall indicate in this section how the complex
field can be constructed.
As seen before, the real number system is a field which satisfies certain
additional properties. In Section 8, we constructed the Cartesian space R?
and introduced some algebraic operations in the p-fold Cartesian product of
R. However, we did not make R? into a field. It may come as a surprise
that it is not possible to define a multiplication which makes R’, p = 3, intoa
field. Nevertheless, it is possible to define a multiplication operation in
RXR which makes this set into a field. We now introduce the desired
operations.
13.1 DEFINITION. The complex number system C consists of all or-
dered pairs (x, y) of real numbers with the operation of addition defined by
(x yt yJa(xtx5,y+y'),
and the operation of multiplication defined by
(= x+y? , x74
3)
y?
Exercises
13.C. Describe the geometrical relation between the complex numbers z and
az+b,wherea#0. Show that the mapping defined for z € C, by f(z) =az + b, sends
circles into circles and lines into lines.
13.D. Describe the geometrical relations among the complex numbers z, Z and
1/z for z#0. Show that the mapping defined by g(z) =Z sends circles into circles
and lines into lines. Which circles and lines are left fixed under g?
13.E. Show that the inversion mapping, defined by h(z) = 1/z, sends circles and
lines into circles and lines. Which circles are sent into lines? Which lines are sent
into circles? Examine the images under h of the vertical lines given by the equation
Re z =constant, the horizontal lines Im z = constant, the circles |z|= constant.
13.F, Investigate the geometrical character of the mapping defined by g(z) =z’.
Determine if the mapping g is one-one and if it maps C onto allof C. Examine the
inverse images under g of the lines
Re z =constant, Im z = constant,
90
14. INTRODUCTION TO SEQUENCES 91
(2, 4, 6, 8, ...)
for the sequence of even integers. A more satisfactory method is to specify a
formula for the general term of the sequence, such as
(Qn:neEN).
X=(2,4,6,...,2n,...),
_ y=(L5.4..04,..),
-(,1i1 1
then we have
xey=(32,0%...,2m#),
_(,9 19 Qn?+1
_(,717
x-v=(15,4,...,7%-4,..),
Qn?-1
XY =(2,2,2,...,2,...),
3X =(6, 12, 18,...,6n,...),
X= (2, 8,18,...,2n?,...).
Similarly, if Z denotes the sequence in R given by
-=(1,0,1,..., 1-1)" )
Qos ds
then we have defined X + Z, X— Z and XZ; but X/Z is not defined, since
some of the elements in Z are zero.
We now come to the notion of the limit of a sequence.
14.3. DeFINiTION. Let X=(x,) be a sequence in R’. An element x
of R? is said to be a limit of X if, for each neighborhood V of x there is a
natural number Ky such that for all n = Ky, then x, belongsto V. Ifxisa
limit of X, we also say that X converges to x. If a sequence has a limit, we
say that the sequence is convergent. If a sequence has no limit then we say
that it is divergent.
The notation Ky is used to suggest that the choice of K will depend on
V. It is clear that a small neighborhood V will usually require a large
value of Ky in order to guarantee that x, € V for all n = Ky.
We have defined the limit of a sequence X = (x,) in terms of neighbor-
hoods. It is often convenient to use the norm in R? to give an equivalent
definition, which we now state as a theorem.
Some Examples
<
Loot .
9 l+na na
1 J
0
<TTna na = Kea”
whence it follows that |1/(1+ na)—0|<e« forn=K(e). Since ¢ >0 is arbi-
trary this shows that lim X =0.
(c) Let bE R satisfy 0<b <1 and consider the sequence (b"). We shall
show that lim(b")=0. To do this, it is cohvenient to write b in the form
__1
lta
where a >0, and to use Bernoulli’s Inequality (1+ a)" = 1+na for nEN.
(See Exercise 5.C.) Hence
nl 1 1
0<b ~ (1+a)" =TSna~ na’
e >0, then there is a natural number K(e) such that if n > K(e), then
0<c™=1=d, 5 —*<e,
0<1-c'" <h,<_t<e.
+h,
= hn
nc
Vn
Therefore |c 1|<e when n > K(e), as desired.
(e) Consider the sequence X =(n"); we shall show that lim X=1, a
tather non-obvious fact. Write n’"=1+k, with k,>0 for n>1; hence
n=(1+k,)". By the Binomial Theorem, when n> 1 we have
n= 14 nk, ED gay MD
Now let « >0 be given. Then there exists K(<) such that if n = K(e), then
1/(n—1)<e?/2; whence it follows that 0<k, <« and so
O<n”"-1=k,.<e
for n= K(e). Since ¢ >0 is arbitrary, this proves that lim (n™") = 1.
These examples show that a body of results which will make the in-
genuity employed here unnecessary would prove highly useful. We shall
obtain such results in the following two sections; but we close this section
with a result which is very often useful.
Exercises
0 (2, »
© (5). (@) (0).
is called a subsequence of X.
It may be helpful to connect the notion of a subsequence with that of the
composition of two functions. Let g be a function with domain N and range in N
and let g be strictly increasing in the sense that ifn <m, then g(n)<g(m). Then g
defines a subsequence of X = (x,) by the formula
Conversely, every subsequence of X has the form X ¢ g for some strictly increasing
function g with D(g)=N and R(g)cN.
X! = (X45 Xray ey Xm ee de
15. SUBSEQUENCES AND COMBINATIONS 99
The preceding results have been mostly directed towards proving that a
sequence converges to a given point. It is also important to know precisely
what it means to say that a sequence X does not converge to x. The next
result is elementary but not trivial and its verification is an important part
of everyone’s education. Therefore, we leave its detailed proof to the
reader.
that no number z<0O can be a limit for Z. In fact, the open set V=
{x €R:x <0} is a neighborhood of z containing none of the elements of Z.
This shows (why?) that z cannot be the limit of Z. Hence if Z has a limit,
this limit must be positive.
Combinations of Sequences
The next theorem enables one to use the algebraic operations of Defini-
tions 14,2 to form new sequences whose convergence can be predicted
from the convergence of the given sequences.
conclude that if « > 0 is given, then there exist natural numbers K,, K2 such
that if n= K,, then |ly.—yl|<«/2M and if n =K, then |x, —x||<«/2M.
Now choose K = sup {K:, K2}; then, if n = K, we infer from (15.2) that
bebfelibsfo)e(be-b)
np? xX} = b. 5 5 b*
=|p-$ 1
[ell + py Hse — a
~ |b, Bb
=e allell
Xn —X
1
well and |x||<M.
It follows that there exists a natural number Ko such that if n = Ko, then
1
Mle and |[x,||<_M.
_2n+1
neN.
" nt+5?
102 CONVERGENCE
lim X =
lim Y_2_ 2,
limZ 1
(b) If X =(xn) is a sequence in R which converges to x and if p isa
polynomial, then the sequence defined by (p(x,):n €N) converges to p(x).
(Hint: use Theorem 15.6 and induction.)
(c) Let X =(x,) be a sequence in R which converges to x and let r be a
rational function; that is, r(y) = p(y)/q(y), where p and q are polynomials.
Suppose that q(x.) and q(x) are non-zero, then the sequence (r(x,):né
N) converges to r(x). (Hint: use part (b) and Theorem 15.6.)
We conclude this section with a result which is often useful. It is
sometimes described by saying that one “‘passes to the limit in an
inequality.”
15.8 LEMMA. Suppose that X=(x,) is a convergent sequence in R?
with limit x. If there exists an element c in R? and a number r>0 such that
\|xn — cl] = r for n sufficiently large, then ||x — cll <r.
PROOF. The set V={y€R?’:||y—cl|>r} is an open subset of R’. If
x eV, then V is a neighborhood of x and so x,€ V for sufficiently large
values of n, contrary to the hypothesis. Therefore x¢ V and hence we
have |x —cl| sr. QED.
It is important to note that we have assumed the existence of the limit in this
result, for the remaining hypotheses are not sufficient to enable us to prove its
existence.
Exercises
15.A. If (x,) and (y,) are convergent sequences of real numbers and if x, = y, for
all n EN, then lim (x,) = lim (y,).
15.B. If X =(x,) and Y =(y,) are sequences of real numbers which both con-
verge to c and if Z = (z,) is a sequence such that x, = z, = y, fornéN, then Z also
converges to c.
15.C. For x, given by the following formulas, either establish the convergence or
the divergence of the sequence X = (x,):
n (-1'n
(a) X= Fa (b) x, = atl?
15. SUBSEQUENCES AND COMBINATIONS 103
2n _2n?4+3
(©) ~=307> ) x =3 ayy
(e) =n’ —n, (f) x, =sin n.
15.D. If X and Y are sequences in R’ and if X+Y converges, do X and Y
converge and have lim (X+ Y)=lim X+lim Y?
15.E. If X and Y are sequences in R’ and if X - Y converges, do X and Y
converge and have lim X - Y = (lim X) - (lim Y)?
15.F. If X =(x,) is a positive sequence which converges to x, then (Vx) con-
verges to vx. (Hint: ¥x,—Vx= (x — x)/(V%,+-Vx) when x# 0.)
15.G. If X =(x,) is a sequence of real numbers such that Y = (x,”) converges to
0, then does X converge to 0?
15.H. If x, =Vn+1-—Vn, do the sequences X = (x,) and ¥ =(Vnx,) converge?
15.1. Let (x,) be a sequence in R? such that the subsequences (x.,) and (xon+1)
converge toxe€ R’. Prove that (x,) converges to x.
15.J. Let (x,) and (y,) be sequences in R such that lim (x,) 40 and lim (x,y,.)
exists. Prove that lim (y,) exists.
15.K. Does Exercise 15.J remain true in R??
15.L. If0<a<=b and if x, =(a"+ b")'”, then lim (x,) = b.
15.M. Every irrational number in R is the limit of a sequence of rational
numbers. Every rational number in R is the limit of a sequence of irrational
numbers.
15.N. Let ACR?’ andx¢R’. Then x is a boundary point of A if and only if
there is a sequence (a,) of elements in A and a sequence (b,) of elements in @{A)
such that
lim (a,) = x = lim (6,).
15.0. Let ACR’ andx eR’. Then x isa cluster point of A if and only if there
is a sequence (a,) of distinct elements in A such that x =lim (a,).
15.P. Ifx =lim (x,) and if |x, —cl|<r for all n €N, does it follow that ||x — cl] <r?
Projects
15.a. Let d be a metric on a set M in the sense of Exercise 8.8. If X =(x,) isa
sequence in M, then an element x € M is said to be a limit of X if, for each « >0
there exists a number K(e) in N such that for all n > K(e), then d(x,, x)<«. Use
this definition and show that Theorems 14.5, 14.4, 15.2, 15.3, and 15.4 can be
extended to metric spaces. Show that the metrices d,, d,, d.. in R® give rise to the
same convergent sequences in R’. Show that if d is the discrete metric on a set,
then the only sequences which converge relative to d are those which are “‘constant
after some natural number.”
15.8. Let m denote the collection of all bounded sequences in R; let c denote
the collection of all convergent sequences in R; and let c, denote the collection of
all sequences in R which converge to zero.
(a) With the sum X+ Y and product cX as given in Definition 14.2, show that
each of the above collections is a vector space in which the zero element is the
sequence 0=(0,0,...).
104 CONVERGENCE
(b) In each of the collections m, c, Co, define the norm of X =(x,) by ||x||=
sup {|x,|:2¢€N}. Show that this definition actually yields a norm.
(c) If X and Y belong to either m, c, or co, then the product XY also belongs to it
and |X Y|| = |[X|| | Y||.. Give an example to show that equality may hold in this last
relation, and one to show that equality may fail.
(d) Show that the metric induced by the norm in part (b) in these spaces is given
by d(X, Y) =sup {|x, — y,|: 1 © N}.
(e) Show that if a sequence (X,) converges to Y relative to the metric in (d), then
each “coordinate sequence” converges to the corresponding coordinate of Y.
(Warning: X, is a sequence in R, while (X,) is a sequence in m, ¢, of co; that is, a
“sequence of sequences’”’ in R.)
(f) Give an example of a sequence (X,) in co where each coordinate sequence
converges to 0, but where d(X,, 0) does not converge to 0.
Until now the main method available for showing that a sequence is
convergent is to identify it as a subsequence or an algebraic combination of
convergent sequences. When this can be done, we are able to calculate the
limit using the results of the preceding section. However, when this
cannot be done, we have to fall back on Definition 14.3 or Theorem 14.4 in
order to establish the existence of the limit. The use of these latter tools
has the noteworthy disadvantage that we must already know (or at least
suspect) the correct value of the limit and we then verify that our suspicion
is correct.
There are many cases, however, where there is no obvious candidate for
the limit of a given sequence, even though a preliminary analysis has led to
the belief that convergence does take place. In this section we give some
results which are deeper than those in the preceding sections and which can
be used to establish the convergence of a sequence when no particular
element presents itself as the value of the limit. The first result in this
direction is very important. Although it can be generalized to R°, it is
convenient to restrict its statement to the case of sequences in R.
XiSXoSt
SM SX St -
Then the sequence X converges if and only if it is bounded, in which case
lim (x,) = sup {xa}.
PROOF. It was seen in Lemma 14.6 that a convergent sequence is
bounded. If x =lim(x,) and e >0, then there exists a natural number
16. TWO CRITERIA FOR CONVERGENCE 105
xtreme xi de
NY x* f
4 4 i 1
° wre T "T T
Figure 16.1
=>1_1
13> >. ++ poa1 >--- >):0;
106 CONVERGENCE
Dividing the powers of n into the numerators of the binomial coefficients, we have
_ 1 1 df, 1 2
recast ea} t at wet wi)
44 2 n-1
mo +moaaa) (tag) i)
1 1
<u <1t1tp+sete- tyes, n>2.
Cauchy Sequences
We now introduce the important notion of a Cauchy sequence in R”’. It
will turn out that a sequence in R? is convergent if and only if it is a Cauchy
sequence.
16.6 DEFINITION. A sequence X =(x,) in R” is said to be a Cauchy
sequence in case for every « > 0 there is a natural number M(e) such that
for all m, n= M(e), then |lxn —x,||<e.
In order to help motivate the notion of a Cauchy sequence, we shall
show that every convergent sequence in R” is a Cauchy sequence.
16.7 Lemma. If X=(x,) is a convergent sequence in R’, then X is a
Cauchy sequence.
16. TWO CRITERIA FOR CONVERGENCE 109
[x — Xml} == [x — xx]
+ [xx ~ Hal] <€,
when n= M(e/2). Therefore, the sequence X converges to the element x,
which is the limit of the subsequence X’. QED.
We are now prepared to obtain the important Cauchy Criterion. Our
proof is deceptively short, but the reader will note that the work has
already been done and we are merely putting the pieces together.
16.10 CAUCHY CONVERGENCE CRITERION. A sequence in R? is con-
vergent if and only if it is a Cauchy sequence.
PROOF. It was seen in Lemma 16.7 that a convergent sequence must be
a Cauchy sequence.
110 CONVERGENCE
x =4(x+x).
However, since the sequence X converges, so does the subsequence with
odd indices. By induction we can establish that
1 1,1
xi=1, w=1+5, Xs=1l+a+53.---,
1,1 1
Xone Lt a taat- . “baat wae
16. TWO CRITERIA FOR CONVERGENCE 111
It follows that
1 1 1
Xanti1 = 145 (145+: . +71)
aqoi.iel#_
=1+5 1-1/4 1+3Al 1 -7)
a).
Since each of these m—n terms exceeds 1/m, this difference exceeds
(m—n)/m=1-—n/m. In particular, if m =2n, we have
X2n — Xn >y
Exercises
16.A. Let x,ER satisfy x,>1 and let x,.,=2-—1/x, for neN. Show that the
sequence (x,) is monotone and bounded. What is its limit?
112 CONVERGENCE
16.B. Let y,=1 and y,,,=(2+,)"” for neN. Show that (y,) is monotone and
bounded. What is its limit?
16.C. Let a>Q and let z,>0. Define z,,,=(a+z,)'? forneN. Show that (z,)
converges.
16.D. If a satisfies O0<a<1, show that the sequence X =(a") is convergent.
Since Y=(a*") is a subsequence, we have lim X =lim Y=(lim X)’, and that
lim X =0.
16.E. Show that every sequence in R either has a monotone increasing subse-
quence or a monotone decreasing subsequence.
16.F. Use Exercise 16.E to prove the Bolzano-Weierstrass Theorem for sequ-
ences in R.
16.G. Determine the convergence or divergence of the sequence (x,), where
Xn
1 1 1 for neN.
~atitns2’ tn
16.H. Let X =(x,) and Y =(y,) be sequences in R’ and let Z=(z,) be the
“shuffled” sequence defined by z;=X,, Z2=Yi,..-5 Zan =Xns Zonet =Ynp---- Is it
true that Z is convergent if and only if X and Y are convergent and lim X =
lim Y?
16.1. Show directly that the following are Cauchy sequences:
@ (3),
1
w) (>),
+1
() (14+
1
+5).
1
16.3. Show directly that the following are not Cauchy sequences:
Project
16.«. In this project, let m, c, and cy designate the collections of real sequences
that were introduced in Project 15.8 and let d denote the metric defined in part (d)
of that project.
(a) IfreFandr=0. rn4r.-:-+4, +--+ is its decimal expansion, consider the element
X,=(r,) in m. Conclude that there is an uncountable subset A of m such that if X,
and X, are distinct elements of A, then d(X,, X,) => 1.
(b) Suppose that B is a subset of c with the property that if X and Y are distinct
elements of B, then d(X, Y)=1. Prove that B is a countable set.
(c) If j EN, let Z, =(z,,:n EN) be the sequence whose first j elements are 1 and
whose remaining elements are 0. Observe that Z, belongs to each of the metric
spaces m, c, and cy and that d(Z, Z,)=1 for j#k. Show that the sequence
(Z,:}€N) is monotone in the sense that each coordinate sequence (z,;:j €N) is
monotone. Show that the sequence (Z,) does not converge with respect to the
metric d in any of the three spaces.
(d) Show that there is a sequence (X;) in m, c, and cy) which is bounded (in the
sense that there exists a constant K such that d(X, 0) = K for all j € N) but which
possesses no convergent subsequence.
(e) (if d is a metric on a set M, we say that a sequence (X;) in M is a Cauchy
sequence if for every « >0 there exists K(e)<N such that d(X,, X,)<« whenever
j,k = K(e). We say that M is complete with respect to d in case every Cauchy
sequence in M converges to an element of M.) Prove that the sets m, c, and cy are
complete with respect to the metric d we have been considering.
(f) Let f be the collection of all real sequences which have only a finite number
of non-zero elements and define d as before. Show that d is a metric on f, but that
f is not complete with respect to d.
this sequence may diverge. For each of those points x for which the
sequence (17.1) converges there is, by Theorem 14.5, a uniquely deter-
mined point of R*. In general, the value of this limit, when it exists, will
depend on the choice of the point x. In this way, there arises a function
whose domain consists of all points x in D<R? for which the sequence
(17.1) converges in R*.
We shall now collect these introductory words in a formal definition of
convergence of a sequence of functions.
17.1 DEFINITION. Let (f,) be a sequence of functions on DCR? to
R‘, let Do be a subset of D, and let f be a function with domain containing
Do and range in R*. We say that the sequence (f,) converges on D, to f if,
for each x in Do the sequence (f,(x)) converges in R‘ to f(x). In this case
we call the function f the limit on D, of the sequence (f,). When such a
function f exists we say that the sequence (f,) converges to f on Do, or
simply that the sequence is convergent on Do.
It follows from Theorem 14.5 that, except for possible change in the
domain Do, the limit function is uniquely determined. Ordinarily, we
choose Dp to be the largest set possible; that is, the set of all x in D for
which (17.1) converges. In order to symbolize that the sequence (f,)
converges on D, to f we sometimes write
f=lim(f,) on Do, or fa—>f on Do.
We shall now consider some examples of this idea. For simplicity, we
shall treat the special case p =q = 1.
17.2. ExXAMPLEs. (a) For each natural number n, let f, be defined for
xin D=R by f.(x)=x/n. Let f be defined for all x in D=R by f(x) =0.
(See Figure 17.1). The statement that the sequence (f,) converges on R
to f is equivalent to the statement that for each real number x the numeri-
cal sequence (x/n) converges to 0. To see that this is the case, we apply
Example 14.8(a) and Theorem 15.6(b).
(b) Let D={x €R:0 <x = 1} and for each natural number n let f, be
defined by f,(x) =x" for all x in D and let f be defined by
f(x) =0, O<x<1,
=1, x=1.
(See Figure 17.2.) Itis clear that when x = 1, then f,(x) =f,(1) = 1" =1 so
that f.(1)— fl). We have shown in Example 14.8(c), that if O<x<1,
then f,(x) =x" — 0. Therefore, we conclude that (f,) converges on D to f.
(It is not difficult to prove that if x >1 then (f,(x)) does not converge at
all.)
(c) Let D=R and for each natural number n, let f, be the function
17. SEQUENCES OF FUNCTIONS 115
Figure 17.1
defined for x in D by
_ x?+nx
fi(x)=———,
and let f(x)=x. (See Figure 17.3.) Since f,(x) = (x’/n) +x, it follows from
Example 14.8(a) and Theorem 15.6(b) that (f,(x)) converges to f(x) for all
xeER.
(d) Let D=R and, for each natural number n, let f, be defined to be
f.(x) =(1/n) sin (nx +n). (See Figure 17.4.) (A rigorous definition of the
(140)
Figure 17.2
116 CONVERGENCE
lly,
\
Figure 17.3
sine function is not needed here; in fact, all we require is that |sin y| = 1 for
any real number y.) If f is defined to be the zero function f(x) =0, x ER,
then f=lim(f,) on R. Indeed, for any real number x, we have
| fu(x) —fx)|<e
Figure 17.4
17. SEQUENCES OF FUNCTIONS 117
no matter what the value of x. Therefore, we infer that the sequence (f.)
converges to f. (Note that by choosing n sufficiently large, we can make
the differences |f,,(x)—f(x)| less than ¢ for all values of x simultaneously!)
Partly to reinforce Definition 17.1 and partly to prepare the way for the
important notion of uniform convergence, we formulate the following
restatement of Definition 17.1.
17.3. LemMMa. A sequence (f.) of functions on D < R? to R* converges
to a function f on DoS D if and only if for each e > 0 and each x in Do there
is a natural number K(e, x) such that for alln = K(e, x), then
(17.2) IIf.(x) —f(x)i<e.
Since this is just a reformulation of Definition 17.1, we shall not go
through the details of the proof, but leave them to the reader as an
exercise. We wish only to point out that the value of n required in
inequality (17.2) will depend, in general, on both e>0 and xeDp. An
alert reader will have already noted that, in Examples 17.2(a—-c) the value
of n required to obtain (17.2) does depend on both e>0 and xeDo.
However, in Example 17.2(d) the inequality (17.2) can be satisfied for all
x in Do provided n is chosen sufficiently large but dependent on « alone.
It is precisely this rather subtle difference which distinguishes between
the notions of “ordinary” convergence of a sequence of functions (in the
sense of Definition 17.1) and “uniform” convergence, which we now
define.
17.4 Derinirion. A sequence (f,) of functions on DCR? to R’
converges uniformly on a subset Do of D to a function f in case for each
e >0 there is a natural number K(e) (depending on « but not on x € Dy)
such that for all n = K(e) and x € Do, then
Figure 17.5
The proof of this result merely requires that the reader negate Definition
17.4. It will be left to the reader as an essential exercise. The preceding
lemma is useful to show that Examples 17.2(a-c) do not converge uni-
formly on the given sets Do.
17.6 ExAMpLEs. (a) We consider Example 17.2(a). If m=k and
x = k, then f(x) =1 so that
| fu (i)
— f(24.)| = [1-0] = 1.
This shows that the sequence (f,) does not converge uniformly on R to f.
(b) We consider Example 17.2(b). If m =k and x. =()"", then
| fc (2) — fated] = [fi (20) =}.
Therefore, we infer that the sequence (f,) does not converge uniformly on
[0, 1] tof.
(c) We consider Example 17.2(c). If m =k and x, =k, then
[fi (x) — f (%e)| = k,
showing that (f,) does not converge uniformly on R to f.
(d) We consider Example 17.2(d). Then since
|fn(x}—f(x)| <= 1/n
for all x in R, the sequence (f,) converges uniformly on R to f.
If D&R?® and f:D > R‘, we say that f is bounded in case there exists
M>O such that ||f(x)||= M for allxeD. If f:D — R?* is bounded, then it
follows that the number ||f|lp defined by
(17.5) IIfllo
= sup {||f@)||:x
€ D}
existsin R. (We note that the norm on the right side of this equation is the
norm in the space R*.).
17.7. DeFinition. If D¢R?, then the collection of all bounded func-
tions on D to R‘ is denoted by B,,(D) or (when p and q are understood) by
B(D).
In the space B,,(D) we define the vector sum of two functions f, g and
the scalar multiple of ce R and f by
I[fn(x)
— FO) = -— NGOs If. — fll = e.
Therefore the sequence (f,,) converges uniformly on D to f. Q.E.D.
We now illustrate the use of this lemma as a tool in examining a
sequence of functions for uniform convergence. We observe first that the
norm has been defined only for bounded functions; hence we can employ it
(directly, at least) only when the sequence consists of bounded functions.
17.10 EXAMPLES. (a) We cannot apply Lemma 17.9 to the example
considered in 17.2(a) and 17.6(a) for the reason that the functions f,,
defined to be f,(x)=x/n, are not bounded on R, which was given as the
domain. For the purpose of illustration, we change the domain to obtain a
bounded sequence on the new domain. For convenience, let us take
E=(0,1]. Although the sequence (x/n) did not converge uniformly to the
zero function on the domain R [as was seen in Example 17.6(a)], the
convergence is uniform on E=[0, 1]. To see this, we calculate
Since this norm does not converge to zero, we infer that the sequence (fn)
does not converge uniformly on D =[0, 1] to f. This bears out our earlier
considerations.
(c) We consider Example 17.2(c). Once again we cannot apply Lemma
17.9, since the functions are not bounded. Again, we choose a smaller
17. SEQUENCES OF FUNCTIONS 121
(17.6) || fm(e)—
fn (0) || = [fin — fall
< € for m,n>= M(e).
Hence the sequence (f,(x)) is a Cauchy sequence in R* and so converges to
some element of R*. We define f for x in D by
we have
Iifmn (20) — fa (X)|-<e.
If we apply Lemma 15.8 it follows that if m = M(e) and xeD, then
\lfm(x)—f(x)|| Se. Since f,, is a bounded function, it follows readily from
this (how?) that f is bounded and hence belongs to B,,(D). Moreover we
conclude that (f,) converges uniformly to f on D. Q.E.D.
Exercises
In these exercises you may make use of the elementary properties of the
trigonometric and exponential functions from earlier courses.
17.A. For each neEN, let f, be defined for x >0 by f,(x)=1/(mx). For what
values of x does lim (f,(x)) exist?
17.B. For each nEN, let g, be defined for x = 0 by the formula
g(xy=nx, Osxsi/n,
_ nx
f.(x) ~ Lttn’x?’
17.H. Show that lim (e~™”) exists for x =O. Also consider the existence of
lim (xe™*).
17.1. Suppose that (x,) is a convergent sequence of points which lies, together
with its limit x, ina set DCR’. Suppose that (f,) converges on D to the function f.
Is it true that f(x) =lim (f,(x,))?
17.J. Consider the preceding exercise with the additional hypothesis that the
convergence of the (f,,) is uniform on D.
17.K. Prove that the convergence in Exercise 17.A is not uniform on the entire
set of convergence, but that it is uniform for x => 1.
17.L. Show that the convergence in Exercise 17.B is not uniform on the domain
x = O, but that it is uniform on a set x = c, where c > 0.
17.M. Is the convergence in Exercise 17.D uniform on R?
17.N. Is the convergence in Exercise 17.E uniform on I?
17.0. Is the convergence in Exercise 17.F uniform on I? Is it uniform on [c, 1]
for c >0?
—nx
-17.P. Does the sequence (xe™") converge uniformly for x = 0?
17.Q. Does the sequence (x’e~™) converge uniformly for x = 0?
17.R. Let (f.) be a sequence of functions which converges on D to a function f.
If A and B are subsets of D and it is known that the convergence is uniform on A
and also on B, show that the convergence is uniform on A U B.
17.S. Give an example of a sequence (f,) in B,,(I) such that ||f.|) = 1 for allneN
which does not have a uniformly convergent subsequence. (Hence the Bolzano-
Weierstrass Theorem does not hold in B,,().)
is the infimum of the set V of véR such that there are at most a finite
number of néN such that v< x,.
(b) The limit inferior of X, which we denote by
lim inf X, lim inf (x,), or lim (xa),
124 CONVERGENCE
HEH
+ ttt t+} + HH
Figure 18.1
is the supremum of the set W of w € R such that there are at most a finite
number of m & N such that x, <w.
While a bounded sequence need not have a limit, it always has a unique
limit superior (and a unique limit inferior). That is clear from the fact that
the number v = sup {x,:n€N} belongs to the set V, while the number
inf {x,:1 €N}—1 is a lower bound for V.
There are several equivalent ways, which are often useful, that one can
define the limit superior of a bounded sequence. (The reader is strongly
urged to attempt to prove this result before reading the proof.)
18.2 THEOREM. If X=(x,) is a bounded sequence in R, then the
following statements are equivalent for a real number x*.
(a) x*=lim sup (x,).
(b) If e >0, there are at most a finite number of ne N such that x*+e<
Xn, Dut there are an infinite number such that x*—¢ <x,
(c) If vo. =sup {x.:n = m}, then x* = inf fon :ne N}.
(d) If tn =sup {x,:n = mb}, then x*= lim (vn).
(e) If Lis the set of »v € R such that there exists a subsequence of X which
converges to v, then x* = sup L.
PROOF. Let x*=lim sup (x,) and let e>0. By Definition 18.1, there
exists a ve V withx*<v<x*+e. Therefore x*+< also belongs to V, so
there can be at most a finite number of n < N such that x*+2<x,. On the
other hand x*—« is not in V, so there are an infinite number of né N such
that x*—«<-x,. Hence (a) implies (b).
If (b) holds, given « >0, then for all sufficiently large m we have v, =
x*+e; therefore inf{v,:meN}<x*+«. But since there is an infinite
number of néN such that x*—«<x,, then x*—e<v,, for all meN and
hence x*—e <inf{om:meN}. Since e >0 is arbitrary, we deduce that
x* = inf {v, 1m €N} and (c) holds.
If the sequence (v,,.) is defined as in (c), then it is monotone decreasing
and hence inf (v,,) = lim (v,), so that (c) implies (d).
Now suppose that x* satisfies (d) and X'=(x,) is a convergent subse-
quence of X; since n, = k we have x,, <1 and hence lim X' < lim(uv,) = x*.
Conversely, note that there exists ni1¢N such that v:.-1<x,< 01.
Inductively choose n.+:> mm. such that
1
< Xn SS Ue
K+
18. THE LIMIT SUPERIOR 125
Since lim (v.) = x*, it follows that x* = lim (xa,). Therefore (d) implies (e).
Finally, let w= sup L. If « >0 is given, then there can be at most a finite
number of ne N with w+e<x, (by the Bolzano Weierstrass Theorem
16.4). Therefore w+eeV and limsupX=w+e. On the other hand
there exists a subsequence X’ converging to some number exceeding w —«;
hence w—¢ is not in V andso w—e <= limsup X. Since ¢ >0 is arbitrary,
we infer that w =lim sup X. Therefore (e) implies (a). QED.
Both of the characterizations (d) and (e) can be regarded as justifying the term
“limit superior.” There are corresponding characterizations for the limit inferior
of a bounded sequence which the reader should write out and prove.
We now establish the basic algebraic properties of the limit superior and
the limit inferior of bounded sequences.
PROOF. (a) If w<liminf(x.) and v >lim sup (x,), then there are in-
finitely many néN such that w = x,, while there are only a finite number
such that v<x,. Therefore, we must have w =< v, which implies (a).
(b) If c = 0, then multiplication by c preserves all inequalities of the
form w <= x, ete.
(b’} If c <0, then multiplication by c reverses inequalities and converts
the limit superior into the limit inferior, and conversely.
Statement (c) is dual to (d) and can be derived directly from (d) or proved
by using the same type of argument. To prove (d), let v > lim sup (x,) and
u>lim sup (y,); by definition there are only a finite number of n € N such
that v <x, and a finite number such that u<y,. Therefore there can be
only a finite number of n such that v+u<xt+y,, showing that
lim sup (x, + y,) = vu+u. This proves statement (d).
We now prove the second assertion in (e). If u>lim sup (y,), then there
can be only a finite number of natural numbers n such that u<y,. Since
Xn <= Yn, then lim sup (x,) = u, and so lim sup (x,) = lim sup (y,). QED.
Unbounded Sequences
the set V of numbers v € R such that there are at most a finite number of
néN such that v<x, isempty. Hence theinf V=+. Thus if X =(x,) is
a sequence in R which is not bounded above, then we have
lim sup (x,) = +9.
Similarly, if Y = (y,) is a sequence in R which is not bounded below, then
we have
lim inf (y.) =—2.
We note that if X =(x,) is a sequence in R that is not bounded above,
then the sets {x,:n = m} are not bounded above and so
Um = sup {Xin= m}= +00
for allmeN.
Infinite Limits
18.A. Determine the limit superior and the limit inferior of the following
bounded sequences in R.
18.G. Determine the limit superior and the limit inferior of the following sequ-
ences in R.
(a) (-1)’n), {b) (nsinn),
(c) (n(sin n)?), (d) (ntann).
18.H. Show that the sequence X =(x,) in R diverges to + if and only if
lim inf X = +0,
18.I. Show that lim sup X = + if and only if there is a subsequence X’ of X such
that lim X’ = +09,
18.J. Interpret Theorem 18.3 for unbounded sequences.
Cesaro Summation
1
On X= (Kit Aot +Xn)—X
(19.1) 1
=p lGa-x)t+Ga-x)t- + +(x, —x)}.
Since x = lim (x,), then given e >0 there is a natural number N(e) such
that if m = N(e), then ||x.—x||<e. Also, since the sequence X =(xn) is
t ERNESTO CESARO (1859-1906) studied in Rome and taught at Naples. He did work in
geometry and algebra as well as analysis.
130 CONVERGENCE
Observe that, in this array, the first index refers to the row in which the
element Xnn appears and the second index refers to the column.
19.4 DeFiniTiIon. If X =(%m.) is a double sequence in R’, then an
element x is said to be a limit (or a double limit) of X if for each positive
number ¢ there is a natural number N(e) such that for all m, n= N(e)
then ||Xmn —x||<. In this case we say that the double sequence converges
to x and write
x= lim (Xmn) or x =lim X.
19. SOME EXTENSIONS 131
| Xn — Xs|| <e.
We shall not pursue in any more detail that part of the theory of double
sequences which is parallel to the theory of (single) sequences. Rather, we
propose to look briefly at the relation between the limit as defined in 19.4
and the “iterated” limits.
To begin with, we note that a double sequence can be regarded (in at
least two ways) as giving a sequence of sequences! On one hand, we can
regard each row in the array given in (19.2) as a sequence in R’. Thus the
first row in the array (19.2) yields the sequence Y;=(x1,:ne€N)=
(X11, X12,-.+5 Xin ...)3 the second row in (19.2) yields the sequence Y2=
(x21:n€N); etc. It makes perfectly good sense to consider the limits of
the row sequences Yi, Yo,..., Ym,..- (when these limits exist).
Supposing that these limits exist and denoting them by yu, y2,..-, Ym-- +>
we obtain a sequence of elements in R’ which might well be examined for
convergence. Thus we are considering the existence of y =lim (ym).
Since the elements y,, are given by ym =lim Y,, where Yn =(X%m.:néN),
we are led to denote the limit y = lim (y,,) (when it exists) by the expression
y = lim lim (Xm).
We shal! refer to y as an iterated limit of the double sequence (or more
precisely as the row iterated limit of this double sequence).
What has been done for rows can equally well be done for columns.
Thus we form the sequences
Z1= (%mi:m EN), Z2= (Xm2:m EN),
and so forth. Supposing that the limits z;=lim Z), z2=lim Z,..., exist,
we can then consider z = lim (z,). When this latter limit exists, we denote
it by
and refer to z as an iterated limit, or the column iterated limit of the double
sequence X = (Xinn).
The first question we might ask is: if the double limit of the sequence
X =(Xmn) exists, then do the iterated limits exist? The answer to this
question may come as a surprise to the reader; it is negative. To see this,
let X be the double sequence in R which is given by X%m.=
(-1)"""(i/m+1/n), then it is readily seen that the double limit of this
sequence exists and is 0. However, it is also readily verified that none of
the sequences
Yi=(xnineEN),...,
Ym =(XmninEN),...
has a limit. Hence neither iterated limit can possibly exist, since none of
the “‘inner’’ limits exists.
The next question is: if the double limit exists and if one of the iterated
limits exists, then does this iterated limit equal the double limit? This time
the answer is affirmative. In fact, we shall now establish a somewhat
stronger result.
19.6 DovuBLe Limir THEOREM. If the double limit x = littan (%mn)
exists, and if for each natural number m the limit ym =limn (mu) exists, then
the iterated limit limm lim, (Xnn) exists and equals x.
PROOF. By hypothesis, given ¢ >0 there is a natural number N(e) such
that if m,n > N(e), then |xn.—x||<e. Again by hypothesis, the limits
Ym = lim, (Xmn) exist, and from the above inequality and Lemma 15.8 it
follows that |lym—x||= © for all m= N(e). Therefore, we conclude that
x =lim (ym). OED.
The preceding result shows that if the double limit exists, then the only
thing that can prevent the iterated limits from existing and being equal to
the double limit is that the “inner’’ limits may not exist. More precisely,
we have the following result.
19.7 COROLLARY. Suppose the double limit exists and that the limits
yn = lim (Xmn), Zn = lim (Xmn)
exist for all natural numbers m,n. Then the iterated limits
lim lim (Xmn)5 lim lim (Xn)
exist and are equal, but the double limit does not exist. However, under
some additional conditions, we can establish the existence of the double
limit from the existence of one of the iterated limits.
The reader will do well to compare this definition with Definition 17.4
and observe that they are of the same character. Partly in order to
motivate Theorem 19.10 to follow, we show that if each of the sequences
Yin is convergent, then the existence of the double limit implies that the
sequences {Y,, :m € N} are uniformly convergent.
19.9 Lemma. If the double limit of the double sequence X = (Xmn)
exists and if, for each natural number m, the sequence Yn = (X%mnin EN) is
convergent, then this collection is uniformly convergent.
PROOF, Since the double limit exists, given « >0 there is a natural
number N(«) such that if m,n = N(e), then ||xun—x|}<e. By hypothesis,
the sequence Yn =(Xmn:n EN) converges to an element y, and, applying
Lemma 15.8, we find that if m = N(e), then |ly.—x||< «. Thus if m,n =
N(e), we infer that
[inn
— Yu] = [tn — xI|+ [x — yal] < 20.
In addition, for m=1,2,..., N(e)—1 the sequence Y,, converges to Yn;
hence there is a natural number K(e) such that if m > K(e), then
||Xan — Yn | <e, m=1,2,...,N(e)—-1.
exist for all m,n éN, and that the convergence of one of these collections is
uniform. Then both iterated limits and the double limit exist and all three are
equal.
PROOF. Suppose that the convergence of the collection {Yn :m €N} is
uniform. Hence given « >0, there is a natural number N(e) such that if
n= N(e), then
(19.3) [Xn — Yl] <e
for all natural numbers m. To show that lim (ym) exists, take a fixed
number q = N(e). Since z, =lim (x,,:réN) exists, we know that if r, s =
R(e, q), then
We now show that the double limit exists. Since y=lim (yn), given
¢ >0 there is an M(e) such that if m= M(e), then lly, —yll<e. Letting
K(e)=sup {N(e), M(e)}, we again use (19.3) to conclude that if m,n =
K(e), then
It might be conjectured that, although the proof just given makes use of the
existence of both collections of single limits and the uniformity of one of them, the
conclusion may follow with the existence (and uniformity) of just one collection of
single limits. We leave it to the reader to investigate the truth or falsity of this
conjecture.
Exercises
19.C. If X,= O(Y) and X,= O(Y), we conclude that X,+ X,= O(Y) and sum-
marize this in the “‘equation”
(a) OCY)+O(Y)=OCY). Give similar interpretations for and prove that
(b) of Y)+0(Y) =o0(Y).
(c) If c#0, then o(cY) = 0(Y) and O(cY) = OCY).
(d) O(0(Y))= oY), o(OCY)) =o(Y).
(e) OCX)OCY) = O(XKY), O(X)0o(Y) =o(XY), o(X)o(Y)=o0(XY).
19.D. Show that X = o(Y) and Y = o(X) cannot hold simultaneously. Give an
example of sequences such that X = O(Y) but Y# O(X).
19.E. If X is a monotone sequence in R, show that the sequence of arithmetic
means is monotone.
19.F. If X =(x,) is a sequence in R and (a,) is the sequence of arithmetic means,
then lim sup (¢,,) = lim sup (x,). Give an example where inequality holds.
19.G. If X =(x,) is a sequence of positive real numbers, then is (o,.) monotone
increasing?
19.H. If a sequence X =(x,) in R’ is Cesaro summable, then X=o(n). (Hint:
X, = no, —(— 1)6,-4.)
19.J. Let X be a monotone sequence in R. Is it true that X is Cesaro summable
if and only if it is convergent?
19.J. Give a proof of Theorem 19.5.
19.K. Consider the existence of the double and the iterated limits of the double
sequences (x,,,), Where X,, is given by
136
20. LOCAL PROPERTIES OF CONTINUOUS FUNCTIONS 137
f(a) Vv
R’
Figure 20.1
We now give two equivalent statements which could have been used as
the definition.
20.2 THEOREM. Leta be a point in the domain D(f) of the function f.
The following statements are equivalent:
(a) fis continuous at a.
(b) If « >0, there exists a number &(e)>0 such that if xe D(f) and
|x — al|< 8(e), then ||f(x)— fla)i<e.
(c) If (x.) is any sequence of elements of D(f) which converges to a, then
the sequence (f(x,)) converges to f(a).
PROOF. Suppose that (a) holds and that e>0, then the ball V.=
{fy € R*:|ly — f(a)||<} is a neighborhood of the point f(a). By Definition
20.1 there is a neighborhood U of a, such that if x «UN D(f), then
f(x)e V.. Since U is a neighborhood of a, there is a positive real number
8(e) such that the open ball with radius 6(¢) and center a is contained
in U. Therefore, condition (a) implies (b).
Suppose that (b) holds and let (x,) be a sequence of elements in D(f)
which converges to a. Let ¢>0 and invoke condition (b) to obtain a
138 CONTINUOUS FUNCTIONS
—_
———
-—-—
a
Figure 20.2
fx)=0, x <0,
=1, x>0.
It may be seen that f is continuous at all points a#0. We shall show that f
is not continuous at 0 by using the Discontinuity Criterion 20.3. In fact, if
Xn = 1/n, then the sequence (f(1/n)) = (1) does not converge to f(0). (See
Figure 20.3.)
{ PETER GusTAV LEJEUNE DIRICHLET (1805-1859) was born in the Rhineland and taught at
Berlin for almost thirty years before going to Gottingen as Gauss’ successor. He made
fundamental contributions to number theory and analysis.
20. LOCAL PROPERTIES OF CONTINUOUS FUNCTIONS 141
ce
yt 4 !
“lyn 1
n 3 2
Figure 20.3
(h) Let D(f)={x e.R:x>0}. For any irrational number x >0, we define f(x) =
0; for a rational number of the form m/n, with the natural numbers m, n having no
common factor except 1, we define f(m/n) =1/n. We shall show that f is continu-
ous at every irrational number in D(f) and discontinuous at every rational number
in D(f). The latter statement follows by taking a sequence of irrational numbers
converging to the given rational number and using the Discontinuity Criterion.
Let a be an irrational number and ¢ > 0; then there is a natural number n such that
1lj/n<e. If 6 is chosen so small that the interval (a — 6, a+5) contains no rational
number with denominator less than n, then it follows that for x in this interval we
have |f(x) —f(a)| =|f(x)| <= 1/n<e. Thus f is continuous at the irrational number
a. Therefore, this function is continuous precisely at the irrational points in its
domain.
G) This time, let D(f) = R? and let f be the function on R’ with values in R?
defined by
. &
se)=inf{ al
if {|(x, y)—(a, b)||<8(e), then we have ||f(x, y)—f(a, b)||<e, proving that f is con-
tinuous at the point (a, b).
Combinations of Functions
Figure 20.4
Exercises
f(x+y)=f(x)+fly)
for all x, ye R. Show that an additive function which is continuous at x =0 is
continuous at any point of R. Show that a monotone additive function is continu-
ous at every point.
20.M. Suppose that f is a continuous additive function on R. If c = f(1), show
that f(x)=cx for all x in R. (Hint: first show that if r is a rational number, then
f(r) =cr)
20.N. Let g: R — R satisfy the relation
g(xt+y)=ge(x)g(y) forx,yeR.
Show that, if g is continuous at x =0, then g is continuous at every point. Also, if
g(a) =0 for some ac R, then g(x) =0 for allx ER.
20.0. If |f| is continuous at a point, then is it true that f is also continuous at this
point?
20.P. Let f, g:R’ — R be continuous at a point a ¢ R’ and let h, k be defined on
R’ to R by
h(x) =sup {f(x), g(x)}, k(x) = inf (f(x), g()}-
Show that h and k are continuous at a. (Hint: note that sup {b, c}=i(b+c+|b—e])
and inf {b, c)={b +c —|b —c]).)
20.Q. If xe R, we often define [x] to be the greatest integer n¢ Z such that
n= x. The map x +> [x] is called the greatest integer function. Sketch the graphs
and determine the points of continuity of the functions defined for x ¢ R by
exercise. Let
Projects
20.8. Let P={x €R:x >0} and let h: P— R be a function not identically zero
which satisfies the functional equation
f(ax
+ by +- -+ez)= af(x)+bf(y)+-+-+cf(z).
It is readily seen that the functions in Examples 20.6(b) and 20.6(i) are
linear functions for the case p = q = 1 and p = q = 2, respectively. In fact it
is not difficult to characterize the most general linear function from R? to
R’.
21.2 THEOREM. [If f is a linear function with domain R?’ and range in
R?, then there are pq real numbers (cy), 1=i<q, 1=j sp, such that if
xX = (x1, X2,...,Xp) is any point in R°, and ify =(y1, y2,.--, Ya) = f(x) is its
image under f, then
Yr = Ci1X1 + Cy2X2 + + ++ CipXp,
(21.2)
Yq = CqiX1+ CqoX2 t+ + + + CopXp.
Thus the real number c, is the ith coordinate of the point f(e).
148 CONTINUOUS FUNCTIONS
f(x) = xifler)
+ x2f(e2) ++ + ++ xpf(ep).
This shows that the coordinates of f(x) are given by the relations (21.2),
as asserted.
Conversely, it is easily verified by direct calculation that if the relations
(21.2) are used to obtain the coordinates y, of y from the coordinates x; of
x, then the resulting function satisfies the relation (21.1) and so is linear.
We shall omit this calculation, since it is straightforward. QED.
C1 C12 Cip
\aibi
+ dab +: +- apbp |? = far?
+ ax? + ++ ap’ Hbi
+ bo? +- +b,
lot? = {2 % lea}
P
i=1j
(21.5) lolUrcot=
{3% lout} te
q Pp 172
Exercises
21.A. Show that f: R’ > R? is a linear function if and only if f(ax) = af(x) and
f(x+y) =f(x)+f(y) for all ae R and all x, ye R’.
21.B. If f is a linear function of R’ into R*, show that the columns of the matrix
representation (21.4) of f indicate the elements in R* into which the elements
e,=(1,0,...,0), e2.=(0,1,...,0),...,¢,=(0,0,...,1) of R’ are mapped by f.
21.C. Let f be a linear function of R? into R* which sends the elements e, =
(1,0), e2=(0,1) of R’* into the vectors f(e.)=(2, 1,0), f(e.)=(1, 0, -0) of R°.
Give the matrix representation of f. What vectors in R° are the images under f of
the elements (2,0), (1, 1), and (1, 3)?
21.D. If f denotes the linear function of Exercise 21.C, show that not every
vector in R? is the image umerf of a vector in R?.
150 CONTINUOUS FUNCTIONS
21.E. Let g be any linear function on R’ to R®. Show that not every element of
R’ is the image under g of a vector in R’.
21.F. Let h be any linear function on R* to R’. Show that there exist non-zero
vectors in R® which are mapped into the zero vector of R’ by h.
21.G. Let f be a linear function on R’ to R? and let the matrix representation of
f be given by
a b
c od
21.1. Let g bea linear function from R’ to R*. Show that g is one-one and only
if g(x)=0 implies that x =0.
21.J. If h is a one-one linear function from R’ onto R’, show that the inverse
h” is a linear function from R’ onto R?’.
21.K. Show that the sum and the composition of two linear functions are linear
functions.
21.L. If f is a linear map on R’ to R‘, define
21.P. If (21.4) gives the matrix for f, show that |c,|<|lfll,, for all i, j.
The formulas (22.1) and (22.2) are two representations of D(f) as the
union of f~'(B) with another set with which it has no common points.
Therefore, we have C,N D(f) =f (C).
Suppose that (b) holds and that H is closed in R*. Apply the argument
just completed in the case where B= R*\ H and C=H. Then B and B,
are open sets in R* and R’, respectively, so C, = R* \ B, is closed in R?.
This shows that (b) implies (c).
To see that (c) implies (b), use the above argument with B = R” \ G,
where G is an open set in R*. O.B.D.
In the case where D(f)=R’, the preceding result simplifies to some
extent.
22.2 Coroitary. Let f be defined on all of R?’ and with range in R*.
Then the following statements are equivalent:
(a) f is continuous on R°;
(b) if G is open in R4, then f~'(G) is open in R’;
(c) if His closed in R4, then f *(H) is closed in R’.
It should be emphasized that the Global Continuity Theorem 22.1 does
not say that if f is continuous and if G is an open set in R’, then the direct
image f(G)={f(x):x €G}is open in R*. In general, a continuous function
need not send open sets to open sets or closed sets to closed sets. For
example, the function f on R to R, defined by
fe)=Ths
is continuous on R. [In fact, it was seen in Examples 20.5(a) and (c) that the
functions fi(x)=1, and f.(x)=x’, for x¢R, are continuous at every
point. From Theorem 15.6, it follows that
fox) =14+ 27, xeER,
is continuous at every point and, since f, never vanishes, this same theorem
implies that the function f given above is continuous on R.] If G is the open
set G = (-1, 1), then f(G) = @, 1], which is not open in R. Similarly, if H is
the closed set H={x € R:x = 1}, then f(H) =(0, 3], which is not closed in
R. Similarly, the function f maps the set R, which is both open and closed in
R, into the set f(R) = (0, 1], which is neither open nor closed in R.
The moral of the preceding remarks is that the property of a set being open or
closed is not necessarily preserved under mapping by a continuous function.
However, there are important properties of a set which are preserved under
continuous mapping. For example, we shall now show that the properties of
connectedness and compactness of sets have this charzcter.
22. GLOBAL PROPERTIES OF CONTINUOUS FUNCTIONS 153
Preservation of Connectedness
The very word ‘‘continuous”’ suggests that there are no sudden “‘breaks”
in the graph of the function; hence the next result is by no means
unexpected. However, the reader is invited to attempt to provide a
different proof of this theorem and he will come to appreciate its depth.
22.4 BOLZANO’s INTERMEDIATE VALUE THEOREM. Let H<D(f)
be a connected subset of R® and let f be continuous on H and with values in R.
If k is any real number satisfying
inf {f(x):x € H}<k <sup {f(x):x € H},
then there is at least one point of H where f takes the value k.
PROOF, If kéf(H), then the sets A={teR:t<k}, B={teR:t>k}
form a disconnection of f(H), contrary to the previous theorem. QED.
Preservation of Compactness
We now demonstrate that the important property of compactness is
preserved under continuous mapping. We recall that it is a consequence
of the important Heine-Borel Theorem 11.3 that asubset K of R? is compact
154 CONTINUOUS FUNCTIONS
if and only if it is both closed and bounded in R’. Thus the next result could
be rephrased by saying that if K is closed and bounded in R? and if f
is continuous on K and with range in R*, then f(K) is closed and bounded
in R4,
22.5 PRESERVATION OF COMPACTNESS. If K<&D(f) is compact
and f is continuous on K, then f(K) is compact.
FIRST PROOF. We assume that K is closed and bounded in R? and shall
show that f(K) is closed and bounded in R*. If f(K) is not bounded, for each
neé N there exists a point x, in K with |f(x,)||=n. Since K is bounded, the
sequence X=(x,) is bounded; hence it follows from the Bolzano-
Weierstrass Theorem 16.4 that there is a subsequence of X which converges
toanelement x. Since x, € K for n EN, the point x belongs to the closed set
K. Hence f is continuous at x, so f is bounded by |[f(x)||+ 1 on a neighbor-
hood of x. Since this contradicts the assumption that ||f(x,)|| =, the set
f(K) is bounded.
We shall prove that f(K) is closed by showing that any cluster point y of
f(K) must be contained in this set. In fact, ifn isa natural number, there isa
point z, in K such that ||f(z.)—yl|<1/n. By the Bolzano-Weierstrass
Theorem 16.4, the sequence Z =(z,) has a subsequence Z' = (Znq)) which
converges to anelement z. Since K is closed, then z € K and f is continuous
at z. Therefore,
Now if the value M is not attained, then the union of the family € ={C,} of
open sets contains all of K. Since K is compact and the family {C, N K} is
increasing, there is an re N such that KCC, But then we have f(x)<
M-—1/r for all x € K, contrary to the fact that M =sup f(K). QED.
linearity of f we have
pqlteoll=
1
Geile
u
whence it follows that ||f(u)|] = mllull for all ue R? (since the result is trivial
for u =0).
Conversely, suppose ||f(x)|| < m ||x|| for allx e R®. If f(x:) = f(x2), then we
have
Since Hj, is a subset of f(K) = D(g), we can write this last equation as
Hin D(g)=87"(H).
From the Global Continuity Theorem 22.1(c), we infer that g=f' is
continuous. QED.
We shall close this section with the introduction of some notation that will
be convenient.
22.10 Derimnirion. If DCR?, then the collection of all continuous
functions on D to R° is denoted by C,,(D). The collection of all bounded
continuous functions on D to R* is denoted by BC,,(D). When p and q are
understood, we will denote these collections merely by C(D) and BC(D).
22.11 THEOREM. (a) The spaces C,,(D) and BC,,(D) are vector
spaces under the vector operations
G+ gx)=f(x)+e), (ef)()=ef(x) — forxe D.
(a) The space BC,,(D) is a normed space under the norm
Exercises
22.A. Interpret the Global Continuity Theorem 22.1 for the real-valued func-
tions f(x)=x* and g(x)=1/x, x#0. Take various open and closed sets and
determine their inverse images under f and g.
22.B. Let H:R — R be defined by
Exhibit an open set G such that h~’(G) is not open in R, and a closed set F such that
h“(F) is not closed in R.
22.C. If f is bounded and continuous on R’ to R and if f(x.) >0, show that f is
strictly positive on some neighborhood of x». Does the same conclusion hold if f is
merely continuous at Xo?
22.D. If p:R°—>R is a polynomial and ceR, show that the set
{(x, y): p(x, y)<c} is open in R’.
22.E. If f:R’—R is continuous on R°’ and a<f, show that the set
{xe R’:a=f(x)=8} is closed in R’.
22.F. A subset DC R° is disconnected if and only if there exists a continuous
function f:D — R such that f(D) ={0, 1}.
22.G. Let f becontinuouson R*to R*. Define the functions g,, gon R to R® by
Project
22.a. The purpose of this project is to show that many of the results of Section 22
hold for continuous functions whose domains and ranges are contained in metric
spaces. (In establishing these results we may either observe that earlier definitions
apply to metric spaces or can be reformulated to do so.)
(a) Show that Theorem 20.2 can be reformulated for a function from one metric
space to another one.
(b) Show that the Global Continuity Theorem 22.1 holds without change.
(c) Prove that the Preservation of Connectedness Theorem 22.3 holds.
(d) Prove that the Preservation of Compactness Theorem 22.5 holds.
f(x)
- Flu)| = 2 |x — ul]
and so we can choose 6(e, u)=e/2 for all values of u.
23. UNIFORM CONTINUITY AND FIXED POINTS 159
s@)-awl= 5
and that this inequality cannot be improved, since equality actually holds for
x=u-56. If we want to make |g(x)— g(u)|<e, then the largest value of &
we can select is 2
eu
ale, "Then"
inf {S4—-u>0}=0,
Hence if we define 8(¢) = ea’/(1+ ea), then we can use this number for all
points u = a.
In order to help fix these ideas, the reader should look over Examples 20.5
and determine in which examples the 6 was chosen to depend on the point in
question and in which ones it was chosen independently of the point.
With these preliminaries we now introduce the formal definition.
23.1 DeFINITION. Let f have domain D(f)in R’ andrangein R*. We
say thatf is uniformly continuous on aset A ¢ D(f) if foreach e > Othere isa
8(e)>0 such that if x and u belong to A and |lx—ul|= 8(e), then
Ilf(x) — f(w)|| se.
It is clear that if f is uniformly continuous on A, then it is continuous at
every point of A. In general, however, the converse does not hold. It is
useful to have in mind what is meant by saying that a function is not
uniformly continuous, so we state such a criterion, leaving its proof to the
reader.
23.2 LEMMA. A necessary and sufficient condition that the function f is
not uniformly continuous on AC D(f) is that there exist eo.>0, and two
160 CONTINUOUS FUNCTIONS
sequences X = (xx), Y =(yn) in A such that ifn €N, then ||xn— yn|| = 1/n and
If) — Fn)I] > €0.
As an exercise the reader should apply this criterion to show that
g(x) =1/x is not uniformly continuous on D(g)={x:x >O}.
We now present a very useful result which assures that a continuous
function is automatically uniformly continuous on any compact set in its
domain.
23.3. UNIFORM CONTINUITY THEOREM. Let f be a continuous function
with domain D(f) in R? and range in R*. IfK < D(f) is compact, then f is
uniformly continuous on K.
FIRST PROOF. Suppose that f is not uniformly continuous on K. By
Lemma 23.2 there exists ¢) >0 and two sequences (x,) and (y,) in K such
that if née .N, then
(23.1) llxx— yal} 1/n, —— {If(%
— flyn)||>
n)£0.
Since K is compact in R’, the sequence X is bounded; by the Bolzano-
Weierstrass Theorem 16.4, there is a subsequence (Xn«)) of (x.) which
converges to an element z. Since K is closed, the limit z belongs to K and f
iscontinuous at z. Itis clear that the corresponding subsequence (yna)) of Y
also converges to z.
It follows from Theorem 20.2(c) that both sequences (f(X%.Go)) and (f(yng))
converge to f(z). Therefore, when k is sufficiently great, we have ||f(xna) —
f(yac)||< eo. But this contradicts the second relation in (23.1).
SECOND PROOF. (A shorter proof could be based on the Lebesgue
Covering Theorem 11.5, but we prefer to use the definition of
compactness.) Suppose that f is continuous at every point of the compact
set K. According to Theorem 20.2(b), given « >0 and u in K there is a
number 6(ze,u)>0 such that if x¢K and ||x—ull<SGe,u), then
f(x) -f(@)||<3e. For each u in K, form the open ball G(u)=
{x eR? :|x —ul|<48Ge, u)}; then the set K is certainly contained in the
union of the family @ = {G(u): u € K}, since to each point u in K there is an
open ball G(u) which contains it. Since K is compact, it is contained in the
union of a finite number of sets in the family G, say, G(u:),..., G(un). We
now define
8(e)
= inf {8Ge, u),..., Ge, und},
and we shall show that 6(<) has the desired property. For, suppose that x, u
belong to K and that ||x — ul] = 6(c). Then there exists a natural number k
with 1=<k =< N such that x belongs to the set G(u.); that is, ||x —ul|<
76(e, ux). Since &(e) <46(e, ux), it follows that
||u — a] = [lu — x]]+|]x — ux |< 8 Ge, ux).
23. UNIFORM CONTINUITY AND FIXED POINTS 161
(23.2) If)
— fl] = A |x — ul
for all points x, uin D(f). Incase the inequality (23.2) holds with a constant
A <1, the function is called a contraction.
It is clear that if relation (23.2) holds, then on setting &(e) = e/A one can
establish the uniform continuity of f on D(f). Therefore, if f satisfies a
Lipschitz condition, then f is uniformly continuous. The converse, how-
ever, is not true, as may be seen by considering the function defined for
D(f) =I by f(x) =x. If (23.2) holds, then setting u=0 one must have
|f(x)| = A |x| for some constant A, but it is readily seen that the latter
inequality cannot hold.
By recalling Theorem 21.3, we see that a linear function with domain R?
and range in R® satisfies a Lipschitz condition. Moveover, it will be seen in
Section 27 that any real function with a bounded derivative also satisfies a
Lipschitz condition.
Iff is a function with domain D(f) and range in the same space R®, then a
point u in D(f) is said to be a fixed point of f in case f(u) = u. A number of
important results can be proved on the basis of the existence of fixed points
of functions so it is of importance to have some affirmative criteria in this
direction. The first theorem we give is elementary in character, yet it is
often useful and has the important advantage that it provides a construction
of the fixed point. For simplicity, we shall first state the result when the
domain of the function is the entire space.
Since 0< C <1, the sequence (C””’) converges to zero. Therefore, (x,) isa
Cauchy sequence. If u = lim (x,), then it is clear from (23.3) that u is a fixed
point of f. From (23.5) and Lemma 15.8, we obtain the estimate
Cc
(23.6) lu — xa]] <= 1-¢C \|x2— xall
In case the function f is not defined on all of R’, then somewhat more care
needs to be exercised to assure that the iterative definition (23.3) of the
sequence can be carried out and that the points remain in the domain of f.
Although some other formulations are possible, we shall content ourselves
with the following one.
23.A. Examine each of the functions in Example 20.5 and either show that the
function is uniformly continuous or its domain or that it is not.
23.B. Give a proof of the Uniform Continuity Theorem 23.3 by using the
Lebesgue Covering Theorem 11.5.
+L. E. J. BROUWER (1881-1966) was professor at Amsterdam and dean of the Dutch school of
mathematics. In addition to his nearly contributions to topology, he is noted for his work on
the foundations of mathematics.
164 CONTINUOUS FUNCTIONS
Project
23.a. This project introduces the notion of the “oscillation” of a function on a set
and ata point. Let I=[a, b]<R and let f:I > R be bounded. If A CI we define
the oscillation of f on A to be the number
Also, if w,(c) <a, then there exists 6 >0 such that O,(Ns) <a.
(c) Show that f is continuous at c € I if and only if w,(c) = 0.
(d) Ifa >0 and if w,(x)< a for all x € [, then there exists 6 > 0 such that ifA ¢ I is
such that its diameter d(A) = sup {|x — y|: x, y € A} is less than 8, then 0,(A)<a.
(e) Ifa >0, then the set D, = {x € I: w(x) = a} isaclosed set in R. Show that
D=UD.=U Din
a>d nen
There are many times when one needs to consider a sequence of continu-
ous functions. In this section we shall present several interesting and
important theorems about such sequences. Theorem 24.1 will be used
very often in the following and is a key result. The remaining theorems
will not be used frequently, but the reader should be familiar with their
statements, at least.
In this section the importance of uniform convergence should become
clearer. We recall that a sequence (f,.) of functions on a subset D of R? to
R’‘ is said to converge uniformly on D to f if for every e > 0 there is an N(e)
such that if n= N(e) and x e€D, then |lf.(x)—f(x)||<e. We recall from
Theorem 17.9 that this is true if and only if lf. —fllb > 0, when (f,) is a
bounded sequence.
Thus, despite the simple character of the continuous functions f,,, the limit
function is not continuous at the point x = 1.
Although the extent of the discontinuity of the limit function in the
example just given is not very great, it is evident that more complicated
examples can be constructed which will produce more extensive dis-
continuity. It would be interesting to investigate exactly how discontinu-
ous the limit of a sequence of continuous functions can be, but this
investigation would take us too far afield. Furthermore, for most applica-
tions it is more important to find additional conditions which will guarantee
that the limit function is continuous.
We shall now establish the important fact that uniform convergence of a
sequence of continuous functions is sufficient to guarantee the continuity of
the limit function.
24.1 THsoreM. LetF = (f,) be a sequence of continuous functions with
domain D in R? and range in R° and let this sequence converge uniformly on
D to a function f. Then f is continuous on D.
PROOF. Since (f,) converges uniformly on D to f, given ¢ >0 there is a
natural number N = N(e/3) such that ||fv(x) — f(x)||< e/3 for all x in D. To
show that f is continuous at a point a in D, we note that
(24.1) f(x)
— fl@)Il < [LF Ox) — fin )II+ Ilfie)
— fix (@)I| + [lf (a) — f(a]
< ¢/3 +|lfw(x)— f(a)||+€/3.
Since fy is continuous, there exists a number 8= 8(e/3, a, fy) >0 such
that if |x —al|<6 and x €D, then ||fx(x)—fx(a)||< 2/3. (See Figure 24.1.)
Therefore, for such x we have |f(x)—f(a)||<e. This establishes the con-
tinuity of the limit function f at the arbitrary point ain D. O.E.D.
We remark that, although the uniform convergence of the sequence of
continuous functions is sufficient for the continuity of the limit function, it is
(% fy)
(a, f,,(2))
Figure 24.1
24, SEQUENCES OF CONTINUOUS FUNCTIONS 167
Approximation Theorems
For many applications it is convenient to “approximate” continuous
functions by functions of an elementary nature. Although there are several
reasonable definitions that one can use to make the word “‘approximate”’
more precise, one of the most natural as well as one of the most important is
to require that at every point of the given domain the approximating
function shall not differ from the given function by more than the preas-
signed error. This sense is sometimes referred to as ‘‘uniform approxima-
tion” and it is intimately connected with uniform convergence. We suppose
that f is a given function with domain D = D(f) contained in R’ and range in
R’. We say that a function g approximates f uniformly on D to within < > 0,
(x3, f(x3))
Approximation by Polynomials
We shall now prove a deeper, more useful, and more interesting result
concerning the approximation by polynomials. First, we prove the Weier-
strass Approximation Theorem for p = q = 1, by using the polynomials of S.
Bernstein.
24.6 DEFINITION. Let f bea function with domain I =[0, 1] and range
in R. The nth Bernstein polynomial for f is defined to be
(7)a —x)"* which may be seen to be very small for those values of k for which
k/nisfarfrom x. In fact, the function ¢, is non-negative on I and takes its maximum
value at the point k/n. Moreover, as we shall see below, the sum of all the ¢,.(x),
k=0,1,...,n,is 1 foreachx inL
(2) ~ ce ky!
+ SERGE N, BERNSTEIN (1880-1968) made profound contributions to analysis, approximation
theory, and probability. He was born in Odessa and was a professor in Leningrad and
Moscow.
170 CONTINUOUS FUNCTIONS
(24.4) (CD
m—1\___(n—1)!_—_k(n
er essinices sreerit)
n-2\_— (n-2)! —_sik(k -1)/n
(24.5) (0-3) * Goel net le)
Now let s =x and t =1—x in (24.3), to obtain
_*< k AY ey yack
=D E(h)x (1-x)"™.
Deere £O Qe
Therefore we conclude that
Multiplying (24.6) by x’, (24.7) by —2x, and adding them to (24.8), we obtain
(24.7) says the same thing for the function fi(x) =x. Formula (24.8) asserts
that the nth Bernstein polynomial for the function f.(x) = x? is
B(x; fa) = (1 — 1/n)x?+ (A/n)x,
which converges uniformly on I to f2. We shall now prove that if f is any
continuous function on I to R, then the sequence of Bernstein polynomials
has the property that it converges uniformly on I to f. This will give us a
constructive proof of the Weierstrass Approximation Theorem. In the
process of proving this theorem we shall need formula (24.9).
24.7 BERNSTEIN APPROXIMATION THEOREM. Let f be continuous on
I with values in R. Then the sequence of Bernstein polynomials for f,
defined in equation (24.2), converges uniformly on I to f.
PROOF. On multiplying formula (24.6) by f(x), we get
foo = % #eo(f)sta
xy
Therefore, we obtain the relation
The sum taken over those k for which |x —k/n| = n~™, that is, (x —k/m)’ >
n‘?, can be estimated by using formula (24.9). For this part of the sum in
172 CONTINUOUS FUNCTIONS
1 M
=< 2MVa{— x 1-x ber,
n ( ) Vn
f(x) — Bu(x)|<2e,
independently of the value of x. This shows that the sequence (B,)
converges uniformly on I to f. OED.
Exercises
fi@)=flx)e---=f)eful)e---
If lim (f,(c))=0 for some ceD and « >@; show that there exists meN and a
neighborhood U of c such that if n= m and xe UND, then f,(x)<e.
24.H. Use the preceding exercise to prove the following result of U. Dini.+ If
(f.) is a monotone sequence of continuous functions which converges at each point
of a compact set K in R° to a function f which is continuous on K, then the
convergence is uniform on K.
24.1. Show, by examples, that Dini’s Theorem fails if we drop either of the
hypotheses that K is compact or that f is continuous.
24.J. Prove the following theorem of G. Pélya.¢ If for each n € N the function f,
on I to R is monotone increasing and if f(x) =lim (f,(x)) is continuous on FE, then
the convergence is uniform on I. (Observe that it is not assumed that f, is
continuous.)
¥ ULIisse Dini (1845-1918) studied and taught at Pisa. He worked on geometry and analysis,
particularly Fourier series.
+ GeorGE PoLya (1887- ) was born in Budapest and taught at Ziirich and Stanford. He
is widely known for his work in complex analysis, probability, number theory, and the theory
of inference.
174 CONTINUOUS FUNCTIONS
(c) If c does not belong to the domain D of f, then the deleted limit exists if
and only if the non- deleted limit exists.
Part (b) of the lemma just stated shows that the notion of the non-
deleted limit is somewhat more restrictive than that of the deleted limit.
Part (c) shows that they can be different only in the case where c belongs to
D. To give an example where these notions differ, consider the function f
on R to R defined by
f(x)=0, x#0,
(25.3)
=1, x=0.
If c =0, then the deleted limit of f at c=0 exists and equals 0, while the
non-deleted limit does not exist.
We now state some necessary and sufficient conditions for the existence
of the limits, leaving their proof to the reader. It should be realized that in
part (c) of both results the limit refers to the limit of a sequence, which was
discussed in Section 14.
f(x) belongs to V for all x#c for which xe UND, in which case limf
exists and equals f(c). Conversely, statements (b) and (c) are readily seen
to imply (a). O.E.D.
If f and g are two functions which have deleted (respectively, non-
deleted) limits at a cluster point c of D(f+g)=D(f)ND(g), then their
sum f +g has a deleted (respectively, non-deleted) limit at c and
a=Lim gof.
in formula (25.3) and let g=f and c=0. Then gef is given by
(g°f\(x)=1, x0,
=0, x=0.
(a) p(r)
= sup {f(x):0<||x
—cl| <r, x € D},
(b) (7) = sup {f(x) : |x — cl]<r, x € D}
and set
These quantities are called the deleted limit superior and the non-deleted
limit superior of f at c, respectively.
Since these quantities are defined as the infima of the image under f of
ever-decreasing neighborhoods of c, it is probably not clear that they
deserve the terms ‘‘limit superior.” The next lemma indicates a justifica-
tion for the terminology.
25, LIMITS OF FUNCTIONS 179
sup {f(x) + g(x):x EA} < sup {f(x):xe A}+sup {g(x):x € A},
it is clear that, using notation as in Definition 25.7, we have
Prrelt) = O(t) + Pe(r).
Now use Lemma 25.8 and let r —> 0 to obtain (a). Q.E.D.
Results concerning other algebraic combinations will be found in
Exercise 25.F.
180 CONTINUOUS FUNCTIONS
One of the keys to the importance and the utility of upper semi-continuous
functions is suggested by the following lemma, which may be compared
with the Global Continuity Theorem 22.1.
25.12 Lemma. Let f be an upper semi-continuous function with
domain D in R? and let k be an arbitrary real number. Then there exists an
open set G and a closed set F such that
(25.6) GND={xeD:flx)<k}, FOD={xeD:f(x)=
k}.
PROOF. Suppose that c is a point in D such that f(c)<k. According to
Definition 25.11 and Lemma 25.9(b), there is a neighborhood U(c) of c
such that f(x)<k for allx in DM U(c). Without loss of generality we can
select U(c) to be an open neighborhood; setting
G = U{U(c):c € D},
we have an open set with the property stated in (25.6). If F is the
complement of G, then F is closed in R? and satisfies the stated condition.
OED.
It is possible to show, using the lemma just proved, (cf. Exercise 25.M)
that if K is a compact subset of R? and f is upper semi-continuous on K,
then f is bounded above on K and there exists a point in K where f attains its
supremum. Thus upper semi-continuous functions on compact sets pos-
sess some of the properties we have established for continuous functions,
even though an upper semi-continuous function can have many points of
discontinuity.
It will have occurred to the reader that it is possible to extend the notion
of limit superior at a point to the case where the function is not bounded by
using ideas along the lines given at the end of Section 18. Similarly one
25. LIMITS OF FUNCTIONS 181
can define the limit superior as x +. These ideas are useful, but we
will leave them as exercises.
Exercises
25.A. Discuss the existence of both the deleted and the non-deleted limits of the
following functions at the point x = 0.
(a) f(x) =I, (b) f(x)=1/x, = x #0,
(c) f(x) =x sin (1/x), x40, (d) f(x) =sin (1/x), x#0
x sin Oho. x#0, _ f0, x=0,
(©) f= { x=0, (p fod= {0 x >0.
25.B. Prove bemma 25.2.
25.C. If f denotes the function defined in equation (25.3), show that the deleted
limit at x =0 equals 0 and that the non-deleted limit at x =0 does not exist.
Discuss the existence of these two limits for the composition ff.
25.D. Prove Lemma 25.4.
25.E. Show that statements 25.5(b) and 25.5(c) imply statement 25.5(a).
25.F. Show that if f and g have deleted limits at a cluster point c of the set
D(f)N D(g), then the sum f+ g has a deleted limit at c and
Under the same hypotheses, the inner product f - g has a deleted limit at c and
lim
f = +00
in case for each positive number M there exists a neighborhood U of c such that if
xe€UND(f), x#c, then f(x)>M. Formulate and establish a result analogous to
Theorem 25.3 for this limit.
25.J. In view of Exercises 25.H and 25.I, give a definition of what is meant by
the expressions:
lim f= +, lim f =~
182 CONTINUOUS FUNCTIONS
25.K. Establish Lemma 25.8 for the non-deleted limit superior. Give the proof
of Lemma 25.9(b).
25.L. Define what is meant by lim sup,.4. f= L, and lim inf,_... f= —».
25.M. Show that if f is an upper semi-continuous function on a compact subset K
of R’ with values in R, then f is bounded above and attains its supremum on K.
25.N. Show that an upper semi-continuous function on a compact set may not be
bounded below and may not attain its infimum.
25.0. Show that if A is an open subset of R° and if f is defined on R’ to R by
f(x)=1 for xeA, and f(x)=0 for x¢A, then f is a lower semi-continuous
function. If A is a closed subset of R’, show that f is upper semi-continuous.
25.P. Give an example of an upper semi-continuous function which has an
infinite number of points of discontinuity.
25.Q. Is it true that a function on R’ to R is continuous at a point if and only if it
is both upper and lower semi-continuous at this point?
25.R. If (f,) is a bounded sequence of continuous functions on R’ to R and if f*
is defined on R” by f*(x)=sup {f,(x):n <N} for x eR’, then is it true that f* is
upper semi-continuous on R°?
25.S. If (f,) is a bounded sequence of continuous functions on R® to R and if f, is
defined on R?’ by f,(x) = inf {f.(x):n €N} for x € R’, then is it true that f, is upper
semi-continuous on R??
25.T. Let f be defined on a subset D of R’ x R* and with values in R’. Let
(a, b) be a cluster point of D. By analogy with Definition 19.4, define the double
and the two iterated limits of f at (a, b). Show that the existence of the double and
the iterated limits implies their equality. Show that the double limit can exist
without either iterated limit existing and that both iterated limits can exist and be
equal without the double limit existing.
25.U. Let f be as in the preceding exercise. By analogy with Definitions 17.4
and 19.8, define what it means to say that
We shall present some theorems in this section that will not be applied
later in this book, but which are often useful in topology and analysis. The
26. SOME FURTHER RESULTS 183
Tt MARSHALL H. STONE (1903- ) studied at Harvard and has taught at Harvard and the
Universities of Chicago and Massachusetts. The son of a chief justice, he has made basic
contributions to modern analysis, especially to the theories of Hilbert space and Boolean
algebras.
184 CONTINUOUS FUNCTIONS
Since g.,(x) = F(x), it is seen that h,(x) = F(x) and hence there is an open
neighborhood V(x) of x such that if z belongs to K M V(x), then
(26.3) h,(z)<F(z)+e.
Use the compactness of K once more to obtain a finite number of neigh-
borhoods V(x1),..., V(@%m) and set h = inf {h,,,..., h.,}. Then h belongs
to £& and it follows from (26.2) that
h(z)>F(z)-e« forzeK.
Combining these results, we have |h(z)— F(z)|<«, z €K, which yields the
desired approximation. O.E.D.
The reader will have observed that the preceding result made no use of
the Weierstrass Approximation Theorem. In the next result, we replace
condition (a) above by three algebraic conditions on the set of functions.
Here we make use of the classical Weierstrass Theorem 24.8 for the special
case of the absolute value function ¢ defined for t in R by ¢(t)=|t|, to
conclude that ¢ can be approximated by polynomials on every compact set
of real numbers.
26.2 STONE-WEIERSTRASS THEOREM. Let K be a compact subset of
R? and let & be a collection of continuous functions on K to R with the
properties:
(a) The constant function e(x)=1, x € K, belongs to &.
(b) If f, g belong to Hf, then af+ Bg belongs to & for all a, B in R.
(c) If f, g belong to #, then fg belong to A.
(d) If x# y are two points of K, there exists a function f in of such that
f(x) 4 fly).
Then any continuous function on K to R can be uniformly approximated
on K by functions in xf.
PROOF, Leta,beR andx# y belongto K. According to (d), there isa
26. SOME FURTHER RESULTS 185
function f in sé such that f(x) #f(y). Since e(x) = 1= e(y), it follows that
there are real numbers a, B such that
af(x)+ Be(x) =a, af(y)+ Be(y) = b.
Therefore, by (b) there exists a function g¢.o such that g(x)=a and
gly) =o.
Now let £ be the collection of all continuous functions on K which can
be uniformly approximated by functions in 4. Obviously @ c &, so £ has
property (b) of the Stone Approximation Theorem 26.1. We shall now
show that if he Y, then |h|e ¥. Since
sup {f, g}=2f+ g+l|f- gi),
inf {f, g}=20f+ @—If—e)),
this will imply that & has property 26,1(a) and hence that every continuous
function on K to R belongs to &.
Since h is continuous and K is compact, it follows that there exists an
M>0 such that |lhlk <M. Since he &, there is a sequence (h,) of func-
tions in «¢ which converge uniformly to h on K and we may suppose that
[halk M+1 for allneN. (Why?) If ¢>0 is given we now apply the
Weierstrass Approximation Theorem 24.8 to the absolute value function
on the interval [-(M +1), M +1] to get a polynomial p. such that
In fact, if d(x, A) =inf {\|x—yl]:ye¢A} and d(x, B) = inf {|x — yl]:y € B},
then we can define » for x € R” by the equation
___a(x, A)
0) =a A) +d, B):
26.4 TreTzet EXTENSION THEOREM. Let f be a bounded continuous
function defined on a closed subset D of R’ and with values in R. Then
} HEINRICH TIETZE (1880-1964) was professor at Munich and contributed to topology,
geometry, and algebra. This extension theorem goes back to 1914.
26. SOME FURTHER RESULTS 187
Equicontinuity
We have made frequent use of the Bolzano-Weierstrass Theorem 10.6
for sets (which asserts that every bounded infinite subset of R’ has a cluster
point) and the corresponding Theorem 16.4 for sequences (which asserts
that every bounded sequence in R? has a convergent subsequence). We
now present a theorem which is entirely analogous to the Bolzano-
Weierstrass Theorem except that it pertains to sets of continuous functions
and not sets of points. For the sake of brevity and simplicity, we shall
present here only the sequential form of this theorem.
In what follows we let K be a fixed compact subset of R’, and we shall be
concerned with functions which are continuous on K and have their range
in R*. In view of Theorem 22.5, each such function is bounded, and hence
C,,(K) = BC,,(K). We say that a set ¥ in C,,(K) is bounded (or uniformly
bounded) on K if there exists a constant M such that |lfllx < M, for all f in
#, Jt is clear that any finite set # of such functions is bounded; for if
#F ={fi, fo,..-, fa}, then we can set
is convergent. We proceed in this way and then set g, =f," so that g, is the
nth function in the nth subsequence. It is clear from the construction that
the sequence (g,) converges at each point of C.
We shall now prove that the sequence (g,) converges at each point of K
and that the convergence is uniform. To do this, let e >0 and let 8(e) be
as in Definition 26.6. Let C:={y1,..-, yx} be a finite subset of C such that
every point in K is within 8(e) of some point in C;. Since the sequences
(gn (2), (gn(y2)), «- +» (Gn yie))
converge, there exists a natural number M such that if m,n = M, then
Exercises
26.A. Show that condition (a) of Theorem 26.1 is equivalent to the condition:
(a’') If f belongs to &, then |f| belongs to &.
26.B. Show that every continuous real-valued function on the interval [0, a] is
the uniform limit of a sequence of ‘“‘polynomials in cos x”’ (that is, of functions (P,),
where P,(x)=p,(cos x) for some polynomial p,).
26.C. Show that every continuous real-valued function on [0, 7] is the uniform
limit of a sequence of functions of the form
X +> dota, cosx+a;,cos2x++-++4,
cos nx.
26.D. Explain why the result in Exercise 26.B fails if cos kx is replaced by sin kx,
keEN.
26.E. Use Exercise 26.C to show that every continuous real-valued function f on
[0, a] with f(0) = f(a) is the uniform limit of a sequence of functions of the form
26.F. Use Exercises 26.C and 26.E to show that every continuous real-valued
function f on [—a, 7] with f(—a)= f(s) is the uniform limit of a sequence of
functions of the form
[Hint: split f into the sum f = f. +f, of an even function f,(x) =3(f(x) +f(—x)) and an
odd function f, (x) =3(f(x).]
26.G. Give a proof of the preceding exercise based upon Theorem 26.3 applied
to the unit circle T = {(x, y)e R*:x’+y?=1} and the observations that there is a
one-one correspondence between continuous functions on T to R and continuous
functions on [~—7, a] to R which satisfy f(—a) = f(a).
26.H. Let J&R be a compact interval and let # be a collection of continuous
functions on J > R which satisfy the properties of the Stone-Weierstrass Theorem
26.2. Show that any continuous function on JxJ (in R’) to R can be uniformly
approximated by functions of the form
193
194 FUNCTIONS OF ONE VARIABLE
(27.1) f)-fO_
— ple &.
f(x)
— f(c)| = |x| {f"()|+ I.
The left side of this expression can be made less than ¢ if we take x in D
with |x — c| <inf {8, e/(|f’(c)|+ 1}. OED.
It is easily seen that continuity at ¢ is not a sufficient condition for the derivative
to exist at c. For example, if D=R and f(x)=|x|, then f is continuous at every
point of R but has a derivative at a point c if and only ifc#0. By taking simple
algebraic combinations, it is easy to construct continuous functions which do not
have a derivative at a finite or even a countable number of points. In 1872,
27, THE MEAN VALUE THEOREM 195
can be proved to have this property. We shall not go through the details, but refer
the reader to the books of Titchmarsh and Boas for further details and references.)
-e fO=1O_pye),
Since x —c > 0, this relation implies that
of—-~—-——
ZL \
Figure 27.1
1
!
on
i
% ¢
Figure 27.2. The mean value theorem.
27.7 Coroiiary. Iff has a derivative on J =[a, b], then there exists a
point c in (a, b) such that
Exercises
27.A. Using the definition, calculate the derivative (when it exists) of the func-
tions given by the expressions:
h'(c) =fi(c)g(c)
+ fic)g'(c).
27.C. Show that the function defined for x # 0 by
(fs) =f0)
Xn —€
exists. In this case the limit of all such sequences are equal to f'(c).
27.G. If f:D — R is differentiable at c¢ D andifc+1/neD forall ne N, show
that
[#22101 py |<
27.M. Let f:[a, b]> R be differentiable on [a, b]. Show that f’ is continuous
on (a, b]if and only if for every « > 0 there is a 8(e) > 0 such that if 0<|x — y|< 8(e),
x, y€[a, b], then
fG)-fy)
x-y -f@) |<
27.N. Let f:[a,b]— R be continuous on [a, b] and differentiable in (a, b). If
lim, f'(x) = A show that f’(a) exists and equals A.
27.0. If f: R— R and f'{a) exists, show that
Fa) =f tla)
200 FUNCTIONS OF ONE VARIABLE
However, give an example to show that the existence of this limit does not imply
the existence of the derivative.
27.P. A function f:R — R is said to be even if f(—x) = f(x) for all x ER, and to
be odd if f(—x) =—f(x) for allxe R. Iff is differentiable on R and even (respec-
tively, odd), show that f’ is odd (respectively, even).
27.Q. Let f:(a,b) > R and ce(a,b). We put f(ct+)=hm,.. f(x) (the right-
hand limit of f atc). If the right-hand limit
(x)=
alex) -f(glb))
g(x)—g(b) if g(x) ¥ gc),
fin EMM)
(b) If f(x)> aeR and f'(x)> bER as x > +™, then b=0.
(c) If f'(x) > bE R as x > +, then f(x)/x > b as x > +00,
27.T. Let f:[a,b]—>R be differentiable with 0<m = f'(x)=M for x e[a, b]
and let f(a)<0<f(b). Given x,€[a, b], define the sequence (x,) by
Prove that this sequence is well-defined and converges to the unique root x of the
equation f(x) =0 in [a, b] and that
y ny
\xXn-e1¥| = aml M
forneN. (Hint: let ~:[a, b] > R be defined by ¢ (x) =x —f(x)/M. Show that ¢
is increasing and a contraction (see 23.4) with constant 1—m/M.)
27.U. Let f: R — R have a continuous derivative and be such that f(a) = b an!
f'(a) #0. Let &>0 be such that if |x — a] = 5 then |f’(x)—f’(a)| =<3|f'(a)|, and let
28. FURTHER APPLICATIONS OF THE MEAN VALUE THEOREM 201
Xnat = Xp
f)-¥F neN
converges to the unique point x in [a—6, a+] such that f(x)=y9. (Hint: show
that the function defined by p(x) = x — (f(x)— 9)/f'(a) is a contraction with constant
300 the interval [a— 8, a+6].)
5
ari
<< V105—-10<5-~
xa 2(10) ’
023 <sae os
25) ~¥ 105—
Our improved estimate is 10.243<¥105 < 10.250 and more accurate esti-
mates can be obtained in this way.
28.3. AppLicaTion. The Mean Value Theorem and its corollaries can
be used to establish inequalities and to extend inequalities that are known
for integral or rational values to real values.
For example, we recall that Bernoulli’s Inequality 5.C asserts that if
1+x>0 and neEN, then (1+x)"=i+nx. We shall show that this in-
equality holds for any real exponent r= 1, Todo so, let f(x)=(1+ x)’, so
that f’(x)=r(it+x)". If -1<x<0, then f’(x)<r, while if x>0, then
f'(x)>r. If we apply the Mean Value Theorem to both of these cases, we
obtain the result
(1+x) =1+nrx,
f(b) _ fle)
gb) gc)”
It follows that if lim... f’(x)/g’(x) exists, then
£'Q)
wax_ > 0, as x0.
g'(x)
aLL]
Let e¢ > 0 and choose a fixed number 0< x, <1 such that if0<x<x,, then
|f’(x)/2’(x)|<e. Applying the Cauchy Mean Value Theorem, we have
f(x)
— f(x.) | _ | f(x) <€ ’
g(x) — g(x) g'(x2)
with x2 satisfying O0<x<x.<x,. Since f(x) #0 and g(x) #0 forO0<x<x4,
we can write the quantity appearing on the left side in the more convenient
form
1 fx)
Fx) JU fx)
g(x) 1 — 80) ‘
g(x)
Holding x: fixed, we let x > 0. Since the quantity in braces converges to
the Mean Value Theorem to the difference f,, —f, on an interval with end
points c, x to infer that there exists a point z (depending on m, n) such that
{fm (x) — fn(x)}—
{fm (c) — fale)} = (x — cf f(z) — fa(z)}.
We infer that, when c# x, then
FO)-FO) _ fale)
— filo)
|
x—-C x—-C€
when n= M(e). Since g(c) =lim (f4(c)), there exists an N(e) such that if
n= N(e), then [fi(c)—g(c)|<«. Now let K =sup{M(e), N(e)}. In view
of the existence of f&(c), if 0<|x —c|< 8x(e), then
Pe) fl) _ (0)
x-
Taylor’s Theorem
f BROOK TAYLOR (1685-1731) was an early English mathematician. In 1715 he gave the
infinite series expansion, but—true to the spirit of the time—did not discuss convergence.
The remainder was supplied by Lagrange.
206 FUNCTIONS OF ONE VARIABLE
and that f™ exists in (a, b). If a, B belong to J, then there exists a number y
between a and B such that
8.1) BAP
p— 4p) ~ {70 +49 @-a)
fe (a) ¢
Paap boa" |.
and consider the function ¢ defined on J by
PHF) pp yn
— (n- 1)! (B x) .
This form can be established as above, except that on the left side of
equation (28.1) we put (8 — a@)Q/(n— 1)! and we define ¢ as above except
its last term is (8 —x)OQ/(n—1)! We leave the details as an exercise. (In
Section 31 we shall obtain another form involving use of the integral to
evaluate the remainder term.)
Exercises
28.A. Using the formulas in 28.1, show that if 2 = 0, 1,2,..., then the roots of
the Bessel functions J, and J,,., on (0, +9) interlace each other.
28.B. Show that if x >0, then
28.C. Calculate V1.2 and V2. What is the best accuracy you can be sure of?
28.D. Get estimates similar to those in Exercise 28.B for (1+ x)” on the interval
[0, 7]. Use these to calculate 31.5 and 72.
28.E. Suppose that 0<r<1 and -—1<x. Show that we have (1+x) =1+rx
and that the equality holds if and only if x =0.
28.F. A root x» of a polynomial p is said to be simple (or have multiplicity one) if
p'(%o.) #0, and to have multiplicity n if p(x.) =p'(x.) =---=p (x. =0, but
P (Xo) #0.
If a<b are consecutive roots of a polynomial, then there are an odd number
(counting multiplicities) of roots of its derivative in (a, b).
28.G. Show that if the roots of the polynomial p are all real, then the roots of p’
are allreal. If, in addition, the roots of p are all simple, then the roots of p’ are all
simple.
28.H. If f(x) =(x?— 1)" and if p is the nth derivative of f, then p is a polynomial
of degree n whose roots are simple and lie in the open interval (—1, 1).
28.1. Establish the Cauchy form of the remainder term R, in Taylor’s Theorem
given in formula (28.3).
28.J. A proof of Taylor’s Theorem 28.6 using the Cauchy Mean Value Theorem
can be given by letting
(Bre as (nxt FR
(1+x)™ = 14(™)x+
Projects
28.a. In this project we consider the exponential function from the point of view
of differential calculus.
(a) Suppose that a function E on J = (a, b) to R has a derivative at every point of
J and that E’(x) = E(x) for all x ¢J. Observe that E has derivatives of all orders on
J and they all equal E.
(b) If E(a) = 0 for some a € J, apply Taylor’s Theorem 28.6 and Exercise 14.L to
show that E(x)=0 for all xeJ.
(c) Show that there exists at most one function E on R to R which satisfies
(d) Prove that if E satisfies the conditions in part (c), then it also satisfies the
functional equation
E(x+y)=E(x)E(y) forx, yeR.
Show that the sequence (E,) converges on R to a function E with the properties
displayed in part (c). Therefore, E is the unique function with these properties.
(g) Let E be the function with E’=E and E(0)=1. If we define e to be the
number
e-E(\),
28.f. In this project, you may use the results of the preceding one. Let E
denote the unique function on R such that
5-H
SLG) LQ) <2 (9%).
(Hint: apply the Mean Value Theorem to E.)
(e) The function L has a derivative for x >0O and L'(x) = 1/x.
(f) The number e satisfies
e=tim ((1+4)’), n
(Hint: evaluate L'(1) by using the sequence ((1+1/n)) and the continuity of E.)
h"(x)+h(x) =0
for all x in J, Show that h has derivatives of all orders and that if there is a point a
in J such that h(a) =0, h’(a) = 0, then h(x)=0 for all x EJ. (Hint: use Taylor’s
Theorem 28.6.)
(b) Show that there exists at most one function C on R satisfying the conditions
c’'+C=0, CO)=1, C(O) =0,
and at most one function S on R satisfying
Cix)=1-%7/2, CO)
= Cae) +(-1)" (2n)!"
210 FUNCTIONS OF ONE VARIABLE
4 Ant
Show that (S,) converges uniformly for |x| < A to the unique function S with the
properties in part (b).
(e) Prove that S’=C and C’=~—S.
(f) Establish the Pythagorean Identity $+ C?=1. (Hint: calculate the deriva-
tive of S7+C”’,)
28.8. The project continues the discussion of the sine and cosine functions. Free
use may be made of the properties established in the preceding project.
(a) Suppose that h is a function on R which satisfies the equation
h"+h=0.
Show that there exist constants a, 8 such that h=aC+S. (Hint: a =h(0),
B =h'(0).)
(b) The function C is even and S is odd in the sense that
(f) We define 7 to be the smallest positive root of S. Prove that m=2y and
hence that 2V2 <7 <2v3.
(g) Prove that both C and S are periodic functions with period 27 in the sense
that C(x +27) = C(x) and S(x +2) =S(x) for all x in R. Also show that
28.2. Following the model of the preceding two projects, introduce the hyper-
bolic cosine and sine as functions satisfying
c"=¢, c(0) = 1, c‘(0)=0,
' s"=s, s(0)=0, s‘(0)=1,
respectively. Establish the existence and the uniqueness of these functions and
show that c—s?=1.
Prove results similar to (a)-(d) of Project 28.6 and show that, if the exponential
function is denoted by E, then
for each x, y in J. (In geometrical terms: the midpoint of any chord of the curve
y = ¢(x), lies above or on the curve.) In this project we shall suppose that ¢ is a
continuous convex function.
(a) If n=2™ and if x,,...,x, belong to J, then
oP
Xb Xete
ES)
+X
Fol)
1
+ 96):
(b) Ifm<2” andifx,,...,x, belong to J, let x, forj =n+1,...,2™ be equal to
= (Stet ts)
x= {4 iS
n
Show that the same inequality holds as in part (a).
(c) Since @ is continuous, show that if x, y belong to J and teF, then
oly)—o(x) _ ofz)—e(x)
y-x z-x
(f) Prove that a continuous convex function ¢ on J has a left-hand derivative and
a tight-hand derivative at every point. Furthermore, the subset where o’ does not
exist is countable.
+ (GEORGE FRIEDRICH) BERNHARD RIEMANN (1826-1866) was the son of a poor country
minister and was born near Hanover. He studied at Gottingen and Berlin and taught at
Gottingen. He was one of the founders of the theory of analytic functions, but also made
fundamental contributions to geometry, number theory, and mathematical physics.
THOMAS JOANNES STIELTJES (1856-1894) was a Dutch astronomer and mathematician. He
studied in Paris with Hermite and obtained a professorship at Toulouse. His most famous
work was a memoir on continued fractions, the moment problem, and the Stieltjes integral,
which was published in the last year of his short life.
29. THE RIEMANN-STIELTJES INTEGRAL 213
)
|. —-__N
\
Lp
NN
—- —- —_
—
}— srr —
wD
Leir
a=Xxo *L x2 x3 Xk Xpel Xn =b
x
co
H
Figure 29.1. '
The Riemann sum as an area.
It will be noted that both of the sums (29.1) and (29.2) depend upon the choice of
the ‘intermediate points’’; that is, upon the numbers &, 1 = k =n. Thusit might be
thought advisable to introduce a notation displaying the choice of these numbers.
However, by introducing a finer partition, it can always be assumed that the
intermediate points & are partition points. In fact, if we introduce the partition
Q = (Xo, Er, X1, &2, + + + > Exp Xn) and the sum S(Q; f, g) where we take the intermediate
points to be alternately the right and the left end points of the subinterval,,then the
sum S(Q; f, g) yields the same value as the sum in (29.1). We could always assume
that the partition divides the interval into an even number of subintervals and the
intermediate points are alternately the right and left end points of these subintervals.
However, we shall not find it necessary to require this ‘‘standard’’ partitioning
process, nor shall we find it necessary to display these intermediate points.
g(x)=0, x=a,
=1, a<x<b.
We leave it as an exercise to show that a function f is integrable with respect
to g if and only if f is continuous at a and that in this case the value of the
integral is f(a).
(d) Let c be an interior point of the interval J =[a, b] and let g be defined
by
g(x) =0, asxx<c,
=1, c<x=<b.
It is an exercise to show that a function f is integrable with respect to g if and
only if it is continuous at c from the right (in the sense that for every e >0
there exists 6(¢)>O such that if c<x<c+68(e) and xeJ, then we have
[f(x)—f(c)|<e). If f satisfies this condition, then the value of the integral
is f(c). (Observe that the integrator function g is continuous at c from the
left.)
(e) Modifying the preceding example, let h be defined by
h(x) =0, asxx<c,
=1, csxab.
By taking more points we can obtain a sum involving the values of f at points
in J, weighted by the values of the jumps of g at these points.
(g) Let the function f be Dirichlet’s discontinuous function [cf. Example
20.5(g)] defined by
f(x) =1, if x is rational,
=0, if x is irrational,
[S(Qusf, g)-L|<e/2.
29. THE RIEMANN-STIELTJES INTEGRAL 217
Pll
= sup {x1—%0, X2— X1,-. + Xn Xn-sf < BCE).
This type of limiting is generally used in defining the Riemann integral and
sometimes used in defining the Riemann-Stieltjes integral. However, many authors
employ the definition we introduced, which is due to S. Pollard, for it enlarges slightly
the class of integrable functions. As a result of this enlargement, the next result is
valid without any additional restriction. See Exercises 29. P-R.
29.6 THEorEM. (a) Suppose that a <c < band that f is integrable with
respect to g over both of the subintervals [a, c] and [c, b]. Then f is integrable
with respect to g on the interval [a, b] and
= |[ tas-seP sf 8) + [fag-sers
fe) |<26.
It follows that f is integrable with respect to g over [a, b] and that the value
of its integral is
{ faa+[
Fas,
(b) We shall use the Cauchy Criterion 29.4 to prove that f is integrable
over [a, c]. Since f is integrable over [a, b], given e >0 there is a partition
29. THE RIEMANN-STIELTJES INTEGRAL 219
Q. of [a, b] such that if P, Q are refinements of Q.,, then relation (29.4) holds
for any corresponding Riemann-Stieltjes sums. It is clear that we may
suppose that the point c belongs to Q., and we let Q: be the partition of
[a, c] consisting of those points of Q, which belong to[a, c]. Suppose that
P’ and Q’ are partitions of [a, c] which are refinements of Q! and extend
them to partitions P and Q of [a, b] by using the points in Q, which belong
to [c, b]. Since P, Q are refinements of QO,, then relation (29.4) holds.
However, it is clear from the fact that P, Q are identical on [c, b] that, if
we use the same intermediate points, then
Thus far we have not interchanged the roles of the integrand f and the
integrator g, and it may not have occurred to the reader that it might be
possible to do so. Although the next result is not exactly the same as the
“integration by parts formula” of calculus, the relation is close and this result
is usually referred to by that name.
29.7 INTEGRATION By Parts. A function f is integrable with respect to g
over [a, b] if and only if g is integrable with respect to f over [a,b]. In this
case,
(29.12) {i dg = [te
PROOF. The hypothesis implies that g’ is uniformly continuous on J. If
e >0, let P = (xo, X1,..., Xn) be a partition of J such that if & and ¢ belong
to [xx-1, x] then |g’(&)—g’(&)|<e. We consider the difference of the
Riemann-Stieltjes sum S(P; f, g) and the Riemann sum S(P; fg’), using the
same intermediate points &. In doing so we have a sum of terms of the form
If we apply the Mean Value Theorem 27.6 to g, we can write this difference
in the form
FE Mg") — BCE Xe — Xe)
where & is some point in the interval [x.-1,x.]. Since this term is
dominated by e ||fl| (x. — xx-1), we conclude that
|S(P; f, g)— SCP; fel <e |Ifl (- a),
provided the partition P is sufficiently fine. Since the integral on the left side
of (29.12) exists and is the limit of the Riemann-Stieltjes sums S(P; f, g), we
infer that the integral on the right side of (29.12) also exists and that the
equality holds. Q.E.D.
For an extension of this result, see Theorem 30.13
29. LHI RIEMANN-STIBLTJES INTEGRAL 221
(Here we have made use of results from calculus that will be proved in
Section 30.)
(b) If we apply Theorem 29.7 on Integration by Parts to the functions in
(a), we get
i 1
‘2
[> d(x*)=x* -| x? dx =1-3x .
0 0 0 3
| sin x d(sin x)
= (sin x)” -| sin x d(sin x),
0 0 0
(f) It follows from results in Section 30 that f(x) =x" is integrable with
respect to both gi(x) = x and go(x) =[x]on[0, 5]. Therefore it is integrable
with respect to g(x) = x +[x] and we have
[x d(x+[x) = [x dx + [« d({x)
= 35° 4+1°4+274+374+4°4+ 5",
222 FUNCTIONS OF ONE VARIABLE
Exercises
29.A. Iff is constant on the interval [a, b], then it is integrable with respect to any
function g and
[ far-xg@y'- Gay,
(a) Prove this by examining the two Riemann-Stieltjes sums for a partition
P=(X%o, X1,...,X,) obtained by taking & =x,_, and & =x.
(b) Prove this by using the Integration by Parts Theorem 29.7.
29.F. Show directly that if f is the greatest integer function f(x) =[x] defined in
Example 29.9(e), then f is not integrable with respect to f on the interval (0, 2].
29.G. If f is Riemann integrable on [0, 1], then
[r-nim (5&1)
29.H. Show that if g is not integrable on [0, 1], then the sequence of averages
(2 et)
1g /(k )
frefs
b b
{ f dk -| k df =0.
[ Fes. [ fas,
29.N. Suppose that g is continuous on [a, b], that x+» g’(x) exists and is
continuous on [a, b}\{c}, and that the one-sided limits
exist. If f is integrable with respect to g on [a, b], then fg’ can be defined at c to be
Riemann integrable on [a, b] and such that
[ fas= | te’
b b
[ofde-[r-[r
29.P. If P=(Xo, X:,..., Xn) is a partition of J =[a, b], let ||P|| be defined to be
=1, s<x<l,
29.R. Let g(x)=x for xeJ. Show that for this integrator, a function f is
integrable in the sense of Definition 29.2 if and only if it is (*)-integrable in the
sense of Exercise 29.P.
29.8. Let f be Riemann integrable on J and let f(x)=0O for xeJ. If f is
continuous at a point ceJ and if f(c)>0, then
[ Foo.
29.T. Let f be Riemann integrable on J and let f(x) >0OforxeJ. Show that
[ foo.
(Hint: for each neéN, let H, be the closure of the set of points x €J such that
f(x)>1/a and apply Exercise 11.N.)
Projects
LOPS F, 8) = mgs)
20%}
UP; f, 8)= Y Mile)
— 80-0):
(a) If S(P;f, g) is any Riemann-Stieltjes sum corresponding to P, then
(d) If P,; and P, are any partitions of J, then L(P,; f, g) < U(P.; f, g). [Hint: let
Q be a partition which is a refinement of both P, and P, and apply (c).]
(ec) Define the lower and the upper integral of f with respect to g to be,
respectively
LO, g)=sup {L(P; f, 8},
U(f, g) = ink {U(P; f, gh:
here the supremum and the infimum are taken over all partitions P of J. Show that
L(f, g) = UG, 8).
(f) Prove that f is integrable with respect to the increasing function g if and only
if the lower and upper integrals introduced in (e) are equal. In this case the
common value of these integrals equals
(1s
Show that f is integrable with respect to g if and only if the following Riemann
condition is satisfied: for every «>0 there exists a partition P such that
U(P;f, g)—L(P;f, g) <e.
(g) If f; and f, are bounded on J, then the lower and upper integrals of f,+f,.
satisfy
L(fit+f., g)= Lh, g)t+ Lh, g&)>
29.8. This (and the next two) projects introduce and study the important class of
functions which have “bounded variation” on a compact interval. Let f:[a,b]—>R
be given; if P=(a =x.<x,<:-+<x, =b) isa partition of [a, b], let v,(P) be defined
by .
If the set {v,(P):P a partition of [a, b]} is bounded, we say that f has bounded
variation on [a,b]. The collection of all functions having bounded variation on
[a, b] is denoted by BV({a, b]) or by BV[a, b]. If fe BV{a, b], then we define
We call the number V,[a, b] the total variation of f on [a, b]. Show that Vi[a, b]=0
if and only if f is a constant function on [a, b].
(a) If f:[a,b]> R, if P and Q are partitions of [a, b], and if P > QO, show that
uP) = v,(Q). Iff ¢BV[a, b], show that there exists a sequence (P,) of partitions of
(a, b] such that V[a, b] =lim (v,(P,)).
(b) Iff is monotone increasing on [a, b], show that f ¢ BV[a, b] and that V[a, bJ=
f(b)—f(a). What iff is monotone decreasing on [a, b]?
(c) If g:[a, b] > R satisfies the Lipschitz condition |g(x) — g(y)| = M |x — y| for all
x, y in[a, b], show that g € BV[a, b] and that V,[a, b] = M(b—a). If|h'(x)| = M for
allx €[a, b], then h ¢ BV[a, b] and V,[a, b] = M(b—a). However, consider k(x) =
vx on [0, 1].
226 FUNCTIONS OF ONE VARIABLE
(d) Let f:[0,1]— R be defined by f(x)=0 for x =0 and f(x)=sin (1/x) for
0<x<1. Show that f does not have bounded variation on[0, 1]. If g is defined by
g(x) =xf(x) for x €[0, 1], show that g is continuous but does not have bounded
variation on [0, 1}. However, if h is defined by h(x) = x’f(x) for x €[0, 1], show that
h does have bounded variation on [0, 1].
(e) If f ¢ BV[a, b], show that |f(x)| = |f(a@)|+ Vi[a, b] for all x €[a, b], so that f is
bounded on I =[a, b} and |If\l, = |f(a)|+ ViLa, 6}.
(@ Itf, g¢BV[a, b] anda € R, show that af and f+g belongto BV[a, b] and that
Vis (a, b] = lax| V,[a, b],
Show that the quotient of two functions in BV[a, b] may not belong to BV[a, b].
(h) Show that the mapping f+ Vj[a, b] is not a norm on the vector space
BV[a, b], but that the mapping
is. To prove that fif. is integrable when f, and f, are, note that
FO) = ["Fag,
has a derivative at c and F'(c) = f(c)g’(c).
proor. If h>O is such that c+h belongs to J, then it follows from
Theorem 29.6 and the preceding result that
Specializing this theorem to the Riemann case, we obtain the result which
provides the basis for the familiar method of evaluating integrals in calculus.
30.8 FUNDAMENTAL THEOREM OF INTEGRAL CatcuLus. Let f be continu-
ous on J=[a, b]. A function F on J satisfies
x
(30.6) F(x)~ F(a)= { f for xeJ,
The preceding theorem asserts that F.=fonJ. If F is such that F’ =f, then
it follows from Theorem 27.9(ii), that there exists a constant C such that
F(x) = F.(x)+C, forx €J. Since F,(a) = 0, then C = F(a) whence it follows
that if F'=f on J, then
F(x)—F(a)= [7 Q.E.D.
232 FUNCTIONS OF ONE VARIABLE
g(x)= [pt de
Since p(x)= 0, it is seen that g is increasing and it follows from the
Differentiation Theorem 30.7 that g’=p. By Theorem 29.8, we conclude
that
b b
i fdg - { fp,
and from the First Mean Value Theorem 30.6, we infer that for some c in J,
then
{3 dg = fro). QED.
{ fe'=f0)a(b)-fladeta)—
[F's
The next result is often useful.
30.11 SECOND MEAN VALUE THEOREM. (a) [f f is increasing and g is
continuous on J =[a, b], then there exists a point c in J such that
[ch= er.
PROOF. The hypotheses, together with the Integrability Theorem 30.2
imply that g is integrable with respect to f on J. Furthermore, by the First
Mean Value Theorem 30.6,
{ eaf= e(ertf)-fa)h.
After using Theorem 29.7 concerning integration by parts, we conclude that
f is integrable with respect to g and
a(x)= fh
so that g’=h. The conclusion then follows from part (a) by using Theorem
29.8. To prove (c) define F to be equal to » for x in (a, b] and define
F(a)=0. We now apply part (b) to F. QED.
234 FUNCTIONS OF ONE VARIABLE
Part (c) of the preceding theorem is frequently called the Bonnett form of
the second Mean Value Theorem. Itis evident that there is a corresponding
result for a decreasing function (cf. Exercise 30.N.)
Change of Variable
We shall now establish a theorem justifying the familiar formula relating
to the ‘‘change of variable” in a Riemann integral.
30.12 CHANGE OF VARIABLE THEOREM. Let @ be defined on an interval
[a, B] to R with a continuous derivative and suppose that a= (a) and
b= (8). Iff is continuous on the range of », then
+ OSSIAN BONNET (1819-1892) is primarily known for his work in differential geometry.
30. EXISTENCE OF THE INTEGRAL 235
I= a
where M, = sup {g'(x):x €[x-1, x;]} and m; = inf {g(x):x €[xj)-1, x ]}. If we
use the Mean Value Theorem 27.6, we obtain points ¢; € (x;-1, x;) such that
Since « >0 and the choice of & €[x;-1, x;] are arbitrary, it follows that f is
integrable with respect to g and that (30.11) holds. OED,
REMARK. The proof can be modified to apply to the case where f is
bounded and g is continuous on [a, b], and where g has a derivative except
at a finite number of points at which g’ can be defined so that g’ and fg’ are
Riemann integrable on [a, b].
Exercises
30.A. Show that a bounded function which has at most a finite number of
discontinuities is Riemann integrable.
30.B. If f:[a, b] > R is discontinuous at some point of the interval, then there
exists a monotone increasing function g such that f is not g-integrable.
30.C. Show that the Integrability Theorem 30.2 holds when g is a function of
bounded variation on J.
30.D. Give an example of a function f which is not Riemann integrable over J but
such that |f| and f* are Riemann integrable over J.
236 FUNCTIONS OF ONE VARIABLE
30.E. Let f be positive and continuous on J =[a, b] and let M = sup {f(x):x € J}.
Show that
F(x)= [i dg.
Prove that (a) if g is continuous at c, then F is continuous at c, and (b) if f is
positive, then F is increasing.
30.1. Give an example of a Riemann integrable function f on J such that the
function F, defined for x €J by
Fa)=["f
does not have a derivative at some points of J. Can you find an integrable function f
such that F is not continuous on J?
30.J. If f is Riemann integrable on J =[a, b] and if F’=f on J, then
F(b)—F(a)= { f.
Hint: if P = (xo, X:,.--, %) is a partition of J, write
[on = eta). h.
30, EXISTENCE OF THE INTEGRAL 237
30.0. Let f be continuous on F=(0, 1], let fo=f, and let f,., be defined by
foals) = [ f.i)dt
0
forne NxeL
By induction, show that |f,(x)| = (M/n!)x" = M/n!, where M=sup {|f(x)|:x 6D.
It follows that the sequence (f,) converges uniformly on F to the zero function.
30.P. If fis integrable with respect to g on J =[a, b], if p is continuous and strictly
increasing on [c, d], and if p(c) = a, p(d) = b, then fe@ is integrable with respect to
gem and
[=o
for all continuous functions h, then f(x) = 0 for all x.
30.R. Iff is integrable on [a, b] and if
[im
for all continuous functions h, then f(x) = 0 for all points of continuity of f.
30.S. Let p be continuous and positive on [a, b] and let c>0. If
pix)< cf pt) dt
for all x e[a, b], show that p(x)=0 for all x.
30.T. Let f be continuous and such that f(x) = 0 for all x e[a, b]. If g is strictly
increasing on [a, b] show that
{ fag 0
b
Projects
30.a. The purpose of this project is to develop the logarithm by using an integral
as its definition. Let P={xeR:x>O}.
(a) If x € P, define L(x) to be
L(x)= I ifdat.
Hence L(1)=0. Prove that L is differentiable and that L’(x) = 1/x.
(b) Show that L(x) <0 for O0<x<1 and L{x)>0 forx>1. In fact,
(c) Prove that L(xy) = L({x)+L(y) for x, yin P. Hence L(1/x) =—L(x) for x in P.
(Hint: if y ¢P, let L, be defined on P by Li(x) = L(xy) and show that Li =L’.)
(d) Show that if n € N, then
1,1 1 1 1
ata HE <L(nj<ltste +o.
(e) Prove that L is a one-one function mapping P onto allof R. Letting e denote
the unique number such that L(e)=1, and using the fact that L’(1) = 1, show that
e=lim((14+1/n)*).
(f} Let r be any positive rational number, then lim,_,.. L(x)/x" =0.
(g) Observe that
™* dt [4
LU+
G4+xy=} =| fo]
P= | 4.
iat
Write (1+1)” as a finite geometric series to obtain
k=
30.8. This project develops the trigonometric functions starting with an integral.
(a) Let A be defined for x in R by
A(x) = { dt
14+¢°
Then A is an odd function (that is, A(—x) = —A(x)), it is strictly increasing, and it is
bounded by 2. Define a by a/2= sup {A(x):x © R}.
(b) Let T be the inverse of A, so that T isa strictly increasing function with domain
(—a/2, 7/2). Show that T has a derivative and that T’=1+T’.
(c) Define C and S on (—7/2, a/2) by the formulas
1 oT
Caer SS aeTy
30. EXISTENCE OF THE INTEGRAL 239
Hence C is even and S is odd on (—77/2, 7/2). Show that C(0) =1 and $(0)=0 and
C(x)— 0 and S(x) > 1 asx > 7/2.
(d) Prove that C’(x) =—S(x) and S’(x) = C(x) for x in (7/2, 7/2). Therefore,
both C and S satisfy the differential equation
h’+h=0
on the interval (—7/2, 7/2).
(e) Define C(a/2)=0 and S(a/2)=0 and define C, S, T outside the interval
(-/2, 7/2) by the equations
T(x + 1) = T(x).
If this is done successively, then C and S are defined for all R and have period 27.
Similarly, T is defined except at odd multiples of 7/2 and has period a.
(f} Show that the functions C and S, as defined on R in the preceding part, are
differentiable at every point of R and that they continue to satisfy the relations
C’=-S, S'=C
everywhere on R.
30.y. This project develops the well-known Wallist product formula. Through-
out it we shall let
afd
1-3-5--:Qn—-Da 2° 4-++(2n)
Sm= FG.
6 (On) 2? Sons =
1:3-5-+-(2n+1)°
(c) Show that the sequence (S,,) is monotone decreasing. (Hint: 0 <sinx =< 1.)
(d) Let W, be defined by
°-6°6°++(2n)(2n)
7---(2n—1)(Qn+1)"
Prove that lim (W,) = 7/2. (This is Wallis’s product.)
(e) Prove that lim ((n!)°27"/(2n)! Vn)= Va.
30.8. This project develops the important Stirling} formula, which estimates the
magnitude of n!.
+ JOHN WALLIs (1616-1703), the Savilian professor of geometry at Oxford for sixty years, was a
precurser of Newton. He helped to lay the groundwork for the development of calculus.
+ JAMES STIRLING (1692-1770) was an English mathematician of the Newtonian school. The
formula attributed to Stirling was actually established earlier by ABRAHAM DE Moivre
(1667-1754), a French Huguenot who settled in London and was a friend of Newton.
240 FUNCTIONS OF ONE VARIABLE
(a) By comparing the area under the hyperbola y = 1/x and the area of a trapezoid
inscribed in it, show that
2 1
Int1~ 8 (1 +)
From this, show that e<(14+1/n)"*"”.
(b) Show that
Consider the figure F made up of rectangles with bases [1,3], [n —3, n] and heights 2,
log n, respectively, and with trapezoids with bases [k —3, k +3], k =2,3,...,n—-1,
and with slant heights passing through the points (k, log k). Show that the area of F
is
u, =(wie) 1, nen
(d) Show that the sequence (u,) is monotone increasing. (Hint: consider u,41/t,.)
(e) By considering u,’/u,,, and making use of the result of part (e) of the preceding
project, show that lim (u,) =(27)"*”.
(f) Obtain Stirling’s formula
However, this need not be the case even for very nice functions.
31. FURTHER PROPERTIES OF THE INTEGRAL 241
(1/n, n)
{ j
0 lfn 2jn 1
Exampce. Let J=(0, 1], let g(x) =x, and let f, be defined for n = 2 by
{feo dx =1, n= 2.
[f00 dx =0.
Therefore, equation (31.1) does not hold in this case even though both sides
have a meaning.
Since equation (31.1) is very convenient, we inquire if there are any simple
additional conditions that will imply it. We now show that, if the con-
vergence is uniform, then this relation holds.
Fa) dip — a)
f(b)= fla) +p t+ LO
Ea ®@ py(bay yet + Re
where the remainder is given by
ce vf (b—t)"-7f°" P(t)at}
at
(n-1)
~
a (b-ayr'+ a |, O97
Continuing to integrate by parts in this way, we obtain the stated formula.
OED.
244 FUNCTIONS OF ONE VARIABLE
This form of the remainder can be extended to the case where f has domain
in R? and range in R*.
for all x in[a, b]. Let t, to satisfy this condition and apply the Mean Value
Theorem 27.6 to obtain a ti (which may depend on x and lies between t and
to) such that
for all x in[a, b]. By applying Lemma 30.5, we obtain the estimate
— b b _
F(t) rte) | fix, to) dx\< { fx, t) f(x, to) £ (x, to) dx
t— to la a t— to
<e(b—a),
which establishes equation (31.5). Q.E.D.
Sometimes the parameter ¢ enters in the limits of integration as well as in
the integrand. The next result considers this possibility. In its proof we
shall make use of a very special case of the Chain Rule (to be discussed in
Chapter VID) which will be familiar to the reader.
31.8 Leisniz’s ForMuLA. Suppose that f and f, are continuous on D toR
and that « and B are functions which are differentiable on the interval [c, d]
and have values in [a, b]. If @ is defined on [c, d] by
Bt)
(31.6) o(t)= [.. f(x, t) dx,
We can write the integral on the left of (31.8) in the form of the sum
Applying the First Mean Value Theorem 30.6 twice, we infer that there
exists a number x} in [x;-:, x] and a number fj, in [t.-1, & ] such that
ty
Hence we have
G(f)=0.
A linear functional G on C(J) is said to be bounded if there exists M = 0
such that
IG(f)| = M llflls
for all fe CJ).
31.10 Lemma. If g is a monotone increasing function on J and if G is
defined for f in C(J) by
G1.9) aih= [ fas,
then G is a bounded positive linear functional on C(J).
t The rest of this section may be omitted on a first reading.
248 FUNCTIONS OF ONE VARIABLE
Gin(x)= 1, a=x<t,
(31.10) =1-n(x-1), t<x<t+1/n,
=0, tt+l1lin<x<b.
OS gim(X) = Gn(x) = 1,
+ FREDERIC Riesz (1880-1955), a brilliant Hungarian mathematician, was one of the founders
of topology and functional analysis. He also made beautiful contributions to potential,
ergodic, and integration theory.
31. FURTHER PROPERTIES OF THE INTEGRAL 249
1
|
|
|
|
|
|
|
L @ |
t t+1/n 6
whence it follows that g(t) < g(s). We define g(a) =0 and if ¢,,, denotes
the function g,.(x) = 1, x € J, then we set g(b) = G(q,.). _Ifa<t<bandn
is sufficiently large, then for all x in J we have
[ tag-sasi, 8)|<e.
Now let P = (to, fi, ..., tm) be a partition of J into distinct points which is a
refinement of P, such that sup {t, —t-1}<3 8&(e) and let n be a natural
number so large that
2in< inf {ty _ tes}.
|
|
|
|
|
|
|
|
|
a aaa th tht lfn b
tpi, tifn
Figure 31.3. Graph of @y.n — Putin
{f(x)-f*(x)[<e.
If the x belongs to two intervals in (31.11), then & = x =< & +1/n for some
HG (ign) — Gu-1n)}
£8 (te) — 8 (te—1)}| <€/2m [fs
for k =2,3,...,m. Applying G to the function f* defined by equation
(31.13) and recalling that g(t.) =0, we obtain
and g is continuous from the right at every interior point of J. With these
additional conditions, there is a one-one correspondence between positive
functionals and increasing functions.
Exercises
31.A. If a>0, show directly that
tim | e“ dx =0.
n oO
lim { e™ dx =0.
What happens if a = 0?
31.F. Let h,(x)=nxe™ for x €[0, 1] and let h(x)=0. Show that
o 0
31.H. Give example to show that the conclusion in the preceding exercise may
fail if the convergence is not uniform.
31.1. If a >0, show that fo “(log t)’ dt=2/(a +1).
31.J. Suppose that f and its partial derivative f, are continuous for (x, ft) in
[a, b]x[c, d]. Apply the Interchange Theorem 31.9 to
31.K. Use the Fundamental Theorem 30.8 to show that if a sequence (f,) of
functions converges on J to a function f and if the derivatives (f/) are continuous
and converge uniformly on J to a function g, then f’ exists and equals g. (This
result is less general than Theorem 28.5, but it is easier to establish.)
31.L. Let {r,, 1,...5 tm .-.} be an enumeration of the rational numbers in I. Let
fa, be defined to be 1 if x €{r,,..., 7} and to be 0 otherwise. Then f, is Riemann
integrable on I and the scquence (f,,) converges monotonely to the Dirichlet discon-
tinuous function f (which equals 1 on EN Q and equals 0 on I\Q. Hence the
monotone limit of a sequence of Riemann integrable functions does not need to be
Riemann integrable.
31.M. Let g be a fixed monotone increasing function on J=[a, b]. If f is any
function which is integrable with respect to g on J, then we define ||f||, by
Inh= [inlas,
Show that the following ‘“‘norm properties” are satisfied:
(a) [lfll= 0;
(b) If f(x) =0 for all x € J, then ||f]], = 0;
(c) IfceR, then |lef|l; =|e| [lfll:;
(@) | fl Walls |= fe Alh = IF + lh
However, it is possible to have ||f||, = 0 without having f(x) =O for all x €J. (Can this
occur when g(x) =x?)
31.N. If g is monotone increasing on J, and iff and f,, n €.N, are functions which
are integrable with respect to g, then we say that the sequence (f,) comverges in
mean (with respect to g) in case
lf. fli0.
(The notation here is the same as in the preceding exercise.) Show that if (f.)
converges in mean to f, then
fy=1, xe,
=0, xéI..
Prove that the sequence (f,) converges in mean [with respect to g(x) =x] to the
254 FUNCTIONS OF ONE VARIABLE
zero function on J, but that the sequence (f,) does not converge uniformly. Indeed,
the sequence (f,) does not converge at any point!
31.P. Let g be monotone increasing on J =[a, b]. Iff and h are integrable with
respect to g on J to R, we define the inner product (f, h) off and h by the formula
FW = [ fs 9 dgto.
31.V. Let J,=[a, b] and J,=[c, d]. Assume that the real valued function g is
monotone on J,, that h is monotone on J, and that f is continuous on J,x J».
Define G on J, and H on J, by
steno = [ fle(), 1) dx
Show that S(@) belongs to C(J,), but that, in general, S is not a linear transforma-
tion in the sense of Exercise 31.W. However, show that S sends the collection B,
256 FUNCTIONS OF ONE VARIABLE
Project
31.a. This project establishes the existence of a unique solution of a first order
differential equation under the presence of a Lipschitz condition. Let Q¢ R’ be
open and let f:Q— R be continuous and satisfy the Lipschitz condition: |f(x, y)—
f(x y= K |y —y'| for all points (x, y), (x, y’) in Q. Let I be a closed cell
T={(x, y):|x-a]| <a, ly—b| < B}
contained in 0 and suppose that Ma = B, where |f(x, y)|-< M for (x, y)eL
(a) If J=[a—a, a+a], define ¢.(x) = b for x € J and, if nEN, define
e'(x)=f(x,e@)) — forxeJ.
(d) If % is continuous on J and satisfies
le@)-¥@=K| [leo-vola
K" n
= Fy
lle — lh bx—al’.
Hence |lp — ol, = |e — bl, K"a"/n!, whence it follows that p(x) = W(x) for all x e J.
Unbounded Functions
(32.1) I= {it
We shall define the improper integral of f over J =[a, b] to be the limit of I,
asc—a.
32.1 DeFIniITION. Suppose that the Riemann integral in (32.1) exists
for each c in (a, b]. Suppose that there exists a real number J such that for
every « >0 there is a 8(¢) >0 such that if a<c<a+8&(e) then |I.—I|<e.
In this case we say that I is the improper integral of f over J =[a, b] and
we sometimes denote the value I of this improper integral by
asa"
ati
[ od = 1 (1-¢**?)
c vex atl .
{is [9
p- b
a p+
exist, then we define the improper integral of f over [a, b] to be their sum.
In the limit notation, we define the improper integral of f over [a, b] to be
p-e b
(32.3) lim | f(x) dx + 80+
230+ Jy
lim | Jois f(x) dx.
It is clear that if those two limits exist, then the single limit
also exists and has the same value. However, the existence of the limit
(32.4) does not imply the existence of (32.3). For example, if f is defined
for x €[~1, 1], x#0, by f(x) = 1/x’, then it is easily seen that
Ce (a E)E-3-
[, (5) ar+[ (3) ax =( 2 Netto)
for all ¢ satisfying O0<e<1. However, we have seen in Example 32.2(c)
9
Infinite Integrals
It is important to extend the integral to certain functions which are
defined on unbounded sets. For example, iff is defined on {x Ee R:x = a}
to R and is Riemann integrable over [a, c] for every c >a, we let I, be the
partial integral given by
(32.5) L= [it
We shall now define the “infinite integral” of f for x = a to be the limit of
J, as c increases.
32.3. DeFINITION. If f is Riemann integrable over [a, c] for each c >a,
let I. be the partial integral given by (32.5). A real number I is said to be
the infinite integral of f over {x :x = a} if for every « >0, there exists a real
number M(e) such that if c > M(e) then |I—I.|<e. In this case we denote
Iby
=| £ dx =log clog a.
| e * dx =—(e“—1);
Qo
As in the case of the improper integral, the existence of both of the limits in
(32.8) implies the existence of the limit
(32.9) tim|{ [" f(x) dx+ [#00 ax} = tim [- f(x) dx,
cote
and the equality of (32.8) and (32.9). The limit in (32.9), when it exists, is
often called the Cauchy principal value of the infinite integral over R and is
denoted by
| x dx =(c*?—c”?)=0
for all c. Thus the Cauchy principal value of the infinite integral for
f(x) =x exists and equals 0, but the infinite integral of this function does
not exist, since neither of the infinite integrals in (32.7) exists.
We now obtain a few conditions for the existence of the infinite integral
over the set {x:x = a}. These results can also be applied to give condi-
tions for the infinite integral over R, since the latter involves consideration
of infinite integrals over the sets {x:x < a} and {x:x => a}. First we state
the Cauchy Criterion.
32.5 CAUCHY CRITERION. Suppose that f is integrable over [a, c] for all
c2za. Then the infinite integral
[s
exists if and only if for every « >0 there exists a K(e) such that ifb=>c>=
K(e), then
(32.11) {| <e,
It is seen that (J,) is a Cauchy sequence of real numbers. If J = lim (J,) and
e >0, then there exists N(e) such that if n = N(e), then |I-I.|<e. Let
M(e)=sup {K(e),a+N(e)}+1 and let c>M(e). Then there exists a
natural number n = N(e) such that K(e)<a+n<c. Therefore the par-
tial integral I. is given by
w= [r=[ore[
whence it follows that [I — I,|<2e. OED.
In the important case where f(x)=0O for all x =a, the next result
provides a useful test.
32.6 THEOREM. Suppose that f(x) = 0 for all x = a and that f is inte-
grable over [a,c] for allc =a. Then the infinite integral of f exists if and
only if the set {I,:¢ =< a} is bounded. In this case
32.7 COMPARISON Test. Suppose that f and g are integrable over [a, c]
for allc = a and that |f(x)| = g(x) for allx =a. If the infinite integral of g
exists, then the infinite integral of f exists and
i= [os
proor. If a<c<b, then it follows from Lemma 30.5 that |f| is inte-
grable on [c, b] and that
|Lels[irl=[e
b b b
It follows from the Cauchy Criterion 32.5 that the infinite integrals of f and
|f| exist. Moreover, we have
J =| ifl= | g. QED.
(32.12) a Oe)
dim oq) * °°
proor. In view of the relation (32.12) we infer that there exist positive
numbers A <B and K = a such that
Ag(x) = f(x) = Bg(x) for x => K.
The Comparison Test 32.7 and this relation show that both or neither of
the infinite integrals fx” f, {x g exist. Since both f and g are integrable on
[a, K], the statement follows. OED.
32.9 DrricHLet’s Test. Suppose that f is continuous for x = a, that the
partial integrals
I, = { f, c2a,
are bounded, and that ¢ is monotone decreasing to zero asx —> +. Then
the infinite integral Jz” fo exists.
PROOF. Let A be a bound for the set {|I.|:c = a}. Ife >0, let K(e) be
such that ifx = K(e),then0 < g(x) <e/2A. Ifb=c = K(e), thenit follows
from Exercise 30.N that there exists a number é in [c, b] such that
[te = o(c)|
In view of the estimate
it follows that
| =|I,-L|<2A,
[te J<e
when b= c both exceed K(e). We can then apply the Cauchy Criterion
32.5. QED.
32.10 Exampces. (a) If f(x)=1/(1+x7) and g(x)=1/x? for x =a>0,
then 0 < f(x) = g(x). Since we have already seen in Example 32.4(b) that
the infinite integral J1* (1/x’) dx exists, it follows from the Comparison
Test 32.7 that the infinite integral [}* (1/(1+ x*)) dx also exists. (This
could be shown directly by noting that
also exists. This time, a direct evaluation of the partial integrals is not
possible, using elementary functions. However, it will be seen later that
this infinite integral equals 4V7.
(c) Let p>0 and consider the existence of the infinite integral
It is clear that the integral over [0, 1] exists, so we shall examine only the
integral over {x:x = 1}. If we make the substitution t= x” and apply the
Change of Variable Theorem 30.12, we obtain
. 1(°sint
sin x” dx=>/ ——= dt.
J 2s vt
The preceding example shows that the integral on the right converges when
c > +; hence it follows that fi” sin x” dx exists. (It should be observed
that the integrand does not converge to 0 as x — +0.)
(e) Suppose that a = 1 and let I'(a) be defined by the integral
+o0
(32.13) (a) = { e*x"* dx.
In order to see that this infinite integral exists, consider the function
g(x) =1/x? forx =1. Since
{ e*x""* dx
O+
exists when 0<a<1. Hence we can extend the definition of the Gamma
function to be given for all a >O by an integral of the form of (32.13)
provided it is interpreted as a sum
a +00
| etx! dx+ { ex" ' dx
0. + a
Absolute Convergence
If f is Riemann integrable on [a, c] for every c = a, then it follows from
Theorem 30.4(a) that |f|, the absolute value of f, is also Riemann inte-
grable on [a,c] for c = a. It follows from the Comparison Test 32.7 that
if the infinite integral
(32.14) | [f(x)} dx
exists, then the infinite integral
(32.15) | f(x) dx
also exists and is bounded in absolute value by (32.14).
32.11 DeFinirion. If the infinite integral (32.14) exists, then we say
that f is absolutely integrable over {x :x = a}, or that the infinite integral
(32.15) is absolutely convergent.
We have remarked that if f is absolutely integrable over {x :x > a}, then
the infinite integral (32.15) exists. The converse is not true, however, as
may be seen by considering the integral
[0% as.
+00 Ot
sin X
at x
ke sin x _{ en boi 1 1
I ax | + +{o =isetact +e},
whence it follows (see 16.11(c)) that the function f(x) =sin x/x is not abso-
lutely integrable over {x:x = 7}.
We observe that the Comparison Test 32.7 in fact establishes the abso-
lute convergence of the infinite integral of f over the interval [a, +).
Exercises
© | ‘dx
aitan. ) | ‘dx
gyn
' xdx ‘logx
(c) i Gx)’ (d) i Te dx,
© [2Pee © [ae
x dx
32.E. Determine the values of p and q for which the following integrals con-
verge:
32.G. For what values of p and q are the following integrals convergent? For
what values are they absolutely convergent?
“= xP ™ sinx
(a) { Tan & (b) { xi dx,
* sin x? * 1—cosx
(c) << dx, (d) i x dx.
32.H. If f is integrable on any interval [0,c] for c>0, show that the infinite
integral {>* f exists if and only if the infinite integral [5* f exists.
32.1. Give an example where the infinite integral {>° f exists but where f is not
bounded on the set {x :x = O}.
32.J. If f is monotone and the infinite integral [> f exists, then xf(x)—>0 as
xX > +0,
each « >0 there is a number K(e) such that if b> c = K(e) andteJ, then
Suppose that for each te J, the function (x, t) is monotone decreasing for
X = a, and converges to 0 as x > +~ uniformly forte J. Then the integral
and if we define M(x) =(1+x7)"', then |f(x, t)|< M(x). Since the infinite
integral of M on [0, +) exists, it follows from the Weierstrass M-test that
the infinite integral
+0
cos 1x
5 dx
I, 1+x?
converges uniformly for t in an interval [0, 8B] for any B>0. However, it
does not converge uniformly on {t¢ R:t= 0}. (See Exercise 33.A.)
(c) If f(x, )=e™ sin x for x =O andt= y>0, then
If(x, \lse™se™
If we set M(x) =e ™, then the Weierstrass M-test implies that the integral
40
| e-™ sin x dx
0
converges uniformly for t => y > 0 and an elementary calculation shows that
it converges to (1+ 17)". (Note that if t=0, then the integral no longer
converges.)
(d) Consider the infinite integral
“=. sin x
| e *—— dx fort = 0,
0 x
where we interpret the integrand to be 1 for x =0. Since the integrand is
270 FUNCTIONS OF ONE VARIABLE
Gw=[ Kd ax
is uniformly convergent on J. Then F is differentiable on J and F'=G. In
symbols:
F.(t)= ac t) dx,
then it follows from Theorem 31.7 that F, is differentiable and that
+n
Fi(t) -| g(x, t) dx.
By hypothesis, the sequence (F,) converges on J to F and the sequence
(Fi) converges uniformly on J to G. It follows from Theorem 28.5 that F
is differentiable on J and that F’= G. OED.
i={ e* dx
t lo
x
—px
i
+00
= { xe”™ dx.
t lo
T(a) = { x* te dx
0
(w= -|| ao
+0
Fu e-™ cos ux dx = —3 uz 0.
Arc tan (0) =0 and infer that C=0. Hence, if t>0 and u = 0, then
40 .
_i, SIN UX
Arc tan (u/t) = { e — dx.
0
(e) Now hold u >0 fixed in the last formula and observe, as in Example
33.5(d) that the integral converges uniformly for t = 0 so that the limit is
continuous fort = 0. If we let t > 0+, we obtain the important formula
(33.5) ( = tim fe
PROOF. It follows from the Comparison Test 32.7 that the infinite
integrals
{ f, ( fy neéeN,
exist. If ¢ >0, let K be chosen such that fx° M <e, from which it follows
that
Since f(x) = lim (f.(x)) for all x €[a, K] it follows from the Bounded Con-
vergence Theorem 31.3 that Jo f= lim, Jz f.. Therefore, we have
[or [el=| =
[rf +2e,
[fose{] jam
PROOF. Since the sequence (f,) is monotone increasing, we infer that
the sequence (Jz° f,:néN) is also monotone increasing. If f has an
integral over {x:x = a}, then the Dominated Convergence Theorem (with
M = f) shows that
[ f= tim | fa
Conversely, suppose that the set of infinite integrals is bounded and let $
be the supremum of this set. If c >a, then the Monotone Convergence
Theorem 31.4 implies that
[t=syp['f=sep
(see[ 1}
= sup {sup| fs} = sup| fas QED.
ec 1) dx = [te t) dx
for each b> aandt=a. Therefore, it follows from the Comparison Test
32.7, that
If we repeat this argument and apply equation (33.7’), we obtain the reverse
inequality. Therefore, the equality must hold. Q.E.D.
33.13 THEOREM. Suppose fis continuous forx = a, t = a, and that there
exist positive functions M and N such that the infinite integrals \3° M and J.° N
exist. If the inequality
(33.8) [f(x, O| = M(x)N(), Xx2a, t2a,
holds, then the iterated integrals in (33.6) both exist and are equal.
proor. Let g be defined for x = a, t= a by g(x, t)=f(x, th + M(x)N(D)
so that
0 = g(x, t)= 2M(x)N(t).
Since N is bounded on each interval [a, 8], it follows from the inequality
(33.8) and the Weierstrass M-test 33.3 that the integral
{a t) dx
exists uniformly for t¢[a, B]. By applying Theorem 33.7, we observe that
equation (33.7’) holds (with f replaced by g) for each B >a. Similarly,
(33.7) holds (with f replaced by g) for each b= a. Also the Comparison
Test 32.7 implies that the iterated integrals in (33.6) exist (with f replaced by
g). We deduce from Theorem 33.12 that these iterated integrals of g are
equal. But this implies that the iterated integrals of f exist and are equal.
QED.
The preceding results deal with the case that the iterated integrals are
absolutely convergent. We now present a result which treats the case of
non-absolute convergence.
33.14 THEOREM. Suppose that the real-valued function f is continuous
in (x, t) forx = a andi >= a and that the infinite integrals
(33.10) { F(x, B) dx
converges uniformly for B = a. Then both iterated infinite integrals exist and
are equal.
prRoor. Since the infinite integral (33.10) is uniformly convergent for
8 = a, if « >0 there exists a number A, = a such that if A = A., then
whence it follows that the limit of Jz” F(x, B) dx exists as B > +. After
applying Theorem 33.7 to the uniform convergence of the first integral in
(33.9), we have
Since both terms on the left side of (33.11) have limits as 8 ~+* we
conclude, on passing to the limit, that
(b) If g(x, t)=e™, for x = 0 and t = 0, then we are in trouble on the lines
x=0Oandt=0. However, if a>0,a>0, and x >a and t>a, then we
observe that
e xt e THE g—xtl2 < ere a?
If we set M(x) =e"*” and N(t)=e"”, then Theorem 33.13 implies that
xe —(1+y?)x?
dx = -~f
i ~ 21+y’) |x za 2(1+y*)’
it follows that
1a?
2€ 6
+e gary?
tty? y= [Pen{[
y=
+00
€
ya
o
+00
xe
ae
ay} de
y xX.
It follows that
te grey 2 2
lty 7 dy =2e* i e” dx.
0
33. UNIFORM CONVERGENCE AND INFINITE INTEGRALS 279
We want to take the limit as a > 0. In the last integral this can evidently
be done, and we obtain Jo” (e~™ sin x/x) dx. In view of the fact that
e~” cos a is dominated by 1 for y = 0, and the integral fi°(1/(1+ y’)) dy
exists, we can use the Dominated Convergence Theorem 33.10 to conclude
that
[ase
cosa |
lim
a0 Jy ity’ ree
The second integral is a bit more troublesome as the same type of estimate
shows that
ye™ sina
1+y’ ~14+y"’
“77
and the dominant function is not integrable; hence we must do better. Since
u <e” and |sin u| < u for u = 0, we infer that |e~” sin al =< 1/y, whence we
obtain the sharper estimate
ye” sina 1
< 3.
1+y’ 1+y
Wecan now employ the Dominated Convergence Theorem to take the limit
280 FUNCTIONS OF ONE VARIABLE
*° dy -[-S sinx
am — Are tan a = dx.
lo 1+y? lo x
We now want to take the limit as a0. This time we cannot use the
Dominated Convergence Theorem, since J3” x~* sin x dx is not absolutely
convergent. Although the convergence of e “ to 1 as a > 0 is monotone,
the fact that sin x takes both signs implies that the convergence of the entire
integrand is not monotone. Fortunately, we have already seen in Example
33.5(d) that the convergence of the integral is uniform for oa = 0.
According to Theorem 33.6, the integral is continuous for a = 0 and hence
we once more obtain the formula
(33.14) { SID X dx = 4.
0 x
Exercises
33.A. Show that the integral {5° x‘e™* dx converges uniformly for ¢ in an interval
[0, B] but that it does not converge uniformly for t = 0.
33.B. Show that the integral
[ sin (tx) ax
0 x
is uniformly convergent for t = 1, but that it is not absolutely convergent for any of
these values of t.
33.C. For what values of t do the following infinite integrals converge uniformly?
@) "dx
[oes w [,
*° dx
() { er dy,
0
(f) { Fee dy,
o (OX
Differentiate with respect to t and integrate by parts to prove that F’(t) = (—1/2)iF(t).
Then find F(t) and, after a change of variable, establish the formula
+
| e cos tx dx =$V a/c e7""*, c>0.
12,
Differentiate and change variables to show that G'(t) = —2G(t). Then find G(t) and
establish the formula te
{ ert? dy lag el,
oO
1 atl
=Tloss> ja|>4,
co 2 [em Pacn
33.J. For neN let f, be defined by
fa(x) = 1/x, lax<n,
=0, x>n.
Each f, has an integral for x = 1 and the sequence (f,) is bounded, monotone
increasing, and converges uniformly to a continuous function which is not integrable
over {x ER: x = 1}.
282 FUNCTIONS OF ONE VARIABLE
Each g, has an integral over x = 0 and the sequence (g,) is bounded and converges to
a function g which has an integral over x = 0, but it is not true that
iim 2. = { g.
oO o
B too
| {{ f(x, t)as} dx <0 for each B = 1.
1 1
[= kl
dy =Fe
1+y?
33.N. By considering the iterated integrals of e“°” sin y over the quadrant
x = 0, y = 0, establish the formula
“em. (*siny
{ Taw ® [ aty a>o.
Projects
33.a. This project treats the Gamma function, which was introduced in Example
32.10(e). Recall that T' is defined for x in P ={x € R : x >0} by the integral
+e0
T@)= | O+ er dt.
We have already seen that this integral converges for x € P and that '(@) =z.
(a) Show that T is continuous on P.
(b) Prove that P(x +1)=xI'(x) forx e€ P. (Hint: integrate by parts on the interval
[e, c].)
(c) Show that [(n+ 1) =n! forneN.
(d) Show that lim,..., x['(x)=1. Hence it follows that T is not bounded to the
right of x =0.
(e) Show that T is differentiable on P and that the second derivative is always
positive. (Hence I is a convex function on P.}
33. UNIFORM CONVERGENCE AND INFINITE INTEGRALS 283
33.8. We introduce the Beta function of Euler. Let B(x, y) be defined for x, y in
P={xeR:x
>0} by
L
B(x, y)= { rd —ty dt.
Ifx = 1 and y = 1, this integral is proper, but if0<x <1 or0<y <1, the integral is
improper.
(a) Establish the convergence of the integral for x, y in P.
(b) Prove that B(x, y) = B(y, x).
(c) Show that if x, y belong to P, then
(aP2)—
B(x, y)= 2[ O+ (sin "(cos t)*"' dt
and
+20 x1
Bow | aa
(d) By integrating the positive function
ft, u) = ete? Pend yd
over {(t, u):t?+u’=R?’, t= 0, u = 0} and comparing this integral with the integral
over inscribed and circumscribed squares, derive the important formula
B(x, y) _T@Py)
Ta ty)”
33.y. This and the next project present a few of the properties of the Laplacet
transform, which is important both for theoretical and applied mathematics. To
simplify the discussion, we shall restrict our attention to continuous functions f
defined on {t€R:t=0}toR. The Laplace transform of f is the function f defined at
the real number s by the formula
(a) Suppose there exists a real number c such that |f(t)| = e“ for sufficiently large t.
Then the integral defining the Laplace transform f converges for s>c. Moreover,
it converges uniformly for s =c+6 if6>0.
(b) If f satisfies the boundedness condition in part (a), then f is continuous and has
a derivative for s >c given by the formula
a(s)
-
== 13flsia).
Similarly, if h(t) =(1/a)f(t/a), then h converges for s > sp/a and
h(s) = flas).
(f) Suppose that the Laplace transform f of f exists for s > s, and let f be defined
fort <Otobeequalto0. Ifb >0 andif g(t) = f(t — b), then g converges for s > s, and
&(s) =e f(s).
Similarly, if h(t) = e"f(t) for any real b, then h converges for 5 >s)+b and
A(s) = f(s—b).
33.6. This project continues the preceding one and makes use of its results.
(a) Establish the following short table of Laplace transforms.
1 V/s s>0,
t" nt/s"** s>0,
e* (s—a)y? s>a,
te" nif(s-—ay""" s>a,
: a
sin at vt+ae all s,
s
cos at sta all s,
sinh at 33 s>a,
cosh at z s z s>a,
; sa
sint
1 Arc tan (1/s) s>0.
33. UNIFORM CONVERGENCE AND INFINITE INTEGRALS 285
(b) Suppose that f and f’ are continuous for t = 0, that f converges for s > sy and
that e“f(t) > 0 as t-> + for alls >s.. Then the Laplace transform of f' exists for
5 > S$, and
fP(s) = s7f(s)—sf(0)—f'().
(d) When all or part of an integrand is seen to be a Laplace transform, the integral
can sometimes be evaluated by changing the order of integration. Use this method
to evaluate the integral
[oasis
0
sinS
s
Assume that this equation has a solution y such that the Laplace transforms of y and
y’ exists for sufficiently large s. In this case the transform of y must satisfy the
equation
ya 5t3
WS) = ETDs 1)
Use partial fractions and the table in (a) to obtain y(t)=Je'—3e “, which can be
directly verified to be a solution.
(f) Find the solution of the equation
286
34 CONVERGENCE OF INFINITE SERIES 287
S2=Sitx2. (=x1+Xz2),
both to denote the infinite series generated by the sequence X = (x,) and also to
denote lim S in the case that this infinite series is convergent. In actual practice,
the double use of these notations does not lead to confusion, provided it is
understood that the convergence of the series must be established.
The reader should guard against confusing the words ‘“‘sequence’’ and ‘‘series.”
In non-mathematical language, these words are interchangeable; in mathematics,
however, they are not synonyms. According to our definition, an infinite series is a
sequence S obtained from a given sequence X according to a special procedure that
was stated above. There are many other ways of generating new sequences and
attaching ‘‘sums”’ to the given sequence X. The reader should consult books on
divergent series, asymptotic series, and the summability of series for examples of
such theories.
A final word on notational matters. Although we generally index the elements
of the series by natural numbers, it is sometimes more convenient to start with
n=0, with n=5, or with n=k. When such is the case, we shall denote the
resulting series or their sums by notations such as
34.2 THEOREM. (a) If the series ¥ (x.) and ¥ (ya) converge, then the
series ¥ (xX. + yn) converges and the sums are related by the formula
[[xno1l]
+ [Xns2l| ++ + ++ [xml] <e-
According to the Triangle Inequality, the left-hand side of this relation
dominates
[|Xn+1
t+ Xns2t+ + Xml.
We apply the sufficiency of the Cauchy Criterion to conclude that the }' (x,)
must converge. Q.E.D.
iL 1
Sky = 1 ta
Saewtatylytig
=Tt5tZt Gas st ghyl
3+ G> 5, 42(9)1)_= 145.4 2
Therefore, the subsequence (s,,) is not bounded and the harmonic series
does not converge.
(c) We now treat the p-series ))(1/n’) where O<p=1 and use the
elementary inequality n’ <n, for ne N. From this it follows that, when
O0<p <1, then
A 7p
Slr
= ; neN.,
n
Since the partial sums of the harmonic series are not bounded, this
inequality shows that the partial sums of ¥ (1/n") are not bounded for
O0<p<=1. Hence the series diverges for these values of p.
(d) Consider the p-series for p>1. Since the partial sums are
monotone, it is sufficient to show that some subsequence remains bounded
in order to establish the convergence of the series. If ki = 2'—1=1, then
S,=1. If k2=2’-1=3, we have
1./1,1 2 1
sy qt (get ge) <1 tg Lt ge,
and if k;=2°—1, we have
1,1,1,1 4 1 1
Ska = Sk, + (StS Sts) <atH<lt gat ge.
34 CONVERGENCE OF INFINITE SERIES 291
11 _ 21 _1 0 1
k?+k k(k+1) k eT
This expression shows that the partial sums are telescoping and hence
Sn
4104 L411
~7-2°2-3¢ Tn@tD1 ntl:
It follows that the sequence (s,) is convergent to 1.
Rearrangements of Series
Loosely speaking, a rearrangement of a series is another series which is
obtained from the given one by using all of the terms exactly once, but
scrambling the order in which the terms are taken. For example, the
harmonic series
11,1 1
pratgt ty
has rearrangements
1,1,1,1 1 1
2tttat3t
tant init
1,1,1,1,1,1
llyallt = + yell = K,
whence it follows that ¥ (yn) is absolutely convergent to some element y of
R?’. We wish to show that x=y. If e>0, let N(e) be such that if
m>nz N(e), and 5 =x:+++++x,, then |x—s,||<e and
m
Dd. |pall<e.
k=n+1
Choose a partial sum ¢, of ¥ (ym) such that |ly —14||<« and such that each
X1,X2,...,;%n Occurs in & Having done this, choose m> ni so large that
every yx appearing in ¢, also appears in s,. Therefore
Exercises
34.A. Let ¥ (a,) be a given series and let ¥ (b,) be one in which the terms are the
same as those in ¥ (a,), except those for which a, = 0 have been omitted. Show
that } (a,) converges to a number A if and only if ¥ (b,) converges to A.
34.B. Show that the convergence of a series is not affected by changing a finite
number of its terms. (Of course, the sum may well be changed.)
34.C. Show that grouping the terms of a convergent series by introducing
parentheses containing a finite number of terms does not destroy the convergence or
34 CONVERGENCE OF INFINITE SERIES 293
the value of the limit. However, grouping terms in a divergent series can produce
convergence.
34.D. Show that if a convergent series of real numbers contains only a finite
number of negative terms, then it is absolutely convergent.
34.E. Show that if a series of real numbers is conditionally convergent, then the
series of positive terms is divergent and the series of negative terms is divergent.
34.F. By using partial fractions, show that
1
(b) zy alwe iw e3) “4°
2" a2"
n 1
converges. This result is often called the Cauchy Condensation Test. (Hint: group
the terms into blocks as in Examples 34.8(b, d).)
34.L. Use the Cauchy Condensation Test to discuss the convergence of the
p-series ) (1/n?).
34.M. Use the Cauchy Condensation Test to show that the series
1 1
» nlogn’ » n(log n)(log log n)’
y—_—_,
1
n(log n)(log log n)(log log log n)
are divergent.
34.N. Show that if c >1, the series
yy,
n(log n) YF n(log n)(log log n)°
are convergent.
294 INFINITE SERIES
(35.1) Xn Yn forn = K,
(b) If the limit in (35.2) is zero and > (y,) is convergent, then ¥. (xa) is
convergent.
PROOF. It follows from (35.2) that for some real number c >1 and
some natural number K, then
If we apply the Comparison Test 35.1 twice, we obtain the assertion in part
(a). The proof of (b) is similar and will be omitted. OED.
35.3. Roor Test. (a) IfX =(x,) is a sequence in R? and there exists a
positive number r<1 and a natural number K such that
(35.3) [x.""<r forn=K,
then the series ¥ (xn) is absolutely convergent.
(b) If there exists a number r>1 and a natural number K such that
(35.4) I|xn[]"" =r forn> K,
then the series ¥ (xn) is divergent.
PROOF. (a) If (35.3) holds, then we have |{x,|] <r". Now for 0 <r <1,
the series ) (r") is convergent, as was seen in Example 34.8(a). Hence it
follows from the Comparison Test that ¥ (x,) is absolutely convergent.
(b) If (35.4) holds, then ||x,|| =r". However, since r > 1, it is false that
lim (|lxx{l) = 0. QED.
In addition to establishing the convergence of >) (x,), the root test can be
used to obtain an estimate of the rapidity of convergence. This estimate is
useful in numerical computations and in some theoretical estimates as well.
r ntl
{|in — Sn || = [foenert+ + Xml < [loenealf-+ > + [fxn] sr? +--+ tr <
l-r
Now take the limit with respect to m to obtain (35.5). QE.D.
296 INFINITE SERIES
PROOF. It follows that if the limit in (35.6) exists and is less than 1, then
there is a real number r; with r<r,< 1 anda natural number K such that
I|xn[["" = 11 for n= K.
In this case the series is absolutely convergent. If this limit exceeds 1, then
there is a real number r.>1 and a natural number K such that
x.'"=r forn=K,
in which case series is divergent. QED.
PROOF. The relation (35.7) implies that ||x.+«|| << r* ||xn|| when n = K.
Therefore, if m = n= K, we have
r= lim (lias),
I[xelh
whenever the limit exists. Then the series ¥ (xn) is absolutely convergent
when r<1 and divergent when r> 1.
PROOF. Suppose that the limit exists andr<1. If 1, satisfies r<r.<1,
then there is a natural number K such that
[|xn-+1l]
<n forn>K.
Ilan
In this case Theorem 35.6 establishes the absolute convergence of the
series. If r>1, andif rz satisfies 1<r.<,r, then there is a natural number K
such that
Liesl to for n = K,
Raabe’s Test
If r= 1, both the Ratio and the Root Tests fail and either convergence or
divergence may take place. (See Example 35.13(d)). For some purposes
298 INFINITE SERIES
it is useful to have a more delicate form of the Ratio Test for the case when
r=1. The next result, which is attributed to Raabe?, is usually adequate.
35.9 Raase’s Test. (a) If X=(xn) is a sequence of non-zero
elements of R? and there is a real number a > 1 and a natural number K such
that
[xu+al|>c/n, n= K.
Since the harmonic series ¥ (1/n) diverges, then ’ (x,) cannot be absolutely
convergent. QED.
t JospPH L. RAABE (1801-1859) was born in the Ukraine and taught at Ziirich. He worked in
both geometry and analysis.
35 TESTS FOR ABSOLUTE CONVERGENCE 299
35.14 a= lim(n( 1
G14) Pl )),
whenever this limit exists. Then ©, (xn) is absolutely convergent when a> 1
and is not absolutely convergent when a <1.
PROOF. Suppose the limit (35.14) exists and satisfies a >1. If ai is any
number with a>a,> 1, then there exists a natural number K such that
ee 1- forn=K
n
and Theorem 35.9 assures the absolute convergence of the series. The
case where a <1 is handled similarly and will be omitted. Q.B.D.
We shall show how the results in Theorems 35.1 to 35.12 can be applied
to the p-series, which were introduced in Example 34.8(c).
35.13. Examptes. (a) First we shall apply the Comparison Test.
Knowing that the harmonic series } (1/n) diverges, it is seen that if p = 1,
35 TESTS FOR ABSOLUTE CONVERGENCE 301
a(R
=—_ P
.
3
After using the Comparison Test 35.1, we conclude that the p-series
¥ (/n’) diverges for p <1.
(b) Now consider the case p=2; that is, the series )(1/n?). We
compare the series with the convergent series }' [1/n(n+ 1)] of Example
34.8(e). Since the relation
i et
n(n+1) nn?
holds and the terms on the left form a convergent series, we cannot apply
the Comparison Theorem directly. However, we could apply this theorem
if we compared the nth term of }[1/n(n+1)] with the (n+ 1)st term of
> (1/n’). Instead, we choose to apply the Limit Comparison Test 35.2 and
note that
it nn
n(n+1) n? n(wt+1) nt+1°
Since the limit of this quotient is 1 and } [1/n(n+ 1)] converges, then so
does the series ¥ (1/n’).
(c) Now consider the case p = 2. If we note that n’ = n’ for p = 2, then
1
p72
i
neon
Ait iW
nP on? nn? nP
2)" <ornm= 0
n?
302 INFINITE SERIES
Now it is known (see Example 14.8(e)) that the sequence (n”") converges
to 1. Hence we have
tim((s) =
lfm
so that the Root Test (in the form of Corollary 35.5) does not apply.
In the same way, since
1 o.1 on 1
(n+l? n? (th? (1+1/n)P’
and since the sequence ((1+1/n)’) converges to 1, the Ratio Test (in the
form of Corollary 35.8) does not apply.
(e) In desperation, we apply Raabe’s Test to the p-series for integral
values of p. First, we attempt to use Corollary 35.11. Observe that
2-H)
mf HEEB -a(1(1-gh))
If p is an integer, then we can use the Binomial Theorem to obtain an
estimate for the last term. In fact,
1 \'\_ (p-1)
n(1-(1- 44) )- n(1-1+ Bodie ye )
If we take the limit with respect to n, we obtain p. Hence this corollary to
Raabe’s Test shows that the series converges for integral values of p = 2
(and, if the Binomial Theorem is known for non-integral values of p, this
could be improved). :
(f) Finally, we apply the Integral Test to the p-series. Let f(t)=1t7° and
recall that
[4 de=1og
(n) —tog (1),
[
a
From these relations we see that the p-series converges if p>1 and
diverges if p= 1.
Exercises
35.A. Establish the convergence or the divergence of the series whose nth term
is given by
1 n
) GeD@+2’ ©) GeDm@+d)’
35 TESTS FOR ABSOLUTE CONVERGENCE 303
is convergent, but that both the Ratio and the Root Tests fail to apply.
35.F. If a and b are positive numbers, then
a
(an + by’
n! (nt
@ 55-7 @ntty’ ©) Qnyl
2+4+++(2n) 2-4-+-(2n)
© 35 ant ) San +3)
35.H. The series given by
6) Gy GS
2 2-4 2:4-6
Then } (x,) is absolutely convergent if r<1 and divergent if r>1. [The limit
superior u =lim sup (b,) of a bounded sequence of real numbers was defined in
Section 18. It is the unique number u with the properties that (i) if u<v then
b, = v for all sufficiently large n € N, and (ii) if w<u, then w = b, for infinitely
many neN.]
304 INFINITE SERIES
35.M. Let x,>0 for neN and suppose that n(1—x,,./x.)=a+k,/n’, where
p >0 and (k,) is bounded. Then the series } (x,) converges if a > 1 and diverges if
asl.
35.N. If p>0, q>0, then the series
y et Det?) - (rn)
(q+1)(q+2)---(q+n)
converges for q>p+1 and diverges for q<p+1.
35.0. Show that the series } (2"n!)’/(2n + 1)! is divergent.
35.P. Let x, >0 and let r = lim inf (—log x,/log n). Show that ¥ (x,) converges if
r>1 and diverges if r<1.
25.Q. Suppose that none of the numbers a, b, c is a negative integer or zero.
Prove that the hypergeometric series
Project
35.a. Although infinite products do not occur as frequently as infinite series,
they are of importance in many investigations and applications. For simplicity,
we shall restrict attention here to infinite products with terms a, >0. If A =(a,) is
a sequence of strictly positive real numbers, then the infinite product, or the
sequence of partial products, generated by A is the sequence P = (p,) defined by
Pn = Pn-1@n(
= G12 ** + Onn), 00+.
If the sequence P is convergent to a non-zero number, then we call lim P the
product of the infinite product generated by A. In this case we say that the infinite
product is convergent and write either
The tests given in Section 35 all have the character that they guarantee
that, if certain hypotheses are fulfilled, then the series ¥ (x,) is absolutely
convergent. Now it is known that absolute convergence implies ordinary
convergence, but it is readily seen from an examination of special series,
306 INFINITE SERIES
such as
yey yey
n n
that convergence may take place even though absolute convergence fails.
It is desired, therefore, to have a test which yields information about
ordinary convergence. There are many such tests which apply to special
types of series. Perhaps the ones with most general applicability are
those due to Abelt and Dirichlet.
To establish these tests, we need a lemma which is sometimes called the
partial summation formula, since it corresponds to the familiar integration
by parts formula. In most applications, the sequences X and Y are both
sequences in R, but the results hold when X and Y are sequences in R?
and the inner product is used or when one of X and Y is a real sequence
and the other is in R’.
36.1 AsBevs Lemma. Let X=(x,) in R and Y=(y,) in R? be
sequences and let the partial sums of ¥ (yn) be denoted by (s.). If m=n,
then
m
(36.2) D [xa
— Xn]
+ NiELS HENRIK ABEL (1802-1829) was the son of a poor Norwegian minister. When only
twenty-two he proved the impossibility of solving the general quintic equation by radicals.
This self-taught genius also did outstanding work on series and elliptic functions before his
early death from tuberculosis.
36 FURTHER RESULTS FOR SERIES 307
PROOF. (a) Suppose that ||s,||<B for all j. Using (36.1), we have the
estimate
The next test strengthens the hypothesis on the series ¥ (y,), but relaxes
the hypothesis on ¥ (x,).
36.4 ABEL’s Test. Suppose that the series ¥ (y.) converges in R?.
(a) If the sequence X =(x,) in R is such that
= |xi{+ x |e — Xe+s|
k=1
so that |x.{<A for some A >0. Moreover, there exists N2{e) such that if
m>n => N2(e), then
Now let N3(e) =sup {Ni(e), No(e)} so that if m > n > N;(e), then we have
||Xm+18m — XnSn-1|
|[Xnta8im — Xm +18] + []Xm+18 — Xns|| + [focus — XnSn-l|
S [Xm-e1] [|S — Sl] + [Xn+1 — Xn [|S] + [Xn] []s — Seal
= Ae+eB+Ae=(QA+B)e.
<2A+B)e,
where we have used (36.4) in the last step. Since e >0 is arbitrary, the
convergence of } (x;y) is established.
(b) If the sequence (x,) is monotone and converges to x, then the series
(36.2) is telescoping and converges either to x — x; or to x1—x. QED.
If we use the same type of argument we can establish the following error
estimate.
36.5 COROLLARY. In part (b), we have the error estimate
cS n
d XiYi — a Xi
j=l i=l
E Ixnea| | —sal|-+2B |x — x04).
Alternating Series
There is a particularly important class of conditionally convergent real
series, namely those whose terms are alternately positive and negative.
36.6 DEFINITION. A sequence X =(x,) of non-zero real numbers is
alternating if the terms (—1)"x,, n=1,2,..., are all positive (or all
negative) real numbers. If a sequence X =(x,) is alternating, we say that
the series }' (x,) it generates is an alternating series.
It is useful to set x, =(—1)"z, and require that z, >0 (or z, <0) for all
n=1,2,.... The convergence of alternating series is easily treated when
the next result, proved by Leibniz, can be applied.
36.7 ALTERNATING SERIES TEsT. Let Z=(z,) be a decreasing se-
quence of strictly positive numbers with lim (z,.)=0. Then the alternating
series ¥ ((—1)"zn) is convergent. Moreover, if s is the sum of this series and s,
36 FURTHER RESULTS FOR SERIES 309
it follows that
lcosx+---+cosnx|<——7.
|sin 3x|
We can then apply Dirichlet’s Test 36.2(b) to conclude that the series
¥ (1/n) cos nx converges for all x¢{2ka:k € Z}. We note that this series
diverges when x =2km for some ke Z.
(d) Let xe R and let ke Z. Then since
It is readily seen that the series ) (y,) does not converge, but its partial
sums s, ate bounded; in fact, we have ||s,||</2. Hence Dirichlet’s Test
shows that the series ¥ (1/n)y, is convergent in R’.
Double Series
Smn = Xi.
Ms
M3
j Li 1
By analogy with Definition 34.1, we shall say that the double series > (x;)
converges to an element x in R° if for every « >0 there exists a natural
number M(e) such that if m = M(e) and n= M(e) then
IIx — Sin <e.
By analogy with Definition 34.6, we shall say that the double series © (x,) is
absolutely convergent if the double series > (|lx;||) in R is convergent.
It is an exercise to show that if a double series is absolutely convergent,
then it is convergent. Moreover, a double series is absolutely convergent if
36 FURTHER RESULTS FOR SERIES 311
(36.8) Xij
its
iMs
Le Xij, y
im]j 1
also converge to x.
PROOF. By hypothesis there exists a positive real number A which is an
upper bound for the set in (36.7). If n is fixed, we observe that
(36.9) [Sinn
— xl|<e,
In view of the relation
we infer that
=yityotr+t
yn
312 INFINITE SERIES
This proves that the first iterated sum in (36.8) exists and equals x. An
analogous proof applies to the second iterated sum. QED.
There is one additional method of summing double series that we shall
consider, namely along the diagonals i+j=n.
36.11 THEOREM. Suppose that the double series > (xj) converges
absolutely tox in R°. If we define
Cauchy Multiplication
In the process of multiplying two power series and collecting the terms
according to the powers, there arises very naturally a new method of
generating a series from two given ones. In this connection it is notation-
ally useful to have the terms of the series indexed by 0,1, 2,....
36.12 DEFINITION. If Yo (y,) and Yj-o (z;) are infinite series in R’,
36 FURTHER RESULTS FOR SERIES 313
Here the dot denotes the inner product in R’. In like manner we can
define the Cauchy product of a series in R and a series in R?.
It is perhaps a bit surprising that the Cauchy product of two convergent series
may fail to converge. However, it is seen that the series
= (-1)"
naovnt 1
is convergent, but the nth term of the Cauchy product of this series with itself is
1 1 1
CO-1 |=
ivr tt
avn : +k],
Vast
Since there are n+1 terms in the bracket and each term exceeds 1/(n+2), the
terms in the Cauchy product do not converge to zero. Hence this Cauchy product
cannot converge.
Exercises
36.A. Consider the series
11
—s-5+
1 2 3
36.B. Let a4,éR for néEN and let p<q. If the series ¥ (a,/n’) is convergent,
then the series )' (a,/n*) is also convergent.
36.C. If p and q are strictly positive numbers, then
n n”
(a) (-1) ape (b) @ep?
() yr Se @ oe
36.E. Suppose that } (a,) is a convergent series of real numbers. Either prove
that ¥ (b,) converges or give a counter-example, when we define b, by
111,11
1+5-3+ gts et +e
is divergent.
36.G. If the hypothesis that (z,) is decreasing is dropped, show that the
Alternating Series Test 36.7 may fail.
36.H. For néN, let c, be defined by
11 1
Gi =pt5t : “+7 log ne
Show that (c,) is a decreasing sequence of positive numbers. The limit C of this
sequence is called Euler’s Constant and is approximately equal to 0.577. Show
that if we put
then the sequence (b,) converges to log 2. (Hint: b, =c., —¢, + log 2.)
36.1. Let ¥ (a,.) be the double series given by
=-1, ifm—n=~-1,
= 0, otherwise.
Show that both iterated sums exist, but are unequal, and the double sum does not
exist. However, if (s,.,) denote the partial sums, then lim (s,,) exists.
37 SERIES OF FUNCTIONS 315
36.J. Show that if the double and the iterated series of ¥ (a...) exist, then they
are al] equal. Show that the existence of the double series does not imply the
existence of the iterated series; in fact the existence of the double series does not
even imply that lim, (a...) =0 for each m.
36.K. Show that if p>1 and q>1, then the double series
1 1
» (ca) and (qr 5)
are convergent.
36.L. By separating } (1/n’) into odd and even parts, show that
— 1 — ‘ 1
Lan Lan 3Gnai"
n=l
36.0. Prove Mertens’ Theorem: If ¥ (a,) converges absolutely to A and }' (b,)
converges to B, then their Cauchy product converges to AB. (Hint: Let the partial
sums be denoted by A,, B,, C,, respectively. Show that lim (C,,—A,B,) =0 and
lim (Ci.41— A.B.) = 0.)
36.P. Prove Ceséro’s Theorem: Let ¥ (a,) converge to A and } (b,) converge to
B, and let ¥ (c,) to their Cauchy product. If (C,) is the sequence of partial sums of
¥ (c,), then
(Hint: write C,+---+C,=A,B,+---+A,B,; break this sum into three parts; and
use the fact that A, > A and B, > B.)
si(x) = fi(x),
82(x) = si(x)+fo(x) [=filx) +fa(x)],
Ld), Lik) or Eh
to denote either the series or the limit function, when it exists.
If the series > ((|f.(x)||) converges for each x in D, then we say that } (f.)
is absolutely convergent on D. If the sequence (s,) is uniformly con-
vergent on D to f, then we say that ¥ (f,) is uniformly convergent on D, or
that it converges to f uniformly on D.
One of the main reasons for the interest in uniformly convergent series
of functions is the validity of the following results which give conditions
justifying the change of order of the summation and other limiting
operations.
(37.2) [s = n=1
y "fu
Ja
The assertion follows from the Cauchy Criteria 34.5 and 37.6 and the
convergence of ¥ (M,.). OED.
The next two results are very useful in establishing uniform convergence
when the convergence is not absolute. Their proofs are obtained by
modifying the proofs of 36.2 and 36.4 and will be left as exercises.
37.8 DiIRICHLET’s Test. Let (f,) be a sequence of functions on D < R?
to R* such that the partial sums
fi, neN,
Me:
Sn =
j=l
for |x|<r, it follows from the M-test that the differentiated series is
uniformly convergent on the interval [—r, r].
(c) A direct application of the M-test (with M,=1/n’) shows that
Y2=1 (1/n’) sin nx is uniformly convergent for all x in R.
(d) Since the harmonic series ¥ (1/n) diverges, we cannot apply the
M-test to
(1/n) decreases to zero, Dirichlet’s Test 37.8 implies that the series (37.5)
is uniformly convergent on J.
(e) Consider Y5-1 ((-1)"/nje™ on the interval ¥=[0, 1]. Since the norm
of the nth term on I is 1/n, we cannot apply the Weierstrass Test.
Dirichlet’s Test can be applied if we can show that the partial sums of
X((-1)"e™) are bounded. Alternatively, Abel’s Test applies since
* ((-1)"/n)) is convergent and the bounded sequence (e~™) is monotone
decreasing on I (but not uniformly convergent to zero).
Power Series
(37.6) er
tax
Y aax" = ot aixt tere,
n=0
Even though the functions appearing in (37.6) are defined over all of R,
it is not to be expected that the series (37.6) will converge for all x in R.
For example, by using the Ratio Test 35.8, we can show that the series
6
Y nix",
Mes
x”, YX x"/n!,
n=0 n=0
=
= 1/p, if O<p<+oa,
= +00, if p=0.
show. Since lim(n'")=1 (cf. 14.8(e)), each of these power series has radius of
convergence equal to 1. The first power series converges at neither of the points
x =—1 and x =+1; the second series converges at x =—1 but diverges at x = +1;
and the third power series converges at both x =—1 andx=+1. (Find a power
series with R=1 which converges at x =+1 but diverges at x =~—1.)
It is an exercise to show that the radius of convergence of ¥ (a,x") is also
given by
flan )
(37.9) tim
provided this limit exists. Frequently, it is more convenient to use (37.9)
than Definition 37.12.
The argument used in the proof of the Cauchy-Hadamard Theorem
yields the uniform convergence of the power series on any fixed compact
subset in the interval of convergence (—R, R).
(37.10) x. ei
n!}
oe n-k
An = Dn for allneN.
PROOF. Our preceding remarks show that n!a,=f(0)=n!b, for
neN. QED.
37 SERIES OF FUNCTIONS 323
then their product is given on this interval by the series ¥ (cax”), where the
coefficients (c.) are
n
Cn = > ADa-k forn=0,1,2,....
k=0
PROOF. We have seen in 37.13 that if |x|<r, then the series giving f(x)
and g(x) are absolutely converent. If we apply Theorem 36.13, we obtain
the desired conclusion. O.E.D.
The Multiplication Theorem asserts that the radius of convergence of the
product is at least r. It can be larger, however, as is easily shown.
We have seen that, in order for a function f to be given by a power series
on an interval (—r,r), r>0, it is necessary that all of the derivatives of f
exist on this interval. It might be suspected that this condition is also
sufficient; however, things are not quite so simple. For example, the
function f, given by
f)=e""", = x 40,
37.12
( ) =0, x=0,
can be shown (see Exercise 37.N) to possess derivatives of all orders and
f(0)=0 for n=0,1,2,.... Iff can be given on an interval (—1, r) by a
power series around x = 0, then it follows from the Uniqueness Theorem
37.17 that the series must vanish identically, contrary to the fact that
f(x) 40 for x40.
Nevertheless, there are some useful sufficient conditions that can be
given in order to guarantee that f can be given by a power series. As an
example, we observe that it follows from Taylor’s Theorem 28.6 that if
there exists a constant B>0 such that
(37.13) If) <B
t The rest of this section can be omitted on the first reading.
324 INFINITE, SERIES
for all |x| <r andn=0, 1,2,..., then Yx-o f(0)x"/n! converges to f(x) for
|x|<r. Similar (but less stringent) conditions on the magnitude of the
derivatives can be given which yield the same conclusion.
As an example, we present an elegant and useful result due to Serge
Bernstein concerning the one-sided expansion of a function in a power
series.
37.19 BERNSTEIN’s THEOREM. Let f be defined and possess derivatives
of all orders on an interval [0, r] and suppose that f and all of its derivatives
are positive on the interval [0,r]. If 0<x<r, then f(x) is given by the
expansion
PROOF. We shall make use of the integral form for the remainder in
Taylor’s Theorem given by the relation (31.3). If 0=x <r, then
PROOF. Abel’s Test 37.9, with f,(x) =a, and o.(x) =x", applies to give
the uniform convergence of ¥ (a.x") on IE Hence the limit is continuous
on I; since it agrees with f(x) for 0=x<1, the limit relation (37.17)
follows. QED.
One of the most interesting things about this result is that it suggests a
method of attaching a limit to series which may not be convergent. Thus,
if );-1 (b,) is an infinite series, we can form the corresponding power series
> (b.x"). If the b, do not increase too rapidly, this power series converges
to a function B(x) for |x|<1. If B(x)—> B as x > 1-, we say that the
series ) (b,) is Abel summable to 8. This type of summation is similar to
(but more powerful than) the Cesaro method of arithmetic means men-
tioned in Section 19 and has deep and interesting consequences. The
content of Abel’s Theorem 37.20 is similar to Theorem 19.3; it asserts that
if a series is already convergent, then it is Abel summable to the same
limit. The converse is not true, however, for the series )7~o(—1)" is not
convergent but since
1 . nn
x7 2D ¥,
We shall assess the magnitude of (37.18) for this value of N and Xo.
From (i), (ii), (iii) and the fact that (1—x0)(N+1)=1, we derive the
estimate
N
Since this can be done for each ¢ >0, the convergence of ¥ (a,) to A is
established. O.E.D.
Exercises
37.A. Discuss the convergence and the uniform convergence of the series ¥ (f,),
where f,(x) is given by
(a) (x?+n7)", (b) (nx)?, x#0,
(c) sin (x/n’), (d) (x"+1)',x =0,
(e) x"(x"+1)",x = 0, @® (-D*nt+x)y',x=0.
37.B. If ¥ (a,) is an absolutely convergent series, then the series ¥ (a, sin nx) is
absolutely and uniformly convergent.
37 SERIES OF FUNCTIONS 327
37.C. Let (c,) be a decreasing sequence of positive numbers. If ¥ (c, sin nx) is
uniformly convergent, then lim (nc,) = 0.
37.D. Give the details of the proof of Dirichlet’s Test 37.8.
37.E. Give the details of the proof of Abel’s Test 37.9.
37.F. Discuss the cases R=0, R=+ in the Cauchy-Hadamard Theorem
37.13.
37.G. Show that the radius of convergence R of the power series ¥ (a,x") is
given by lim (|a,|/|a,.,|) whenever this limit exists. Give an example of a power
series where this limit does not exist.
37.H. Determine the radius of convergence of the series )) (a,x"), where a, is
given by
(a) I/n’, (b) n*/n},
(c) nat, (d) (log n)", n= 2.
(e) (a!) /(2n)!, nv
37.1. If a, =1 when n is the square of a natural number and a, = 0 otherwise,
find the radius of convergence of ¥ (a,x"). If b,=1 when n=m! for meéN and
b, = 0 otherwise, find the radius of convergence of ¥ (b,.x").
37.J. Prove in detail that lim sup (|na,|'") = lim sup (|a,|"").
37.K. 1f0<p <|a,| <q for all n EN, find the radius of convergence of © (a,x").
37.L. Let f(x) =¥ (a,x") for |x|<R. If f(x)=f(—x) for all |x}<R, show that
a, = 0 for all odd n.
37.M. Prove that if f is defined for |x|<r and if there exists a constant B such
that |f(x)| = B for all |x}<r and ne N, then the Taylor series expansion
> f°).
M
ao on! *
37.R. Let f(x) =tan x for |x}</2. Use the fact that f is odd and Bernstein’s
Theorem to show that f is given on this interval by its Taylor series expansion about
x=0.
37.8. Use Abel’s Theorem to prove that if f(x)=¥ (a.x") for jx|< R, then
i fo) de =F BR,
provided that the series on the right side is convergent even though the original
series may not converge at x=R. Hence it follows that
1 net .
log 2= roe® , a7 Ler
37.T. By using Abel’s Theorem, prove that if the series }(a,) and ¥ (b,)
converge and if their Cauchy product } (c,) converges, then we have }) (c,)=
¥ (a,) +E (b,)-
37.U. Suppose that a, = 0 and that f(x) = (a,x") has radius of convergence 1.
If ¥ (a,) diverges, prove that f(x)» + asx -—>1—. Use this result to prove the
elementary Tauberian theorem: If a, = 0 and if
A = lm y a,x",
D a,x"
lim =s.
D pax"
xl—
(Hint: it is sufficient to treat the case s=0. Also use the fact that
lim, [Z (p.x") 7 =
37.W. Apply Appell’s Theorem with p(x) = X7_ (x") to obtain Abel’s Theorem.
37.X. If (a,) is a sequence of real numbers and a, =0, let s,=a,+---+a, and
let o, =(s:+-+++5,)/n. Prove Frobenius’t Theorem: If s = lim (o,) then
= Jim 2 ax".
REMARK. In the terminology of summability theory, this result says that if a
sequence (a,) is Cesaro summable to s, then it is also Abel summable to s. (Hint:
apply Appell’s Theorem to p(x)=(1-x)?= ¥"%_.(nx"") and note that
X(n- ox")
= p(x) Y (a,x").)
+ PAUL APPELL (1855-1930) was a student of Hermite at the Sorbonne. He did research in
complex analysis.
= GEORG FROBENIUS (1849-1917) was professor at Berlin. He is known for his work both i.
algebra and analysis.
37 SERIES OF FUNCTIONS 329
Projects
37.a. The theory of power series presented in the text entends to complex power
series.
(a) In view of the observations in Section 13, all of the definitions and theorems
that are meaningful and valid for series in R’ are also valid for series with elements
in C. In particular the results pertaining to absolute convergence extend readily.
(b) Examine the results pertaining to rearrangements and the Cauchy product to
see if they extend to C.
(c) Show that the Comparison, Root, and Ratio Tests extended to C.
(d) Let R be the radius of convergence of a complex power series
x az".
n=0
Prove that the series converges absolutely for |z]}<R and uniformly on any
compact subset of {z €C:|z|<R}.
(ec) Let f and g be functions defined for D ={z €C:|z|<r} with values in C
which are the limits on D of two power series. Show that if f and g agree on
DAR, then they agree on all of D.
(f Show that two power series in C can be multiplied together within their
common circle of convergence.
37.8. In this project we define the exponential function in terms of power
series. In doing so, we shall define it for complex numbers as well as real.
(a) Let E be defined for z € C by the series
E(z) =y3 .
Show that the series is absolutely convergent for all z € C and that it is uniformly
convergent on any bounded subset of C.
(b) Prove that E is a continuous function on C to C, that E(0)=1, and that
E(z+w)=E(z)E(w)
for z, win C. (Hint: the Binomial Theorem for (z+ w)" holds when z, w eC and
néeN,)
(c) If x and y are real numbers, define E, and E, by E,(x) = E(x), E.{y) = E(iy);
hence E(x +iy)=E,(x)E.(y). Show that E, takes on only real values but that E,
has some non-real values. Define C and S on R to R by
The set of all functions f:R —~ R which have period 27 and are piecewise
continuous will be denoted by PC(277). It is readily seen that this set is a
vector space under the operations:
[ f(x) dx = [ Cx) dx
for any ce R.
On the space PC(27) we shall be interested in the two norms
b=2 | g(t)sinmtdt,
TT Jo
n=1,2,....
(Note that the function in (a) is odd.)
(ce) Let f be continuous on R with period 27 and let its derivative f’ be
piecewise continuous on R (and with period 27). We shall relate the
Fourier coefficients an, b, of f with the Fourier coefficients ax, b, of f’ for
n=1,2,.... In fact, integrating by parts, we have
pi 1 " ,
an= = [ f'(t) cos nt dt
1 7 ™ .
=— f(t)}eosnt| — f(t})(—n) sin nt dt}.
If we use the fact that t+ f(t) cos nt has period 27 the first term is seen to
vanish and so a,=nb, for n=1,2,.... Similarly it is shown that
bi=—na, forn=1,2,..., (We note that if f:, f. are the functions in (a)
and (b), then f:(x) = f2(x) for x¢é{na:ne Z}, and that the Fourier coeffi-
cients for f; and f, for n=1,2,... satisfy the above relationships.)
In the next lemma, we shall calculate the square of the distance relative
to the norm ||-||, from f in PC(27) to an arbitrary function T, of the form
If we insert these two relations into the first formula and add and subtract
atsao
+ Yi-1 (a?+ b.2)}, we obtain formula (38.5). QED.
Lemma 38.3 has the following important ‘‘geometrical”’ interpretation:
among all trigonometric polynomials T, of degree n, the one which
minimizes the expression |{f — T, ||.” is uniquely determined and is obtained
by choosing the coefficients a, B, to be the Fourier coefficients ax, b, of f,
k=0,1,...,n. If we denote this (unique) minimizing trigonometrical
polynomial by S,(f), then
The next result is a special case of what is usually called the Riemann-
Lebesgue Lemma.
38.5 RIEMANN-LEBESGUE LEMMa. If g¢€PC(27), then
PROOF. Since sin (n+3)t =sin nt cos 3t+cos nt sin 3t, we have
If we let t=x+s and use the fact that the cosine is an even function and
that the integrand has period 27, we have
fi(c) = lim +9
fle)
10
38.7 PoOINTWISE CONVERGENCE THEOREM. Suppose that f € PC(27)
and that f has right- and left-hand derivatives atc. Then, the Fourier series
for f converges to x{f(c—)+f(c+)} at the point c. In symbols,
? 2, 2 sin it
336 INFINITE SERIES
Multiply by (1/7) f(c +) and integrate with respect to t over [0,7]. Since
So cos kt dt =0 for k EN, we obtain
yle+y-2]
1 1/°
fle-) sin (n+3)t
-
2 sin 3t
dt.
a +{ f(e+t)-
sin ttfle +) 1 dt.
sin (n +3)t
=fi(c) + 1= fic),
it follows that the function
We remark that it follows from the Mean Value Theorem (see Exercise
27.N) that if f’€ PC(277), then the left- and right-hand derivatives of f exist
at the points of discontinuity of f’. We now show that for a function f with
period 27 and such that f’¢ PC(27), the Fourier series for f is uniformly
convergent to f.
38.9 UNIFORM CONVERGENCE THEOREM. Let f be continuous, have
period 27, and suppose that f'e PC(27). Then the Fourier series for f
converges uniformly to f on R.
PROOF. Since f is continuous and the one-sided derivatives of f exists
at every point, it follows from the Pointwise Convergence Theorem 38.7
that the Fourier series for f converges to f at every point. It remains to
show that the convergence is uniform. In view of the inequality
Since a similar inequality holds for }|b,|, the desired assertion follows.
QED.
We now show that the partial sums of the Fourier series for any function
f in PC(2z) converges to f in the norm ||-|,._ While this does not guarantee
that we can recover the value of f at any particular preassigned point, it can
be interpreted as giving f in a certain ‘statistical’ sense. For some
applications this type of convergence is as useful as pointwise convergence,
and there is the advantage that we do not have to impose differentiability
restrictions.
38.10 Norm CONvERGENCE THEOREM. If fe PC(27) and if (S,(f))
is the sequence of partial sums of the Fourier series for f, then
1 + (= ty 0<jtl|< 7,
(38.13) Hp wsDi9+---+D..40)-[2e sin3t /’ =e
an, t=0,
f LEOPOLD FEJER (1880-1959) studied and taught in Budapest. He made many interesting
contributions to various areas of real and complex analysis.
38 FOURIER SERIES 339
and we let K, be this function which is called the nth Fejér kernel. Clearly
K,(t) = 0 and since
+ {7 Da ae=1
for k=0,1,2,..., it follows that
Finally, we note that it follows from Lemma 38.6 that we can express the
Cesaro means by the formula
38.12 FEséR’s THEOREM. [If f is continuous and has period 27, then
the Cesdro means of the Fourier series for f converge uniformly tof on R.
PROOF. It follows from (38.14) that
7 7
340 INFINITE SERIES
11"
a ds Yer) 1001 K.@)
" r= 7=*
~ or aiply(2)
M\8n 8) < n\ 48 (=e)PP
which can be made less than « by taking n sufficiently large. Since a
similar estimate holds for the integral over [—7z, —6], it follows that
IPs fle<(2+2)e1
for n sufficiently large. O.E.D.
Since the function’, (f) is readily seen to be a trigonometric polynomial (of
degree n—1), we have another proof of the following theorem of Weier-
strass.
38.13 WEIERSTRASS APPROXIMATION THEOREM. If f is continuous
and has period 27, then it can be uniformly approximated by trigonometric
polynomials.
Exercises
38.D. Let F and f belong to PC(27r) and have Fourier coefficients A,, B, and a,,
b,, respectively. If a, B € R andifh =aF+ Bf, show that h belongs to PC(27) and
has Fourier coefficients aA, + Ba,, aB, + Bb,. (Hence the Fourier coefficients of a
function depend linearily on the function.)
38.E. (a) Let f, be the function in Example 38.2(a). Calculate the Fourier
series for f,, and show that this Fourier series does not converge uniformly on
(-7, aw].
(b) Let f. be the function in Example 38.2(b). Calculate the Fourier series for
f., and show that the term-by-term derivative of the Fourier series for f, coincides
with the Fourier series for f,.
(c) Using the fact that the Fourier series for f, converges to f,, deduce that
moti ity,
g tg
(d) Let f,(x)=3a—f,(x) so that f,(x)=30—|x| for xe(-—a, 7]. Use Exercise
38.D ta show that the Fourier series for f, is given by
38.F. (a) Let g,€ PC(27) be such that g,(x)=x for x €(—a, w] and g,(a) =0.
Show that g, is an odd function and that its Fourier series is given by
[= x_sin2x sin3x_ |
1 2 3
Note that this Fourier series converges to 0 at x=+m. Use the Pointwise
Convergence Theorem 38.7 to show that this Fourier series converges to g,(x) for
every point x €[—7, zr).
(b) Let g,€ PC(27) be such that g.(x) =x? for x €(—a, 7]. Show that g, is an
even function and that its Fourier series is given by
wi Tart
11ga
72°4172
(d) Let h(x)=37’- g(x) so that h(x)=}0?—x? for xe(—7, 7]. Then the
Fourier series for h is given by
38.G. (a) Let k(x) =x? for allx eR. Show that k is continuous and odd on R.
342 INFINITE SERIES
38.H. Let f:[0, 7]— R be piecewise continuous and let f, € PC(27) be defined
by
(a) Show that f, is an even function; it is called the even extension of f with
period 27.
(b) The Fourier series of f, is called the (Fourier) cosine series of f. Show that it
is given by
where
(c) Show that if c € (0, ar) and f has left- and right-hand derivatives at c, then the
cosine series for f converges to Af(c—)+f(c+)]. Also if f has a right-hand
derivative at 0, then the cosine series for f converges to f(0+). If f has a left-hand
derivative at a, then the cosine series for f converges to f(m—).
38.1. For each of the following functions defined on [0, 7], calculate the cosine
series and determine the limit of this series at each point.
(a) f(@x)=x; (b) f(x)=sinx;
(c) f(x)=1 for O=x <}n, (d) f(x)=3a0-x for 0<x <)n,
=0 for ja <x<7. =0 for ja <x <7.
(e) f(x)=x(a—x).
38.J. Let f:[0, 7]— R be piecewise continuous and let f,¢ PC(27) be defined
by
f =f x) for x < (0, 7],
=0 for x =0,
=—f(-x) for
x € (~7, 0).
38 FOURIER SERIES 343
(a) Show that f, is an odd function; it is called the odd extension of f with period
27.
(b) The Fourier series of f, is called the (Fourier) sine series of f. Show that it is
given by
y b, sin nx,
n=1
where
(c) Show that if c €(0, a) and if f has left- and right-hand derivatives at c, then
the sine series forf converges to 3[f(c—)+f(c+)]. In any case, the sine series forf
converges to 0 at x =0, a.
38.K. For each of the following functions defined on [0, a], calculate the sine
series and determine the limit of this series at each point.
(a) f(x) =1; (b) f(x) =cos x;
(c) f(x)=1 for0 <x <7, (d) f(x)=a7-x;
=0 forja<x<7;
(e) f(x) =x(ar—x).
38.L. Let f, € PC(27) be the function such that f,(x)=n'" for O< x = 1/n and
=0 for other x¢(—a, 7]. Show that |[f|,=1/n’” so that the sequence (f,)
converges to the zero function in the norm ||-||, but, since it is unbounded, the
convergence is not uniform.
38.M. If fe PC(27) and if « >0, show that there exists a continuous function f,
with period 27 such that ||f—filb<e.
38.N. Use Parseval’s Equality 38.11 to establish the following formulas.
2 2 ca
roy l TL 1
(a) ecw (b) 8 =) Gao
> sin nx
1/2
nai ft
converges for all x. Show, however, that this series cannot be the Fourier series of
any function in PC(27).
344 INFINITE. SERIES
38.0. Let L>0 and let PC(2L) be the vector space of all functions f:R > R
which have period 2L and are piecewise continuous.
(a) If we define f-g=J', f(t)g(t) dt for f, ge PC(2L), show that the map
(fg) f-g is an inner product (in the sense of Definition 8.3) on PC(2L).
Moreover the norm induced by this inner product (see 8.7) is
Wk=[[
oor ar] ”
(b) We let Cy, C,, S,, ne N, be the functions in PC(2L) given by
1 NTX
Oar
C(x)==, C(x = Foo
Vi Vi
Soe) =~ sin SF
where 6,,, = 1 if n=m and 6, =Oifn#m. (Hint: if L =, these are the relations
given before 38.3.)
(c) If f € PC(2L), we define the Fourier series of f on [-L, L] to be the series
— . Narx
ao+ 5 (a, cos "7L ~4-b, sin "*),
we
where we have
a= 1/*
[. fds a= 1°(. f(t) cos "2 t at
1/* _ nt
b=7 f(t) sin; dt
forn=1,2,....
(d) Reformulate the Convergence Theorems 38.7, 38.9, and 38.10 for Fourier
series of functions in PC(2L). (Hint: make a change of variable.)
(e) If fe PC(2L), then Parseval’s Equality becomes
i
rll =5 a+ > (a+b),
ol 2 + 2 2
where the norm of f is as in part (a) and the Fourier coefficients are as in part (c).
38.R. For each of the following functions on the specified interval, calculate the
Fourier series on this interval and determine the limit of this series at each point.
(@) f(x)=x on (2,2);
(b) f(x) =0 for —-4<x<0,
=x for O=x=4;
(c) f(x) =0 for —-3<x<0,
=1 for O<x<1l,
=0 for <x <3.
38.8. Let f be continuous and have period 27. Show that if the Fourier series
for f converges at ce[—7z, 7] to some number, then it converges to f(c).
38 FOURIER SERIES 345
38.T. Let f belong to PC(27) and suppose that c €{—7, 7]. If T',(f) denotes the
nth Fejér mean, defined in (38.16), show that
lim Fe(N(e)=S1f(-)
+ le +).
38.U. Suppose that f and f’ are continuous with period 2a and that f’e
PC(27). (a) Show that the Fourier coefficients a,, b, of f are such that the series
Y n*(a +14)
is convergent. Hence, there exists a constant M>0 such that |a,| =< M/n’ and
b,| = M/n? for all ne N.
(b) Show that the Fourier series for f’ is the term-by-term derivative of the
Fourier series for f.
38.V. (a) If ke PC(2m) and if x,, x e[—a, 7], use the Schwarz Inequality to
show that
(b) Use part (a) and the Norm Convergence Theorem 38.10 to show that if
f¢PC(27) and x,€[—m, w], then the Fourier series for f can be integrated term-
by-term:
| f(t) dt= ; a(x —X0) + > | (a, cos nt+b, sin nt) dt,
x0 n=1 dxg
1 2x 1
cot 1x = — +—_ z 23
TX Wax’ -n
* (-4)"
ese mx = 424 C y
TX Wysx'—n
(c) Differentiate the first series in (b) term-by-term (justify this) to show that if
x¢Z, then
Ti, ¥ 1.
2
lim
(sin ax)? noi (x —n)?*
(d) Integrate the first series in (b) term-by-term (justify this) to show that if x¢ Z,
then
L=limf@=fO) |
xe x—-C
when this limit exists. Equivalently, we could have defined this derivative
to be the number L such that
lim f(x)-f()-L@—c) =0.
xc
|x —¢|
This limiting relation can be regarded as making precise the sense in which
we approximate the values f(x), for x sufficiently near c, by the values of
the affine? map
xr f(c)+L(x—c),
whose graph yields the line tangent to the graph of f at the point (c, f(c)).
It is this approach to the derivative that we shall use for functions on R?
to R*, Thus the derivative of a function f defined on a neighborhood of a
t In elementary courses, such a map is called “linear.” However, to be consistent with the
more restricted use of the term “linear” introduced in Section 21, we shall use the term
“affine” to refer to a function obtained by adding a constant to a linear function.
346
39 THE DERIVATIVE IN R? 347
Diff fas OF of
axe
Daf=
-, x2
—4f
axp 7 Def
=, f,
=f
OX,”
39 THE DERIVATIVE IN R?° 349
In case the symbol denoting the function has a subscript, we shall sometimes
insert a comma to indicate a partial derivative; thus, D,f2= f.,.
It should be observed that the partial derivative of a function at a point
with respect to one vector may exist, yet the partial derivative with respect
to another vector need not exist (see Exercise 39.A). It is also plain that,
under appropriate hypotheses, there are algebraic relations between
partial derivatives of sums and products of functions, etc. We shall not
bother to obtain these relations since they are either special cases of what
we shall do below, or can be proved similarly.
A word about terminology is in order. If u is a unit vector in R’, then
the partial derivative D.f(c) = f.(c) is often called the directional derivative
of f at c in the direction of u.
The Derivative
} The reader is warned that L is sometimes called the Fréchet derivative, or the differential, of
f atc, and is sometimes denoted by df(c) or f'(c), etc.
350 DIFFERENTIATION IN R?
f(c)+L(x—c) = (f(c)—L(c))
+ L@).
Hence we are approximating x +> f(x) by a function of the form x +>
yot L(x), where yo is fixed. Such functions are called affine mappings of
R? into R’; they are merely translations of linear mappings and so have a
very simple character.
From a geometric point of view, the existence of the derivative of f at c
reflects the existence of a tangent plane to the surface {(x, f(x)):x¢ A} in
R’ x R‘ at the point (c, f(c)); namely, the plane given by the graph
0<|Li(u)—- L.(u)]|
We now show that the existence of the derivative at a point implies the
existence of all of the partial derivatives at that point.
39.6 THEOREM. If ACRY, if f: A— R* is differentiable at a point
céA, and if uis any element of R’, then the partial derivative D.f(c) off atc
with respect to u exists. Moreover,
(39.8) Df(c)(u)
= ui Dif(c)+---+u,D,f(c).
352 DIFFERENTIATION IN R?
PROOF. The theorem implies that for each of the vectors e;,..., & the
partial derivatives Dif(c),...,D,f(c) exist and equal Df(c)(ei),...,
Df(c)(e,). However, since Df(c) is linear and u=u,e,+---+ ue, we
deduce that
j=l
orp
REMARKS. (a) The converse of Corollary 39.7 is not always true, for the partial
derivatives of f may exist without the derivative existing. For example, let
f:R’—R be defined by
ab?
(39.9) Da f(0, 0) = ath?’ (a, b) # (0, 0).
contrary to (39.9).
(b) We shall see below that if A < R” and if the partial derivatives of f: A > R*
are continuous at c, then Df(c) exists.
lim e910.p¢e
a)
tA0
exists. In this case the derivative Df(c) is the linear function of R into R
defined by
ur fi(c)u.
Thus Df(c) maps ueéR into the product of f’(c) and u. (In matrix
terminology, the derivative Df(c) is the linear mapping represented by the
1X1 matrix whose only element is f’(c).)
Traditionally, instead of writing u for the real number on which the linear
function of Df(c) acts, one writes the somewhat peculiar symbol dx (here the “‘d”’
plays the role of a prefix and has no other significance). When this is done and the
39 THE DERIVATIVE IN R? 353
Leibniz? notation for the derivative is used, the formula Df(c)(u) = f’(c)u becomes
(b) Let ACR and let f: A—» R* (q>1). Hence f can be represented
by the “coordinate functions”
(d) Now let ACR? and f:A — R‘ where both p>1, q>1. In this
case we can represent y = f(x) by a system
+GOTTFRIED WILHELM LEIBNIZ (1646-1716) is, with ISAAC NEWTON (1642-1727), one of the
coinventors of calculus. Leibniz spent most of his life serving the dukes of Hanover and was a
universal genius. He contributed greatly to mathematics, law, philosophy, theology, linguis-
tics, and history.
354 DIFFERENTIATION IN R?
t CARL ({G. J.) JAcosI (1804-1851) was professor at K6nigsberg and Berlin. His main work
was concerned with elliptic functions, but he is also known for his work in determinants and
dynamics.
39 THE DERIVATIVE IN R° 355
39.9 THEOREM. Let ACR?, let f:A — R‘, and let c be an interior
point of A. If the partial derivatives Df ((=1,...,q,,=1,..., p) exist
in a neighborhood of c and are continuous at c, then f is differentiable’
atc. Moreover Df(c) is represented by the q Xp matrix (39.11).
PROOF. We shall treat the case q = 1 in detail. If ¢ >0 let (ce) >0 be
such that if ||y—cl| =< 6(e) and j=1,2,..., p, then
(39.12) ID,f
— Dif(c)|<e.
(y)
Tf x = (x1, X2,..., Xp) and c = (ci, €2,..., Cp), let z:, Z2,..., Zp-1 denote the
points
Z1 = (C1, X2,.. +, Xp), Zo
= (C1, €2, X3,.-- Xp),
wey Zp = (c1, Cay. . 64 Cpa, Xp)
and let zo=x and z,=c. If ||x—cl|<=6(e), then it is easily seen that
||z; -¢l| = 8(e) for 7] =0,1,...,p. We write the difference f(x)—f(c) asa
telescoping sum:
Exercises
39.4. Let f:R’—R be defined by
Show that the partial derivatives D,f(0, 0), D.f(0, 0) exist and equal 0. However,
the derivative of f at (0,0) with respect to a vector u=(a,b) does not exist if
ab#0. Show that f is not continuous at (0, 0); indeed, f is not even bounded on a
neighborhood of (0, 0).
39.B. Let g:R*— R be defined by
Show that the partial derivatives D,g(0, 0), D.g(0, 0) exist and equal 0. However,
the derivative of g at (0,0) with respect to a vector u =(a, b) does not exist if
ab#0. Show that g is not continuous at (0,0); however, g is bounded on a
neighborhood of (0, 0).
39.C. Let h: RR’—> R be defined by
Show that the partial derivatives D,h(0, 0), D2h(0, 0) exist and equal 0. However,
the derivative of h at (0,0) with respect to a vector u=(a,b) does not exist if
ab#0. Show that h is not continuous at (0, 0).
39.D. Let k:R’— R be defined by
Show that the partial derivative of k at (0, 0) with respect to any vector u = (a, b)
exists and that
D,k(0, =e ifax0.
Show that k is not continuous and hence not differentiable at (0, 0).
39.E. Let f:R’— R be defined by
Show that the partial derivative of f at (0,0) with respect to any vector u = (a, b)
exists and that
ab?
D.,f(O, 0) = woab? if (a, b) # (0, 0).
Show that F is continuous only at the point (0, 0) and that it is differentiable there.
39.G. Let G:R?’— R be defined by
G(x, y)=(x?+ y’) sin 1/(x’+ y’) for (x, y) 4 (0, 0),
=0 for (x, y)
= (0, 0).
Show that G is differentiable at every point of R* but the partial derivatives D,G,
D,G are not bounded (and hence not continuous) on a neighborhood of (0, 0).
39.H. Let H: R?— R® be defined by
Show that D,H exists at every point and that D,H exists and is continuous on a
neighborhood of (0,0). Show that H is differentiable at (0, 0).
39.1. Let ACR’, let f: A > R‘ be differentiable at a point c interior to A, and
jet ve R*. If we define g:A > R by g(x) =f(x)- v for all xe A, show that g is
differentiable at c and that
(u)
= (Df
Dg(c) (c)(u)) + v forueR’.
39.J. Let c be an interior point of ACR? and let f:A—>R.
(a) If f is differentiable at c, show that there exists a unique vector v, € R° such
that
c)=v. + u
= Df(c)(u)
D,f( for allue R’.
The vector v, is called the gradient of f at c and is denoted by V-f, or by grad f(c).
Show that
V(af)=aV.f, V.(f+g)=V.f+Veg,
V. (fg) = f(c) V.g + g(c) Vef.
39.L. Find the gradients of the following functions at an arbitrary point in R’.
(a) filx, y, Zz) =x? ty? +275
(b) fax, y, z) =x? yz +275
(c) falx, y, 2) = xyz.
39.M. Find the directional derivatives of each of the functions in 39.L at the
point (0, 1, 2) in the direction toward the point (0, 2, 3).
39.N. Let ACR? and let a function f:A—R represent a surface S,; in R®
explicitly as its graph:
{x y, z) eR? : 2 = f(a, Yo) + Dif(Xos Vox — X0) + Daf (Xo, yoy — Yo).
39.0. Find the tangent planes to the surfaces in R® represented as graphs of the
following functions of the points specified. Draw a sketch.
(a) f(x, y)=x?+y? at (0,0) and at (1, 2).
(b) f.(x, y)=xy at (0, 0) and at (1, 2).
(c) falx, y) =(4—(x?+ y?))'” at (0, 0) and at (1, 1).
39.P. Let J&R be an interval and let g:J— R® represent a curve C, in R®
parametrically:
{(x, y, Z)E R?:x = hy(S0, to} + Diha(So, to)(S — So) + DahalSo, to)(t — to),
yr ha(So, to) + D,hi(so, to)(s _ Si) + D,hi(so, to)(t _ to),
If the vectors (Djhi(So, to), Diho(o, toe), Dihs(So, to)) and (Dh, (So, to), Drha(So, to),
D,h,(So, to)) in R* are not multiples of each other, then this tangent space is a plane
in R° and is called the tangent plane.
39.8. Find parametric equations for the tangent planes to the following surfaces
in R° at the specified points.
(a) h:(s, t)> (x, y, z)=(s, tf, s7+127) at the points corresponding to (s, t) = (0, 0)
and (1, 1).
(b) h:(s, t)> (x, y, z)=(s+4, s—t, s?—t”) at the points corresponding to (s, i) =
(0, 0) and (1, 2).
(c) h:(s, thw (x, y, z)=(s cost, s sin t, t) at the points corresponding to (s, t)=
(1, 0) and (2, 7/2).
(d) h:(s, t) > (x, y, z) = (cos s sin t, sins sin t, cos t) at the points corresponding
to (s, t)=(0, 0), (0, 2/2) and (7/4, 27/4).
39.T. If ACR?’ and f: A > R is such that the partial derivatives D,f,...,D,f
exist and are bounded on some neighborhood of c € A, then f is continuous at c.
(Hint: argue as in the proof of Theorem 39.9.)
39.U. Let f be defined on a neighborhood of a point c ¢ R’ with values in R.
Suppose that D.f exists and is continuous on a neighborhood of c and that D.f
exists at c. Show that f is differentiable at c.
39.V. Let ACR? and let f: A > R‘ and g:A —R’ be given. If F:A > R'X
R’ = R*™ is defined by F(x) = (f(x), g(x)) for x € A, show that F is differentiable at
an interior point c € A if and only iff and g are differentiable at c. In this case we
have
DF(c)(u)
= (Df (c)(u), Dee (u)) for ue R’.
360 DIFFERENTIATION IN R?
Alternatively, we write
If ||x ~ cl] = inf {y, (1/K) 8(e, g)}, then (40.3) implies that |If(x)—f(c)||<
6(¢, g), which means that
and so the derivative of g of at c takes this point of R’ into the real number
g'(b)[Dif(c)wit: + D,f(c)we].
(c) Letq>1, p=r=1. According to Examples 39.8(b), (c) the deriva-
tive Df(c) takes the real number u into the point
Df(c)(u)= uf'(c)=(file)u,..., ftcju) in R*,
and the derivative Dg(b) takes the point w =(wi,..., Wa) in R® into the
real number
Dig(b)wit- + -+Dag(b)w,.
It follows that the derivative of h = gef takes the real number u into the
real number
ay, dx * dy, ax
(d) We consider the case where p=q=2 and r=3. For simplicity in
notation, we denote the coordinate variables in R’ by (x, y), in R* by
(w, z), and in R” by (r,s,t). Then a function f on R?’ to R* can be
expressed in the form
w=WG,y), z=Z(x,y)
and a function g on R‘ to R' can be expressed in the form
r=R(w, z), s = S(w, z), t=T(w, z).
The derivative Df(c) sends (é, 4) into (@, £) according to the formulas
o= W,.(c)é + Wy (c)n,
(40.6)
f= Z.(c)§ + Z,(c)n.
Here we write W, for D:.W=D,W, etc. Also the derivative Dg(b) sends
(w, £) into (p, a, 7) according to the relations
A routine calculation shows that the derivative of gof sends (&, 7) into
(p, o, 7) by
p ={Rw(b) Wc) + Re(b) Zz (C)}E + {Rw (b} Wy(c) + R.(b)Z, (c)}m,
(40.8) o={S..(b) W.(c) + S.(b)Z.(c)}E + {S.(b) W(c) + S.(b)Z,(c)}n,
T= {T.(b) W.(c) + Tb) Z.(c)}E +{T.(b) W,(c) + T(b)Z,(c)}n.
A more classical notation would be to write dx, dy instead of & 1; dw, dz instead of
@, ¢; and dr, ds, dt instead of p, o, +. If we denote the values of the partial
derivative W, at the point c by dw/dx, etc., then (40.6) becomes
dw = dx ey dy,
dz = oe dx +5 dy;
similarly, (40.7) becomes
dr= a dw +t dz,
ds= 2s dw +3 dz,
at at
dt =.
aw OY dw+—
az O23dz;
364 DIFFERENTIATION IN R?
as = (28.2,
5
8 22) gy 5 (89, 9822) ay,
as dW | Os Oz
Ow ax dz Ox x
Os Ow , OS Oz
aw dy dz oy y
In these last three sets of formulas it is important to realize that all of the indicated
partial derivatives are to be evaluated at appropriate points. Hence the coefficients
of dx, dy, etc., turn out to be real numbers.
Similarly, (40.7) asserts that the mapping Dg(b) of («, Z) into (p, a, r) is
given by the 3x2 matrix
or (b)
or
= (b)
Ru(b) Re(b) ow
02
(40.10) S.(b) S.(b) |= 2S
ow
(p) (b) .
T.(b) Tb) ar
aw (6) =)
Finally, relation (40.8) asserts that the mapping D(g°f)(c) of (& 7) into
(p, o, tT) is given by the 3x2 matrix
in R‘, and if a, b belong to R’, then there exists a point c (lying between a,
b) such that
e(t)=(1-that+th=a+t(b—a),
so that (0) =a, o(1)=b, and g(t)e SO for te[0, 1]. Since 0 is open
and ¢ is continuous, there is a number y > 0 such that @ maps the interval
(-y, 1+ y) into O. Now let F:(-y,1+y)— RB be defined by
F(t)=fee(t)=f(1—-tha+tb).
By the Chain Rule [see 40.3(c) and 40.P] it follows that
f(b)
— f(a) = FQ)
— F(0)
= F'(to) = Df(c)(b—a). QED.
Although the most natural extension of the Mean Value Theorem does
not hold when the range space is R*, q>1, there are some extensions
366 DIFFERENTIATION IN R?
Evidently we have
DH(x)(u) ={Df(x)(u)} * y1
forxe S,ueER’. It follows from the Mean Value Theorem 40.4 that there
is a point c on S such that
Since the exact value of the point c is usually not known, the theorem is
often applied by using the following result, whose statement uses the
notion of the norm of a linear map L from R? to R‘ that was introduced in
Exercise 21.L. It is only necessary to recall that ||L(u)l] < M |lull for all
uéR’, if and only if the norm ||Lj|,, =< M.
40.6 CoROLLARY. Suppose the hypotheses of Theorem 40.5 are
satisfied and that there exists M>0O such that |\Df(x)|,q <M for all xeS.
Then we have
Df or of i=1,2,...,p.
of a
Dyf or Bx, ax? Lj=i,2,...,p.
mixed partial derivatives at the point (0,0) are the limit of the quotient
D,.f(0, 0)
= th, Jim, 0)Ato
,
proor. Let ¢>0 and let 6>0 be so small that if |h|<6 and |k|<8,
then the point (h, k) belongs to U and
B(h)
= f(h, k)— f(h, 0),
from which it follows that A(h,k)=B(h)—B(0). By hypothesis, the
partial derivative D.f exists in U and hence B has a derivative. Applying
the Mean Value Theorem 27.6 to B, there exists a number ho with
0<|ho|<|h| such that
(40.15) A(h, k) = B(h)—
B(0) = hB'(ho).
(It is noted that the value of h. depends on the value of k, but this will not
cause any difficulty.) Referring to the definition of B, we have
B'(ho) = D,f (ho, k) — D.f (ho, 0).
Applying the Mean Value Theorem to the right-hand side of the last
equation, there exists a number ky with 0<|ko|<|k| such that
(40.16) B'(ho) = k{Dyf(ho, ko)}.
Combining equations (40.15) and (40.16), we conclude that if 0<]h|<6
and 0<|k|<6, then
A(h, k)_
hk Dyf(ho, ko),
By taking the limit in this inequality with respect to k and using (40.18), we
obtain
Higher Derivatives
for sufficiently small z. This means that Df(c) is the linear function which
most closely approximates the difference f(c+z)—f(c) when z is small.
Any other linear function would lead to a less exact approximation for
small z. From this defining property, it is seen that if Df(c) exists, then it is
necessarily given by the formula
Df(c)(z) = Dif(c)zi
++ + Def(c)zp,
where z=(%,...,Zp) in R’.
Although linear approximations are particularly simple and are suffi-
ciently exact for many purposes, it is sometimes desirable to obtain a finer
degree of approximation than is possible by using linear functions. In such
cases it is natural to turn to quadratic functions, cubic functions, etc., to
effect closer approximations. Since our functions are to have their
domains in R’, we would be led into the study of multilinear functions on
R’ to R for a thorough discussion of such functions. Although such a
study is not particularly difficult, it would take us rather far afield in view of
the limited applications we have in mind.
For this reason we shall define the second derivative D’f(c) of f at c to
be the function on R® x R” to R such that if (y, z) belongs to this product
and y=(yi,.-., yp) and z=(z1,..., z,), then
In discussing the third derivative, we shall assume that all of the third
partial derivatives of f exist and are continuous in a neighborhood of c.
By now the method of formation of the higher derivatives should be
clear. (In view of our preceding remarks concerning the interchange of
order in differentiation, if the resulting mixed partial derivatives are
continuous, then they are independent of the order of differentiation.)
One further notational device: we write
If p=2 and if we denote an element of R®* by (x, y) and w =(h, k), then
D?f(c)(w)* equals the expression
Da f(c)h? + 2Dyf(cyhk + D,,f(c)k?;
Dy...f(c)h + (TD. n
flOn n-1
k+(3)De-.-onflh
n
k
n-2yp,2
D,...yf(c)k”.
Now that we have introduced this notation we shall establish an
important generalization of Taylor’s Theorem for functions on R? to R.
40.9 TayLor’s THEOREM. Suppose that f is a function with open
domain ©, in R? and range in R, and suppose that f has continuous partial
derivatives of order n in a neighborhood of every point on a line segment S
joining two points a, b=a+uin Q. Then there exists a point c on S such
that
F(t)=f(a+tu).
Exercises
40.A. If f(x, y)=x?+y’ and g(t)=(3t+1, 2t—-3), let F(t)=feg(t). Evaluate
F'(t) both directly and by using the Chain Rule.
40.B. lf f(x, y)=xy and g(s, t) =(2s+ 3t, 4s +1), let F(s, t)=feg(s, t), Evaluate
D,F and D,F both directly and by using the Chain Rule.
40.C. lf f(x, y, z)=xyz and g(s, t)= (3s + st, s, t), let F(s, t) =feg(s, t). Evaluate
D,F and D.F both directly and by using the Chain Rule.
40.D. I£ f(x, y, z)=xy + yz + zx and g(s, t) = (cos s, sin s cos f, sin t), let F(s, t)=
feg(s,t). Evaluate D,F and D,F both directly and by using the Chain Rule.
40.E. If Cartesian axes are rotated in the plane by the angle @, then the new
coordinates u, v of a point are related to the original coordinates x, y by
Let f:R’— R be differentiable on R’ and let F(u, v) =f(x, y) for all x, y. Show
that
for all a, be R and all x, x’, y,y’in R®. It can be proved that there exists M>0
such that |[B(x, y)]] = M |[x||[lyll for all x, y in R’. Assuming this, prove that B is
differentiable at every point (x, y)<¢ R’ x R’ = R” and that
h'(c) = Df(g(c))(g'(e)).
40.Q. Let Q2R?’ be open and let f:2— R*. Suppose that O contains the
points a, b and the line segment S joining these points, and that f is differentiable at
every point of S. Show that there exists a linear mapping L: R’ > R* such that
f(b)— fla) =L(b— a).
tT LEONARD EULER (1707-1783), a native of Basel, studied with Johann Bernoulli. He
resided many years at the court in St. Petersburg, but this stay was interrupted by twenty-five
years in Berlin. Despite the fact that he was the father of thirteen children and became
totally blind, he was still able to write over eight hundred papers and books and make
fundamental contributions to all branches of mathematics.
374 DIFFERENTIATION IN R°
40.R. Let QCR® be an open connected set and let f:— R* be differentiable
on ©. If Df(x)=0 for all x €2, show that f(x)= f(y) for all x, ye€Q. Show that
this conclusion may fail if 0 is not connected.
40.8. Let JGR’ be an open cell and suppose that f:J — R is differentiable on
J. Show that if the partial derivative D,f(x) =0 for all x < J, then f does not depend
on the first variable in the sense that
Show that the second partial derivatives D,,f and D,,f exist at (0,0) but are not
equal.
40.V. Use the Mean Value Theorem to determine approximately the distance
from the point (3.2, 4.1) to the origin. Give error bounds for your estimate.
40.W. Let OCR? be open and let f:2— R*. Suppose that 0 contains the
points a, b and the line segment S joining the points, and that f has continuous
partial derivatives on S. Show that
Project
40.a. (This project is a modification of the classical Newton’s Method for the
location of roots when a sufficiently close approximation is known.) Let f be
defined and continuous on an open set containing the closed ball B,(x,)=
{x ER? :||x —x,|| <r} with values in R*. Suppose that f is differentiable at every
point of B,(x,) and that there exists a number C, with 0<C <1, and an injective
linear map T': R* > R® such that |'°f(x.)l| = (1—C)r and such that
(a) Let g:B,(x) > R? be defined by g(x) =x —lof(x) for x € B,(x). Show that
g is differentiable at every point of B,(x.) and that g is a contraction with constant
C<1 (see 23.4) on B,(x).
(b) Define x,= (x0) and x..41=g(x,) for neN. Show that |ba.i—x||s
C* |x: -x<, whence it follows that |[x..1-x,l[<C"r for n=m20. Hence
|x. —xo|<r for k= 0, 1,2,....
(c) Show that (x,) is a Cauchy sequence and hence converges to an element
X¥ © B,(X»), which is such that g(x)=x. Moreover, we have the estimate ||x, — x||<
C*r,
(d) Show that f(%)=0 and x is the only element in B,(x.) where f vanishes.
[Difi(x)— Difiy)].
Since Df(xo) is linear, it follows that Dg(x) = Df(x)— Df(x») for x EQ. If
we apply the Mean Value Theorem 40.5, we infer that there exists a point
céS such that
I|L-Gea— x2)
I] = []f(2e1) — fee)|| = 27 |] x1 — x2].
If we combine this and (41.3) with u=x,—x2, we obtain
(41.4) 2F ||x1
— x2l] <[lf(x.)
— f xa)
for x. € Bs. This proves that the restriction off to B; is an injection; hence
this restriction has an inverse function which we shall denote by g. If
yx € f(Bs), then there exist unique points x,=g(y.) in Bs such that
yx =f(xx). It follows from (41.4) that
A reader familiar with the notion of the “rank” of a linear transformation will
recall that L: R° — R° is surjective if and only if rank (L)=q sp.
41.6 SURJECTIVE MAPPING THEOREM. Let OCR? be open and let
f:0Q— R? belong to Class C'(Q). Suppose that for some c €Q, the linear
function L = Df(c) is a surjection of R’ onto R*. Then there exist numbers
m>0 and a >0 such that if y € R? and |ly — f(c)|| < a/2m, then there exists
an x €Q such that |x —clls a and f(x)=y.
PROOF. Since L is a surjection, each of the standard basic vectors
e,=(1,0,..., 0), e.=(0,1,...,0),...,e,=(0,0,...,
1)
+ Lawrence M. Graves (1896-1973) was born in Kansas, but was associated with the
University of Chicago for many years as student and professor. He is best known for his
contributions to functional analysis and the calculus of variations.
41) MAPPING THEOREMS AND IMPLICIT FUNCTIONS 379
in R* is the image under L of some vector in R°, say ui, u2,..., Ug. Now
let M:R°—R?’ be the linear function mapping ¢ into u; for j=
1,2,...,q5 that is,
m(3 ae)=$ an
It follows that L°M is the identity mapping on R‘; that is, Lo M(y)= y for
all ye R*. If we let
= {dtu}.
2 1/2
i=l
|MOvyl-= 3 La le
={¥ lar} {¥ hair}
=m fyl.
By the Approximation Lemma 41.4 there exists a number a@ >0 such
that if |x. —c||< a, k =1,2, then x € and
Now let B, ={x eR :|x—c|| <a} and suppose that y¢R* is such that
lly — f(c)|| < a/2m. We will show that there exists a vector x with x é B.
such that y = f(x).
Let xo=c and let x1=x0+M(y—ff(c)) so that ||x1—xol| <= m lly —f(|l <
3a, whence
1
Ix ~xol| = 5 and Ix: -cl| = (1-3)a,
Suppose that c = Xo, X1,..., Xn, have been chosen inductively in R? such
that
(41.6) [xx
— xx-1]] = @/2*, [xx
— cl] = 1-1/2" )a,
for k=1,...,n. We now define x41 (n = 1) by
[[>n —~ Xen] <= [fen — Xnval] + [fae — Xneal| +> + + +|[Xm—1— Xml
a a a a
= gut gaat tam = 5a -
2
It follows that (x,) is a Cauchy sequence in R° and therefore converges to
some element x. Since ||x, — cl] =< (1—1/2")a, it follows that |x —c||< « so
that xe B,.
Since x1— x0 = M(y —f(c)), it follows that
whence it follows that y = lim f(x,) = f(x). Hence every point y satisfying
lly —f(c)l| < a/2m is the image under f of a point x €Q with |x —c||< a.
OED.
41.7 Open Maprinc THEOREM. Let OCR? be open and let
f:Q— R’‘ belong to Class C'(Q). If for each x €O the derivative Df(x) is a
surjection, and if G <Q. is open, then f(G) is open in R*.
PROOF, If bef(G), then there exists a point c € G such that f(c)=b. It
follows from the Surjective Mapping Theorem 41.6 applied to f |G that
there exists 8 >0 such that if ||y — b|| =< 6 then there exists an x € G such
that y = f(x). Hence f(G) is open in R*. QED.
and only if the derivative Df(c) has an inverse which, in turn, is true if and
only if the Jacobian determinant
J;(c) = det [Dyfi(c)] = det [f.;(c)]
is different from zero.
A reader familiar with the notion of the “rank” of a linear transformation will
recall that L: R’ — R‘ is bijective if and only if rank (L)=p=q.
Dg(y)=[Df(ety)T" for ye V.
PROOF. By hypothesis L = Df(c) is injective; hence Corollary 22.7
implies that there exists r>0 such that
lle(y)— gy)
— Maly — yall = (2/77) flu(o)Il ly — yall.
Now as y > yi, then x = g(y)—> g(y1) = x1 and so |lu(x)||-> 0. We con-
clude, therefore, that Dg(y1) exists and equals M; = (Df(x.))'.
The fact that g belongs to Class C’(V) follows from the relation
De(y)=[Df(g(y))T* for y € V, and the continuity of the mappings
yr>g(y), x >Df(x), Lr
of V> U, U— £(R’,R’), and ¥(R’, R’) > L(R’, R°), respectively.
(See Exercise 41.L.) QED.
Implicit Functions
F(x, y)=0
for one argument (say, y) in terms of the other in the sense that we find a
function » defined on a subset of R°® with values in R* such that b = ¢(a)
and
F(x, o(x))=0
for all x in the domain of ~. We assume that F is continuous on a
neighborhood of (a,b) and we hope to conclude that the “solution
functionӢ is continuous on a neighborhood of a. It will probably be no
surprise to the reader that we shall assume that F belongs to Class C’ on a
neighborhood of (a, b); however, even this hypothesis is not enough to
guarantee the existence and uniqueness of a continuous solution function ¢
defined on a neighborhood of a.
41 MAPPING THEOREMS AND IMPLICIT FUNCTIONS 383
Indeed, ifp = q = 1, then the function given by F(x, y) = x?— y* has two continuous
solution functions 9,(x)=x and ¢(x)=—x corresponding to the point (0,0). It
also has discontinuous solutions, such as
@(c) = x, x rational,
=-x, x irrational.
The function G(x, y)=x—y’ has two continuous solution functions corresponding
to (0, 0), but neither of them is defined on a neighborhood of the point x =0. To
give a more exotic example, the function H:R’— R defined by
When this is the case, there are functions ¢;, j=1,...,q, defined and
continuous near a =0 such that if we substitute
It follows readily (see Exercise 39.V) that H belongs to Class C’(Q) and
that
L,(u)
= DF(O, 0)(u, 0) forueR’;
then the fact that DF(O, 0)(u, v) = Li(u)+L.(v) shows that the inverse of
DH(0, 0) is the linear mapping K on R? x R‘ defined by
Hence it follows from the Inversion Theorem 41.8 that there is an open
neighborhood U of (0,0)€R’R* such that V=H(U) is an open
neighborhood of (0,0)¢€R’ xR‘ and the restriction of H to U is a
bijection onto V with a continuous inverse @: V — U which belongs to
Class C'(V) and with (0, 0)=(0, 0). Now ©® has the form
P(x, z) = (g(x, z), 2(x, z)) for (x, z)eE Vv
41 MAPPING THEOREMS AND IMPLICIT FUNCTIONS 385
Now if P:R’ xR‘ —R’‘ is defined by P(x, z)=z, then P is linear and
continuous and g2= Pe; therefore g, belongs to Class C’(V) and we
have
z = F(x, ¢2(x, z)) for (x, z)e V.
fap+t —— fav+a
is invertible at the point (a,b). (Recall that f,; denotes the partial
derivative of f, with respect to the jth argument.) In this case the
derivative of the solution function ¢ at a point x is given by
fryer frp+a a fis fip
C={(x,
y)e R’ x R*: F(x, y)
= 0}
passing through the point (a, b), can be parametrized at least locally as the
graph in R? x R‘ of some function defined on a neighborhood W of ae R?
to R‘; that is,
The dimension r(L) of Ry is called the rank of L, and the dimension n(L)
of Nz is called the nullity of L. (Thus the rank of L is the number of
linearly independent vectors in R‘ needed to span the range Rz, and the
nullity of L is the number of linearly independent vectors in R’ needed to
span the null space Nz.) It is an exercise to prove that if {ui,..., ua}
(where n = n(L)) is a linearly independent set of vectors in R’ spanning Nr
to which we adjoin p — n vectors uns, ..., Up to get a basis for R’, then the
set {L(un+1),.--, L(up)} is a linearly independent set of vectors in R?
spanning R,. Therefore it follows that p=n(L)+r(L); hence: the
dimension of the domain of L is equal to the sum of the nullity and the rank
of L.
If we represent L by a q X p matrix as in (23.1), then it can be shown that
the rank of L is the largest number r such that there is at least one rxr
submatrix with non-zero determinant.
{+ For more detail, consult the books of Hoffman and Kunze or Finkbeiner listed in the
References.
388 DIFFERENTIATION IN R?
Qy)= =)
Lay, Qa(y)= a ciyi-
art
hence u belongs to Class C'(Q). Since it is readily seen that Du(0) is the
identity map on R’, then it follows from the Inversion Theorem 41.8 that
there exists an open neighborhood U of a=0 such that U'=u(U) is an
open neighborhood of 0, and that the restriction of u to U is a bijection
onto U’ with inverse w= u~':U’ > R? which belongs to Class C'(U’).
Further, by replacing U and U' by smaller sets, we may also suppose that
U’ is convex (that is, contains the line segment joining any two of its
points).
We now let g:U'— R‘ be defined by
g(z)=f(w(z)), zeEU'cR?.
Clearly g belongs to Class C’(U’) and
p(t)=geC(t),
whence it follows that
p(t) = (few)
C(t) = fe B(t).
Moreover, if x € V, then
f(x) = f(weu(x))
= (few)eu(x) = geu(x);
however, we have seen that geu(x) = g°P,°u(x) so that
F(x)= geu(x)=g°e(CeC™)o(Pieu)(x)
=(geC)o(C'ePieu)(x)
= pea(x).
Hence, f(x) = ¢°a(x) for all xe V. Q.E.D.
PROOF. We assume that a =0 and b=0 and shall employ the notation
and results established during the proof of The Parametrization Theorem.
Let B:R° — R?® be the linear function which maps the standard basis
elements e:,..., e, of R” into the vectors x1, ..., x»; hence B is a bijection
of R?’ onto R’ and so B™ exists. The map o:Q-—R?’ defined by
a(x) = B'ou(x) belongs to Class C'(Q) and, since the restriction of u to U
has an inverse w:U'— R” mapping onto U, it follows that the restriction
of o to U has an inverse o7'=weB mapping B™'(U’) onto U.
Let WC R' and o: W > R* be as in the Parametrization Theorem and
let H:R*— R‘ be the linear function which maps the standard basis
elements e:,..., e, of R* into the vectors y:,..., y,; hence H is a bijection
of R* onto R? and so H™ exists. We define
W'={(c,...,¢,)E
Ro: (c1,...,¢,)€ Wh
and let +: W’— R®* be defined by
T(C1,-.-, Cg) = O(C1,...,G) + H(O,..., 0, Crety-- +5 Cy).
Exercises
41.A. Let QCR?’ be open and f:Q— R*. If Df(x) exists for all xe and if
i=1,...,q, j=1,...,p, then show that |Dji(x)-D,fi(y)| < ||Df(x)- Df(y)lh.
Hence, if f belongs to Class C’(Q) then each of the partial derivatives D,f, is
continuous on ©.
41.B. Let Q¢ R’ be open andf:Q— R*. If f belongs to Class C'(Q) and K <Q
is compact, show that x +> Df(x) is uniformly continuous in the sense that for every
e >0 there exists 6 > 0 such that if x, y ¢ K and |x — yl|< then ||Df(x) — Df(y) lla <
&.
41.C, Let Q¢ R? and 0, < R* be open and let f:Q— R* belong to Class C'(Q)
and g:0,—R’ belong to Class C'(Q,). If f(Q) <0), show that gef belongs to
Class C'(Q).
41.D. Let f:R > R be defined by f(x) =x. Show that f belongs to Class C'(R)
and that it is a bijection of R onto R with inverse g(x) =x'" for allx eR. However
Df(0) is neither injective or surjective. Does g belong to Class C’(R)?
41L.E. Let g:R—R be such that g'(x)#0 for all xe R. Show that g is a
bijection of R onto g({R).
41.F. Let ACR’, let f:A — R’, and let g:f(A)— R? be inverse to f. Suppose
that f is differentiable at ae A and g is differentiable at b=f(a). If Df(a) is not
invertible, then show tnat Dg({b) is not invertible.
41.G. Let f: R’— R’ be given by
uU =x, v= xy.
Draw some curves u = constant, v = constant in the (x, y)-plane and some curves
x = constant, y =constant in the (u, v)-plane. Is this mapping one-one? Does f
map onto all of R*? Show that if x #0, then f maps some neighborhood of (x, y) in
a one-one fashion onto a neighborhood of (x, xy). Into what region in the
(u, v)-plane does f map the rectangle {(x, y):1=<x <2, 0< y <2}? What points
in the (x, y)-plane map under f into the rectangle ((u,v):lsau<2, 0<v = 2}?
41.1. Let f be the mapping of R? into R’* which sends the point (x, y) into the
point (u, v) given by
u=x?~y’, v= 2xy.
What curves in the (x, y)-plane map under f into the lines u=constant, v=
constant? Into what curves in the (u,v)-plane do the lines x =constant, y =
constant map? Show that each non-zero point (u, v) is the image under f of two
points. Into what region does f map the square {(x, y):0 xx <1, 0<y<1}?
What region is mapped by f into the square {(u,v):0=u<1, 0<v <1}?
410 MAPPING THEOREMS AND IMPLICIT FUNCTIONS 393
=0 forx =0.
Show that h does not belong to Class C'(R) and that h is not injective on a
neighborhood of 0. However, it is surjective on a neighborhood of 0 and Dh(0) is
invertible.
41.K. Let f:R’?— R’ be defined by f(x, y)=(y,x+y’) for (x, y)eR®. Show
that f belongs to Class C’(R’) and that f is invertible on some neighborhood of an
arbitrary point of R?. Draw the image under f of the lines x = 0, +1, +2 and y =0,
+1, +2. Find the inverse g=f7':R’—R’ and show that Dg(f(xo, yo)) =
Df(Xu5 yo)".
41.L. (This exercise assumes familiarity with the notion of the determinant of a
square matrix.) Let Le ¥(R°, R’) and tet [c,] be the matrix representation of L
with respect to the standard basis in R°. It is shown in linear algebra that L is
invertible if and only if A=det[c,] is not zero. Furthermore, if A #0, then the
matrix of L“'h the form [p,/A], where the p, are polynomials in the c,.
(a) Show that if L, is invertible and if ||L —Ly||,, is sufficiently small, then L is
invertible.
(b) Show that if L, is invertible, then the map L+>L™ is continuous on a
neighborhood of L, with respect to the norm in £(R’, R°).
(c) Let QE2R® be open and f:Q— R° belong to Class C'(Q). If Df(c) is
invertible for some c€Q,, then Df(x) is invertible on some neighborhood of c.
41.M. Let F:R’— R be defined by F(x, y)=y’—x. Show that F belongs to
Class C'(R’) but that D,F(0,0)=0. Show that there does not exist a function @
defined on a neighborhood W of 0 such that F(x, p(x))=0 for all xe W.
41.N. Let f:R°— R’ be defined by
[; 1 4
1 -1 or
(a) Show that we can solve for (x, y)=¢(z) near z=0 and that
peo=|_ |
Rie wie
_ Zz 2-22?
forz<1.
o(z)= Gc 1)’ 2G- 5)
Check the result of part (a).
(c) Show that we can solve for (y, z)= (x) near x =0 and that
pwo-(.4}
394 DIFFERENTIATION IN R?
(d) Carry out the explicit solution of (y, z)= w(x) to obtain
2x +x _ 2x ) for x <3,
wo)= (FS "2x-1
A cro, 90.20(X) Ys Z) = F (Xo, Yo, Zo) + DF (Xo; Yo Zo)(X —Xas Y — Yos Z — Za)
= DF (Xos Yoo Zo)(X — Xu Y — Vos Z ~ Zo)
(a) Show that the tangent space at (Xo, yo, Zo) is given by
{(x. y, 2): Di F (Xo, Yo. Zo)(% — Xn) + DoF (Xo, You Zo)(Y — Yo) + DsF (Xe, Yor Z0)(Z — Zo) = OF.
Hence the tangent space to S; is a plane if at least one of the numbers
D,F (Xo, Yo, Zo), DoF (Xo, Yor Zo); DaF (Xo, Yo, Zo) is different from 0. In this case the
tangent space to S, is called the tangent plane to Sp at (Xo, yo, Zo).
41.Q. Let F:R’— R, given below, represent a surface S, in R* implicitly as the
level surface
(b) Prove the analogous result for the Implicit Function Theorem 41.9.
41.8. Let f:R’—R belong to Class C'(R’). Show that f is not injective;
indeed, the restriction of f to any open set of R’ is not injective.
41.T. Let g:R— R® belong to Class C’(R). Show that if c¢R, then the
restriction of g to any neighborhood of c is not a surjective map onto a
neighborhood of g(c).
41.U. Let Le £(R’, R*) be injective and let r >0 be such that r |[x|] < |[L(x)|| for
all xe R’. Show that if L,¢ £(R’, R*) is such that |[L,—L],,<r, then L, is
injective. (Hence, the set of injective maps is open in £(R’, R*).)
41.V. Let Le £(R’, R*) be surjective and let m > 0 be as in the proof of 41.6.
Show that if L,¢ £(R’, R*) is such that ||L,—L||,, <m/2, then L, is surjective.
(Hence, the set of surjective maps is open in £CUR’, R*).
41.W. Let g:R’ > R® belong to Class C'(R°) and satisfy ||Dg(x)|,, = a<1 for
allxeR’. If f(x)=x+ g(x) for xe R’, show that f satisfies
Projects
4l.a. (This project gives a direct and elementary proof of the Implicit Function
Theorem.) Let Q¢ R’ be open and let F:0. > R belong to Class C'(Q). Suppose
that (a, b)eQ, that F(a, b)=0, and that D,F(a, b)>0.
(a) Show that there exists a closed cell Q =[a,, a2] <[bi, b2] with center (a, b)
such that D,F(x, y)>0 for all (x, y)¢Q, and such that F(x, b,)<0 and F(x, b,)>0
for all x €[a,, a2].
(b) If x €[a,, ap], then the function F, :[b,, b.] > R defined by F,(y) = F(x, y) for
y €[b,, b.] is such that F,(b,\)<0<F,(b,) and Fi(y)>0 for y €[b,, be].
(c) There exists a function mapping [a,, a,] into [b,, b.] such that F(x, p(x))=0
for all x €[a,, a,].
(d) If xe(a,, a.) and {h| is sufficiently small, show that there exists h, with
0<}h,|<|h[{ such that
O= Fix +h, e(x+h)]- Fx, e(x)]
(a, b,)ER’ XR. Hence F(x,,..., Xpar, @(%1,- ~~, Xp11)) = 0, on this neighborhood.
Now put
(x1, 6, Xpar) = G(X, 2 Xp ary PCH, ~~» 5 Xpar))-
By Chain Rule
Dow = DyiiG + (Dyi2G)(Dp ie),
where these functions are evaluated at the appropriate points. Since Dip =
—(D,aF)(D,.F) we infer that D,,,H =—A/D,,.,.F which does not vanish at
(a, b,). Hence we can use (f) to obtain x,,.= o(x,,..., x,) on a neighborhood of
aéR’. (This establishes the Implicit Function Theorem in the case where q = 2;
extensions to the case of general q are obtained by induction.)
41.8. (This project is parallel to Project 40.@ and gives a more direct proof of
the first part of Inversion Theorem 41.8 then the one given in the text.) We
assume that Q¢R’ is open, that f:(— R? belongs to Class C’(Q), and that for
some x,€Q the linear mapping Df(x,) is a bijection. We let T= Df(x.)".
(a) Show that there exists r>0 such that if [|x — xl <7, then I-Ie Df(x)|},, <3.
(b) Let s = 3r [[[\k¢ and for fixed y with |ly — f(xo)[| < s, we define F,(x)=f(x)—y
for ||x—x. <r. Then F, is differentiable, |[['°F,(x,)| < 4r, and |[{-TeDF,(x)||,, <3
for ||x — xo <7.
(c) If lly —f(.)l}-< s, let G, be defined for |x —xdl|<r by G,(x)=x —Te° F,(x).
Then G, is a contraction with constant 3 on this ball.
(d) If ly —f(xo)|| < 5, define ¢(y) =x. and @,.1(y)= G,(¢,(y)) for n=0,1,2,....
Show that |lg..1(y)-@,(y)|| <2 |le.(y)—@o(y)||
<2 "77, whence it follows that
llenvi(y)— Ga (y)]} = 2-7 for n= m=O. In particular, |lp,.(y) —x,| <1, so that this
iteration is possible.
(e) Show that each of the functions ¢, is continuous for ||y — f(x.)|| < s and that
the sequence (¢,) is uniformly convergent to a continuous function g which is such
that G,(e{y))=¢(y) for |ly—f(x.)l|<s, whence it follows that f(@(y))=y for
ly —f(xo)|| = s. Hence the function ¢ is the inverse of f on the set {y :|ly —f(x0)|| =
s} and maps it into the set {x :||x— xl <r}.
41.y. (This project is parallel to Projects 40.@ and 41.6 and gives a direct proof
of the Implicit Function Theorem.) Let Q¢ R°x R* be open and let (x,, yo) EM.
Suppose that F:0Q-> R* belongs to Class C'(Q), that F(x, yo) =0, and that the
linear map L,: R*— R* defined by
(c) If ||x —xoll <s, define y(x)=yo and Yi(x)=G. (bh, (x)) for n=0,1,2,....
Show that |[h.1(x) — (x) = 2°" 'r for n = m= 0. Hence |ly,(x)— yoll < 4, so that
this iteration is possible.
(d) Show that each of the functions , is continuous for ||x — x,||< s and that the
sequence (if,) is uniformly convergent to a continuous function w such that
F(x, (x))=0 for all |x —x, |<.
(e) To show that is differentiable for ||x = x,|<s use Exercise 39.W and employ
anargument similar to thatin (d) and (e) of Project 41.« foreachcomponentof F.
PROOF. It follows from Corollary 39.7 that each of the partial deriva-
tives D,f(c), j=1,...,p, exist and that if u=(m,...,u,)e¢R°, then
Dflo\(u) = ¥ wf.
By the preceding theorem, Djf(c) = 0 for j =1,..., p, whence Df(c)(u) =0
forallue R?. O.E.D.
f(c
+ tw) = f(c) + Df(c)(tw)
+2D flew)’.
Since c is a critical point, it follows that if ||wl|=1 and 0<1t<6, then
fle + tw)-f(c) =3t’Df(c)\(wy= amt?
> 0,
Thus f(c+u)>f(c) for 0<|lu—cl|<6, whence f has a relative strict
minimum atc. Thus part (a) is proved and the proof of part (b) is similar.
(c) Let ws, w- be unit vectors in R? such that
D’f(c)(w.)’ > 0, D’f(c)(w_-)? <0.
It follows from Taylor’s Theorem that for sufficiently small t > 0 we have
f(c +tw.)>f(c), f(c+tw_)<f(c).
Thus c is a saddle point of f. Q.E.D.
On comparing Theorems 42.4 and 42.5, one is led to make the following
conjectures: (i) if c€Q is a point of relative strict minimum, then
D’f(c)(wy’ > 0 for all we R’, w #0, (ii) if c €O is a saddle point of f, then
D’f(c)(w)* takes on both strictly positive and strictly negative values, (iii) if
D°f(c)(w)’ = O for all we R®, then c is a point of relative minimum. All of
these conjectures are false, as may be seen by examples.
In order to implement Theorem 42.5 it is necessary to know whether the
function w+> D?f(c)(w)’ is of one sign. An important and well-known
result of algebra (see the book of Hoffman and Kunze cited in the
References) can be used to determine this. For each j=1,2,..., p, Ict
A, be the determinant of the (symmetric) matrix
Df (c) Difle)
D°f(c)(w)’
<0 for all w#0 and f has a relative strict maximum atc. In
other cases there can be extreme or saddle points.
In the important special case p = 2 a less elaborate formulation is more
convenient and a bit more information can be derived. Here we need to
examine the quadratic function
O = Au?+2Buv+ Cov’.
If A= AC— B*>0, then A# 0 (and C# 0) and we can complete the square
and write
Hence the sign of Q is the same as that of A (or C). On the other hand, if
A<O, then Q has both strictly positive and strictly negative values. This is
obvious from the above equation if A # 0 and is also readily established if
A=0.
We collect these remarks in a formal statement.
42.6 CoroLLary. Let Qc R? be open, let f:Q— R have continuous
second partial derivatives on ©, let ce€Q, be a critical point of f, and let
(42.3) A=Duf(c)Dzf(c)-[Diaf(c)).
(a) If A>0 and if Dif(c) > 0, then f has a relative strict minimum at c.
(b) If A>0 and if Diif(c) <0, then f has a relative strict maximum atc.
(c) If A<O, then f has a saddle point at c.
Some information concerning the case where A=0 will be given in the
exercises.
Until now we have been discussing the case where the extrema of the
function f:— R belong to the interior of its domain QC R°”. None of our
remarks applies to the location of the extrema on the boundary.
However, if the function is defined on the boundary of 0 and if this
boundary of © can be parametrized by a function , then the extremum
problem is deduced to an examination of the extrema of the composition
fee.
There is a related problem which leads to an interesting and elegant
procedure. Suppose that S is a “‘surface”’ contained in the domain Q of the
real-valued function f. It is often desired to find the values of f that are
maximum or minimum among all! those attained on S. For example, if
Q= R® and f(x) =|{x||, then the problem we have posed is concerned with
finding the points on the surface S$ which are closest to, or farthest from,
402 DIFFERENTIATION IN R?
the origin. If the surface S is given parametrically, then we can treat this
problem by considering the composition of f with the parametric represen-
tation of S. However, it frequently is not convenient to express S in this
fashion and another procedure is often more desirable.
Suppose S can be given as the points x in © satisfying a relation of the
form
g(x) =0,
for a function g defined on Q to R. We are attempting to find the relative
extreme values of f for those points x in © satisfying the constraint (or side
condition) g(x)=0. If we assume that f and g are in Class C’(Q) and that
Dg(c)#0, then a necessary condition that c be an extreme point of f
relative to points x satisfying g(x) =0, is that the derivative Dg(c) is a
multiple of Df(c). In terms of partial derivatives, this condition is that
there exists a real number A such that
Dif(c)
= ADig(c),
(42.4) 0
D,f(c) = AD, g(c).
g(c)=90
are then solved for the p +1 unknown quantities, of which the coordinates
of c are of primary interest.
42.7 LAGRANGE’s THEOREM. Let (.¢ R® be open and suppose thatf
and g are real-valued functions in Class C'\(Q). Suppose c €Q. is such that
g(c)=0 and that there exists on neighborhood U of C such that
fy=flc) — [or f(x) = fle)]
for all points x € U which satisfy g(x)=0. Then there exist real numbers «1,
A, not both zero, such that
(42.5) pDf(c) =ADg(c).
V(x, y, Z) = xyz
subject to the constraint
g(x, y, Z)=xyt+2xz+2yz-A=0.
Since the desired point will have strictly positive coordinates, Lagrange’s
Theorem leads to the system
yz =A(y+2z),
xz =A(x+2z),
xy =A(2x+2y),
xy+2xz+2yz—-A=0.
The first equality implies x = y, and the second implies y = 2z. Hence the
ratio of the sides are 2:2:1 and it follows from the last equation that
4z°+42z7+4z?= A which implies that z =3(A/3)’”. Therefore the volume
of this box is 3(A/3)°”.
Frequently there is more than one constraint; in this case the following
result is useful.
42.9 THEOREM. Let Q<R’ be open and suppose that
f and g1,..., &
are real-valued functions in C'(Q). Suppose that c€Q satisfies the con-
straints
and that there exists an open neighborhood U of a such that f(x) = f(c) [or
f(x) = f(c)] for all x € U satisfying these constraints. Then there exist real
numbers 2, A1,..., Ax not all zero such that
(42.8)
Dygile) Dpge(c)
is equal to k( <p). Then there are real numbers Ai, ..., Ax not all zero such
that
Dif(c) = A:Digi(c) Feb AcDigi(c),
(42.9) ee
Dpf(c) = Ai Dp gi(c) + > +A.Dpg(c).
PROOF. If we apply the formula (42.7) to e1,...,e,¢R°, we obtain a
system of equations with the right-hand side of (42.9) and the left-hand
side of (42.9) multiplied by uw. If =0, then the assumption that the rank
equals k implies that A,.=--:=A,.=0, contrary to hypothesis. Hence
p. #0 and we can normalize this system to obtain (42.9). QED.
42.11 ExampLe. Find the points on the intersection of the cylinder
{(x, y, z):x?+ y? = 4} and the plane {(x, y, z):6x+3y+2z = 6} which are nearest to
the origin and those which are farthest from the origin.
We shall search for relative extrema of the function
fQG y, z= x? +y?4+2?
subject to the constraints
glx, y, z)=x?+y?-4=0,
g(x, yz) = 6x +3y +2z-6=0.
The matrix corresponding to (42.8) in this case is
2x 6
0 2
which has rank 2 except at the point (x, y)=(0,0) which does not satisfy the
constraints. Hence we can apply the corollary to obtain the system
2x =A\(2x)+A,(6),
2y =A,(2Y) + A2(3),
2z= A.(2),
x*+y?=4,
6x+3y+2z =6,
406 DIFFERENTIATION INR?
of five equations in five variables. The third equation gives 4.=z, so we can
eliminate A, from the first two equations. To eliminate A,, we multiply the
resulting first equation by y and the second by x and subtract, to get
Inequality Constraints
Dyhi(c) Dih,(c)
(42.11)
Dyhi(c) Dyh-(c)
corresponding to those h; for which h;(c) = 0, is equal tor. Then we may take
=1 in (42.10). In addition, if f(x)=fic) [respectively, f(x)=f(c)] for
all x€U satisfying the constraints and if we take =1 in (42.10), then
A, <0 [respectively, A,=0] fori=1,...,r.
Dhi(c)(v})= 8.
Exercises
42.A. Find the critical points of the following functions and determine the
nature of these points.
(a) f(x, y)=x?+4xy,
(b) f(x, y)=x*+2y*+32x—-—y+17,
(c) f(x, y) =x? +4y?— 12y?—36y,
(d) f(x, y)=x*—4xy,
(e) f(x, y)=x?+4xy + 2y?—2y,
(f) f(x, y)=x?+3y*—4y?— 12y’,
42.B. Let QC R?® be open, let f:— R have continuous second partial deriva-
tives on Q, let c EQ be a critical point of f, and let 6>0.
(a) Show that if D?f(x)(w)? = 0 for all 0<||x—c||< 6 and we R®, thenc isa point
of relative minimum of f.
(b) Show that if D?f(x)(w)? > 0 for all 0< |x —c||< 8 and we R’, w#0, then c is
a point of relative strict minimum of f.
42.C. Let Qc R’ be open, let f:Q— R have continuous second partial deriva-
tives on Q, let ce be a critical point of f, and let
for xeQ. Suppose that for some 6 >0, then A(x) = 0 for all |x —c/]<4.
(a) If D..f(x) >0 (or if D2.f(x) > 0) for all x such that 0<||x —cl]< 8, show that c
is a point of relative minimum of f.
(b) If Di f(x) <0 (or if D.2f(x) <0) for all x such that 0<||x — cl|< 5, show that c
is a point of relative maximum of f.
42.D. Let f:R° > R be differentiable on R’ and f(x)=0 for all xe R’ with
|x| =1. Show that there exists a point ce R° with ||c||/<1 such that Df(c)=0.
(This is a version of Rolle’s Theorem in R’*.)
42.E. Use the Surjective Mapping Theorem 41.6 to establish Corollary 42.2.
42.F. Show that each of the following functions has a critical point at the origin.
Find which have relative extrema and which have saddle points at the origin.
(a) f(x, y)= xy’, (b) f(x, y)=x?—y’,
(c) fx y)=x?-y?, (d) f(x, yy=x*—x?y*+y4,
(e) f(x, yy=x?y —xy?, (f) f(x, y)=x*ty*
42.G. Show that the function f(x, y)=2x+4y—x’y* has a critical point but no
relative extreme points.
42.H. Study the behavior of the function f(x, y) =x*—3xy’ in a neighborhood of
the origin. The graph of this function is sometimes called a ‘‘monkey saddle.”
Why?
+E. J. McShane (1904— ) received his doctor’s degree from the University of Chicago.
He has long been associated with the University of Virginia and is widely known for his
contributions to integration theory, the calculus of variations, optimal control theory,
and exterior ballistics.
42 EXTREMUM PROBLEMS 409
42.1. Find the minimum distance from the point (2, 1, —3) to the plane 2x + y —
2z =4.
42.J. Find the dimensions of the rectangular box, open at the top, with given
volume and minimum surface area.
42.K. Find the minimum distance between the lines L, = {(x, y,z):x=2-t,
y=3+t, z=1-2t} and L,={(x, y, z):x=1-—s, y=2-s, z=3+s5}.
42.L. Give examples to show that each of the conjectures stated after Theorem
42.5 is false.
42.M. Suppose we are given n points (x;, y,), j= 1,..., in R? and wish to find
the affine function F:R — R given by F(x) = Ax +B, such that the quantity
» (F(x;)— yi)’
is minimized. Show that this leads to the equations
ALtt x +BY
r=1
=) i= xy,
j=l
for the numbers A, B. [This function F is said to be the affine function which “best
fits the n points in the sense of least squares.’’]
42.N. Let f:[0,1]—R be continuous on [0,1]. We wish to choose real
numbers A, B, C in such a way as to minimize the quantity
[The resulting function x > Ax?+ Bx + C is said to be the quadratic function which
“best fits f on [0, 1] in the sense of least squares.’’]
42.0. Use Lagrange’s Theorem to locate points on the curve y = x*+ x — 2 where
the function f(x, y)=x—y may have a relative extremum. Then sketch the curve
and the level curves for f to show that the point(s) located are not point(s) of
relative extrema for f.
42.P. Let f:R’—R be the quadratic function f(x, y)=ax?+2bxy+cy’ for
(x, y)<.R®. We wish to find the relative extrema of f on the unit circle {(x, y):x?+
y’=1}. Use Lagrange’s Theorem to show that the points (xo, yp) at which these
relative extrema are taken must satisfy the system
(a—A)Xo+
by, = 0,
bxo+(c—A)yo
= 0,
410 DIFFERENTIATION INR?
\?-(atc)a+t
(ac —b?) =0.
Show that the corresponding value of the multiplier A is equal to the extreme value
of f at such a relative extremum.
42.Q. The sum of three real numbers is 9. Find these numbers if their product is
to be maximized.
42.R. Show that the volume of the largest box that can be inscribed in the
ellipsoidal region
{x y, 2) a+ otos i}
M = f(c)
> sup {f(x):||xl| =r}= m.
Let g be defined by
» Dyfle) =0
for all x EQ.
(a) Use the argument of the preceding exercise to show that a harmonic function
on © attains its supremum and infimum on b(Q).
42) EXTREMUM PROBLEMS 4it
(b) If f and g are harmonic on © and if f(x) = g(x) for x € b(Q), then f(x) = g(x)
for all xe.
(c) Iff and g are harmonic on © and if f(x) = @(x), g(x) = b(x) for x € b(Q), then
42.Y. Show that the geometric mean of a collection of positive real numbers
{a,,..., a,} does not exceed their arithmetic mean; that is,
42.Z. (a) Let p>1, q>1, 1/p+1/q=1. Show that the minimum of f(x, y)=
(1/p)x* +(1/q)y* (x >0, y >0) subject to the constraint xy = 1 is equal to 1.
(b) Use part (a) to show that if a>0, b>0, then
ab<tartips.
Pp q
(c) Let {a,}, {bh}, i=1,...,n, be positive real numbers. Prove Hdélder’s
Inequality:
by letting A =( a”)’”, B=(© b*)’", and applying part (c) to a=a/A, b=b/B.
(d) Use Hdlder’s Inequality in (c) to obtain Minkowski’s Inequality:
412
43. THE INTEGRAL IN R?° 413
functions whose domain is a subset of the space R’, and the reader will
recall that this was in fact done in calculus courses where one considered
“double” and “‘triple”’ integrals. In this section we shall initiate a study of
the Riemann integral of real-valued functions defined on a suitable
bounded subset of R’. Although many of our results can be extended to
permit the values to be in R‘ for q > 1, we shall leave this extension to the
reader.
Content Zero
We recall from Section 5 that a cell in R is a set having one of the four
forms:
where a = b. The numbers a, b are called the end points of these cells. A
cell in R? is the Cartesian product J=J,x-+-+xJ, of pcellsin R. The cell J
is said to be closed (respectively, open) if each of the cells Jy,..... J, are
closed (respectively, open) in R. If the cells J; have end points a; = b;
(i=1,..., p), we define the content of J=J,x---x J, to be the product
c(J)
= (bi— ai) + + + (by
— Gp).
If p = 1, content is usually called “length”; if p = 2, the content is called ‘area’; if
p =3, the content is called “volume.” We shall use the word ‘‘content’’ because it is
free from special connotations that these other words may have.
It is important that the reader show that one can require the cells
appearing in this definition to be closed. or to be open, or to be cubes, and
the notion of content zero remains exactly the same.
414 INTEGRATION IN RP
cot cSi+---+e(hj<et+0+---+0=6,
and since ¢ >0 is arbitrary, it follows that Z has content zero.
(c) Any subset of a set with content zero has content zero. The union of
a finite number of sets with content zero has content zero.
(d) In the space R*, the diamond-shaped set S$ = {(x, y):|x|+]y|= 1} has
content zero. For, if ne N, we introduce squares with diagonals along S$
and vertices at the points x = y =+k/n (k =0,1,...,"), then we see that
we can enclose § in 4n closed squares each having content 1/n*. Hence
the total content is 4/n, which can be made arbitrarily small. (See Fig.
4.31.)
(e) The circle S$ ={(x, y):x’+y’=1} in R® has content zero. This can
be proved by modifying the argument in (d).
(f) Let f be a continuous function on J=[a, b]to R. Then the graph’
G ={(x, f(x)eR?:xeS}
has content zero. This can be proved by using the uniform continuity of f
and modifying the argument in (d).
Figure 43.1
43. ‘THE INTEGRAL IN RP? 415
(g) The set SR? consisting of all points (x, y) where both x and y
belong to 1M Q is a countable set but does not have content zero. Indeed,
any finite union of cells containing $ must also contain the cell Ix J, which
has content 1.
In contrast to (f) we note that there are ‘“‘continuous curves” in R’ which
have positive content. Indeed, there are continuous functions f, g on
I=[0, 1] to R such that the set
S={(f(t), 3); ce
contains the cell Ix Fin R*. Such a curve is called a space-filling curve, or
a Peano curve. (See Exercise 43.U.)
and, for each k=1,...,p, let P, be a partition of [a,, b.] into a finite
number of closed cells in R. This induces a partition P of I into a finite
number of closed cells in R’. If P and Q are partitions of I, we say that P
is a refinement of Q if each cell in P is contained in some cell in Q.
(Alternatively, noting that a partition is determined by the vertices of its
cells, P is a refinement of O if and only if all of the vertices contained in Q
are also in P.)
43.3. Derinition. A Riemann sum S(P; f) corresponding to a parti-
tion P={J,,...,J,} of I is given by
S(Psf)= ¥ flaw)e(),
where x, is any “intermediate” point in J., k=1,...,n. A real number L
is defined to be the Riemann integral of f over I if, for every ¢ >0 there is a
partition P. of I such that if P is any refinement of P. and S(P; f) is any
Riemann sum corresponding to P, then (S(P;f)-—L|<e. In case this
integral exists we say that f is integrable over I.
=| Ff
416 INTEGRATION INR?
Jr or [ffs yy ae v9,
and when p=3 we occasionally write
filx)=f(x) for xe A,
=0 for xe1\A.
If the function f; is integrable on I in the sense of Definition 43.3, then it is
an exercise (see 43.M) to show that the value f; fr does not depend on the
choice of the closed cell I containing A. Becauuse of this we shall say that
f is integrable on A and define
[t+ [ 1
hoele lt
43. ‘THE INTEGRAL IN R? 417
[, (of+6er-of fre «
PROOF. This result follows from the fact that the Riemann sums for a
partition P of a cell IDA satisfy
S(P; af+ Bg) = aS(P; f) + BS(P; g),
when the same intermediate points x, are used. Q.E.D.
43.6 TuHeorem. If f:A—R is integrable on A and if f(x)=0 for
xeéA, then faf= 0.
PRooF, Note that S(P; f) = 0 for any Riemann sum. Q.E.D.
pRooF.
[ifs
Extend f and g to functions f;, g defined on a closed cell I
containing A. The hypotheses imply that h,=f;—g: is bounded and
equals 0 except on E. By Theorem 43.7 we deduce that hy is integrable
on I and the value of its integral is 0. Applying Theorem 43.5, we infer
that g;=f;—h; is integrable on I and
IS'(P;
f)- SQ; f)| = |S'(P; f)-S'(P.s
f+ |S'(P.s -S'(Q; f)|
= 2ec(]).
Figure 43.2
43. THE INTEGRAL IN R? 419
IS(P;
f)- S(Q; f)| = e(2c(+2M);
since ¢ >0 is arbitrary, we infer from the Cauchy Criterion that f is
integrable on I. Q.E.D.
Necessary and sufficient conditions for integrability will be given in
Exercise 43.P and Project 44.a.
Exercises
43.A. (a) Let a=(a,,...,a,)€R° and let J;=[a,, a,],..., J, =[a,, a,] be cells
in R. Show that J=J,x---xJ, has content zero in R’. Hence the set {a} has
content zero in R’.
(b) If we take J{=(a,, a,), then the cell J’=J{xJ,x---XJ, is empty and has
content zero.
43.B. Show that a set ZR’ has content zero if and only if for each ¢ >0
there exists a finite set K,,..., .K, of cubes whose union contains Z and such that
c(K,)+++++c(K,)<e.
43.C. Write out the details of the proof of the assertion, made in Example
43.2(f), that the graph S ¢ R’ of a continuous function f:[a,b]— R has content
zero.
43.D. If J is a closed cell in R* and g:J-—R is continuous, show that the
graph {(x, y, g(x, y)):(x, y}e D}CR? of g has content zero.
43.E. Let A&R’ be the set consisting of all pairs (i/p, j/p) where p is a prime
number, and i, j=1,2,...,p—1. Show that each horizontal and each vertical
line in R? intersects A in a finite number (often zero) of points. Does A have
content zero?
43.F. Let 12 R” be a closed cell and let P={h,..., 4} and Q={i,..., Jn}
be two partitions of I into closed cells. Show that R={LNJ,:
i=1,...,n;j=1,...,m} is a partition of I and that R is a refinement of both
P and QO. The partition R is called the common refinement of P and Q.
43.G. If I<J are cells in R° and if P is a partition of I, show that there exists a
partition Q of J such that every cell in P belongs to Q.
43.H. Let ZR?’ be aset with content zero and let I be a closed cell containing
Z. If J,,...,J, are cells contained in I whose union contains Z, show that there
exists a partition P of I such that the closure of each J, is the union of cells in P.
43.1. Let Z CR?’ be a set with content zero and let I be a closed cell containing
Z. If ¢ >0, show that there exists a partition P, of I such that the cells in P, which
contain points in Z have total content less than «.
43.J. In the preceding exercise, show that we can choose P, to have the
additional property that the cells in P. which contain any points of Z contain them
in their interior.
420 INTEGRATION IN R?
2. (M,— m,)o(I.)<e,
where M, =sup {f(x):xeJ,} and m, =inf {f(x):xeJ} for j=1,...,”. This result
is called the Riemann Criterion for Integrability (cf. Theorem 30.1).
43.Q. Let Ic R® be a closed cell and let f: I> R be bounded by M. if f is
integrable on I, show that the function |f| is integrable on J and that J,|f| < Mc(J).
43.R. Let I< R® be a closed cell and let f, g:I1 > R be integrable on I. Show
the product function fg is integrable on I.
43.8. Let Ic R® be a closed cell and let (f,) be a sequence of functions which are
integrable on I. If the sequence converges uniformly on I to f, show that f is
integrable on I and that
[r-wm([)
t " 1
43.T. Let K<R?° be a closed cube and let f, g:K > R be continuous. Show
that if « >0, then there exists a partition P, ={K,,..., K,} of K into cubes such that
if x, y, are any points of K, j=1,...,7, then
|, #8 -¥ fereonetK,)
| <eetK).
43.U. (This exercise gives an example due to I. J. Schoenbergt of a space-filling
t Isaac J. SCHOENBERG (1903-— ) was born in Romania and educated there and in
Germany. For many years at the University of Pennsylvania, he has worked in number
theory, real and complex analysis, and the calculus of variations.
43. THE INTEGRAL IN R?° 421
curve.) Let ¢:R—R be continuous, even, with period 2, and such that
e(t)=0 for O<t<},
=1 for f<t<l.
(a) Draw a sketch of the graph of y. Note that |lollx = 1.
(b) If teJ, define f(t) and g(t) by
I
fi) Se) +3 oF +H eB) + ey
1 1 1
git) = 5931) +33 9B) +53 oH +- >
Show that these series are uniformly convergent, so that f and g are continuous
on £
(c) Evaluate f(é) and g(t), where t has the ternary (base 3) expansion given by
0.2020, 0.0220, 0.0022, 0.2002.
(d) Let (x, y) belong to the graph S = {(f(t), g(t)):t € I} and write x and y in their
binary (base 2) expansions:
xX =0.0,0203..., y=0.6,8.B,...,
where a,, 8, are either 0 or 1. Let t be corresponding real number whose ternary
expansion is
t= 0.(2a,)(2B1)(2a2)(2B2) snes
Show that x = f(t) and y = g(t). Hence every point in the cube I x I belongs to the
graph S.
43.V. A set Z&R?’ has measure zero if for each « >0 there is a sequence (J,)
of cells whose union contains Z and such that } c(J,)<e.
(a) Since the empty set is a cell, show that a set which has content zero also has
measure zero.
(b) Show that every countable set in R’ has measure zero. Hence the set in
Example 43.2(g) has measure zero (but it does not have content zero).
(c) Show that, in the definition of ‘‘measure zero” given above, we can require
the cells to be open, or to be cubes.
(d) Show that every compact set with measure zero also has content zero.
(e) The union of a countable family of sets with content zero has measure zero.
Projects
43.a. Let IR’ be a closed cell and let f:1—R be bounded. if P=
{Ji,.-.,J,} is a partition of J, let
m,=inf{fx):xeF}, M,=sup{f(x):xeJ}
for j=1,..., and define the lower and upper sums of f for P to be
(a) If S(P; f) is any Riemann sum corresponding to P, then L(P; f) = S(P; f)<
U(P;f). Ife >0, then there exist Riemann sums S,(P; f) and S.(P; f) correspond-
ing to P such that
SP; f) = L(P; f)te, U(P; f)—«e = S.(P;
f).
In this section we shall introduce the collection of sets with content, and
characterize the content function as a real-valued function defined on this
collection of sets. Next we shall obtain some further properties of the
integral over sets with content, and show how the integral can be evaluated
ee
as an “iterated integral.”
44.1 Derinition. If A CR?, then we recall that a point x € R° is said
to be a boundary point of A if every neighborhood of x contains both
points in A and points in its complement (A). The boundary of A is the
subset of R® consisting of all the boundary points of A; it will be denoted
by b(A).
44. CONTENT AND THE INTLGRAL. 423
(A)= [a=] 1.
Note that if J&R? is a cell, then its boundary consists of the union of a
finite number of “faces,” which are cells each having content zero. [For
example, if J=[(a, b]<[c, d], then b(J) is the union of the four cells
[a, b} x[c, c], [a, b|x[d, d],
[a, a] x[e, d], [b, b] x[c, d].
These same four cells are also the boundary of the cell (a, b) x(c, d).] It
follows that a cell in R’ has content; moreover, one easily sees that if
J= [ai, b.]x nx [a,, bp], then
(= [ 1= (ba)
+++, a).
Hence the content of a cell, as given by Definition 44.2, is consistent with
the definition of the content assigned to a closed cellin Section 43. Similar
remarks apply to other cells in R°; in particular, it is seen that if
K =[a,, bi) X---[a,, b,), then
44.3. Lemma. A set AC R?® has content zero (in the sense of Definition
43.1) if and only if it has content (in the sense of Definition 44.2) and
c(A) =0.
PROOF. Suppose that ACR? has content zero. Then, if e >0, we can
enclose A in the union U of a finite number of closed cells with total
content less than «. Since this union U is a bounded set, then A is
bounded; since U is closed, it also contains b(A). Since e >0 is arbitrary,
we infer that b(A) has content zero; hence A has content and
c(a)= | 1.
Now let I be a closed cell containing A UB and let f., fr, fi, fa be the
functions equal to 1 on A, B, ANB, A UB, respectively, and equal to 0
elsewhere on I. Since each of these functions are continuous except on
sets with content zero, they are integrable on I. Since
fatfo=fitf.,
it follows from Theorem 43.5 and the definition of content that
This establishes the formula given in (a); the one in (b) can be proved
similarly.
To prove (c), note that if « >0 is given and if Ji,...,J, are cells with
total content less than ¢ whose union contains b(A), then x +Ji,...,x+Jn
are cells with total content less than « whose union contains b(x+ A).
Since ¢ >0 is arbitrary, the set x+A belongs to @(R"). To show that
e{x + A)=c(A), let I be a closed cell containing A; hence x +1 is a closed
cell containing x +A. Let f;:1— R be such that fi(y)=1 for ye A and
fily)=0 for yeI\A, and let fp:x+I—R be such that f.(z)=1 for
zEx+A and f.(z)=0 for ze(x+I)\(x+A). Show that to each
Riemann sum for f; there corresponds a Riemann sum for f2 which is equal
to it. Hence
Now c and y are both invariant under translation of the set and agree on
K,. Moreover c and y are additive over disjoint finite unions. Hence it
follows that
PROOF. Since «. possesses properties (i) and (ii), it is easily seen that wu is
monotone in the sense that A ¢ B implies that u(A) = »(B). If u(Ko) =
0, then p» of any bounded set is 0, whence it follows that w(A)=0 for all
A€Q(R°), so we can take m=0. If (Ko) #0, let
1 G ,
(A) =— (Ko) (A) for all Ae @(R").
2(R")
Since it is readily seen that y has properties (i), (ii), (iii), and (iv) of the
theorem, it follows that y=c. Hence we take m = p,(Ko). Q.E.D.
aan Li[rels
(b) Let A belong to DUR”) and let Ai, A2ze BCR") be such that A=
AiU A, and such that A, A2 has content zero. If f: A —> R is integrable
on A, and if the restrictions off to A: and Az are integrable, then (44.1)
holds.
proof. (a) Let I be a closed cell containing A =A,iU Az. and let
fi: I— R, i=1, 2, be equal to f on A; and equal to 0 elsewhere on I. By
428 INTEGRATION IN R°
[eq[s i=1,2.
It follows from Theorem 43.5 that f,+f2 is integrable on I and that
[u+to=[refe
Now since f(x)=fi(x)+/f2(x) provided xe A \(A:N Az), it follows from
Lemma 43.8 that f is integrable on A and that (44.1) holds.
(b) We preserve the notation of the proof of (a). By hypothesis, f, is
integrable on I. Now, fi(x) =fi(x)+f.(x) except for x in A.M Ao, a set
with content zero. It therefore follows from Theorem 43.5 and Lemma
43.8 that
[re t-[aorr-[
re] p
= [. f+ {. f. O.E.D.
(44.2) | f| = Mc(A).
I.
More generally, iff is real valued and m =< f(x) = M for all xe A, then
S(P.sf)-e=
| f=S(Psfite.
We note that if the intermediate points of the Riemann sum are chosen
outside of A whenever possible, we have
S(Ps f= Li fue),
44. CONTENT AND THE INTEGRAL. 429
and since « >0 is arbitrary we obtain the right side of inequality (44.3).
The left side is established in a similar manner. QED.
(44.4) [, F=f),
_ pRoor. If c(A)=0, the conclusion is trivial; hence we suppose that
c(A)#0. Let
1
(44.5) may |, f< M.
If both inequalities in (44.5) are strict, the result follows from Bolzano’s
Intermediate Value Theorem 22.4.
Now suppose that f4f=Mc(A). If the supremum M is attained at
p <A, the conclusion also follows. Hence we assume that the supremum
M is not attained on A. Since c(A) #0, there exists a closed cell KC A
such that c(K)#0 (see Exercise 44.G). Since K is compact and f is
continuous on K, there exists « >0 such that f(x) <= M-e for all xeK.
Since A = K U(A\K) it follows from Theorem 44.9 and 44.10 that
me(A)= |, f= J. t+] f
<(M-—e)c(K)+Mc(A \ K)<Mc(A),
F(y)= [ flay) de
Let c=yo= yi=::-=y,=d be a partition of the interval [c, d], let
a=Xo=x15°''-=x,=b be a partition of [a,b], and let P denote the
partition of J obtained by using the cells [xx-1, xx] X[yj-1, yj]. Let y* be
any point in [y;-1, y] and note that
FO =D flrk yum),
We multiply by (y;— y;-.) and add to obtain
j=lk=1
44. CONTENT AND ‘THE INTEGRAL 431
Now the expression on the left side of this formula is an arbitrary Riemann
sum for the integral
[if {lena}
As a consequence of this theorem, we shall obtain a result which is often
used in evaluating integrals of functions defined on a set which is bounded
by continuous curves. For convenience, we shall state the result in the
case where the set has horizontal line segments as its top and bottom
boundaries, and continuous curves as its lateral boundaries. Clearly, a
similar result holds if the lateral boundaries are vertical line segments and
the top and bottom boundaries are curves. More complicated sets are
handled by decomposing the sets into the union of subsets of these two
types.
44.14 THEoreM. Let ACR? be given by
A={@ y):a(y)
=x = By), c=y=d},
x= aly) x= B(y)
Figure 44.1
432 INTEGRATION IN R?
where a and B are continuous functions on [c, d] with values in [a, b]. Iff is
continuous on A > R, then f is integrable on A and
prooF.
[r-[{[[lmna}e
Let J be a closed cell containing A and let f; be the extension of
ftoJ. A variation of Example 43.2(f) shows that the boundary of A has
content zero; hence f; is integrable on J. Now for each yé[c, d] the
function x + f(x, y) is continuous except possibly at the two points a(y)
and B(y), at which it has one-sided limits. It follows from the preceding
theorem that
c¥(A)
= sup c(U), c*(A) =inf c(V),
where the supremum is taken over the set of all finite unions of cells contained in A,
and the infimum is taken over the set of all finite unions of cells containing points
of A.
44. CONTENT AND THE INTEGRAL, 433
(a) Prove that c,(A)=c*(A) and that A has content if and only if ca(A)=
c*(A), in which case c(A) is this common value.
(b) If A and B are disjoint subsets of R’, show that c*(A UB) < c*(A)+c*(B).
(c) Give an example of disjoint sets A, B such that 0#c*(A)=c*(B)=
c*(A UB).
44.1. Let MEN and let I, ¢ R” be the cube with half length 2” and center 0.
For néN we divide I, into a grid G,,,, of length 2 " formed by the collection of all
cubes in I, with side length 2" and dyadic rational end points (that is, end points of
the form k/2" where k € Z).
(a) If J <Iy is a closed cell and e >0, show that there exists n € N such that the
union of all the cubes in Gy, which are contained in J has a total content more than
c(J)—« and the union of all the cubes in Gy,,, which contain points in J has total
content less than c(J)+e.
(b) If A&I, has content and e >0, show that there exists n€ N such that the
union of all cubes in Gy, which are contained in A has total content exceeding
c{A)-« and the union of all cubes in Gy,, which contain points in A has total
content less than c(A)+<«.
44.3. Let I< R’ be aclosed cell and let f: 1 > R be integrable on I. If ACT has
content, then the restriction of f to A is integrable on A. (Hint: Use Exercise
43.P.)
44.K. Let Ae Q(R") and suppose that f and g are integrable on A and that
g(x)=0 for all xe A. If m=inf f(A), M=sup f(A), then there exists a real
number 2 €[m, M] such that
| fe =u
A A
8.
44.L. If, in addition to the hypotheses of the previous exercise, we suppose that
A is connected and f is continuous on A, then there exists a point p € A such that
[ fg =f) 8.
44.M. Let {(x,, y,):n € N} be an enumeration of the points in (0, 1) x (0, 1) with
rational coordinates. For each néEN, let I, be an open ceil in (0, 1) (0, 1)
containing (x, y.), and let G= Unen I. Show that G is an open set in R’ whose
boundary b(G) is (0, 1)<(0,1)\ G. Show that if © c(,)<1, then the open set G
does not have content.
44.N. Using the terminology of Exercise 7.K, let A <[0, 1] be a “Cantor-like”
set with length }. If K =A x[0, 1], show that K is a compact subset of R’, that
b(U&) = K, and that K does not have content.
44.0. Let a = b and let f:[a, b] — R be continuous and such that f(x) = 0 for all
xef[a, b]. Let S,={(x, y):a=<x <b, 0<y <f(x)} be called the ordinate set of f.
By examining the boundary of S, show that it has content. Show that
that A does not have content and that f is not integrable on Q. However, the
iterated integrals exist and satisfy
LL f[rnaar-c
but that {3 f(x, y) dy does not exist for rational x.
44.R. Let Jc R’ be an open cell containing (0, 0) and let f: J > R be continuous
on J. Define F:J— R by the iterated integral:
[ Fay(x2, «+ «5 Xp) dix...) | {{ fix, X25. + +5 Xp) ax,| d(X2, «2.5 Xp)
(Uy a a
(d) Extend the result to the case where for each point (x.,...,x,) in Jy) the
function x,+> F(x,, X2,...,%,) of [a,, b.]-> R is continuous except possibly for a
finite number of points at which it has one-sided limits.
44.U. (a) Let a, 8:[a, b]> R be continuous with a(x) < B(x) for all x €[a, b].
Show that the set
B={(x, y)eR’?:a=<x=b,a(x)<y
= B(x)}
is a compact set in R? with content.
44. CONTENT AND THE INTEGRAL 435
(b) Now let y, 6:B — R be continuous functions with y(x, y) < 6(x, y) for all
(x, y)€B. Show that the set
Jenn {Ls
44.W. Use the Weierstrass Approximation Theorem to show that if I=
[a,, b,]x-+-x[a,, b,] and if g: I= R is continuous, then
be fhe b,
[e-| {{ ff Blt, Xo 8) day} + dea} de,
J ay kway ap
44.X. Let @:[0, +)— R with ~(0)=0 be continuous, unbounded, and strictly
increasing, and let y be its inverse function. Hence & is also continuous and
strictly increasing on [0, +).
(a) If a, 8 are positive numbers, compare the area of the interval [0, «]x[0, 8]
with the areas bounded by the coordinate axes and the graph of » to obtain
‘Young’s Inequality:
ap = [¢ + [v.
(b) If p=1 and q=1 are such that (1/p)+(1/q)=1, and if g(x)=x' and
w(x) = y”, use Young’s Inequality to establish the inequality
ap = a*/p+ BiY/q.
Y |ab| = AB,
which was obtained in Project 8.8(b).
Projects
44.0. Let IG R” be a closed cell and let f: 1 R be bounded. For a>0, let
D, = {x €I:@,(x) = a}, where w,(x) denotes the oscillation of f at x (see Project
23.a).
436 INTLGRATION IN R?
D (M,~m)c(J,) = ac*(D,).
Deduce that f is not integrable on 1
(d) Conclude that f is integrable on I if and only if the set D, has content zero
for all a>0.
(e) Recall that D = ),.~ D,,, is the set of points where f is discontinuous. Show
that D has measure zero (in the sense of Exercise 43.V) if and only if each set D,,,
has content zero.
(f) Conclude that f is integrable on I if and only if its set D of points of
discontinuity has measure zero. (This result is Lebesgue’s Criterion for
Integrability.)
44.8. This project considers lower and upper integrals (introduced in Project
43.a) and their iterations. Let I< R' and JCR* be closed cells, p=r-+s, and let
K=IxXJcCR’=R'xR*. Suppose that f:K > R is bounded.
(a) For each x € I, define g,:J > R by g.(y)=f(x, y) for yeJ. LetA:I—R be
defined to be the lower integral A{x) = L(g,) of g,, and let ~:1— R be defined to
be the upper integral x(x)= U(g,) of g.. If R is any partition of I, and S is any
partition of J, and P=RXS the resulting partition of K, then show that
L(P; f) = L(R; A) = U(R; A) = U(R; pw) = UP; f).
(b) Show that
[fe Ja-[u
(c) For each ye J, define h,:1—>R by h,(x)=f(x, y) forxel Let A’:J OR
and «.’:J >R be defined by
Te atx) <e.
(c) Let Q=R’. Show that the content function c:@(R’)— R_ has strong
density identically equal to 1 on R?.
(d) Suppose that w:(R’)— R is a positive additive function which is invariant
under the translation of sets [that is, 2(x +A)=p(A) for all xe R’, Ac D(R’)).
Show that » has a strong density on R’ which is a constant on R°.
(e) If f:Q— R is continuous and if F is defined as in (a), show that F has strong
density f on 0.
(f) If G:@(Q) = R is additive and has strong density g:(1— R, show that g is
continuous on Q. Hence g is uniformly continuous on every A € B(Q).
(g) Suppose that G : 9(Q) — R is additive and has strong density identically zero
on 2, Show that if K is a closed cube and if ¢ >0, then there exists a partition of K
into cubes {K,,...,K,} such that |G(K;)|< ec(K) for j=1,...,7, whence it
follows that |G(K)| =< ec(K). Conclude that G(K) = 0 for all closed cubes K <Q.
(h) Suppose that F, and F, are additive functions on @(Q) such that for some
M >0 we have |F,(A)| = Mc(A) for all AG @(Q), j=1, 2. If F.(K)=F,(K) for
every cube K <Q, prove that F,(A)=F,(A) for all A €@(Q).
happen if the mapping is in Class C' and shall study the mapping of sets
with content under C’ maps. The case of a linear map is particularly
important and the result is satisfyingly simple. In the case of a non-linear
mapping, it will be seen that the Jacobian of the mapping indicates the
extent of the ‘‘distortion” of the transformation.
These results will be used to establish a theorem concerned with the
“change of variable” of an integral over a set in R’. The special cases of
polar and spherical coordinates are briefly examined, and a stronger
theorem is given that applies to many transformations that exhibit a mild
amount of singularity.
45.1 Lemma. The 0. R? be open and let p:0.— R?° belong to Class
C'(Q). Let A be a bounded set with AT<Q. Then there exists a bounded
open set QO, with ATS 0, 507 EO and a constant M>0 such that if A is
contained in the union of a finite number of closed cubes in 0, with total
content at most a, then ¢{A) is contained in the union of a finite number of
closed cubes with total content at most (Vp Myra,
proo. If Q=R?", let 6&=1; otherwise let 6=+inf {la—x|:
aéA, x€Q}. Since A™ is compact, it follows that 5>0. (Why?)
Now let 0,={yeR°:||y—al|<6 for some a€ A}, so that 0, is open
and bounded and A-cQ, and 0;<. Since ge C(Q) and 0; is
compact, it follows that M =sup {|De(x)llp:x€Q:} is finite. If Ac
I,U-:+-UI,, where the J; are closed cubes contained in 04, then it follows
from Corollary 40.6 that for x, yeJj we have
pRooF. Let %=2% R’™ so that Qp is open in R’, and define @:Q%—> °
R? by
(Kiyo Xep Netty oe ey Xp)
= WK, 2, Xe).
45. TRANSFORMATION OF SETS AND INTEGRALS 439
Evidently ¢ € C'(Q»). Let Ao= A X{0,..., 0} so that Ap < Me and Avo has
content zero in R’. It follows that #(A) = ¢(A,) has content zero in R’.
Q.E.D.
We note that this corollary asserts that the C’ image of any bounded set of
“lower dimensionality” has content zero.
Since the boundary of a set A with content has content zero, it follows
from Theorem 45.2 that if y is in Class C’ then @(b(A)) has content zero.
Unfortunately ¢(b(A)) need have little relation, in general, to b(@(A)).
This observation enhances the interest of the next two results.
45.4 THEOREM. Let Q.<R? be open and let ¢:Q— R?® belong to Class
C*\(Q). Suppose that A has content, AT <Q, and J,(x) #0 for all xe A®.
Then @{A) has content.
We shall now see that sets with content are mapped by a linear map in
R’ into sets whose content is a fixed multiple of the original content.
Moreover, this multiple is the absolute value of the determinant corre-
sponding to the linear map. (In this theorem we shall assume that the
440 INTEGRATION IN R°
proor. If L is singular (that is, if det L=0), then L maps R?® into a
proper linear subspace of R°. Since this subspace can also be obtained as
the image of some L’: R’ > R? with r<p, it follows from Corollary 45.3
that c(L(A))=0 for all Ae @(R°). Hence the statement is true for linear
maps which are singular.
if L is not singular (that is, if det L #0), then Theorem 45.4 implies that
if A€@(R’), then L(A)eE G(R’). We now define A:D(R°)—R by
A(A)=c(L(A)). @ It is clear that A(A)=O for all Ae BR’).
(ii) Suppose A, Be @(R”) and AM B= 9; then
mimc(A)= ¢(L°M{(A))=c(L(M(A)))
= mc(M(A)) = mumnc{A)).
Hence we have mrw = mmm for-all non-singular L, Me £(R°).
It remains to show that m. =|det L|. To do this we shall use the fact
from linear algebra that every non-singular L € £(R?) is the composition
of linear maps of the following three forms:
¢(L3(Ko)) = c(A2)
+ ¢((1, 0,..., 0)
+ Ai) = c(A2)
+ c(Ar)
=c(A;UA2) =c(Ko).
Hence m ,= 1=|det L;|.
Now let the non-singular linear map L be the composition of linear maps
Li, Ls,...,L, having one of the three forms given above. Since
45.8 THE JACOBIAN THEOREM. Let 0.2 R? be open and suppose that
g :0— R? belongs to Class C'(Q), is injective on Q, and that J,(x) #0 for
x€Q. Suppose that A has content and A~<Q. If ¢ >0 is given, then there
exists y >0O such that if K is a closed cube with center x € A and side length
less then 2y, then
Hence, if K is a closed cube with center x € A and side length less than 2
(where y = Biv), then inequality (45.2) holds. Q.E.D.
Change of Variables
We shall now apply the Jacobian Theorem to obtain an important
theorem which is a generalization to R°® of the Change of Variables
Theorem 30.12. The latter result asserts that if o@:[a,B]—R has a
continuous derivative and if f is continuous on the range of ¢, then
for x €[a, B]. If m happens to be injective, we note that the exact analog for (45.5)
in the case p=1 is
[1 [Geert
where A = inf {e(a), o(8)} and B=sup{p(a), p(B)}. Of course, if ¢’(x)>0 for
a =x < B, then formula (45.5) reduces to (45.4); while if o'(x) <0 fora =x <8,
then formula (45.5) reduces to
[t= [deocen,
whence (45.4) also follows. The explanation for this difference is that the integral
over intervals in R is “oriented” in the sense that we define
[ve-[i
for any real numbers u, v. No such orientation has been defined for integrals over
R’.
TJ. T. SCHWARTZ (1930- ) was graduated from CCNY, received his doctorate at Yale
University, and is a professor at the Courant Institute of New York University. Although he
is best known for his work in functional analysis, he has also contributed to differential
equations, geometry, computer languages, various aspects of mathematical physics, and
mathematical economics.
45. TRANSFORMATION OF SETS AND INTEGRALS 445
Gi) || eel
| Fee) In|
= | I... (fee) |Jo| | = MpMac(A \ B) = [M;Mz]e.
It follows from Lemma 45.1 that c(@(A \ B)) < (Vp M,)’s, so that
(For, if necessary, we can divide the cubes K:,..., Kn into small cubes;
see Exercise 43.T.)
Since @ is injective, two sets from {@(Ki):i=1,..., m} intersect at most
in a set ¢(K, 0 K;) which has content 0 since c(K; 1 K,;)=0. Also, since
¢(k:) has content, then f is integrable on ¢(K;); hence it follows from
Theorem 44,.9(b) that
{Eftp(B) i= Je(kK)
[FH flpdel@K), T= 1, sm
Since p: € e{K;), there exists a unique y; € K; with p;= (yi), i=1,...,m.
Hence we have
Applications
The use of the theorem on the change of variables when p>1 is
generally different from the application of the corresponding theorem
45. TRANSFORMATION OF SETS AND INTEGRALS 447
| x(1+ x7)? dx
QO
and so we have
. — > 3 — oO. .
where A =[1, 9]x[1, 4] and where u(x, y) and v(x, y) are given in (45.6).
system (given by rays through the origin and circles centered at the origin).
Alternatively, we can think of polar coordinates as a map cf (r, 0)€ R’
into (x, y)< R® given by
(45.7) (x, y)= (7, 0)
= (1 cos 4, r sin @).
Any pair of numbers (r, 6) € R’ such that (x, y) =(r cos 6, r sin 0) is called a
set of polar coordinates of the point (x, y). Usually one requires r = 0; even
so, each point (x, y) in R* has infinitely many sets of polar coordinates.
For example, if (x, y) = (0, 0), then (0, @) is a set of polar coordinates of (0, 0) for
all 0€ R; if (x, y) # (0, 0) and (, 6) is a set of polar coordinates for (x, y), then for
each ne Z the pair (r, 9+n277) is also a set of polar coordinates for (x, y).
If (x, y)#(0, 0), then the unique pair (r, @) with r>0, 0<@<27, is
called the principal set of polar coordinates of the point (x, y). Thus the
function » gives rise to an injective map of (0, +%) x[0, 27) onto R*\
{(0, O)}. It also gives a map of [0,+~)x[0, 22r) onto R? but it is not
injective, since it sends all the points (0, 4), 0 =< @<27, into (0,0). Note
also that the Jacobian is given by
If (x, y, z) is such that (x, y) # (0, 0), then the unique triple (r, 6, @) with
r>0,0<0<27, 0<¢<z7, is called the principal set of spherical coordi-
nates of (x,y,z). Thus the function ® yields an injective map of
450 INTEGRATION IN R?
PROOF. Since b(A) and b(Qo) are compact and have content zero, we
may assume that they are contained in E; therefore A°\ECQOo\E. Since
A and E have content, the set A \ E has content; moreover, since E is
closed, then (A\E)°=A°\E so that J,(x)#0 for xe€(A\E)’.
Therefore, by Theorem 45.4 applied to A \ E, we deduce that ¢(A \ E)
has content. It follows from Theorem 45.2 that @(F) has content zero,
and since p(A)=@((A \ E)U(ANE))=@(A\ E)UG(ANE), that e(A)
has content. Since f is bounded on ¢(A) and continuous except on a
subset of @(E), we deduce that f is integrable over p(A). Moreover, since
fe is continuous except on a subset of E, we deduce that (fe) |J.| is
integrable over A. It remains to show that these integrals are equal.
Now apply Lemma 45.1 to E to obtain a bounded open set Q, with
Ec,¢;
<Q and a constant M,>0, with the property that if E is
contained in a finite union of closed cubes in 0, with content at most a >0,
45. TRANSFORMATION OF SETS AND INTEGRALS 451
Exercises
45.A. Let OQ¢ R’ be an open set and let f:— R° satisfy a Lipschitz condition
on Q; that is, for some M > 0, ||f(x)— f(y)|| = M ||x — yl for all x, yeQ. If K cQisa
cube with side length s >0, show that f(K) is contained in a cube with side length
Myps. Show that if AO is a compact set with content zero, then f(A) has
content zero, and if B <Q is a compact set with content, then f(B) has content.
45.B. Consider the polar coordinate map (x, y)=@(r, 6)=(rcos 6,rsin @)
defined on R®, and its behavior on the set A =[0.1]*[0,27]. Use Theorem 45.4 to
obtain the reassuring information that the image D = ¢(A), which is the unit disk
D={(x, y):x?+ y’ = 1}, has content. Investigate the manner in which ¢ maps the
boundary of A. Show that the boundary of D is the image under @ of only one side
of A, and that the other three sides of A get mapped into the interior of D.
452 INTEGRATION IN R°
45.C. Consider the map (x, y)= (u,v) =(sinu,sinv) defined on R’.
Determine the image of the boundary of B=[-—in,ia]x[-ia, ia] under &,
and the boundary of #(B). Show that most, though not quite all, of the boundary
points of (B) are images of interior points of B.
45.D. Given that the area of the circular disk {(x, y):x’+y’ = 1} is equal to a,
find the areas of the elliptical disks given by:
[fernac
y= | fate o=3
B
Uf ve}as
directly. Then use the transformation (x, y)+> (u, v) =(x, y—x’) to evaluate this
integral.
45.H. Determine the area of the region bounded by the curves
In particular, we have
45.J. Let §:R*— R be as in the preceding exercise. Show that y maps the
triangular region A={(x, y):0 =x =1,0<y = x} into the triangular region
A. = #(A)={(u, v):0susil,usvs2-uh.
45.K. Let a<B belong to [0, 27] and let h:[a, 8] > R be continuous and such
that h(@) = 0 for @e[a, B]. Let H={(6,r)eR’:a <6 < B, Osr=sh(@)} be the
ordinate set of h (see Exercise 44.0), so that H has content. The polar curve
generated by h is the curve in R’ defined by 0+ (h(6) cos 6, h(6) sin @), and the
polar ordinate set of this curve is the set
(X) =f (fC) dx
45.M. Let 0 <a<b and let f:[a, b] > R and S, be as in the preceding exercise.
Let p,: R*— R° be defined by p,(x, y, 6) =(x cos 4, y, x sin @) and let Y; be the
image of S,x[0,2a] under p,. (The set Y; is called the ‘‘solid of revolution
obtained by revolving the ordinate set S, about the y-axis.”) Use Theorem 45.11
to show that
[Jew aa y=Fa-e",
CR
(c) From the fact that Cz © Bry < Cavs, show that
. 8 ey V_t
um (flew dx) =F,
whence it follows that fe" e’ dx =iV a.
45.0. Let B = {(x, y):4x7+9y? <= 4}. Use an appropriate change of variables to
evaluate
Project
45.a. This project is based on Project 44.y and provides an alternative approach
to the Change of Variables Theorem 45.9. Let Q¢ R” be open, and let p:Q > R”
belong to Class C'(Q), be injective on ©, and such that J,(x) #0 for allxeQ. For
simplicity, we also suppose that there exists M>O such that ||p(x)—¢(y)||<
M |x —y|| for x, yeQ.
(a) If b:B(Q)— R is defined by
then ® is additive on %(Q) and has strong density equal to |J,|. Moreover, for
some M,>0, we have ®(A) < M,c(A) for all Ae BQ).
(b) If f is a bounded function which is integrable on every set »(A), for
AeQM(Q), and if ¥:2(Q)— R is defined by
wA={ wlA)
of
then W is additive on @(Q). Moreover, for some M,>0, we have |W(A)|=<
M.c(A) for all AE BO).
(c) If f is a bounded and continuous function on @(Q), and if V is defined as in
(b), show that W has strong density equal to (f°@) |J,|.
(d) If f is as in (c), show that
This list includes books and articles that were cited in the text and some
additional references that will be useful for further study.
456
REFERENCES 457
The reader is urged not to look at these hints unless he is stymied. Many of the
exercises call for proofs, and there is no single way that is correct; even if the reader
has given a totally different argument, his may be entirely correct. However, in
order to help the reader learn the material and to develop his technical skill, some
hints and a few solutions are offered. It will be observed that more detail is
presented for the early material.
Section 1
ENUA,=U(ENA)).
The opposite inclusion is proved by reversing these steps. The other equality is
handled similarly.
LL. Ifxe€((\{A, :j EJ), then x¢ (){A,:jeJ}. This implies that there exists a
k eJ such that x¢ A,. Therefore, x ¢€(A,), and hence xe U{€(A,):j es}. This
proves that €((] A,)< LU €(A,). The opposite inclusion is proved by reversing
these steps. The other equality is similar.
Section 2
2.A. If (a, c) and (a, c’) belong to g°f, then there exist b, b’ in B such that (a, b),
(a, b’) belong to f and (b, c), (b’, c’} belong to g. Since f is a function, b = b’; since g
is a function, c=c’.
2.B. No. Both (0, 1) and (0, -1) belong to C.
2.D. Let f(x) = 2x, g(x) =3x.
458
HINTS FOR SELECFED EXERCISES 459
2.E. If (b, a), (b, a’) belong to f~’, then (a, b), (a’, b) belong to f. Since f is
injective, then a=a’. Hence f7' is a function.
2.G. If f(x.) = f(x.) then x1 = gof(x,) = gef(x.)=x.. Hence f is injective.
2.H. Apply Exercise 2.G twice.
Section 3
Section 4
Section 5
5.A. Since a?= 0 and b?= 0, then a’*+b’=0 implies that a? = b?=0.
5.D. If c=1+a with a>O0, then c"=(1+a)"=l+naztit+a=ce.
5.G. Observe that 1<2'=2. Ifk <2" fork = 1, thenk+1<2k<2-2*=2'".
Therefore, n<2" for all neN.
5.H. Note that b"- a" = (b—a)(b""'+---+a"")=(b—a)p, where p>0.
5.M. {(x, y):y =+x}.
5.N. A square with vertices (+1, 0), (0,41).
Section 6
6.A. If A={x}, then x,=supA. If A={x1,...,%)%eit and if u=
sup {x,,...,X,}, show that sup {u, x,.:} is the supremum of A.
6.C. Let S={xeQ:x’?<2}.
6.E. In fact, sup A U B =sup {sup A, sup B}.
6.H. If S=sup {f(x, y):x eX, y € Y}, then f(x, y) < S for all x € X, ye Y, and so
fi(x)=S for all xe X. Hence sup {fi(x):xeX}<S. Conversely, if « >0 there
exists (Xo, Yo) such that S—e <f(Xo, yo). Hence S—e <f,(x,) and therefore S—«<
sup {fi(x):x¢X}. Since «>0 is arbitrary, we infer that S = sup {f,(x):x € X}.
6.K. Since f(x) = sup {f(z):z¢€ X}, it follows that
f(x)+ g(x) = sup {f(z):z © X}+ supig(z):z € X}.
460 HINTS FOR SELECULD EXERCISES
Therefore sup {f(x)+ g(x):x € X} is less than or equal to the right hand side.
Similarly, if x ¢ X, then
inf {f(z):z € X}+ g(x) = f(x)
+ g(x).
If we use 6.J, we infer that
Section 7
7.B. Let aeA; if aéA’ then a€B’ and so €<é’ <a, a contradiction.
Therefore ae A’ and since a€A is arbitrary we have ASA’. Since —< &', there
exists xe R with &é<x<&'. Since <x, we must have x eB. But since x € A’ we
infer that A# A’.
7.C. Let A={x:x<1}, B={x:x = 1} and A’={x:x <= 1}, B’={x:x>1}.
7.E. Ifx € I, for all n, we have a contradiction to the Archimedean Property 6.6.
7.F. If xeJ, for all n, we have a contradiction to Corollary 6.7(b).
7.H. Every element in F, has a ternary expansion whose first digit is either 0 or
2. The points in the four subintervals of F, have ternary expansions beginning
0.00...,0.02...,0.20...,0.22...,
and so forth.
7.J. If n is sufficiently large, 1/3°<b—a.
7.K. As close to 1 as desired.
Section 8
IIx]?
+ 20x > y) + llyl? = Ibe + ylP= Cell l ylb?
= [lxlP +2 [lll lyll+llylP.
Hence x - y =||x|| |ly|| and the condition for equality in Theorem 8.7 holds provided
the vectors are non-zero.
8.P. Since ||x + yl? = |x|? +2(% - y)+|lylf?. the stated relation holds if and only if
x-y=0.
8.Q. A set K is convex if and only if it contains the line segment joining any two
points in K. If vy. ye Ky, then |x
+ —dyll<t|x}+ -ollyj<et+0—-—n=1 so
HINTS FOR SELECTED EXERCISES 46]
ix+(1-t)y¢K, forO<t=<1. The points (+1, 0) belong to K,, but their midpoint
(0, 0) does not belong to K,.
8.R. If x, y belong to (] K,, then x, y¢ K, for alla. Hence tx+(1—t)yeK, for
all a; whence it follows that (| K. is convex. Consider the union of two disjoint
intervals.
Section 9
Section 10
Section 11
Section 12
12.B. Let A, B be a disconnection for C’= C U{x}. Then ANC’ and BNC’
are disjoint, non-empty, and have union C’. One of these sets must contain x;
suppose it is B. Since B is an open set, it also contains points of C so
CN(B \ {x} #@. But then A, B\{x} form a disconnection of C.
12.E. Modify the proof of Theorem 12.4.
12.G. By Theorem 12.8, the sets C, and C, are intervals. It is easily seen that
C, XC; is convex so {2.E applies.
Section 13
13.A. Examine the geometrical position of iz =(—y, x) in terms of z =(x, y).
13.B. Note that cz = (x cos 6— y sin 6, x sin 8+ y cos @), and this corresponds to a
counter-clockwise rotation of @ radians about the origin.
13.C. The circle |z — c| =r is mapped into the circle |w— (ac + b)|=|al|r. Wecan
write z=a 'w—a'b and calculate x =Rez, y=Imz in terms of u=Rew,
v=Imw. Doing so we easily see that the equation ax + by =c transforms into an
equation of the form Au+Bv=C.
HINTS FOR SEEILCTED EXERCISES 463
13.D. A circle is left fixed by g if and only if its center lies on the real axis. The
only lines left fixed by g are the real and the imaginary axis.
13.E. Circles passing through the origin are sent into lines by h. All lines not
passing through the origin are sent into circles passing through the origin; all lines
passing through the origin are sent into lines passing through the origin.
13.F. Every point of C, except the origin, is the image under g of two elements
of C. If Re g(z)=k, then x°-y’=k. If Img(z)=k, then 2xy=k. If |g(z)|=k,
then k = 0 and |z|=~k.
Section 14
Section 15
15.A. Consider z, = y,—x, and apply Example 15.5(c) and Theorem 15.6(a).
15.C. (a) Converges to 1. (b) Diverges. (f) Diverges.
15.D. Let Y=—X.
15.F. Consider two cases: x =0 and x >0.
15.G. Yes.
15.H. Use the hint in Exercise 15.F.
15.L. Observe that b <x, = b2*”.
Section 16
16.A. By induction 1<x, <2 for n= 2. Since xy4)— X. = (Xa — Xn-1)/(%nXn-1), the
sequence is monotone.
16.C. The sequence is monotone and bounded. The limit is (1+ (1 +4a)'”)/2.
16.D. The sequence X is monotone decreasing and bounded.
16.E. An element x, of X =(x,) is called a “peak” for X if x,=x, for n>k.
(i) If there are infinitely many peaks with indices k,<k,<..., then the sequence
(x,,) of peaks is a decreasing subsequence of X.
(ii) If there are only a finite number of peaks with indices k,<---<k,, let
m,>k,. Since x,,, is not a peak, there exists m, > m, such that x,,<X»,. Continuing
in this way we obtain a strictly increasing subsequence (x,,,) of X.
16.G. The sequence is increasing and x, = n/(n+1)<1.
464 HINTS FOR SELECTED EXERCISES
16.K. There exists K € N such that if n = K, then L~e = x,.,/x, 5 L+e. Now
use an argument similar to the one in Exercise 14-1.
16.M. (a) e, (b) e*”, (c) Hint: (1+ 2/n)=(14+ 1/n)(1+ 1/(n+1)), (d) e*.
16.P. Let y, € F be such that ||x — y,||<d+1/n. If y =lim (y,,), then |x — y||=d.
Section 17
17.A. All.
17.C. If x eZ, the limit is 1; if x¢ Z, the limit is 0.
17.E. If x =0, the limit is 1; if x0, the limit is 0.
17.G. If x>0 and O<e<7n/2, then tan(m/2—e)>0. Therefore nx =
tan (7/2—¢) for all n = n,, from which 7/2—e < Arctan nx < 7/2.
17.H. If x>0, then e*<i. 17.J. Not necessarily.
17.M. Consider the sequence (1/n) or note that ||fillp = 3.
17.P. Yes. 17.Q. Yes.
Section 18 .
19.E. If j <n, then x, <x,., and x¥(1+1/n) <= x,+(1/n)x,..1. Now add.
19.1. If X is increasing and not convergent in R, then X is not bounded.
19.K. (a) None exist. (b,c) All three are equal. (d) The iterated limits are
different and the double limit does not exist. (e) The double limit and one iterated
limit are equal. (f) The iterated limits are equal, but the double limit does not
exist.
19.L. Let x,, =n if m=1 and x,,=0 if m>1.
19.N. In (b,c, e).
19.0. Apply Corollary 19.7 to x = sup {X%mn im, n © N}.
19.P. Let x =0 for mn and Xm =(-1)"/n for m =n.
Section 20
Jx—al__ |x—al
We—val= vx+vVa <
Va
20.B. Apply Example 20.5(b) and Theorem 20.6.
20.C. Apply Exercise 20.B and Theorem 20.6.
HINTS FOR SELECTED EXERCISES 465
Section 21
is (x, y)
= (0, 0).
21.1. Note that g(x)=g(y), if and only if g(x—y)=8.
21.P. Note that c; =e, « f(e,) and apply the Schwarz Inequality.
Section 22
Section 23
23.A. The functions in Example 20.5(a, b, i) are uniformly continuous on R.
23.G. The function g is bounded and uniformly continuous on [0, p}.
23.1. If (x,) is a sequence in (0,1) with x,—>0, then (f(x,)) is a Cauchy
sequence and is therefore convergent in R.
23.K. Take f(x) =sin x, g(x) =x, for xER.
Section 24
Section 25
25.G. (b) If « >0, there exists 5(¢)>0O such that if c<x<c+8(e), xe D(f),
then |f(x)—b|<e. (c) If (x,) is any sequence in D(f) such that c<x, and
c=lim (x,), then b = lim (f(x,)).
25.J. (a) If M>O, there exists m>O such that if x =m and xe D(f), then
f(x) =M. (b) IfM <0, there exists § > 0 such that if 0<|x —c|< 6, then f(x)<_M.
25.L. (a) Let p(r) =sup {f(x):x >r} and set L=lim,_,. g(r). Alternatively, if
« >0 there exists m(e) such that if x = m(e), then |sup {f(x):x >r}-L]<e.
25.M. Apply Lemma 25.12.
25.N. Consider the function f(x)=—1/|x| for x#0 and f(0)=0.
25.P. Consider Example 20.5(h).
25.R. Not necessarily. Consider f.(x)=—x" for xe.
25.8. Yes.
Section 26
26.B. Show that the collection # of polynomials in cos x satisfies the hypotheses
of the Stone-Weierstrass Theorem.
26.E. If f(0) = f(a) =0, first approximate f by a function g vanishing on some
intervals [0,5] and [7—6,a]. Then consider h(x)=g(x)/sinx for x €(0, 7),
h(x) =0 for x =0, 7.
26.1. Consider f(x) =sin (1/x) for x#0.
26.K. Use the Heine-Bore] Theorem or the Lebesgue Covering Theorem as in
the proof of the Uniform Continuity Theorem.
HINTS POR SELECTED EXERCISES 467
Section 27
27.D. Observe that g'(0)=0 and that g’(x) = 2x sin (1/x)—cos (1/x) for x #0.
27.E. Yes.
27.L. We can write
(Fx)
— f(n))/x| = |(x — n)/x] [f'n] = [Ge — 2)/x| [b|/2.
Section 28
Section 29
29.D. If e>0, then there are rational numbers n,...,7, in F such that
O<f(x)<e if x#r,. Let P be a partition such that each of the (at most 2m)
subintervals containing one of the r,,..., 7, has length less than e/2m. Show that
Os S(P;f, g) s2¢.
29.5. If fi(x)
= f(x) for x¢{c,...,¢,} and ¢ >0, let P be a partition such that
each of the subintervals containing one of the c.,...,¢m has length less than
e/2mM, where M = sup {|lf|[;, |i}. Using the same intermediate points, we have
IS(P; f, 2) -S(P; f,, g)}<e, where g(x) =x for xeJ.
29.N. Suppose c € (a, b); then f is g-integrable over [a, c] and[c, b]. If g, is the
restriction of g to [a,c], it follows from 27.N than g; is continuous on [a, c];
similarly for the restriction g, of g to[c, b]. It follows from Theorem 29.8 that fg;
is integrable over [a,c] and that fg; is integrable over [c, b] and that
and let 6=e/4nM. If Q=(yo, yi,..-, Ym) is a partition with norm ||Ol|<
4, let
Q*=QUP, so that Q* 2 P and has at most n—1 more points than Q. Show that
S$(Q*; f)—S(Q; f) reduces to at most 2(n—1) terms of the form +{f(€)—f(n)}
(x;— y.) with |x,—y.|<6.
Section 30
where |u,—v,|<6(e) and hence this sum is dominated by eM. Now use the
Cauchy Criterion.
30.E. Direct estimation yields
<= M(b—a)"".
([ voor dx)
Conversely, f(x) = M-—e on some subinterval of [a, b].
30.H. If m = f(x)=M for a =x = 8, there exists an A with m= A <M such
that
m[ p= { fp<M| p-
Section 31
31.K. Since f.(x)-fi(c)=Si ff, we can apply Theorem 31.2 to obtain f(x)—-
f(c)=J§ig for all xeJ. Show that g=f’.
31.8. Apply Theorem 30.9 to (31.2) with h(t)=(b-1).
31.V. Prove that the functions G and H are continuous. The remainder of the
proof is as in 31.9.
31.X. The function f is uniformly continuous on J,
x J.
31.Y. Let g(0)=0, g(x) =} for O0<x <3, and g(x) =1.
HINTS FOR SELECTED EXERCISES 469
Section 32
32.D. (a), (b), (d), (e) are convergent.
32.E. (a) is convergent for p, q>—1. (b) is convergent for p+q>—1.
32.F. (a) and (c) are absolutely convergent. (b) is divergent.
32.G. (a) is absolutely convergent if q>p+t1. (b) is convergent if q>0 and
absolutely convergent if q>1.
Section 33
Section 34
34.C. Group the terms in the series S%_, (—1)" to produce convergence to —1
and to 0.
34.G. Consider = ((—1)"n-*”). However, consider also the case where a, = 0.
34.H. If a, b= 0, then 2(ab)’? < a+b.
34.J. Show that b,+b,+---+b, = aiti+---+1/n).
34.3. Use Exercise 34.F(a).
34.K. Show that a,+a,+--+-+a, is bounded below by }{a,+2a.+: +++ 2"ax}
and above by a,+2a,+: + -+2" anit ay.
34.0. Consider the partial sums s, with n/2=k <n and apply the Cauchy
Criterion.
Section 35
Section 36
Section 37
37.A. (a) and (c) converge uniformly for all x. (b) converges for x#0 and
uniformly for x in the complement of any neighborhood of x =0. (d) converges for
x >1 and uniformly for x = a, where a>1.
37.C. If the series is uniformly convergent, then
lc, sin nx +++ ++ C2, sin 2nx|<e,
provided n is sufficiently large. Now restrict attention to x in an interval such that
sin kx >} for n<k <2n.
37.H. (a) ~, (c) 1/e, (f) 1.
37.L. Apply the Uniqueness Theorem 37.17.
37.N. Show that if néN, then there exists a polynomial P, such that if x #0,
then f(x) =e"'”’P,(1/x).
37.T. The series A(x) = (a,x"), B(x) = (b,x"), and C(x) =% (c,x") converge
to continuous functions on IE By the Multiplication Theorem 37.8, C(x)=
A(x)B(x) for 0=x<1, and by continuity C(1) = A(1)B(1).
37.U. The sequence of partial sums is increasing on the interval [0, 1].
37.V. If ¢ >0, then |a,| < ep, forn >N. Break the sum ¥ (a,x") into a sum over
n=1,...,N and asum over n>N.
Section 38
oases.1 2[cosx
cos2x
cos3x , cos 5x
cos4x cos 6x
(e) | LP? + 7 + 32 |
Section 39
39.G. We have |G(u, v)— G(0, 0)| = Ju?+ v7] =|K(u, v)|? so that DG(O, 0)(u, v) =
0. If (x, y)#(0, 0), then D,G(x, x) = 2x sin (2x?)"'— x7! cos (2x’)"', which is not
bounded as x 0.
39.L. (a) Vine
f: = (2a, 2b, 2c).
(c) Vasofs = (be, ac, ab).
39.M. (a) 0. (c) 4/v6.
39.0. (a) At (1, 2) we have {(x, y, z):z-5=2(x—1)+4(y—2)}.
(c) At (1, 1) we have {(x, y, z):z -V2 = —(x
+ y ~ 2)/V2}.
39.Q. (a) At £=0 we have {(x, y,z):x=t,.y=0,z=0}; at t=1 we have
{(x, y, z):x=1+s, y=14+2s,z=14+3s}.
(c) At t=3a we have {(x, y, z):x=~—2s, y=2, z=ia0+t+s}.
39.8. (b) At the point (3, —1, —3) corresponding to (s, t)=(1, 2) we have:
S. ={, y, z):x =34+(s—1)+(t-2), y=~—14+(s—-1)-(t-2),
z=-34+2(s—1)—-4(t—2)}.
(d) At the point (1,0,0) corresponding to (s,1)=(0,3ar) we have: S,=
{(x, y,z):x=1,y=s,z2 =-(t-ia)}.
39.V. Note that if ye R’, z¢R’, then (y,z)e R*xR'=R" is such that
Ky, 2P = lly P+ lle.
Section 40
40.A. F(t) =2(3t+ 134+ 2(2t—3)2 = 261-6.
40.D. D,F(s, t)= (sin s cos t+sin t)(—sin s)+ (cos s + sin t)(cos s cos t)+ 0.
40.G. (a) D,F(x, y)=f'(xy)y, DaF(x, y) =f'(xy)x.
(d) DiF(x, y)=f'Q?—y’)(2x), DiF(x, y)= f'(x?— y’)-2y).
40.K. (b) Since g’{(t)= D,f(tc)c,+---+D,flte)c,, it follows from Euler’s Rela-
tion that we have
Therefore (why?) g(t) = Ct* for some constant C. Since f(c) = g(J) = C, we deduce
that f(tc) = g(t)=t'f(c), whence f is homogeneous of degree k.
40.M. Since
40.P. Since Dg(c)(u) = (ugi(c), ..., ugi(c)) =ug'(c) for ue R, it follows from
the Chain Rule that
Section 41
Section 42
42.A. (a) Saddle point at (0,0). (b) Relative strict minimum at (—2.}3).
(c) Saddle point at (0, —1); relative strict minimum at (0,3). (f) Saddle point at
(0, 0); relative strict minima at (0, —1) and (0, 2).
42.D. If f is not constant, then either the supremum or the infimum of f on
S ={x ER? :||x|| <1} is not 0. Since $ is compact, this supremum (or infimum) is
attained at a point ce S. The hypothesis rules out the possibility that |lcl|= 1.
42.F. (a,d) Relative minimum at (0,0). (b,c,e) Saddle point at (0,0).
(f) Relative strict minimum at (0, 0).
42.G. Saddle point at (1,1).
42.H. Monkeys have tails.
PINTS TOR SELECTED EXERCISHS 473
42.1, 4. 42.K. 3.
42.S. (a) Maximum value = 1, attained at (+1, 0); minimum value = —1, attained
at (0,+1). (b) Maximum value=3, attained at (1,0); minimum value= ~—1,
attained at (-1,0). (c) Maximum value= 4, attained at (1, +1); minimum value=
—1, attained at (—1,0). (d) Maximum value= 1, attained at (0, 7/2); minimum
value = —1; attained at (0, —7/2).
42.U. Maximum value= 1, attained at (1, 0, 0); minimum value =, attained at
(—s 3 3).
Section 43
43.B. If peN is given, let n>(2'°-1)"'. If a cell I in R°® has side lengths
O<a,<a,<:-+:<4,, let c=a,/n. Then I is contained in the union of n((a,/c]+
1)---([a,/ce]+1) cubes with side length c, having total content less than
2(a,-+-a,)=2c(1). Hence, if Z is contained in the union of cells with total
content less than e, it is contained in the union of cubes with total content less than
26.
43.E. No.
43.H. If the closure of J, is [ai,6,]x---[a,, b,], for j=1,...,n, and if
I=[a,, b,]x---[a,, b,], let P, be the partition of [a,, b,] obtained by using the
points {aj, bi:j=1,...,a},..., and P, be the partition of [a,, b,] obtained by
using {a,,b,:j=1,...,n}. The partitions P,,...,P, induce a partition of I.
43.1. Enclose Z in the union of a finite number of closed cells in I with total
content less than e. Now apply 43.H.
43.J. Enclose Z in the union of a finite number of open cells in I with total
content less then «. Now apply 43.H.
43.K. We form a sequence of partitions of I into cubes with side length 2°"8 by
successive bisection of the sides of I. Given a cube K <I with side length r, enclose
K in the union of all cubes in the nth partition which have non-empty intersection
with K. If n is so large that (1+ 8/2""'r)’ <2, then this union has total content less
than 2c(K).
43.L. Use 43.G. 43.M. Use 43.L.
43.R. First treat the case f= g; then consider (f+).
43.T. Let M>lflL. Ilgl. Since f and g are uniformly continuous on K, if P, is
sufficiently fine, thenf and g vary less than e/2M on each K;, and such that for any
p,¢K, we have |fxfg—L f(p,)g(p,)e(K,)|=(e/2)c(K). Then we have
Section 44
Section 45
475
476 INDEX
Complement of a set, 7 Descartes, R., 8
Complex number system, 86 ff. Diagonal method, 24, 190, 227
Components of a vector, 57 Difference, of two functions, 142
Conditional convergence, 289 of two sequences, 91
Conjugate, of a complex number, 86 symmetric, 10
Connected set, 80 Differentiable function, 349
Connectedness, preservation of, 153 Differential equation, 256
Constraint, 402 Differentiation Theorem, for integrals, 231
Content, inner, 432 for power series, 322
of a cell, 413 Dini, U., 173
outer, 432 Direct image, 19
of a set, 423 Directional derivative, 349
zero, 413 Dirichlet, P. G, L., 140
Continuity, 136 ff. Dirichlet’s discontinuous function, 140
of the inverse function, 156 Test for convergence, 263, 306
one-sided, 215 Test for uniform convergence, 268, 318
uniform, 158 ff. Disconnected set, 80
Contour, 78 Disconnection, 80
Contraction, 161 Discontinuity Criterion, 138
Convergence, absolute, 265 Discrete metric, 60
of Fourier series, 335 ff. Disjoint sets, 4
in mean, 253 Divergence, of a sequence, 92, 127
in mean square, 254 Domain, of a function, 12
in a metric space, 103 Dominated Convergence Theorem, 273
interval of, 320 Dot product, 54
radius of, 320 Double limit, 130
of a sequence, 92 sequence, 130 ff.
ofa sequence of functions, 114 series, 310 ff.
uniform, 117, 133, 267, 316
Convex function, 211 Element, ofa set, 1
set, 59 Equicontinuity, 189 ff.
Coordinates, cylindrical, 454 Equivalent sequences, 128
polar, 449 Euler, L., 373
spherical, 449 Exponential function, 44, 146, 208, 329
ofa vector, 57 Extension, of a continuous function, 186
Correspondence, 23
Cosine series, 329 of a function, 15
Countable set, 23 Exterior point of a set, 65
Covering, 73 Extremum, 397
Critical point, 398
Cube, 413 Fejér, L., 338
Curve, polygonal, 83 Fejér’s Theorem, 339
space-filling, 415 Field, 28
Cut, 45 Finite set, 23
Property, 46 First Mean Value Theorem, 230, 232
Cylindrical coordinates, 454 Fixed point, 161
Flyswatter Principle, 102
D’Alembert, J., 296 Fourier, J.-B. J., 330
Darboux, G., 197 Fourier coefficients, 331
Darboux’s Theorem, 199 cosine series, 342
Decreasing function, 145 series, 330 ff.
sequence, 105 sine series, 343
Dedekind, R., 45 Fresnel, A., 264
De Moivre, A., 239 Fresnel integral, 264
De Morgan, A., 8 Frobenius, G., 328
De Morgan’s Laws, 8 Function, 10 ff.
Density of a set function, 437 absolute value of, 143
of the rational numbers, 41 additive, 145,437
Denumerable set, 23 affine, 350
Derivative, 194 ff., 349 ff. Beta, 283
block partial, 360, 385 bijective, 18
directional, 349 bilinear, 373
one-sided, 200, 335 bounded variation, 225
partial, 348 Class C’ , 376
INDEX 477
composition, 15 Hyperbolic functions, 211
content, 423, 426 Hypergeometric series, 304
continuous, 137
convex, 211 Identity element, of a field, 28
decreasing, 145 Image, 12, 19, 20
derivative of, 194, 349 Imaginary part, 86
differentiable, 349 Implicit Function Theorem, 384, 395, 396
direct image of, 19 Improper integrals, 257 ff.
domain of, 12 Increasing function, 145
even, 200, 332 sequence, 104
exponential, 44, 146, 208, 329 Inequalities, basic properties of, 32 ff.
Gamma, 264, 282 Inequality, arithmetic-geometric, 60, 411
greatest integer, 145, 221 Bernoulli, 36
harmonic, 410 Bessel, 334
homogeneous, 372 Cauchy, 60
hyperbolic, 211 Chebyshev, 62
increasing, 145 Holder, 61, 202, 411, 435
injective, 17 Minkowski, 61, 411
inverse, 17 Schwarz, 56
inverse sine, 18 Triangle, 36,56
inverse image of, 19 Infimum, 38
Laplace transform of, 283 Property, 39
linear, 147 Infinite integral, 259 ff.
logarithm, 45, 146, 209, 238 limits, 127
monotone, 145 product, 305
nondifferentiable, 195 series, 286 ff.
odd, 200, 332 set, 23
periodic, 164, 330 Injection, 17
piecewise, continuous, 330 Injective function, 17
piecewise linear, 168 Injective Mapping Theorem, 377
polynomial, 144 Inner content, 432
positively homogeneous, 372 Innex product, 54, 254
range of, 12 product space, 54
semicontinuous, 180 Integrability theorems, 216, 228-229, 418,
step, 167 420, 436
surjective, 18 Integral, 212 ff., 415 ff.
square root, 18, 157 improper, 257 ff.
trigonometric, 209 ff., 238, 329 infinite, 259 ff.
Fundamental Theorem, of algebra, 88 iterated, 246
of integral calculus, 231 lower, 225, 422
partial, 259
Gamma function, 264, 282 transformation of, 234, 443 ff.
Gauss, C. F., 88 upper, 225, 422
Geometric mean, 60, 411 Integral Test, for series, 300
series, 289 Integrand, 214
Global Continuity Theorem, 151 Integration by parts, 219, 233
Gradient, 357 Integrator, 214
Graves, L. M., 378 Interchange theorems, relating to continuity,
Greatest integer function, 145, 221 166, 244, 270, 316
lower bound (= infimum), 38 telating to differentiation, 204, 245, 271,
Grid, 433 317, 322, 369
relating to infinite integrals, 270 ff.
Hadamard, J., 320 relating to integration, 241 ff., 244 ff.,
Half-closed cell (or interval), 47 316 ff., 321, 430 ff.
Half-open cell (or interval), 47 relating to sequences, 166, 204, 241 ff.,
Hardy, G. H., 260 273 ff.
Harmonic function, 410 relating to series, 316 ff., 321
series, 290 Interior maximum, 195
Heine, E., 74 point, 65
Heine-Borel Theorem, 74 of a set, 67,423
Helly Selection Theorem, 227 Intermediate Value Theorem, 153
Holder, O.,61 Intersection of sets, 4
Holder’s Inequality, 61, 202, 411, 435 Interval, of convergence, 320
Homogeneous function, 372-373 inR,47
478 INDEX
unit, 47 of Fourier series, 337
Inverse function, 17 Mean Value Theorem, for derivatives in R,
continuity of, 156 196 ff.
Inverse image, 20 for derivatives in RP , 364 ff,
Inverse sine function, 18 for integrals in R, 230, 232-233
Inversion mapping in C, 89 for integrals in R? , 429
Inversion Theorem, 381, 396 Means, arithmetic, 129
Irrational elements of a field, 31 Measure zero, 421
Irrational powers of a real number, 44 Member of a set, 2
Iterated integrals, 246, 275 ff., 430 ff. Meriens, F., 313
limits, 131 ff. Metric, 59
suprema, 42 space, 59
Minimum, relative, 195, 397
Jacobi, C. G. J., 354 Value Theorem, 154
Jacobian determinant, 354 Minkowski, H., 61
Jacobian Theorem, 442 Minkowski’s Inequality, 61, 411
Jump ofa function, 146 Models for R, 49
Monotone Convergence Theorem, for
Kronecker, L., 50 infinite integrals, 274
for integrals, 243
Lagrange, J. -L., 60 for sequences, 104
Lagrange identity, 60 Multiplication of power series, 323
multiplier, 402 ff. Multiplier, Lagrange, 402 ff.
Landau, E., 128
Laplace, P, -S., 283 Nearest Point Theorem, 78
Laplace transform, 283 ff. Neighborhood, 65
Least squares, 409 Nested Cells Property, in R, 47
upper bound (= supremum), 38 in RP, 69
Lebesgue, H., 77 Newton’s method, 200-201, 375, 396
Lebesgue Covering Theorem, 77 Nondifferentiable functions, 195
integral, 212 Norm, 54
number, 77 of a function, 119, 253, 254
Lebesgue’s Criterion for Integrability, 436 ofa matrix, 150
Leibniz, G. W., 353 of a partition, 223
Leibniz’s Alternating Series Test, 308 of a vector, 54
formula, 245 Norm Convergence of Fourier series, 337
L’Hospital, G. F., 203 Normed space, 54
Limit, deleted, 175 Null space, 387
of a double sequence, 130 Nullity, 387
ofa function, 175 Nusnbers, complex, 4, 86 ff.
inferior, 123 natural, 3
nondeleted, 175 rational, 4, 31
right-hand, 181 real, 27 ff.
of a sequence, 92
superior, 123, 178 0,0, 128
upper, 178 Open Mapping Theorem, 380
Linear function, 147 Open set, 62
functional, 247 Operation, binary, 28
transformation, 255 Order Properties of R, 32 ff.
Lipschitz, R., 161 Ordered pair, 8
Lipschitz condition, 161 Ordinate set, 433
Logarithm, 45, 146, 209, 238 Orthogonal vectors, 59
Lower bound, 37 Orthonormal set of functions, 344
integral, 225, 422 Oscillation of a function, 164
Outer content, 432
Machine, 14
Macilaurin, C., 299 Pair, ordered, 8
McShane, E. J., 408 Parallelepiped, 69
Mapping, 12 Parallelogram Identity, 58
Matrix, 148 Parametrization Theorem, 388
Maximum, relative, 195, 397 Parseval’s Equality, 338
Value Theorem, 154 Partial derivative, 348
Mean convergence, 253 integral, 259
Mean square convergence, 254 map, 360
INDEX 479
product, 305 sum, 213
sum, 287, 306, 310 Riesz, F., 248
Partition, 213,415 Riesz Representation Theorem, 248
Peano curve, 415 Rolle, M., 196
Periodic function, 164, 330 Rolle’s Theorem, 196
Perpendicular, 59 Root, multiplicity of, 207
Piecewise continuous function, 330 simple, 207
Point, accumulation, 69 Root Test, 295
boundary, 65, 422 Rosenberg, A., 50, 57
cluster, 69 Rota, G. C., 480
critical, 398
exterior, 65 Saddle point, 398
interior, 65 Schoenberg, I. J., 420
saddle, 398 Schwartz, J. T., 444
Pointwise Convergence of Fourier series, Schwarz, H. A., 56
Schwarz Inequality, 56
Polar coordinates, 449 Second Derivative Test, 399
curve, 453 Mean Value Theorem, 233
Pélya, G., 173 Semicontinuity, 180
Polygonal curve, 83 Sequence(s), 91 ff.
Polynomial, Bernstein, 169 of arithmetic means, 129
trigonometric, 332 bounded, 93
Positive class, 32 in a Cartesian space, 91
Power, of a real number, 31, 43-44 Cauchy, 108
series, 319 ff. convergent, 91
Preservation, of Compactness, 154 double, 130
of Connectedness, 153 difference of ,91, 100
Product, Cauchy, 313 divergent, 91, 127
dot, 54 equivalent, 128
of functions, 142 of functions, 113 ff., 165 ff.
infinite, 305 iterated, 131-132
of a real number and a vector, 53 limit of, 91
of sequences, 91 in a metric space, 103
Property, 3 monotone, 104
product of, 91, 100
Quotient, of functions, 143 quotient of 92, 100
of sequences, 92 sum of, 91, 100
unbounded, 126
Raabe, J. L., 298 Series, 286 ff.
Raabe’s Test, 298 absolutely convergent, 289
Radius of convergence, 320 alternating, 308
Range of a function, 12, 387 conditionally convergent, 289
Rank, 387 double, 310 ff,
Rank Theorem, 391 Fourier, 330 ff.
Ratio Test, 296 of functions, 315 ff.
Rational number, 31 geometric, 289
power of a real number, 44 harmonic, 290
Ray, 46 hypergeometric, 304
Real part, 86 p-series, 290
Rearrangement Theorem, 292 power, 319 ff.
Remainder in Taylor’s Theorem, 206, Tearrangements of, 291 ff.
Set(s), accumulation point of, 69
Cauchy’s form, 206 boundary point of, 65, 422
integral form, 243 bounded, 69
Lagrange’s form, 206 Cantor, 48
Restriction of a function, 15 Cartesian product of, 8
Riemann, B., 212 closed, 64
Riemann Criterion for Integrability, 228, closure of, 68, 423
420 cluster point of, 69
integral of a function on R, 214 compact, 73
integral of a function on R”, 415 complement of, 7
sum, 213,415 connected, 80
Riemann-Lebesgue Lemma, 334 content of, 423
Riemann-Stieltjes integral, 212 ff. convex, 59
480 INDEX
denumerable, 23 Surjective function, 18
disconnected, 80 Surjective Mapping Theorem, 378
disjoint, 4 Symmetric difference, 10
empty, 4
enumerable, 23 Tangent line, 358
equality of, 2 plane, 350, 358, 359, 394
exterior point of, 65 space, 358, 394
finite, 22 Tauber, A., 325
infinite, 22 Tauber’s Theorem, 325
interior of, 68, 423 Taylor, B., 205
interior point of, 65 Taylor’s Theorem, 205, 243, 371
intersection of, 4 Tests for convergence of series, 294 ff.
nonintersecting, 4 Tietze, H., 186
open, 62 Tietze’s Extension Theorem, 186
ordinate, 433 Topology, 63, 72
relative complement of, 7 Translation of a set, 80
symmetric difference of, 10 Transformation, 14
union of, 4 of integrals, 234, 443 ff.
void, 4 Triangle Inequality, 36, 56
Shuffled sequence, 112 Trichotomy Property, 32
Side condition, 402 Trigonometric functions, 209 ff., 238, 329
Simple root, 207 polynomial, 332
Sine series, 329
Space, inner product, 54 Uniform continuity, 159
metric, 59, 72 Uniform convergence, of Fourier series, 337
normed, 54 of an infinite integral, 267
topological, 72 ofa sequence of functions, 117
vector, 52 of a sequence of sequences, 133
Space-filling curve, 415 of a series of functions, 316
Sphere in a Cartesian space, 57 Uniform norm, 118 ff.
Spherical coordinates, 449 Union of sets, 4
Solid of revolution, 453 Uniqueness Theorem for power series, 322
Stieltjes, T. J., 212 Unit ball, cell, 47
Stirling, J., 239 content of, 454-455
Stirling’s formula, 239-240 interval, 47
Stone, M. H., 183 Upper bound, 37
Stone Approximation Theorem, 183 integral, 225, 422
Stone-Weierstrass Theorem, 184
Subsequence, 98 Value, of a function, 12
Subset, 2 Vector space, 52
Sum, Riemann, 213, 415
partial, 287 Wallis, J., 239
Riemann-Stieltjes, 213 Wallis product, 239
of two functions, 53, 142 Weierstrass, K., 70
of two sequences, 91 Weierstrass Approximation Theorem, 172,
of two vectors, 52 186, 340
Summability, Abel, 325 Weierstrass M-Test, for infinite integrals,
Cesdro, 129, 339 268
Supremum, 38 for series, 317
iterated, 42 Well-Ordering Property, 22
norm, 119
Property, 39 Zero, content, 413
Surjection, 18 measure, 421