linear algebra

EXERCISE SET 1.1

1. (b) Not linear because of the term x1x3.

(d) Not linear because of the term x–21 .

(e) Not linear because of the term x3/51 .

6. (b) Substituting the given expressions for x and y into the equation x = 5 + 2y yields

Since this equation is valid for all values of t, the proposed solution is, indeed, thegeneral one.

Alternatively, we let x = t and solve for y, obtaining

7. Since each of the three given points must satisfy the equation of the curve, we have thesystem of equations

ax21 + bx1 + c = y1

ax22 + bx2 + c = y2

ax23 + bx3 + c = y3

If we consider this to be a system of equations in the three unknowns a, b, and c, theaugmented matrix is clearly the one given in the exercise.

8. If the system is consistent, then we can, for instance, subtract the first two equations fromthe last. This yields c – a – b = 0 or c = a + b. Unless this equation holds, the system cannothave a solution, and hence cannot be consistent.

y t=1

2–

5

2

t t t t= 5 + 21

2

5

2= 5 + 5 =− −

1

9. The solutions of x1 + kx2 = c are x1 = c – kt, x2 = t where t is any real number. If thesesatisfy x1 + x2 = d, then c – kt + t = d, or ( – k)t = d – c for all real numbers t. Inparticular, if t = 0, then d = c, and if t = 1, then = k.

10. If x – y = 3, then 2x – 2y = 6. Therefore, the equations are consistent if and only if k = 6;that is, there are no solutions if k ≠ 6. If k = 6, then the equations represent the same line,in which case, there are infinitely many solutions. Since this covers all of the possibilities,there is never a unique solution.

12. (a) If the system of equations fails to have a solution, then there are three possibilities:The three lines intersect in (i) distinct points, (ii) 2 distinct points, or (iii) not at all.

(i) (ii) (iii)

(b) If the system of equations has exactly one solution, then all of the lines must passthrough a common point. Moreover, at least two of the lines must be distinct.

2 Exercise Set 1.1

(c) If the system has in infinitely many solutions, then the three lines coincide; that is, thethree equations represent the same line.

14. If k = = m = 0, then all lines must intersect at the origin.

3

EXERCISE SET 1.2

1. (e) Not in reduced row-echelon form because Property 2 is not satisfied.

(f) Not in reduced row-echelon form because Property 3 is not satisfied.

(g) Not in reduced row-echelon form because Property 4 is not satisfied.

2. (c) Not in row-echelon form because Property 1 is not satisfieded.

(d) In row-echelon form.

(f) Not in row-echelon form because Property 1 is not satisfied.

4. (b) Let x4 = t. Then we can read off the solution

x1 = 8 + 7t

x2 = 2 – 3t

x3 = –5 –t

x4 = t

(c) Let x5 = t. Then x4 = 8 – 5t and x3 = 7 – 4t. Let x2 = s. Then x1 = –2 –3t + 6s.

5. (a) The solution is

x3 = 5

x2 = 2 – 2 x3 = –8

x1 = 7 – 4 x3 + 3x2 = –37

5. (b) Let x4 = t. Then x3 = 2 – t. Therefore

x2 = 3 + 9t – 4x3 = 3 + 9t – 4(2 – t) = –5 + 13t

x1 = 6 + 5t – 8x3 = 6 + 5t – 8(2 – t) = –10 + 13t

6. (a) The augmented matrix is

Replace Row 2 by Row 1 plus Row 2 and replace Row 3 by Row 3 plus –3 times Row 1.

Multiply Row 2 by –1.

Note that we resist the temptation to multiply Row 3 by –1/2. While this is sensible, itis not part of the formal elimination procedure. Replace Row 3 by Row 3 plus 10 timesRow 2.

Multiply Row 3 by –1/52.

1 1 2 8

0 1 5 9

0 0 52 104

− −− −

1 1 2 8

0 1 5 9

0 10 2 14

− −− − −

1 1 2 8

0 1 5 9

0 10 2 14

−− − −

1 1 2 8

1 2 3 1

3 7 4 10

– −−

6 Exercise Set 1.2

Now multiply Row 3 by 5 and add the result to Row 2.

Next multiply Row 3 by –2 and add the result to Row 1.

Finally, multiply Row 2 by –1 and add to Row 1.

Thus the solution is x1 = 3, x2 = 1, x3 = 2.

(c) The augmented matrix is

Replace Row 2 by Row 2 plus –2 times Row 1, replace Row 3 by Row 3 plus Row 1,and replace Row 4 by Row 4 plus –3 times Row 1.

1 1 2 1 1

2 1 2 2 2

1 2 4 1 1

3 0 0 3 3

− − −− − −

− −− −

1 0 0 3

0 1 0 1

0 0 1 2

1 1 0 4

0 1 0 1

0 0 1 2

1 1 2 8

0 1 0 1

0 0 1 2

1 1 2 8

0 1 5 9

0 0 1 2

− −

Exercise Set 1.2 7

Multiply Row 2 by 1/3 and then add –1 times the new Row 2 to Row 3 and add –3times the new Row 2 to Row 4.

Finally, replace Row 1 by Row 1 plus Row 2.

Thus if we let z = s and w = t, then we have x = –1 + w = –1 + t and y = 2z = 2s. Thesolution is therefore

x = –1 + t

y = 2s

z = s

w = t

7. (a) In Problem 6(a), we reduced the augmented matrix to the following row-echelonmatrix:

1 1 2 8

0 1 5 9

0 0 1 2

− −

1 0 0 1 1

0 1 2 0 0

0 0 0 0 0

0 0 0 0 0

− −−

1 1 2 1 1

0 1 2 0 0

0 0 0 0 0

0 0 0 0 0

− − −−

1 1 2 1 1

0 3 6 0 0

0 1 2 0 0

0 3 6 0 0

− − −−−−

8 Exercise Set 1.2

By Row 3, x3 = 2. Thus by Row 2, x2 = 5x3 – 9 = 1. Finally, Row 1 implies that x1 = –x2 – 2 x3 + 8 = 3. Hence the solution is

x1 = 3

x2 = 1

x3 = 2

(c) According to the solution to Problem 6(c), one row-echelon form of the augmentedmatrix is

Row 2 implies that y = 2z. Thus if we let z = s, we have y = 2s. Row 1 implies that x= –1 + y – 2z + w. Thus if we let w = t, then x = –1 + 2s – 2s + t or x = –1 + t. Hencethe solution is

x = –1 + t

y = 2s

z = s

w = t

8. (a) The augmented matrix is

Divide Row 1 by 2.

1 3 2 1

2 1 1

3 2 1

− −

2 3 2

2 1 1

3 2 1

− −

1 1 2 1 1

0 1 2 0 0

0 0 0 0 0

0 0 0 0 0

− − −−

Exercise Set 1.2 9

Replace Row 2 by Row 2 plus –2 times Row 1 and replace Row 3 by Row 3 plus –3times Row 1.

Divide Row 2 by 4. Then multiply the new Row 2 by –13/2 and add the result to Row3.

Thus, if we multiply Row 3 by –8/7, we obtain the following row-echelon matrix:

Finally, if we complete the reduction to the reduced row-echelon form, we obtain

The system of equations represented by this matrix is inconsistent because Row 3implies that 0x1 + 0x2 = 0 = 1.

8. (c) The augmented matrix is

Divide Row 1 by 4.

4 8 12

3 6 9

2 4 6

−−

− −

1 0 0

0 1 0

0 0 1

1 3 2 1

0 1 3 4

0 0 1

− −

1 3 2 1

0 1 3 4

0 0 7 8

− −

−

1 3 2 1

0 4 3

0 13 2 4

− −

/

/

10 Exercise Set 1.2

Replace Row 2 by Row 2 plus –3 times Row 1 and replace Row 3 by Row 3 plus 2times Row 1.

If we let x2 = t, then x1 = 3 + 2x2 = 3 + 2t. Hence, the solution is

x1 = 3 + 2t

x2 = t

9. (a) In Problem 8(a), we reduced the augmented matrix of this system to row-echelonform, obtaining the matrix

Row 3 again yields the equation 0 = 1 and hence the system is inconsistent.

(c) In Problem 8(c), we found that one row-echelon form of the augmented matrix is

Again if we let x2 = t, then x1 = 3 + 2x2 = 3 + 2t.

1 2 3

0 0 0

0 0 0

−

1 3 2 1

0 1 3 4

0 0 1

− −

/

/

1 2 3

0 0 0

0 0 0

−

1 2 3

3 6 9

2 4 6

−−

− −

Exercise Set 1.2 11

10. (a) The augmented matrix of the system is

Divide Row 1 by 5.

Replace Row 2 by Row 2 plus 2 times Row 1.

Multiply Row 2 by 5.

Replace Row 1 by Row 1 plus 2/5 times Row 2.

Thus, if we let x3 = t, then x2 = 5 – 27x3 = 5 – 27t and x1 = 2 – 12x3 = 2 – 12t.Therefore, the solution is

x1 = 2 – 12t

x2 = 5 – 27t

x3 = t

1 0 12 2

0 1 27 5

1 2 5 6 5 0

0 1 27 5

−

/ /

1 2 5 6 5 0

0 1 5 27 5 1

−

1 2 5 6 5 0

2 1 3 1

−−

5 2 6 0

2 1 3 1

−−

12 Exercise Set 1.2

(c) The augmented matrix of the system is

Interchange Rows 1 and 4 and multiply the new Row 1 by 1/2.

Interchange Rows 2 and 4. Then replace Row 3 by Row 3 plus –1 times the new Row 2.

Replace Row 4 by Row 4 plus –1 times Row 3.

Replace Row 1 by Row 1 plus –1/2 times Row 2.

1 2 0 5 2 1 2 3 2

0 0 1 2 1 4

0 0 0 1 1 3

0 0 0 0 0 0

−−

1 2 1 2 7 2 0 7 2

0 0 1 2 1 4

0 0 0 1 1 3

0 0 0 0 0 0

−−

1 2 1 2 7 2 0 7 2

0 0 1 2 1 4

0 0 0 1 1 3

0 0 0 1 1 3

−−−

1 2 1 2 7 2 0 7 2

0 0 0 1 1 3

0 0 1 3 2 7

0 0 1 2 1 4

−−−

0 0 1 2 1 4

0 0 0 1 1 3

0 0 1 3 2 7

2 4 1 7 0 7

−−−

Exercise Set 1.2 13

Replace Row 2 by Row 2 plus –2 times Row 3 and replace Row 1 by Row 1 plus –5/2times Row 3.

Thus, if we let y = t and v = s, we have

u = –6 – 2s – 3t

v = s

w = –2 – t

x = 3 + t

y = t

11. (a) From Problem 10(a), a row-echelon form of the augmented matrix is

If we let x3 = t, then Row 2 implies that x2 = 5 – 27t. Row 1 then implies that x1 =(–6/5)x3 + (2/5)x2 = 2 – 12t. Hence the solution is

x1 = 2 – 12t

x2 = 5 – 27t

x3 = t

(c) From Problem 10(c), a row-echelon form of the augmented matrix is

1 2 1 2 7 2 0 7 2

0 0 1 2 1 4

0 0 0 1 1 3

0 0 0 0 0 0

−−

1 2 5 6 5 0

0 1 27 5

−

1 2 0 0 3 6

0 0 1 0 1 2

0 0 0 1 1 3

0 0 0 0 0 0

−−

−

14 Exercise Set 1.2

If we let y = t, then Row 3 implies that x = 3 + t. Row 2 then implies that

w = 4 – 2x + t = –2 – t.

Now let v = s. By Row 1, u = 7/2 – 2s – (1/2)w – (7/2)x = –6 – 2s – 3t. Thus we havethe same solution which we obtained in Problem 10(c).

12. (a) There are more unknowns than equations. By Theorem 1.2.1, the system has infinitelymany solutions.

(d) If we let x2 = t, then x1 = 2–3

t. Thus the system has infinitely many solutions.

13. (b) The augmented matrix of the homogenous system is

This matrix may be reduced to

If we let x3 = 4s and x4 = t, then Row 2 implies that

4x2 = –4t – 4s or x2 = –t – s

Now Row 1 implies that

3x1 = –x2 – 4s – t = t + s – 4s – t = –3s or x1 = –s

Therefore the solution is

x1 = –s

x2 = –(t + s)

x3 = 4s

x4 = t

3 1 1 1 0

0 4 1 4 0

3 1 1 1 0

5 1 1 1 0− −

Exercise Set 1.2 15

14. (b) The augmented matrix of the homogenous system is

This matrix can be reduced to

If we let w = 2s and x = 2t, then the above matrix yields the solution

u = 7s – 5t

v = –6s + 4t

w = 2s

x = 2t

15. (a) The augmented matrix of this system is

2 1 3 4 9

1 0 2 7 11

3 3 1 5 8

2 1 4 4 10

−−

−

1 07

2

5

20

0 1 3 2 0

0 0 0 0 0

0 0 0 0 0

−

−

0 1 3 2 0

2 1 4 3 0

2 3 2 1 0

4 3 5 4 0

−−

−− − −

16 Exercise Set 1.2

Its reduced row-echelon form is

Hence the solution is

I1 = –1

I2 = 0

I3 = 1

I4 = 2

(b) The reduced row-echelon form of the augmented matrix is

If we let z2 = s and z5 = t, then we obtain the solution

Z1 = –s – t

Z2 = s

Z3 = –t

Z4 = 0

Z5 = t

16. (a) The augmented matrix of the system is

2 1

3 6

a

b

1 1 0 0 1 0

0 0 1 0 1 0

0 0 0 1 0 0

0 0 0 0 0 0

1 0 0 0 1

0 1 0 0 0

0 0 1 0 1

0 0 0 1 2

−

Exercise Set 1.2 17

Divide Row 1 by 2 and Row 2 by 3.

Subtract Row 1 from Row 2.

Multiply Row 2 by 2/3.

Thus

Row 1 then implies that

or

xa a b a

= −−

+

= −2

1

2 3

2

9

2

3

6

9

xa y

= −2 2

ya b

=−

+3

2

9

11

2 2

0 13

2

9

a

a b− +

11

2 2

03

2 2

2

3

a

a b− +

11

2 2

1 23

a

b

18 Exercise Set 1.2

17. The Gauss-Jordan process will reduce this system to the equations

x + 2y – 3z = 4

y – 2z = 10/7

(a2 – 16)z = a – 4

If a = 4, then the last equation becomes 0 = 0, and hence there will be infinitely manysolutions—for instance, z = t, y = 2 t + 10—

7 , x = –2 (2t + 10—

7 ) + 3t + 4. If a = – 4, then the last

equation becomes 0 = –8, and so the system will have no solutions. Any other value of a willyield a unique solution for z and hence also for y and x.

19. One possibility is

Another possibility is

20. Let x1 = sin α, x2 = cos β, and x3 = tan γ. If we solve the given system of equations for x1,x2, and x3, we find that x1 = 1, x2 = –1, and x3 = 0. Thus α = π/2, β = π, and γ = 0.

21. If we treat the given system as linear in the variables sin α, cos β, and tan γ, then theaugmented matrix is

This reduces to

1 2 3 0

2 5 3 0

1 5 5 0− −

1 3

2 7

2 7

1 3

1 7 2

1 3

1 7 2

→

→

→

11 1

1 3

2 7

1 3

0 1

→

RREF I= =

3

1 0 0

0 1 0

0 0 1

Exercise Set 1.2 19

so that the solution (for α, β, γ between 0 and 2 π) is

sin α = 0 ⇒ α = 0, π, 2π

cos β = 0 ⇒ β = π/2, 3π/2

tan γ = 0 ⇒ γ = 0, π, 2π

That is, there are 3•2•3 = 18 possible triples α, β, γ which satisfy the system of equations.

22. If (x, y) is a solution, then the first equation implies that y = –(λ – 3)x. If we thensubstitute this value for y into the second equation, we obtain [1 – (λ – 3)2]x = 0. But [1 –(λ – 3)2]x = 0 implies that either x = 0 or 1 – (λ – 3)2 = 0. Now if x = 0, then either of thegiven equations implies that y = 0; that is, if x = 0, then the system of equations has onlythe trivial solution. On the other hand, if 1 – (λ – 3)2 = 0, then either λ = 2 or λ = 4. Foreither of these values of λ, the two given equations are identical—that is, they representexactly the same line—and, hence, there are infinitely many solutions.

23. If λ = 2, the system becomes

– x2 = 0

2x1 – 3x2 + x3 = 0

–2x1 + 2x2 – x3 = 0

Thus x2 = 0 and the third equation becomes –1 times the second. If we let x1 = t, then x3= –2t.

24. We regard this as a system of linear equations in the variables 1/x, 1/y, and 1/z. Theaugmented matrix for this system of equations is therefore

1 2 4 1

2 3 8 0

1 9 10 5

−

−

1 0 0 0

0 1 0 0

0 0 1 0

20 Exercise Set 1.2

which reduces to

Hence, the solution is x = –13/7, y = 91/54, and z = –91/8.

25. Using the given points, we obtain the equations

d = 10

a + b + c + d = 7

27a + 9b + 3c + d = –11

64a + 16b + 4c + d = –14

If we solve this system, we find that a = 1, b = –6, c = 2, and d = 10.

26. Using the given points, we obtain the equations

41a – 4b + 5c + d = 0

53a – 2b + 7c + d = 0

25a + 4b – 3c + d = 0

This homogeneous system with more unknowns than equations will have infinitely manysolutions. Since we know that a ≠ 0 (otherwise we’d have a line and not a circle), we canexpect to solve for the other variables in terms of a. Since the augmented matrix reducesto

we have d = –29, c = –4a, and b = –2a. If we let a = 1, the equation of the circle becomesx2 + y2 – 2x – 4y – 29 = 0. Any other choice of a will yield a multiple of this equation.

29 0 0 1 0

4 0 1 0 0

2 1 0 0 0

1 0 0 7 13

0 1 0 54 91

0 0 1 8 91

−

−

Exercise Set 1.2 21

27. (a) If a = 0, then the reduction can be accomplished as follows:

If a = 0, then b ≠ 0 and c ≠ 0, so the reduction can be carried out as follows:

Where did you use the fact that ad – bc ≠ 0? (This proof uses it twice.)

29. There are eight possibilities. They are

(a)1 0 0

0 1 0

0 0 1

1 0

0 1

0 0 0

, ,

p

q

11 0

0 0 1

0 0 0

0 1 0

0 0 1

0 0 0

1

p

,

,

pp q p

0 0 0

0 0 0

0 1

0 0 0

0 0 0

0 0

, ,

11

0 0 0

0 0 0

∈, , ,where R any real np q uumbers.

and

0 0 0

0 0 0

0 0 0

0

0

1

0

1

0 1

b

c d

c d

b

d

c

b

d

c

→

→

→

→

1 0

0 1

a b

c d

b

a

c d

b

a

ad bc

a

→

→−

11

0

→

→

1

0 1

1 0

0 1

b

a

22 Exercise Set 1.2

(b)

30. (a) If the system of equations has only the trivial solution, then the three lines all passthrough (0,0) and at least two of them are distinct.

(b) Nontrivial solutions can exist only if the three lines coincide; that is, the threeequations represent the same line and that line passes through (0,0).

31. (a) False. The reduced row-echelon form of a matrix is unique, as stated in the remark inthis section.

(b) True. The row-echelon form of a matrix is not unique, as shown in the followingexample:

but

1 2

1 3

1 3

1 2

1 3

0 1

1 3

0 1

→

→

−

→

1 2

1 3

1 2

0 1

→

1 0 0 0

0 1 0 0

0 0 1 0

0 0 0 1

1 0 0

0 1 0

0 0 1

0

,

p

q

r

00 0 0

1 0 0

0 1 0

0 0 0 1

0 0 0 0

,

p

q

,

1 0 0

0 0 1 0

0 0 0 1

0 0 0 0

0 1 0 0

0 0 1 0

p

,00 0 0 1

0 0 0 0

1 0

0 1

0 0 0 0

0 0 0 0

,

p q

r s

,

,

1 0

0 0 1

0 0 0 0

0 0 0 0

1p q

r

p q 00

0 0 0 1

0 0 0 0

0 0 0 0

0 1 0

0 0 1

0 0 0 0

0 0 0

,

p

q

00

0 1 0

0 0 0 1

0 0 0 0

0 0 0 0

,

p

,, ,

0 0 1 0

0 0 0 1

0 0 0 0

0 0 0 0

1

0 0 0 0

0 0

p q r

00 0

0 0 0 0

0 1

0 0 0 0

0 0 0 0

0 0 0 0

,

p q

, ,

0 0 1

0 0 0 0

0 0 0 0

0 0 0 0

0 0 0 1

0

p

00 0 0

0 0 0 0

0 0 0 0

0 0 0 0

0 0 0 0

0 0 0 0

0 0 0 0

and

Exercise Set 1.2 23

(c) False. If the reduced row-echelon form of the augmented matrix for a system of 3equations in 2 unknowns is

then the system has a unique solution. If the augmented matrix of a system of 3equations in 3 unknowns reduces to

then the system has no solutions.

(d) False. The system can have a solution only if the 3 lines meet in at least one pointwhich is common to all 3.

32. (a) False. Let 2 of the equations be x1 + x2 + x3 + x4 + x5 = 0 and x1 + x2 + x3 + x4 + x5= 1 and the system cannot be consistent.

(b) False. Let all 5 equations be the same.

(c) False. The nth row might be n zeros followed by a 1. In that case, the system wouldhave no solution.

(d) False. If one equation is a multiple of another, those 2 equations represent the samesolution set. Thus, 1 of the 2 equations can be disregarded and we will have a systemof n – 1 equations in n unknowns. Such a system need not be inconsistent.

1 1 1 0

0 0 0 1

0 0 0 0

1 0

0 1

0 0 0

a

b

24 Exercise Set 1.2

EXERCISE SET 1.3

1. (c) The matrix AE is 4 × 4. Since B is 4 × 5, AE + B is not defined.

(e) The matrix A + B is 4 × 5. Since E is 5 × 4, E (A + B)is 5 × 5.

(h) Since AT is 5 × 4 and E is 5 × 4, their sum is also 5 × 4. Thus (AT + E)D is 5 × 2.

2. Since two matrices are equal if and only if their corresponding entries are equal, we havethe system of equations

a – b = 8

b + c = 1

c + 3d = 7

2a –4d = 6

The augmented matrix of this system may be reduced to

Hence, a = 5, b = –3, c = 4, and d = 1.

3. (e) Since 2B is a 2 × 2 matrix and C is a 2 × 3 matrix, 2B – C is not defined.

1 0 0 0 5

0 1 0 0 3

0 0 1 0 4

0 0 0 1 1

−

25

3. (g) We have

(j) We have tr(D – 3E) = (1 – 3(6)) + (0 – 3(1)) + (4 – 3(3)) = –25.

4. (b) We have

(d) Since BT is a 2 × 2 matrix and 5CT is a 3 × 2 matrix, the indicated addition isimpossible.

(h) We have

5. (b) Since B is a 2 × 2 matrix and A is a 3 × 2 matrix, BA is not defined (although AB is).

(d) We have

AB =−

−

12 3

4 5

4 1

=−

− −−

=9 1 1

13 2 4

0 1 6

9 −−

− − −

13 0

1 2 1

1 4 6

( – )2 3

12 2 8

2 2 2

6 4 6

3 3

E DT T T =

−

−− 99

15 0 6

6 3 12

T

D ET T– =

−

−−1 1 3

5 0 2

2 1 4

6 1 4

1 1 1

3 2 33

5 0 1

4 1 1

1 1 1

=− −

−− −

= –3

13 7 8

3 2 5

11 4 10

39 21 24

9 6−

=− − −

– −−− − −

15

33 12 30

– ( ) –3 2 3

1 5 2

1 0 1

3 2 4

D E+ = −

+112 2 6

2 2 4

8 2 6

−

26 Exercise Set 1.3

Hence

(e) We have

(f) We have

(j) We have tr(4ET – D) = tr(4E – D) = (4(6) – 1) + (4(1) – 0) + (4(3) – 4) = 35.

6. (a) We have

( – )2

2 2 6

10 0 4

4 2 8

6 1 3

1 1 2D E AT =

−

− −44 1 3

3 0

1 2

1 1

−

=

−− −−

−

4 3 3

11 1 2

0 1 5

3 0

1 2

1 1

− −

=

6 3

36 0

4 7

CCT =

1 4 2

3 1 5

1 3

4 1

2 5

21 17

=117 35

A BC( ) = −

=

3 0

1 2

1 1

1 15 3

6 2 10

33 45 9

11 11 17

7 17 13

−

( )AB C = −

3 45 9

11 11 17

7 17 13

Exercise Set 1.3 27

6. (b) Since (4B)C is a 2 × 3 matrix and 2B is a 2 × 2 matrix, then (4B)C + 2B is undefined.

(d)

Thus,

(f) Each of the matrices DTET and (ED)T is equal to

Thus their difference is the 3 × 3 matrix with all zero entries.

7. (a) The first row of A is

A1 = [3 -2 7]

Thus, the first row of AB is

A B1 3 2 7 6 2 4

0

= [67 41 41]

= −[ – ]

11 3

7 7 5

14 4 12

36 1 26

25 7 21

−

(BA CT T– )2

10 6

14 2

1 8

=−

−− −

BA CT =– 2

4 1

0 2

3 1 1

0 2 1

−

−

− 2

1 4 2

3 1 5

12 6 3

0 4 2

2 8 4

6 2 10

=−

−

=− −

− −

10 14 1

6 2 8

28 Exercise Set 1.3

(c) The second column of B is

Thus, the second column of AB is

(e) The third row of A is

A3 = [0 4 9]

Thus, the third row of AA is

8.

BA =−

6 6 70

6 17 31

63 41 122

6

6

63

=

+−

3

6

0

7

6

2

1

7

+

−

= −

0

4

3

5

6

17

41

2

6

0

7

+−

+

6

2

1

7

4

4

3

5

=

+70

31

122

7

6

0

7

4

−−

+

2

1

7

9

4

3

5

A A3 0 4 9 3 2 7

6 5 4

0 4 9

= −

[ ]

== [ ]24 56 97

AB2

3 2 7

6 5 4

0 4 9

2

1

7

=−

−

=41

21

67

B2

2

1

7

=−

Exercise Set 1.3 29

9. (a) The product yA is the matrix

[y1a11 + y2a21 + … + ym

am

] y1a12 + y2a22 + … + ym

am2

…

y1a1n+ y2a2n

+ … + ym

amn

]

We can rewrite this matrix in the form

y1 [a11 a12… a1n

] + y2 [a21 a22… a2n

] + … + ym

[am1 am2 …

amn]

which is, indeed, a linear combination of the row matrices of A with the scalarcoefficients of y.

(b) Let y = [y1, y2, …, ym

]

Taking transposes of both sides, we have

(yA)T = ATyT = (A1 | A2 | … | Am

)

= (y1A1 | y2A2 | … | ymAm=

y A

y A

y Am m

1 1

2 2

T

y1

ym

by 9a, =yA y A

y A

y Am m

1 1

2 2

and = be the m rowsA A

A

Am

1

2

oof .A

30 Exercise Set 1.3

10. (a) By the result of Exercise 9, the rows of AB are the matrices

3[6 –2 4] – 2[0 1 3] + 7[7 7 5]

6[6 –2 4] + 5[0 1 3] + 4[7 7 5]

0[6 –2 4] + 4[0 1 3] + 9[7 7 5]

11. Let fij

denote the entry in the ith row and jth column of C(DE). We are asked to find f23. Inorder to compute f23, we must calculate the elements in the second row of C and the thirdcolumn of DE. According to Equation (3), we can find the elements in the third column ofDE by computing DE3 where E3 is the third column of E. That is,

12. (a) Suppose that A is m × n and B is r × s. If AB is defined, then n = r. That is, B is n ×s. On the other hand, if BA is defined, then s = m. That is, B is n × m. Hence AB is m× m and BA is n × n, so that both are square matrices.

(b) Suppose that A is m × n and B is r × s. If BA is defined, then s = m and BA is r × n.If A(BA) is defined, then n = r. Thus B is an n × m matrix.

f23 3 1 5

1 5 2

1 0 1

3 2 4

3

= −

[ ] 22

3

19

0

25

= [3 1 5]

= 182

Exercise Set 1.3 31

15. (a) By block multiplication,

16. (b) Call the matrices A and B. Then

17. (a) The partitioning of A and B makes them each effectively 2 × 2 matrices, so blockmultiplication might be possible. However, if

then the products A11B11, A12B21, A11B12, A12B22, A21B11, A22B21, A21B12, and A22B22 areall undefined. If even one of these is undefined, block multiplication is impossible.

AA A

A AB

B B

B=

=11 12

21 22

11 12

21and

BB22

=

− − −

4 7 19 43

2 2 18 17

0 5 25 35

2 3 23 24

AB =

−

−

−2 5

1 3

0 5

2 1 3

0 1 5

2 5

1 3

0 55

4

7

1 42 1 3

0 1 51

−

−

44

4

7

−

AB =

−−

−

+

1 2

0 3

2 1

3 5

1 5

4 2

7 1

0 3

1 2

0 3

4

2

1 5

4 2

−

−−

+

−

−

+

5

3

1 52 1

3 566 1

7 1

0 31 5

4

26 1

−

+

5

3

8 9

9 15

−

=

−−

+

7 14

28 2

0

6

10

14

13 26

−

+

−

− + − +

=

42 3 14 27

− −−

1 23 10

37 13 8

29 23 41

32 Exercise Set 1.3

18. (a) The entry in the ith row and jth column of AB is

ai1b1j

+ ai2b2j

+ … + ain

bnj

Thus, if the ith row of A consists entirely of zeros, i.e., if aik

= 0 for i fixed and for k =1, 2, …, n, then the entry in the ith row and the jth column of AB is zero for fixed i andfor j = 1, 2, …, n. That is, if the ith row of A consists entirely of zeros, then so does theith row of AB.

20. The element in the ith row and jth column of I is

The element in the ith row and jth column of AI is

ai1δ1j

+ ai2δ2j

+ … + ain

δnj

But the above sum equals aij

because δkj

= 0 if k ≠ j and δjj

= 1. This proves that AI = A.

The proof that IA = A is similar.

21. (b) If i > j, then the entry aij

has row number larger than column number; that is, it liesbelow the matrix diagonal. Thus [a

ij] has all zero elements below the diagonal.

(d) If |i – j| > 1, then either i – j > 1 or i – j < –1; that is, either i > j + 1 or j > i + 1. Thefirst of these inequalities says that the entry a

ijlies below the diagonal and also below

the “subdiagonal“ consisting of all entries immediately below the diagonal ones. Thesecond inequality says that the entry a

ijlies above the diagonal and also above the

entries immediately above the diagonal ones. Thus we have

[a

a a

a a a

a a a

ij ] =

11 12

21 22 23

32 33 34

0 0 0 0

0 0 0

0 0 0

00 0 0

0 0 0

0 0 0 0

43 44 45

54 55 56

65 66

a a a

a a a

a a

δij

i j

i j=

0

1

if

if

≠=

Exercise Set 1.3 33

22. (b) For example, a11 = 11–1 = 10 = 1, a13 = 13–1 = 12 = 1, a23 = 23–1 = 22 = 4, and a32 = 32–1

= 31 = 3. Thus the matrix is

23.x f (x)

x

x

x

f (x) =

f (x)

f (x)

) =2

1

f x

) =2

0

f x

) =7

4

f x

) =0

-2

f x

fx

x

x x

x

1

2

1

2

= 2

+

x =

1

1

x =

2

0

x =

4

3

x =−

2

2

1 1 1 1

1 2 4 8

1 3 9 27

1 4 16 64

34 Exercise Set 1.3

(a)

(b)

(c)

(d)

24. (a)

27. The only solution to this system of equations is, by inspection,

29. (a) Let B = Then B2 = A implies that

(*)a2 + bc = 2 ab + bd = 2

ac + cd = 2 bc + d2 = 2

One might note that a = b = c = d = 1 and a = b = c = d = –1 satisfy (*). Solving thefirst and last of the above equations simultaneously yields a2 = d2. Thus a = ±d. Solvingthe remaining 2 equations yields c(a + d) = b(a + d) = 2. Therefore a ≠ –d and a andd cannot both be zero. Hence we have a = d ≠ 0, so that ac = ab = 1, or b = c = 1/a.The first equation in (*) then becomes a2 + 1/a2 = 2 or a4 – 2a2 + 1 = 0. Thus a = ±1.That is,

and

are the only square roots of A.

− −− −

1 1

1 1

1 1

1 1

a b

c d

A = −

1 1 0

1 1 0

0 0 0

f w + z( )α β

α

=

+f

w

w

wm

1

2 β

α

z

z

z

f

w

m

1

2

=

11 1

2 2

++

+

βα β

α β

z

w z

w zm m

=

+

α βf

w

w

f

z

zm m

1 1

= ( ) ( )α βf w + f z

Exercise Set 1.3 35

(b) Using the reasoning and the notation of Part (a), show that either a = –d or b = c = 0.If a = –d, then a2 + bc = 5 and bc + a2 = 9. This is impossible, so we have b = c = 0.This implies that a2 = 5 and d2 = 9. Thus

are the 4 square roots of A.

Note that if A were , say, then B = would be a square root of A for

every nonzero real number r and there would be infinitely many other square roots as well.

(c) By an argument similar to the above, show that if, for instance,

A = and B =

where BB = A, then either a = –d or b = c = 0. Each of these alternatives leads to acontradiction. Why?

31. (a) True. If A is an m × n matrix, then At is n × m. Thus AAT is m × m and AT A is n × n.Since the trace is defined for every square matrix, the result follows.

(b) True. Partition A into its row matrices, so that

A = and AT =

Then

AA

r r r r r r

r r r r r r

r

T

T TmT

T TmT

m

=

1 1 1 2 1

2 1 2 2 2

rr r r r rT

mT

m mT

1 2

r r rT T

mT

1 2

r

r

rm

1

2

a b

c d

−

1 0

0 1

1

4 1

r

r −

5 0

0 5

5 0

0 3

5 0

0 3

5 0

0 3

−

−

−−

5 0

0 3

36 Exercise Set 1.3

Since each of the rows ri

is a 1 × n matrix, each rTinferior/superior alignedi

is an n× 1 matrix, and therefore each matrix r

irTinferior/superior aligned

jis a 1 × 1 matrix.

Hence

tr(AAT) = r1 rT1 + r2 rT

2 + … + rm

rTm

Note that since ri

rTinferior/superior alignedi

is just the sum of the squares of theentries in the ith row of A, r1 rTinferior/superior aligned1 + r2 rTinferior/superioraligned2 + … + r

mrTinferior/superior aligned

mis the sum of the squares of all of the

entries of A.

A similar argument works for ATA, and since the sum of the squares of the entries of AT

is the same as the sum of the squares of the entries of A, the result follows.

31. (c) False. For instance, let A = and B = .

(d) True. Every entry in the first row of AB is the matrix product of the first row of A witha column of B. If the first row of A has all zeros, then this product is zero.

32. (a) True. If the rows of A are r1, … , rn

and the columns are c1, … , cn, then the ith row

of AA is given by ric1

… ric

n. Thus if r

i= r

jfor some i ≠ j, then the ith row of AA will

equal its jth row.

(b) False. For instance if

A = then A2 =

(c) True. If every entry of B is a positive even integer, we can write B = [bij] = [2c

ij] where

each entry cij

is a positive integer. If A = [aij] where each a

ijis also a positive integer,

then each entry of AB and of BA is the sum of numbers of the form 2cijakl. Thus each

will be an even positive integer.

(d) True. If A is m × n and B is r × s, then for AB to be defined we must have n = r andfor BA to be defined, we must have s = m. Thus AB is m × m and BA is n × n. But forAB + BA to be defined, we must have m = n. Thus A and B must both be square m ×m matrices.

1 −

1 0

0 0 0

0 0 0

1 1 0

0 0

0 0 0

−

0

1 1

1 1

0 1

0 1

Exercise Set 1.3 37

EXERCISE SET 1.4

1. (a) We have

Hence,

On the other hand,

B C+ =− −

−

8 5 2

1 8 6

7 2 15

=−

−

10 6 1

1 12 11

5 1 19

( )A B C+ + =− −

−

+10 4

0 5

2 6

2

7

10

0 −−

2 3

1 7 4

3 5 9

A + B =

10 4 2

0 5 7

2 6 10

− −

−

39

Hence,

1. (c) Since a + b = –3, we have

Also

2. (a) We have

On the other hand

( )aB C =− −

−

32 12 20

0 4 8

16 28 24

−

=− − −0 2 3

1 7 4

3 5 9

72 248 132

288 68 88

44 108 152−

=− − −

−

( )4

18 62 33

7 17 22

11 27 38

=− − −

−

72 248 132

28 68 88

44 108 1522

a BC( ) ( )=− −

−

−4

8 3 5

0 1 2

4 7 6

0 2 3

1 7 4

3 55 9

aC bC+ =−

+0 8 12

4 28 16

12 20 36

0 14 −−− − −− − −

=−

− −21

7 49 28

21 35 63

0 6 9

3 21 −−− − −

12

9 15 27

( ) ( )a b C+ = −−

3

2 3

7 4

5 9

0

1

3

==−

− − −− − −

0 6 9

3 21 12

9 15 27

A B C+ + =−

−

+( )

2 1

0 4

2 1

3

5

4

88 5 2

1 8 6

7 2 15

10 6 1

1 12 1

− −

−

=−

11

5 1 19−

40 Exercise Set 1.4

Finally,

(c) Since

and

the two matrices are equal.

3. (b) Since

and


A BT T+ =

−−

−

+2 0 2

1 4 1

3 5 4

88 0 4

3 1 7

5 2 6

10 0 2

4 5 6− −−

= − −−−

2 7 10

( )A BT

T

+ =− −

−

=10 4 2

0 5 7

2 6 10

10 0 2

−− −−

4 5 6

2 7 10

BA CA

–

–

–

+ =− −

+26 25– 11

4 6 13

4 26 1

66 2

6 31 54

12 26 70

10 37

– – –5 20 30 9

–

–

–

= 667

16 0 71–

( )

–

B C A+ =−

8 5

2

2– 2

1 8 6

7 15

–– – –1

0

20 30 93

4 5

2 1 4

1

–

–

= 00 37

16 0–

67

71

B aC( ) =−

8 3 5

7

0 8– –

0 1 2

4 6

12

4 288 16

12 20 36

28 6

=– – –72 248 132

88 88

44 108 152−

Exercise Set 1.4 41

3. (d) Since

and


4. For A, we have ad – bc = 1. Hence,

For B, we have ad – bc = 20. Hence,

For C, we have ad – bc = 2. Hence,

5. (b)

( )BT

T

− =−

=1 1

20

4 3

4 2

1

20

4 3

4 2

1

20

4 4

3 2−

=

−

T

( )BT −

−

=−

=

−

11

2 4

3 4

1

20

4 4

3 2

C− =

− −

1 1

2

1 4

2 6

B− =

−

1 1

20

4 3

4 2

A− =

−−

1 2 1

5 3

B AT T = − −

−

−8 0 4

3 1 7

5 2 6

2 0 2

−−

= −1 4 1

3 5 4

28 20 0

228 31 21

6 38 36

− −

( )ABT

T28 28 6

20 31 38

0 21 36

−−−

=28 20 0

28 31 21

6 38 36

− − −

42 Exercise Set 1.4

6. (a)

7. (b)

Thus,

7. (d)

8. To compute A–3, we can either compute A3 and then take its inverse or else we can find A–1

and then find (A–1)3. We choose the first alternative. Thus

A3 2 0

4 1

2 0

4 1

2 0

4 1=

=

=

4 0

12 1

2 0

4 1

8 0

28 1

2

5

13

2

134

13

1

13

1 0

0 1A =

−

−

=

−

−

,

18

13

2

134

13

12

13

so that A .=−

−

9

13

1

132

13

6

13

If then( ) ,I A I+ =−

+−2

1 2

4 51 22

1 2

4 5

5

13

2

134

13

1

13

1

A =−

=

−

−

. Hence

A =

2 7 1

1 7 3 7

7 73 7

1 2

2 7

1 31 1

1

A A(( ) )= =−

−

=

− −−

We are given that Th( ) .73 7

1 21

A− =

−−

eerefore

B A− − =

−

−−

1 1 1

20

4 3

4 2

2 1

5 3 =−

−

1

20

7 5

18 10

( )AB−

−

=−−

=

−−

11

10 5

18 7

1

20

7 5

18 10

Exercise Set 1.4 43

so that

9. (b) We have

10. (b) Let A be any square matrix. Then

p2(A)p3(A) = (A + 3)(A – 3) = A2 + 3A – 3A – 9I = A2 – 9 = p1(A)

11. Call the matrix A. By Theorem 1.4.5,

since cos2 θ + sin2 θ = 1.

=−

cos sin

sin cos

θ θθ θ

A− =

+

−

12 2

1

cos sin

cos sin

sin cosθ θθ θθ θ

=

20 7

14 6

=

−

+

22 8

16 6

3 1

2 1

1 0

0 1

=

−

+

2

11 4

8 3

3 1

2 1

1 0

0 1

p A( ) =

−

+2

3 1

2 1

3 1

2 11

1 0

0 1

2

A− =

−

3

1

80

7

21

44 Exercise Set 1.4

12. Since the determinant is

12(ex + e–x)1

2(ex + e–x) – 12(ex – e–x)12(ex – e–x)

= 14(e2x + e–2x + 2) – 14(e2x + e–2x – 2)

= 1 ≠ 0

Theorem 1.4.5 applies.

13. If a11a22… a

nn≠ 0, then a

ii≠ 0, and hence 1/a

iiis defined for i = 1,2, . . ., n. It is now easy

to verify that

14. We are given that 3A – A2 = I. This implies that A(3I – A) = (3I – A)A = I; hence A–1 = 3I – A.

15. Let A denote a matrix which has an entire row or an entire column of zeros. Then if B is anymatrix, either AB has an entire row of zeros or BA has an entire column of zeros,respectively. (See Exercise 18, Section 1. 3.) Hence, neither AB nor BA can be the identitymatrix; therefore, A cannot have an inverse.

16. Not necessarily; for example, both I and –I are invertible, but their sum is not.

17. Suppose that AB = 0 and A is invertible. Then A–1(AB) = A–10 or IB = 0. Hence, B = 0.

A

a

a

ann

− =

1

11

22

1 0 0

0 1 0

0 0 1

Exercise Set 1.4 45

18. By Exercise 15 of Section 1.3, we are looking for a matrix C such that

= I4

That is, BA–1 + AC = O. Thus

AC = –BA–1

and hence

C = –A–1BA–1

19. (a) Using the notation of Exercise 18, let

Then

so that

C = −−

−1

4

1 1

1 1

1 1

1 1

1 1

1 1

= −

=−

1

4

0 0

4 0

0 0

1 0

A− =

−

1 1

2

1 1

1 1

A B=−

=

1 1

1 1

1 1

1 1and

=+

−

I O

BA AC I

21

2

A O

B A

A O

C A

AA O O O

BA

=+ +−

−

−1

1

1

−− −+ +

1 1

AC O AA

46 Exercise Set 1.4

Thus the inverse of the given matrix is

20. (a) One such example is

In general, any matrix of the form

will work.

(b) One such example is

In general, any matrix of the form

will work.

0

0

0

b c

b f

c f

−− −

A =−−

0 0 1

0 0 2

1 2 0

a d e

d b f

e f c

A =−−

− −

1 0 1

0 2 2

1 2 3

1

2

1

20 0

1

2

1

20 0

0 01

2

1

2

1 01

2

1

−

−

−22

Exercise Set 1.4 47

21. We use Theorem 1.4.9.

(a) If A = BBT, then

AT = (BBT)T = (BT)TBT = BBT = A

Thus A is symmetric. On the other hand, if A = B + BT, then

AT = (B + BT)T = BT + (BT)T = BT + B = B + BT = A

Thus A is symmetric.

(b) If A = B – BT, then

AT = (B – BT)T = [B + (–1)BT]T = BT + [(–1)BT]T

= BT + (–1)(BT)T = BT + (–1)B = BT – B = –A

Thus A is skew-symmetric.

22. By Theorem 1.4.9, (An)T = (A(An–1))T = (An–1)T AT. If we repeat this process n – 1 times, weget that (An)T = (AT)n which justifies the equality.

23. Let

Then

AA

x x x

x x− =

111 12 13

21 2

1 0 1

1 1 0

0 1 122 23

31 32 33

11 31 12 32 1

x

x x x

x x x x x

=+ + 33 33

11 21 12 22 13 23

21 31 22 32 2

++ + ++ +

x

x x x x x x

x x x x x 33 33+

x

A

x x x

x x x

x x x

− =

111 12 13

21 22 23

31 32 33

48 Exercise Set 1.4

Since AA–1 = I, we equate corresponding entries to obtain the system of equations

The solution to this system of equations gives

24. (a) Let ij

denote the ijth entry of the matrix on the left and rij

be the corresponding entryfor the matrix on the right. Then

ij

= aij

+ (bij

+ cij)

Since the associative rule holds for real numbers, this implies that

ij

= (aij

+ bij) + c

ij= r

ij

(c) Suppose that B is m × n and C is n × p. Then

[a(BC)]ij

= a(bi1c1j

+ bi2c2j

+ … + bin

cnj

)

= (abi1)c1j

+ (abi2)c2j

+ … + (abin

)cnj

= [(aB)C]ij

= bi1 (ac

ij) + b

i2(ac2j) + … + b

in(ac

nj) = [B(aC)]

ij

A− =

−−

−

11 2 1 2 1 2

1 2 1 2 1 2

1 2 1 2 1 2

x x

x x

x x

x x

x x

11 31

12 32

13 33

11 21

12 22

1

0

0

0

+ =+ =

+ =+ =

+ ==+ =

+ =+ =

+ =

1

0

0

0

1

13 23

21 31

22 32

23 33

x x

x x

x x

x x

Exercise Set 1.4 49

25. We wish to show that A(B – C ) = AB – AC. By Part (d) of Theorem 1.4.1, we have A(B – C) = A(B + (–C)) = AB + A(–C). Finally by Part (m), we have A(–C) = –AC and the desired result can be obtained by substituting this result in the above equation.

26. Let aij

denote the ijth entry of the m × n matrix A. All the entries of 0 are 0. In the proofsbelow, we need only show that the ijth entries of all the matrices involved are equal.

(a) aij

+ 0 = 0 + aij

= aij

(b) aij

– aij

= 0

26. (c) 0 – aij

= – aij

(d) ai10 + a

i20 + … + ain

0 = 0 and 0 a1j+ 0a2j

+ … + amj

= 0

27. (a) We have

On the other hand,

( ) ) ( ) (A AA A AA A AA Ar s

r r r

= ( )

factors ffactors aactors

factors

factor

s

rs

AA A

=

ss

A A AA A AA Ar s

r s

= ( (

factors

)

factors

))

= =+

+AA A A

r s

r s

factors

50 Exercise Set 1.4

(b) Suppose that r < 0 and s < 0; let ρ = –r and δ = –s, so that

Ar As = A–ρ A–δ

= (A–1)ρ (A–1)ρ (by the definition)

= (A–1)ρ+δ (by Part (a))

= A–(ρ+δ) (by the definition)

= A–ρ–δ

= Ar+s

Also

(Ar)s = (A–ρ)–δ

= [(A–1)ρ]–δ (by the definition)

= ([(A–1)ρ]–1)δ (by the definition)

= ([(A–1)–1)]ρ)δ (by Theorem 1.4.8b)

= ([A]ρ)δ (by Theorem 1.4.8a)

= Aρδ (by Part (a))

= A (–ρ)(–δ)

= Ars

29. (a) If AB = AC, then

A–1(AB) = A–1(AC)

or

(A–1 A)B = (A–1 A)C

or

B = C

(b) The matrix A in Example 3 is not invertible.

Exercise Set 1.4 51

30. Following the hint, let ij

= [A(BC)]ij

and rij

= [(AB)C]ij

and suppose that A is m × n, B is n × p, and C is p × q. Then

i j

= ai1 [BC]1j

+ ai2[BC]2j

+ … + ain

[BC]nj

= ai1

(b11c1j+ … + b1p

cpi

) + ai2(b

21c

1j+ … + b

2pcpj

) + … + ain

(bn1

c1j

+ … + bnp

cpj

)

= (ai1b11 + a

i2b21 + … + ain

bn1)c1j

+ … + (ai1b1p

+ ai2b2p

+ … + ain

bnp)cpj

= [AB]i1c1j

+ … + [AB]ip

cpj

= rij

For anyone familiar with sigma notation, this proof can be simplified as follows:

31. (a) Any pair of matrices that do not commute will work. For example, if we let

then

whereas

A AB B2 22

1 3

0 1+ + =

( )A B+ =

=

22

1 1

0 1

1 2

0 1

A B=

1 0

0 0==

0 1

0 1

l a BC a b cij is sj is st tjt

p

s

n

= ===∑∑

11ss

n

is sts

n

tj ita b c AB

=

=

∑

∑=

=

1

1

cc

r

tjt

p

t

p

ij

==∑∑

=

11

52 Exercise Set 1.4

(b) In general,

(A + B)2 = (A + B)(A + B) = A2 + AB + BA + B2

32. (a) As in Exercise 31, let A and B be any appropriate matrices which do not commute.

33. If

Thus, A2 = I if and only if a211 = a 2

22 = a233 = 1, or a11 = ±1, a22 = ±1, and a33 = ±1. There are

exactly eight possibilities:

34. (a) The logical contrapositive is “If A is invertible, then AT is invertible.”

(b) The statement is true by Theorem 1.4.10.

35. (b) The statement is true, since (A – B)2 = (–(B – A))2 = (B – A)2.

(c) The statement is true only if A–1 and B–1 exist, in which case

(AB–1)(BA–1) = A(B–1B)A–1 = AInA–1 = AA–1 = I

n

36. D = (CT)–1 B–1 BT C–1 (AT)–1 B–1

1 0 0

0 1 0

0 0 1

1 0 0

0 1 0

0 0 1

1 0

−

00

0 1 0

0 0 1

1 0 0

0 1 0

0 0 1

1

−

−−

− 00 0

0 1 0

0 0 1

1 0 0

0 1 0

0 0 1

−

−

−−

−1 0 0

0 1 0

0 0 1

11 0 0

0 1 0

0 0 1

−−

A

a

a

a

A

a

=

=11

22

33

20 0

0 0

0 0

then

1112

222

332

0 0

0 0

0 0

a

a

Exercise Set 1.4 53

EXERCISE SET 1.5

1. (a) The matrix may be obtained from I2 by adding –5 times Row 1 to Row 2. Thus, it iselementary.

(c) The matrix may be obtained from I2 by multiplying Row 2 of I2 by √3. Thus it iselementary.

(e) This is not an elementary matrix because it is not invertible.

(g) The matrix may be obtained from I4 only by performing two elementary row operationssuch as replacing Row 1 of I4 by Row 1 plus Row 4, and then multiplying Row 1 by 2.Thus it is not an elementary matrix.

3. (a) If we interchange Rows 1 and of A, then we obtain B. Therefore, E1 must be the matrixobtained from I3 by interchanging Rows 1 and 3 of I3, i.e.,

(c) If we multiply Row 1 of A by –2 and add it to Row 3, then we obtain C. Therefore, E3must be the matrix obtained from I3 by replacing its third row by –2 times Row 1 plusRow 3, i.e.,

5. (a) R1 ↔ R2 Row 1 and Row 2 are swapped

(b) R1 → 2R1

R2 → –3R2

(c) R2 → –2R1 + R2

E3

1 0 0

0 1 0

2 0 1

=−

E1

0 0 1

0 1 0

1 0 0

=

55

6. (a)

Therefore

(c)

Thus, the given matrix is not invertible because we have obtained a row of zeros on theleft side.

6 4 1 0

3 2 0 1

1 2 3 1 6 0

3 2 0 1

−−

−−

−

1 2 3 1 6 0

0 0 1 2 1

1 4

2 7

7 4

2 1

1

=

−−

−

1 4 1 0

2 7 0 1

1 4 1 0

0 1 3 1

1 0 7 4

0 1 2 1

− −

−−

56 Exercise Set 1.5

Add –2 times Row1 to Row 2.

Multiply Row 2 by –1;then multiply Row 2 by –4and add to Row 1.

Divide Row 1 by 6.

Multiply Row 1 by 3and add to Row 2.

7. (a)

Thus, the desired inverse is

3

2

11

10

6

51 1 1

1

2

7

10

2

5

− −

−

−

3 4 1 1 0 0

1 0 3 0 1 1

2 5 4 0 0 1

1 0 3 0 1 0

3 4 1

−

−

−− 11 0 0

2 5 4 0 0 1

1 0 3 0 1 0

0 4 10 1 3 0

0

−

− −55 10 0 2 1

1 0 3 0 1 0

0 4 10 1

− −

− −33 0

0 1 0 1 1 1

1 0 3 0 1 0

0 1

−

0 1 1 1

0 0 10 5 7 4

1 0 03

2

−− − −

−− −

−

−

11

10

6

50 1 0 1 1 1

0 0 11

2

7

10

2

5

Exercise Set 1.5 57

Interchange Rows1 and 2.

Add –3 times Row 1to Row 2 and –2 timesRow 1 to 3.

Add –4 times Row 3to Row 2 and inter-change Rows 2 and 3.

Multiply Row 3 by–1/10. Then add –3times Row 3 to Row 1.

Add –1 times Row 2to Row 3.

7. (c)

Thus

(e)

1 0 1

0 1 1

1 1 0

1

2

1

2

1

21

2

1

2

1

1

=

−

−

−

221

2

1

2

1

2−

1 0 1 1 0 0

0 1 1 0 1 0

1 1 0 0 0 1

1 0 1 1 0 0

0 1 1

0 1 0

0 1 1 1 0 1

1 0 1 1 0 0

0 1 1 0 1 0

0 0 11

2

1

2

1

− −

22

1 0 01

2

1

2

1

2

0 1 01

2

1

2

−

−1

2

1 1 01

2

1

2

1

2−

58 Exercise Set 1.5

Subtract Row 1from Row 3.

Subtract Row 2 fromRow 3 and multiplyRow 3 by -1/2.

Subtract Row 3 from Rows 1 and 2.

Thus

1 0 1

1 1 1

0 1 0

1

2

1

2

1

20 0

1

−

=

−−

11

1

2

1

2

1

2−

1 0 1 1 0 0

1 1 1 0 1 0

0 1 0 0 0 1

1 0 1 1

−

00 0

0 1 2 1 1 0

0 0 2 1 1 1

1 0 1 1 0 0

0

− − −

11 0 0 0 1

0 0 11

2

1

2

1

2

1 0 01

2

1

2

1

20

−

−

11 0 0 0 1

0 0 11

2

1

2

1

2−

Exercise Set 1.5 59

Add Row 1 to Row 2and subtract the newRow 2 from Row 3.

Add Row 3 to Row 2and then multiplyRow 3 by -1/2.

Subtract Row 3 from Row 1.

8. (a)

1

5

1

5

2

51 0 0

1

5

1

5

1

100 1 0

1

5

4

5

1

100 0 1

−

−

−

−

1 1 2 5 0 0

1 11

20 5 0

1 41

220 0 5

1 1 2 5 0 0

0 05

25

−

− 55 0

1 55

25 0 5

1 1 2 5 0 0

0

− −

−00 1 2 2 0

0 11

21 0 1

1 1 0

−

− −

11 4 0

0 0 1 2 2 0

0 1 0 0 1 1

1 0 0 1 3 1

−−

00 1 0 0 1 1

0 0 1 2 2 0

−−

60 Exercise Set 1.5

Multiply all threeRows by 5.

Subtract Row 3 from both Row 2 and Row 3.

Multiply Row 2 by 2/5and Row 3 by -1/5.

Add 2 times Row 2 to Row 1 and 1/2times Row 2 to Row 3.

Subtract Row 3 from Row 1 and interchangeRows 2 and 3.

Thus the desired inverse is

(b) Hint: First multiply Rows 1 and 2 by 1/√2.

(c)

1 0 0 0 1 0 0 0

1 3 0 0 0 1 0 0

1 3 5 0 0 0 1 0

1 3 5 7 0 0 0 1

−−−

1 0 0 0 1 0 0 0

0 3 0 0 1 1 0 0

0 3 5 0 1 0 1 0

0 3 5 7 1 0 0 1

−

1 0 0 0 1 0 0 0

0 1 0 01

3

1

30 0

0 0 5 0 0 1 1 0

0 0 5 7 0 1 0 1

1 0 0 0 1

−−

0 0 0

0 1 0 01

3

1

30 0

0 0 1 0 01

5

1

50

0 0 0 1

−

−

0 01

7

1

7−

1 3 1

0 1 1

2 2 0

−−

Exercise Set 1.5 61

Subtract Row 1 from Rows 2-4.

Subtract Row 2 fromRows 3 and 4 andmultiply Row 2 by 1/3.

Subtract Row 3 from Row 4 andmultiply Row 3 by 1/5and Row 4 by 1/7.

Thus the desired inverse is

8. (d) Since the third rowconsists entirely of zeros, this matrix is not invertible.

9. (b) Multiplying Row i of

by 1/ki

for i = 1, 2, 3, 4 and then reversing the order of the rows yields I4 on the leftand the desired inverse

on the right.

(c) To reduce

we multiply Row i by 1/k and then subtract Row i from Row (i + 1) for i = 1, 2, 3.Then multiply Row 4 by 1/k. This produces I4 on the left and the inverse,

k

k

k

k

0 0 0 1 0 0 0

1 0 0 0 1 0 0

0 1 0 0 0 1 0

0 0 1 0 0 0 1

0 0 0 1

0 0 1 0

0 1 0 0

1 0 0 0

4

3

2

1

k

k

k

k

0 0 0 1 0 0 0

0 0 0 0 1 0 0

0 0 0 0 0 1 0

0 0 0 0 0 0 1

1

2

3

4

k

k

k

k

1 0 0 0

1

3

1

30 0

01

5

1

50

0 01

7

1

7

−

−

−

62 Exercise Set 1.5

on the right.

10. (a) It takes 2 elementary rowoperations to convert A to I: first multiply Row 1 by 5 andadd to Row 2, then multiply Row 2 by 1/2. If we apply each of these operations to I, weobtain the 2 elementary matrices

From Theorem 1.5.1, we conclude that

E2E1A = I

(b) Since the inverse of a matrix is unique, the above equation guarantees that (with E1and E2 as defined above)

A–1 = E2E1

(c) From the equation E2E1A = I, we conclude that

E2–1E2E1A = E2

–1I

or

E1A = E2–1

or

E1–1E1A = E1

–1E2–1

or

A = E 1–1E 2

–1

E E121 0

5 1

1 0

0 1 2=

=

and

1 0 0 0

1 1 0 0

1 1 1 0

1 1 1 1

2

3 2

4 3 2

/ k

k k

k k k

k k k k

−

−

−

Exercise Set 1.5 63

so that

12.

13. (a) E3E2E1A =

(b) A = (E3E2E1)–1 = E–1

–1E2–1E3

–1

14. We first reduce A to row-echelon form, keeping track as we do so of the elementary rowoperations which are required. Here

1. If we interchange Rows 1 and 2, we obtain the matrix

1 3 3 8

0 1 7 8

2 5 1 8− − −

A =− − −

0 1 7 8

1 3 3 8

2 5 1 8

=−

1 0 2

0 1 0

0 0 1

1 0 0

0 1 3

0 0 1

1 0 0

0 4 0

0 0 1

1 0 0

0 1 4 0

0 0 1

1 0 0

0 1 3

0 0 1

−

−

1 0 2

0 1 0

0 0 1

1 0 2

0 4 3

0 0 1

= I3

3 2

3 1

1 0

1 1

1 2

0 1

3 0

0 1

−−

=

−

A =−

1 0

5 1

1 0

0 2

64 Exercise Set 1.5

2. If we add twice Row 1 to Row 3, we obtain the matrix

3. If we subtract Row 2 from Row 3, we obtain the matrix

Note that the above matrix R is in row-echelon form. Thus, Theorem 1.5.1 implies thatE3E2E1A = R, where

Thus, A = E1–1E2

–1E 3–1R. We need only let

and

G E= =

−3

11 0 0

0 1 0

0 1 1

F E=−

−2

11 0 0

0 1 0

2 0 1

E E1

11

0 1 0

1 0 0

0 0 1

=

−

E E1 2

0 1 0

1 0 0

0 0 1

1 0 0

0 1 0

2 0 1

,=

=

=−

, and E3

1 0 0

0 1 0

0 1 1

R =

1 3 3 8

0 1 7 8

0 0 0 0

1 3 3 8

0 1 7 8

0 1 7 8

Exercise Set 1.5 65

15. If A is an elementary matrix, then it can be obtained from the identity matrix I by a singleelementary rowoperation. If we start with I and multiply Row 3 by a nonzero constant, thena = b = 0. If we interchange Row 1 or Row 2 with Row 3, then c = 0. If we add a nonzeromultiple of Row 1 or Row 2 to Row 3, then either b = 0 or a = 0. Finally, if we operate onlyon the first two rows, then a = b = 0. Thus at least one entry in Row 3 must equal zero.

17. Every m × n matrix A can be transformed into reduced row-echelon form B by a sequenceof rowoperations. From Theorem 1.5.1,

B = EkE

k–1… E1A

where E1, E2, …, Ek

are the elementary matrices corresponding to the row operations. If wetake C = E

kE

k–1… E1, then C is invertible by Theorem 1.5.2 and the rule following Theorem

1.4.6.

18. The hypothesis that B is rowequivalent to A can be expressed as follows:

B = EkE

k–1… E1A

where E1, E2, …, Ek

are elementary matrices. Since A is invertible, then (by the rule followingTheorem 1.5.3) B is invertible and

B–1 = A–1E1–1E2

–1 … Ek–1

19. (a) First suppose that A and B are row equivalent. Then there are elementary matrices E1, . . ., E

psuch that A = E1

… EpB. There are also elementary matrices E

p+1, . . ., Ep+q

such that Ep+1

… Ep+q

A is in reduced row-echelon form. Therefore, the matrix Ep+1

…E

p+qE1

… EpB is also in (the same) reduced row-echelon form. Hence we have found,

via elementary matrices, a sequence of elementary row operations which will put B inthe same reduced row-echelon form as A.

Now suppose that A and B have the same reduced row-echelon form. Then there are elementary matrices E1, . . ., E

pand E

p+1, . . ., Ep+q

such that E1… E

pA =

Ep+1

… Ep+q

B. Since elementary matrices are invertible, this equation implies that A = E

p–1 … E–1

1Ep+1… E

p+qB. Since the inverse of an elementary matrix is also an

elementary matrix, we have that A and B are row equivalent.

20. There are kinds of elementary row operations, and therefore basic kinds of elementarymatrices. First, suppose that we multiply row k by the constant c ≠ 0. This results in the elementary matrix E with e

kk= c and all other entries identical to those of I

m. Now

consider the product EA. Since

[EA]ij

= ei1a1j

+ ei2a2j

+ … + eim

amj

66 Exercise Set 1.5

where eij

= 0 unless i = j, we have that [EA]ij

= aij

unless i = k, in which case [EA]kj

= cakj

.That is, EA is just A with the kth row multiplied by c.

Next, suppose that we interchange rows k and . This results in the elementary matrix E in which e

k = ek = 1, all other entries in rows k and are zero, and otherwise

E coincides with Im

. Thus [EA]ij

= aij

unless i = k or i = , in which case [EA]kj

= ajand

[EA]j = akj. That is, EA is just A with rows k and interchanged.

Finally, suppose that we multiply row k by c and add the result to row where k ≠ .This results in the elementary matrix E in which ek

= c and all other entries coincide withthose of I

m. Thus [EA]

ij= a

ijunless i = , in which case [EA]j

= cakj

+ aj, so that EA is just

A with c times row k added to row .

21. The matrix A, by hypothesis, can be reduced to the identity matrix via a sequence ofelementary row operations. We can therefore find elementary matrices E1, E2, . . . E

ksuch

that

Ek

… E2• E1

• A = In

Since every elementary matrix is invertible, it follows that

A = E1–1E2

–1 … Ek–1I

n

22. (a) False. A square matrix need not be invertible, but the product of elementary matricesis. Thus the zero matrix provides a counterexample.

(b) False. The product of two elementary matrices is a matrix which represents twosuccessive elementary row operations. These operations can usually not beaccomplished by a single elementary row operation. For instance, let

Then

which is clearly not an elementary matrix since it cannot be obtained from I2 via asingle elementary row operation.

(c) True. If A is invertible and we apply an elementary row operation to A, then we have,in effect, multiplied A by an elementary matrix E. The resulting matrix EA is invertibleas the product of invertible matrices.

E E1 22 0

0 2=

E E1 22 0

0 1

1 0

0 2=

=

and

Exercise Set 1.5 67

23. (a) True. Suppose we reduce A to its reduced row-echelon form via a sequence ofelementary row operations. The resulting matrix must have at least one row of zeros,since otherwise we would obtain the identity matrix and A would be invertible. Thusat least one of the variables in x must be arbitrary and the system of equations willhave infinitely many solutions.

(b) See Part (a).

(d) False. If B = EA for any elementary matrix E, then A = E–1B. Thus, if B wereinvertible, then A would also be invertible, contrary to hypothesis.

24. There is not. For instance, let b = 1 and a = c = d = 0. Then there is no matrix A which willsatisfy the equation.

68 Exercise Set 1.5

EXERCISE SET 1.6

1. This system of equations is of the form Ax = b, where

By Theorem 1.4.5,

Thus

That is,

x1 = 3 and x2 = –1

3. This system is of the form Ax = b, where

By direct computation we obtain

A

x

x

x

=

=

1 3 1

2 2 1

2 3 1

1

2

3

xx

= −

and bb

4

1

3

xx bb= =−

−

=

−−

A1 6 1

5 1

2

9

3

11

A− =

−−

1 6 1

5 1

Ax

x=

1 1

5 61

2xx = and b =

2

9

69

so that

That is,

x1 = –1, x2 = 4, and x3 = –7

5. The system is of the form Ax = b, where

By direct computation, we obtain

Thus,

That is, x1 = 1, x2 = 5, and x3 = –1.

xx bb= =−

−A

11

5

1

A− =

−

−

1 1

5

1 0 1

3 1 1

1 1 0

A

x

x

x

= −−

1 1 1

1 1 4

4 1 1

1

2

3

xx =

=

and bb

5

10

0

xx bb= =−

−

−A

11

4

7

A− =

−−

−

11 0 1

0 1 1

2 3 4

70 Exercise Set 1.6

7. The system is of the form Ax = b where

By Theorem 1.4.5, we have

Thus

That is,

x1 = 2b1 – 5b2 and x2 = –b1 + 3b2

9. The system is of the form Ax = b, where

We compute

so that

xx bb=

− ( ) + ( ) +

( ) −−A

b b b

b1

1 2 3

1

1 3 1 3

1 3 1 33

2 3 1 3

2

1 2 3

( )( ) + ( ) −

b

b b b

A− =

−−

−

11 3 1 3 1

1 3 1 3 0

2 3 1 3 1

A

x

x

x

= −

=1 2 1

1 1 1

1 1 0

1

2

3

xx

=

and bb

b

b

b

1

2

3

xx bb= =−

− +

−A

b b

b b

1 1 2

1 2

2 5

3

A− =

−−

1 2 5

1 3

Ax

x=

=

3 5

1 20 1

2and bb =

b

b

1

2

Exercise Set 1.6 71

9. (a) In this case, we let

Then

That is, x1 = 16/3, x2 = –4/3, and x3 = –11/3.

9. (c) In this case, we let

Then

That is, x1 = 3, x2 = 0, and x3 = –4.

10. We apply the method of Example 2 to the two systems (a) and (c) solved above. Thecoefficient matrix augmented by the two b matrices yields

1 2 1 1 1

1 1 1 3 1

1 1 0 4 3

− −− −

xx bb=−

−A

13

0

4

bb =−−

1

1

3

xx bb= −−

−A

116 3

4 3

11 3

bb =−

1

3

4

72 Exercise Set 1.6

The reduced row-echelon form may be obtained as follows:

This, fortunately, yields the same results as in Exercise 9.

11. The coefficient matrix, augmented by the two b matrices, yields

This reduces to

or

Thus the solution to Part (a) is x1 = 22/17, x2 = 1/17, and to Part (b) is x1 = 21/17, x2 = 11/17.

1 0

0 1

22 17 21 17

1 17 11 17

1 5 1 2

0 17 1 11

− −

1 5 1 2

3 2 4 5

− −

1 2 1 1 1

0 3 0 4 0

0 2 1 1 4

1 0 2 2

− −−

−

− −55

0 1 0 4 3 0

0 0 1 2 11 6 2

1 0 0 16 3 3

0

−−

/

11 0 4 3 0

0 0 1 11 3 4

−− −

Exercise Set 1.6 73


Add –1 times Row 2 toRow 3 and –1 times Row 1to Row 2.

Add –1 times Row 3 toRow 2, divide Row 2 by–3 and ROw 3 by 2, add–1 times Row 2 to Row 3

Multiplt Row 3 by –2 andthen add –2 times Row 3to Row 1.

Divide Row 2 by 17 and add5 times Row 2 to Row 1.

15. As above, we set up the matrix

This reduces to

or

or

Thus if we let x3 = t, we have for Part (a) x1 = –12 – 3t and x2 = –5 – t, while for Part (b) x1 = 7 – 3t and x2 = 3 – t.

16. The augmented matrix for this sytem is

which, when reduced to row-echelon form, becomes

1 2 3 3

0 0 22

1 2

−−

b

b b

6 4

3 21

2

−−

b

b

1 0 3 12 7

0 1 1 5 3

0 0 0 0 0

−−

1 2 1 2 1

0 1 1 5 3

0 0 0 0 0

− −−

1 2 1 2 1

0 1 1 5 3

0 1 1 5 3

− −− − −− − −

1 2 1 2 1

2 5 1 1 1

3 7 2 1 0

− −− −− −

74 Exercise Set 1.6

Add twice Row 2to Row 3.

Add appropriatemultiples of Row 1to Rows 2 and 3.

Add –1 times Row 2 toRow 3 and multiplyRow 2 by –1.

The system is consistent if and only if b1 – 2b2 = 0 or b1 = 2b2. Thus Ax = b is consistent ifand only if b has the form

17. The augmented matrix for this system of equations is

If we reduce this matrix to row-echelon form, we obtain

The third row implies that b3 = b1 – b2. Thus, Ax = b is consistent if and only if b has theform

20. (a) In general, Ax = x if and only if Ax – x = 0 if and only if (A – I)x = 0. In this case,

and hence,

A = −

2 1 2

2 2 2

3 1 1

bb =−

b

b

b b

1

2

1 2

1 2 5

0 1 41

34

0 0 0

1

2 1

1 2 3

−

− −( )− + +

b

b b

b b b

1 2 5

4 5 8

3 3 3

1

2

3

−−

− −

b

b

b

bb =

2 2

2

b

b

Exercise Set 1.6 75

The reduced row-echelon form of A–I is

Therefore, the system of equations (A – I)x = 0 has only the trivial solution x1 = x2 = x3 = 0; that is, x = 0 is the only solution to Ax = x.

22. (a) It is clear that x = 0 is the only solution to the equation Ax = 0. We may then invoke Theorem 1.6.4 to conclude that A is invertible.

23. Since Ax = has only x = 0 as a solution, Theorem 1.6.4 guarantees that A is invertible. ByTheorem 1.4.8 (b), Ak is also invertible. In fact,

(Ak)–1 = (A–1)k

Since the proof of Theorem 1.4.8 (b) was omitted, we note that

Because Ak is invertible, Theorem 1.6.4 allows us to conclude that Akx = 0 has only thetrivial solution.

24. First, assume that Ax = 0 has only the trivial solution. It then follows from Theorem 1.6.4that A is invertible. Now Q is invertible by assumption. Thus, QA is invertible because theproduct of two invertible matrices is invertible. If we apply Theorem 1.6.4 again, we see that(QA)x = 0 has only the trivial solution.

Conversely, assume that (QA)x = 0 has only the trivial solution; thus, QA is invertibleby Theorem 1.6.4. But if QA is invertible, then so is A because A = Q–1(QA) is the productof invertible matrices. But if A is invertible, then Ax = 0 has only the trivial solution.

A A A A A A

k k

− − − =1 1 1

factors factors

II

A I− =

1 0 0

0 1 0

0 0 1

A I− =

1 0 0

0 1 0

0 0 1

76 Exercise Set 1.6

25. Suppose that x1 is a fixed matrix which satisfies the equation Ax1 = b. Further, let x be anymatrix whatsoever which satisfies the equation Ax = b. We must then show that there is amatrix x0 which satisfies both of the equations x = x1 + x0 and Ax0 = 0.

Clearly, the first equation implies that

x0 = x – x1

This candidate for x0 will satisfy the second equation because

Ax0 = A(x – x1) = Ax – Ax1 = b – b = 0

We must also show that if both Ax1 = b and Ax0 = 0, then A(x1 + x0) = b. But

A(x1 + x0) = Ax1 + Ax0 = b + 0 = b

26. We are given that AB = I. By Part (a) of Theorem 1.6.3, A must be the inverse of B. But then

A = B–1 ⇒ A–1 = (B–1)–1 = B

That is, A–1 = B.

27. (a) x ≠ 0 and x ≠ y

(b) x ≠ 0 and y ≠ 0

(c) x ≠ y and x ≠ –y

Gaussian elimination has to be performed on (A I) to find A–1. Then the productA–1B is performed, to find x. Instead, use Gaussian elimination on (A B) to find x. Thereare fewer steps in the Gaussian elimination, since (A B) is a m × (n+1) matrix in general,or n × (n+1) where A is square (n × n). Compare this with (A I) which is n × (2n) in theinversion approach. Also, the inversion approach only works for A n × n and invertible.

28. (a) For the equation x = Ax + b to have a unique solution for x, the equation (I – A)x = bmust also have a unique solution. We are assured of this if the matrix I – A is invertible.

29. No. The system of equations Ax = x is equivalent to the system (A – I)x = 0. For thissystem to have a unique solution, A – I must be invertible. If, for instance, A = I, then anyvector x will be a solution to the system of equations Ax = x.

Note that if x ≠ 0 is a solution to the equation Ax = x, then so is kx for any real number

Exercise Set 1.6 77

k. A unique solution can only exist if A – I is invertible, in which case, x = 0.

30. Yes. The matrices A and B need not be square. For example, if

then AB = I2 but

In fact, only square matrices can be invertible.

31. Let A and B be square matrices of the same size. If either A or B is singular, then AB issingular.

BA =

1 0

0 0 0

0 0 1

0

A =

=

1 0 0

0 0 1

1 0

0 0

0 1

and B

78 Exercise Set 1.6

EXERCISE SET 1.7

6. For A to be symmetric, the following equations must hold:

a – 2b + 2c = 3

2a + b + c = 0

a + c = –2

The solution to this system of equations is a = 11, b = –9, c = –13. This is the only set ofvalues for which A is symmetric.

7. The matrix A fails to be invertible if and only if a + b – 1 = 0 and the matrix B fails to beinvertible if and only if 2a – 3b – 7 = 0. For both of these conditions to hold, we must havea = 2 and b = –1.

8. (a) Since the product is not symmetric, the matrices do not commute.

(b) Since the product is symmetric, the matrices commute.

9. We know that A and B will commute if and only if

is symmetric. So 2b + d = a – 5b, from which it follows that a – d = 7b.

ABa b

b d

a b b d

a b b=

−

=

+ +−

2 1

1 5

2 2

5 −−

5d

79

10. (b) If

so that

Thus there are 8 possible matrices A which satisfy the given equation.

11. (b) Clearly

for any real number k = 0.

13. We verify the result for the matrix A by finding its inverse.

−

−

− − −

1 2 5 1 0 0

0 1 3 0 1 0

0 0 4 0 0 1

1 2 5 1 0 0

0 1 3 0 11 0

0 0 1 0 0 1 4−

A

ka ka ka

ka ka ka

ka ka ka

=

11 12 23

21 22 23

31 32 33

3 0 0

0 5 0

0 0 7

k

k

k

A =

±

±

±

1

30 0

01

20

0 0 1

A A− −=

2 19 0 0

0 4 0

0 0 1

then =±

±±

3 0 0

0 2 0

0 0 1

80 Exercise Set 1.7

Multiply Row 1 by –1and Row 3 by –1/4.

Thus A–1 is indeed upper triangular.

14. (a) Since A–1 = , the inverse is indeed symmetric.

15. (a) If A is symmetric, then AT = A. Then (A2)T = (AA)T = ATAT = A . A = A2, so A2

is symmetric.

(b) We have from part (a) that

(2A2 – 3A + I)T = 2(A2)T + 3AT + IT = 2A2 + 3A + I

16. (a) The statement is true for k = 1, since A is symmetric. It is true for k = 0, since A0 = Iis symmetric. Assume AK is symmetric. Show that AK+1 is also symmetric.

We have (AK+1)T = (AK • A)T = AT • (AK)T = A • AK = AK+1. So AK+1 is symmetric.

By induction, AK is symmetric for all K = 0,1,2,3, . . ., if A is symmetric.

(b) Yes—this follows from part (a), with use of the rules (A + B)T = AT + BT, and (kA)T = KAT. So, let p(x) = a0x

n + a, xn + … + an–1x + a

nbe any real polynomial in

x. Then P(A) = a0An + a, An–1 + … + a

n–1A + anI. Then [P(A)]T = (a0A

n + a, An–1 + …+ a

n–1 A + anI)T

= a0(An)T + a,(An–1)T … + an–1A

T + anIT

= a0 An + a, An–1 + … + an–1A + a

nI = P(A).

So P(A) is symmetric.

3 5 1 5

1 5 2 5

1 0 1 1 2 0

0 1 0 0 1 3 4

0 0 1 0 0 1 4

1 0 0 1 2 1 4

0 1 0

−

−

−00 1 3 4

0 0 1 0 0 1 4−

Exercise Set 1.7 81

Add –1 times Row3 to Row 1.

Add 2 times Row 2 toRow 1 and –3 timesRow 3 to Row 2.

17. From Theorem 1.7.1(b), we have if A is an n × n upper triangular matrix, so is A2. Byinduction, if A is an n × n upper triangular matrix, so is Ak, k = 1, 2, 3, . . . We note that theidentity matrix I

n= A0 is also upper triangular. Next, if A is n × n upper triangular, and k

is any (real) scalar, then KA is upper triangular. Also, if A and B are n × n upper triangularmatrices, then so is A+B. These facts allow us to conclude if p(x) is any (real) polynomial,and A is n × n upper triangular, then p(A) is an n × n upper triangular matrix.

18. If A = AT A, then AT = (AT A)T = AT (AT)T = AT A = A, so A is symmetric. This implies thatA = AT, so A = A2.

19. Let

Then if A2 – 3A – 4I = O, we have

This leads to the system of equations

x2 – 3x – 4 = 0

y2 – 3y – 4 = 0

z2 – 3z – 4 = 0

which has the solutions x = 4, –1, y = 4, –1, z = 4, –1. Hence, there are 8 possible choicesfor x, y, and z, respectively, namely (4, 4, 4), (4, 4, –1), (4, –1, 4), (4, –1, –1), (–1, 4, 4),(–1, 4, –1), (–1, –1, 4), and (–1, –1, –1).

20. (a) Since aij

= i2 + j2 = j2 + i2 = aji, A is symmetric.

(d) Since 2i2 + 2j3 need not equal 2j2 + 2i3, A is not symmetric. For instance a21 ≠ a12.In fact, A will be symmetric only in the trivial case where n = 1.

x

y

z

x

y

z

2

2

2

0 0

0 0

0 0

3

0 0

0 0

0 0

−

−

=4

1 0 0

0 1 0

0 0 1

O

A

x

y

z

=

0 0

0 0

0 0

82 Exercise Set 1.7

22. (a) If A is invertible and skew-symmetric, then by Theorem 1.4.10,

(A–1)T = (AT)–1 = (–A)–1 = –(A–1)

so A–1 is skew-symmetric.

(c) From the hint,

A = 12(A + AT) + 12(A – AT)

so we need only prove that 12 (A + AT) is symmetric and that 1

2 (A – AT) is skew-symmetric. To this end, note that

12(A + AT)T = 12(AT + (AT)T) = 12(A + AT)

and that

12(A – AT)T = 12(AT – (AT)T) = 12(AT – AT – A) = 12(A – AT)

23. The matrix

is skew-symmetric but

is not skew-symmetric. Therefore, the result does not hold.

In general, suppose that A and B are commuting skew-symmetric matrices. Then (AB)T = (BA)T = AT BT = (–A)(–B) = AB, so that AB is symmetric rather than skew-symmetric. [We note that if A and B are skew-symmetric and their product is symmetric,then AB = (AB)T = BT AT = (–B)(–A) = BA, so the matrices commute and thus skew-symmetric matrices, too, commute if and only if their product is symmetric.]

24. (a) Here

AA A= =−

−

2 1 0

0 1

A =−

0 1

1 0

Exercise Set 1.7 83

has the solution y1 = 1, y2 = 0, y3 = –2. Thus

so that x1 = 7/4, x2 = 1, x3 = –1/2.

25. Let

Then

Hence, x3 = 1 which implies that x = 1, and z3 = –8 which implies that z = –2. Therefore,3y = 30 and thus y = 10.

26. An n × n matrix has n2 entries, n of them along the diagonal and (n2 – n)/2 of them abovethe diagonal. Thus we could assign distinct values to n + (n2 – n)/2 = (n2 + n)/2 entries.The remaining (n2 – n)/2 entries below the diagonal would coincide with the correspondingentries above the diagonal.

27. To multiply two diagonal matrices, multiply their corresponding diagonal elements to obtaina new diagonal matrix. Thus, if D1 and D2 are diagonal matrices with diagonal elements d1, . . ., d

nand e1, . . ., e

nrespectively, then D1D2 is a diagonal matrix with diagonal elements

d1e1, . . ., dne

n. The proof follows directly from the definition of matrix multiplication.

Ax y x xz z

z

33 2 2

30

1 30

0 8=

+ +( )

=−

Ax y

z=

0

U

x

x

x

xx

2 1 3

0 1 2

0 0 4

1

2

3

−

=11

0

2−

= yy

L

y

y

y

yy

1 0 0

2 3 0

2 4 1

11

2

3

−

= −22

0

= b

84 Exercise Set 1.7

28. Suppose that D has diagonal elements d1, . . ., dn. Since A is a square matrix and AD = I,

Theorem 1.6.3 guarantees that A = D–1. Thus, A is a diagonal matrix with diagonal elements1/d1, . . ., 1/d

n. Note that since D is invertible, it follows that all of its diagonal elements

must be nonzero.

29. In general, let A = [aij]n×n

denote a lower triangular matrix with no zeros on or below thediagonal and let Ax = b denote the system of equations where b = [b1, b2, . . ., b

n]T. Since

A is lower triangular, the first row of A yields the equation a11x1 = b1. Since a11 ≠ 0, we cansolve for x1. Next, the second row of A yields the equation a21x1 + a22x2 = b2. Since weknow x1 and since a22 ≠ 0, we can solve for x2. Continuing in this way, we can solve forsuccessive values of x

iby back substituting all of the previously found values x1, x2, . . ., x

i–1.

30. (a) False. If A were invertible, then AT would also be invertible, and hence so would AAT.

(c) True. Use Theorems 1.4.10 and 1.5.3.

(d) False. Let A = . Then if A2 is symmetric, we must have ab + bd = ac + cd

so that either a = –d or b = c. If b = c, then A is symmetric. However, if b ≠ c but

a = –d, then A2 is symmetric, but A is not. For instance, let a = .1 0

1 1−

a b

c d

Exercise Set 1.7 85

SUPPLEMENTARY EXERCISES 1

1.

3

5

4

54

5

3

5

14

3

5

34

5

3

5

−

−

x

y

x

y

−

− +

−

14

3

5

3

05

3

4

3

1

x

x y

44

3

5

3

0 14

5

3

5

1 03

5

4

5

0 14

5

x

x y

x y

− +

+

− xx y+

3

5

87



Add –4/5 times Row 1to Row 2.

Add –4/3 times Row 2to Row 1.

Thus,x′ = 3–

5x + 4–

5y

y′ = – 4–5

x + 3–5

y

3. We denote the system of equations by

a11x1 + a12x2 + a13x3 + a14x4 = 0

a21x1 + a22x2 + a23x3 + a24x4 = 0

If we substitute both sets of values for x1, x2, x3, and x4 into the first equation, we obtain

a11 – a12 + a13 + 2a14 = 0

2a11 + 3a13 – 2a14 = 0

where a11, a12, a13, and a14 are variables. If we substitute both sets of values for x1, x2, x3,and x4 into the second equation, we obtain

a21 – a22 + a23 + 2a24 = 0

2a21 + 3a23 – a24 = 0

where a21, a22, a23, and a24 are again variables. The two systems above both yield the matrix

which reduces to

This implies that

a11 = –(3/2)a13 + (1/2)a14

a12 = –(1/2)a13 + (5/2)a14

1 0 3 2 1 2 0

0 1 1 2 5 2 0

−−

1 1 1 2 0

2 0 3 1 0

−−

88 Supplemental Exercises 1

and similarly,

a21

= (–3/2)a23

+ (1/2)a24

a22 = (–1/2)a23 + (5/2)a24

As long as our choice of values for the numbers aij

is consistent with the above, then the system will have a solution. For simplicity, and to insure that neither equation is amultiple of the other, we let a13 = a14 = –1 and a23 = 0, a24 = 2. This means that a11 = 1, a12 = –2, a21 = 1, and a22 = 5, so that the system becomes

x1 – 2x2 – x3 – x4 = 0

x1 + 5x2 + 2x4 = 0

Of course, this is just one of infinitely many possibilities.

4. Suppose that the box contains x pennies, y nickels, and z dimes. We know that

x + 5y + 10z = 83

x + y + z = 13

The augmented matrix of this system is

which can be reduced to the matrix

by Gauss-Jordan elimination. Thus,

x = 5–4

z – 9–2

z and y = 35—2

Since x, y, and z must all be integers between 0 and 13, then z must be either 2, 6, or 10because those are the only values of z for which x and y are both integers. However, z = 2implies that x < 0 while z = 10 implies that y < 0. Therefore, the only solution is x = 3, y = 4, and z = 6.

1 05

4

9

2

0 19

4

35

2

− −

1 5 10 83

1 1 1 13

Supplemental Exercises 1 89

5. As in Exercise 4, we reduce the system to the equations

Since x, y, and z must all be positive integers, we have z > 0 and 35 – 9z > 0 or 4 > z. Thuswe need only check the three values z = 1, 2, 3 to see whether or not they produce integersolutions for x and y. This yields the unique solution x = 4, y = 2, z = 3.

6. The augmented matrix of this system is

which can be reduced to the matrix

Row 3 of the above matrix implies that the system of equations will have a solution if andonly if 2a2 – a – 6 = 0; that is, if and only if a = 2 or a = –3/2. For either of those values ofa, the solution is x1 = 2 – t, x2 = t, and x3 = 2. Hence, the system has infinitely manysolutions provided a = 2 or a = –3/2. For all other values of a, there will be no solution.

8. Let s = xy, t = √y, and u = zy. Then we have a system of linear equations in s, t, and uwith augmented matrix

1 2 3 8

2 3 2 7

1 1 2 4

−−

−

1 1 0 2

0 0 1 2

0 0 0 2 62a a− −

1 1 1 4

0 0 1 2

0 0 4 22a a− −

x

yz–

=+

=

1 5

4

35 9

4

z


This reduces to

so thatxy = 5

√y = 3

zy = 3

Thus from the second equation, y = 9, and from the first and third equations, x = 5/9 andz = 3/9 = 1/3.

9. Note that K must be a 2 × 2 matrix. Let

Then

or

or

2 8 4 4

4 6 2 3 2 3

2 4 2

a c b d b d

a c b d b d

a c b d b

+ + − −− + − + −

− − − + 22

8 6 6

6 1 1

4 0 0d

=−

−−

1 4

2 3

1 2

2

2

8 6 6

6−−

−−

=

−a b b

c d d−−

−

1 1

4 0 0

1 4

2 3

1 2

2 0 0

0 1 1−

−

−

a b

c d==

−−

−

8 6 6

6 1 1

4 0 0

Ka b

c d=

1 0 0 5

0 1 0 3

0 0 1 3


Thus2a + 8c = 8

b + 4d = 6

– 4a + 6c = 6

– 2b + 3d = –1

2a – 4c = –4

b – 2d = 0

Note that we have omitted the 3 equations obtained by equating elements of the lastcolumns of these matrices because the information so obtained would be just a repeat ofthat gained by equating elements of the second columns. The augmented matrix of theabove system is

The reduced row-echelon form of this matrix is

Thus a = 0, b = 2, c = 1, and d = 1.

1 0 0 0 0

0 1 0 0 2

0 0 1 0 1

0 0 0 1 1

0 0 0 0 0

0 0 0 0 0

2 0 8 0 8

0 1 0 4 6

4 0 6 0 6

0 2 0 3 1

2 0 4 0 4

0 1 0 2 0

−− −

− −−


10. If we substitute the given values for x, y, and z into the system of equations, we obtain

a – b – 6 = –3

–2 + b + 2c = –1

a – 3 – 2c = –3

or

a – b = 3

b + 2c = 1

a – 2c = 0

This system of equations in a, b, and c yields the matrix

which reduces to

Hence, a = 2, b = –1, and c = 1.

11. The matrix X in Part (a) must be 2 × 3 for the operations to make sense. The matrices inParts (b) and (c) must be 2 × 2.

(b) Let X = . Then

If we equate matrix entries, this gives us the equations

Xx y x x y

z w z z w

1 1 2

3 0 1

3 2

3 2

−

=

+ − ++ − +

x y

z w

1 0 0 2

0 1 0 1

0 0 1 1

−

1 1 0 3

0 1 2 1

1 0 2 0

−

−


x + 3y = –5 x + 3w = 6

– x = –1 – z = –3

2x + y = 0 2z + w = 7

Thus x = 1 and z = 3, so that the top two equations give y = –2 and w = 1. Sincethese values are consistent with the bottom two equations, we have that

11. (c) As above, let X = , so that the matrix equation becomes

This yields the system of equations

2x – 2y + z = 2

–4x + 3y + w = –2

–x + z – 2w = 5

–y – 4z + 2w = 4

with matrix

which reduces to

Hence, x = –113/37, y = –160/37, z = –20/37, and w = –46/37.

1 0 0 0 113 37

0 1 0 0 160 37

0 0 1 0 20 37

0 0 0 1 46 37

−−

−−

2 2 1 0 2

4 3 0 1 2

1 0 1 2 5

0 1 4 2 4

−− −− −

− −

3 3

2 2

2 4

2 4

x z y w

x z y w

x y x

z w z

+ +

− + − +

−+

+

+−

2 2

5 4

x y

z w

X =−

1 2

3 1


12. (a) By inspection

and

Thus,

14. (a) If A4 = 0, then

(I – A)(I + A + A2 + A3) = I + A + A2 + A3 – A – A2 – A3 – 0 = I

Therefore, (I – A)–1 = I + A + A2 + A3 by Theorem 1.6.3.

(b) If An+1 = 0, then

(I – A)(I + A + A2 + … + An) = I – An + 1 = I + 0 = I

Thus Theorem 1.6.3 again supplies the desired result.

15. Since the coordinates of the given points must satisfy the polynomial, we have

p(1) = 2 ⇒ a + b + c = 2

p(–1) = 6 ⇒ a – b + c = 6

p(2) = 3 ⇒ 4a + 2b + c = 3

=− −

−

1 7 11

14 10 26

1

2

3

x

x

x

z

z

1

2

4 1 1

3 5 1

1 1 1

3 1 4

2 2 3

=

−− −

−−

− −

x

x

x

1

2

3

z

z

y

y

y

1

2

1

2

3

4 1 1

3 5 1

=

−− −

y

y

y

1

2

3

1 1 1

3 1 4

2 2 3

=−

−− −

xx

x

x

1

2

3


The reduced row-echelon form of the augmented matrix of this system of equations is

Thus, a = 1, b = – 2, and c = 3.

16. If p(x) = ax2 + bx + c, then p′(x) = 2ax + b. Hence,

p(–1) = 0 ⇒ a – b + c = 0

p(2) = –9 ⇒ 4a + 2b + c = –9

p′(2) = 0 ⇒ 4a + b = 0

The augmented matrix of this system reduces to

so that a = 1, b = –4, and c = –5.

17. We must show that (I – Jn) (I –

n–11 J

n) = I or that (I –

n–11 J

n) (I – J

n) = I. (By virtue of

Theorem 1.6.3, we need only demonstrate one of these equalities.) We have

But Jn2 = nJ

n(think about actually squaring J

n), so that the right-hand side of the above

equation is just I, as desired.

18. If A3 + 4A2 – 2A + 7I = 0, then (A3 + 4A2 – 2A + 7I)T = 0 as well. Now repeated applicationsof Theorem 1.4.9 will finish the problem.

I J In

J In

IJ J In n n n−( ) −−

= −−

− +1

1

1

1

12

nnJ

In

nJ

nJ

n

n n

−

= −−

+−

1

1

1

1

2

2

1 0 0 1

0 1 0 4

0 0 1 5

−−

1 0 0 1

0 1 0 2

0 0 1 3

−


19. First suppose that AB–1 = B–1 A. Note that all matrices must be square and of the samesize. Therefore

(AB–1)B = (B–1 A)B

or

A = B–1 AB

so that

BA = B(B–1 AB) = (BB–1)(AB) = AB

It remains to show that if AB = BA then AB–1 = B–1 A. An argument similar to the one givenabove will serve, and we leave the details to you.

20. Suppose that A and A + B are both invertible. Then there is a matrix C such that (A + B)C = I. We must show that I + BA–1 is also invertible. Consider the matrix

(I + BA–1)(AC) = AC + (BA–1)(AC)

= AC + BIC

= AC + BC

= (A + B)C

= I

Thus AC is the inverse of I + BA–1.

Now suppose that A is invertible but A + B is not. We must show that (I + BA–1) is also not invertible. Suppose that (I + BA–1)D = I for some matrix D. Then

(A + B)A–1 D = AA–1 D + BA–1 D

= D + BA–1 D

= (I + BA–1)D

= I

This implies that A + B is invertible. Since this is contrary to our hypothesis, I + BA–1 is notinvertible.


21. (b) Let the ijth entry of A be aij. Then tr(A) = a11 + a22 + … + a

nn, so that

tr(kA) = ka11 + ka22 + … + kann

= k (a11 + a22 + … + ann

)

= ktr(A)

(d) Let the ijth entries of A and B be aij

and bij, respectively. Then

tr(AB) = a11b11 + a12b21 + … + a1nb

n1

+ a21b12 + a22b22 + … + a2nb

n2

+ …

+ an1b1n

+ an2b2n

+ … + ann

bnn

and

tr(BA) = b11a11 + b12a21 + … + b1na

n1

+ b21a12 + b22a22 + … + b2na

n2

+ …

+ bn1a1n

+ bn2a2n

+ … + bnn

ann

If we rewrite each of the terms bija

jiin the above expression as a

jib

ijand list the terms

in the order indicated by the arrows below,

tr(BA) = a11b11 + a21b12 + … + an1b1n

+ a12b21 + a22b22 + … + an2b2n

+ …

+ a1nb

n1 + a2nb

n2 + … + ann

bnn

then we have tr(AB) = tr(BA).

22. Suppose that A and B are square matrices such that AB – BA = I. Then

tr(AB – BA) = tr(AB) – tr(BA) = tr(I).

But tr(I) = n for any n × n identity matrix I and tr(AB) – tr(BA) = 0 by Part (d) of Problem 21.


24. (c) Let A = [aij(x)] be an m × n matrix and let B = [b

ij(x)] be an n × s matrix. Then the

ijth entry of AB is

ai1(x)b1j

(x) + … + ain

(x)bnj

(x)

Thus the ijth entry of d—dx

(AB) is

d—dx

(ai1(x)b1 j

(x) + … + ain

(x)bnj

(x))

= a′i1(x)b1j

(x) + ai1(x)b′1 j

(x) + … + a′in

(x)bnj

(x) + ain

(x)b′nj

(x)

= [a′i1(x)b1j

(x) + … + a′in

(x)bnj

(x)] + [ai1(x)b′1j(x) + … + a

in(x)b′nj(x)]

This is just the ijth entry of dA—dx

B + AdB—dx

.

25. Suppose that A is a square matrix whose entries are differentiable functions of x. Supposealso that A has an inverse, A–1. Then we shall show that A–1 also has entries which aredifferentiable functions of x and that

dA–1

——dx

= –A–1 dA—dx

A–1

Since we can find A–1 by the method used in Chapter 1, its entries are functions of x whichare obtained from the entries of A by using only addition together with multiplication anddivision by constants or entries of A. Since sums, products, and quotients of differentiablefunctions are differentiable wherever they are defined, the resulting entries in the inversewill be differentiable functions except, perhaps, for values of x where their denominatorsare zero. (Note that we never have to divide by a function which is identically zero.) Thatis, the entries of A–1 are differentiable wherever they are defined. But since we are assumingthat A–1 is defined, its entries must be differentiable. Moreover,

or

dA

dxA A

dA

dx

−−

+ =11

0

d

dxAA

d

dxI( ) ( )− = =1

0


Therefore

so that

26. Following the hint, we obtain the equation

x2 + x – 2 = A(x2 + 1) + (Bx + C)(3x – 1)

or

x2 + x – 2 = (A + 3B)x2 + (3C – B)x + (A – C)

If we equate coefficients of like powers of x, we obtain the system of equations

A + 3B = 1

– B + 3C = 1

A – C = –2

the solution to this system is

A = –7/5, B = 4/5, C = 3/5

27. (b) Let H be a Householder matrix, so that H = I – 2PPT where P is an n × 1 matrix. Thenusing Theorem 1.4.9,

HT = (I – 2PPT)T

= IT – (2PPT)T

= I – 2(PT)T PT

= I – 2 PPT

= H

––

– –dA

dxA

dA

dxA

11 1=

––A

dA

dx

dA

dxA

11=


and (using Theorem 1.4.1)

HT H = H2 (by the above result)

= (I – 2PPT)2

= I2 – 2PPT – 2PPT + (–2PPT )2

= I – 4PPT + 4PPT PPT

= I – 4PPT + 4PPT (because PT P = I)

= I

28. (a) We are asked to show that (C–1 + D–1)–1 = C(C + D)–1D. Since all the matrices in sightare invertible and the inverse of a matrix is unique, it will suffice to show that theinverses of each side are equal, or that (C(C + D)–1D)–1 = C–1 + D–1. We have

(C(C + D)–1 D)–1 = D–1(C + D)C–1

= D–1CC–1 + D–1 DC–1

= D–1 + C–1

= C–1 + D–1

(b) Since we don’t know whether or not C is invertible, we can’t use the technique of Part (a). Instead, we observe that

(*) (I + CD)C = C(I + DC)

since each side of (*) is just C + CDC. If we multiply both sides of (*) on the left by(I + CD)–1 and on the right by (I + DC)–1, we obtain the desired result.

(c) Start with D + DDT C–1 D, write it in two different ways, and proceed more or less asin Part (b).

29. (b) A bit of experimenting and an application of Part (a) indicates that

An

n

n

n

a

b

d c

=

0 0

0 0

0


where

d = an–1 + an–2 c + … + acn–2 + cn–1 = if a ≠ c

If a = c, then d = nan–1. We prove this by induction. Observe that the result holdswhen n = 1. Suppose that it holds when n = N. Then

Here

Thus the result holds when n = N + 1 and so must hold for all values of n.

a cda c

a c

a c

a a c a c c

a cN

NN N N N N N

++

−−

=− + −

−=

+ +

=

1 1aa c

a ca c

a a Na N a a

N N

N N N

+ +

−

−−

≠

+ ( ) = +( ) =

1 1

1 1

if

if cc

A AA A

a

b

d c

a

N N

N

N

N

+ = =

=1

0 0

0 0

0

NN

N

N cd N

b

a c

+

+

+ +

1

1

1

0 0

0 0

0

a c

a c

n n–

–


EXERCISE SET 2.1

1. (a) The number of inversions in (4,1,3,5,2) is 3 + 0 + 1 + 1 = 5.

(d) The number of inversions in (5,4,3,2,1) is 4 + 3 + 2 + 1 = 10.

2. (a) The permutation is odd because 5 is odd.

(d) The permutation is even because 10 is even.

3.

5.

7.

9.

−− = − − + − + + = −

2 1 4

3 5 7

1 6 2

20 7 72 20 84 6 65( ) ( )

a

aa a a a

−− −

= − − − − = − +3 5

3 23 2 3 5 5 212( )( ) ( )( )

−− −

= − − − − =5 6

7 25 2 7 6 52( )( ) ( )( )

3 5

2 412 10 22

−= − − =( )

103

11.

13. (a)

= λ2 + 2λ – 3 = (λ – 1)(λ + 3)

Hence,det(A) = 0 if and only if λ = 1 or λ = –3.

14. The following permutations are even:

(1,2,3,4) (1,3,4,2) (1,4,2,3)

(2,1,4,3) (2,3,1,4) (2,4,3,1)

(3,1,2,4) (3,2,4,1) (3,4,1,2)

(4,1,3,2) (4,2,1,3) (4,3,2,1)

The following permutations are odd:

(1,2,4,3) (1,3,2,4) (1,4,3,2)

(2,1,3,4) (2,3,4,1) (2,4,1,3)

(3,1,4,2) (3,2,1,4) (3,4,2,1)

(4,1,2,3) (4,2,3,1) (4,3,1,2)

15. If A is a 4 × 4 matrix, then

det(A) = ∑(–1)p a1i1a2i2

a3i3a4i4

where p = 1 if (i1, i2, i3, i4) is an odd permutation of 1,2,3,4 and p = 2 otherwise. Thereare 24 terms in this sum.

det ( ) ( )( )A =−

− += − + +

λλ

λ λ2 1

5 42 4 5

( ) ( )

3 0 0

2 1 5

1 9 4

12 0 0 0 135 0−−

= + + − + + == −123

104 Exercise Set 2.1

17. (a) The only nonzero product in the expansion of the determinant is

a15a24a33a42a51 = (–3)(–4)(–1)(2)(5) = –120

Since (5,4,3,2,1) is even, det(A) = –120.

(b) The only nonzero product in the expansion of the determinant is

a11a25a33a44a52 = (5)(–4)(3)(1)(–2) = 120

Since (1,5,3,4,2) is odd, det(A) = –120.

18. We have

and

Thus x2 – 2x = 3 + x – x2 or 2x2 – 3x – 3 = 0. By the quadratic formula, x = (3 ± 33)/4.

19. The value of the determinant is

sin2 θ – (–cos2 θ) = sin2 θ + cos2 θ = 1

The identity sin2 θ + cos2 θ = 1 holds for all values of θ.

20. We have

and

BAd e

f

a b

c

ad bd ce

cf=

=

+

0 0 0

ABa b

c

d e

f

ad ae bf

cf=

=

+

0 0 0

1 0 3

2 6

1 3 5

5 0 3 2 3 3

−−−

= − + + − − − +x

x

x x x( ) ( )( )( ) ( (−− + = −6 3 0 22)( ) ) x x

x

xx x x x

−−

= − − − = + −1

3 11 3 1 3 2( ) ( )

Exercise Set 2.1 105

Thus AB = BA if and only if ae + bf = bd + ce, which is just the condition that

21. Since the product of integers is always an integer, each elementary product is an integer.The result then follows from the fact that the sum of integers is always an integer.

22. The signed elementary products of an n × n matrix (n > 1) all of whose entries are 1 willall be either 1 or –1.There are n! such elementary products, half of which are 1 and theother half of which are –1. Thus the determinant of such a matrix will always be zero.

23. (a) Since each elementary product in the expansion of the determinant contains a factorfrom each row, each elementary product must contain a factor from the row of zeros.Thus, each signed elementary product is zero and det(A) = 0.

24. Suppose that A = [aij] denotes an n by n diagonal matrix. That is a

ij= 0 unless i = j. If any

elementary product a1j1a2j2

… anjn

contains a factor of the form aiji

with i ≠ ji, then that

elementary product is zero. But, the only elementary product where i = ji

for all i = 1, 2,…, n is a11a22

… ann

, that is, the product of elements from the diagonal of A. Since thecolumn indices for this elementary product are in natural order, the sign of this elementaryproduct is positive. Hence, the determinant of any diagonal matrix is the product of itsdiagonal elements.

25. Let U = [aij] be an n by n upper triangular matrix. That is, suppose that a

ij= 0 whenever

i > j. Now consider any elementary product a1j1a2j2

… anjn

. If k > jk

for any factor akjk

in thisproduct, then the product will be zero. But if k ≤ j

kfor all k = 1, 2, …, n, then k = j

kfor all

k because j1, j2, …, jn

is just a permutation of the integers 1, 2, …, n. Hence, a11a22… a

nn

is the only elementary product which is not guaranteed to be zero. Since the column indicesin this product are in natural order, the product appears with a plus sign. Thus, thedeterminant of U is the product of its diagonal elements. A similar argument works forlower triangular matrices.

b a e

e d fbd bf ae ce .

−−

= − − + = 0


EXERCISE SET 2.2

1. (b) We have

det( )A =−

−=

− −

− −

=

2 1 3

1 2 4

5 3 6

0 5 5

1 2 4

0 13 14

(( )( )

( )( )

− −− −

= − −−

1 5

1 2 4

0 1 1

0 13 14

1 5

1 2 4

0 1 1

0 0 1

== − − − = −

= − − =

( )( )( )

det( )

1 5 1 5

2 1 5

1 2 3

3 4 6

0 5

AT

−−− −

−

= −− −

−

1

1 2 3

0 10 3

1

1 2 3

0 5 1

0 0

( )

−−

= − − − = −

1

1 1 5 1 5( )( )( )( )

107

By Theorem 2.2.2.

Add 13 timesRow 2 to Row 3.

Factor –5 from Row 1and interchange Row 1 and Row 2.

Add 2 times Row 2to Row 1 and 3 timesRow 2 to Row 3.

Add –2 times Row 1 toRow 3, and interchangeRow 1 and Row 2.

By Theorem 2.2.2.

Add –2 times Row 2 toRow 1 and –5 timesRow 2 to Row 3.

2. (a) Since the matrix is upper triangular, its determinant equals the product of the diagonalelements. Hence, if we call the matrix A, then det(A) = –30.

(c) The first and third rows of the given matrix are proportional—in fact, they are equal—and hence, by Theorem 2.2.5, det(A) = 0.

3. (b) Since this matrix is just I4 with Row 2 and Row 3 interchanged, its determinant is –1.

5.

If we factor –5/3 from Row 3 and apply Theorem 2.2.2 we fInd that

det(A) = –3(–5/3)(1) = 5

7.

det( )A =−

− − =−3 6 9

2 7 2

0 1 5

3

1 2 3

0 3 4

0 1 5

det( )A = = −( )0 3 1

1 1 2

3 2 4

1

1 1 2

0 3 1

3 22 4

1

1 1 2

0 3 1

0 1 2

1 3

1 1 2

0 1 1 3

0 1

( )

( )( )

= −− −

= −− −22

3

1 1 2

0 1 1 3

0 0 5 3

= −−


InterchangeRow 1 andRow 2.


Factor 3from Row 2.

Add Row 2to Row 3.

Factor 3 fromRow 1 and Addtwice Row 1 to Row 2.

9.

det( )A = = −( )2 1 3 1

1 0 1 1

0 2 1 0

0 1 2 3

1

1 0 1 1

2 1 3 1

0 2 1 0

0 1 2 33

1

1 0 1 1

0 1 1 1

0 2 1 0

0 1 2 3

1

1 0 1 1

0 1

= −−

= −

( )

( )11 1

0 0 1 2

0 0 1 4

1

1 0 1 1

0 1 1 1

0 0 1 2

0

−−

= −−

−( )

00 0 6

1 1 6 1 6= − − =( )( )( )( )

(= 3))( )3

1 2 3

0 1 4 3

0 0 11 3

911

3

1

−

=

−−

= =

2 3

0 1 4 3

0 0 1

9 11 3 1 33( )( )


Factor 3 fromRow 2 andsubtract Row 2from Row 3.

Factor 11/3from Row 3.



Multiply Row 3 by–1/10. Then add –3times Row 3 to Row 1.


11.

12. Let

We are given that det(A) = –6

A

a b c

d e f

g h i

=

det( )A =− − −1 3 1 5 3

2 7 0 4 2

0 0 1 0 1

0 0 2 1 11

0 0 0 1 1

1 3 1 5 3

0 1 2 6 8

0 0 1 0 1

0 0 0 1 1

0 0 0

=−

−11 1

1 3 1 5 3

0 1 2 6 8

0 0 1 0 1

0 0 0 1 1

0 0 0 0 2

=−

−

Hencee A,det( ) ( )( )( ) .= − = −1 2 1 2


Add 2 times Row 1 toRow 2; add –2 timesRow 3 to Row 4.

Add –1 timesRow 4 to Row 5.

(a)

(b)

(c) Let

Then det(D) = det(A) = –6 because the matrix D may be obtained from A by replacingthe first row by the sum of the first and third rows.

D

a g b h c i

d e f

g h i

=+ + +

det( )

( )( )( )

C

a b c

d e f

g h i

a b c

d

= − − −

= −

3 3 3

4 4 4

3 1 4 ee f

g h i

A= − =12 72det( )

det( )

( )

B

d e f

g h i

a b c

d e f

a b c

g h i

a

= = −( )

= −

1

1 2bb c

d e f

g h i

A= = −det( ) 6



Interchange Row 1and Row 2.

Factor 3 from Row 1,–1 from Row 2, and 4 from Row 3.

12. (d) Let

If we factor –3 from Row 1, we obtain

But the above matrix can be obtained from A by adding –4 times Row 2 to Row 3.Hence det(E) = (–3)det(A) = 18.

13.

Since b2 – a2 = (b – a)(b + a), we add –(b + a) times Row 2 to Row 3 to obtain

= (b –a)[(c2 – a2) – (c – a)(b + a)]

= (b – a)(c –a)[(c + a) – (b + a)]

= (b – a)(c – a)(c – b)

det( )A b a c a

c a c a

= − −

−( ) − −

1 1 1

0

0 0 2 2 (( ) +( )b a

det( )A a b c

a b c

b a c a

b a c a

=

= − −

− −

1 1 1

1 1 1

0

0

2 2 2

2 2 2 22

det( ) ( )E

a b c

d e f

g d h e i f

= −− − −

3

4 4 4

E

a b c

d e f

g d h e i f

=− − −

− − −

3 3 3

4 4 4


Add –a times Row 1 toRow 2; add –a2 timesRow 1 to Row 3.

14. (a) Consider the elementary product p = a1j1a2j2

a3j3. If j1 = 1 or j1 = 2, then p = 0 because

a11 = a12 = 0. If j1 = 3, then the only choices for j2 are j2 = 1 or j2 = 2. But a21 = 0, sothat p = 0 unless j2 = 2. Finally, if j1 = 3 and j2 = 2, then the only choice for j3 is j3 =1. Therefore, a13a22a31 is the only elementary product which can be nonzero. Sincethe permutation (3,2,1) is odd, then

det(A) = –a13a22a31

(b) By an argument which is completely analogous to that given in Part (a), we can arguethat a14a23a32a41 is the only nonzero elementary product. Since (4,3,2,1) is an evenpermutation, then

det(A) = a14a23a32a41

15. In each case, d will denote the determinant on the left and, as usual, det(A) =

∑ ±a1j1a2j2

a3j3, where ∑ denotes the sum of all such elementary products.

(a) d = ∑ ± (ka1j1

)a2j2

a3j3 = k ∑ ±

a1j1

a2j2

a3j3 = k det(A)

(b) d = ∑ ± a2j1a1j2

a3j3= ∑ ± a1j2

a2j1a3j3

= (a11 + ka21)(a22)(a33) + (a12 + ka22)(a23)(a31)

+ (a13 + ka23)(a21)(a32) – (a13 + ka23)(a22)(a31)

– (a12 + ka22)(a21)(a33) – (a11 + ka21)(a23)(a32)

= a11 a22 a33 + a12 a23 a31 + a13 a21 a32 – a13 a22 a31

– a12 a22 a33 – a11 a23 a32

+ ka21 a22 a33 + ka22 a23 a31 + ka23 a21 a32

– ka23 a22 a31 – ka22 a21 a33 – ka21 a23 a32

=a a a

a a a

a a a

11 12 13

21 22 23

31 32 33

a ka a ka a ka

a a a

a a a

11 21 12 22 13 23

21 22 23

31 32 3

+ + +

33


16.

(4)

(5)

R3 → R3 – 3R2

= (–1)(–6 + 1) = 5

(6)

R3 → R3 – 2R2

= –(–10 + 27) = –17

(7)

R2 → R2 + 2R1

= = − =33 4

1 53 15 4 33( )

3 6 9

2 7 2

0 1 5

3

1 2 3

2 7 2

0 1 5

3

1 2 3

0 3 4

0 1 5

−− − =

−− − =

−

1 3 0

2 4 1

5 2 2

1 3 0

2 4 1

9 10 0

1 3

9 10

−−

−=

−−

−= −

−−

0 3 1

1 1 2

3 2 4

0 3 1

1 1 2

0 1 2

13 1

1 2=

− −= −( )

− −

3 6 9

0 0 2

2 1 5

23 6

2 12 3 12(

−−

−=

−= + )) = 30


17. (8)

R2 → R2 – 2R1

R3 → R3 + 2R1

R4 → R4 – 2R1

R3 → R3 + R1 = 39

(9)

R1 → R1 – 2R2 R3 → R3 + 3R1

= −−

= −( ) −( ) =( )1

1 1 1

2 1 0

4 5 0

1 12 1

4 56

2 1 3 1

1 0 1 1

0 2 1 0

0 1 2 3

0 1 1 1

1 0 1 1

0 3 1 0

0 1 2 3

1

1 1

=

−

= −( )−−1

2 1 0

1 2 3

=−−

−=

−− =

−=3

3 5 1

1 2 0

0 12 1

3

3 5 1

1 2 0

3 7 0

31 2

3 739

1 2 3 1

5 9 6 3

1 2 6 2

2 8 6 1

1 2 3 1

3 5 0 1

1 2 0 0

0

−−

− − −=

−−−112 0 1−


(10)

R1 → R1 – 2R2

R1 → R1 + 3R2

= –16 (2

3 + 13 ) = –16

(11)

R2 → R2 – R3 = –2

1 3 1 5 3

2 7 0 4 2

0 0 1 0 1

0 0 2 1 1

0 0 0 1 1

1 3 1 5 3

0 1 2 6 8

0

− − −=

−00 1 0 1

0 0 2 1 1

0 0 0 1 1

1 2 6 8

0 1 0 1

0 2 1 1

0 0 1 1

22 2

=

−

→ +R R R11

1

1 0 1

2 1 1

0 1 1

1

1 0 1

2 0 0

0 1 1

1 20 1

1 1= −( ) = −( ) = −( ) −( )

=

− −

−

=

−

=1

2

1 0 1

23

1

3

1

31

32

30

1

2

1 1 0

23

1

3

1

31

32

30

11

2

1

3

1 1

1

32

3

−

−

0 1 1 1

12

12

1 12

23

1

3

1

30

1

3

2

30 0

1 0 1 0

12

12

1 12

23

−

=

− −

11

3

1

30

1

3

2

30 0−


18. (a) Since A can be obtained from I3 by interchanging the first and third rows, it followsthat det(A) = –1.

19. Since the given matrix is upper triangular, its determinant is the product of the diagonalelements. That is, the determinant is x(x + 1)(2x – 1). This product is zero if and only ifx = 0, x = – 1, or x = 1/2.

20. (a) If x = 1orif x = –3, then two rows of the given matrix will be the same and thedeterminant will be zero.

(b) Since the determinant is 12 – 8x – 4x2 and since a quadratic polynomial has exactlytwo roots, it follows that the values of x which we found in Exercise 18(a) are theonly possibilities.


EXERCISE SET 2.3

1. (a) We have

and

2.

Thus det(A) = 10 and det(B) = –17. Now by direct computation,

det( ) ( )

/

/

det( )

A

B

= = =

=

2 1 0

3 4 0

0 0 2

2

1 1 2 0

0 5 2 0

0 0 2

10

11 1 3

7 1 2

5 0 1

14 1 0

3 1 0

5 0 2

17 0 0

3 1 0

5 0

−=

− −−

=−−

11

17= −

det( ) ( )( ) ( )( ) ( )22 4

6 82 8 4 6 40 2 102

A =−

= − − = − = −

det( )A =−

= − − = −1 2

3 44 6 10

119

If we add 9 times Column 2 to Column 1, we find that

4. (a)

Since the determinant of this matrix is not zero, the matrix is invertible.

(c) Since the first two rows of this matrix are proportional, its determinant is zero. Hence,it is not invertible.

5. (a) By Equation (1),

det(3A) = 33 det(A) = (27)(–7) = –189

(c) Again, by Equation (1), det(2A–1) = 23 det(A–1). By Theorem 2.3.5, we have

det(2A–1) = 8 det(A) = –8

7

1 0 1

9 1 4

8 9 1

1 0 0

9 1 13

8 9 7

1 0 0

9 1 0

8 9 124

−−

−= − = − = −124

det( ) ( )

(

AB =−

=−

=

0 1 8

40 1 17

10 0 2

10

0 1 8

4 1 17

1 0 2

10)) ( )

– det( )

0 1 8

0 1 9

1 0 2

10

1 0 2

0 1 8

0 0 17

170

−= −

= = A det( )B

AB =−

9 1 8

31 1 17

10 0 2


(d) Again, by Equation (1), det(2A) = 23 det(A) = –56. By Theorem 2.3.5, we have

det[(2A)–1] = 1 det(2A) = – 1

56

(e)

7. If we replace Row 1 by Row 1 plus Row 2, we obtain

because the first and third rows are proportional.

8. If we add –1 times Column 1 to Column 3 and –1 times Column 2 to Column 3 in thedeterminant on the left, we obtain the determinant on the right.

10. If we add –t times Row 1 to Row 2 in the determinant on the left, we get

a b t a b t a b t

t b t b t b

c

1 1 2 2 3 32

12

22

31 1 1

+ + +

− − −( ) ( ) ( )

11 2 3c c

b c c a b a

a b c

f

a b c b c a c b a

a b c

f

+ + +=

+ + + + + +=

1 1 1 1

0

a g d

b h e

c i f

a d g

b e h

c f i

a b c

d e f

g h i

= −

= − .

= 7


Interchange Columns2 and 3.

Take the transpose ofthe matrix.

or

Adding –t times Row 2 to Row 1 in the above determinant yields the desired result.

12. (a) We have

det(A) = (k – 3)(k – 2) – 4

= k2 – 5k + 2

From Theorem 2.3.3, A will be invertible if and only if det(A) ≠ 0. But det(A) = 0 if andonly if k = (5 ± 17)/2. Thus A is invertible unless k = (5 ± 17)/2.

13. By adding Row 1 to Row 2 and using the identity sin2 x + cos2 x = 1, we see that thedeterminant of the given matrix can be written as

But this is zero because two of its rows are identical. Therefore the matrix is not invertible.

14. (b) The system can be written as

or

λ1 0

0 1

2 3

4 31

2

–

x

x==

0

0

2 3

4 31

2

1

2

=

x

x

x

xλ

sin sin sin2 2 2

1 1 1

1 1 1

α β γ

( )11 1 2 2 3 3

1 2 3

1 2 3

−+ + +

t

a b t a b t a b t

b b b

c c c

2


This expression can be simplified—

15. We work with the system from Part (b).

(i) Here

so the characteristic equation is λ2 – 5λ – 6 = 0.

(ii) The eigenvalues are just the solutions to this equation, or λ = 6 and λ = –1.

(iii) If λ = 6, then the corresponding eigenvectors are the nonzero solutionsto the equation

The solution to this system is x1 = (3/4)t, x2 = t, so is an eigenvectorwhenever t ≠ 0.

If λ = –1, then the corresponding eigenvectors are the nonzero solutions

to the equation

If we let x1 = t, then x2 = –t, so is an eigenvector whenever t ≠ 0.

It is easy to check that these eigenvalues and their corresponding eigenvectors satisfythe original system of equations by substituting for x1, x2, and λ. The solution is valid for allvalues of t.

x =−

t

t

− −− −

=

3 3

4 4

0

01

2

x

x

x =

x

x

1

2

x = ( )

3 4 t

t

6 2 3

4 6 3

4 3

4 31

2

− −− −

=

−−

x

x

x

x

1

2

0

0

=

x =

x

x

1

2

det( ) ( )( )λλ

λλ λ λI A− =

−− −

= − − − = −

2 3

4 32 3 12 2 55 6λ −

λλ

−− −

=

2 3

4 3

0

01

2

x

x


16. Since A is invertible, det(A) ≠ 0. Thus

det(A–1 BA) = det(A–1) det(B) det(A)

= 1 det(A) det(B) det(A)

= det(B)

17. (a) We have, for instance,

The answer is clearly not unique.

18. We know that

A is invertible if and only if det(A) ≠ 0

if and only if det(AT) = 0 (since det(A) = det(AT)

if and only if det(AT) det(A) ≠ 0

if and only if det(AT A) ≠ 0

if and only if AT A is invertible

19. Let B be an n × n matrix and E be an n × n elementary matrix.

Case 2: Let E be obtained by interchanging two rows of In. Then det(E) = –1 and EA is

just A with (the same) two rows interchanged. By Theorem 2.2.3, det(EA) = –det(A) =det(E) det(A).

Case 3: Let E be obtained by adding a multiple of one row of In

to another. Then det(E)= 1 and det(EA) = det(A). Hence det(EA) = det(A) = det(E) det(A).

a b c d

a b c d

a b c d

a c

a1 1 1 1

2 2 2 2

1 1 1 1

2 2

1+ ++ +

=+ +

++ bb c d

b d

a c

a c

b d

a c

a c

b d

1 1 1

2 2

1 1

2 2

1 1

2 2

1 1

2 2

+

= + + +b d

b d

1 1

2 2


20. No, since det(AB) = det(A) det(B) = det(B) det(A) = det(BA). Hence, the 2 determinantsare always equal.

21. If either A or B is singular, then either det(A) or det(B) is zero. Hence, det(AB) = det(A)det(B) = 0. Thus AB is also singular.

22. (a) False. If A is an n by n matrix, then det(2A) = 2n det(A). Hence, the statement holdsonly in the trivial cases when n = 1 or when det(A) = 0.

(c) False. For instance, let .

(d) True. By Theorem 1.6.1, any system of linear equations has no solution, one solution,or infinitely many solutions. Since A0 = 0, there is at least one solution. Since det(A)= 0, it follows from Theorem 2.3.6 that there cannot be just one solution. Hence, theremust be infinitely many. Also see the argument used in Exercise 20(a) of Section 1.5.

23. (a) False. If det(A) = 0, then A cannot be expressed as the product of elementarymatrices. If it could, then it would be invertible as the product of invertible matrices.

(b) True. The reduced row echelon form of A is the product of A and elementary matrices,all of which are invertible. Thus for the reduced row echelon form to have a row ofzeros and hence zero determinant, we must also have det(A) = 0.

(c) False. Consider the 2 × 2 identity matrix. In general, reversing the order of thecolumns may change the sign of the determinant.

(d) True. Since det(AAT) = det(A) det(AT) = [det(A)]2, det(AAT)cannot be negative.

A =

1 0

0 0


EXERCISE SET 2.4

1. a) 5 b) 9 c) 6d) 10 e) 0 f) 2

2. a) odd b) odd c) evend) even e) even f) even

3. 22

4. 0

5. 52

6. –3 6)

7. a2 – 5a + 21

8. 0

9. –65

10. –4

11. –123

12. –c4 + c3 – 16c2 + 8c – 2

127

13. (a) λ = –3, λ = 1

(b) λ = 4, λ = 3, λ = –2

14. 1 2 3 4 even 2 1 3 4 odd1 2 4 3 odd 2 1 4 3 even1 3 4 2 even 2 3 1 4 even1 3 2 4 odd 2 3 4 1 odd1 4 2 3 even 2 4 1 3 even1 4 3 2 odd 2 4 1 2 odd

3 1 2 4 even 4 1 2 3 odd3 1 4 2 odd 4 1 3 2 even3 2 1 4 odd 4 2 1 3 even3 2 4 1 even 4 2 3 1 odd3 4 1 2 even 4 3 1 2 odd3 4 2 1 odd 4 3 2 1 even

15. (a) detA = a11a23a32a44 + a11a24a32a43 – a11a24a33a42

– a11a23a32a44 + a11a24a32a43 – a11a24a33a42

– a12a21a33a44 + a12a21a34a43 + a12a23a31a44

– a12a23a34a41 + a12a24a33a41 + a12a24a31a43

+ a13a21a32a44 + a13a21a34a42 + a13a22a32a41

+ a13a22a34a41 + a13a24a31a42 + a13a24a32a41

(b) There is no diagonal pattern

16. 275

17. (a) –120

(b) 120


18.

19. det = sin2 θ + cos2 θ = 1

20.

So AB = BA ae + bf = bd + ce.

But = bd – bf – ea + ee

So if = 0, then ae + bf = bd + ce, and conversely, and so AB = BA.

21. detA would also be an integer, since detA is formed by products and sums of its entries, andthe integers are closed under multiplication and addition.

22. detA = 0, since all products are either ±1, half being +1 (corresponding to an evenpermutation), and half being –1 corresponding to an odd permutation.

23. (a) Every product in detA must have an element of any given row.

(b) Likewise, every product in detA must have an element of any given column, or becausedetAT = detA.

24. detA = a11a22a33… a

nn—the product of the diagonal entries.

25. detA = a11a22… a

nn—the product of the diagonal entries, in both cases.

b a c

e d f

−−

b a c

e d f

−−

ABa b

c

d e

f

ad ae bf

cf=

=

+

0 0 0

=

=

+

BA

d e

f

a b

c

ad bd ce

cf0 0 0

x =±3 33

4



1.

3. The determinant of the coefficient matrix is

The system of equations has a nontrivial solution if and only if this determinant is zero;that is, if and only if α = β. (See Theorem 2.3.6.)

1 1 1 11 1

αβ

α β

αβ α

α ββ

α β1 1

1= 0 0 −

1= =–( – )a ––( – )( – )β βa a

′ =

−

−=

+

+=x

x

y x y

45

35

35

45

45

35

35

45

925

1625

3

5

4

5

3545

35

45

45

35

35

45

1

x y

y

x

y y x

+

′ =−

=−

= − +4

5

3

5x y

131

4. Recall that det(A) is the sum of 6 signed elementary products, 3 of which are positive and3 of which are negative. Since each of the factors in these products is either 0 or 1, then theonly values which the products can assume are 0, +1, or –1. If det(A) has 3 signedelementary products which are equal to +1, then the remaining 3 products must equal –1.(Why?) In this case, det(A) = 0. However, det(A) can have 2 signed elementary productswhich are equal to +1 and the remaining 4 products equal to 0. For instance, this is true of

Thus, the largest possible value for det(A) is 2.

5. (a) If the perpendicular from the vertex of angle α to side a meets side a between anglesβ and γ, then we have the following picture:

Thus cos β = 1st sfa1divisor2nd sfc and cos γ = 1st sfa2divisor2nd sfb andhence

a = a1 + a2 = c cos β + b cos γ

This is the first equation which you are asked to derive. If the perpendicular intersectsside a outside of the triangle, the argument must be modified slightly, but the sameresult holds. Since there is nothing sacred about starting at angle α, the sameargument starting at angles β and γ will yield the second and third equations.

Cramer’s Rule applied to this system of equations yields the following results:

a

a1 a2

c b

det

1 0 1

1 1 0

0 1 1

132 Supplementary Exercises 2

6. The system of equations

(1 – λ)x – 2y = 0

x – (1 + λ)y = 0

always has the trivial solution x = y = 0. There can only be a nontrivial solution in case thedeterminant of the coefficient matrix is zero. But this determinant is just

–(1 – λ)(1 + λ) + 2 = λ2 + 1

and λ2 + 1 can never equal zero for any real value of λ.

7. If A is invertible, then adj(A), or adj(A) = [det(A)]A–1. Thus

adj( )det( )

AA

AI= =

AA

–

det( )1 1

=

cos α (–= =

+ +

a c b

b a

c a

c b

c a

b a

a a b c

0

0

0

0

0

2 2 22

2 2

0

2

)

abc

b c a

bc

a b

c b a

b c a

abc

=+ +

=cos β ( )=

− +=

+ −b a b c

abc

a b c

ac

2 2 2 2 2 2

2 2

cos γ ( )= =

+ −=

0

0

2 2

2 2 2

c a

c b

b a c

abc

c a b c

abc

aa b c

ab

2 2 2

2

+ −

Supplementary Exercises 2 133

That is, adj(A) is invertible and

It remains only to prove that A = det(A)adj(A–1).This follows from Theorem 2.4.2 andTheorem 2.3.5 as shown:

8. From the proof of Theorem 2.4.2, we have

(*) A[adj(A)] = [det(A)]I

First, assume that A is invertible. Then det(A) ≠ 0. Notice that [det(A)]I is a diagonalmatrix, each of whose diagonal elements is det(A). Thus, if we take the determinant of thematrices on the right- and left-hand sides of (*), we obtain

det(A[adj(A)]) = det([det(A)]I)

or

det(A) • det(adj(A)) = [det(A)]n

and the result follows.

Now suppose that A is not invertible. Then det(A) = 0 and hence we must show thatdet(adj(A)) = 0. But if det(A) = 0, then, by (*), A[adj(A)] = 0. Certainly, if A = 0, thenadj(A) = 0; thus, det(adj(A)) = 0 and the result holds. On the other hand, if A ≠ 0, then Amust have a nonzero row x. Hence A[adj(A)] = 0 implies that xadj(A) = 0 or

[adj(A)]TxT = 0T

Since xT is a nontrivial solution to the above system of homogeneous equations, it followsthat the determinant of [adj(A)]T is zero. Hence, det(adj(A)) = 0 and the result is proved.

A AA

A A A= = =[ ]det( )

( ) det ( ) ( )– ––

– –1 11

1 11adj adj

[adj( )]det( )

–AA

A1 1=


9. We simply expand W. That is,

= ddx

(f1(x)g2(x) – f2(x)g

1(x))

= f′1(x)g2(x) + f1(x)g′2(x) – f′2(x)g1(x) – f2(x)g′1(x)

= [f′1(x)g2(x) – f′2(x)g1(x)] + [f1(x)g′2(x) – f2(x)g′1(x)]

10. (a) As suggested in the figure and the hints,

area ABC = area ADEC + area CEFB – area ADFB

= 12 (x3 – x1)(y1 + y3) + 12 (x2 – x3)(y3 + y2)

– 12 (x2 – x1)(y1 + y2)

= 12 [x3y1 – x1y1 + x3y3 – x1y3 + x2y3 – x3y3 + x2y2

– x3y2 – x2y1 + x1y1 – x2y2 + x1y2]

= 12 [x1y2 – x1y3 – x2y1 + x2y3 + x3y1 – x3y2]

= 12

x y

x y

x y

1 1

2 2

3 3

1

1

1

=′ ′

+′

f x f x

g x g x

f x f x

g x

1 2

1 2

1 2

1

( ) ( )

( ) ( )

( ) ( )

( ) ′′g x2( )

dW

dx

d

dx

f x f x

g x g x

( ) ( )

( ) ( )= 1 2

1 2


10. (b) The area of the triangle is

12 = 12[3(–1) + 2(3) + 4(3 + 1)] = 19

2

11. Let A be an n × n matrix for which the entries in each row add up tozeroand let x be then × 1 matrix each of whose entries is one. Then all of the entries in the n × 1 matrix Ax arezero since each of its entries is the sum of the entries of one of the rows of A. That is, thehomogeneous system of linear equations

has a nontrivial solution. Hence det(A) = 0. (See Theorem 2.3.6.)

12. To obtain B from A, we first interchange Row 1 and Row n. This changes the sign of det(A).If n = 2 or 3, we are done. If n > 3, then we interchange Rows 2 and n – 1, which againreverses the sign of the determinant. In general, we interchange Rows k + 1 and n – k untilall of the rows have been interchanged.

If n is even, then we will have made n/2 interchanges and the sign of det(A) will havechanged by a factor of (–1)n/2. On the other hand, if n is odd, then we will have made (n – 1)/2 interchanges and the sign of det(A) will have changed by a factor of (–1)(n–1)/2. Allof this can be summarized by the formula

Can you see why?

13. (a) If we interchange the ith and jth rows of A, then we claim that we must interchange theith and jth columns of A–1. To see this, let

A A=

−

Row 1

Row 2

Row n

Mand = Co1 ll.1, Col. 2, , Col. n

det ( ) ( ) det ( )( – )

B An n= − −1

1

2

Ax =

0

0

M

3 3 1

2 1 1

4 0 1

− −


where AA–1 = I. Thus, the sum of the products of corresponding entries from Row s inA and from Column r in A–1 must be 0 unless s = r, in which case it is 1. That is, ifRows i and j are interchanged in A, then Columns i and j must be interchanged in A–1

in order to insure that only 1’s will appear on the diagonal of the product AA–1.

(b) If we multiply the ith row of A by a nonzero scalar c, then we must divide the ith

column of A–1 by c. This will insure that the sum of the products of correspondingentries from the ith row of A and the ith column of A–1 will remain equal to 1.

(c) Suppose we add c times the ith row of A to the jth row of A. Call that matrix B. Nowsuppose that we add –c times the jth column of A–1 to the ith column of A–1. Call thatmatrix C. We claim that C = B – 1. To see that this is so, consider what happens when

Row j → Row j + c Row i [in A]

Column i → Column i – c Column j [in A–1]

The sum of the products of corresponding entries from the jth row of B and any kth

column of C will clearly be 0 unless k = i or k = j. If k = i, then the result will be c –c = 0. If k = j, then the result will be 1. The sum of the products of correspondingentries from any other row of B—say the rth row—and any column of C—say the kth

column—will be 1 if r = k and 0 otherwise. This follows because there have been nochanges unless k = i. In case k = i, the result is easily checked.

14. Let C be the matrix which is the same as A except for its ith row which is the sum of the ith

rows of B1 and B2, or twice the ith row of A. Then by Theorem 2.3.1,

det(B1) + det(B2) = det(C)

But det(C) = 2 det(A), so that

det(A) = 12 [det(B1) + det(B2)]

15. (a) We have

det( )λλ

λI A

a a a

a a a− =− − −

− − −−

11 12 13

21 22 23

aa a a31 32 33− −λ


If we calculate this determinant by any method, we find that

det(λI – A) = (λ – a11)(λ – a22)(λ – a33) – a23a32 (λ – a11)

–a13a31(λ – a22) – a12a21(λ – a33)

–a13a21a32 – a12a23a31

= λ3 + (–a11 – a22 – a33)λ2

+ (a11a22 + a11a33 + a22a33 – a12a21 – a13a31 – a23a32)λ

+ (a11a23a32 + a12a21a33 + a13a22a31

–a11a22a33 – a12a23a31 – a13a21a32)

15. (b) From Part (a) we see that b = –tr(A) and d = –det(A). (It is less obvious that c is thetrace of the matrix of minors of the entries of A; that is, the sum of the minors of thediagonal entries of A.)

16. Recall that sin(a + b) = sin a cos b + cos a sin b. If we multiply the 1st column of thedeterminant by cos δ and the 2nd column by sin δ and then add the results, we get the 3rd

column. Therefore the determinant is 0.

17. If we multiply Column 1 by 104, Column 2 by 103, Column 3 by 102, Column 4 by 10, andadd the results to Column 5, we obtain a new Column 5 whose entries are just the 5numbers listed in the problem. Since each is divisible by 19, so is the resulting determinant.

18. (b) Here we have

If we add λ times Row 2 to Row 1, we have

det( –

– –

–

– – –

λλ λ

λλ

I A =

0 1 1

1 1

1 5 3

2

+−− − −

= (λλ

λλ

– )1

0 1 1

1 1

1 5 3

A I A= −

− =0 1 1

1 0 1

1 5 3

so that λλ −− −

−− − −

1 1

1 1

1 5 3

λλ


(Factor λ – 1 from Row 1.) Now add –1 times Row 2 to Row 3 and interchange Rows1 and 2 to obtain the equation

To simplify, add Row 2 to Row 3 and then add (λ + 1)/4 times Row 3 to Row 2, getting

If we interchange Rows 2 and 3, the last determinant becomes lower triangular, sothat

det(λI – A) = –(λ – 1)(–1)(–1)(–4)

= (λ – 1)(λ2 – 2λ + 1) = (λ – 1)3

Thus, the characteristic equation is (λ – 1)3 = 0 and so λ = 1 is the only eigenvalue. Ifwe let λ = 1, then the eigenvectors are the nonzero solutions to the equation

The augmented matrix reduces to

If we put x3 = 2 t, then x2 = –t and x1 = t, so the eigenvectors are

x = [t –t 2t]T where t ≠ 0

1 1 1 0

0 11

20

0 0 0 0

− −

1 1 1

1 1 1

1 5 2

1

2

3

− −−− − −

x

x

x

=

0

0

0

2 1

4

2λ λ− +

det ( ) ( )λ λλ

λλ

λ

λ

I A− = − −−

+− −

= − −

−

1

1 1

0 1 1

0 4 3

1

1

( )

11

0 02 1

4

0 4 3

2λ λ

λ

− +

− −

det ( ) ( )λ λλ

λλ λ

I A− = − −−

+− − −

1

1 1

0 1 1

0 5 4


TECHNOLOGY EXERCISES 2.1

T3. Let y = ax3 + bx3 + cx + d be the polynomial of degree three to pass through the fourgiven points. Substitution of the x and y coordinates of these points into the equation of thepolynomial yields the system

7 = 27a + 9b + 3c + d

–1 = 8a + 4b + 2c + d

–1 = a + b + c + d

1 = 0a + 0b + 0c + d

Using Cramer’s Rule,

a =

−−

=

7 9 3 1

1 4 2 1

1 1 1 1

1 0 0 1

27 9 3 1

8 4 2 1

1 1 1 1

0 0 0 1

12

1221

27 7 3 1

8 1 2 1

1 1 1 1

0,= =

−−

b11 0 1

12

2

12

27 9 7 1

8 4 1 1

1 1 1 1

0 0

=−

= −

=

−−

c11 1

12

12

121

27 9 3 7

8 4 2 1

1 1

,=−

= − =

−

d

1 1

0 0 0 1

12

12

121

−

= =

141

Plot. y = x3 – 2x2 – x + 1

(1, −1) (2, −1)

(3, 7)

(0, 1)

142 Technology Exercises 2.1

EXERCISE SET 3.1

1.

2.

v1= (3, 6)

x

y

x

y

v3 = ( 4, 3)

(a) (c)

x

(3, 4, 5)

y

z

x

y

z

(3, 4, 5)

x

( 3, 4, 5)

y

z

x

z

y

(3, 0, 3)

(a) (c)

(e) (j)

143

3. (a) → = (3 – 4, 7 – 8) = (–1, –1)

(e)→→ = (–2 – 3, 5 + 7, –4 –2) = (–5, 12, –6)

5. (a) Let P = (x,y,z) be the initial point of the desired vector and assume that this vector

has the same length as v. Since has the same direction as v = (4, –2, –1), we have

the equation

= (3 – x, 0 – y, –5 – z) = (4, –2, –1)

If we equate components in the above equation, we obtain

x = –1, y = 2, and z = –4

Thus, we have found a vector which satisfies the given conditions. Any positive

multiple k will also work provided the terminal point remains fixed at Q. Thus, P

could be any point (3 – 4k, 2k, k – 5) where k > 0.

(b) Let P = (x, y, z) be the initial point of the desired vector and assume that this vector

has the same length as v. Since →→ is oppositely directed to v = (4, –2, –1), we have

the equation

→→ = (3 – x, 0 – y, –5 –z) = (–4, 2, 1)

If we equate components in the above equation, we obtain

x = 7, y = –2, and z = –6

Thus, we have found a vector →→ which satisfies the given conditions. Any positive

multiple k →→ will also work, provided the terminal point remains fixed at Q. Thus, P

could be any point (3 + 4k, –2k, –k – 5) where k > 0.

PQ PQ

PQ

PQ

PQ

PQ

PQ

PQ

P P1 2

P P1 2

v1= (3, 4, 5)

x

z

y

v9= (0, 0, 3)x

z

y

(i)(g)


6. (a) v – w = (4 – 6, 0 – (–1), –8 – (–4)) = (–2, 1, –4)

(c) –v + u = (–4 – 3, 0 + 1, 8 + 2) = (–7, 1, 10)

(e) –3(v – 8w) = –3(4 – 48, 0 + 8, –8 + 32) = (132, –24, –72)

7. Let x = (x1, x2, x3). Then

2u – v + x = (–6, 2, 4) – (4, 0, –8) + (x1,x2, x3)

= (–10 + x1, 2 + x2, 12 + x3)

On the other hand,

7x + w = 7(x1, x2, x3) + (6, –1, –4)

= (7x1 + 6, 7x2 – 1, 7 x3 – 4)

If we equate the components of these two vectors, we obtain

7x1 + 6 = x1 – 10

7x2 – 1 = x2 + 2

7x3 – 4 = x3 + 12

Hence, x = (–8/3, 1/2, 8/3).

9. Suppose there are scalars c1, c2, and c3 which satisfy the given equation. If we equatecomponents on both sides, we obtain the following system of equations:

–2c1 – 3c2 + c3 = 0

9c1 + 2c2 + 7c3 = 5

6c1 + c2 + 5c3 = 4

The augmented matrix of this system of equations can be reduced to

The third row of the above matrix implies that 0c1 + 0c2 + 0c3 = –1. Clearly, there do notexist scalars c1, c2, and c3 which satisfy the above equation, and hence the system isinconsistent.

2 3 1 0

0 2 2 1

0 0 0 1

−− −

−


10. If we equate components on both sides of the given equation, we obtain

c1 + 2c2 = 0

2c1 + c2 + c3 = 0

c2 + c3 = 0

The augmented matrix of this system of equations can be reduced to

Thus the only solution is c1 = c2 = c3 = 0.

11. We work in the plane determined by the three points O = (0, 0, 0), P = (2, 3, –2), and Q =

(7, –4, 1). Let X be a point on the line through P and Q and let t (where t is a positive,

real number) be the vector with initial point P and terminal point X. Note that the length

of t → is t times the length of . Referring to the figure below, we see that

and

Q

O

PX

tPQ

OP PQ OQ

+ =

OP tPQ OX

+ =

PQ

PQ

PQ

1 0 0 0

0 1 0 0

0 0 1 0


Therefore,

(a) To obtain the midpoint of the line segment connecting P and Q, we set t =1/2. Thisgives

(b) Now set t = 3/4.This gives

12. The relationship between the xy-coordinate system and the x′y′-coordinate system is givenby

x′ = x –2, y′ = y –(–3) = y + 3

(a) x′ = 7 – 2 = 5and y′ = 5+ 3 = 8

(b) x = x′ + 2 = –1 and y = y′ –3 = 3

OX = − + − = −

1

42 3 2

3

47 4 1

23

4

9

4

1

4( , , ) ( , , ) , ,

OX OP OX= +

= − + −

1

2

1

21

22 3 2

1

27 4( , , ) ( , ,, )

, ,

1

9

2

1

2

1

2= − −

OX OP t OQ OP

t OP

= +

= +

( – )

( – )1 ttOQ


(c)

(d) (1, 10)

(e) v1′ = v1 – 2, v2′ = v + 3

13. Q = (7, –3, –19)

14. Let (x0, y0, z0) denote the origin in the x′ y′ z′-coordinate system with respect to the xyz-coordinate system. Suppose that P1 and P2 are the initial and terminal points, respectively,for a vector v. Let (x1, y1, z1) and (x2, y2, z2) be the coordinates of P1 and P2 in the xyz-coordinate system, and let (x′1, y′1, z′1) and (z′2, y′2, z′2) be the coordinates of P1 and P2 in thex′y′z′-coordinate system. Further, let (v1, v2, v3) and (v′1, v′2, v′3) denote the coordinates ofv with respect to the xyz- and x′y′z′-coordinate systems, respectively. Then

v1 = x2 – x1, v2 = y2 – y1, v3 = z2 – z1

and

v′1 = x′2 – x′1, v′2 = y′2 – y′1, v′3 = z′2 – z′1

However,

x′1 = x1 – x0, y′1 = y1 – y0, z′1 = z1 – z0

and

x′2 = x2 – x0, y′2 = y2 – y0, z′z′2 = z2 – z0

Q

y y'

x'

x

P85

6

7

5–3


If we substitute the above expressions for x′1 and x′2 into the equation for v′1, we obtain

v′1 = (x2 – x0) – (x1 – x0 ) = x2 – x1 = v1

In a similar way, we can show that v′2 = v2 and v′3 = v3.

16. Let v = (v1, v2) and u = kv = (u1, u2). Let P and Q be the points (v1, v2) and (u1, u2),respectively. Since the triangles OPR and OQS are similar (see the diagram),

we have

Thus, u1 = kv1. Similarly, u2 = kv2.

17. The vector u has terminal point Q which is the midpoint of the line segment connecting P1and P2.

−OP1

OP2 − OP1

(OP2 − OP1)

O

Q P2P1

OP1OP2

12

u OS

OR

1

1υ= =

length of

length of

lengthh of

length of

OQ

OPk=

O R S

Q(u1, u2)

P(v1, v2)

kv

y

x

v


18. We have x + y + z + w = 0 where

19. Geometrically, given 4 nonzero vectors attach the “tail” of one to the “head” of another andcontinue until all 4 have been strung together.The vector from the “tail” of the first vectorto the “head” of the last one will be their sum.

x

yz

w

x + y + z + w

x

yz

w


EXERCISE SET 3.2

1. (a) v = (42 + (–3)2)1/2 = 5

(c) v = [(–5)2 + 02]1/2 = 5

(e) v = [(–7)2 + 22 + (–1)2]1/2 = 54

2. (a) d = [(3 – 5)2 + (4 – 7)2]1/2 = 13

(c) d = [7 – (–7)2 + (–5) – (–2)2 + 1 – (–1)2]1/2 = 209

3. (a) Since u + v = (3, –5, 7), then

u + v = [32 + (–5)2 + 72]1/2 = 83

(c) Since

– 2u = [(–4)2 + 42 + (–6)2]1/2 = 2 17

and

2u = 2 [22 + (–2)2 + 32]1/2 = 2 17

then

2u + 2u = 4 17

(e) Since w = [32 + 62 + (–4)2]1/2 = 61, then

, ,–1

ww=

3

61

6

61

4

61

151

4.

1 = v – w ≤ 5

5. (a) k = 1, l = 3

(b) no possible solution

6. l = 4

7. Since kv = (–k, 2k, 5k), then

kv = [k2 + 4k2 + 25k2]1/2 = |k| 30

If kv = 4, it follows that |k| 30 = 4 or k = ±4/ 30.

9. (b) From Part (a), we know that the norm of v/v is 1. But if v = (3, 4), then v = 5.Hence u = v/v = (3/5, 4/5) has norm 1 and has the same direction as v.

10. (b) By the result of Part (a), we have 4u = 4(u cos 30°, u sin 30°) = 4(3( 3/2),

3(1/2)) = (6 3, 6) and 5 v = 5(v cos(135°), v sin(135°)) = 5(2(–1/ 2), 2(1/

2)) = (–10/ 2, 10/ 2) = (–5 2, 5 2). Thus 4u – 5v = (6 3 + 5 2, 6 – 5 2).

11. Note that p – p0 = 1 if and only if p – p02 = 1. Thus

(x – x0)2 + (y – y0)

2 + (z – z0)2 = 1

The points (x, y, z) which satisfy these equations are just the points on the sphere of radius1 with center (x0, y0, z0); that is, they are all the points whose distance from (x0, y0, z0) is1.

22

33

0

0

w – v2 3w

w wv

v v


12. First, suppose that u and v are neither similarly nor oppositely directed and that neither isthe zero vector.

If we place the initial point of v at the terminal point of u, then the vectors u, v, and u+ v form a triangle, as shown in (i) below.

Since the length of one side of a triangle—say u + v—cannot exceed the sum of thelengths of the other two sides, then u + v < u + v.

Now suppose that u and v have the same direction. From diagram (ii), we see that u+ v = u + v. If u and v have opposite directions, then (again, see diagram (ii)) u + v< u + v.

Finally, if either vector is zero, then u + v = u + v.

13. These proofs are for vectors in 3-space. To obtain proofs in 2-space, just delete the 3rdcomponent. Let u = (u1, u2, u3) and v = (v1, v2, v3). Then

(a) u + v = (u1 + v1, u2 + v2, u3 + v<I>3)

= (v1 + u1, v2 + u2, v3 + u3) = v + u

(c) u + 0 = (u1 + 0, u2 + 0, u3 + 0)

= (0+ u1, 0 + u2, 0 + u3)

= (u21, u

2, u

3) = 0 + u = u

(e) k(lu) = k(lu1, lu2, lu3) = (klu1, klu2, klu3) = (kl)u

14. Again, we work in 3-space. Let u = (u1, u2, u3).

(d) u + (–u) = (u1 + (–u1), u2 + (–u2), u3 + (–u3)) = (0, 0, 0) = 0

(g) (k + l)u = ((k + l)u1, (k + l)u2, (k + l)u3)

= (ku1 + lu1, ku2 + lu2, ku3 + lu3)

= (ku1, ku2, ku3) + (lu1, lu2, lu3)

= ku + lu

(h) 1u = (1u1, 1u2, 1u3) = (u1, u2, u3) = u

u

uu

u

u + vu + v

u + v

u + v

v

v

vv

(i) (ii)


15. See Exercise 9. Equality occurs only when u and v have the same direction or when one isthe zero vector.

16. (a) For p = (a, b, c) to be equidistant from the origin and the xz-plane, we must have

a2 + b2 + c2 = |b|. Thus, a2 + b2 + c2 = b2 or a2 + c2 = 0, so that a = c = 0. That is, p

must lie on the y-axis.

(b) For p = (a, b, c)to be farther from the origin than the xz-plane, we must have a2 +

b2 + c2 > |b|. Thus a2 + b2 + c2 > b2 or a2 + c2 > 0, so that a and c are not both zero.

That is, p cannot lie on the y-axis.

17. (a) If x < 1, then the point x lies inside the circle or sphere of radius one with center atthe origin.

(b) Such points x must satisfy the inequality x – x0 > 1.

18. The two triangles pictured are similar since u and ku and v and kv are parallel and theirlengths are proportional; i.e., ku = ku and kv = kv. Therefore ku + kv = ku + v.That is, the corresponding sides of the two triangles are all proportional, where k is theconstant of proportionality. Thus, the vectors k(u + v) and ku + kv have the same length.Moreover, u + v and ku + kv have the same direction because corresponding sides of thetwo triangles are parallel. Hence, k(u + v) and ku + kv have the same direction. Therefore,k(u + v) = ku + kv.


EXERCISE SET 3.3

1. (a) u • v = (2)(5) + (3)(–7) = –11

(c) u • v = (1)(3) + (–5)(3) + (4)(3) = 0

2. (a) We have u = [22 + 32]1/2 = 13 and v = [52 + (–7)2]1/2 = 74. From Problem

1(a), we know that u • v = –11. Hence,

(c) From Problem 1(c), we know that u • v = 0. Since neither u nor v is the zero vector,this implies that cos θ = 0.

3. (a) u • v = (6)(2)+ (1)(0)+ (4)(–3) =0. Thus the vectors are orthogonal.

(b) u • v = –1 < 0. Thus θ is obtuse.

4. (a) Since u • a = 0 and a ≠ 0, then w1 = (0, 0).

(c) Since u • a = (3)(1) + (1)(0) + (–7)(5) = –32 and a 2 = 12 + 02 + 52 = 26, we have

5. (a) From Problem 4(a), we have

w2 = u – w1 = u = (6, 2)

w132

261 0 5

16

130

80

13= − = − −

( , , ) , ,

cos–

.θ = ≈11

13 74110 8

155

5. (c) From Problem 4(c), we have

w2 = (3, 1, –7) – (–16/13, 0, –80/13) = (55/13, 1, –11/13)

6. (a) Since |u • a| = |–4 + 6| = |2| = 2 and a = 5, it follows from Formula (10) that projau

= 2/5.

(d) Since |u • a| = |3 –4 –42| = |–43 and a = 54, Formula (10) implies that projau =

43/ 54.

8. (c) From Part (a) it follows that the vectors (–4, –3) and (4, 3) are orthogonal to (–3, 4).Divide each component by the norm, 5, to obtain the desired unit vectors.

10. We look for vectors (x, y, z) such that (x, y, z) • (5, –2, 3) = 0. That is, such that 5x – 2y

+ 3z = 0. One such vector is (0, 3, 2). Another is (1, 1, –1). To find others, assign arbitraryvalues, not both zero, to any two of the variables x, y, and z and solve for the remainingvariable.

12. Since the three points are not collinear (Why?), they do form a triangle. Since

• = (1, 3, –2) • (4, –2, 1) = 0

the right angle is at B.

13. Let w = (x, y, z) be orthogonal to both u and v. Then u • w = 0 implies that x + z = 0 andv • w = 0 implies that y + z = 0. That is w = (x, x, –x). To transform into a unit vector, wedivide each component by w = 3x2. Thus either (1/ 3, 1/ 3, –1/ 3) or (–1/ 3,–1/ 3, 1/ 3) will work.

16. (a) We look for a value of k which will make p a nonzero multiple of q. If p = cq, then (2,k) = c(3, 5), so that k/5 = 2/3 or k = 10/3.

(c) We want k such that

p • q = p q cos π3

BC

AB


or

If we square both sides and use the quadratic formula, we find that

The minus sign in the above equation is extraneous because it yields an angle of 2π–/3.

17. (b) Here

17. From the definition of the norm, we have

u + v2 = (u + v) • (u + v)

Using Theorem 3.3.2, we obtain

u + v2 = (u • u) + (u • v) + (v • u)+ (v • v)

or

(*) u + v2 = u2 + 2(u • v) + v2

Similarly,

(**) u – v2 = u2 – 2(u • v) + v2

If we add Equations (*)and (**), we obtain the desired result.

19. If we subtract Equation (**) from Equation (*) in the solution to Problem 16, we obtain

u + v2 – u – v2 = 4(u • v)

If we then divide both sides by 4, we obtain the desired result.

D =+ − −

+=

4 2 1 5 2

4 1

1

172 2

( ) ( )

( ) ( )

k =− ±60 34 3

33

6 5 4 3 51

22 2 2+ = + +

k k


20. Let u1, u2, and u3 be three sides of thecube as shown. The diagonal of the faceof the cube determined by u1 and u2 is b= u1 + u2; the diagonal of the cube itselfis a = u1 + u2 + u3. The angle θ betweena and b is given by

Because u1, u2, and u3 are mutually orthogonal, we have ui

• uj= 0 whenever i ≠ j. Also u

i

• ui

= ui2 and u1 = u2 = u3. Thus

That is, θ =arccos(2/ 6).

21. (a) Let i = (1, 0, 0), j = (0, 1, 0), and k = (0, 0, 1) denote the unit vectors along the x, y,and z axes, respectively.If v is the arbitrary vector (a, b, c), then we can write v = ai

+ bj + ck. Hence, the angle α between v and i is given by

since i = 1 and i • j = i • k = 0.

23. By the results of Exercise 21, we have that if vi

= (ai, b

i, c

i) for i = 1 and 2, then cos α

i=

ai/v

i, cos β

i= b

i/v

i, and cos γ

i= c

i/v

i. Now

cos α =⋅

=+ +

=vv ii

vv ii vv

a

a b c

a

2 2 2

cosθ =+

+ + +

=

uu uu

uu uu uu uu uu

12

22

12

22

32

12

22

2 uuuu

uu uu

22

12

12

3 2

2

6=

cos

( ) ( )

( ) (

θ =⋅

=+ + ⋅ +

+ + ⋅

a b

a b

u u u u u

u u u u1 2 3 1 2

1 2 3 11 2 3 1 2 1 2+ + + ⋅ +u u u u u u) ( ) ( )


24. (a) area = 19 units

(b) β(–3, 12) area = 2 × 19 = 38 units

25. Note that

v • (k1w1 + k2w2) = k1(v • w1) + k2(v • w2) = 0

because, by hypothesis, v • w1 = v • w2 = 0. Therefore v is orthogonal to k1w1 + k2w2 forany scalars k1 and k2.

26. Note that w is a multiple of u plus a multiple of v and thus lies in the plane determined byu and v. Let α be the angle between u and w and let β be the angle between v and w. Then

Similarly, we can show that

Since both α and β lie between 0 and π, then cos α = cos β implies that α = β.

27. (a) The inner product x • y is defined only if both x and y are vectors, but here v • w isa scalar.

(b) We can add two vectors or two scalars, but not one of each.

cosβ = =+v w

v wu v

w⋅ ⋅ kl

cos α =⋅

=⋅ + ⋅

=⋅

uu

uu vv

uu vv uu uu

uu

ww

ww

k l

k

k vv

uu vv

+

=⋅ +

lk

k

lk

2

ww

ww

v v v v1 2 1 2and are orthogonal ⇔ =⇔ +

⋅ 0

1 2 1a a b bb c c

a a b b c c

2 1 2

1 2

1 2

1 2

1 2

1 2

1 2

0

0

+ =

⇔ + + =

⇔

v v v v v v

cosαα α β β γ γ1 2 1 2 1 2 0cos + + =cos cos cos cos


27. (c) The norm of x is defined only for x a vector, but u • v is a scalar.

(d) Again, the dot product of a scalar and a vector is undefined.

28. Assume that neither a nor u is the zero vector.If projau = proj

ua, then

Thus, either u • a = 0 and the two vectors are orthogonal, or

In this case, a is clearly a scalar multiple of u. If we let a = ku, then

Hence, k = 1, so that a = u.

29. If, for instance, u = (1, 0, 0), v = (0, 1, 0) and w = (0, 0, 1), we have u • v = u • w = 0, butv ≠ w.

27. Suppose that r = c1u + c2v + c3w. Then

u • r = c1u • u + c2u • v + c3u • w = c1u2

since u • v = u • w = 0. Similarly, v • r = c2v–2 and w • r = c3w2. Since we know u • r,v • r, and w • r, this allows us to solve for c1, c2, and c3. We find that

31. This is just the Pythagorean Theorem.

u

u + vv

rruu rr

uuuu

vv rr

vvvv

ww rr

wwww= + +

⋅ ⋅ ⋅2 2 2

aa

aa

uu

uu

uu

uu2 2 2 2

= =k

k k

aa

aa

uu

uu2 2

=

uu aa

aaaa

aa uu

uuuu

uu aa

uuuu

⋅ ⋅ ⋅2 2 2

= =


EXERCISE SET 3.4

1. (a)

(c) Since

we have

(e) Since

v – 2w = (0, 2, –3) – (4, 12, 14) = (–4, –10, –17)

we have

2. (a) By Theorem 3.4.1, u × v will be orthogonal to both u and v where

so (1, 2, –1), for instance, is orthogonal to both u and v.

3. (a) Since u × v = (–7, –1, 3), the area of the parallelogram is u × v = 59.

u v× = −− −

=, , ( , ,

4 2

1 5

6 2

3 5

6 4

3 118 36 )−18

u (v – 2w )× =−

− −−

−− − −

, ,2 1

10 17

3 1

4 17

3 2

4 −−

= − −

1044 55 22( , , )

(u v)× × = −− −

=, , (w

9 6

6 7

4 6

2 7

4 9

2 6277 40 42, , )−

u v× =−−

−−−

= −, , ( , ,

2 1

2 3

3 1

0 3

3 2

0 24 9 )6

v w× =−

−−

= −, , ( , ,

2 3

6 7

0 3

2 7

0 2

2 632 6 )−4

161

3. (c) Since u and v are proportional, they lie on the same line and hence the area of theparallelogram they determine is zero, which is, of course, u × v.

4. (a) We have u = →→ = (–1, –5, 2) and v = →→ = (2, 0, 3). Thus

so that

u × v = [(–15)2 + 72 + 102]1/2 = 374

Thus the area of the triangle is 374/2.

6. For Part (a), we have

and

Hence

u × v = –(v × u)

For Part (b), we have v + w = (7, 2, –3). Hence

On the other hand, from Part (a)we have

u × v = (2, 22, 6)

Also

u w× =−

−−

−−

= −, , ( ,

1 2

2 1

5 2

1 1

5 1

1 23 77 11, )

u (v w)× + =−

−−

−−

=, , (

1 2

2 3

5 2

7 3

5 1

7 2−−1 29 17, , )

v u× =−

−−

−−

= − −, , ( ,

0 2

1 2

6 2

5 2

6 0

5 12 222 6, )−

u v× =−

−−

−−

=, , ( , ,

1 2

0 2

5 2

6 2

5 1

6 02 22 )6

u v× =−

−− − −

= −, , ( ,

5 2

0 3

1 2

2 3

1 5

2 015 7,, )10

PR

PQ


Thus

(u × v) + (u × w) = (2, 22, 6) + (–3, 7, 11)

= (–1, 29, 17)

and we have that

u × (v + w) = (u × v) + (u × w)

The proof of Part (c)is similar to the proof of Part (b).

For Part (d), we already have that u×v = (2, –22, 6). Hence k(u×v) = (–10, 110, –30).But ku = (–25, 5, –10) and kv = (–30, 0, 10). Thus

and

Thus

k(u × v) = (ku) × v = u × (kv)

For Part (e), we have

Similarly 0 × u = 0.

Finally, for Part (f), we have

u u× =−−

−−−

=, , ( , ,

1 2

1 2

5 2

5 2

5 1

5 10 0 )0 0=

u 0× =−

−−

=, , ( , ,

1 2

0 0

5 2

0 0

5 1

0 00 0 )0 0=

u ( v)× =−

−−

k , ,1 2

0 10

5 2

30 10

(– , – , – )

5 1

30 0

10 110 30

−−

=

( , ,ku) v× =−−

−− −

−−

5 10

0 2

25 10

6 2

25 5

6 0

= (– , – , – )10 110 30


7. Choose any nonzero vector w which is not parallel to u. For instance, let w = (1, 0, 0) or(0, 1, 0). Then v = u × w will be orthogonal to u. Note that if u and w were parallel, thenv = u × w would be the zero vector.

Alternatively, let w = (x, y, z). Then w orthogonal to u implies 2x – 3y + 5z = 0. Nowassign nonzero values to any two of the variables x, y, and z and solve for the remainingvariable.

8. (a) We have

9. (e) Since (u × w) • v = v • (u × w) is a determinant whose rows are the components ofv, u, and w, respectively, we interchange Rows 1 and 2 to obtain the determinantwhich represents u • (v × w). Since the value of this determinant is 3, we have (u ×w) • v = –3.

10. (a) Call this volume V. Then, since

we have v = 16.

11. (a) Since the determinant

the vectors do not lie in the same plane.

12. For a vector to be parallel to the yz-plane, it must be perpendicular to the vector (1, 0, 0),

so we are looking for a vector which is perpendicular to both (1, 0, 0) and (3, –1, 2). Such

a vector is (1, 0, 0) × (3, –1, 2) = (0, –2, –1). Since we want a unit vector, we divide through

by the norm to obtain the vector (0, –2/ 5, –1/ 5). Obviously any vector parallel to this

one, such as (0, 2/ 5, 1/ 5), will also work.

− −−

−= ≠

1 2 1

3 0 2

5 4 0

16 0

2 6 2

0 4 2

2 2 4

24 2

2 42

6 2

4 216

−−−

=−−

+−

−= −

u v w⋅ × =−

−−

= − + −( ) ( ) (

1 2 4

3 4 2

1 2 5

20 4 2 15 −− + + = −2 4 6 4 10) ( )


15. By Theorem 3.4.2, we have

(u + v) –(u – v) = u × (u – v) + v × (u – v)

= (u × u) + (u × (–v)) + (v × u) + (v × (–v))

= 0 – (u × v) – (u × v) – (v – v)

= –2(u × v)

17. (a) The area of the triangle with sides →→ and →→ is the same as the area of the trianglewith sides (–1, 2, 2) and (1, 1, –1) where we have “moved” A to the origin andtranslated B and C accordingly. This area is 12(–1, 2, 2) × (1, 1, –1) = 12(–4, 1, –3) =

26/2.

18. If the vector u and the given line make an angle θ, then the distance D between the pointand the line is given by

D = u sin θ (Why?)

= u × v/v (Why?)

19. (a) Let u = →→ = (–4,, 2) and v = →→ = (–3, 2, –4). Then the distance we want is

(–4, 0, 2) × (–3, 2, –4)/(–3, 2, –4) = (–4, –22, –8)/ 29. = 2 141/ 29

20. We have that u • v = u v cos θ and u × v = u v sin θ. Thus if u • v ≠ then u, v,and cos θ are all nonzero, and

21. (b) One vector n which is perpendicular to the plane containing v and w is given by

n = w × v = (1, 3, 3) × (1, 1, 2) = (3, 1, –2)

Therefore the angle φ between u and n is given by

φ =⋅

=

≈

− −cos cos

.

1 1 9

14

0 8726

uu nn

uu nn

rradians or( . )49 99

tansin

cosθ θ

θ= =

×=

×uu vv

uu vvuu vvuu vv

uu vv

uu vv

⋅⋅

AB

AP

AC

AB


Hence the angle θ between u and the plane is given by

θ = π2 φ ≈ .6982 radians (or 40°19′′)

If we had interchanged the rôles of v and w in the formula for n so that n = v × w = (–3, –1, 2), then we would have obtained φ = cos–1(– 9

14) ≈ 2.269 radians or 130.0052°. In this case, θ = φ ≈ – π2.

In either case, note that θ may be computed using the formula

24. (a) By Theorems 3.4.2 and 3.2.1 and the definition of k0,

(u + kv) × v = (u × v) + (kv × v)

= (u × v) + k(v × v)

= (u × v) + k0

= u × v

(b) Let u = (u1, u2, u3), v = (v1, v2, v3), and z = (z1, z2, z3). Then we know from the textthat

Since x • y = y • x for any vectors x and y, we have

u (v z)⋅ × =u u u

v v v

z z z

1 2 3

1 2 3

1 2 3

θ sin .=⋅

−1 u nu n


25. (a) By Theorem 3.4.1, we know that the vector v × w is perpendicular to both v and w.Hence v × w is perpendicular to every vector in the plane determined by v and w;moreover the only vectors perpendicular to v × w which share its initial point must bein this plane. But also by Theorem 3.4.1, u × (v × w) is perpendicular to v × w for anyvector u ≠ 0 and hence must lie in the plane determined by v and w.

(b) The argument is completely similar to Part (a), above.

26. Let u = (u1, u2, u3), v = (v1, v2, v3), and w = (w1, w2, w3). Following the hint, we find

v × i = (0, v3, –v2)

and

u × (v × i) = (–u2v2 – u3v3, u1v2, u1v3)

But

(u • i)v – (u • v)i = u1(v1, v2, v3) – (u1u1 + u2v2 + u3v3)(1, 0, 0)

= (u1v1, u1v2, u1v3) – (u1v1 + u2v2 + u3v3, 0, 0)

= (–u2v2 – u23v

3, u

1v

2, u

1v

3)

= u × (v × i)

Similarly,

u × (v × j) = (u • j)v – (u • v)j

and

u × (v × k) = (u • k)v – (u • v)k

(u z) v = v (u z) =× ⋅ ⋅ ×v v v1 2 33

1 2 3

1 2 3

1 2 3

1 2 3

1 2 3

u u u

u u u

v v v

–

z z z

z z z

= −

= ⋅u ((v z)×


Now write w = (w1, w2, w3) = w1i + w2j + w3k. Then

u × (v × w) = u × (v × (w1i + w2j + w3k))

= w1[u × (v × i)] + w2[u × (v × j)] + w3[u × (v × k)]

= w1[(u • i)v – (u • v)i] + w2[(u • j)v –(u • v)j] + w3[(u • k)v – (u • v)k]

= (u • (w1i + w2j + w3k))v – (u • v)(w1i + w2j + w3k)

= (u • w)v – (u • v)w

29. If a, b, c, and d lie in the same plane, then (a × b) and (c × d) are both perpendicular tothis plane, and are therefore parallel. Hence, their cross-product is zero.

30. Recall that |a • (b × c)| represents the volume of the parallelepiped with sides a, b, and c,which is the area of its base times its height. The volume of the tetrahedron is just 13 (areaof its base) times (its height). The two heights are the same, but the area of the base of thetetrahedron is half of the area of the base of the parallelepiped. Hence, the volume of thetetrahedron is 13 (1

2|a • (b × c|).

31. (a) The required volume is

= 2/3

33. Let u = (u1, u2, u3), v = (v1, v2, v3), and w = (w1, w2, w3).

For Part (c), we have

u × w = (u2w3 – u3w2, u3w1 – u1w3, u1w2 – u2w1)

and

v × w = (v2w3 – v3w2, v3w1 – v1w3, u1w2 – v2w1)

1

61 3 2 2 0 3 2 3 1 2 3 3 1 3 0 2 1(– – , , – ) (( – , , – – ) ( – , ,+ ⋅ + × + –– ))

(– , , – ) ( , , )

3

1

64 4 3 6 10 4

( )

( )= ⋅


Thus

(u × w) + (v × w)

= ([u2 + v2]w3 – [u3 + v3]w2, [u3 + v3]w1 – [u1 + u1]w3, [u1 + u1]w2 – [u2 + v2]w1)

But, by definition, this is just (u + v) × w.

For Part (d), we have

k(u × v) = (k[u2v3 – u3v2], k[u3u1 – u1v3], k[u1v2 – u2u1])

and

(ku) × v = (ku2v

3– ku

3v

2, ku

3u

1– ku

1v

3, ku

1v

2– ku

2v

1)

Thus, k(u × v) = (ku) × v. The identity k(u × v) = u × (kv) may be proved in an analogousway.

35. (a) Observe that u × v is perpendicular to both u and v, and hence to all vectors in theplane which they determine. Similarly, w = v × (u × v) is perpendicular to both v andto u × v. Hence, it must lie on the line through the origin perpendicular to v and in theplane determined by u and v.

(b) From the above, v • w = 0. Applying Part (d) of Theorem 3.7.1, we have

w = v × (u × v) = (v • v)u – (v • u)v

so that

u • w = (v • v)(u • u) – (v • u)(u • v)

= v2u2 – (u • v)2

36. It is not valid. For instance, let u = (1, 0, 0), v = (0, 1, 0), and w = (1, 1, 0). Then u × v =u × w = (0, 0, 1), which is a vector perpendicular to u, v, and w. However, v ≠ w.

37. The expression u • (v × w) is clearly well-defined.

Since the cross product is not associative, the expression u × v × w is not well-definedbecause the result is dependent upon the order in which we compute the cross products,i.e., upon the way in which we insert the parentheses. For example, (i × j) × j = k × j = –i

but i × (j × j) = i × 0 = 0.


The expression u • v × w may be deemed to be acceptable because there is only onemeaningful way to insert parenthesis, namely, u • (v × w). The alternative, (u •v) –w, doesnot make sense because it is the cross product of a scalar with a vector.

38. If either u or v is the zero vector, then u × v = 0. If neither u nor v is the zero vector, thenu × v = u v sin θ where θ is the angle between u and v. Thus if u × v = 0, then sin θ= 0, so that u and v are parallel.


EXERCISE SET 3.5

4. (a) We have

= (2, 1, 2) and = (3, –1, –2)

Thus × = (0, 10, –5) is perpendicular to the plane determined by and

and P, say, is a point in that plane. Hence, an equation for the plane is

0(x + 4)+ 10(y + 1) – 5(z + 1) = 0

or

2y – z + 1 = 0

5. (a) Normal vectors for the planes are (4, –1, 2) and (7, –3, 4). Since these vectors are notmultiples of one another, the planes are not parallel.

(b) The normal vectors are (1, –4, –3) and (3, –12, –9). Since one vector is three times theother, the planes are parallel.

6. (a) A normal vector for the plane is (1, 2, 3) and a direction vector for the line is (–4, –1,2). The inner product of these two vectors is –4 –2 + 6 = 0, and therefore they areperpendicular. This guarantees that the line and the plane are parallel.

7. (a) Normal vectors for the planes are (3, –1, 1) and (1, 0, 2). Since the inner product ofthese two vectors is not zero, the planes are not perpendicular.

8. (a) A normal vector for the plane is (2, 1, –1) and a direction vector for the line is (–4, –2,2). Since one of these vectors is a multiple of the other, the line and the plane areperpendicular.

PR PQ

PR

PQ

PR

PQ

171

10. (a) Call the points P and Q and call the line . Then the vector = (2, 4, –8) is parallelto and the point P = (5, –2, 4) lies on . Hence, one set of parametric equations for is: x = 5 + t, y = –2 + 2t, z = 4 –4t where t is any real number.

11. (a) As in Example 6, we solve the two equations simultaneously. If we eliminate y, wehave x + 7z + 12 = 0. Let, say, z = t, so that x = –12 – 7t, and substitute these valuesinto the equation for either plane to get y = –41 – 23t.

Alternatively, recall that a direction vector for the line is just the cross-product ofthe normal vectors for the two planes, i.e.,

(7, –2, 3) × (–3, 1, 2) = (–7, –23, 1)

Thus if we can find a point which lies on the line (that is, any point whose coordinatessatisfy the equations for both planes), we are done. If we set z = 0 and solve the twoequations simultaneously, we get x = –12 and y = –41, so that x = –12 – 7t, y = –41 –23t, z = 0 + t is one set of equations for the line (see above).

13. (a) Since the normal vectors (–1, 2, 4) and (2, –4, –8) are parallel, so are the planes.

(b) Since the normal vectors (3, 0, –1) and (–1, 0, 3) are not parallel, neither are theplanes.

14. (a) Since (–2, 1, 4) • (1, –2, 1) = 0, the planes are perpendicular.

(b) Since (3, 0, –2) • (1, 1, 1) = 1 ≠ 0, the planes are not perpendicular.

16. (a) Since 6(0) + 4t – 4t = 0 for all t, it follows that every point on the line also lies in theplane.

(b) The normal to the plane is n = (5, –3, 3); the line is parallel to v = (0, 1, 1). But n •v = 0, so n and v are perpendicular. Thus v, and therefore the line, are parallel to theplane. To conclude that the line lies below the plane, simply note that (0, 0, 1/3) is inthe plane and (0, 0, 0) is on the line.

(c) Here n = (6, 2, –2) and v is the same as before. Again, n • v = 0, so the line and theplane are parallel. Since (0, 0, 0) lies on the line and (0, 0, –3/2) lies in the plane,then the line is above the plane.

17. Since the plane is perpendicular to a line with direction (2, 3, –5), we can use that vectoras a normal to the plane. The point-normal form then yields the equation 2(x + 2) + 3(y –1)– 5(z –7) = 0, or 2x + 3y – 5z + 36 = 0.

PQ


19. (a) Since the vector (0, 0, 1) is perpendicular to the xy-plane, we can use this as thenormal for the plane. The point-normal form then yields the equation z – z0 = 0. Thisequation could just as well have been derived by inspection, since it represents the setof all points with fixed z and x and y arbitrary.

20. The plane will have normal (7, 4, –2), so the point-normal form yields 7x + 4y – 2 z = 0.

21. A normal to the plane is n = (5, –2, 1) and the point (3, –6, 7) is in the desired plane.Hence, an equation for the plane is 5(x – 3) – 2(y + 6) + (z –7) = 0 or 5x – 2y + z – 34 =0.

22. If we substitute x = 9 – 5t, y = –1 – t, and z = 3 + t into the equation for the plane, we findthat t = 40/3. Substituting this value for t into the parametric equations for the line yieldsx = –173/3, y = –43/3, and z = 49/3.

24. The two planes intersect at points given by (–5 – 2t, –7 – 6t, t). Two such points are (–5, –7,0) and (–3, –1, –1). The plane ax + by + cz + d = 0 through these points and also through(2, 4, –1) will satisfy the equations

–5a – 7b + d = 0

–3a – b – c + d = 0

2a + 4b – c + d = 0

This system of equations has the solution a = (–1/2)t, b = (1/2)t, c = 2t, d = t where t isarbitrary. Thus, if we let t = –2, we obtain the equation

x – y – 4z – 2 = 0

for the desired plane.

Alternatively, note that the line of intersection of the two planes has direction given bythe vector v = (–2, –6, 1). It also contains the point (–5, –7, 0). The direction of the lineconnecting thispoint and (2, 4, –1) is given by w = (7, 11, –1). Thus the vector v – w = (5,–5, –20) is normal to the desired plane, so that a point-normal form for the plane is

(x – 2) – (y – 4) – 4(z + 1) = 0

or

x – y –4z – 2 = 0


25. Call the points A, B, C, and D, respectively. Since the vectors = (–1, 2, 4) and =(–2, –1, –2) are not parallel, then the points A, B, and C do determine a plane (and not justa line). The normal to this plane is × = (0, –10, 5). Therefore an equation for theplane is

2y – z + 1 = 0

Since the coordinates of the point D satisfy this equation, all four points must lie in thesame plane.

Alternatively, it would suffice to show that (for instance) × and ×are parallel, so that the planes determined by A, B, and C and A, D, and C are parallel.Since they have points in common, they must coincide.

27. The normals to the two planes are (4, –2, 2) and (3, 3, –6) or, simplifying, n1 = (2, –1, 1)and n2 = (1, 1, –2). The normal n to a plane which is perpendicular to both of the givenplanes must be perpendicular to both n1 and n2. That is, n = n1 × n2 = (1, 5, 3). The planewith this normal which passes through the point (–2, 1, 5) has the equation

(x + 2) + 5(y – 1) + 3(z – 5) = 0

or

x + 5y + 3z – 18 = 0

28. The line of intersection of the given planes has the equations x = – 109 – t3 , y = 31

18 – t3, z = t and hence has direction (1, 1, –3). The plane with this normal vector which passesthrough the point (2, –1, 4) is the one we are looking for. Its equation is

(x – 2) + (y + 1) – 3(z – 4) = 0

Note that the normal vector for the desired plane can also be obtained by computing thecross product of the normal vectors of the two given planes.

30. The directions of the two lines are given by the vectors (–2, 1, –1) and (2, –1, 1). Sinceeach is a multiple of the other, they represent the same direction. The FIrst line passes(for example) through the point (3, 4, 1) and the second line passes through the points (5,1, 7) and (7, 0, 8), among others. Either of the methods of Example 2 will yield an equationfor the plane determined by these points.

31. If, for instance, we set t = 0 and t = –1 in the line equation, we obtain the points (0, 1, –3)and (–1, 0, –5). These, together with the given point and the methods of Example 2, willyield an equation for the desired plane.

DC

AD

BC

AB

BC

AB

BC

AB


33. The plane we are looking for is just the set of all points P = (x, y, z) such that the distancesfrom P to the two fixed points are equal. If we equate the squares of these distances, wehave

(x + 1)2 + (y + 4)2 + (z + 2)2 = (x – 0)2 + (y + 2)2 + (z – 2)2

or

2x + 1 + 8y + 16 + 4z + 4 = 4y + 4 – 4z + 4

or

2x + 4y + 8z + 13 = 0

34. The vector v = (–1, 2, –5) is parallel to the line and the vector n = (–3, 1, 1) isperpendicular to the plane. But n • v = 0 and the result follows.

35. We change the parameter in the equations for the second line from t to s. The two lines willthen intersect if we can find values of s and t such that the x, y, and z coordinates for thetwo lines are equal; that is, if there are values for s and t such that

4t + 3 = 12s – 1

t + 4 = 6s + 7

1 = 3s + 5

This system of equations has the solution t = –5 and s = –4/3. If we then substitute t = –5into the equations for the first line or s = –4/3 into the equations for the second line, we findthat x = –17, y = –1, and z = 1 is the point of intersection.

36. The vector v1 = (4, 1, 0) is parallel to the first line and v2 = (4, 2, 1) is parallel to thesecond line. Hence n = v1 × v2 = (1, –4, 4) is perpendicular to both lines and is therefore anormal vector for the plane determined by the lines. If we substitute t = 0 into theparametric equations for the first line, we see that (3, 4, 1) must lie in the plane. Hence, anequation for the plane is

(x – 3) – 4(y – 4) + 4(z – 1) = 0

or

x – 4y + 4z + 9 = 0


37. (a) If we set z = t and solve for x and y in terms of z, then we find that

38. Call the plane Ax + By + Cz + D/b = 0. Since the points (a, 0, 0), (0, b, 0), and (0, 0, c)

lie in this plane, we have

aA + D = 0

bB + D = 0

cC + D = 0

Thus A = –D/a, B = –D/b, and c = –D/c and an equation for the plane is

Alternatively, let P, Q, and R denote the points (a, 0, 0), (0, b, 0), and (0, 0, c),respectively. Then

= (–a, b, 0) and = (0, –b, c)

So

n = × = (bc, ac, ab)

is a normal vector for the plane. Since P, say, lies on the plane, we have

bc(x – a)+ acy + abz = 0

Dividing the above equation by abc yields the desired result.

39. (b) By Theorem 3.5.2, the distance is

D =−( ) + ( ) − ( ) −

+ + −( )=

2 1 3 2 4 1 1

2 3 4

1

292 2 2

QR

PQ

QR

PQ

x

a

y

b

z

c+ + = 1

x t y t z t= + = − − =11

23

7

23

41

23

1

23, ,


40. (a) The point (0, 0, 1) lies in the first plane. The distance between this point and thesecond plane is

(b) The two planes coincide, so the distance between them is zero.

41. (a)

(b)

(c)

42. First observe that if we substitute the values for x, y, and z from the line equations into thesymmetric equations, we obtain the equations t = t = t, which hold for all values of t. Thusevery point on the line satisfies the symmetric equations.

It remains to show that every point (x, y, z) which satisfies the symmetric equationsalso lies on the line. That is, we must find a value of t for which the line equations hold.Clearly

44. (a) The symmetric equations for the line are

Thus, two of the planes are given by

or, equivalently,

x y x z− − = + − =2 17 0 4 27 0and

x y x z−=

+ −=

−−

7

4

5

2

7

4

5

1and

x y z−=

+=

−−

7

4

5

2

5

1

tx x

a

y y

b

z z

c=

−=

−=

−0 0 0

d

d

d

=

=

=

30

11

382

110, point is on the line

D =−

+ −( ) + ( )=

2 3

6 8 2

1

2 262 2 2


45. (a) The normals to the two planes are (1, 0, 0) and (2, –1, 1). The angle between them isgiven by

Thus θ = cos–1 (2/ 6) ≈ 35°15′52′′.

47. If we substitute any value of the parameter—say t0—into r = r0 + tv and –t0 into r = r0 – tv,we clearly obtain the same point. Hence, the two lines coincide. They both pass through thepoint r0 and both are parallel to v.

47. Since the vector (a, b, c) is both the normal vector to the plane and the direction vector forthe line, the line must be perpendicular to the plane.

48. The equation r = (1 – t)r1 + tr2 can be rewritten as r = r1 + t(r2 – r1). This represents a linethrough the point P1 with direction r2 – r1. If t = 0, we have the point P1. If t = 1, we havethe point P2. If 0 < t < 1, we have a point on the line segment connecting P1 and P2. Hencethe given equation represents this line segment.

cos, , , ,

θ =( ) ⋅ −( )

+ +=

1 0 0 2 1 1

1 4 1 1

2

6


EXERCISE SET 4.1

3. We must find numbers c1, c2, c3, and c4 such that

c1(–1, 3, 2, 0) + c2(2, 0, 4, –1) + c3(7, 1, 1, 4) + c4(6, 3, 1, 2) = (0, 5, 6, –3)

If we equate vector components, we obtain the following system of equations:

–c1 + 2c2 + 7c3+ 6c4 = 0

3c1 + c3 + 3c4 = 5

2c1 + 4c2 + c3 c4 = 6

–c2 + 4c3 2c4 = 3



Thus c1 = 1, c2 = 1, c3 = –1, and c4 = 1.

1 0 0 0 1

0 1 0 0 1

0 0 1 0 1

0 0 0 1 1

−

−

− −

1 2 7 6 0

3 0 1 3 5

2 4 1 1 6

0 1 4 2 3

179

4. If we equate the second vector components in this equation, we have 0c1 + 0c2 + 0c3 = – 2.Hence, there do not exist scalars c1, c2, and c3 which satisfy the given equation.

5. (c) v = [32 + 42 + 02 + (–12)2]1/2 = 169 = 13

6. (a) u + v = (4, 4, 10, 1) = 42 + 42 + 102 + 12]1/2 = 133

(c) –2u + 2u = [(–8)2 + (–2)2 + (–4)2 + (–6)2]1/2 + 2 [42 + 12 + 22 + 32]1/2

= 1201/2 + 2[30]1/2 = 4 30

(e)

8. Since kv = [(–2k)2 + (3k)2 + 02 + (6k)2]1/2 = [49k2]1/2 = 7|k|, we have |kv| = 5 if and onlyif k = ±5/7.

9. (a) (2,5) • (–4,3) = (2)(–4) + (5)(3) = 7

(c) (3, 1, 4, –5) • (2, 2, –4, –3) = 6 + 2 –16 + 15 = 7

10. (a) Let v = (x, y) where v = 1. We are given that v • (3, –1) = 0. Thus, 3x – y = 0 or y

= 3x. But v = 1 implies that x2 + y2 = x2 + 9x2 = 1 or x = ±1/ 10. Thus, the only

possibilities are v = (1/ 10, 3/ 10) or v = (–1/ 10, –3/ 10).

You should graph these two vectors and the vector (3, –1) in an xy-coordinate system.

(b) Let v = (x, y, z) be a vector with norm 1 such that

x – 3y + 5z = 0

This equation represents a plane through (0,0,0) which is perpendicular to (1, –3, 5).There are infinitely many vectors v which lie in this plane and have norm 1 and initialpoint (0,0,0).

1 1

3 1 2 23 1 2 2

1

22 2 2 2 1 2ww =

+ + +

( ) =, , , ,/

, ,1

3 2

2

3

2

3


11. (a) d(u,v) = [(1 – 2)2 + (–2 – 1)2]1/2 = 10

(c) d(u, v) = [(0 + 3)2 + (–2 – 2)2 + (–1 – 4)2 + (1 – 4)2]1/2 = 59

14. (e) Since u • v = 0 + 6 + 2 + 0 = 8, the vectors are not orthogonal.

15. (a) We look for values of k such that

u • v = 2 + 7 + 3k = 0

Clearly k = –3 is the only possiblity.

16. We must find two vectors x = (x1, x2, x3, x4) such that x • x = 1 and x • u = x • v = x • w

= 0. Thus x1, x2, x3, and x4 must satisfy the equations

x21 + x2

2 + x23 + x2

4 = 1

2x1 + x2 – 4x3 = 0

–x1 – x2 + 2x3+ 2x4 = 0

3x1+ 2x + 5x3+ 4x4 = 0

The solution to the three linear equations is x1 = –34t, x2 = 44t, x3 = –6t, and x4 = 11t. If wesubstitute these values into the quadratic equation, we get

[(–34)2 + (44)2 + (–6)2 + (11)2] t2 = 1

or

Therefore, the two vectors are

± 157(–34, 44, –6, 11)

t = ± = ±1

3249

1

57


17. (a) We have |u • v| = |3(4) + 2(–1)| = 10, while

u v = [32 + 22]1/2[42 + (–1)2]1/2 = 221.

(d) Here |u • v| = 0 + 2 + 2 + 1 = 5, while

u v = [02 + (–2)2 + 22 + 12]1/2[(–1)2 + (–1)2 + 12 + 12]1/2 = 6.

18. (a) In this case

and

20. By Theorem 4.1.6, we have u • v = 14 u + v2 – 1

4 u – v2 = 14 (1)2 – 1

4 (52) = –6.

22. Note that u • a = 4 and a2 = 15. Hence, by Theorem 3.3.3, we have

and

uu uu uuuu aa

aaaa− = −

⋅= −( ) − −proja 2

2 1 4 14

15

4

15

8

15

4, , , , , ,

55

34

15

11

15

52

15

9

5

= −

, , ,

proja 2uu

uu aa

aaaa=

⋅= −( )4

151 1 2 3, , ,

AT u . v =

−

= −

9

1

2

612

u . vA =

⋅

−

= −

3

1

10

1812

u v. AT =

⋅

−

−

3

1

2 3

1 4

2

6 =

⋅

=

3

1

14

2668

Au . v =

⋅

−

=

5

13

2

668


23. We must see if the system

3 + 4t = s

2 + 6t = 3 – 3s

3 + 4t = 5 – 4s

–1 – 2t = 4 – 2s

is consistent. Solving the first two equations yield t = – 4/9, s = 11/9. Substituting into 2rdequation yields 5/3 = 1/9. Thus the system is inconsistent, so the lines are skew.

24. Note that

v1 + … + vr2 = (v1 + …+ v

r) • (v1 + … + v

r)

If vi •vj = 0 whenever i ≠ j, then the result follows.

25. This is just the Cauchy-Schwarz Inequality applied to the vectors vT AT and uT AT withboth sides of the inequality squared. Why?

26. If we apply the Cauchy-Schwarz Inequality to the vectors u = (a, b) and v = (cos θ, sin θ),the result follows directly.

27. Let u = (u1, …, un), v = (v1, …, v

n), and w = (w1, …, w

n).

(a) u • (kv) = (u1, …, un) • (kv1, …, kv

n)

= u1 kv1 + … + un

kvn

= k(u1v1 + … + u

nv

n)

= k(u • v)

= ⋅ + ⋅

= + + + ⋅

= ≠

≠

∑ ∑

∑

v v v v

v v v v

ii

r

i i ji j

r i ji j

1

12 2


27. (b) u • (v + w) = (u1, …, un) • (v1 + w1, …, v

n+ w

n)

= u1(v1 + w1) + … + un

(vn

+ wn)

= (u1v1 + … + unv

n) + (u1w1 + … + u

nw

n)

= u • v + u • w

30. Let u = (u1, …, un) and v = (v1, …, v

n).

(a) u • v = u1v1 + … + unv

n= v1u1 + … + v

nu

n= v • u

(c) (ku) • v = (ku1) + … + (kun)v

n= k(u

1v

1+ … + unvn) = k(u • v)

32. Let u = (u1, u2, …, un) and v = (v1, v2, …, v

n).

(a) Since d(u, v) is a square root of a sum of squares, it cannot be negative.

(b) Since d(u, v) = [(u1 – v1)2 + (u2 – v2)

2 + … + (un

– vn)2]1/2 = if and only if u

i– v

i= 0

for i = 1, 2, …, n, we have d(u, v) = 0 if and only if u = v.

(c) Here d(u, v) = [(u1 – v1)2 + … + (u

n– v

n)2]1/2 = [(v1 – u1)

2 + … + (vn

– un)2]1/2 = d

(v, u).

34. The result follows from the equations used to prove Theorem 4.1.6. It says that the sum ofthe squares of the lengths of the diagonals of the parallelogram with sides u and v is thesum of the squares of the lengths of the 4 sides.

35. (a) By theorem 4.1.7, we have

36. Since u – v2 = (u – v) • (u – v) = u2 + v>2 – 2(u • v) for all vectors u and v in Rn, we

need only show that for the formula to hold. Since this result is valid for cos

θ in R2 and R3, it would seem to be the logical way to define cos θ and hence θ in Rn for n

> 3. To show that this makes sense, we must check that so that cos θ is well

defined. But the above inequality is just the Cauchy-Schwarz inequality in Rn. Thus the

formula is valid in Rn.

uu vv

uu vv

⋅≤ 1

cosθ =⋅uu vv

uu vv

d( , ) .uu vv uu vv uu vv= + = + − =2 22


37. (a) True. In general, we know that

u + v2 = u2 + v2 + 2(u • v)

So in this case u • v = 0 and the vectors are orthogonal.

(b) True. We are given that u • v = u • w = 0. But since u • (v + w) = u • v + u • w, itfollows that u is orthogonal to v + w.

(c) False. To obtain a counterexample, let u = (1, 0, 0), v = (1, 1, 0), and w = (–1, 1, 0).


EXERCISE SET 4.2

1. (b) Since the transformation maps (x1, x2) to (w1, w2, w3), the domain is R2 and thecodomain is R3. The transformation is not linear because of the terms 2x1x2 and 3x1x2.

2. (a) Since we can write the transformation as

3. The standard matrix is A, where

so that

T( , , )− =−

−−

−

1 2 4

3 5 1

4 1 1

3 2 1

1

2

4

= −−

3

2

3

w = Ax =−

−−

3 5 1

4 1 1

3 2 1

1

2

3

x

x

x

the standard matrix is2 3 0 1

3 5 0 1

−−

w

w

x

x

x

x

1

2

1

2

3

4

2 3 0 1

3 5 0 1

=

−−

187

4. (a) The standard matrix is

It is interesting to note that T(1, 0) = (2, 1) and T(0, 1) = (–1, 1).


Note that T(1, 0) = (0, –1, 1, 1) and T(0, 1) = (1, 0, 3, –1).

6. (b) We have

7. (b) Here

8. (c) Here

so the reflection of (–1, 2) is (2, –1).

T( , )− =

−

=

−

1 2

0 1

1 0

1

2

2

1

T( , )2 1

2 1 1

0 1 1

0 0 0

2

1

3

3− =

−

−

= −

0

2

0

T( )x =−

−

=

1 2 0

3 1 5

1

1

3

3

13

0 1

1 0

1 3

1 1

−

−

2 1

1 1

−


9. (a) In this case,

so the reflection of (2, –5, 3) is (2, –5, –3).

10. (a)

12. (b) The image of (3, –4) is

(d) The image of (3, –4) is (4, 3), since

13. (b) The image of (–2, 1, 2) is (0, 1, 2 2), since

cos sin

sin cos

45 0 45

0 1 0

45 0 45

°( ) °( )

− °( ) °( )

−

=

−

2

1

2

1

20

1

20 1 0

1

20

1

2

−

=

2

1

2

0

1

2 2

cos sin

sin cos

90 90

90 90

3°( ) − °( )°( ) °( )

−−

=

−

−

=

4

0 1

1 0

3

4

4

3

or3 4 3

2

4 3 3

2

− +

, .

cos sin

sin cos

60 60

60 60

3

4

°( ) °( )− °( ) °( )

−

=−

−

=

1

2

3

2

3

2

1

2

3

4

3 4 3

2

4 3 3

2

−

−+

The desired projection is1 0

0 0

2

5

−

=

, , ).

2

092 0or

T ( , , )2 5 3

1 0 0

0 1 0

0 0 1

2

5

3

− =−

−

= −−

2

5

3


15. (b) The image of (–2, 1, 2) is (0, 1, 2 2), since

16. (b)


(c) The standard matrix for a counterclockwise rotation of 15° + 105° + 60° = 180° is


1 0 0

0 0 0

0 0 1

1 0 0

0 1 0

0 0 1

−

=−−

1 0 0

0 0 0

0 0 1

cos sin

sin cos

180 180

180 180

°( ) − °( )°( ) °( )

=−

−

1 0

0 1

0 1

1 0

1 0

0 0

1

2

3

2

3

2

1

2

−

=−

0 0

1

2

3

2

The standard matrix is1 2 0

0 1 2

0 0

0 1

=

.

0 0

0 1 2

=

−

−

1

20

1

20 1 0

1

20

1

2

2

1

2

=

0

1

2 2

cos sin

sin cos

− °( ) − − °( )

− °( ) − °( )

45 0 45

0 1 0

45 0 45

−

2

1

2


(b) The standard matrix is

19. (c) The standard matrix is

20. (a) Projection on either axis followed by projection on the other will yield the zero vector.Therefore the transformations commute. In matrix form, we have

(c) Geometrically, it is clear that the transformations do not commute. In matrix form, wehave

[ ] [ ]cos sin

sin cosT T1 2

1 0

0 0=

−

θ θθ θ

==−

cos sinθ θ0 0

[ ] [ ]T T1 21 0

0 0

0 0

0 1

0 0

0 0=

=

[ ] [ ]=

=

0 0

0 1

1 0

0 0 2 1T T

= −−

0 1 0

0 0 1

1 0 0

=−

−

−

1 0 0

0 1 0

0 0 1

0 0 1

0 1 0

1 0 0

1 0 0

0 0 1

0 1 0−

cos sin

sin cos

180 180 0

180 180 0

0 0 1

°( ) − °( )°( ) °( )

°( ) °( )

− °( )

cos sin

sin c

90 0 90

0 1 0

90 0 oos

cos sin

90

1 0 0

0 270 270

°( )

°( ) − °( )00 270 270sin cos°( ) °( )

2 0 0

0 2 0

0 0 2

1

20

1

20 1 0

1

20

1

2

−

=−

1 0 1

0 2 0

1 0 1


and

21. (a) Geometrically, it doesn’t make any difference whether we rotate and then dilate orwhether we dilate and then rotate. In matrix terms, a dilation or contraction isrepresented by a scalar multiple of the identity matrix. Since such a matrix commuteswith any square matrix of the appropriate size, the transformations commute.

22. (a) We consider only T1. It is linear because it can be represented by the matrix

(b) Again we consider only T1. We have T1(x) = (x, 0, 0) and x – T1(x) = (x, y, z) – (x,0, 0) = (0, y, z), so that T1(x) • (x – T1x) = 0 and the vectors are orthogonal.

(c)

23. Set (a, b, c) equal to (1, 0, 0), (0, 1, 0), and (0, 0, 1) in turn.

24.

1

3

1

3

1

3

1

3

1

3

1

3

1

3

1

3

1

3

1

31

3

1

3

1

3

− +

+ −

− +1

3

1

3

−T(x)

x − T(x) x

x

y

T(x)

1 0 0

0 0 0

0 0 0

[ ] [ ]cos sin

sin cosT T2 1

1 0

0 0=

−

θ θθ θ =

cos

sin

θθ

0

0


25. (a) Since T2(T1(x1, x2)) = (3(x1 + x2), 2(x1 + x2) + 4(x1 – x2)) = (3x1 + 3x2, 6x1 – 2x2),we have

We also have

27. Compute the trace of the matrix given in Formula (17) and use the fact that (a, b, c) is aunit vector.

28. (a) By inspection, c1 • c2 = c1 • c3 = c2 • c3 = 0 and c1 = c2 = c3 = 1 where c1, c2, andc3 are the column vectors of A. It is also easy to show that det(A) = 1.

(b) If we let , for instance, an axis of rotation is given by the vector

Thus 13, is an appropriate vector of length 1.

(c) From Exercise 27, we have . Substituting

into Formula (17), we find that θ = 90° + 360°n for n = 0, 1, 2, … .

cos , , , ,θ =−

= ° + ° =1 1

20 90 180 0 1 2so forn n

1

3

2

3

2

3, ,

=

−

+ −

1

98

94

9

1

94

98

9

=

=

2

94

94

9

2

33

1

32

32

3

u =

−

−

1

9

4

9

8

98

9

4

9

1

94

9

7

9

4

9

1

00

0

1

9

8

9

4

94

9

4

9

7

98

9

1

9

4

9

+

−

−

+ −

1

0

0

1 1

1

0

0

x =

1

0

0

[ ] [ ]T T2 13 0

2 4

1 1

1 1

3 3

6 2=

−

=

−

[ ]T T2 13 3

6 2 =

−


29. (a) This is an orthogonal projection on the x-axis and a dilation by a factor of 2.

(b) This is a reflection about the x-axis and a dilation by a factor of 2.

30. (a) This transformation dilates the x coordinates by a factor of 2 and the y coordinates bya factor of 3.

(b) This is a rotation through angle of π/6.

31. Since cos(2θ) = cos2 θ – sin2 θ and sin(2θ) = 2 sin θ cos θ, this represents a rotationthrough an angle of 2θ.

32. We have

which represents a rotation through an angle –θ.

34. No, f(0) = b ≠ 0 unless b = 0; then f(x) = mx + b is not a linear function.

AT =

−

=

−( ) − −(cos sin

sin cos

cos sinθ θθ θ

θ θ ))−( ) −( )

sin cosθ θ


EXERCISE SET 4.3

1. (a) Projections are not one-to-one since two distinct vectors can have the same imagevector.

(b) Since a reflection is its own inverse, it is a one-to-one mapping of R2 or R3 onto itself.

2. (a) The standard matrix is . Since det(A) = 0, A is not invertible and hence

the operator is not one-to-one.

(b) The standard matrix is . Since det(A) = 17 ≠ 0, A is invertible and hence

the operator is one-to-one.

3. If we reduce the system of equations to row-echelon form, we find that w1 = 2w2, so that anyvector in the range must be of the form (2w, w). Thus (3, 1), for example, is not in the range.

4. The range satisfies w1 – w2 + w3 = 0

Thus (1, 1, 1) Range.

5. (a) Since the determinant of the matrix

is 3, the transformation T is one-to-one with

Thus T w w w w w w− = − +

11 2 1 2 1 2

1

3

2

3

1

3

1

3( , ) , .

[ ]T− =

−

1

1

3

2

31

3

1

3

[ ]T =−

1 2

1 1

A =−

2 3

5 1

A =

8 4

2 1

195

5. (b) Since the determinant of the matrix

is zero, T is not one-to-one.

6. (a) Since the matrix

is invertible, T is one-to-one. Since

T–1(w1, w2, w3) = (w1 – 2w2 + 4w3, –w1 + 2w2 – 3w3, –w1 + 3w2 – 5w3)

(b) Since the matrix

is not invertible, T is not one-to-one.

8. (a) T is linear since

T((x1, y1) + (x2, y2))= (2(x21 + x2), y1 + y2)

= (2x1, y1) + (2x2, y2)

= T(x1, y1) + T(x2, y2)

and

[ ]T =−

−−

1 3 4

1 1 1

0 2 5

[ ]T− =

−− −− −

11 2 4

1 2 3

1 3 5

[ ]T =−

1 2 2

2 1 1

1 1 0

[ ]T =−

−

4 6

2 3


T(k(x, y))= T(kx, ky) = (2kx, ky)

= k(2x, y) = kT(x, y)

(b) T is not linear since

T((x1, y1) + (x2, y2)) = (x1 + x2)2, y1 + y2

= (x21, y1) + (2x1x2, 0) + (x2

2, y2)

= T(x1, y1) + T(x2, y2) + (2x1x2, 0)

≠ T(x1, y1) + T(x2, y2) if x1x2 ≠ 0

and

T(k(x,y)) = (k2x2, ky) = k(kx2, y)

≠ kT(x, y) if k ≠ 0, 1 and x ≠ 0


T((x1, y1) + (x2, y2) ) = (2(x1 + x2) + (y1 + y2), (x1 + x2) – (y1 + y2))

= (2x1 + y1, x1 – y1) + (2x2 + y2, x2 – y2)

= T(x1, y1) + T(x2, y2)

and

T(k(x, y)) = (2kx + ky, kx – ky)

= k(2x + y, x – y) = kT(x, y)

(b) Since

T((x1, y1) + (x2, y2))= (x1 + x2 + 1, y1 + y2)

= (x1 + 1, y1) + (x2, y2)

≠ T(x1, y1) + T(x2, y2)

and T(k(x, y)) = (kx + 1, ky) ≠ kT (x, y) unless k = 1, T is nonlinear.



T((x1, y1, z1) + (x2, y2, z2)) = (x1 + x2, x1 + x2 + y1 + y2 + z1 + z2)

= (x1, x1 + y1 + z1) + (x2, x2 + y2 + z2)

= T(x1, y1, z1) + T(x2, y2, z2)

and

T(k(x, y, z)) = (kx, kx + ky + kz)

= kT(x, y, z)

(b) Since for all vectors u and v,

T(u + v) = (1, 1) ≠ T(u) + T(v) = (2, 2)

and

T(ku) = (1, 1) ≠ kT(u) = (k, k) if k = 1

T is nonlinear.

13. (a) The projection sends e1 to itself and the reflection sends e1 to –e1, while the projection

sends e2 to the zero vector, which remains fixed under the reflection. Therefore

T(e1) = (–1, 0) and T(e2) = (0, 0), so that

(b) We have e1 = (1, 0) → (0, 1) → (0, –1) = 0e1 – e2 while e2 = (0, 1) → (1, 0) → (1, 0)

= e1 + 0e2. Hence

(c) Here e1 = (1, 0) → (3, 0) → (0, 3) → (0, 3) = 0e1 + 3e2 and e2 = (0, 1) → (0, 3) →

(3, 0) → (0, 0) = 0e1 + 0e2. Therefore

14. (a) Here e1 = (1, 0, 0) → (1, 0, 0) → (1/5, 0, 0) = 15e1, e2 = (0, 1, 0) → (0, –1, 0) → (0,

–1/5, 0) = – 15e2, and e3 = (0, 0, 1) → (0, 0, 1) → (0, 0, 1/5) = 15e3. Therefore

[ ]T = −

1 5 0 0

0 1 5 0

0 0 1 5

[ ]T =

0 0

3 0

[ ]T =−

0 1

1 0

[ ]T =−

1 0

0 0


16. (c) The mapping is NOT 1 – 1.

For example,

yet

17. (a) By the result of Example 5,

or T(–1, 2) = (1/2, 1/2)

18. (a) This transformation maps vectors on the x-axis to themselves, vectors on the y-axisinto their negatives, and maps no other vectors into scalar multiples of themselves.Thus the eigenvalues are λ = ±1 and the eigenvectors are vectors (x, y) with either x = 0 or y = 0, but not both. To verify this, we observe that since e1 → e1 and

e2 → –e1, the standard matrix for the transformation is . Hence the char-acteristic equation is

or λ2 – 1 = 0 with solutions λ = ±1. If (x, y) is an eigenvector corresponding to λ = 1,then

or y = 0, so the vector must lie on the x-axis. If (x, y) is an eigenvector correspondingto λ = –1, then

or x = 0, so the vector must lie on the y-axis.

−

=

−

=

2 0

0 0

2

0

0

0

x

y

x

0 0

0 2

0

2 0

=

=

x

y y

0

det λ1 0

0 1

1 0

0 10

−

−

=

1 0

0 1−

T−

=

( ) ( )( )( )( ) (

1

2

1 2 1 2 1 2

1 2 1 2 1 2

2

))

−

=

2

1

2

1 2

1 2

A

1

1

1

0

0

0

0

04 1−

=

= ×

A

0

0

0

0

0

0

0

04 1

=

= ×


18. (c) This transformation leaves vectors on the x-axis fixed and maps vectors on the y-axisto (0, 0). No other vectors are mapped into scalar multiples of themselves. Thus theonly eigenvalues are λ = 1 and λ = 0 with corresponding eigenvectors (x, 0) and (0, y),respectively, where xy ≠ 0. To verify this, observe that the characteristic equation is

or λ (λ – 1) = 0. Thus λ = 0 or λ = 1. If (x, y) is an eigenvector corresponding to λ =0, then

so that x = 0. If (x, y) is an eigenvector corresponding to λ = 1, then

so that y = 0. Thus the eigenvectors must indeed lie along the coordinate axes.

19. (a)

Eigenvalue λ1 = 4, eigenvector

Eigenvalue λ3 = 1, eigenvector

(b)

λ1 = 0,

λ2 = 1, ξ ξ21 22

1

0

0

0

0

1

=

=

,

, or in general

s

o

t

ξ1

0

1

0

=

A =

1 0 0

0 0 0

0 0 1

ξ ξ21 22

0

1

0

0

0

1

,=

=

ξ1

1

0

0

=

A =−

1 0 0

0 1 0

0 0 1

0 0

0 1

0 0

0

=

=

x

y y

−

=

−

=

1 0

0 0 0

0

0

x

y

x

det λ1 00 1

λ −1 00

−

=

1 0

0 0 λλ= 0


(c) This transformation doubles the length of each vector while leaving its directionunchanged. Therefore λ = 2 is the only eigenvalue and every nonzero vector in R3 is acorresponding eigenvector. To verify this, observe that the characteristic equation is

or (λ – 2)3 = 0. Thus the only eigenvalue is λ = 2. If (x, y, z) is a correspondingeigenvector, then

Since the above equation holds for every vector (x, y, z), every nonzero vector is aneigenvector.

(d) Since the transformation leaves all vectors on the z-axis unchanged and alters (butdoes not reverse) the direction of all other vectors, its only eigenvalue is λ = 1 withcorresponding eigenvectors (0, 0, z) with z ≠ 0. To verify this, observe that thecharacteristic equation is

or

Since the quadratic (λ – 1/ 2)2 + 1/2 = 0 has no real roots, λ = 1 is the onlyeigenvalue. If (x, y, z) is a corresponding eigenvector, then

You should verify that the above equation is valid if and only if x = y = 0. Therefore thecorresponding eigenvectors are all of the form (0, 0, z) with z ≠ 0.

1 0

1 0

0 0 0

−

− −

1 2 1 2

1 2 1 2x

y

z

=

−( ) + ( )−( ) + ( )

1 1 2 1 2

1 2 1−1 2

x y

x y

0

0

0

0

=

( ) ( )λ λ λ− = − −( ) + ( )

1 1 1 2 1 22 2λ −

− λ −

1 2 1 2

1 2 1 2

= 0

λ −1 2

λ −1 2

λ −1

1 2 0

1 2 0

0 0

0− =

0 0 0

0 0 0

0 0 0

0

0

0

=

x

y

z

λ −λ −

λ −

2 0 0

0 2 0

0 0 2

0=


20. (a) Yes. If T1 and T2 are both one-to-one (whether or not they are linear), then each mapsdistinct vectors to distinct vectors and therefore so does T2

2 T

1.

(b) If T2 is one-to-one and T1 is not, then T2 T1 cannot be one-to-one since there aredistinct vectors u and v such that T1(u) = T1(v) and therefore such that T2 T1 (u)= T2 T1(v). This is true whether or not the transformations are linear. However T1 T2 may indeed be one-to-one. To see this, consider, for instance, the transformationsgiven by

It is easy to check that T2 is one-to-one and T1 is not (since T1(e3) = 0). Moreover, T1 T2 is given by the matrix

which is invertible, so T1 T2 is one-to-one.

21. Since T(x, y) = (0, 0) has the standard matrix , it is linear. If T(x, y) = (1, 1) were

linear, then we would have

(1, 1) = T(0, 0) = T(0 + 0, 0 + 0) = T(0, 0) + T (0, 0) = (1, 1) + (1, 1) = (2, 2)

Since this is a contradiction, T cannot be linear.

22. (a) Since T is linear,

T (0) = T(0 + 0) = T(0) + T(0) = 2T(0)

so T(0) = 0.

(b) y1 = x21 + x2

2 is an R2 → R2 mapping, y2 = x1x2 where , but T is not linear.

23. From Figure 1, we see that T(e1) = (cos 2θ, sin 2θ) and from Figure 2, that T(e2) =

cos ,sin (sin3

22

3

22 2

π θ π θ+

+

= θθ θ, cos )− 2

T( ) ( )0 0=

0 0

0 0

1 0 0

0 1 0

1 0

0 1

0 0

1 0

0 1

=

[ ] [ ]T T1 21 0 0

0 1 0

1 0

0 1

0 0

=

=

and


Figure 1 Figure 2

This, of course, should be checked for all possible diagrams, and in particular for the caseπ2 < θ < π. The resulting standard matrix is

25. (a) False. The transformation T(x1, x2) = x21 from R2 to R1 is not linear, but T(0) = 0.

(b) True. If not, T(u) = T(v) where u and v are distinct. Why?

(c) False. One must also demand that x ≠ 0.

(d) True. If c1 = c2 = 1, we obtain equation (a) of Theorem 4.3.2 and if c2 = 0, we obtainequation (b).

26. (a) Sometimes true

(b) Always false

(c) Sometimes true

27. (a) The range of T cannot be all of Rn, since otherwise T would be invertible and det(A)

≠ 0. For instance, the matrix sends the entire xy-plane to the x-axis.

(b) Since det(A) = 0, the equation T(x) = Ax = 0 will have a non-trivial solution andhence, T will map infinitely many vectors to 0.

1 0

0 0

cos sin

sin cos

2 2

2 2

θ θθ θ−

l

ly

y

x

x

(1, 0)

(0, 1)

(cos 2, sin 2)


EXERCISE SET 4.4

1. (a) (x2 + 2x – 1) – 2(3x2 + 2) = –5x2 + 2x – 5

(b) 5/4x2 + 3x) + 6(x2 + 2x + 2) = 26x2 + 27x + 6

(c) (x4 + 2x3 + x2 – 2x + 1) – (2x3 – 2x) = x4 + x2 + 1

(d) π(4x3 – 3x2 + 7x + 1) = 4πx3 – 3πx2 + 7πx + π

2. (a) Yes,

(b) This is a linear transformation, corresponding to an R4 → R4 mapping (NOT R3 → R3

as in text).

3. (a) Note that the mapping fR3 → R given by f(a, b, c) = |a| has f(1, 0, 0) = 1, f(0, 1, 0) =

0, and f(0, 0, 1) = 0. So If f was a linear mapping, the matrix would be A = (1, 0, 0).

Thus, f(–1, 0, 0) would be found as . Yet, f(–1, 0, 0) = |–1| = 1 ≠ –1.

Thus f is not linear.

(b) Yes, and here A = (1, 0, 0) by reasoning in (a).

1 0 0

1

0

0

1, ,( )−

= −

A =−

−

0 0 1 0

0 1 0 0

1 0 0 0

0 0 0 1

A =

0 0 1

0 1 0

1 0 0

205

4. (a) Yes,

(b) Yes, N =

N is (n + 2) × (n + 1) in dimensions.

(c) No, note that when W(p(x)) = xp(λ) + 1, then w(0pn) = x(0p

n) + 1 = 1 ≠ 0p

n+1 so Wis not linear.

5. (a)

6.

(a0, a1, a2, a3) → (a1, a2, a3) (a0, a1, a2, a3, a4) → (a1, a2, a3, a4)

a b( )

( )0 1 0 0

0 0 2 0

0 0 0 3

0 1 0 0

0 0 2 0

0 0 0 33

A =

( )3 0 0 0

0 2 0 0

0 0 1 0

4 0 0 0 0

0 3 0 0b

00

0 0 2 0 0

0 0 0 1 0

5 0 0 0 0 0

0 4 0 0 0 0

0 0 3

( )c 00 0 0

0 0 0 2 0 0

0 0 0 0 1 0

1 0 0 0 0 0

1 1 0

0 1 1

−

−

0 1

0 1 0

−

00 0 0 1 1 0

0 0 0 0 1 1

0 0 0

−−

0 0 1−

A =

1 0 0

0 1 0

0 0 1

0 0 0


(c)

(a0, a1, a2, a3, a4, a5) → (a1, 2a2, 3a3, 4a4, 5a5)

7. (a) T(ax + b) = (a + b)x + (a – b)

TP1 → P1

(b) T (ax + b) = ax2 + (a + b)x + (2a – b)

T P1 → P2

(c) T(ax3 + bx2 + cx + d) = (a + 2c – d)x + (2a + b + c + 3d)

TP3 → P1

7. (d) T(ax2 + bx + c) = bx

TP2 → P2

(e) T(ax2 + bx + c) = b

T2P2

→ P0

8. (a)

(b)

9. (a) 3et + 3e–t

(b) Yes, since cosh t = 12et + 12e–t, cosht corresponds to the vector (0, 0, 12, 12).

(c) A a b c=

−

0 1 0 0

0 0 0 0

0 0 1 0

0 0 0 1

( , , ,, ) ( , , , )d b c d→ −0

A a b c=−

→0 0 0

0 0 1

0 1 0

0( , , ) ( , cc b, )−

A a b c a c= −

→ −1 0 0

0 0 1

0 1 0

( , , ) ( , , bb)

0 1 0 0 0 0

0 0 2 0 0 0

0 0 0 3 0 0

0 0 0 0 4 0

0 0 0 0 0 5


10. Assume Th 4.3.2 holds.

Then T(c1u + c2v) = T(c1u) + T(c2v) = c1T(u) + c2T(v).

Thus, the principle of superposition holds. Conversely, assume the principle of superpositionis true, i.e., T(c1u + c2v) = c1T(u) + c2T(v). Let c1 = c2 = 1. Then T(u + v) = T(u) + T(v).Next, let c2 = 0. Then T(c1u) = c1T(u). This is Th. 4.3.2.

11. If S(u) = T(u) + f, f ≠ 0, then S(0) = T(0) + f = f ≠ 0. Thus S is not linear.

12. The Vandermonde system is

Solving: a0 = 0, a1 = 0, a2 = 2.

Thus, p(x) = 2x2.

13. (a) The Vandermonde system is

Solving: a0 = 1, a1 = 2, A2 = 1

P(X) = X2 + 2X + 1 = (X + 1)2

(b) The system is:

Solving: b0 = 1, b1 = 0, b2 = 1.

Thus, p(x) = 1 • (x + 2)(x) + 0 • (x + 2) + 1 = (x + 2) • x + 1

= (x + 2) • x + 1

= x2 + 2x + 1

1 0 0

1 2 0

1 3 3

1

1

4

0

1

2

=b

b

b

1 2 4

1 0 0

1 1 1

0

1

2

−

a

a

a

==

1

1

4

1 1 1

1 0 0

1 1 1

0

1

2

−

a

a

a

=

2

2

0



Solving: a0 = 0, a1 = –1, a2 = 0, a2 = 1

Thus p(x) = x3 – x

(b) The system is

Solving: b0 = 0, b1 = 0, b2 = 0, b3 = 1

Thus p(x) = (x + 1)(x – 0)(x – 1)

(c) We have

(d) We have

(e) In this case a0 = a1 = a2 = a3 = 0, so p(x) = 0

Similarly b0 = b1 = b2 = b3 = 0

b

b

b

b

0

1

2

3

1 1 1 1

0 1 1 1

0

=

− −−

00 1 0

0 0 0 1

0

1

0

1

−

=

0

0

0

1

a

a

a

a

0

1

2

3

1 1 0 0

0 1 1 1

0 0 1 0

0 0 0

=−

1

0

0

0

1

0

1

=−

00

1

1 0 0 0 0

1 1 0 0 0

1 2 2 0 0

1 3 6 6 6

1 1 1 1 0

1 0 0 0 0

1 1 1 1 0

1 2 4 8 6

− −



Thus, p(x) = 2x3 – 2x + 2

(b) The system is

Thus, p(x) = 2(x + 2)(x + 1)(x – 1) – 4(x + 2)(x + 1) + 12(x + 2) – 10

(c) We have

(d) We have

16.

det

1

1

1

2

2

2

2 2 2 2 2 2

a a

b b

c c

bc ab a c a b ac b c= + + − − −

det1

1

a

bb a= −

b

b

b

b

0

1

2

3

1 2 4 8

0 1 3 7

0

=

− −−

00 1 2

0 0 0 1

2

2

0

2

−

−

=

−

−

10

12

4

2

a

a

a

a

0

1

2

3

1 2 2 2

0 1 3 1

0 0

=

−−

11 2

0 0 0 1

10

12

4

2

−

−

=−

2

2

0

2

1 0 0 0 10

1 1 0 0 2

1 3 6 0 2

1 4 12 12 14

−

= −== −=

Solving,

b

b

b

b

0

1

2

3

10

12

4

2

1 2 4 8 10

1 1 1 1 2

1 1 1 1 2

1 2 4

− − −− −

8 14

2

2

0

0

1

2

3

== −==

Solving, a

a

a

a 22


= bc2 + ab2 + a2c – a2b – ac2 – b2c = (bc2 + ab2 – b2c – bac) + (–ac2 + A2C – a2b + abc)

= b(c2 + ab – bc – ac) – a(c2 – ac + ab – bc)

= (b – a) (c2 + ab – bc – ac)

= (b – a) [c(c – b) – a(c – b)]

= (b – a (c – b) (c – a).

The Vandermonde system for the curve through the two points (x0, y0), (x1, y1) with x0 ≠x, is

Since det = x1 – x0 ≠ 0, the above system has a unique solution for a0, a1.

Thus p(x) = a0 + a1x is a unique line.

The Vandermonde system for the curve through the data (x0, y0), (x1, y1), (x2, y2), with x0≠ x1, x0 ≠ x2, x1 ≠ x2 is

Since the determinant of = (x1 – x0)(x2 – x1) • (x2 – x0)

the above system has a unique solution for ≠0 a0, a1, a2. Thus p(x = a0 + a, x + a2x2 is

unique. If a2 ≠ 0, the curve is a parabola. If a2 = 0, the curve is a line.

17. (a)

(b)

a

a

a

x x x

x x

0

1

2

0 0 1

0 1

1

0 1

0 0 1

=−

− +

( )

b

b

b

0

1

2

a

a

x b

b

0

1

0 0

1

1

0 1

=

−

1

1

1

0 02

1 12

2 22

x x

x x

x x

1

1

1

0 02

1 12

2 22

0

1

2

x x

x x

x x

a

a

a

=

y

y

y

0

1

2

1

10

1

x

x

1

10

1

0

1

0

1

x

x

a

a

y

y

=


(c)

where

= –(x1x2x3 + x0x2x3 + x0x1x3 + x0x1x2)

= x0x1 + x0x2 + x0x3 + x1x2 + x1x3 + x2x3

18. (a) No, it is not even a function. Ex. T(x) = 12x2 + c, c any constant.

(b) Yes. Let T(f/x)) = ba

a(x)dx

Then T(af(x) + bg(x)) = ba

af(x) + bg(x) dx

= a ba

f(x)dx + b ba

g(x) dx

= aT(f(x)) + bT(g/x).

19. (a) D2 = (2 0 0)

(b)

(c) No. For example, the matrix for 1st diff. from P3 → P2 is

D21 can not be formed.

20. The matrix is the mapping

T(ax2 + bx + c) = bx in P2 → P2

While the matrix [0 1 0] is the mapping

T(ax2 + bx + c) = b in P2 → P0

0 0 0

0 1 0

0 0 0

D2

3 0 0 0

0 2 0 0

0 0 1 0

=

D26 0 0 0

0 2 0 0=

2

1

a

a

a

a

a

x x x x x x0

1

2

3

4

0 0 1 0 11

=

− − 22 0 1 2 3

0 1 1 2 0 2 0 1 1

0

0 1

0 0 1

x x x x

x x x x x x x x

x

− +( ) + +

− +

xx x

x x

1 2 2

2 30 0 0 1

0 0 0 0 1

+( )− +( )

b

b

b

b

b

0

1

2

3

4


21. (a) We first note P(xi) = 4

i. But from the expansion of the Vandermonde determinant,

where the xi

are unique, is nonzero. Hence the Vandermonde system has a uniquesolution that are the coefficients of the polynomial of nth degree through the n + 1data points. So, there is exactly one polynomial of the nth degree through n + 1 datapoints with the x

iunique. Thus, the Lagrange expression must be algebraically

equivalent to the Vandermonde form.

(b) Since ci

= yi, i = 0, 1, …, n, then the linear systems for the Vandermonde and iveritons

method remain the same.

(c) Newtons form alows for the easy addition of another point (xn+1, 4n+1) that does not

have to be in any order with respect to the other xi

values. This is done by adding anext term to p(i), pe.

p(x) = bn+1(x – x0)(x – x1)(x – x2) … (x – x

n)

+ bn(x – x0)(x – x1)(x – x2) … (x – x

n–1)

+ etc. + b1(x – x0) + b0,

where Pn(x) = b

n(x – x0)(x – x1)(x – x2) … (x – x

n–1) + … +b1(x – x0) + b0 is theinterpolant to the points (x0, y0) … (x

n, y

n). The coefficients for p

n+1(x) are found asin (4), giving an n + 1 degree polynomial. The extra point (x

n+1, y

n+1) may be the

desired interpolating value.

22.

det

23. We may assume in all cases that x = 1, since

Let (x1, x2) = (cos θ, sin θ) = since x = 1.

(a) we have T = + = + =max cos sin max cos3 1 3 22 2 2θ θ θ

x

T x

x

T x

x

T x

x

2

2

2

2

( )=

( )=

( )

1

1

1

1

0 02

0

1 12

1

1 12

1

x x x

x x x

x x x

x

n

n

n n nn

n

…

…

…− − −

xx x

x x

n nn

j k

j kj k

n

2 1…

= −

>=

( )

,

∏∏


(b)

(c)

(d) T x x x x= +

+ −

max1

2

1

2

1

2

1

21 2

2

1 2

= + =

2

21

22 1max x x

T x x= + = + =max max sin4 9 4 5 321

22 2 θ

T x x= + =max 12

22 1


EXERCISE SET 5.1

6. The pair (1, –2) is in the set but the pair (–1)(1, –2) = (–1, 2) is not because the firstcomponent is negative; hence Axiom fails. Axiom 5 also fails.

8. Axioms 1, 2, 3, 6, 9, and 10 are easily verified. Axiom 4 holds with 0 = (–1, –1) and Axiom5 holds with –(x, y) = (–x –2, –y –2). Axiom 7 fails because

k((x, y) + (x′, y′) = k(x + x′ + 1, y + y′ + 1)

= (kx + kx′ + k, ky + ky′ + k)

while

k(x, y) + k(x′, y′ = (kx, ky) + (kx′, ky)

= (kx + kx′ + 1, ky + ky′ + 1)

Hence, k(u + v) = ku + kv only if k = 1. Axiom 8 also fails, since if u = (x, y) , then

(k + )u = ((k + )x, (k + )y)

but

ku + u = (kx, ky) + (x, y)

= ((k + )x + 1, (k + )y + 1)

10. This is a vector space. Axioms 2, 3, 7, 8, 9, and 10 follow from properties of matrix additionand scalar multiplication. We verify the remaining axioms.

(1) If we add two matrices of this form, the result will again be a matrix of this form:

(*)a

b

c

d

a c

b d

0

0

0

0

0

0

+

=

++

215

(4) The 2 × 2 zero matrix is of the appropriate form and has the desired properties.

(5) If u is a matrix of the given form, then

is again of the desired form and u + (–u) = (–u) + u = 0.

(6) If u is any matrix of this form, then ku is

(**)

and ku has the desired form.

11. This is a vector space. We shall check only four of the axioms because the others followeasily from various properties of the real numbers.

(1) If f and g are real-valued functions defined everywhere, then so is f + g. We mustalso check that if f(1) = g(1) = 0, then (f + g)(1) = 0. But (f + g)(1) = f(1) + g(1)= 0 + 0 = 0.

(4) The zero vector is the function z which is zero everywhere on the real line. Inparticular, z(1) = 0.

(5) If f is a function in the set, then –f is also in the set since it is defined for all realnumbers and –f(1) = –0 = 0. Moreover, f + (–f) = (–f) + f = z.

(6) If f is in the set and k isany real number, then kf is a real valued function definedeverywhere. Moreover, kf(1) = k0 = 0.

12. This is a vector space and the proof is almost a direct repeat of that for Problem 10. In fact,we need only modify the two equations (*) and (**) in the following way:

Note that if u is any matrix of this form, then

− =− − +( )

− +( ) −

ua a b

a b b

a a b

a b b

c c d

c d d

a c a c++

+

++

=

+ +( ) + bb d

a c b d b d

ka a b

a b b

+( )+( ) + +( ) +

++

=

++

ka ka kb

ka kb kb

ka

b

ka

kb

0

0

0

0

=

− =−

−

u

a

b

0

0


13. This is a vector space with 0 = (1, 0) and –x = (1, –x). The details are easily checked.

15. We must check all ten properties:

(1) If x and y are positive reals, so is x + y = xy.(2) x + y = xy = yx = y + x(3) x + (y + z) = x(yz) = (xy) = (x + y) + z(4) There is an object 0, the positive real number 1, which is such that

1 + x = 1 • x = x = x • 1 = x + 1

for all positive real numbers x.(5) For each positive real x, the positive real 1/x acts as a negative:

x + (1/x) = x(1/x) = 1 = 0 = 1 = (1/x)x = (1/x) + x

(6) If k is a real and x is a positive real, then kx = xk is again a positive real.(7) k(x + y) = (xy)k = xkyk = kx + ky

(8) (k + )x = xk+ = xkx = kx + x

(9) k(x) = (x)k = (x)k = xk = xk = (k)x

(10) 1x = x1 = x

16. This is not a vector space, since properties (7) and (8) fail. It is easy to show, for instance,that ku + kv = k2(u + v). Property (4) holds with 0 = (1, 1) , but Property (5) failsbecause, for instance, (0, 1) has no inverse.

17. (a) Only Axiom 8 fails to hold this space. Let k and m be scalars. Then

(k + m)(x, y, z) = ((k + m)2x, (k + m)2y, (k + m)2z) = (k2x, k2y, k2z) + (2kmx, 2kmy,2kmz) + m2x, m2y, m2z)

= k(x, y, z) + m(x, y, z) + (2kmx, 2kmy, 2kmz)

≠ k(x, y, z) + m(x, y, z),

and Axiom 8 fails to hold.

(b) Only Axioms 3 & 4 fail for this space.

Axiom 3: Using the obvious notation, we have

u + (v + w) = (u1, u2, u3) + v3 + w3, v2 + w2, v1 + w1)= (u3 + v1 + w1, u2 + v2 + w2, u1 + v3 + w3)

whereas

(u + v) + w = (u3 + v3, u2 + v2, u1 + v1) + (w1, w2, w3)= (u1 + v1 + w3, u2 + v2 + w2, u3 + v3 + w1)


Thus, u + (v + w) ≠ (u + w) + w.

Axiom 4: There is no zero vector in this space. If we assume that there is, and let0 = (z1, z2, z3), then for any vector (a, b, c), we have (a, b, c) + (z1, z2,z3) = c + z3, b + z2, a + z1) = (a, b, c). Solving for the z′inferior/superioraligned

is, we have z3 = a – c, z2 = 0 adn z1 = c – a. Thus, there is no one

zero vector that will work for every vector (a, b, c) in R3.

(c) Let V be the set of all 2 × 2 invertible matrices and let A be a matrix in V. Since we areusing standard matrix addition adn scalar multiplication, the majority of axioms hold.However, the following axioms fail for this space V:

Axiom 1: Clearly if A is invertible, then so is –A. However, the matrix A + (–A) =0 is not invertible, and thus A + (–A) is not in V, meaning V is not closedunder addition.

Axiom 4: We’ve shown that the zero matrix is not in V, so this axiom fails.

Axiom 6: For any 2 × 2 invertible matrix A, det(kA) = k2 det(A), so for k ≠ 0, thematrix kA is also invertible. However, if k = 0, then kA is not invertible,so this axiom fails.

Thus, V is not a vector space.

18. Let V be the set of all matrices of the form . We will verify each of the 10 vector

space axioms using matrices A = , B = , and C = , and scalars

k and m.

(1) We can see from the given formula that the sum of two matrices in V is again amatrix in V. Thus, Axiom 1 holds.

(2) A + B = = B + A.

(3) We have

and Axiom 3 is satisfied.

A B Ca

a

b c

b c+ + =

+

++

=( ) 1

2

1 1

2 2

1

1

1

1

a b c

a b c

a b

a b

1 1 1

2 2 2

1 1

2 2

1

1

1

1

+ ++ +

=+

+

+

= + +( )

c

cA B C

1

2

1

1

a b

a b

b a

b a

1 1

2 2

1 1

2 2

1

1

1

1

++

=

++

c

c

1

2

1

1

b

b

1

2

1

1

a

a

1

2

1

1

a

b

1

1


(4) The zero vector in this space is the matrix 0 = . Then

A + 0 = = A

And similarly, 0 + A = A. Thus, Axiom 4 holds.

(5) If A = , then –A = , and we have

A + (–A) = = 0

Similarly, we have (–A) + A = 0.

(6) We can see from the given formula that if A is in V, then kA is also a matrix in V.Thus, Axiom 6 holds.

(7) We have

and Axiom 7 is satisfied.

(8) We have

and Axiom 8 holds.

( )k m Ak m a

k m a

ka ma+ =

+( )+( )

=+1

2

1 11

1

1

1 kka ma

ka

ka

ma

ma

2 2

1

2

1

2

1

1

1

1

+

=

+

= +kA kB

k A Ba b

a b

k a b

k a b( )+ =

++

=

+( )+

1 1

2 2

1 1

2

1

1

1

1 22

1 1

2 2

11

1

1

1

( )

=+

+

=

ka kb

ka kb

ka

kka

kb

kb

kA kB

2

1

2

1

1

+

= +

a

a

a

a

1

2

1

2

1

1

1

1

0 1

1 0

+

−−

=

−−

a

a

1

2

1

1

a

a

1

2

1

1

a

a

a

a

1

2

1

2

1

1

0 1

1 0

0 1

1 0

+

=

++

0 1

1 0


(9) We have

so Axiom 9 holds.

(10) Finally, we have

and Axiom 10 is also satisfied.

19. (a) Let V be the set of all ordered pairs (x, y) that satisfy the equation ax + by = c1 forfixed constants a, b and c. Since we are using the standard operations of addition andscalar multiplication, Axioms 2, 3, 5, 7, 9, 9, 10 will hold automatically. However, forAxiom 4 to hold, we need the zero vector (0, 0) to be in V. Thus a0 + b0 = c, whichforces c = 0. In this case, Axioms 1 and 6 are also satisfied. Thus, the set of all pointsin R2 lying on a line is a vector space exactly in the case when the line passes throughthe origin.

(b) Let V be the set of all ordered triples (x, y, z) that satisfy the equation ax + by + cz

= d, for fixed constants a, b, c and d. Since we are using the standard operations ofaddition and scalar multiplication, Axioms 2, 3, 5, 7, 8, 9, 10 will hold automatically.However, for Axiom 4 to hold, we need the zero vector (0, 0, 0) to be in V. Thus a0 +b0 + c0 = d, which forces d = 0. In this case, Axioms 1 and 6 are also satisfied. Thus,the set of all points in R3 lying on a plane is a vector space exactly in the case whenthe plane passes through the origin.

20. Let V be the set of all 2 × 2 invertible matrices. With the given operations, V is not a vectorspace. We will check all 10 axioms:

(1) The product of two invertible matrices is invertible, so Axiom 1 holds.

(2) In general, matrix multiplication is not commutative, so AB ≠ BA and Axiom 2 fails.

(3) Matrix multiplication is associative so A(BC) = (AB)C and Axiom 3 holds.

(4) The identity matrix functions as the zero vector here: AI = IA = A and Axiom 4holds.

(5) The inverse of A functions as –A in this case: A(A–1) = A–1A = I, so Axiom 5 holds.

a

a

a

a

a

a

1

2

1

2

1

2

1

1

1 1

1 1

1

1

=

=

k m A kma

ma

k ma

k ma( ( )) =

=

( )( )

1

2

1

2

1

1

1

1

=( )

( )

=km a

km akm A

1

2

1

1( )


(6) If k = 0, then kA is not invertible, so kA is not in V and Axiom 6 fails.

(7) Axiom 7 fails: k(AB) ≠ (kA)(kB).

(8) Axiom 8 fails as well: (k + m)A ≠ (kA)(mA).

(9) Since we are using regular scalar multiplication, Axiom 9 holds.(10) Again, since we are using regular scalar multiplication, Axiom 10 holds.

Thus, V is not a vector space since Axioms 2, 6, 7, & 8 fail.

22. Properties (2), (3), and (7)–(10) all hold because a line passing through the origin in 3-space is a collection of triples of real numbers and the set of all such triples with the usualoperations is a vector space. To verify the remaining four properties, we need only checkthat

(1) If u and v lie on the line, so does u + v.

(4) The vector (0,0,0) lies on the line (which it does by hypothesis, since the linepasses through the origin).

(5) If u lies on the line, so does –u.

(6) If u lieson the line, so does any real multiple ku of u.

We check (1), leaving (5) and (6) to you.

The line passes through the origin and therefore has the parametric equations x = at,y = bt, z = ct where a, b, and c are fixed real numbers and t is the parameter. Thus, if u andv lie on the line, we have u = (at1, bt1, ct1) and v = (at2, bt2, ct2). Therefore u + v = (a(t1+ t2), b(t1 + t2), c(t1 + t2)), which is also on the line.

25. No. Planes which do not pass through the origin do not contain the zero vector.

26. No. The set of polynomials of exactly degree 1 does not contain the zero polynomial, andhence has no zero vector.

27. Since this space has only one element, it would have to be the zero vector. In fact, this isjust the zero vector space.

28. If a vector space V had just two distinct vectors, 0 and u, then we would have to definevector addition and scalar multiplication on V. Theorem 5.1.1 ensures that ku ≠ 0 unless k= 0. Thus we would have to define ku = u for all k ≠ 0. But then we would have

2 u = (1 + 1)u = 1u + 1u = u + u ≠ 0,


so that u + u = u. However,

0u = (1 – 1)u = 1u + (–1)u = u + (–1)u = 0

which implies that (–1)u ≠ u. Hence (–1)u = 0, which is contrary to Theorem 5.1.1. ThusV cannot be a vector space.

30. We are given that ku = 0. Suppose that k ≠ 0. Then

1k

(ku) = 1kk u = (1)u = u (Axioms 9 and 10)

But

1k

(ku) = 1k0 = 0 (By hypothesis and Part (b))

Thus u = 0. That is, either k = 0 or u = 0.

33. Suppose that there are two zero vectors, 01 and 02. If we apply Axiom 4 to both of thesezero vectors, we have

01 = 01 + 02 = 02

Hence, the two zero vectorsare identical.

34. Suppose that u has two negatives, (–u)1 and (–u)2. Then

(–u)1 = (–u)1 + 0 = (–u)1 + (u + (–u)2) = ((–u)1 + u) + (–u)2 = 0 + (–u)2 = (–u)2

Axiom 5 guarantees that u must have at least one negative. We have proved that it has atmost one.

35. Following the hint, we have

(u + v) –(v + u) = (u + v) + (–(v + u)) by Theorem 5.1.1

= (u + v) + ((–1)v + (–1)u) by Property (7)

= (u + v) + (–v + (–u)) by Theorem 5.1.1

= ((u + v) + (–v)) + (–u) by Property (3)

= (u + (v + (–v)) + (–u) by Property (3)

= (u + 0) + (–u) by Property (5)

= u + (–u) by Property (4)

= 0 by Property (5)


Thus (u + v) + (–1)(v + u) = 0, from which it follows that

(u + v) + (–(v + u)) = 0 by Theorem 5.1.1

so that

[(u + v) + (–(v + u))] + (v + u) = 0 + (v + u) adding to both sides of the equation

and thus

(u + v) + (–(v + u) + (v + u)) = 0 + (v + u) by Property (3)

or

(u + v) + 0 = 0 + (v + u) by Property (5)

so that finally u + v = v + u by Property (4).


EXERCISE SET 5.2

1. (a) The set is closed under vector addition because

(a, 0, 0) + (b, 0, 0) = (a + b, 0, 0)

It is closed under scalar multiplication because

k(a, 0, 0) = (ka, 0, 0)

Therefore it is a subspace of R3.

(b) This set is not closed under either addition or scalar multiplication. For example, (a,1, 1) + (b, 1, 1) = (a + b, 2, 2) and (a + b, 2, 2) does not belong to the set. Thus it isnot a subspace.

(c) This set is closed under vector addition because

(a1, b1, 0) + (a2, b2, 0) = (a1 + a2, b1 + b2, 0).

It is also closed under scalar multiplication because

k(a, b, 0) = (ka, kb, 0).

Therefore, it is a subspace of R3.

2. (a) This set is closed under addition since the sum of two integers is again an integer.However, it is not closed under scalar multiplication since the product ku where k isreal and a is an integer need not be an integer. Thus, the set is not a subspace.

(c) If det(A) = det(B) = 0, it does not necessarily follow that det(A + B) = 0. For instance,

let A = and B = . Thus the set is not a subspace.0 0

0 1

1 0

0 0

225

(e) This set is closed under vector addition because

It is also closed under scalar multiplication because

Therefore, it is a subspace of M22.

3. (a) This is the set of all polynominals with degree ≤ 3 and with a constant term which isequal to zero. Certainly, the sum of any two such polynomials is a polynomial withdegree ≤ 3 and with a constant term which is equal to zero. The same is true of aconstant multiple of such a polynomial. Hence, this set is a subspace of P3.

(c) The sum of two polynomials, each with degree ≤ 3 and each with integral coefficients,is again a polynomial with degree ≤ 3 and with integral coefficients. Hence, the subsetis closed under vector addition. However, a constant multiple of such a polynomialwill not necessarily have integral coefficients since the constant need not be an integer.Thus, the subset is not closed under scalar multiplication and is therefore not asubspace.

4. (a) The function f(x) = –1 for all x belongs to the set, but the function (–1)f(x) = 1 for allx does not. Hence, the set is not closed under scalar multiplication and is therefore nota subspace.

(c) Suppose that f and g are in the set. Then

(f + g)(0) = f(0) + g(0) = 2 + 2 = 4

and

–2f(0) = (–2)(2) = –4

This set is therefore not closed under either operation.

(e) Let f(x) = a + b sin x and g(x) = c + d sin x be two functions in this set. Then

(f + g)(x) = (a + c) + (b + d) sin x

a a

a a

ka ka

k a k a

ka ka

− −

=

−( ) −( )

=

− kka ka( ) −( )

.

a a

a a

b b

b b

a b a b

a b a− −

+

− −

=

+ +− − − − bb

a b a b

a b a b

=

+ +− +( ) − +( )


and

k(f(x)) = ka + kb sin x

Thus, both closure properties are satisfied and the set is a subspace.

5. (b) If A and B are in the set, then aij

= –aji

and bij

= –bji

for all i and j. Thus aij

+ bij

=–(a

ji+ b

ji) so that A + B is also in the set. Also a

ij= –a

jiimplies that ka

ij= –(ka

ji), so

that kA is in the set for all real k. Thus the set is a subspace.

(c) For A and B to be in the set it is necessary and sufficient for both to be invertible, butthe sum of 2 invertible matrices need not be invertible. (For instance, let B = –A.)Thus A + B need not be in the set, so the set is not a subspace.

6. (b) The matrix reduces to

so the solution space is the line x = 2t, y = t, z = 0.

(d) The matrix reduces to the identity matrix, so the solution space is the origin.

(f) The matrix reduces to

so the solution space is the plane x –3y + z = 0.

7. (a) We look for constants a and b such that au + bv = (2, 2, 2), or

a(0, –2, 2) + b(1, 3, –1) = (2, 2, 2)

Equating corresponding vector components gives the following system of equations:

b = 2

–2a+ 3b = 2

2a – b = 2

1 3 1

0 0 0

0 0 0

−

1 2 0

0 0 1

0 0 0

−


From the first equation, we see that b = 2. Substituting this value into the remainingequations yields a = 2. Thus (2, 2, 2) is a linear combination of u and v.

(c) We look for constants a and b such that au + bv = (0, 4, 5), or

a(0, –2, 2) + b(1, 3, –1) = (0, 4, 5)

Equating corresponding components gives the following system of equations:

b = 0

–2a + 3b = 4

2a – b = 5

From the first equation, we see that b = 0. If we substitute this value into theremaining equations, we find that a = –2 and a = 5/2. Thus, the system of equations isinconsistent and therefore (0, 4, 5) is not a linear combination of u and v.

8. (a) We look for constants a, b, and c such that au + bv + cw = (–9, –7, –15); that is, such that

a(2, 1, 4) + b(1, –1, 3) + c(3, 2, 5) = (–9, –7, –15)

If we equate corresponding components, we obtain the system

2a + b + 3c = –9

a – b + 2c = –7

4a + 3b + 5c = –15

The augmented matrix for this system is


Thus a = –2, b = 1, and c = –2 and (–9, –7, –15) is therefore a linear combination ofu, v, and w.

1 0 0 2

0 1 0 1

0 0 1 2

−

−

2 1 3 9

1 1 2 7

4 3 5 15

−− −

−


8. (c) This time we look for constants a, b, and c such that

au + bv + cw = (0, 0, 0)

If we choose a = b = c = 0, then it is obvious that au + bv + cw = (0, 0, 0). We nowproceed to show that a = b = c = 0 is the only choice. To this end, we equatecomponents to obtain a system of equations whose augmented matrix is

From Part (a), we know that this matrix can be reduced to

Thus, a = b = c = 0 is the only solution.

9. (a) We look for constants a, b, and c such that

ap1

+ bp2

+ cp3

= –9 –7x –15x2

If we substitute the expressions for p1, p2, and p3 into the above equation and equatecorresponding coefficients, we find that we have exactly the same system of equationsthat we had in Problem 8(a), above. Thus, we know that a = –2, b = 1, and c = –2 andthus –2p1 + 1p2 – 2p3 = –9 – 7x – 15x2.

(c) Just as Problem 9(a) was Problem 8(a) in disguise, Problem 9(c) is Problem 8(c) indifferent dress. The constants are the same, so that 0 = 0p1 + 0p2 + 0p3.

10. (a) We ask if there are constants a, b, and c such that

4 0

2 2

1 1

2 3

0 2

1 4− −

+

−

+

=b c

6 8

1 8

−− −

1 0 0 0

0 1 0 0

0 0 1 0

2 1 3 0

1 1 2 0

4 3 5 0

−


If we multiply, add, and equate corresponding matrix entries, we obtain the followingsystem of equations:

4a + b = 6

–b + 2c = –8

–2a + 2b + c = –1

–2a + 3b + 4c = –8

This system has the solution a = 1, b = 2, c = –3; thus, the matrix is a linearcombination of the three given matrices.

(b) Clearly the zero matrix is a linear combination of any set of matrices since we canalways choose the scalars to be zero.

11. (a) Given any vector (x, y, z) in R3, we must determine whether or not there areconstants a, b, and c such that

(x, y, z) = av1 + bv2 + cv3

= a(2, 2, 2) + b(0, 0, 3) + c(0, 1, 1)

= (2a, 2a + c, 2a + 3b + c)

or

x = 2a

y = 2a + c

z = 2a + 3b + c

This is a system of equations for a, b, and c. Since the determinant of the system isnonzero, the system of equations must have a solution for any values of x, y, and z,whatsoever. Therefore, v1, v2, and v3 do indeed span R3.

Note that we can also show that the system of equations has a solution by solvingfor a, b, and c explicitly.

11. (c) We follow the same procedure that we used in Part (a). This time we obtain thesystem of equations

3a + 2b + 5c + d = x

a – 3b – 2c + 4d = y

4a + 5b + 9c – d = z



which reduces to

Thus the system is inconsistent unless the last entry in the last row of the abovematrix is zero. Since this is not the case for all values of x, y, and z, the given vectorsdo not span R3.

12. (a) Since cos(2x) = (1) cos2 x + (–1) sin2 x for all x, it follows that cos(2x) lies in thespace spanned by cos2 x and sin2 x.

(b) Suppose that 3 + x2 is in the space spanned by cos2 x and sin2 x; that is, 3 + x2 = acos2 x + b sin2 x for some constants a and b. This equation must hold for all x. If weset x = 0, we find that a = 3. However, if we set x = π, we find a = 3 + π2. Thus wehave a contradiction, so 3 + x2 is not in the space spanned by cos2 x and sin2 x.

13. Given an arbitrary polynomial a0 + a1x + a2x2 in P2, we ask whether there are numbers a,

b, c and d such that

a0 + a1x + a2x2 = ap1 + bp2 + cp3 + dp4

If we equate coefficients, we obtain the system of equations:

a0 = a + 3b + 5c – 2d

a1 = –a + b – c – 2d

a2 = 2a + 4c + 2d

1 3 2 4

0 1 1 13

11

0 0 0 0

− −

−−

−

y

x y

z 44

17

3

11

y x y−

−

3 2 5 1

1 3 2 4

4 5 9 1

x

y

z

− −−


A row-echelon form of the augmented matrix of this system is

Thus the system is inconsistent whenever –a0 + 3a1 + 2a2 ≠ 0 (for example, when a0 = 0,a1 = 0, and a2 = 1). Hence the given polynomials do not span P2.

14. (a) As before, we look for constants a, b, and c such that

(2, 3, –7, 3) = av1 + bv2 + cv3

If we equate components, we obtain the following system of equations:

2a + 3b – c = 2

a – b = 3

5b + 2c = –7

3a + 2b + c = 3


This reduces to

Thus a = 2, b = –1, and c = –1, and the existence of a solution guarantees that thegiven vector is in spanv1, v2, v3.

1 0 0 2

0 1 0 1

0 0 1 1

0 0 0 0

−−

2 3 1 2

1 1 0 3

0 5 2 7

3 2 1 3

−−

−

1 3 5 2

0 1 1 14

0 0 0 0 3

0

0 1

0

−

−+

− +

a

a a

a aa a1 22+


14. (c) Proceeding as in Part (a), we obtain the matrix

This reduces to a matrix whose last row is [0 0 0 1]. Thus the system is inconsistentand hence the given vector is not in spanv1, v2, v3.

15. The plane has the vector u × v = (0, 7, –7) as a normal and passes through the point(0,0,0). Thus its equation is y – z = 0.

Alternatively, we look for conditions on a vector (x, y, z) which will insure that it liesin spanu, v. That is, we look for numbers a and b such that

(x, y, z) = au + bv

= a(–1, 1, 1) + b(3, 4, 4)

If we expand and equate components, we obtain a system whose augmented matrix is

This reduces to the matrix

Thus the system is consistent if and only if = 0 or y = z.

17. The set of solution vectors of such a system does not contain the zero vector. Hence itcannot be a subspace of Rn.

7

− +y z

1 3

0 17

0 07

− −+

− +

x

x y

y z

−

1 3

1 4

1 4

x

y

z

2 3 1 1

1 1 0 1

0 5 2 1

3 2 1 1

−−


18. Suppose that spanv1, v2, …, vr = spanw1, w2, …, w

k. Since each vector v

iin S belongs to

spanv1, v2, …, vr, it must, by the definition of spanw1, w2, …, w

k, be a linear combination

of the vectors in S′. The converse must also hold.

Now suppose that each vector in S is a linear combination of those in S′ and conversely.Then we can express each vector v

ias a linear combination of the vectors w1, w2, …, w

k,

so spanv1, v2, …, vr ⊆ spanw1, w2, …, w

k. But conversely we have spanw1, w2, …, w

k ⊆

spanv1, v2, …, vr, so the two sets are equal.

19. Note that if we solve the system v1 = aw1 + bw2, we find that v1 = w1 + w2. Similarly, v2 =2w1 + w2, v3 = –w1 + 0w2, w1 = 0v1 + 0v2 – v3, and w2 = v1 + 0v2 + v3.

21. (a) We simply note that the sum of two continuous functions is a continuous function andthat a constant times a continuous function is a continuous function.

(b) We recall that the sum of two differentiable functions is a differentiable function andthat a constant times a differentiable function is a differentiable function.

23. (a) False. The system has the form Ax = b where b has at least one nonzero entry.Suppose that x1 and x2 are two solutions of this system; that is, Ax1 = b and Ax2 = b.Then

A(x1 + x2) = Ax1 + Ax2 = b + b ≠ b

Thus the solution set is not closed under addition and so cannot form a subspace ofR

n. Alternatively, we could show that it is not closed under scalar multiplication.

(b) True. Let u and v be vectors in W. Then we are given that ku + v is in W for all scalarsk. If k = 1, this shows that W is closed under addition. If k = –1 and u = v, then thezero vector of V must be in W. Thus, we can let v = 0 to show that W is closed underscalar multiplication.

(d) True. Let W1 and W2 be subspaces of V. Then if u and v are in W1 ∩ W2, we know thatu + v must be in both W1 and W2, as must ku for every scalar k. This follows from theclosure of both W1 and W2 under addition and scalar multiplication.

(e) False. Spanv = span2v, but v ≠ 2 v in general.

24. (a) Two vectors in R3 will span a plane if and only if one is not a constant multiple of theother. They will span a line if and only if one is a constant multiple of the other.

(b) Spanu = spanv if and only if u is a nonzero multiple of v. Why?

(c) The solution set will be a subspace of Rn if and only if b = 0. See Exercise 23(a).


25. No. For instance, (1, 1) is in W1 and (1, –1) is in W2, but (1, 1) + (1, –1) = (2, 0) is inneither W1 nor W2.

26. (a) The matrices and span M22.

27. They cannot all lie in the same plane.

0 0

0 1

1 0

0 0

0 1

0 0

0 0

1 0

, ,


EXERCISE SET 5.3

2. (a) Clearly neither of these vectors is a multiple of the other. Thus they are linearlyindependent.

(b) Following the technique used in Example 4, we consider the system of linear equations

–3k1 + 5k2 + k3 = 0

–k22

+ k3

= 0

4k1 + 2k2 + 3k3 = 0

Since the determinant of the coefficient matrix is nonzero, the system has only thetrivial solution. Therefore, the three vectors are linearly independent.

(d) By Theorem 5.3.3, any four vectors in R3 are linearly dependent.

3. (a) Following the technique used in Example 4, we obtain the system of equations

3k1 + k2+ 2k3 + k4 = 0

8k1 + 5k2 – k3 + 4k4 = 0

7k1 + 3k2+ 2k3 = 0

–3k1 – k2+ 6k3 + 3k4 = 0

Since the determinant of the coefficient matrix is nonzero, the system has only thetrivial solution. Hence, the four vectors are linearly independent.

3. (b) Again following the technique of Example 4, we obtain the system of equations

3k2 + k3 = 0

3k2 + k3 = 0

2k1 = 0

2k1 – k3 = 0

237

The third equation, above, implies that k1 = 0. This implies that k3 and hence k2 mustalso equal zero. Thus the three vectors are linearly independent.

4. (a) We ask whether there exist constants a, b, and c such that

a(2 – x + 4x2) + b(3 + 6x + 2x2) + c(2 + 10x – 4x2) = 0

If we equate the coefficients of x0, x, and x2 in the above polynomial to zero, we obtainthe following system of equations:

2a + 3b + 2c = 0

–a + 6b+ 10c = 0

4a + 2b – 4c = 0

Since the coefficient matrix of this system is invertible, the trivial solution is the onlysolution. Hence, the polynomials are linearly independent.

(d) If we set up this problem in the same way we set up Part (a), above, we obtain threeequations in four unknowns. Since this is equivalent to having four vectors in R3, thevectors are linearly dependent by Theorem 5.3.3.

5. (a) The vectors lie in the same plane through the origin if and only if they are linearlydependent. Since the determinant of the matrix

is not zero, the matrix is invertible and the vectors are linearly independent. Thusthey do not lie in the same plane.

6. (a) Since v2 = –2v1, the vectors v1 and v2 lie on the same line. But since v3 is not amultiple of v1 or v2, the three vectors do not lie on the same line through the origin.

(c) Since v1 = 2v2 = –2v3, these vectors all lie on the same line through the origin.

7. (a) Note that 7v1 – 2v2 + 3v3 = 0.

2 6 2

2 1 0

0 4 4

−−


8. (a) By inspection, we see that

(1, 2, 3, 4) + (0, 1, 0, –1) = (1, 3, 3, 3)

so we have the linear combination (1, 2, 3, 4) + (0, 1, 0, –1) – (1, 3, 3, 3) = 0 and thisis a linearly dependent set.

(b) Using the notation v1 = (1, 2, 3, 4), v2 = (0, 1, 0, –1), and v3 = (1, 3, 3, 3), the equationfrom Part (a) becomes v3 = v1 + v2. Solving this for v1 and then v2, we have the threedependence relations

v1 = –v2 + v3

v2 = –v1 + v3

v3 = v1 + v2.

9. If there are constants a, b, and c such that

a(λ, –1/2, –1/2) + b(–1/2, λ, –1/2) + c(–1/2, –1/2, λ) = (0, 0, 0)

then

The determinant of the coefficient matrix is

This equals zero if and only if λ = 1 or λ = –1/2. Thus the vectors are linearly dependent forthese two values of λ and linearly independent for all other values.

11. Suppose that S has a linearly dependent subset T. Denote its vectors by w1,…, wm

. Thenthere exist constants k

i, not all zero, such that

k1w1 + … + km

wm

= 0

But if we let u1, …, un–m

denote the vectors which are in S but not in T, then

k1w1 + … + km

wm

+ 0u1 + … + 0un–m

= 0

λ λ λ λ32

3

4

1

41

1

2( )− − = − +

λλ

λ

− −− −− −

1 2 1 2

1 2 1 2

1 2 1 2

a

b

c =

0

0

0


Thus we have a linear combination of the vectors v1, …, vn

which equals 0. Since not all ofthe constants are zero, it follows that S is not a linearly independent set of vectors, contraryto the hypothesis. That is, if S is a linearly independent set, then so is every non-emptysubset T.

13. This is similar to Problem 10. Since v1, v2, …, vr is a linearly dependent set of vectors,

there exist constants c1, c2, …, cr

not all zero such that

c1v1 + c2v2 + … + cr

vr

= 0

But then

c1v1 + c2v2 + … + crv

r+ 0v

r+1 + … + 0vn

= 0

The above equation impliesthat the vectors v1, …, vn

are linearly dependent.

15. Suppose that v1, v2, v3 is linearly dependent. Then there exist constants a, b, and c not allzero such that

(*) av1 + bv2 + cv3 = 0

Case 1: c = 0. Then (*) becomes

av1 + bv2 = 0

where not both a and b are zero. But then v1, v2 is linearly dependent, contrary tohypothesis.

Case 2: c ≠ 0. Then solving (*) for v3 yields

v3 = – ac

v1 – bc

v2

This equation implies that v3 is in spanv1, v2, contrary to hypothesis. Thus, v1, v2, v3is linearly independent.

16. Note that (u – v) + (v – w) + (w – u) = 0.

18. Any nonzero vector forms a linearly independent set. The only scalar multiple of a nonzerovector which can equal the zero vector is the zero scalar times the vector.


20. (a) Since sin2 x + cos2 x = 1, we observe that

2(3 sin2 x) + 3(2 cos2 x) + (–1)(6) = 0

Hence the vectors are linearly dependent.

(c) Suppose that there are constants a, b, and c such that

a(1) + b sin x + c sin 2x = 0

Setting x = 0 yields a = 0. Setting x = π/2 yields b = 0, and thus, since sin 2x 0, wemust also have c = 0. Therefore the vectors are linearly independent.

(e) Suppose that there are constants a, b, and c such that

a(3 – x)2 + b(x2 – 6x) + c(5) = 0

or

(9a + 5c) + (–6a – 6b)x + (a + b)x2 = 0

Clearly a = –b = –(5/9)c. Thus a = 5, b = –5, c = –9 is one solution and the vectors arelinearly dependent.

This conclusion may also be reached by noting that the determinant of the systemof equations

9a + 5c = 0

–6a – 6b = 0

a + b = 0

is zero.

21. (a) The Wronskian is

Thus the vectors are linearly independent.

1

0 1

0 0

0

x e

e

e

e

x

x

x

x= ≡


21. (b) The Wronskian is

Thus the vectors are linearly independent.

23. Use Theorem 5.3.1, Part (a).

24. (a) False. There are 6 such matrices, namely

Since M22 has dimension 4, at least two of these matrices must be linear combinationsof the other four. In fact, it is easy to show that A1, A2, A3, and A4 are linearlyindependent and that A5 = –A2 + A3 + A4 and A6 = –A1 + A3 + A4.

(b) False. One of the vectors might be the zero vector. Otherwise it would be true.

(d) False. A finite set of vectors can be linearly dependent without containing the zerovector.

26. We could think of any 3 linearly independent vectors in R3 as spanning R3. That is, theycould determine directions for 3 (not necessarily orthogonal) coordinate axes. Then anyfourth vector would represent a point in this coordinate system and hence be a linearcombination of the other 3 vectors.

A A A4 5 60 1

1 0

0 1

0 1

0 0

1 1=

=

=

A A A1 2 31 1

0 0

1 0

1 0

1 0

0 1=

=

=

sin cos sin

cos sin sin cos

sin cos

x x x x

x x x x x

x x

− +− − 2ccos sin

sin cos sin

cos sin sin cos

x x x

x x x x

x x x x

−= − + xx

x

x x x x

0 0 2

2 2 02 2

cos

cos sin cos ) cos= − − = − ≡(


EXERCISE SET 5.4

2. (a) This set is a basis. It has two vectors and neither is a multiple of the other.

(c) This set is not a basis since one vector is a multiple of the other.

3. (a) This set has the correct number of vectors and they are linearly independent because

= 6 ≠ 0

Hence, the set is a basis.

(c) The vectors in this set are linearly dependent because

= 0

Hence, the set is not a basis.

4. (a) The vectors in this set are linearly dependent because

= 0

Thus, the set is not a basis. (Compare with Problem 3(c), above.)

1 1 1

3 1 7

2 4 0

− −

2 4 0

3 1 7

1 1 1

− −

1 2 3

0 2 3

0 0 3

243

4. (c) This set has the correct number of vectors and

= 1 ≠ 0

Hence, the vectors are linearly independent and therefore are a basis.

5. The set has the correct number of vectors. To show that they are linearly independent, weconsider the equation

If we add matrices and equate corresponding entries, we obtain the following system ofequations:

3a + d = 0

6a – b – 8c = 0

3a – b – 12c – d = 0

–6a – 4c + 2d = 0

Since the determinant of the coefficient matrix is nonzero, the system of equations hasonly the trivial solution; hence, the vectors are linearly independent.

6. (a) Recall that cos 2x = cos2 x – sin2 x; that is,

1v1 + (–1)v2 + (–1)v3 = 0

Hence, S is not a linearly independent set of vectors.

(b) We can use the above identity to write any one of the vectors vi

as a linearcombination of the other two. Since no one of these vectors is a multiple of any other,they are pairwise linearly independent. Thus any two of these vectors form a basis forV.

7. (a) Clearly w = 3u1 – 7u2, so the coordinate vector relative to u1, u2 is (3, –7)

a b c3 6

3 6

0 1

1 0

0 8

12 4−

+

−−

+

−− −

+

−

=

d

1 0

1 2

0 0

0 0

1 0 0

1 1 0

1 1 1


(b) If w = au1 + bu2, then equating coordinates yields the system of equations

2a + 3b = 1

–4a + 8b = 1

This system has the solution a = 5/28, b = 3/14. Thus the desired coordinate vector is(5/28, 3/14).

8. (a) Since w = 12(u1 + u2), the coordinate vector of w relative to S = u1, u2 is (w)S

= (12, 12).

(b) Since w = 12(–u1 + u2), the coordinate vector of w relative to S = u1, uθ2 is (w)S

=

(– 12, 12).

(c) Since w = u2 = 0u1 + u2, the coordinate vector of w relative to S = u1, u2 is (w)S

=(0, 1).

9. (a) If v = av1 + bv2 + cv3, then

a + 2b + 3c = 2

2b + 3c =–1

3c = 3

From the third equation, c = 1. Plugging this value into the second equation yields b= –2, and finally, the first equation yields a = 3. Thus the desired coordinate vector is(3, –2, 1).

10. (b) If p = ap1 + bp2 + cp3, then equating coefficients yields the system of equations

a + b = 2

a + c = –1

b + c = 1

This system has the solution a = 0, b = 2, and c = –1. Thus the desired coordinatevector is (0, 2, –1).

12. The augmented matrix of the system reduces to

Hence, x1 = s, x2 = 0, and x3 = s. Thus the solution space is spanned by (1, 0, 1) and hasdimension 1.

1 0 1 0

0 1 0 0

0 0 0 0

−


15. If we reduce the augmented matrix to row-echelon form, we obtain

Thus x1 = 3r – s, x2 = r, and x3 = s, and the solution vector is

Since (3, 1, 0) and (–1, 0, 1) are linearly independent, they form a basis for the solutionspace and the dimension of the solution space is 2.

16. Since the determinant of the system is not zero, the only solution is x1 = x2 = x3 = 0.

Hence there is no basis for the solution space and its dimension is zero.

19. (a) Any two linearly independent vectors in the plane form a basis. For instance, (1, –1,–1) and (0, 5, 2) are a basis because they satisfy the plane equation and neither is amultiple of the other.

(c) Any nonzero vector which lies on the line forms a basis. For instance, (2, –1, 4) willwork, as will any nonzero multiple of this vector.

(d) The vectors (1, 1, 0) and (0, 1, 1) form a basis because they are linearly independentand

a(1, 1, 0) + c(0, 1, 1) = (a, a + c, c)

20. This space is spanned by the vectors 0, x, x2 and x3. Only the last three vectors form alinearly independent triple. Thus the space has dimension 3.

21. (a) We consider the three linear systems

− + =− =− =

k k

k k

k k

1 2

1 2

1 2

1 0 0

2 2 0 1 0

3 2 0 0 1

x

x

x

r s

r

s

1

2

3

3 3

1

0

=−

=

+−

r s

1

0

1

1 3 1 0

0 0 0 0

0 0 0 0

−


which give rise to the matrix

A row-echelon form of the matrix is

from which we conclude that e3 is in the span of v1, v2, but e1 and e2 are not. Thusv1, v2, e1 and v1, v2, e2 are both bases for R3.

22. We consider the linear system of equations k1v1 + k2v2 + k3e1 + k4e2 + k5e3 + k6e4 = 0which give rise to the coefficient matrix

A row-echelon form of the matrix is

If we eliminate any two of the final four columns of this matrix, we obtain a 4 × 4 matrix.If the determinant of this matrix is zero, then the system of equations has a nontrivialsolution, so the corresponding vectors are linearly dependent. Otherwise, they are linearlyindependent. If we include column 3 (which corresponds to e1) in the determinant, itsvalue is zero. Otherwise, it is not. Thus we may add any two of the vectors e2, e3, and e4 tothe set v1, v2 to obtain a basis for R4.

1 3 1 0 0 0

0 1 1 1 4 0 0

0 0 0 1 2 0

0 0 0 0 1 2 3

−− −

1 3 1 0 0 0

4 8 0 1 0 0

2 4 0 0 1 0

3 6 0 0 0 1

−−

−−

1 1 1 0 0

0 1 3 0 1

0 0 1 1 2 0

− −

−−−

1 1 1 0 0

2 2 0 1 0

3 2 0 0 1


23. Since u1, u2, u3 has the correct number of elements, we need only show that they arelinearly independent. Let

au1 + bu2 + cu3 = 0

Thus

av1 + b(v1 + v2) + c(v1 + v2 + v3) = 0

or

(a + b + c)v1 + (b + c)v2 + cv3 = 0

Since v1, v2, v3 is a linearly independent set, the above equation implies that a + b + c =b + c = c = 0. Thus, a = b = c = 0 and u1, u2, u3 is also linearly independent.

24. (a) Note that the polynomials 1, x, x2, …, xn form a> set of n + 1 linearly independentfunctions in F(–∞, ∞).

(b) From Part (a), dim F(–∞, ∞) > n for any positive integer n. Thus F(–∞, ∞) is infinite-dimensional.

25. First notice that if v and w are vectors in V and a and b are scalars, then (av + bw)S

=a(v)

S+ b(w)

S. This follows from the definition of coordinate vectors. Clearly, this result

applies to any finite sum of vectors. Also notice that if (v)S

= (0)S, then v = 0. Why?

Now suppose that k1v1 + … + krv

r= 0. Then

(k1v1 + … + krv

r)

S= k1(v1)S

+ … + kr(v

r)

S

= (0)S

Conversely, if k1(v1)S+ … + kr(vr)S = (0)S, then

(k1v1 + … + krvr)S

= (0)S, or k1v1 + … + k

rv

r= 0

Thus the vectors v1, …, vr

are linearly independent in V if and only if the coordinate vectors(v1)S

, …, (vr)

Sare linearly independent in Rn.


26. If every vector v in V can be written as a linear combination v = a1v1 + … + arv

rof v1, …,

vr, then, as in Exercise 24, we have (v)S

= a1(v1)S+ … + a

r(v

r)

S. Hence, the vectors (v1)S

,…, (v

r)

Sspan a subspace of Rn. But since V is an n-dimensional space with, say, the basis

S = u1, …, un, then if u = b1u1 + … + b

nu

n, we have (u)

S= (b1, …, b

n); that is, every

vector in Rn represents a vector in V. Hence (v1)S, …, (v

r)

S spans Rn.

Conversely, if (v1)S, …, (v

r)

S spans Rn, then for every vector (b1, …, b

n) in Rn, there

is an r-tuple (a1, …, ar) such that

(b1, …, bn) = a1(v1)S

+ … + ar (vr)S

= (a1v1 + … + arvr)S

Thus a1v1 + … + arv

r= b1u1 + … + b

nu

n, so that every vector in V can be represented as

a linear combination of v1, …, vr.

27. (a) Let v1, v2, and v3 denote the vectors. SinceS = 1, x, x2 is the standard basis for P2, wehave (v1)S

= (–1, 1, –2), (v2)S= (3, 3, 6), and (v3)S

= (9, 0, 0). Since (–1, 1, –2), (3,3, 6), (9, 0, 0) is a linearly independent set of three vectors in R3, then it spans R3.Thus, by Exercises 24 and 25, v1, v2, v3 is linearly independent and spans P2. Henceit is a basis for P2.

28. (a) It is clear from the picture that the x′-y′ coordinates of (1, 1) are (0, 2).

(d) Let (a, b) and (a′, b′) denote the coordinates of a point with respect to the x-y and x′-y′ coordinate systems, respectively. If (a, b) is positioned as in Figure (1), then wehave

a′ = a – b and b′ = 2b

y

l

l

y'

x and x'45∞

2


(d)

Figure 1 Figure 2

These formulas hold no matter where (a,b) lies in relation to the coordinate axes.Figure (2) shows another configuration, and you should draw similar pictures for all ofthe remaining cases.

30. See Theorem 5.4.7.

31. There is. Consider, for instance, the set of matrices

Each of these matrices is clearly invertible. To show that they are linearly independent,consider the equation

aA + bB + cC + dD =

This implies that

0 1 1 1

1 0 1 1

1 1 0 1

1 1 1 0

a

b

c

d

=

0

0

0

0

0 0

0 0

A B

C D

=

=

=

=

0 1

1 1

1 0

1 1

1 11 1

0 1and

11 0

y'

y

(a, b)

a

b

x and x'

(a', b')

45∞45∞

2b

a' b

y'

y

(a, b)

x and x'

(a', b')

45∞aa

b

2b

45∞


The above 4 × 4 matrix is invertible, and hence a = b = c = d = 0 is the only solution. Andsince the set, A, B, C, and D consists of 4 linearly independent vectors, it forms a basis forM22.

32. (b) The most general 2 × 2 symmetric matrix has the form . Hence, if n = 2,the matrices

form a basis and the dimension is therefore 3.

If n = 3, the matrices

form a basis and the dimension is therefore 6.

In general, there are n2 elements in an n × n matrix with on or

above the main diagonal. Therefore the dimension of the subspace of n × n symmetric

matrices is n(n + 1)/2.

(c) Since there are n2 elements in each n × n matrix, with n elements on the maindiagonal, there are (n2 – n)/2 elements above the diagonal. Thus, any triangular matrixcan have at most (n2 – n)/2 + n = (n2 + n)/2 nonzero elements. The set of n × n

matrices consisting of all zeros except for a 1 in each of these spots will form a basisfor the space. Consequently, the space will have dimension n(n + 1)/2.

33. (a) The set has 10 elements in a 9 dimensional space.

n n n n2

2

1

2

+=

+( )

1 0 0

0 0 0

0 0 0

, ,

0 0 0

0 1 0

0 0 0

,

0 0 0

0 0 0

0 0 1

0 1 0

1 0 0

0 0 0

00 0 1

0 0 0

1 0 0

0 0 0

0 0 1

0 1 0

, and

1 0

0 0and

, ,

0 0

0 1

0 1

1 0

a c

c b


35. (b) The equation x1 + x2 + … + xn

= 0 can be written as x1 = –x2 –x3 – … –xn

where x2,x3, …, x

ncan all be assigned arbitrary values. Thus, its solution space should have

dimension n – 1. To see this, we can write

The n – 1 vectors in the above equation are linearly independent, so the vectors doform a basis for the solution space.

36. (a) If p1(1) = p2(1) = 0, then (p1 + p2)(1) = p1(1) + p2(1) = 0. Also, kp1(1) = 0. Hence Wis closed under both vector addition and scalar multiplication.

(b) We are looking at the set W of polynomials ax2 + bx + c in P2 where a + b + c = 0 orc = –(a + b). Thus it would appear that the dimension of W is 2.

(c) The polynomial p(x) = ax2 + bx + c will be in the subspace W if and only if p(1) = a+ b + c = 0, or c = –a – b. Thus,

where a and b are arbitrary. The vectors and are linearly independent and

hence the polynomials x2 – 1 and x – 1 form a basis for W.

0

1

1−

1

0

1−

a

b

c

a

b

a b

a

=− −

=−

1

0

1

+−

b

0

1

1

−

+

−

1

1

0

0

0

1

0

1

0

0

3

x

+ +

−

xn

1

0

0

0

1

x

x

x

x

x x x

n

n1

2

3

2 3

=

− − − −x

x

2

3

xn


EXERCISE SET 5.5

3. (b) Since the equation Ax = b has no solution, b is not in the column space of A.

(c) Since A = b, we have b = c1 – 3c2 + c3.

(d) Since A = b, we have b = c1 + (t – 1)c2 + tc3 for all real numbers t.

5. (a) The general solution is x1 = 1 + 3t, x2 = t. Its vector form is

Thus the vector form of the general solution to Ax = 0 is

(c) The general solution is x1 = – 1 + 2r – s – 2t, x2 = r, x3 = s, x4 = t. Its vector form is

−

+

+

−1

0

0

0

2

1

0

0

r s

11

0

1

0

2

0

0

1

+

−

t

t3

1

1

0

3

1

+

t

1

1t

t

−

1

3

1

−

253

Thus the vector form of the general solution to Ax = 0 is

6. (a) Since the reduced row-echelon form of A is

the solution to the equation Ax = 0 is x1 = 16t, x2 = 19t, x3 = t. Thus

is a basis for the nullspace.

(c) Since the reduced row-echelon form of A is

one solution to the equation Ax = 0 is x1 = –s + 2t, x2 = –s – 4t, x3 = s, x4 = 7t. Thusthe set of vectors

is a basis for the nullspace.

−−

−

1

1

1

0

2

4

0

7

and

1 0 1 2 7

0 1 1 4 7

0 0 0 0

−

16

19

1

1 0 16

0 1 19

0 0 0

−−

r s t

2

1

0

0

1

0

1

1

+

−

+

−22

0

0

1


(e) The reduced row-echelon form of A is

One solution to the equation Ax = 0 is x1 = –2s – 16t, x2 = 2t, x3 = 5t, x4 = s, and x5= 12t. Hence the set of vectors

is a basis for the nullspace of A.

8. (a) From a row-echelon form of A, we have that the vectors (1, –1, 3) and (0, 1, –19) area basis for the row space of A.

(c) From a row-echelon form of A, we have that the vectors (1, 4, 5, 2) and (0, 1, 1, 4/7)are a basis for the row space of A.

9. (a) One row-echelon form of AT is

Thus a basis for the column space of A is

1

5

7

0

1

1

and

1 5 7

0 1 1

0 0 0

−

−

2

0

0

1

0

16

2

5

0

12

and

1 0 0 2 4 3

0 1 0 0 1 6

0 0 1 0 5 12

0 0 0 0 0

0 0 0 0 0

−−


9. (c) One row-echelon form of AT is

Thus a basis for the column space of A is

10. (a) Since, by 8(a), the row space of A has dimension 2, any two linearly independent rowvectors of A will form a basis for the row space. Because no row of A is a multiple ofanother, any two rows will do. In particular, the first two rows form a basis and Row 3= Row2 + 2(Row 1).

(c) Refer to 8(c) and use the solution to 10(a), above. In particular, the first two rowsform a basis and Row 3 = Row 1 – Row 2.

(e) Let r1, r2, …, r5 denote the rows of A. If we observe, for instance, that r1 = –r3 + r4 andthat r2 = 2r1 + r5, then we see that r3, r4, r5 spans the row space. Since the dimensionof this space is 3 (see the solution to Exercise 6(e)), the set forms a basis.

For those who don’t wish to rely on insight, set ar1 + br2 + cr3 + dr4 + er5 = 0 andsolve the resulting homogeneous system of equations by finding the reduced row-echelon form for AT. This yields

a = –s + 2t

b = –t

c = –s

d = s

e = t

so that (–s + 2t)r1 – tr2 – sr3 + sr4 + tr5 = 0, or

s(–r1 – r3 + r4) + t(2r1 – r2 + r5) = 0

1

2

1

0

1

1−

−

and

1 2 1

0 1 1

0 0 0

0 0 0

−−


Since this equation must hold for all values of s and t, we have

r1 = –r3 + r4 and r2 = 2r1 + r5

which is the result obtained above.

11. (a) The space spanned by these vectors is the row space of the matrix

One row-echelon form of the above matrix is

and the reduced row-echelon form is

Thus (1, 1, –4, –3), (0, 1, –5, –2), (0, 0, 1, –1/2) is one basis. Another basis is (1, 0,0, –1/2), (0, 1, 0, –9/2), (0, 0, 1, –1/2).

12. (a) If we solve the vector equation

(*) av1 + bv2 + cv3 + dv4 = 0

we obtain the homogeneous system

a – 3b – c – 5d = 0

3b + 3c + 3d = 0

a + 7b + 9c + 5d = 0

a + b + 3c – d = 0

1 0 0 1 2

0 1 0 9 2

0 0 1 1 2

−−−

1 1 4 3

0 1 5 2

0 0 1 1 2

− −− −

−

1 1 4 3

2 0 2 2

2 1 3 2

− −−

−


The reduced row-echelon form of the augmented matrix is

Thus v1, v2 forms a basis for the space. The solution is a = –2s + 2t, b = –s – t, c = s,d = t. This yields

(–2s + 2t)v1 + (–s – t)v2 + sv3 + tv4 = 0

or

s(–2v1 – v2 + v3) + t(2v1 – v2 + v4) = 0

Since s and t are arbitrary, set s = 1, t = 0 and then s = 0, t = 1 to obtain thedependency equations

–2v1 – v2 + v3 = 0

2v1 – v2 + v4 = 0

Thus

v3 = 2v1 + v2

and

v4 = –2v1 + v2

12. (c) If we solve the vector equation

(*) av1 + bv2 + cv3 + dv4 + ev5 = 0

we obtain the homogeneous system

a – 2b + 4c – 7e = 0

–a + 3b – 5c + 4d+ 18e = 0

5a + b + 9c + 2d + 2e = 0

2a + 4c – 3d – 8e = 0

1 0 2 2 0

0 1 1 1 0

0 0 0 0 0

0 0 0 0 0

−


The reduced row-echelon form of the augmented matrix is

This tells us that v1, v2, v4 is the desired basis. The solution is a = –2s + t, b = s – 3t,c = s, d = –2t, and e = t. This yields

(–2s + t)v1 + (s – 3t)v2 + sv3 – 2tv4 + tv5 = 0

or

s(–2v1 + v2 + v3) + t(v1 – 3v2 – 2v4 + v5) = 0

Since s and t are arbitrary, set s = 1, t = 0 and then s = 0, t = 1 to obtain thedependency equations

–2v1 + v2 + v3 = 0

v1 – 3v2 – 2v4 + v5 = 0

Thus

v3 = 2v1 – v2

and

v5 = –v1 + 3v2 + 2v4

13. Let A be an n × n invertible matrix. Since AT is also invertible, it is row equivalent to In. It

is clear that the column vectors of In

are linearly independent. Hence, by virtue of Theorem5.5.5, the column vectors of AT, which are just the row vectors of A, are also linearlyindependent. Therefore the rows of A form a set of n linearly independent vectors in Rn,and consequently form a basis for Rn.

15. (a) We are looking for a matrix so that the only solution to the equation Ax = 0 is x = 0.

Any invertible matrix will satisfy this condition. For example, the nullspace of the

matrix A = is the single point (0, 0, 0).1 0 0

0 1 0

0 0 1

1 0 2 0 1 0

0 1 1 0 3 0

0 0 0 1 2 0

0 0 0 0 0 0

−−


(b) In this case, we are looking for a matrix so that the solution of Ax = 0 is

one-dimensional. Thus, the reduced row-echelon form of A has one column without

a leading one. As an example, the nullspace of the matrix A = is

span , a plane in R3.

(c) In this case, we are looking for a matrix so that the solution space of Ax = 0 is

two-dimensional. Thus, the reduced row-echelon form of A has two columns without

leading ones. As an example, the nullspace of the matrix A = is

span , a plane in R3.

16. (a) True. Since premultiplication by an elementary matrix is equivalent to an elementaryrow operation, the result follows from Theorem 5.5.3.

15. (c) False. For instance, let A = and EA = . Then the column space of A is

span and the column space of EA is span . These are not the same

spaces.

(d) True by Theorem 5.5.1

(e) False. The row space of an invertible n × n matrix is the same as the row space of In,

which is Rn. The nullspace of an invertible matrix is just the zero vector.

1

0

1

2

1 1

0 0

1 1

2 2

−

1

1

0

1

0

1

,

1 1 1

0 0 0

0 0 0

−

1

1

1

1 0 1

0 1 1

0 0 0

−−


17. (a) The matrices will all have the form where s andt are any real numbers.

(b) Since A and B are invertible, their nullspaces are the origin. The nullspace of C is theline 3x + y = 0. The nullspace of D is the entire xy-plane.

18. Let A = [1 1 1]. Then Ax = [1] has the particular solution [1 0 0]T and Ax = [0] has thegeneral solution [s t – s – t]T. Thus the general solution can be written as

19. Theorem: If A and B are n × n matrices and A is invertible, then the row space of AB is therow space of B.

Proof: If A is invertible, then there exist elementary matrices E1, E2, …, Ek

such that

A = E1E2 … Ekin

or

AB = E1E2 … EkB

Thus, Theorem 5.5.4 guarantees that AB and B will have the same row spaces.

x

x

x

s

1

2

3

1

0

0

1

0

1

=

+

+−

t

0

1

1

3 5

3 5

3 5

0 0

0 0

3 5

s s

t ts t

−−

=

−

+

−


EXERCISE SET 5.6

2. (a) The reduced row-echelon form for A is

Thus rank (A) = 2. The solution to Ax = 0 is x = 16t, y = 19t, z = t, so that the nullityis one. There are three columns, so we have 2 + 1 = 3.

(c) The reduced row-echelon form for A is

Thus rank (A) = 2. The null space will have dimension two since the solution to Ax =0 has two parameters. There are four columns, so we have 2 + 2 = 4.

4. Recall that rank(A) is the dimension of both the row and column spaces of A. Use theDimension Theorem to find the dimensions of the nullspace of A and of AT, recalling that ifA is m × n, then AT is n × m, or just refer to the chart in the text.

7. Use Theorems 5.6.5 and 5.6.7.

(a) The system is consistent because the two ranks are equal. Since n = r = 3, n – r = 0and therefore the number of parameters is 0.

(b) The system is inconsistent because the two ranks are not equal.

1 0 1 2 7

0 1 1 4 7

0 0 0 0

−

1 0 16

0 1 19

0 0 0

−−

263

7. (d) The system is consistent because the two ranks are equal. Here n = 9 and r = 2, sothat n – r = 7 parameters will appear in the solution.

(f) Since the ranks are equal, the system is consistent. However A must be the zeromatrix, so the system gives no information at all about its solution. This is reflected inthe fact that n – r = 4 – 0 = 4, so that there will be 4 parameters in the solution for the4 variables.

9. The system is of the form Ax = b where rank(A) = 2. Therefore it will be consistent if andonly if rank([A|b]) = 2. Since [A|b] reduces to

the system will be consistent if and only if b3 = 4b2 – 3b1, b4 = –b2 + 2b1, and b5 = 8b2 – 7b1,where b1 and b2 can assume any values.

10. Suppose that A has rank 2. Then two of its column vectors are linearly independent. Thus,by Theorem 5.6.9, at least one of the 2 × 2 submatrices has nonzero determinant.

Conversely, if at least one of the determinants of the 2 × 2 submatrices is nonzero, then,by Theorem 5.6.9, at least two of the column vectors must be linearly independent. Thusthe rank of A must be at least 2. But since the dimension of the row space of A is at most2, A has rank at most 2. Thus, the rank of A is exactly 2.

11. If the nullspace of A is a line through the origin, then it has the form x = at, y = bt, z = ct

where t is the only parameter. Thus nullity(A) = 3 – rank(A) = 1. That is, the row andcolumn spaces of A have dimension 2, so neither space can be a line. Why?

1 3

0 1

0 0 4 3

0 0 2

0 0 8

1

2 1

3 2 1

4 2 1

5 2

−−

− ++ −

−

b

b b

b b b

b b b

b b ++

7 1b


12. (a) If we attempt to reduce A to row-echelon form, we find that

A→ if t ≠ 0

→ if t ≠ 1

→ if t ≠ – 2

Thus rank(A) = 3 if t ≠ 0, 1, –2. If t = 0, rank(A) = 3 by direct computation. If t = 1,rank(A) = 1 by inspection, and if t = –2, rank(A) = 2 by the above reduction.

13. Call the matrix A. If r = 2 and s = 1, then clearly rank(A) = 2. Otherwise, either r – 2 or s– 1 = 0 and rank(A) = 3. Rank(A) can never be 1.

14. Call the matrix A and note that rank(A) is either 1 or 2. Why? By Exercise 10, rank(A) =1 if and only if

Thus we must have x2 – y = xy – z = y2 – xz = 0. If we let x = t, the result follows.

16. (a) The column space of the matrix

is the xy-plane.

1 0 1

0 1 0

0 0 0

x y

x

x z

y

y z

x y10

10 0, ,= = =and

1 1

0 1 1

0 0 2

1 1

0 1 1

0 0 1

t

t

t

−+

→ −

1 1

0 1 1

0 1 1

t

t

−− − +

( )

1 1

0 1 1

1 1

1 1

0 1 1

0 1 1

t

t t

t

t

t t

t t

− −

→ − −

− − 22


16. (b) The nullspace is the line x = t, y = 0, z = –t.

(c) The row space is the plane x – z = 0.

(d) In general, if the column space of a 3 × 3 matrix is a plane through the origin then thenullspace will be a line through the origin and the row space will be a plane throughthe origin. This follows from the fact that if the column space is a plane, the matrixmust have rank 2 and therefore nullity 1. The one dimensional subspaces of R3 arelines through the origin and the two dimensional subspaces are planes through theorigin.

Similarly, if the column space represents a line through the origin, then so does therow space. In this case, the nullspace will represent a plane through the origin.

17. (a) False. Let A =

(c) True. If A were an m × n matrix where, say, m > n, then it would have m rows, eachof which would be a vector in R

n. Thus, by Theorem 5.4.2, they would form a linearly

dependent set.

18. (a) Since the row rank equals the column rank and since A has 3 rows and more than 3columns, its maximum rank is 3. Hence, the number of leading 1’s in its reduced row-echelon form is less than or equal to 3.

(b) Since nullity (A) ≤ 5, there could be as many as 5 parameters in the general solutionof Ax = 0. The maximum of 5 would occur if A were the zero matrix.

(c) Since A has 3 columns and more than 3 rows, its maximum rank is 3. Hence themaximum number of leading 1’s in the reduced row-echelon form of A is 3.

(d) Since nullity (A) ≤ 3, there could be as many as 3 parameters in the general solutionto Ax = 0.

1 0 0

0 1 0



1. (b) The augmented matrix of this system reduces to

Therefore, the solution space is a plane with equation 2x – 3y + z = 0

(c) The solution is x = 2t, y = t, z = 0, which is a line.

2. Let A be the coefficient matrix. Since det(A) = –(1 – s)2(2 + s), the solution space is theorigin unless s = 1 or s = –2. If s = 1, the solution space is the plane x1 + x2 + x3 = 0. If s= –2, the solution space is the line x1 = t, x2 = t, x3 = t.

Alternative Solution: Let A be the coefficient matrix. We can use the DimensionTheorem and the result of Exercise 12(a) of Section 5.6 to solve this problem. Recall thatrank(A) = 1 if s = 1, rank(A) = 2 if s = –2, and rank(A) = 3 for all other values of s. Hencenullity(A) = 2 if s = 1, nullity(A) = 1 if s = –2, and nullity(A) = 0 for all other values of s.Thus, if s = 1, then the solution space is a two-dimensional subspace of R3, i.e., a planethrough the origin. If s = –2, then the solution space is a one-dimensional subspace of R3,i.e., a line through the origin. If s ≠ 1 and s ≠ –2, then the solution space is the zero-dimensional subspace of R3, i.e., 0.

4. (a) The identities

sin(x + θ) = cos θ sin x + sin θ cos x

cos(x + θ) = cos θ cos x – sin θ sin x

hold for all values of x and θ. Hence

2 3 1 0

0 0 0 0

0 0 0 0

−

267

(*)f1 = (cos θ)f + (sin θ)g

g1 = (– sin θ)f + (cos θ)g

That is, f1 and g1 are linear combinations of f and g and therefore belong to W.

4. (b) If we solve the system (*) for f and g, we obtain

f = (cos θ)f1 + (– sin θ)g1

g = (sin θ)f1 + (cos θ)g1

Hence, any linear combination of f and g is also a linear combination of f1 and g1 andthus f1 and g1 span W. Since the dimension of W is 2 (it is spanned by 2 linearlyindependent vectors), Theorem 5.4.6(b) implies that f1 and g1 form a basis for W.

5. (a) We look for constants a, b, and c such that v = av1 + bv2 + cv3, or

a + 3b + 2c = 1

–a + c = 1

This system has the solution

a = t – 1 b = 23 – t c = t

where t is arbitrary. If we set t = 0 and t = 1, we obtain v = (–1)v1 + (2/3)v2 and v =(–1/3)v2 + v3, respectively. There are infinitely many other possibilities.

(b) Since v1, v2, and v3 all belong to R2 and dim(R2) = 2, it follows from Theorem 5.4.2 thatthese three vectors do not form a basis for R2. Hence, Theorem 5.4.1 does not apply.

6. Suppose that there are constants c1, …, cn, not all zero, such that c1Av1 + … + c

nA v

n= 0.

Then

A(c1v1 + … + cnv

n) = 0

Since the vectors v1, …, vn

are linearly independent, the n × 1 matrix c1v1 + … + cnv

n

cannot equal 0. Thus the equation Ax = 0 has a non-trivial solution, and so A is notinvertible. Therefore, by Theorem 1.5.3, A is invertible if and only if Av1, Av2, …, Av

nare

linearly independent.


7. Consider the polynomials x and x + 1 in P1. Verify that these polynomials form a basis forP1.

8. (c) Since the odd numbered rows are all repeats of Row 1 and the even numbered rowsare all repeats of Row 2, while Rows 1 and 2 are linearly independent, an n × n

checker board matrix has rank 2 whenever n ≥ 2. Since the nullity is n minus therank, we have nullity = n – 2.

10. (a) If p belongs to the set, then it contains only even powers of x. Since this set is closedunder polynomial addition and scalar multiplication (Why?), it is a subspace of P

n.

One basis is the set 1, x2, x4, …, x2m where 2m = n if n is even and 2m = n – 1 if nis odd.

(b) If p belongs to this set, then its constant term must be zero. Since this set is closedunder polynomial addition and scalar multiplication (Why?), it is a subspace of P

n.

One basis is the set x, x2, …, xn.

12. (a) A 3 × 3 symmetric matrix has the form A = , so a basis is formed by the

six matrices

(b) A 3 × 3 skew-symmetric matrix has the form A = , so a basis is formed

by the three matrices

13. (a) Since = – 1≠ 0, the rank is 2.

(b) Since all three 2 × 2 subdeterminants are zero, the rank is 1.

1 0

2 1−

0 1 0

1 0 0

1 0 0

0 0 1

0 0 0

1 0 0

−

−

, ,,

0 0 0

0 0 1

1 1 0−

0

0

0

a b

a c

b c

−− −

1 0 0

0 0 0

0 0 0

0 1 0

1 0 0

0 0 0

, , ,

0 0 1

0 0 0

1 0 0

0 0 0

0 1 0

0 0 0

,, ,

0 0 0

0 0 1

0 1 0

0 0 0

0 0 0

0 0 1

a b c

b d e

c e f


(c) Since the determinant of the matrix is zero, its rank is less than 3. Since = –1≠ 0, the rank is 2.

(d) Since the determinant of the 3 × 3 submatrix obtained by deleting the last column is30 ≠ 0, the rank of the matrix is 3.

14. Call the matrix A. Since the determinant of every 5 × 5, 4 × 4, and 3 × 3 submatrix is zero,rank(A) ≤ 2. Since

det = –a5ja

i6

for i = 1, 2, …, 4 and j = 1, …, 5, then rank(A) = 2 if any of these determinants is nonzero.Otherwise, if anyof the numbers a

ij≠ 0, then rank(A) = 1 and if a

ij= 0 for all i and j, then

rank(A) = 0.

15. (b) Let S

= v1, …, vn and let u = u1v1 + … + u

nv

n. Thus (u)

S= (u1, …, u

n). We have

ku = ku1v1 + … + kunv

n

so that (ku)S

= (ku1, …, kun) = k(u

1, …, un). Therefore (ku)S = k(u)S.

0 6

5 56

a

a a

i

j

1 0

2 1−


EXERCISE SET 6.1

1. (c) Since v + w = (3, 11), we have

⟨u, v + w⟩ = 3(3) + (–2)(11) = –13

On the other hand,

⟨u, v⟩ = 3(4) + (–2)(5) = 2

and

⟨u, w⟩ = 3(–1) + (–2)(6) = –15

(d) Since ku = (–12, 8) and kv = (–16, –20) , we have

⟨ku, v⟩ = (–12)(4) + (8)(5) = –8

and

⟨u, kv⟩ = 3(–16) + (–2)(–20) = –8

Since ⟨u,v⟩ = 2, k⟨u, v⟩ = –8.

2. (c) Since v + w = (3, 11), we have

⟨u, v + w⟩ = 4(3)(3) + 5(–2)(11) = –74

On the other hand,

⟨u, v⟩ = 4(3)(4) + 5(–2)(5) = –2

and

⟨u, w⟩ = 4(3)(–1) + 5(–2)(6) = –72

271

3. (a) ⟨u, v⟩ = 3(–1) – 2(3) + 4(1) + 8(1) = 3

4. (a) ⟨p, q⟩ = (–2)(4) + 1(0) + 3(–7) = –29

5. (a) By Formula (4),

(b) We have ⟨u, v⟩ = 9(–3)(1) + 4(2)(7) = 29.

6. (a) Since

then

= [ 5v1 – v2 –v1 + 10v2 ]

= 5u1v1 – u1v2 – u2v1 + 10u2v2

7. (a) By Formula (4), we have ⟨u, v⟩ = vT AT Au where

A =

3 0

0 5

u

u

1

2

v v1 2

1

2

5 1

1 10,

−−

u

u

A AT =

−−

5 1

1 10

uu,,vv =

=

v vu

u1 2

1

2

3 0

0 2

3 0

0 2

vv vu

u

v vu

u

1 21

2

1 21

9 0

0 4

9 4

= 22

1 1 2 29 4

= +u v u v


8. (a) (1) ⟨u, v⟩ = 3u1v1 + 5u2v2 = 3v1u1 + 5v2u2 = ⟨v, u⟩

(2) If w = (w1, w2), then

⟨u + v, w⟩ = 3(u1 + v1)w1 + 5(u2 + v2)w2

= (3u1w1 + 5u2w2) + (3v1w1 + 5v2w2)

= ⟨u, w⟩ + ⟨v, w⟩

(3) ⟨ku, v⟩ = 3(ku1)v1 + 5(ku2)v2

= k(3u1v1 + 5u2v2)

= k⟨u, v⟩

(4) ⟨v, v⟩ = 3v12 + 5v2

2 ≥ 0

Moreover, ⟨v, v⟩ = 0 if and only if v1 = v2 = 0, or v = 0.

9. (b) Axioms 1 and 4 are easily checked. However, if w = (w1, w2, w3), then

⟨u + v, w⟩ = (u1 + v1)2w1

2 + (u2 + v2)2w2

2 + (u3 + v3)2w3

2

= ⟨u, w⟩ + ⟨v, w⟩ + 2u1v1w12 + 2u2v2w2

2 + 2u3v3w32

If, for instance, u = v = w = (1, 0, 0), then Axiom 2 fails.

To check Axiom 3, we note that ⟨ku, v⟩ = k2⟨u, v⟩. Thus ⟨ku, v⟩ ≠ k⟨u, v⟩ unless k= 0 or k = 1, so Axiom fails.

(c) (1) Axiom 1 follows from the commutativity of multiplication in R.

(2) If w = (w1, w2, w3), then

⟨u + v, w⟩ = 2(u1 + v1)w1 + (u2 + v2)w2 + 4(u3 + v3)w3

= 2u1w1 + u2w2 + 4u3w3 + 2v1w1 + v2w2 + 4v3w3

= ⟨u, w⟩ + ⟨v, w⟩

(3) ⟨ku, v = 2(ku1)v1 + (ku2)v2 + 4(ku3)v3 = k⟨u, v⟩

(4) ⟨v, v⟩ = 2v12 + v2

2 + 4v32 ≥ 0

= 0 if and only if v1 = v2 = v3 = 0, or v = 0

Thus this is an inner product for R3.


10. (c) Since , we have

Thus w = 125 = 5 5.

11. We have u – v = (–3, –3).

(b) d(u, v) = (–3, –3) = [3(9) + 2(9)]1/2 = 45 = 3 5

(c) From Problem 10(c), we have

Thus

d(u, v) = 117 = 3 13

12. (a) p = [(–2)2 + (3)2 + (2)2]1/2 = 17

13. (a) A = [(–2)2 + (5)2 + (3)2 + (6)2]1/2 = 74

14. Since p – q = 1 – x – 4x2, we have

d(p, q) = ⟨p – q, p – q⟩1/2 = [12 + (–1)2 + (–4)2]1/2 = 3 2

d( , )u v[ ] = − − −

−

−−

23 3

2 1

1 13

3

3

= 117

w2 1 3

2 1

1 13

1

3= −

−−

−

= [[ ]−−

=5 40

1

3125

A AT =

−−

2 1

1 13


15. (a) Since , we have

d(A, B) = ⟨A – B, A – B⟩1/2 = [62 + (–1)2 + 82 + (–2)2]1/2 = 105

16. (a) We have ⟨u + v, v + w⟩ = ⟨u, v⟩ + ⟨u, w⟩ + ⟨v, v⟩ + ⟨v, w⟩ = 2 + 5 + (2)2 –3 = 8.

(d) We have u + v2 = ⟨u + v, u + v⟩ = ⟨u, u⟩ + 2⟨u, v⟩ + ⟨v, v⟩ = (1)2 + 2(2) + (2)2 = 9.Hence u + v = 3.

17. (a) For instance,

(b) We have

d(p, q)= p – q

= 1 – x

= −

= − +

−

−

∫

∫

( )

( )

/

1

1 2

21

1 1 2

21

1

x dx

x x dx

= − +

=

−

1 2

23

1

1 1 2

3

22

3

/

/

x xx

=1 2

2

36

/

x x dxx

=

=

=−

−∫ 2

1

1 1 2 3

1

1 1 2

3

2

3

1 2

A B− =−−

6 1

8 2


18. (a) Since we are looking for points (x, y) with (x, y) = 1, we have

or 1 . This is the ellipse shown in the figure.

20. Observe that

(*) u + v2 = ⟨u + v, u + v⟩ = ⟨u, u⟩ + 2⟨u, v⟩ + ⟨v, v⟩

and

(**) u – v2 = ⟨u –v, u – v⟩ = ⟨u, u⟩ – 2⟨u, v⟩ + ⟨v, v⟩

Thus if we add (*) and (**), we obtain the equation

u + v2 + u – v2 = 2u2 + 2v2

21. If, in the solution to Exercise 20, we subtract (**) from (*) and divide by 4, we obtain thedesired result.

22. If, for instance, , then ⟨V, V⟩ = –2 ⟨ 0. Thus Axiom 4 fails.V =−

0 1

1 0

y

4

-2

-4

2

x

1

4

1

1612 2

x y+ = .

1

4

1

1612 2

1 2

x y+

=


23. Axioms 1 and 3 are easily verified. So is Axiom 2, as shown: Let r = r(x) be a polynomialin P2. Then

⟨p + q, r⟩ = [(p + q)(0)]r(0) + [(p + q)(1/2)]r(1/2) + [(p + q)(1)]r(1)

= p(0)r(0) + p(1/2)r(1/2) + p(1)r(1) + q(0)r(0) + q(1/2)r(1/2) + q(1)r(1)

= ⟨p, r⟩ + ⟨q, r⟩

It remains to verify Axiom 4:

⟨p, p⟩ = [p(0)]2 + [p(1/2)]2 + [p(1)]2 ≥ 0

and

⟨p, p⟩ = 0 if and only if p(0) = p(1/2) = p(1) = 0

But a quadratic polynomial can have at most two zeros unless it is identically zero. Thus⟨p, p⟩ = 0 if and only if p is identically zero, or p = 0.

24. Using the hint and properties of the transpose, we have

⟨u, Av⟩ = (Av)Tu = (vTAT)u = vT(ATu) = ⟨ATu, v⟩

26. (1) Axiom 1 follows immediately from the commutativity of multiplication in the realnumbers.

(2) If z = (z1, …, zn), then

⟨u + v, z⟩ = w1 (u1 + v1)z1 + … + wn(u

n+ v

n)z

n

= (w1u1z1 + … + wnu

nz

n) + (w1v1z1 + … + w

nv

nz

n)

= ⟨u, z⟩ + ⟨v, z⟩

(3) If k is any constant, then

⟨ku, v⟩ = w1(ku1)v1 + … + wn(ku

n)v

n

= k(w1u1v1 + … + wnu

nv

n)

= k⟨u, v⟩


(4) Since ⟨v, v⟩ = w1v12 + … + w

nv

n2 where w1, …, w

nare positive reals, then ⟨v, v⟩ ≥

0 and ⟨v, v⟩ = 0 if and only if v1 = … = vn

= 0 or v = 0. If one of the constants, wi,

were negative or zero, then Axiom 4 would fail to hold.

27. (b) ⟨p, q⟩ = (x – 5x3) (2 + 8x2) dx = (2x – 2x3 – 40x5) dx

= x2 – x4/2 – 20x6/3 = 0

28. (a) ⟨f, g⟩ = cos 2πx sin 2πx dx = (1/4π) sin2 2πx = 0

29. We have ⟨U, V⟩ = u1v1 + u2v2 + u3v3 + u4v4 and

which does, indeed, equal ⟨U, V⟩.

30. We use the formula ⟨u, v⟩ = vT AT Au as suggested in the hint.

(1) We must show that ⟨u, v⟩ = ⟨v, u⟩. Since ⟨v, u⟩ is a real number which can bethought of as a 1 × 1 matrix, it will suffice to show that ⟨u, v⟩ = ⟨v, u⟩T. But

⟨v, u⟩T = (uT AT Av)T = vT AT Au = ⟨u, v⟩

tr tr 1( )U Vu u

u u

v v

v v

T =

3

2 4

1 2

3 4

=+ ++ +

tru v u v u v u v

u v u v u v u

1 1 3 3 1 2 3 4

1 1 4 3 2 2 4vv

u v u v u v u v

4

1 1 3 3 2 2 4 4

= + + +

0

1

∫0

1

∫

0

1

∫

-1

1

∫-1

1

∫


(2) We have

⟨u + v, w⟩ = wT AT A(u + v)

= wT AT Au + wT AT Av

= ⟨u, w⟩ + ⟨v, w⟩

(3) Here ⟨ku, v⟩ = vT AT A(ku) = k(vT AT Au) = k⟨u, v⟩.

(4) Since ⟨v, v⟩ = vT AT Av= Av • Av ≥ 0, we need only show that ⟨v, v⟩ = 0 if and onlyif v = 0. But ⟨v, v⟩ = 0 if and only if Av = 0, or v = A–10 = 0, since A is invertible.

Therefore ⟨v, v⟩ ≥ 0 for every vector v. It is clear that if v = 0, then ⟨v, v⟩ = 0. It remainsto show that if v ≠ 0, then ⟨v, v⟩ ≠ 0.

To see this, we notice that the only way that ⟨v, v⟩ can equal zero is for v • ri

to equalzero for every row r

iof A. But this implies that a linear combination of the columns of A is

the n × 1 zero matrix. That is, if we let ci

be the ith column of A, then

v1c1 + v2c2 + … + vnc

n= 0

Therefore the column vectors of A are linearly dependent, and hence A is not invertible.This is why the matrix A has to be invertible to generate an inner product. If A is invertible,then ⟨v, v⟩ can only = 0 when v = 0.

31. Calling the matrix A, we have

⟨u, v⟩ = vT AT Au = vT A2u = w1u1

v1 + … + w

nu

nv

n


33. To prove Part (a) of Theorem 6.1.1 first observe that ⟨0, v⟩ = ⟨v, 0⟩ by the symmetry axiom.Moreover,

⟨0, v⟩ = ⟨00, v⟩ by Theorem 5.1.1

= 0⟨0, v⟩ by the homogeneity axiom

= 0

Alternatively,

⟨0, v⟩ + ⟨0, v⟩ = ⟨0 + 0, v⟩ by additivity

= ⟨0, v⟩ by definition of the zero vector

But ⟨0, v⟩ = 2⟨0, v⟩ only if ⟨0, v⟩ = 0.

To prove Part (d), observe that, by Theorem 5.1.1, –v (the inverse of v) and (–1)v arethe same vector. Thus,

⟨u – v, w⟩ = ⟨u + (–v), w⟩

= ⟨u, w⟩ + ⟨–v, w⟩ by additivity

= ⟨u, w⟩ – ⟨v, w by homogeneity

34. The set of points ⟨(x,y), (x,y)⟩ = x2/52 + y2/42 = 1 is the unit circle with respect to theweighted Euclidean inner product ⟨u, v⟩ = (1/25)u1v1 + (1/16)u2v2.


EXERCISE SET 6.2

1. (e) Since u • v = 0 + 6 + 2 + 0 = 8, the vectors are not orthogonal.

2. Let’s assume that these scalars k and l exist. Then for these three vectors to be mutuallyorthogonal we have

⟨u, v⟩ = ⟨(2, k, 6), (l, 5, 3)⟩ = 2l + 5k + 18 = 0

⟨u, w⟩ = ⟨(2, k, 6), (1, 2, 3)⟩ = 2 + 2k + 18 = 0

⟨v, w⟩ = ⟨(l, 5, 3), (1, 2, 3)⟩ = l + 10 + 9 = 0.

Solving the second and third equations we have k = –10 and l = –19. However, if wesubstitute these values into the first equation, we have ⟨u, v⟩ = 2(–19) + 5(–10) + 18 = –70≠ 0. Thus, there are no scalars k and l so that u, v, and w are mutually orthogonal.

3. We have ku + v = (k + 6, k + 7, –k – 15), so

ku + v = ⟨(ku + v), (ku + v)⟩1/2

= [(k + 6)2 + (k + 7)2 + (–k – 15)2]1/2

= (3k2 + 56k + 310)1/2

Since ku + v = 13 exactly when ku + v2 = 169, we need to solve the quadratic equation3k2 + 56k + 310 = 169 to find k. Thus, values of k that give ku + v = 13 are k = –3 or k =–47/3.

4. We have ⟨u, w1⟩ = ⟨u, w3⟩ = 0 but ⟨u, w2⟩ = –2 ≠ 0. Hence, u is not orthogonal to W.

281

5. (a)

(c)

(e)

6. (a)

(Compare with 3(c), above.)

7. ⟨p, q⟩ = (1)(0) + (–1)(2) + (2)(1) = 0

8. (a)

9. (b) = (2)(1) + (1)(1) + (–1)(0) + (3)(–1) = 0

Thus the matrices are orthogonal.

9. (d) = 4 + 1 – 5 + 6 = 6 = 0

Thus the matrices are not orthogonal.

10. (a) We look for values of k such that

⟨u, v⟩ = 2 + 7 + 3k = 0

Clearly k = –3 is the only possible value.

2 1

1 3

2 1

5 2−

,

2 1

1 3

1 1

0 1−

−

,

cos,

θ = =+ + +

=A B

A B

6 12 1 0

50 14

19

10 7

cos,

θ =− + + + −

− + + + −

1 5 2 2 4 9

1 5 2 2 4 9

2 2

2

x x x x

x x x x22

2 20 18

30 1010=

− + −=

1 0 1 0 3 3 3 3

1 0 1 0 3 3 3 3

, , , , , , ,

, , , , , ,

( ) − − − −( )( ) − − − −

==− −

=−3 3

2 36

1

2

cos, , , , ,

, , , ,θ =

−( ) −( )−( ) −

=−1 5 2 2 4 9

1 5 2 2 4 9

22 20 18

30 1010

+ −=

cos( , ),( , )

( , ) ,θ =

−−

=−

=1 3 2 4

1 3 2 4

2 12

10 20

−−1

2


11. We must find two vectors x = (x1, x2, x3, x4) such that ⟨x, x⟩ = 1 and ⟨x, u⟩ = ⟨x, v⟩ = ⟨x,w⟩ = 0. Thus x1, x2, x3, and x4 must satisfy the equations

x12 + x2

2 + x32 + x4

2 = 1

2x1 + x2 – 4x3

= 0

–x1

– x2+ 2x

3+ 2x

4= 0

3x1+ 2x2+ 5x3 + 4x4 = 0

The solution to the three linear equations is x1 = –34t, x2 = 4t, x3 = –6t, and x4 = 11t. If wesubstitute these values into the quadratic equation, we get

[(–34)2 + (44)2 + (–6)2 + (11)2] t2 = 1

or

Therefore, the two vectors are

12. (a) Here ⟨u, v⟩2 = (3(4) + 2(–1))2 = 100, while ⟨u, u⟩ ⟨v, v⟩ = (32 + 22) (42 + (–1)2) = 221.

13. (a) Here ⟨u, v⟩2 = (3(–2)(1) + 2(1)(0))2 = 36, while, on the other hand, ⟨u, u⟩⟨v, v⟩ =(3(–2)2 + 2(1)2) 3(1)2 + 2(0)2 = 42.

14. Since W is the line y = 2x, then the space W of all vectors which are perpendicular to

every vector in W is just the line y = – 12x. Note that .

15. (a) Here W is the line which isnormal to the plane and which passes through the origin.By inspection, a normal vector to the plane is (1, –2, –3). Hence this line hasparametric equations x = t, y = –2t, z = –3t.

( , ) ,x x x x21

20⋅ −

=

± − −1

5734 44 6 11( , , , )

t = ±1

57


16. (a) A row-echelon form of A is

Therefore the vectors v1 = (1, 2, –1, 2) and v2 = (0, 1, –3, 2) form a basis for the rowspace of A. From the above matrix, we see that the nullspace of A consists of allvectors of the form (–5s + 2t, 3s – 2t, s, t) where s and t are parameters. If we set s =1, t = 0 and s = 0, t = 1, we obtain a basis w1 = (–5, 3, 1, 0), w2 = (2, –2, 0, 1) for thenullspace of A. It is easy to check that v1

• w1 = v1• w2 = v2

• w1 = v2• w2 = 0, which

gives the desired result.

17. (a) The subspace of R3 spanned by the given vectors is the row space of the matrix

which reduces to

The space we are looking for is the nullspace of this matrix. From the reduced form,we see that the nullspace consists of all vectors of the form (16, 19, 1)t, so that thevector (16, 19, 1) is a basis for this space.

Alternatively the vectors w1 = (1, –1, 3) and w2 = (0, 1, –19) form a basis for therow space of the matrix. They also span a plane, and the orthogonal complement ofthis plane is the line spanned by the normal vector w1 × w2 = (16, 19, 1).

18. (b) Since v2 = 2v1, we are looking for a basis for the orthogonal complement of the spacespanned by v1. (This space is the line containing v1 and its orthogonal complement isthe plane 2x – z = 0.) The solution space for the equation 2x1 + 0x2 – x3 = 0 is the setof vectors (Alternatively, the subspace W of R3 spanned by the given vectors v1 and v2is the row space of the matrix A, where

Since (2, 0, –1) is a basis for the row space, then (2, 0, –1) is a basis for W. To find abasis for W, note that (1, 0, 2) and (0, 1, 0) are a basis for the null-space of A andhence for W. It is obvious that W is the plane 2x – z = 0.

A =−−

→

−

2 0 1

4 0 2

2 0 1

0 0 0

1 1 3

0 1 19

0 0 0

−−

1 1 3

5 4 4

7 6 2

−− −−

1 2 1 2

0 1 3 2

0 0 0 0

−−


(c) The orthogonal complement of the subspace of R4 spanned by the given vectors is thenullspace of the matrix

which reduces to

Thus the nullspace consists of all vectors of the form (–s + 2t, –s – 4t, s, 7t) and thevectors (–1, –1, 1, 0) and (2, –, 0, 7) form a basis for this space.

19. If u and v are orthogonal vectors with norm 1, then

u – v = ⟨u – v, u – v⟩1/2

= [⟨u, u – 2⟨u, v⟩ + ⟨v, v⟩]1/2

= [1 – 2(0) + 1]1/2

= 2

20. If ⟨w, u1⟩ = ⟨w, u2⟩ = 0, then

⟨w, k1u1 + k2u2⟩ = ⟨w, k1u1⟩ + ⟨w, k2u2⟩

= k1 ⟨w, u1⟩ + k2 ⟨w, u2⟩

= 0

Thus w is orthogonal to k1u1 + k2u2.

Now consider R3 with the Euclidean inner product. If w is perpendicular to both u1 andu2 and they determine a plane, then w is perpendicular to every vector in that plane. If u1and u2 determine a line, then w must be perpendicular to every vector in that line. If u1 andu2 determine neither a plane nor a line, then u1 = u2 = 0 and the result is not of interest.

21. By definition, u is in span u1, u2, …, ur if and only if there exist constants c1, c2, …, c

rsuch

that

u = c1u1 + c2u2 + … + cru

r

But if ⟨w, u1⟩ = ⟨w, u2⟩ = … = ⟨w, ur⟩ = 0, then ⟨w, u⟩ = 0.

1 4 5 2

0 1 1 4 7

0 0 0 0

/

1 4 5 2

2 1 3 0

1 3 2 2−


22. If a vector z is orthogonal to all of the basis vectors, then, by the result of Exercise 19, itis orthogonal to every linear combination of the basis vectors, and hence to every vector inV. Thus, in particular, ⟨z, z⟩ = 0, so, by Axiom 4, z = 0.

23. We have that W = spanw1, w2, …, wk

Suppose that w is in W. Then, by definition, ⟨w, wi⟩ = 0 for each basis vector w

iof W.

Conversely, if a vector w of V is orthogonal to each basis vector of W, then, by Problem20, it is orthogonal to every vector in W.

24. Note that

v1 + … + vr2 = ⟨v1 + … + v

r, v1 + … + v

r⟩f=

=

If vi, v

j= 0 whenever i ≠ j, then the result follows.

25. (c) By Property (3) in the definition of inner product, we have

ku2 = ⟨ku, ku⟩ = k2⟨u, u⟩ = k2u2

Therefore ku = |k| u.

26. (d) We must show that for all u, v, and w

u – v ≤ u – w + w – v

If we apply the triangle inequality to the vectors u – w and w – v, then we have

((u – w) + (w – v) ≤ u – w + w – v

and the result follows.

27. This is just the Cauchy-Schwarz Inequality using the inner product on Rn generated by A(see Formula (4) of Section 6.1).

= + + +≠∑v v v v1

2 2L r i j

i j

,

v v v vi i

i

r

i j

i j

r

, ,= ≠∑ ∑+

1


28. If we let R2 have the Euclidean inner product and apply the Cauchy-Schwarz Inequality tothe vectors u = (a, b) and v = (cos θ, sin θ), the result follows directly.

30. Suppose that

(*) ⟨u, v⟩2 = ⟨u, u⟩ ⟨v, v⟩

Following the proof of Theorem 6.2.1, we see that (*) can hold if and only if either u = 0or the quadratic equation ⟨tu + v, tu + v⟩ = 0 has just one real root. If u = 0, then u and vare linearly dependent. If the equation has one real root, t0, then ⟨t0u + v, t0u + v⟩ = 0.Thus, by Axiom 4, t0u + v = 0, and again u and v are linearly dependent.

Conversely, suppose that u and v are linearly dependent. Then either u = 0 or v can beuniquely expressed as a multiple, –t20, of u; that is, t0u + v = 0. Thus if we work backwardthrough the above argument, we can verify that ⟨u, v⟩2 = ⟨u, u⟩ ⟨v, v⟩.

31. We wish to show that ABC is a right angle, or that and are orthogonal. Observe

that = u – (–v) and = v – u where u and v are radii of the circle, as shown in the

figure. Thus u = v. Hence

⟨ , ⟩ = ⟨u + v, v – u⟩

= ⟨u, v⟩ + ⟨v, v⟩ + ⟨u, –u⟩ + ⟨v, –u⟩

= ⟨v, u⟩ + ⟨v, v⟩ – ⟨u, u⟩ – ⟨v, u⟩

= v2 – u2

= 0

32. Suppose that ⟨u, v⟩ = au1v1 + bu2v2 is a weighted Euclidean inner product on R2. For u andv to be orthogonal unit vectorswith respect to this inner product, we must have

⟨u, u⟩ = a + 3b = 1

⟨v, v⟩ = a + 3b = 1

⟨u, v⟩ = –a + 3b = 0

which yields a = 1/2 and b = 1/6.

33. (a) As noted in Example 9 of Section 6.1, 0

1f(x)g(x)dx is an inner product on C[0, 1].

Thus the Cauchy-Schwarz inequality must hold, and that is exactly what we’re askedto prove.

BC

AB

BC

AB

BC

AB


(b) In the inner product notation, we must show that

⟨f + g, f + g⟩1/2 ≤ ⟨f, f⟩1/2 + ⟨g, g⟩1/2

or, squaring both sides, that

⟨f + g, f + g⟩ ≤ ⟨f, f⟩ + 2⟨f, f⟩1/2 ⟨g, g⟩1/2 + ⟨g, g⟩

For any inner product, we know that

⟨f + g, f + g⟩ = ⟨f, f⟩ + 2⟨f, g⟩ + ⟨g, g⟩

By the Cauchy-Schwarz inequality

⟨f, g⟩2 ≤ ⟨f, f⟩ ⟨g, g⟩

or

⟨f, g⟩ ≤ ⟨f, f⟩1/2 ⟨g, g⟩1/2

If we substitute the above inequality into the equation for ⟨f + g, f + g⟩, we obtain

⟨f + g, f + g⟩ ≤ ⟨f, f⟩ + 2⟨f, f⟩1/2 ⟨g, g⟩1/2 + ⟨g, g⟩

as required.

34. Recall that cos (α + β) = cos α cos β – sin α sin β. Thus

cos(α – β) = cos α cos(–β) – sin α sin(–β)

= cos α cos β + sin α sin β

Adding these two equations gives the identity

cos(α + β) + cos(α – β) = 2 cos α cos β


Thus if k =, then k + and k – are nonzero and

= 0

35. (a) W is the line y = –x.

(b) W is the xz-plane.

(c) W is the x-axis.

36. (a) If the solution space is a line in R3, then, by Theorem 6.2.6, the row space is theorthogonal complement to that line, or the plane through the origin orthogonal to theline.

(c) If the homogeneous system AT x = 0 has a unique solution, then that solution must bethe zero vector. Therefore the nullspace of AT is the vector space consisting of just thezero vector. Its orthogonal complement, which is all of R3, is the column space andhence also, because they have the same dimension, the row space of A. This alsofollows from the fact that AT and hence also A must be invertible.

37. (b) False. Let n = 3, let V be the xy-plane, and let W be the x-axis. Then V is the z-axisand W is the yz-plane. In fact V is a subspace of W

(c) True. The two spaces are orthogonal complements and the only vector orthogonal toitself is the zero vector.

(d) False. For instance, if A is invertible, then both its row space and its column space areall of Rn.

=

= + +

∫ cos( )cos( )

cos( ) cos(

kx lx dx

k l x

0

1

2

π

kk l x dx

k l x

k l

k l

)

sin sin

−

=+( )

++

−

∫0

1

2

π

(( )−

x

k l0

π


38. (a) Let W be the subspace of all diagonal matrices in M22. Thus, a vector D = in W

is fully described by its diagonal elements a and b. A vector M22 is in W

W if and only if

⟨D, V⟩ = tr(DTV) = ax + bw = 0

for all possible values of a and b. This can only happen if x = w = 0. Thus, W is thesubspace of all matrices in M22 with only zeros on the diagonal.

(b) Let W be the subspace of all symmetric matrices in M22. Thus, the elements of W are

matrices of the form S = . A matrix V = in M22 is in W if and only if

⟨S, V⟩ = tr(STV) = tr(SV)

= ax + cz + cy + bw = 0

for all values of a, b, and c. This implies that x = w = 0 and y = –z. Thus, W is thesubspace of all skew-symmetric matrices in M22.

x y

z w

a c

c b

Vx y

z w=

a

b

0

0


EXERCISE SET 6.3

2. (a) The vectors are orthogonal, but since (2, 0) ≠ 1, they are not orthonormal.

(c) The vectors are not orthogonal and therefore not orthonormal.

4. (a) The set is not orthonormal because the last two vectors are not orthogonal.

(c) The set is not orthonormal because the last two vectors are not orthogonal.

5. See Exercise 3, Parts (b) and (c).

6. (a) Denote the vectors by v1, v2, v3, and v4. Then

⟨v1, vi⟩ = 0 for i = 2, 3, 4

⟨v2, v3⟩ = 49 – 29 – 29 = 0

⟨v2, v4⟩ = 29 + 29 – 49 = 0

⟨v3, v4⟩ = 29 – 49 + 29 = 0

Thus, the vectors are orthogonal. Moreover, v1 = 1 and v2 = v3 = v4 =

49 + 19 + 49 = 1. The vectors are therefore orthonormal.

291

7. (b) Call the vectors u1, u2 and u3. Then ⟨u1, u2⟩ = 2 – 2 = 0 and ⟨u1, u3⟩ = ⟨u2, u3⟩ = 0. The

set is therefore orthogonal. Moreover, u1 = 2, u2 = 8 = 2 2, and u3 = 25

= 5. Thus is an orthonormal set.

9. It is easy to verify that v1• v2 = v1

• v3 = v2• v3 = 0 and that v3 = 1. Moreover, v1

2 =(–3/5)2 + (4/5)2 = 1 and v2 = (4/5)2 + (3/5)2 = 1. Thus v1, v2, v3 is an orthonormal set inR3. It will be an orthonormal basis provided that the three vectors are linearly independent,which is guaranteed by Theorem 6.3.3.

(b) By Theorem 6.3.1, we have

10. It is easy to check that v1• v2 = v1

• v3 = v1• v4 = v2

• v3 = v2• v3

4= v

3• v

4= 0, so that

we have 4 orthogonal vectors in R4. Thus, by Theorem 6.3.3, they form an orthogonal basis.

(b) By Formula (1), we have

11. (a) We have (w)S

= (⟨w, u1⟩, ⟨w, u2⟩) .=−

= −( )4

2

10

22 2 5 2, ,

2 3 2 5 2 22 3 2 10 2 2

1 1 4 1

2 2 6, , ,− −( ) =

+ + ++ + +

+− −

v122 15 2 2 2

4 4 9 4

2 6 2 0 2

1 4 0 1

2 0

+ −+ + +

+− + +

+ + ++

+

v

v

2

3++ −

+ + +

= + − +

0 2

1 0 0 1

15 2

7

5 2

21

2 201 2 3 4

v

v v3

v v

4

( , , )3 7 49

5

28

50

12

5

21

51− = − − +

+ − +v 00 4

37 5 9 5 4

2 3

1 2 3

+

= −( ) + −( ) +

v v

v v v

1

2

1

2

1

21 2 3u u u, ,


12. (a) We have u = w1 + w2 = and v = –w1 + 4w2 = .

(b) By Theorem 6.3.2, these numbers can be computed directly using the Euclidean inner

product without reference to the basis vectors. We find that u = 2, d(u, v) = 13,

and ⟨u, v⟩ = 3. Similar computations with the versions of u and v obtained in Part (b)

fortunately yield the same results.

14. (a) u = (1 + 4 + 1 + 9)1/2 = 15

v – w = (2, 1, –2, 4) = (4 + 1 + 4 + 16)1/2 = 5

v + w = (–2, –7, 4, 6) = (4 + 49 + 16 + 36)1/2 = 105

⟨v, w⟩ = 0 + 12 + 3 + 5 = 20

16. (a) Let

and v1, v2 is the desired orthonormal basis.

17. (a) Let

Since ⟨u2, v1⟩ = 0, we have

vu

u22

2

1

2

1

20, ,= = −

vu

u11

1

1

3

1

3

1

3, ,= =

vu

u

Since u v

11

1

2 1

1

10

3

10

2

10

6

10

= = −

= −

,

, == −

− = +

4

10

2 24

10

12 2 1 1

,

, ( , )

we have

u u v v110

3

10

12

5

4

5, ,−

=

This vector hhas norm12

5Thus

v

, .4

5

4 10

5

5

4 10

12

52

=

= ,, ,4

5

3

10

1

10

=

13

6

16

5,

7

5

1

5, −


Since ⟨u3, v1⟩ = and ⟨u3, v2⟩ = , we have

u3 – ⟨u3, v1⟩v1 – ⟨u3, v2⟩v2

This vector has norm Thus

and v1, v2, v3 is the desired orthonormal basis.

19. Since the third vector is the sum of the first two, we ignore it. Let u1 = (0, 1, 2) and u2 =(–1, 0, 1). Then

Since ⟨u2, v1⟩ = , then

where . Hence

Thus v1, v2 is an orthonormal basis.

v25

30

2

30

1

30= −

−

, ,

, ,− −

=12

5

1

5

30

5

u u v v2 2 1 1 12

5

1

5, , ,− = − −

2

5

vu

u11

1

01

5

2

5, ,= =

v31

6

1

6

2

6, ,= −

1

6

1

6

1

3

1

6, , .−

=

= −

1

6

1

6

1

3, ,

= −

− −( , , ) , , ,1 2 14

3

1

3

1

3

1

3

1

2

1

2

1

220,

1

2

4

3


21. Note that u1 and u2 are orthonormal. Thus we apply Theorem 6.3.5 to obtain

and

22. Since u1 and u2 are not orthogonal, we must apply the Gram-Schmidt process before we caninvoke Theorem 6.3.5. Thus we let

Since ⟨u2, v1⟩ = , we have

where . Thus

Hence v1, v2 is the desired orthonormal set. Next, we compute ⟨w, v1⟩ = 2 3 and

⟨w, v2⟩.

Theorem 6.3.5 then gives

w w v v w v v1 1 1 2 213

14

31

14

40

14= + =

, , , ,

= −7 42

6

v25

42

1

42

4

42= − −

, ,

5

3

1

3

4

3

42

3, ,− −

=

u u v v2 2 1 15

3

1

3

4

3, , ,− = − −

1

3

vu

u11

1

1

3

1

3

1

3, ,= =

w w w2 1

9

50

12

5

= −

=

, ,

w w u u w u u1 1 1 2 2

4

50

3

52 0 1

= +

= − −

+

, ,

, , , , 00

4

52

3

5

( )

= −

, ,


and

24. (a) If we apply the Gram-Schmidt process to the column vectors

Thus

and

(c) If we apply the Gram-Schmidt process to the column vectors

Thus

and

A = −

1 3 8 3 26

2 3 11 3 26

2 3 7 3 26

3 1 3

0 26 3

R, ,

,=

=

u q u q

u q1 1 2 1

2 20

3 1 3

0 26 3

u u we obta1 2

1

2

2

1

1

= −

=

,

1

iin q q1 2

1 3

2 3

2 3

8 3 26

11 3 26

7 3 26

= −

=

,

A =−

1 5 2 5

2 5 1 5

5 5

0 5

R, ,

,=

=

u q u q

0 u q1 1 2 1

2 2

5 5

0 5

u u we obtain q1 2 11

2

1

3

1 5

2 5=

=

−

=

,

=−

, q22 5

1 5

w w w2 11

14

1

14

2

14= − = −

, ,


25. By Theorem 6.3.1, we know that

w = a1v1 + a2v2 + a3v3

where ai

= ⟨w, vi⟩. Thus

w2 = ⟨w, w⟩

But ⟨vi, v

j⟩ = 0 if i ≠ j and ⟨v

i, v

i⟩ = 1 because the set v1, v2, v3 is orthonormal. Hence

w2 = a12 + a2

2 + a32

= ⟨w, v1⟩2 + ⟨w, v2⟩

2 + ⟨w, v3⟩2

26. Generalize the solution of Exercise 25.

27. Suppose the contrary; that is, suppose that

(*) u3 – ⟨u3, v1⟩v1 – ⟨u3, v2⟩v2 = 0

Then (*) implies that u3 is a linear combination of v1 and v2. But v1 is a multiple of u1while v2 is a linear combination of 1 and 2. Hence, (*) implies that u3 is a linear combinationof 1 and 2 and therefore that u1, u2, u3 is linearly dependent, contrary to the hypothesisthat u1, …, u

n is linearly independent. Thus, the assumption that (*) holds leads to a

contradiction.

28. The diagonal entries of R are ⟨ui, q

i⟩ for i = 1, …, n, where q

i= v

i/v

i is a vector resulting

from an application of the Gram-Schmidt process. That process first generates anorthogonal set of vectors v1, …, v

nwhere each vector v

iis u

iminus a linear combination of

the vectors v1, …, vi–1. Therefore ⟨u

i, v

i⟩ = ⟨v

i, v

i⟩ for i = 1, …, n. But since ⟨u

i, q

i⟩ = ⟨u

i,

vi⟩/v

i = ⟨v

i, v

i⟩/v

i = v

i and each of the vectors v

iis nonzero, we have that each diagonal

entry of R is nonzero.

= += ≠∑ ∑, ,a a ai i i

i

i j i j

i

2

1

3

1

v v v v


29. We have u1 = 1, u2 = x, and u3 = x2. Since

we let

Then

and thus v2 = u2/u2 where

Hence

In order to compute v3, we note that

and

Thus

and

u u v v u v3 3 1 1 3 22 1

3− − = −, , x

u v3 22

1

11

20, = =

−∫ x dx

u v3 12

1

11

2

2

3, = =

−∫ x dx

v23

2= x

u22

1

1 2

3= =

−∫ x dx

u v2 1 1

11

20, = =

−∫ x dx

v11

2=

u u u112

1 1

11 2= = =

−∫, dx


Hence,

30. (b) If we call the three polynomials v1, v2, and v3, and let u = 2 – 7x2, then, by virtue ofTheorem 6.3.1,

u =⟨u, v1⟩v1 + ⟨u, v2⟩v2 + ⟨u, v3⟩v3

Here

It is easy to check that

31. This is similar to Exercise 29 except that the lower limit of integration is changed from –1to 0. If we again set u1 = 1, u2 = x, and u3 = x2, then u1 = 1 and thus

v1 = 1

Then ⟨u2, v1⟩ = x dx = 12 and thus

v

or

v

2 =−−

= −

= −( )

x

xx

x

1 2

1 212 1 2

3 2 12

( )

0

1

∫

u = − −2

3

28

15

5

21 3v v .

u v

u v

,

,

12

1

1

22

1

22 7

2

3

3

22 7 0

= −( ) = −

= −( ) =

−

−

∫ x dx

x dx11

1

32 2

1

11

2

5

22 7 3 1

28

15

5

2

∫

∫= −( ) −( ) = −−

u v, x x dx

v or v3 3= −

= −( )45

8

1

3

5

2 23 12 2

x x

x x dx2

22

2

1

11

3

1

3

8

45− = −

=−∫


Finally,

and

Thus

or

v3 = 5 (6x2 – 6x + 1)

32. (a) Let S = w1, w2, …, wn be the orthonormal basis. Then (u)

S= (u1, u2, …, u

n) means

that u = u1w1 + u2w2 + … + unw

n. Thus

Taking positive square roots yields Theorem 6.3.2(b).

Note that this result also follows from Theorem 6.3.1 and Exercise 26.

(b) Since d(u, v) = u – v, we can apply Theorem 6.3.2(b) to u – v to obtain Theorem6.3.2(b).

(c) Let S = w1, w2, …, wn be the orthonormal basis. Then

u w and v= w== =∑ ∑u vi i

i

n

i i

i

n

1 1

u w w w w

w w w

21 1 1 1= + + + +

= +

u u u u

u u u u

n n n n

i i i i i j i

L L,

, ,,

(

w

because S is orthono

j

i ji

n

i

i

n

u

≠=

=

∑∑

∑=

1

2

1

rrmal)

v3

2

2

2

13

12

2 1

13

12

2 1

6 51

6=

− − −( )

− − −( )= − +

x x

x x

x x

u v3 23 2

0

13 2

3

6, ( )= − =∫ x x dx

u v3 12

0

1 1

3, = =∫ x dx


Hence

33. Let W be a finite dimensional subspace of the inner product space V and let v1, v2, …, vr

be an orthonormal basis for W. Then if u is any vector in V, we know from Theorem 6.3.4that u = w1 + w2 where w1 is in W and w2 is in W Moreover, this decomposition of u isunique. Theorem 6.3.5 gives us a candidate for w1. To prove the theorem, we must showthat if w1 = ⟨u, v1⟩v1 + … + ⟨u, v

r⟩v

rand, therefore, that w2 = u – w1 then

(i) w1 is in W

and

(ii) w2 is orthogonal to W.

That is, we must show that this candidate “works.” Then, since w1 is unique, it will be proj

Wu.

Part (i) follows immediately because w1 is, by definition, a linear combination of thevectors v1, v2, …, v

r.

⟨w2, vi⟩ = ⟨u – w1, vi

⟩

= ⟨u, vi⟩ – ⟨w1, vi

⟩

= ⟨u, vi⟩ – ⟨u, v

i⟩⟨v

i, v

i⟩

= ⟨u, vi⟩ – ⟨u, v

i⟩

= 0

Thus, w2 is orthogonal to each of the vectors v1, v2, …, vr

and hence w2 is in W.

If the vectors vi

form an orthogonal set, not necessarily orthonormal, then we mustnormalize them to obtain Part (b) of the theorem.

u v w w w

w w w

, ,

, ,

= + +

= +=∑

u u v

u v u v

n n n n

i i i i

i

n

i i i

1 1

1

ww

because is othonormal

i

i j

i i

i

n

u v S

≠

=

∑

∑=1

( )


34. (a) Call the plane and note that n = (a, b, c) is the normal vector to . Choose anarbitrary vector u in and note that u • n = 0. Hence u × n (and n × u) must lie in and be perpendicular to both u and n. Therefore u and u × n (or n × u) form anorthonogonal basis for . Normalize each of these vectors to obtain the requiredorthonormal basis.

(b) In this case, n = (1, 2, –1). The vector u = (1, 1, 3) lies in the plane given by x + 2y

– z = 0. Note that n × u = (7, –4, –1) lies in the plane and that u • (n × u) = 0. So anorthonormal basis for consists of the vectors

and

35. The vectors x = (1/ 3, 0) and y = (0, 1/ 2) are orthonormal with respect to the given

inner product. However, although they are orthogonal with respect to the Euclidean inner

product, they are not orthonormal.

The vectors x = (2/ 30, 3/ 30) and y = (1/ 5, –1/ 5) are orthonormal with respect

to the given inner product. However, they are neither orthogonal or of unit length with

respect to the Euclidean inner product.

37. (a) True. Suppose that v1, v2, …, vn

is an orthonormal set of vectors. If they were linearlydependent, then there would be a linear combination

c1v1 + c2v2 + … + cnv

n= 0

where at least one of the numbers ci

≠ 0. But

ci

= ⟨vi, c1v1 + c2v2 + … + c

nv

n⟩ = ⟨v

i, 0⟩ = 0

for i = 1, …, n. Thus, the orthonormal set of vectors cannot be linearly dependent.

n u

n u

××

=− −

, ,7

66

4

66

1

66

u

u, ,=

1

11

1

11

3

11


(b) False. The zero vector space has basis 0, which cannot be converted to a unit vector.

(c) True, since projW

u is in W and projW u is in W.

(d) True. If A is a (necessarily square) matrix with a nonzero determinant, then A haslinearly independent column vectors. Thus, by Theorem 6.3.7, A has a QR

decomposition.

38. If we project a vector uk

onto a space W that contains it, then the projection does notchange u

k. That is, proj

Wu

k= u

k. Thus, if we start with a linearly dependent set u1, u2, …,

um

so that uk

is a linear combination of the vectors that come before it, then when wecalculate v

k= u

k– proj

Wk – 1u

kwe have v

k= 0. Thus, the set constructed by the Gram-

Schmidt process will contain the zero vector, and will be a linearly dependent set.


EXERCISE SET 6.4

1. (a) If we call the system Ax = b, then the associated normal system is AT Ax = ATb, or

which simplifies to

2. (a) We have

so that det(AT A) = 0. Hence A does not have linearly independent column vectors.

3. (a) The associated normal system is AT Ax = ATb, or

1 1 1

1 1 2

1 1

1 1

1 2

1

2

− −

−

−

x

x

=

− −

−

1 1 1

1 1 2

7

0

7

A AT =

−

−

1 2 0

3 1 1

2 3 1

1 3 2

2 1 3

0 1 1

=−

−

5 1 4

1 11 10

4 10 14

21 25

25 35

20

201

2

=

x

x

1 2 4

1 3 5

1 1

2 3

4 5

1

2−

−

x

x =

−

−

1 2 4

1 3 5

2

1

5

305

or

This system has solution x1 = 5, x2 = 1/2, which is the least squares solution ofAx = b.

The orthogonal projection of b on the column space of A is Ax, or

3. (c) The associated normal system is

or

This system has solution x1 = 12, x2 = –3, x3 = 9, which is the least squares solution ofAx = b.

The orthogonal projection of b on the column space of A is Ax, or

which can be written as (3, 3, 9, 0).

1 0 1

2 1 2

1 1 0

1 1 1

12

3

9

−−

−

−

=

3

3

9

0

7 4 6

4 3 3

6 3 6

11

2

3

−−

− −

=x

x

x

88

12

9−

=− − −

1 2 1 1

0 1 1 1

1 2 0 1

6

0

9

3

1 2 1 1

0 1 1 1

1 2 0 1

1 0 1

2 1 2

1 1 0

1 1 1− − −

−−

−

x

x

x

1

2

3

1 1

1 1

1 2

5

1 2

11 2

9 2

4

−−

= −

−

3 2

2 6

14

71

2

−−

=

−

x

x


4. (a) First we find a least squares solution of Ax = u where A = [u1T|v2

T]. The associatednormal system is

or

This system has solution x1 = –1, x2 = 5/3, which is the least squares solution. Thedesired orthogonal projection is Ax, or

or (2/3, 7/3, 5/3).

5. (a) First we find a least squares solution of Ax = u where A = [v1T|v2

T|v3T]. The associated

normal system is

or

7 4 6

4 3 3

6 3 6

1

2

3

−−

− −

=x

x

x

30

21

21−

=− − −

2 1 1 1

1 0 1 1

2 1 0 1

6

3

9

6

2 1 1 1

1 0 1 1

2 1 0 1

2 1 2

1 0 1

1 1 0

1 1 1− − −

−−

−

x

x

x

1

2

3

1 1

1 2

0 1

1

5 3

2 3

7 3

5 3

−

=

2 3

3 6

3

71

2

=

x

x

1 1 0

1 2 1

1 1

1 2

0 1

1

2

x

x==

=

1 1 0

1 2 1

2

1

3


This system has solution x1 = 6, x2 = 3, x3 = 4, which is the least squares solution. Thedesired orthogonal projection is Ax, or

or (7, 2, 9, 5).

6. The solution of the given system can be expressed as x1 = –s + t, x2 = –s – t, x3 = 2s, x4 =2t. Thus the solution space is spanned by the vectors

Let A = [v1|v2]. We next find a least squares solution of Ax = u. The associated normalsystem is

or

This system has solution x1 = 1/2, x2 = 1/2, which is the least squares solution. The desiredorthogonal projection is Ax, or

or (0, –1, 1, 1).

−− −

=

−1 1

1 1

2 0

0 2

1 2

1 2

0

1

1

1

6 0

0 6

3

31

2

=

x

x

− −−

−− −

1 1 2 0

1 1 0 2

1 1

1 1

2 0

0 2

x

x

1

2

1 1 2 0

1 1 0 2

5

6

7

2

=

− −−

v and v1 2

1

1

2

0

1

1

0

2

=

−−

=−

2 1 2

1 0 1

1 1 0

1 1 1

6

3

4

−−

−

==

7

2

9

5


7. (a) If we use the vector (1, 0) as a basis for the x-axis and let , then we have

[P] = A(AT A)–1 AT = [1] [1 0] =

8. (a) If we use the vectors (1, 0, 0) and (0, 0, 1) asa basis for the xz-plane and let

then we have

9. (a) By inspection, the vectors (1, 0, –5) and (0, 1, 3) lie in the plane W and they form alinearly independent set with the appropriate number of vectors. Hence they are abasis for W.

(b) Using the basis for W found in Part (a), we let

Thus the standard matrix for the orthogonal projection on W is

A =−

1 0

0 1

5 3

P[ ] =

1 0

0 0

0 1

1 0 0

0 0 1

1 0

0 0

0 11

1 0 0

0 0 1

1 0

0 0

0 1

1

=

−

=

1 0

0 1

1 0 0

0 0 1

1 0 0

0 0 00

0 0 1

A =

1 0

0 0

0 1

1 0

0 0

1

0

A =

1

0


10. (c) By Part (b), the point (x0, y0, z0) projects to the point

on the plane W.

(d) By Part (c), the point (1, –2, 4) projects to the point (–8/7, –5/7, 25/7) on the plane.

The distance between these two points is 3 35/7.

11. (a) The vector v = (2, –1, 4) forms a basis for the line W.

(b) If we let A = [vT], then the standard matrix for the orthogonal projection on W is

1

35

10 15 5

15 26 3

5 3 34

0

0

0

−

−

x

y

z

=+ −

+ +−

( )

( )

(

2 3 7

15 26 2 35

5

0 0 0

0 0 0

x y z

x y z

x00 0 03 34 35+ +

y z )

[ ]P =−

−

−

1 0

0 1

5 3

1 0 5

0 1 3

1 0

0 1

5 3

−

=−

−1

1 0 5

0 1 3

1 0

0 1

5 3

−−

−

=

−26 15

15 10

1 0 5

0 1 3

1

1

00

0 1

5 3

2 7 3 7

3 7 26 35

1 0 5

0 1 3−

−

=−

−

=2 7 3 7 1 7

3 7 26 35 3 35

1 7 3 35 34 35

1

335

10 15 5

15 26 3

5 3 34

−

−


(c) By Part (b), the point P0(x0, y0, z0) projects to the point on the line W given by

(d) By the result in Part (c), the point (2, 1, –3) projects to the point (–6/7, 3/7, –12/7).

The distance between these two points is 497/7.

12. We can see by inspection that when t = 1, we have the point P = (t, t, t) = (1, 1, 1), andwhen s = 1, we have the point Q = (s, 2s – 1, 1) = (1, 1, 1). Thus, when s = t = 1, we haveP – Q2 = (0, 0)2 = 0, which is the minimum possible value of P – Q2, since v ≥ 0 forany vector v.

13. (a) Using horizontal vector notation, we have b = (7, 0, –7) and Ax = (11/2, –9/2, –4).Therefore Ax – b = (–3/2, –9/2, 3), which is orthogonal to both of the vectors (1, –1,–1) and (1, 1, 2) which span the column space of A. Hence the error vector isorthogonal to the column space of A.

(c) In horizontal vector notation, b = (6, 0, 9, 3) and Ax = (3, 3, 9, 0). Hence Ax – b =(–3, 3, 0, –3), which is orthogonal to the three vectors (1, 2, 1, 1), (0, 1, 1, 1), and (–1,–2, 0, –1) which span the column space of A. Therefore Ax – b is orthogonal to thecolumn space of A.

14. Since A has linearly independent column vectors, Theorem 6.4.4 guarantees that there is aunique least squares solution x of the system Ax = b. Since x is a least squares solution, itminimizes Ax – b. But if there is a vector x for which Ax = b, then this minimum is zeroand occurs when x = x. Hence x = x by the uniqueness of the least squares solution.

1

21

4 2 8

2 1 4

8 4 16

0

0

0

−− −

−

x

y

z

=− +

− + −−

( )

( )

(

4 2 8 21

2 4 21

8 4

0 0 0

0 0 0

0

x y z

x y z

x yy z0 016 21+

)

=−

− −−

1

21

4 2 8

2 1 4

8 4 16

= −

−

2

1

4

1

212 1 4

P A A A AT T[ ] = = −

− −−( ) 12

1

4

2 1 4

2

11

4

2 1 4

1

−

−


15. Recall that if b is orthogonal to the column space of A, then projW

b = 0.

16. (a) We have that [P] = A(AT A)–1 AT where A is any matrix formed using a set of basisvectors for W as its column vectors. Therefore

[P]2 = [A(AT A)–1 AT][A(AT A)–1 AT]

= A[(AT A)–1(AT A)](AT A)–1 AT

= A(AT A)–1 AT

= [P]

(c) We need to show that [P]T = [P]. Define A as in Part (a). Then

[P]T = [A(AT A)–1 AT]T

= (AT)T[(AT A)–1]T AT because (BC)T = CT BT

= A[(AT A)–1]T AT because (AT)T = A

We need only show that [(AT A)–1]T = (AT A)–1 to have the desired result. This followsfrom the fact that if B is any invertible matrix, then (B–1)T = (BT)–1, so that [(AT A)–1]T

= [(AT A)T]–1 = (AT A)–1. To see the above result, note that if B is invertible, then

BB–1 = (BB–1)T = I

so that

(B–1)T BT = I

or

(B–1)T = (BT)–1

17. If A is an m × n matrix with linearly independent row vectors, then AT is an n × m matrixwith linearly independent column vectors which span the row space of A. Therefore, byFormula (6) and the fact that (AT)T = A, the standard matrix for the orthogonal projection,S, of Rn on the row space of A is [S] = AT(AAT)–1 A.


18. (a) If we assume a relationship V = IR, then the data points give the linear system

1 = 0.1 R

2.1 = 0.2 R

2.9 = 0.3 R

4.2 = 0.4 R

5.1 = 0.5 R

(b) No, this system is not consistent. There is no value of R that works for all 5 equations.

(c) The system in Part (a) can be written as Ax = b, where

Then, we have the least squares solution

x = (AT A)–1 ATb = [0,55]–1[5.62] = [5.62/0.55] = [10.2182].

This says the average resistance is approximately 10.2182 ohms.

19. If we assume a relationship V = IR + c, we have the linear system

1 = 0.1 R + c

2.1 = 0.2 R + c

2.9 = 0.3 R + c

4.2 = 0.4 R + c

5.1 = 0.5 R + c

A =

=

0 1

0 2

0 3

0 4

0 5

1

2 1

.

.

.

.

.

.

and bb 22 9

4 2

5 1

.

.

.


This system can be written as Ax = b, where

Then, we have the least squares solution

.

Thus, we have the relationship V = 10.3 R – 0.03.

20. A relationship V = a4I4 + a3I

3 + a2I2 + a1I + a0, leads to the five equations

1 = a4(0.1)4 + a3(0.1)3 + a2(0.1)2 + a1(0.1) + a0

2.1 = a4(0.2)4 + a3(0.2)3 + a2(0.2)2 + a1(0.2) + a0

2.9 = a4(0.3)4 + a3(0.3)3 + a2(0.3)2 + a1(0.3) + a0

4.2 = a4(0.4)4 + a3(0.4)3 + a2(0.4)2 + a1(0.4) + a0

5.1 = a4(0.5)4 + a3(0.5)3 + a2(0.5)2 + a1(0.5) + a0

To solve this system, we need to row reduce the augmetned matrix

0 0001 0 001 0 01 0 1 1 1

0 0016 0 008 0 04 0 2 1 2 1

0

. . . .

. . . . .

.. . . . .

. . . . .

0081 0 027 0 09 0 3 1 2 9

0 0256 0 064 0 16 0 4 1 4 22

0 0625 0 125 0 25 0 5 1 5 1. . . . .

RREFF →

−

−

1 0 0 0 0 708 333

0 1 0 0 0 841 667

0 0 1 0 0 342 91

.

.

. 77

0 0 0 1 0 65 583

0 0 0 0 1 2 900

.

.−

x b= =

−−

( ). .

.

.

.A A A

T T11

0 55 1 55

1 5 5

5 62

15 3

=

−

.

..

10 3

0 03

A and

.

.

.

.

.

=

0 1 1

0 2 1

0 3 1

0 4 1

0 5 1

bbbb =

.

.

.

.

.

1

2 1

2 9

4 2

5 1


Thus, keeping 3 decimal places of accuracy, we have the fourth-degree polynomial

V = –708.33I4 + 841.667I3 – 342.917I2 + 65.583I – 2.9.

This polynomial is shown in the graph below for 0.1 ≤ I ≤ 0.5, showing the fluctuation in theresistance during this experiment.

22. (a) Call the vector Ax. Then by Theorem 6.4.4, Ax = A(AT A)–1 ATb.

(c) The error is Ax – b.

(d) The standard matrix is A(AT A)–1 AT.

V, volts

I, amps

5

4

3

2

1

0.1 0.2 0.3 0.4 0.5


EXERCISE SET 6.5

2. (b) T(x) = (14/3, –5/3, 11/3) and T(x) = x = 38.

3. (b) Since the row vectors form an orthonormal set, the matrix is orthogonal. Therefore itsinverse is its transpose,

(c) Since the Euclidean inner product of Column 2 and Column 3 is not zero, the columnvectors do not form an orthonormal set and the matrix is not orthogonal.

(f) Since the norm of Column 3 is not 1, the matrix is not orthogonal.

5. (b) We have (w)S

= (a, b) where w = au1 + bu2. Thus

2a + 3b = 1

–4a + 8b = 1

or and Hence (w)a b s.= = =5

28

3

14

5

28,

3

14

5

283

14

[ ] =

and

w s

1 2 1 2

1 2 1 2−

317

6. (a) Let v = av1

+ bv2

+ cv3. Then

a + 2b + 3c = 2

2b + 3c = –1

3c = 3

so that a = 3, b = –2, and c = 1. Thus (v)S

= (3, –2, 1) and

7. (b) Let p = ap1 + bp2 + cp3. Then

a + b = 2

a + c = –1

b + c = 1

or a = 0, b = 2, and c = –1. Thus (v)S

= (0, 2, –1) and

8. Let A = aA1 + bA2 + cA3 + dA4. Then

–a + b = 2

a + b = 0

c = –1

d = 3

so that a = –1, b = 1, c = –1, and d = 3. Thus (A)S

= (–1, 1, –1, 3) and

v[ ] =−

S

0

2

1

v[ ] = −

S

3

2

1


9. (a) We have w = 6v1 – v2 + 4v3 = (16, 10, 12).

(c) We have B = –8A1 + 7A2 + 6A3 + 3A4 =

11. (a) Since v1 = 1310u1 – 25u2 and v2 = – 12u1 + 0u2, the transition matrix is

(b) Since u1 = 0v1 – 2v2 and u2 = – 52v1 – 132 v2, the transition matrix is

Note that P = Q–1.

(c) We find that w = – 1710 u1 + 85u2; that is

and hence

w[ ] =−

− −

−

B′

05

2

213

2

17

108

5

=−−

4

7

w[ ] =−

B

17

108

5

P =−

− −

05

2

213

2

Q =−

−

13

10

1

22

50

15 1

6 3

−

.

A S[ ] =

−

−

1

1

1

3


11. (d) Verify that w = (–4)v1 + (–7)v2.

12. (a) We must find constants cij

such that

uj= c1j

v1 + c2jv2 + c3j

v3

for j = 1, 2, 3. For j = 1, this reduces to the system

–6c11 – 2c21 – 2c31 = –3

–6c11 – 6c21 – c31 = 0

4c21 + 7c31 = –3

which has the solution c11 = 3/4, c21 = –3/4, c31 = 0. For j = 2 and j = 3, we havesystems with the same coefficient matrix but with different right-hand sides. Thesehave solutions c12 = 3/4, c22 = –17/12, c32 = 2/3, c13 = 1/12, c23 = –17/12, and c33 = 2/3.Thus the transition matrix is

(b) We must find constants a, b, c such that w = au1 + bu2 + cu3. This means solving thesystem of equations

–3a – 3b + c = –5

2b + 6c = 8

–3a – b – c = –5

The solution is a = 1, b = 1, c = 1. Thus

and hence

w[ ]

/ / /

/ / /

/ /B′ = − − −

3 4 3 4 1 12

3 4 17 12 17 12

0 2 3 2 33

1

1

1

19 12

43 12

4 3

=

= −

/

/

/

[w]B =

1

1

1

P = − − −

3 4 3 4 1 12

3 4 17 12 17 12

0 2 3 2 3


(c) Let w = av1 + bv2 + cv3 and solve the equations

–6a – 2b – 2c = –5

–6a – 6b – 3c = 8

4b + 7c = –5

for a, b, and c.

14. (a) The transition matrix from B′ to B is

(b) Since p1 = (3/4)q1 + (3/2)q2 and p2 = (7/2)q1 + q2, the transition matrix is

(c) Let p = ap1 + bp2. Thus

6a+ 10b = –4

3a + 2b = 1

so that a = 1 and b = –1. Therefore [p]B

= and hence

(d) Let p = aq1 + bq2 and solve for a and b.

15. (a) By hypothesis, f1 and f2 span V. Since neither is a multiple of the other, then f1, f2 is

a linearly independent set and hence is a basis for V. Now by inspection,

. Therefore, g1, g2 must also be a basis for V

because it is a spanning set which contains the correct number of vectors.

ff gg gg aanndd ff gg11 11 22 22 22= + −

=1

2

1

6

1

3

[ ]/ /

/

/

/p B =

=

−

=

−−3 4 7 2

3 2 1

1

1

11 4

1 2

1

1

1−

3 4 7 2

3 2 1

/ /

/

3 4 7 2

3 2 1

2 9 7 9

1 3 1 6

1/ /

/

/ /

/ /

=

−−

−


15. (b) The transition matrix is

(c) From the observations in Part (a), we have

(d) Since h = 2f1 + (–5)f2, we have [h]B

= ; thus

16. (a) Equation (9) yields

(b) If we compute the inverse of the transition matrix in (a), we have

18. (a) Always true. The transition matrix is given in Equation (6).

(b) Always true. If the basis is B = v1, v2, …, vn, then the coordinate vectors are given by v

iB

= ei, so the transition matrix has columns e1, e2, …, e

n. This is the n × n identity matrix.

x

y

=

− −

−

=

−1 2 1 2

1 2 1 2

5

2

7/ /

/ /

22

3 2

x

y

′′

=

−cos( / ) sin( / )

sin( / ) cos(

3 4 3 4

3 4

π ππ 33 4

1 2 1 2

1 2 1 2

π / )

/ /

/ /

=−

− −

x

y

−

=

−

2

6

4 2

2 2

[ ]h B′ =−

−

=

−

1

20

1

6

1

3

2

5

1

2

2

5−

P =−

1

20

1

6

1

3

1

20

1

6

1

3

2 0

1 3

1

−

=

−


(c) Always true. If A is invertible, then the column u1, u2, …, un

of A form a basis of Rn byTheorem 6.4.5. Denote this basis by B′, and denote the standard basis of Rn by B.Then we have [u

i]B

= ui

for i = 1, 2, …, n, so A = [[ui]B

| [ui]B

| … | [un]B] is the

transition matrix from the standard basis of Rn to the basis B′ consisting of the columnvectors of A.

19. The general transition matrix will be

In particular, if we rotate through θ = π3, then the transition matrix is

21. (a) See Exercise 19, above.

22. Use the result of Example 7 to find P1 where x′ = P1x. Use the result of Exercise 21 (a),above, to find P2 where x′′ = P2x′. Then x′′ = P2P1x, and hence A = P2P1.

23. Since the row vectors (and the column vectors) of the given matrix are orthogonal, thematrix will be orthogonal provided these vectors have norm 1. A necessary and sufficientcondition for this is that a2 + b2 = 1/2. Why?

24. Suppose that A = [aij] is a 2 × 2 orthogonal matrix. Since the column vectors form an

orthonormal set, we have the equations a112 + a21

2 = a122 + a22

2 = 1 and a11a12 + a21a22 = 0. Thus |aij|

≤ 1 for all aij. Since cos α takes on all values between –1 and + 1 for 0 ≤ α ≤ π, we can let a11

= cos θ for some value of θ where 0 ≤ θ ≤ π. Alternatively, we could let a11 = cos (2π – θ).Since a11

2 + a212 = 1, we have a21

2 = 1 – cos2 θ = sin2 θ, so that a21 = ±sin θ, or alternatively, a21 =sin(2π – θ). Thus the orthogonality equation becomes a12 cos θ ± a22 sin θ = 0, so that a12= k sin θ and a22 = k cos θ. Since a12

2 + a222 = 1, k = ±1. Thus we have four possible matrices:

cos sin

sin cos,

cos sin

sin cos

θ θθ θ

θ θθ θ

−

−− −

−

,

cos sin

sin cos,

cos sin

sin

θ θθ θ

θ θθθ θ−

cos

1

20

3

20 1 0

3

20

1

2

−

cos sinθ θ0

0 1 0

3

10

1

2

−


If we substitute θ = 2π – θ for θ in the third and fourth matrices, they become the first andsecond matrices, respectively. Thus for some value of θ where 0 ≤ θ ≤ 2π, the first twomatrices represent all possible forms of a 2 × 2 orthogonal matrix. Since θ = 2π gives thesame value as θ = 0, we can restrict θ to the range 0 ≤ θ < 2π.

25. Multiplication by the first matrix A in Exercise 24 represents a rotation and det(A) = 1. Thesecond matrix has determinant –1 and can be written as

Thus it represents a rotation followed by a reflection about the x-axis.

28. Since a composition of rotations can be represented as a product of orthogonal matriceswith determinant 1 and, by Theorem 6.5.2, such a product is also orthogonal, it mustrepresent either a single rotation or a rotation followed by a reflection. However, if it is theproduct of matrices with determinant 1, its determinant is also 1, and hence it representsa rotation.

29. Note that A is orthogonal if and only if AT is orthogonal. Since the rows of AT are thecolumns of A, we need only apply the equivalence of Parts (a) and (b) to AT to obtain theequivalence of Parts (a) and (c).

30. (a) Rotations about the origin and reflections about any line through the origin are rigid.So is any combination of these transformations.

(b) Rotations about the origin, dilations and contractions, reflections about lines throughthe origin, and combinations of these are all angle preserving.

(c) Suppose that T is a linear operator in R2 and let A be the matrix associated with T. IfT is rigid, then Ax = x for all x in R2. Then Theorem 6.5.3 guarantees that Ax • Ay

= x • y or, since Ax = x, that

Therefore, the cosine of the angle between Ax and Ay is equal to the cosine of the anglebetween x and y and thus the angles themselves are the same. Hence, if T is rigid, then Tis angle preserving.

The converse is not true. Simply note that dilations in R2 by a factor k where k ≠ 1preserve the angle between two vectors but not their lengths.

Ax Ay

Ax Ay=

x y

x y

⋅ ⋅

cos sin

sin cos

cos sθ θθ θ

θ−− −

=

−

−1 0

0 1

iin

sin cos

θθ θ


31. If A is the standard matrix associated with a rigid transformation, then Theorem 6.5.3guarantees that A must be orthogonal. But if A is orthogonal, then Theorem 6.5.2guarantees that det(A) = ±1.

32. If A is orthogonal, then its row vectors must form an orthonormal set r1, r2, r3. Since r1

= a2 + 1, then a = 0. This guarantees that r1 • r2 = r1 • r3 = 0. It remains to find b and c such

that b2 + 1/3 = 1, c2 + 2/3 = 1, and bc + 2/ 18 = 0. The first two equations imply that b =

± 2/ 3 and c = ±1/ 3, while the third equation tells us that b and c must have different

signs. Therefore a = 0, b = 2/ 3, c = 1/ 3 and a = 0, b = – 2/ 3, c = 1/ 3 are the

only two possibilities.


EXERCISE SET 6.6

4. Let A be an orthogonal matrix. To establish that AT is also an orthogonal matrix, we needto show that (AT)–1 = (AT)T. This is easily done using properties of the transpose fromChapter 1. Since A is orthogonal, we know that

A–1 = AT.

Transposing both sides gives

(A–1)T = (AT)T.

and using Theorem 1.4.10, we have

(AT)–1 = (AT)T.

Thus, AT is also an orthogonal matrix.

327


1. (a) We must find a vector x = (x1, x2, x3, x4) such that

The first two conditions guarantee that x1 = x4 = 0. The third condition implies that x2= x3. Thus any vector of the form (0, a, a, 0) will satisfy the given conditions provideda ≠ 0.

(b) We must find a vector x = (x1, x2, x3, x4) such that x • u1 = x • u4 = 0. This implies

that x1 = x4 = 0. Moreover, since x = u2 = u3 = 1, the cosine of the angle between

x and u2 is x • u2 and the cosine of the angle between x and u3 is x • u3. Thus we are

looking for a vector x such that x • u2 = 2x • u3, or x2 = 2x3. Since x = 1, we have x

= (0, 2x3, x3, 0) where 4x32 + x3

2 = 1 or x3 = ±1/ 5. Therefore

2. We first show that A is symmetric, or that A = AT. Observe that

To show that A is orthogonal, it will suffice to show that A–1 = AT, or that AAT = In, or, in this

case, that A2 = In, or that A2 – I

n= O

n. To this end, observe that

A InT T2

2 424 4

= − +x

xxx

xx( )

A I I

I

Tn

T

T

nT T

T

n

= −

= − ( )= −

2 2

2

2 2

2

xxx

xxx

xxTT

TT

nT

I A( ) = − =xx

xx2

2

x = ±

02

5

1

50, , ,

x u =0, x u =0, andx u

x u=

x u

x u1 42

2

3

3⋅ ⋅ ⋅ ⋅

329

or that

Thus it will suffice to show that . To this end, let

x = [x1x2… x

n]T. Then

so that

This completes the proof.

4. Let . By the Cauchy-Schwarz Inequality,

or

n a aa an

21 1

1

1 1≤ + + + +

( )

( ) ( )u v u vterms

⋅ = + + ≤2 2 21 1L

n

u vand= =( ) ( ), , / , , /a a a an n1 11 1

(xx x x xTn

T

T

nT

x x x

x x

x x

x x

) [ ]21 2

1

2=

L

= + + +

= + + +

x x x

x x x

T Tn

T

n

12

22 2

12

22 2

xx xx xxL

L( )xxx x xxT T=

xx x x xT

T

T

nT

n

x x

x x

x x

x x x=

=

1

21 2 L[ ]]

xxx

xxT T( )=1

22

A I

x xn

T T22 2

24 1− = − −

xx xx( )

330 Supplementaary Exercises 6

6. If v = (a, b, c) where v = 1 and v • u1 = v • u2 = v • u3 = 0, then

a + b – c = 0

–2a – b + 2c = 0

–a + c = 0

where a2 + b2 + c2 = 12. The above system has the solution a = t, b = 0, and c = t where tis arbitrary. But a2 + b2 + c2 = 12 implies that 2t2 = 12 or t = ±1/ 2. The two vectors aretherefore v = ±(1/ 2, 0, 1/ 2).

7. Let

(*) ⟨u, v⟩ =w1u1v1 + w2u2v2 + … + wnu

nv

n

be the weighted Euclidean inner product. Since ⟨vi, v

j⟩ = 0 whenever i ≠ j, the vectors v1,

v2, …, vn form an orthogonal set with respect to (*) for any choice of the constants w1, w2,

…, wn. We must now choose the positive constants w1, w2, …, w

nso that v

k = 1 for all k.

But vk2 = kw

k. If we let w

k= 1/k for k = 1, 2, …, n, the given vectors will then form an

orthonormal set with respect to (*).

8. Suppose that ⟨u, v⟩ = w1u1v1 + w2u2v2 were such an inner product. Then

⟨(1, 2), (3, –1)⟩ = 0 ⇒ 3w1 – 2w2 = 0

(1, 2) = 1 ⇒ w1+ 4w2 = 1

(3, –1) = 1 ⇒ 9w1 + w2 = 1

But since the three equations for 1 and 2 above are inconsistent, there is no such innerproduct.

9. Let Q = [aij] be orthogonal. Then Q–1 = QT and det(Q) = ±1. If C

ijis the cofactor of a

ij, then

so that [aij

= det(Q)Cij.

Q a QQ

C QijT

ijT

T

= = =

=−[( )] ( )det( )

( ) det(1 1))( )Cij

Supplementaary Exercises 6 331

10. From the definition of the norm, we have

u – v2 = ⟨u – v, u – v⟩

= ⟨u, u⟩ + ⟨v, v⟩ – 2⟨u, v⟩

= u2 + v2 – 2u v cos θ

11. (a) The length of each “side” of this “cube” is |k|. The length of the “diagonal” is n|k|.The inner product of any “side” with the “diagonal” is k2. Therefore,

(b) As n → + ∞, cos θ → 0, so that θ → π/2.

12. (a) We have that u + v and u – v are orthogonal if and only if

⟨u + v, u – v⟩ = 0

But

⟨u + v, u – v⟩ = u – v

by the properties of an inner product (Why?). Thus u + v and u – v are orthogonal ifand only if u = v.

(b) Since u + v and u – v can be thought of as diagonals of a parallelogram with sides uand v, the above result says that the two diagonals are perpendicular if and only if theparallelogram has equal sides.

13. Recall that u can be expressed as the linear combination

u = a1v1 + … + anvn

where ai

= ⟨u, vi⟩ for i = 1, …, n. Thus

cosθ = =k

k n k n

2 1


Therefore

14. We must show that ⟨u, v⟩ satisfies the four axioms which define an inner product. We show(2) and (4), and leave the others to you.

(2) Using Axiom 3 for ⟨u, v⟩1 and ⟨u, v⟩2, we have

⟨u + v, w⟩ = ⟨u + v, w⟩1 + ⟨u + v, w⟩2

= ⟨u, w⟩1 + ⟨v, w⟩1 + ⟨u, w⟩2 + ⟨v, w⟩2

= ⟨u, w⟩1 + ⟨u, w⟩2 + ⟨v, w⟩1 + ⟨v, w⟩2

= ⟨u, w⟩ + ⟨v, w⟩

(4) Using Axiom 4 for ⟨u, v⟩1 and ⟨u, v⟩2, we have

⟨v, v⟩ = ⟨v, v⟩1 + ⟨v, v⟩2 ≤ 0

since both ⟨v, v⟩1 and ⟨v, v⟩2 are nonnegative. Moreover, ⟨v, v⟩ = 0 if and only if ⟨v,v⟩1 = ⟨v, v⟩2 = 0, which is true if and only if v = 0.

15. Recall that A is orthogonal provided A–1 = AT. Hence

⟨u, v⟩ = vT AT Au

= vT A–1 Au = vT u

which is the Euclidean inner product.

cos cos21

2 12

22 2

12

22 2

α α+ + =+ + +

+ + +=L

L

Ln

n

n

a a a

a a a11

cos,2

2

2

2

12

22

α i

i

i

i

i

u v

a

u

a

a a

=

=

=+

u v

++ L an2

(Why?)

Supplementaary Exercises 6 333

16. To show that (W) = W, we first show that W ⊆ (W

). If w is in W, then w isorthogonal to every vector in W, so that w is in (W

) . Thus W ⊆ (W).

To show that (W) ⊆ W, let v be in (W

). Since v is in V, we have, by the Projection Theorem, that v = w1 + w2 where w1 is in W and w2 is in W . By definition, ⟨v, w2⟩ = ⟨w1, w2⟩ = 0. But

⟨v, w2⟩ = ⟨w1 + w2, w2⟩

= ⟨w1, w2⟩ + ⟨w2, w2⟩

= ⟨w2, w2⟩

so that ⟨w2, w2⟩ = 0. Hence w2 = 0 and therefore v = w1, so that v is in W. Thus (W) ⊆

W.


EXERCISE SET 7.1

1. (a) Since

the characteristic equation is λ2 – 2λ – 3 = 0.

(e) Since

the characteristic equation is λ2 = 0.

3. (a) The equation (λI – A)x = 0 becomes

The eigenvalues are λ = 3 and λ = –1. Substituting λ = 3 into (λI – A)x = 0 yields

or

–8x1 + 4x2 = 0

0 0

8 4

0

01

2−

=

x

x

λλ

−− +

=

3 0

8 1

0

01

2

x

x

det( ) detλλ

λI A− =

0

8

det( ) det ( )( )λλ

λλ λI A− =

−− +

= − +

3 0

8 13 1

335

Thus x1 = 1–2s and x2 = s where s is arbitrary, so that a basis for the eigenspace

corresponding to λ = 3 is . Of course, and are also bases.

Substituting λ = –1 into (λI – A)x = 0 yields

or

–4x1 = 0

–8x1 = 0

Hence, x1 = 0 and x2 = s where s is arbitrary. In particular, if s = 1, then a basis for the

eigenspace corresponding to λ = –1 is .

3. (e) The equation (λI – A)x = 0 becomes

Clearly, λ = 0 is the only eigenvalue. Substituting λ = 0 into the above equation yieldsx1 = s and x2 = t where s and t are arbitrary. In particular, if s = t = 1, then we find

that and form a basis for the eigenspace associated with λ = 0.

4. (c) Since

= λ3 + 8λ2 + λ + 8

the characteristic equation of A is

λ3 + 8λ2 + λ + 8 = 0

det[ ] detλλ

λλ

I A− =+ −

+− − +

2 0 1

6 2 0

19 5 4

0

1

1

0

λλ0

0

0

01

2

=

x

x

0

1

−−

=

4 0

8 0

0

01

2

x

x

ππ2

1

2

1 2

1


5. (c) From the solution to 4(c), we have

λ3 + 8λ2 + λ + 8 = (λ + 8)(λ2 + 1)

Since λ2 + 1 = 0 has no real solutions, then λ = –8 is the only (real) eigenvalue.

6. (c) The only eigenvalue is λ = –8. If we substitute the value λ = –8 into the equation (λI – A)x = 0, we obtain

The augmented matrix of the above system can be reduced to

Thus, x1 = – 1–6

s, x2 = – 1–6s, x3 = s is a solution where s is arbitrary. Therefore

forms a basis for the eigenspace.

(f) The eigenvalues are λ = –4 and λ = 3. If we substitute λ = –4 into the equation (λI – A)x = 0, we obtain

If we reduce the augmented matrix to row-echelon form, we obtain

This implies that x1 = –2 s, x2 = 8–3

s, x3 = s. If we set s = 1, we find that

1 0 2 0

0 1 8 3 0

0 0 0 0

−

− − −−

− −

9 6 2

0 3 8

1 0 2

1

2

3

x

x

x

=

0

0

0

−−

1 6

1 6

1

1 0 1 6 0

0 1 1 6 0

0 0 0 0

− −−

− − −

6 0 1

6 6 0

19 5 4

1

2

3

x

x

x

=

0

0

0


is a basis for the eigenspace associated with λ = –4.

If we substitute λ = 3 into the equation (λI – A)x = 0, we obtain

If we reduce the augmented matrix to row-echelon form, we obtain

It then follows that

is a basis for the eigenspace associated with λ = 3.

7. (a) Since

= λ4 + λ3 – 3λ2 – λ + 2

= (λ – 1)2(λ + 2)(λ + 1)

det[ ] detλ

λλ

λI A− =

−− −

− +

0 2 0

1 1 0

0 1 2 0

0 0 0 λ −

1

5

2

1

−

1 0 5 0

0 1 2 0

0 0 0 0

−

− − −

−

2 6 2

0 4 8

1 0 5

1

2

3

x

x

x

=

0

0

0

−

2

8 3

1


the characteristic equation is

(λ – 1)2 (λ + 2)(λ + 1) = 0

9. (a) The eigenvalues are λ = 1, λ = –2, and λ = –1. If we set λ = 1, then (λI – A)x = 0becomes

The augmented matrix can be reduced to

Thus, x1 = 2s, x2 = 3s, x3 = s, and x4 = t is a solution for all s and t. In particular, if welet s = t = 1, we see that

form a basis for the eigenspace associated with λ = 1.

If we set λ = –2, then (λI – A)x = 0 becomes


− −− − −

−−

2 0 2 0

1 2 1 0

0 1 0 0

0 0 0 3

=

x

x

x

x

1

2

3

4

0

0

0

0

2

3

1

0

0

0

0

1

and

1 0 2 0 0

0 1 3 0 0

0 0 0 0 0

0 0 0 0 0

−−

1 0 2 0

1 1 1 0

0 1 3 0

0 0 0 0

−− −

−

=

x

x

x

x

1

2

3

4

0

0

0

0


This implies that x1 = –s, x2 = x4 = 0, and x3 = s. Therefore the vector

forms a basis for the eigenspace associated with λ = –2.

Finally, if we set λ = –1, then (λI – A)x = 0 becomes


Thus, x1 = –2s, x2 = s, x3 = s, and x4 = 0 is a solution. Therefore the vector

forms a basis for the eigenspace associated with λ = –1.

−

2

1

1

0

1 0 2 0 0

0 1 1 0 0

0 0 0 1 0

0 0 0 0 0

−

1 0 2 0

1 1 1 0

0 1 3 0

0 0 0 0

1

2

3

4

−− −

−

x

x

x

x

=

0

0

0

0

−

1

0

1

0

1 0 1 0 0

0 1 1 0 0

0 0 0 0 0

0 0 0 0 0

−


11. By Theorem 7.1.1, the eigenvalues of A are 1, 1/2, 0, and 2. Thus by Theorem 7.1.3, theeigenvalues of A9 are 19 = 1, (1/2)9 = 1/512, 09 = 0, and 29 = 512.

12. Notice that A2 = I. Therefore A25 = A.

13. The vectors Ax and x will lie on the same line through the origin if and only if there existsa real number λ such that Ax = λx, that is, if and only if λ is a real eigenvalue for A and xis the associated eigenvector.

(a) In this case, the eigenvalues are λ = 3 and λ = 2, while associated eigenvectors are

respectively. Hence the lines y = x and y = 2x are the only lines which are invariantunder A.

(b) In this case, the characteristic equation for A is λ2 + 1 = 0. Since A has no realeigenvalues, there are no lines which are invariant under A.

14. (a) From the proof of Theorem 7.1.4, p(0) = det(–A) = 5. Since each signed elementaryproduct in det(–A) is just (–1)3 times the corresponding signed elementary product indet(A), we have that det(A) = –det(–A) = –5.

15. Let aij

denote the ijth entry of A. Then the characteristic polynomial of A is det(λI – A) or

This determinant is a sum each of whose terms is the product of n entries from the givenmatrix. Each of these entries is either a constant or is of the form λ – a

ij. The only term

with a λ in each factor of the product is

det

λ

λ

− −− −

− −

a a a

a a a

a a a

n

n

n n n

11 12 1

21 22 2

1 2

nn

1

1

1

2

and


(λ – a11)(λ – a22) … (λ – ann

)

Therefore, this term must produce the highest power of λ in the characteristic polynomial.This power is clearly n and the coefficient of λn is 1.

17. The characteristic equation of A is

λ2 – (a + d)λ + ad – bc = 0

This is a quadratic equation whose discriminant is

(a + d)2 – 4ad + 4bc = a2 – 2ad + d2 + 4bc

= (a – d)2 + 4bc

The roots are

If the discriminant is positive, then the equation has two distinct real roots; if it is zero, thenthe equation has one real root (repeated); if it is negative, then the equation has no realroots. Since the eigenvalues are assumed to be real numbers, the result follows.

18. We are given the two distinct eigenvalues λ1 and λ2. To find the corresponding eigenvectors,we set

This yields the equations

(λi – a) x1 – bx2 = 0

–cx1 + (λi

– d)x2 = 0

This system of equations is guaranteed to have a nontrivial solution. Since b and a – λi

are

not both zero, then any nonzero multiple of will be an eignvector correspondingto λ

i. Therefore

−−

b

a iλ

λλ

i

i

a b

c d

x

x

− −− −

=

1

4

0

0

λ = + ± − +

1

24( ) (a )2

a d b bc


are eigenvectors corresponding to λ1 and λ2, respectively.

19. As in Exercise 17, we have

Alternate Solution: Recall that if r1 and r2 are roots of the quadratic equation x2 + Bx + C = 0, then B = –(r1 + r2) and C = r1r2. The converse of this result is also true.Thus the result will follow if we can show that the system of equations

λ1 + λ2 = a + d

λ1λ2 = ad – bc

is satisfied by λ1 = a + b and λ2 = a– c. This is a straightforward computation and we leaveit to you.

20. Suppose that Ax = λx where A is invertible. Then

x = A–1 Ax = A–1 λx = λA–1x

Since A is invertible, we know that λ ≠ 0. Thus

A–1x = 1–λx

That is, 1/λ is an eigenvalue of A–1 and x is a corresponding eigenvector.

λ ( )

( )

=+ ± − +

=+ ± − +

a d a d bc

a d c b

2

2

4

2

44

2

2

2

bcd c b

a d c b

a

because a − = −

=+ ± +

=+

( )

dd c d a b c d

a b

+ + − − +

= + −2 2

or

or a c

−−

−−

b

a

b

aλ λ1 2and


21. Suppose that Ax = λx. Then

(A – sI)x = Ax – sIx = λx – sx = (λ – s)x

That is, λ – s is an eigenvalue of A – sI and x is a corresponding eigenvector.

22. The characteristic equation of A is λ3 – 6λ2 + 11λ – 6 = 0. The roots of this equation are λ= 1, 2, 3, so that these are the eigenvalues of A. If λ = 1, then λI – A reduces to

Thus if x = [x1 x2 x3]T is an eigenvector of A corresponding to λ = 1, then x1 = x3 and x2 =

0, so that

If λ = 2, then λI – A reduces to

Thus any eigenvector x corresponding to λ = 2 is of the form

If λ = 3, then λI – A reduces to

1 0 1

0 1 1

0 0 0

−−

x =

=

s

s s

2

0

1 2

1

0

1 1 2 0

0 0 1

0 0 0

−

x =

=

r

r

r0

1

0

1

1 0 1

0 1 0

0 0 0

−


Thus any eigenvector x corresponding to λ = 2 is of the form

(a) By Exercise 20, the eigenvalues of A–1 are 1, 1/2, and 1/3. The correspondingeigenspaces are spanned by the vectors

respectively.

23. (a) For any square matrix B, we know that det(B) = det(BT). Thus

det(λI – A) = det(λI – A)T

= det(λIT – AT)

= det(λI – AT)

from which it follows that A and AT have the same eigenvalues because they have thesame characteristic equation.

(b) Consider, for instance, the matrix which has λ = 1 as a (repeated) eigen-

value. Its eigenspace is spanned by the vector , while the eigenspace of its

transpose is spanned by the vector

24. (a) False, since x must be nonzero.

(b) True. The eigenvalues of A are the only values of λ for which the matrix λI – A issingular. So if λ is not an eigenvalue, then λI – A is invertible and the homogeneoussystem (λI – A)x = 0 has only the trivial solution.

1

1

−

1

1

2 1

1 0−

1

0

1

1 2

1

0

1

1

1

/

x =

=

t

t

t

t

1

1

1


(c) True. By hypothesis, λ = 0 is an eigenvalue of A. Theorem 7.1.3 then guarantees thatλ2, which is also zero, is an eigenvalue for A2. Theorem 7.1.4 then guarantees that A2

is singular.

Alternatively, observe that if x is the (necessarily nonzero) eigenvector corre-sponding to λ, then Ax = 0. Thus, AAx = 0 for some x ≠ 0, so A2 is singular.

(d) True. Since λ = 0 is not a root of λn + 1, Theorem 7.1.4 guarantees that A is invertible.

25. (a) Since p(λ) has degree 6, A is 6 × 6.

(b) Yes, A is invertible because λ = 0 is not an eigenvalue.

(c) A will have 3 eigenspaces corresponding to the 3 eigenvalues.


EXERCISE SET 7.2

1. The eigenspace corresponding to λ = 0 can have dimension 1 or 2. The eigenspacecorresponding to λ = 1 must have dimension 1. The eigenspace corresponding to λ = 2 canhave dimension 1, 2, or 3.

2. (a) The characteristic equation is

Hence the eigenvalues are λ = 3 and λ = 5.

(b) If λ = 3,

which has rank 1. If λ = 5,

which has rank 2.

(c) The nullities of the above matrices are 2 and 1, so the corresponding eigenspaces willhave dimensions 2 and 1. Therefore, the 3 × 3 matrix A has 3 linearly independenteigenvectors, and is diagonalizable.

λ I A− =−− −− −

→−1 0 1

2 2 2

1 0 1

1 0 1

0 1 0

0 0 00

λ I A− =−− −− −

→1 0 1

2 0 2

1 0 1

1 0 1

0 0 0

0 0 00

λλ

λλ λ

−− − −− −

= − − =4 0 1

2 3 2

1 0 4

3 5 02( ) ( )

347

5. Call the matrix A. Since A is triangular, the eigenvalues are λ = 3 and λ = 2. The matrices3I – A and 2I – A both have rank 2 and hence nullity 1. Thus A has only 2 linearlyindependent eigenvectors, so it is not diagonalizable.

6. Call the matrix A. The characteristic equation of A is (λ – 2)(λ2 + λ + 1) = 0, so λ = 2 is theonly (real) eigenvalue. Thus A has only one linearly independent eigenvector, so it is notdiagonalizable.

8. The characteristic equation is λ2 – 3λ + 2 = 0, the eigenvalues are λ = 2 and λ = 1, and theeigenspaces are spannedby the vectors

respectively. Thus, if we let

then

Clearly, there are other possibilities for P.

10. The characteristic equation is λ(λ – 1)(λ – 2) = 0, the eigenvalues are λ = 0, λ = 1, and λ= 2, and the eigenspaces are spannedby the vectors

Thus, if we let

P = −

0 1 0

1 0 1

1 0 1

0

1

1

1

0

0

0

1

1

−

P AP− =

1 2 0

0 1

P =

3 4

4 5

3 4

1

4 5

1

and


then

12. The characteristic equation is λ3 – λ2 +5λ – 2 = 0, the eigenvalues are λ = 1 and λ = 2, andthe eigenspaces are spanned by the vectors

respectively. Since A is a 3 × 3 matrix with only 2 linearly independent eigenvectors, it isnot diagonalizable.

13. The characteristic equation is λ3 – 6λ2 + 11λ – 6 = 0, the eigenvalues are λ = 1, λ = 2, andλ = 3, and the eigenspaces are spannedby the vectors

Thus, one possibility is

and

P AP− =

11 0 0

0 2 0

0 0 3

P =

1 2 1

1 3 3

1 3 4

1

1

1

2 3

1

1

1 4

3 4

1

1

4 3

1

4 3

4 3

1

and

P AP− =

10 1 0

0 1 0

0 0 2


15. The characteristic equation is λ2(λ – 1) = 0; thus λ = 0 and λ = 1 are the only eigenvalues.

The eigenspace associated with λ = 0 is spanned by the vectors and ; the

eigenspace associated with λ = 1 is spanned by . Thus, one possibility is

and hence

18. The eigenvalues of A are λ = 1 and λ = 2 and the eigenspaces are spanned by the vectors

Thus if we let

then

Now by Formula (10), we have

P AP− =

1 1 0

0 2

P =

1 0

1 1

1

1

and

0

1

P AP− =

10 0 0

0 0 0

0 0 1

P =−

1 0 0

0 1 0

3 0 1

0

0

1

0

1

0

1

0

3−


20. By Theorem 7.1.1, the eigenvalues of A are λ1 = 1 and λ2 = λ3 = –1. Correspondingeigenvectors are

Since A has three linearly independent eigenvectors, it is diagonalizable. Let

Then

Thus, Ak = PDk P–1 for any positive integer k, or

APIP PP I k

PDP A k

k == =

=

− −

−

1 1

1

if is even

if is odd

P A P D− = = −

−

11 0 0

0 1 0

0 0 1

P =−

1 1 4

0 1 0

0 0 1

1

0

0

1

1

0

4

0

1

−

A P P10

1011 0

1 2

1 0

1 1

1 0

0 2

=

=

−

110

10 10

1 0

1 1

1 0

1 2 2

−

=−


Finally, since D is its own inverse, we have

A–1 = (PDP–1)–1 = PD–1 P–1 = A

so that

A–k = (A–1)k = Ak

Of course, all of the above couldhave been greatly simplified if we had been cleverenough to notice at the beginning that A2 = I.

21. The characteristic equation of A is (λ – 1)(λ – 3)(λ – 4) = 0 so that the eigenvalues are λ = 1, 3, and 4. Corresponding eigenvectors are [1 2 1]T, [1 0 –1]T, and [1 –1 1]T,respectively, so we let

Hence

and therefore

22. (a) By Theorem 7.2.3, we can diagonalize A if it has two distinct eigenvalues. According toExercise 17 of Section 7.1, this will occur if and only if (a – d)2 + 4bc > 0.

(b) See Part (a). If the discriminant, (a – d)2 + 4bc, of the characteristic equation isnegative, then there are no real roots and hence no real eigenvalues or eigenvectors.Thus, by Theorem 7.2.1, A is not diagonalizable.

An

n

n

n

= −−

1 1 1

2 0 1

1 1 1

1 0 0

0 3 0

0 0 4

−−

/ / /

/ /

/ / /

1 6 1 3 1 6

1 2 0 1 2

1 3 1 3 1 3

P− = −

−

11 6 1 3 1 6

1 2 0 1 2

1 3 1 3 1 3

/ / /

/ /

/ / /

P = −−

1 1 1

2 0 1

1 1 1


24. Since A is diagonalizable, we have A = PDP–1 where D is a diagonal matrix with theeigenvalues of A as its diagonal elements. Thus the rank of D is the number of nonzeroeigenvalues of A. Since P and P–1 are invertible, they are products of elementary matrices.But multiplication on the left or on the right by an elementary matrix does not change therank of the product. Therefore rank(A) = rank(D), and we are done.

25. (a) False. For instance the matrix , which has linearly independent column

vectors, has characteristic polynomial (λ – 1)2. Thus λ = 1 is the only eigenvalue.

The corresponding eigenvectors all have the form . Thus this 2 × 2 matrix has

only 1 linearly independent eigenvector, and hence is not diagonalizable.

(b) False. Any matrix Q which is obtained from P by multiplying each entry by a nonzeronumber k will also work. Why?

(c) True by Theorem 7.2.2.

(d) True. Suppose that A is invertible and diagonalizable. Then there is an invertiblematrix P such that P–1 AP = D where D is diagonal. Since D is the product of invertiblematrices, it is invertible, which means that each of its diagonal elements d

iis nonzero

and D–1 is the diagonal matrix with diagonal elements 1/di. Thus we have

(P–1 AP)–1 = D–1

or

P–1 A–1 P = D–1

That is, the same matrix P will diagonalize both A and A–1.

26. (a) The dimensions of the eigenspaces corresponding to λ = 1, λ = 3, and λ = 4 are atmost 1, 2, and 3 respectively. The dimension of the eigenspace corresponding to λ =1 must be exactly 1.

(b) If A is diagonalizable, then it follows from Theorem 7.2.1 that the dimensions of the 3spaces will be exactly 1, 2, and 3.

(c) The eigenvalue must be λ = 4.

27. (a) Since A is diagonalizable, there exists an invertible matrix P such that P–1 AP = Dwhere D is a diagonal matrix containing the eigenvalues of A along its diagonal.Moreover, it easily follows that P–1 AkP = Dk for k a positive integer. In addition,Theorem 7.1.3 guarantees that if λ is an eigenvalue for A, then λk is an eigenvalue forAk. In other words, Dk displays the eigenvalues of Ak along its diagonal.

t1

1

0 1

1 2−


Therefore, the sequence

P–1 AP = D

P–1 A2 P = D2

...P–1 Ak P = Dk

...

will converge if and only if the sequence A, A2, . . ., Ak, . . . converges. Moreover, thiswill occur if and only if the sequences λ

i, λ

i2, . . ., λ i

k, . . . converges for each of the neigenvalues λ

iof A.

(b) In general, a given sequence of real numbers a, a2, a3, . . . will converge to 0 if and onlyif –1 < a < 1 and to 1 if a = 1. The sequence diverges for all other values of a.

Recall that P–1 Ak P = Dk where Dk is a diagonal matrix containing the eigenvalues

λ1k, λ 2

k, . . ., λnk on its diagonal. If |λ

i| < 1 for all i = 1, 2, . . ., n, then Dk = O

and hence Ak = O.

If λi

= 1 is an eigenvalue of A for one or more values of i and if all of the other

eigenvalues satisfy the inequality |λj| < 1, then Ak exists and equals PD

LP–1 where

DL

is a diagonal matrix with only 1’s and 0’s on the diagonal.

If A possesses one or more eigenvalues λ which do not satisfy the inequality

–1 < λ ≤ 1, then Ak does not exist.

28. (a) If Sn

= 1 + x + x2 + . . . + xn, then Sn

– xSn

= (1 – x)Sn

= 1 – xn+1 or Sn

= (1 – xn+1)/

(1 – x). If |x| < 1, then Sn

= 1/(1 – x). If |x| ≥ 1, then the limit doesn’t exist.

(b) and (c) If A is diagonalizable and λ1, λ2, . . ., λn

are the eigenvalues of A, then

P–1(I + A + A2 + . . . + Ak)P = I + P–1 AP + P–1 A2 P + . . . + P–1 Ak P

is a diagonal matrix with

di

= 1 + λi

+ λi–2 + . . . + λ

i–k

as its ith entry. If |λi| < 1 for all i = 1, 2, . . ., n, then the ith entry of that diagonal

matrix will converge to 1/(1 – λi).

If A possesses one or more eigenvalues λ such that |λ| ≥ 1, then the limit of thediagonal matrix will not exist.

limk→°

limk→°

limk→°

limk→°

limk→°


29. The Jordan block matrix is

Since this is an upper triangular matrix, we can see that the only eigenvalue is λ = 1, withalgebraic multiplicity n. Solving for the eigenvectors leads to the system

( )λ I Jn− =

x

0 1 0 0 0

0 0 1 0 0

0 0 0 1 0

0 0 0 0 1

x.

Jn =

1 1 0 0 0

0 1 1 0 0

0 0 1 1 0

0 0 0 1 1

.


EXERCISE SET 7.3

1. (a) The characteristic equation is λ(λ – 5) = 0. Thus each eigenvalue is repeated onceand hence each eigenspace is 1-dimensional.

(c) The characteristic equation is λ2(λ – 3) = 0. Thus the eigenspace corresponding to λ= 0 is 2-dimensional and that corresponding to λ = 3 is 1-dimensional.

(e) The characteristic equation is λ3(λ – 8) = 0. Thus the eigenspace corresponding to λ= 0 is 3-dimensional and that corresponding to λ = 8 is 1-dimensional.

2. The eigenvalues of A are λ = 4 and λ = 2. The eigenspaces are spanned by the orthogonalvectors

respectively. If we normalize these vectors, we obtain

Thus

P =−

1

2

1

21

2

1

2

1

21

2

1

21

2

−

and

1

1

1

1

−

and

357

and

If we interchange the order of the vectors which span the eigenspaces, we obtain

and

6. The eigenvalues of A are λ = 2 and λ = 0. The eigenspace associated with λ = 2 is spanned

by ; the eigenspace associated with λ = 0 is spanned by and . If we

normalize these three orthogonal vectors, we obtain

Thus

P = −

1

2

1

20

1

2

1

20

0 0 0

1

21

20

1

21

20

−

0

0

1

0

0

1

1

1

0

−

1

1

0

P AP− =

1 2 0

0 4

P =−

1

2

1

21

2

1

2

P AP− =

1 4 0

0 2


and

Another possibility (which results from changing the order of the spanning vectors) is

so that

8. The eigenvalues of A are λ = 0, λ = 4, and λ = 2. The eigenspace associated with λ = 0 is

spanned by and ; the eigenspaces associated with λ = 4 and λ = 2 are

spanned by and respectively. Hence, the eigenspaces of A are spanned by the

orthogonal vectors

0

0

1

0

0

0

0

1

1

1

0

00

1

1

0

0

−

1

1

0

0

−

1

1

0

0

0

0

0

1

0

0

1

0

P AP− =

10 0 0

0 0 0

0 0 2

P = −

1

20

1

21

20

1

21

21

1

2

P AP− =

12 0 0

0 0 0

0 0 0


Normalizing each of these vectors yields

Thus

and

Changing the order of the spanning vectors may change the position of the eigenvalues onthe diagonal of P–1 AP.

10. The characteristic equation is λ2 – 2aλ + a2 – b2 = 0. By the quadratic formula, λ = a ± |b|,or λ = a ± b. Since b ≠ 0, there are 2 distinct eigenvalues with eigenspaces spanned by theorthogonal vectors

1

1

1

1

−

and

P AP− =

1

0 0 0 0

0 0 0 0

0 0 4 0

0 0 0 2

P

P

= −

−

0 0 1 2 1 2

0 0 1 2 1 2

1 0 0 0

0 1 0 0

11

0 0 0

0 0 0 1

1 2 1 2 0 0

1 2 1 2 0 0

=

−

1

0

0

1

0

0

0

0

1

1 2

1 2

0

0

−

1 2

1 2

0

0


Thus

will orthogonally diagonalize the given matrix.

12. (a) By Theorem 7.3.1, it will suffice to show that I – v vT is symmetric. This follows fromproperties of the transpose.

(b) In this case,

The characteristic equation is (λ – 1) (λ – 1)(λ + 1) = 0, so the eigenvalues are λ = ±1.

Two linearly independent eigenvectors corresponding to λ = 1 are and .

An eigenvector corresponding to λ = –1 is . Thus the matrix

will orthogonally diagonalize I – vvT.

P =

−

1 2 0 1 2

0 1 0

1 2 0 1 2

1

0

1

0

1

0

1

0

1−

IT− =

−

v v

1 0 0

0 1 0

0 0 1

1

0

1

=−

−

1 0 1

0 0 1

0 1 0

1 0 0

1

2

1

21

2

1

2

−


13. By the result of Exercise 17, Section 7.1, the eigenvalues of the symmetric 2 × 2 matrix

, are Since (a – d)2 + 4b2 cannot be negative,

the eigenvalues are real.

14. (a) True, since both are symmetric.

(b) True, since v1 and v2 must be orthogonal.

(c) False. For instance, let . Then A is orthogonal, but not

symmetric and hence not orthogonally diagonalizable.

(d) Since A is orthogonally diagonalizable, it follows from Theorem 7.3.1 that it must besymmetric, that is, A = AT. And since A is also invertible, we have

I = IT = (A–1 A)T = AT (A–1)T = A(A–1)T

This implies that A–1 = (A–1)T, so that A–1 is symmetric and hence is orthogonallydiagonalizable.

15. Yes. Notice that the given vectors are pairwise orthogonal, so we consider the equation

P–1 AP = D

or

A = PDP–1

where the columns of P consist of the given vectors each divided by its norm and where Dis the diagonal matrix with the eigenvalues of A along its diagonal. That is,

P D=

−

=−0 1 0

1 2 0 1 2

1 2 0 1 2

1 0 0

0 3 0

0

and

00 7

A =−

1 2 3 2

3 2 1 2

λ ( )= + ± −( ) +

1

24

2 2a d a d b

a b

b a


From this, it follows that

Alternatively, we could just substitute the appropriate values for λ and x in the equationAx = λx and solve for the matrix A.

16. The converse of Theorem 7.3.2(b) is: “Eigenvectors from the same eigenspace are notorthogonal.” This is false. If we have an n-dimensional eigenspace, with n > 1, then we canuse the Gram-Schmidt process to convert any basis of eigenvectors to an orthogonal basis.Doing this, we have constructed a set of eigenvectors from the same eigenspace that areorthogonal.

A PDP= =

13 0 0

0 3 4

0 4 3



1. (a) The characteristic equation of A is λ2 – 2 cos θ + 1 = 0. The discriminant of thisequation is 4 (cos2 θ – 1), which is negative unless cos2 θ = 1. Thus A can have no realeigenvalues or eigenvectors in case 0 < θ < π.

3. (a) If

then D = S2, where

Of course, this makes sense only if a21

≥ 0, . . ., an ≥ 0.

(b) If A is diagonalizable, then there are matrices P and D such that D is diagonal and D= P–1 AP. Moreover, if A has nonnegative eigenvalues, then the diagonal entries of Dare nonnegative since they are all eigenvalues. Thus there is a matrix T, by virtue ofPart (a), such that D = T2. Therefore,

A = PDP–1 = PT2 P–1 = PTP–1 PTP–1 = (PTP–1)2

That is, if we let S = PTP–1, then A = S2.

S

a

a

an

=

1

2

0 0

0 0

0 0

D

a

a

an

=

1

2

0 0

0 0

0 0

365

3. (c) The eigenvalues of A are λ = 9, λ = 1, and λ = 4. The eigenspaces are spanned by thevectors

Thus, we have

while

Therefore

4. (a) Since (λI – A)T = λI – AT, we have

det(λI – A) = det((λI – A)T)

= det(λI – AT)

Thus A and AT have the same characteristic polynomials, and hence the sameeigenvalues.

(b) By the result of Exercise 18, Section 7.1, if A has distinct eigenvalues λ1 and λ2, andif bc ≠ 0, then the eigenvectors of A are

−−

−−

b

a

b

aλ λ1 2and

S PTP= =

−11 1 0

0 2 1

0 0 3

D T=

=

9 0 0

0 1 0

0 0 4

3 0 0

0 1 0

0 0 2

and

P P=

= −−1 1 1

2 0 1

2 0 0

0 0 1 2

1 2 1 2

0 1

1and

−−

1

1

2

2

1

0

0

1

1

0


Corresponding eigenvectors of AT are

Thus we need only look for a matrix with two distinct eigenvalues, with bc ≠ 0, andwith b ≠ c. For instance, if we let

then the resulting pairs of eigenvectors will be different.

5. Since det(λI – A) is a sum of signed elementary products, we ask which terms involve λn–1.Obviously the signed elementary product

q= (λ – a11)(λ – a22

) …. (λ – ann)

= –n – (a11 + a22 + …. + ann

)λn–1

+ terms involving λr where r < n – 1

has a term – (trace of A)λn–1. Any elementary product containing fewer than n – 1 factorsof the form λ – a

iicannot have a term which contains λn–1. But there is no elementary

product which contains exactly n – 1 of these factors (Why?). Thus the coefficient of λn–1

is minus the trace of A.

6. The only eigenvalue is λ = a and if b ≠ 0, then the eigenspace is spanned by the vector

. Thus, by Theorem 7.2.1, A is not diagonalizable.

7. (b) The characteristic equation is

p(λ) = –1 + 3λ – 3λ2 + λ3 = 0

1

0

A =−

1 1

3 1

−−

−−

c

a

c

aλ λ1 2and


Moreover,

and


p(A) = –I + 3A – 3A2 + A3 = 0

8. (b) Let A be an n × n matrix, so that A = PDP–1, where D is a diagonalizable matrix whoseentries on the main diagonal are the eigenvalues of A. Then, for any positive integer m,we have

Am = (PDP–1)m = (PDP–1)(PDP–1) … (PDP–1) = PDn P–1

If the characteristic equation of A is

p(λ) = a0 + a1λ + a2λ2 + … + a

nλn = 0,

we then have

p(A) = a0I + a1A + a2A2 + … a

nAn

= a0I + a1(PDP–1) + a2(PDP–1)2 + … + an(PDP–1)n

= a0I + a1(PDP–1) + a2)PDP–1)2 + … + an(P⟨nP–1)

=P(a0I)P–1 + P(a1D)P–1 + P(a2D2)P–1 + … P(a

nDn)P–1

= P[a0I + a1D + a2S2 + … + a

nDn]P–1

A3

1 3 3

3 8 6

6 15 10

=−−

−

A2

0 0 1

1 3 3

3 8 6

= −−


However, each λi

is a root of the characteristic polynomial, so p(λi) = 0 for i = 1, . . ., n.

Then,

= 0.

Thus, a diagonalizable matrix satisfies its characteristic polynomial.

9. Since c0 = 0 and c1 = –5, we have A2 = 5A, and, in general, An = 5n–1 A.

p A P P( ) =

−

0 0 0

0 0 0

0 0 0

1

=

+P

a

a

a

a0

0

0

10 0

0 0

0 0

λ11

1 2

20 0

0 0

0 01

a

a

a

n

λ

λ

λ

+

112

2 22

22

0 0

0 0

0 0

a

a n

λ

λ

+ +

a

a

a

nn

nn

n nn

λ

λ

λ

1

2

0 0

0 0

0 0

=

+ + +

−P

P

a a a

a

nn

0 1 1 1

0

0 0

0

λ λ

++ + +

+ + +

a a

a a a

nn

n n nn

1 2 2

0 1

0

0 0

λ λ

λ λ

=

( )

( )

(

p

P

p

p

p

1

1

2

0 0

0 0

0 0

λλ

λ

nn

P

)

−1


11. Call the matrix A and show that A2 = (c1 + c2 + . . . + cn)A = [tr(A)]A. Thus Ak =

[tr(A)]k–1 A for k = 2, 3, . . . . Now if λ is an eigenvalue of A, then λk is an eigenvalue of Ak,so that in case tr(A) ≠ 0, we have that λk[tr(A)]k–1 = [λ/tr(A)]k–1 λ is an eigenvalue of Afor k = 2, 3, . . . . Why? We know that A has at most n eigenvalues, so that this expressioncan take on only finitely many values. This means that either λ = 0 or λ = tr(A). Why? In case tr(A) = 0, then all of the eigenvalues of A are 0. Why? Thus the only possibleeigenvalues of A are zero and tr(A). It is easy to check that each of these is, in fact, aneigenvalue of A.

Alternatively, we could evaluate det(Iλ – A) by brute force. If we add Column 1 toColumn 2, the new Column 2 to Column 3, the new Column 3 to Column 4, and so on, weobtain the equation

If we subtract Row 1 from each of the other rows and then expand by cofactors along thenth column, we have

= (–1)n+1 (λ – tr(A))(–λ)n–1 because the above matrix is triangular

= (–1)2n(λ – tr(A))λn–1

= λn–1(λ – tr(A))

Thus λ = tr(A) and λ = 0 are the eigenvalues, with λ = 0 repeated n – 1 times.

det( ) det

( )

I A

c c c c c c A

− =

− − − − − − −λ λ λ λ1 1 2 1 2 3 tr

−−− −

− − −

=

λλ λ

λ λ λ

0 0 0

0 0

0

( ) ( ( )) det− −

−− −− − −+1

0 0 0

0 0

01ntr Aλ

λλ λλ λ λ

− − − −

λ λ λ λ

det( ) detI A

c c c c c c c

λ

λ λ λ λ

− =

− − − − − − − −1 1 2 1 2 3 1 cc c

c c c c c c c c c

c

n

n

2

1 1 2 1 2 3 1 2

− −− − − − − − − − − −−

λ λ λ

11 1 2 1 2 3 1 2

1 1

− − − − − − − − −

− − −

c c c c c c c c

c c c

nλ λ

22 1 2 3 1 2− − − − − − −

c c c c c cn λ


12. (a) The characteristic polynomial of the given matrix is

If we add λ times the second row to the first row, this becomes

Expanding by cofactors along the first column gives

Adding λ2 times the second row to the first row yields

p

c c c

c

c( ) detλ

λ λ λ

λ

λ=

+ +

−

−

0 0 0

1 0 0

0 1 0

31 2 3

2

3

44

10 0 0 1

− +

−λ cn

p

c c

c

c( ) detλ

λ λ

λ

λ=

+

−

−

21 2

3

4

0 0 0

1 0 0

0 1 0

00 0 0 1 1 − +

−λ cn

p

c

( ) detλ

λ λ

λ

λ=

+

−

−

0 22

1

2

3

0 0 c

1 0 0 c

0 1 0 c

0 0 1 cn 10 −

−λ +

p

c

c

c( ) detλ

λλ

λ=−

−

0 0 0 0

1 0 0 0

0 1 0 0

0 0

1

2

3

−− +

−1 1λ cn


Expanding by cofactors along the first column gives

Eventually this procedure will yield

Adding λn–2 times the second row to the first row gives

or

p(λ) = c1 + c2λ + … + cn–1λ

n–2 + λn–1

which is the desired result.

14. The characteristic equation of an n × n matrix has degree n and therefore n roots. Sincethe complex roots of a polynomial equation with real coefficients occur in conjugate pairs,any such equation of odd degree must have at least one real root. Hence A has at least onereal eigenvalue.

16. The characteristic equation of the matrix is

(λ – 1)(λ + 2 (λ – 3)(λ + 3) = λ4 + λ3 – 11λ2 – 9λ + 18 = 0

(a) By Exercise 14 of Section 7.1, det(A) = 18.

(b) By Exercise 5, above, tr(A) = –1.

17. Since every odd power of A is again A, we have that every odd power of an eigenvalue ofA is again an eigenvalue of A. Thus the only possible eigenvalues of A are λ = 0, ±1.

pc c c

c

nn n

n

( ) detλ λ λ λλ

= + + + +− +

−− −

−

0

11 2 1

2 1

1

pc c c

c

nn

n

n

( ) detλ λ λ λλ

= + + +− +

−−

−

−

21 2 2

3

11

p

c c c

c

c( ) detλ

λ λ λ

λ

λ=

+ +

−

−

0 0 0

1 0 0

0 1 0

1 2 32

4

55

10 0 0 1

− +

−λ cn


EXERCISE SET 8.1

2. Using the usual notation, we have

T(u + v)= (2(u1 + v1) – (u2 + v2) + (u3 + v3), (u2 + v2) – 4(u3 + v3))

= (2u1 – u2 + u3, u2 – 4u3) + (2v1 –v2 + v3, v2 – 4v3)

= T(u) + T(v)

and

T(cu) = (2cu1 – cu2 + cu3, cu2 – 4cu3) = cT(u)

3 Since T (–u) = –u = u = T(u) ≠ –T(u) unless u = 0, the function is not linear.

5. We observe that

T(A1 + A2) = (A1 + A2)B = A1B + A2B = T(A1) + T(A2)

and

T(cA) = (cA)B = c(AB) = cT(A)

Hence, T is linear.

6. Since T(A + B) = tr(A + B) = tr(A) + tr(B) = T(A) + T(B) and T(cA) = tr(cA) = c(tr(A))= cT(A), T is linear.

373

8. (b) Since both properties fail, T is nonlinear. For instance,

10. (a) T is not linear because it sends the zero function to f(x) = 1.

(b) Since

T(f(x) + g(x))= T((f + g)(x)) = (f + g)(x + 1)

= f(x + 1) + g(x + 1) = T(f(x)) + T(g(x))

and

T(cf(x)) = cf(x + 1) = cT(f(x))

T is linear.

12. Let v = (x1, x2) = av1 + bv2 = a(1, 1) + b(1, 0). This yields the equations x1 = a + b and x2 = a, so that v = x2v1 + (x1 – x2)v2. Thus

T(v) = T(x1, x2) = x2T(v1) + (x1 – x2)T(v2)

= x2(1, –2) + (x1 – x2)(–4, 1)

= (–4x1 + 5x2, x1 – 3x2)

so that

T(2, –3) = (–8 – 15, 2 + 9) = (–23, 11)

14. Let v = (x1, x2, x3) = av1 + bv2 + cv3 = a(1, 1, 1) + b(1, 1, 0) + c(1, 0, 0). Thus x1 = a + b+ c, x2 = a + b, and x3 = a, so that v = x3v1 + (x2 – x3)v2 + (x1 – x2)v3. Therefore

T ka b

c dk a k b

kTa b

c d

= +

≠

2 2 2 2

in general.


T(v)= T(x1, x2, x3) = x3T(v1) + (x2 – x3)T(v2) + (x1 – x2)T(v3)

= x3(2, –1, 4) + (x2 – x3)(3, 0, 1) + (x1 – x2)(–1, 5, 1)

= (–x1 + 4x2 – x3, 5x1 – 5x2 – x3, x1 + 3x3)

and

T(2, 4, –1) = (–2 + 16 + 1, 10 –20 + 1, 2 – 3) = (15, –9, –1)

16. We have

T(2v1 – 3v2 + 4v3)= 2T(v1) – 3T(v2) + 4T(v3)

= (2, –2, 4) + (0, –9, –6) + (–12, 4, 8)

= (–10, –7, 6)

17. (a) Since T1 is defined on all of R2, the domain of T2 T1 is R2. We have T2

T1(x, y) =T2(T1(x, y)) = T2(2x, 3y) = (2x – 3y, 2x + 3y). Since the system of equations

2x – 3y = a

2x + 3y = b

can be solved for all values of a and b, the codomain is also all of R2.

(d) Since T1 is defined on all of R2, the domain of T2 T1 is R2. We have

T2(T1(x, y)) = T2(x – y, y + z, x – z) = (0, 2x)

Thus the codomain of T2 T1 is the y-axis.

18. (a) Since T1 is defined on all of R2, the domain of T3 T2

T1 is R2. We have

T3 T2

T1(x, y) = T3 T2 (–2y, 3x, x – 2y)

= T3(3x, x – 2y, –2y)

= (3x – 2y, x)

This vector can assume all possible values in R2, so the codomain is R2.


19. (a) We have

(b) Since the range of T1 is not contained in the domain of T2, T2 T1 is not well defined.

20. We have

(T1 T2)(p(x)) = T1(p(x + 1)) = p((x – 1) + 1) = p(x)

and

(T2 T1)(p(x)) = T2(p(x – 1)) = p((x + 1) –1) = p(x)

22. We have

T2 T1 (a0 + a1x + a2x

2) = T2 (a0 + a1(x + 1) + a2(x + 1)2)

= T2((a0 + a1 + a2) + (a1 + 2a2)x + a2 x2)

= (a0 + a1 + a2)x + (a1 + 2a2)x2 + a2x3

24. (b) We have

(T3 T2) T1(v) = (T3

T2)(T1(v))

= T3(T2(T1(v)))

= T3 T2

T1(v)

25. Since (1, 0, 0) and (0, 1, 0) form an orthonormal basis for the xy-plane, we have T(x, y, z)= (x, 0, 0) + (0, y, 0) = (x, y, 0), which can also be arrived at by inspection. Then T(T(x,y, z)) = T(x, y, 0) = (x, y, 0) = T(x, y, z). This says that T leaves every point in the x-y

plane unchanged.

( )( ) ( )T T A Aa c

b da d

T1 2 = =

= +tr tr


26. (a) From the linearity of T and the definition of kT, we have

(kT)(u + v) = k(T(u + v)) = k(T(u) + T(v))

= kT(u) + kT(v) = (kT)(u) + (kT)(v)

and

(kT)(cu) = k(T(cu)) = kcT(u) = c(kT)(u)

Therefore, kT is linear.

28. (a) F is linear since

F((x1, y1) + (x2, y2)) = F(x1 + x2, y1 + y2)

= (a1(x1 + x2) + b1(y1 + y2), a2 (x1 + x2) + b2 (y1 + y2))

= (a1x1 + b1y1, a2x1 + b2y1) + (a1x2 + b1y2, a2x2 + b2y2)

= F(x1, y1) + F(x2, y2)

and

F(c(x, y)) = F(cx, cy)

= (a1cx + b1cy, a2cx + b2cy)

= c(a1x + b1y, a2x + b2y)

= cF(x, y)

(b) Since F(c(x, y)) = F(cx, cy) = c2F(x, y), F is not a linear operator.

30. Let v = a1v1 + … + anv

nbe any vector in V. Then

T(v)= a1T(v1) + … + anT(v

n)

= a1v1 + … + anv

n

= v

Hence T is the identity transformation on V.


31. (b) We have

(c) We have

33. (a) True. Let c1 = c2 = 1 to establish Part (a) of the definition and let c2 = 0 to establishPart (b).

(b) False. All linear transformations have this property, and, for instance there is morethan one linear transformation from R2 to R2.

(c) True. If we let u = 0, then we have T(v) = T(–v) = –T(v). That is, T(v) = 0 for allvectors v in V. But there is only one linear transformation which maps every vector tothe zero vector.

(d) False. For this operator T, we have

T(v + v) = T(2v) = v0 + 2 v

But

T(v) + T(v) = 2T(v) = 2v0 + 2v

Since v0 ≠ 0, these two expressions cannot be equal.

34. Since B is a basis for the vector space V, we can write any vector v in V in one and only oneway as a linear combination v = c1v1 + c2v2 + … + c

nv

nof vectors from B. Thus for any

linear operator T, we have

T(v) = c1T(v1) + c2T(v2) + … + cnT(v

n)

That is, to define T, we need only specify where it maps each of the basis vectors in B. IfT is to map B into itself, then we have n choices for each of the n vectors T(v

i), and hence

a total of nn possible operators T.

35. Yes. Let T Pn → Pm be the given transformation, and let TR

Tn+1 → Rm+1 be thecorresponding linear transformation in the sense of Section 4.4. Let

n P

n→ Rn+1 be the

function that maps a polynomial in Pn

to its coordinate vector in Rn+1, and let m

Pm

→Rm+1 be the function that maps a polynomial in P

nto its coordinate vector in Rm+1.

( )( ) ( )J D e e dt ex x xx

+ = + ′ = −∫3 3 10

( )(sin ) (sin ) sin( ) – sin( ) sin( )J D x t dt x x= ′ = =00

xx

∫


By Example 7, both n

and m

are linear transformations. Theorem 5.4.1 implies that acoordinate map is invertible, so

m–1 is also a linear transformation.

We have T=m–1

TR

n, so T is a composition of linear transformations. Refer to the

diagram below:

Thus, by Theorem 8.1.2., T is itself a linear transformation.

T

TR

Pn

nϕ ϕ m–1

Pm

Rm+1Rn+1


EXERCISE SET 8.2

1. (a) If (1, –4) is in R(T), then there must be a vector (x, y) such that T(x, y) = (2x – y,–8x + 4y) = (1, –4). If we equate components, we find that 2x – y = 1 or y = t and x= (1 + t)/2. Thus T maps in finitely many vectors into (1, –4).

(b) Proceeding as above, we obtain the system of equations

2x – y = 5

–8x + 4y = 0

Since 2x – y = 5 implies that –8x + 4y = –20, this system has no solution. Hence (5, 0)is not in R(T).

2. (a) The vector (5, 10) is in ker(T) since

T(5, 10) = (2(5) – 10, –8(5) + 4(10)) = (0, 0)

(b) The vector (3, 2) is not in ker(T)since T(3, 2) = (4, –16) ≠ (0, 0).

3. (b) The vector (1, 3, 0) is in R(T) if and only if the following system of equations has asolution:

4x + y – 2z – 3w = 1

2x + y + z – 4w = 3

6x –9z + 9w = 0

This system has infinitely many solutions x = (3/2)(t – 1), y = 10 – 4t, z = t, w = 1where t is arbitrary. Thus (1, 3, 0) is in R(T).

381

4. (b) Since T(0, 0, 0, 1) = (–3, –4, 9) ≠ (0, 0, 0), the vector (0, 0, 0, 1) is not in ker(T).

5. (a) Since T(x2) = x3 ≠ 0, the polynomial x2 is not in ker(T).

6. (a) Since T(1 + x) = x + x2, the polynomial x + x2 is in R(T).

(c) Since there doesn’t exist a polynomial p(x) such that xp(x) = 3 – x2, then 3 – x2 is notin R(T).

7. (a) We look for conditions on x and y such that 2x – y = and –8x + 4y = 0. Since theseequations are satisfied if and only if y = 2x, the kernel will be spanned by the vector(1, 2), which is then a basis.

(c) Since the only vector which is mapped to zero is the zero vector, the kernel is 0 andhas dimension zero so the basis is the empty set.

8. (a) Since –8x + 4y = –4(2x – y) and 2x – y can assume any real value, the range of T isthe set of vectors (x, –4x) or the line y = –4x. The vector (1, –4) is a basis for thisspace.

(c) The range of T is just the set of all polynomials in P3 with constant term zero. The setx, x2, x3 is a basis for this space.

9. (a) Here n = dim(R2) = 2, rank(T) = 1 by the result of Exercise 8(a), and nullity(T) = 1by Exercise 7(a). Recall that 1 + 1 = 2.

(c) Here n = dim(P2) = 3, rank(T) = 3 by virtue of Exercise 8(c), and nullity(T) = 0 byExercise 7(c). Thus we have 3 = 3 + 0.

10. (a) We know from Example 1 that the range of T is the column space of. Using elementarycolumn operations, we reduce to the matrix

Thus and as well as and form a basis for the column space

and therefore also for the range of T.

0

1

1

1

0

2

0

1

1

1

5

7

1 0 0

5 1 0

7 1 0

1 0 0

0 1 0

2 1 0

or


(b) To investigate the solution space of A, we look for conditions on x, y, and z such that

The solution of the resulting system of equations is x = –14t, y = 19t, and z = 11t

where t is arbitrary. Thus one basis for ker(T)is the vector .

(c) By Parts (a) and (b), we have rank(T) = 2 and nullity(T) = 1.

(d) Since rank(A) = 2 and has 3 columns, nullity (A) = 3 – 2 = 1. This also follows fromTheorem 8.2.2 and Part (c) above.

12. (a) Since the range of T is the column space of A, we reduce using column operations.This yields

Thus and form a basis for the range of T.

(b) If

then x = –s – 4r, y = –s + 2r, z = s, and w = 7r where s and r are arbitrary. Thus,

That is, the set [–1, –1, 1, 0]T, [–4, 2, 0, 7]T is one basis for ker(T).

x

y

z

w

s

=

−−

+

1

1

1

0

−−

4

2

0

7

r

4 1 5 2

1 2 3 0

0

0

=

x

y

z

w

0

1

1

0

1 0 0 0

0 1 0 0

−

14

19

11

1 1 3

5 6 4

7 4 2

−−

x

y

z =

0

0

0


12. (c) By Parts (a) and (b), rank(T) = nullity(T) = 2.

(d) Since A has 4 columns and rank(A) = 2, we have nullity(A) = 4 – 2 = 2. This alsofollows from Theorem 8.2.2 and Part (c) above.

14. (a) The orthogonal projection on the xz-plane maps R3 to the entire xz-plane. Hence itsrange is the xz-plane. It maps only the y-axis to (0, 0, 0), so that its nullspace is they-axis.

16. (a) The nullity of T is 5 – 3 = 2.

(c) The nullity of T is 6 – 3 = 3.

18. (a) The dimension of the solution space of Ax = 0 is, by virtue of the Dimension Theorem,7 – 4 = 3.

(b) Since the range space of A has dimension 4, the range cannot be R5, which hasdimension 5.

19. By Theorem 8.2.1, the kernel of T is a subspace of R3. Since the only subspaces of R3 arethe origin, a line through the origin, a plane through the origin, or R3 itself, the resultfollows. It is clear that all of these possibilities can actually occur.

21. (a) If

then x = –t, y = –t, z = t. These are parametric equations for a line through the origin.

(b) Using elementary column operations, we reduce the given matrix to

1 0 0

3 5 0

2 8 0

−−

1 3 4

3 4 7

2 2 0

0

0

0−

=

x

y

z


Thus, (1, 3, –2)T and (0, –5, 8)T form a basis for the range. That range, which we caninterpret as a subspace of R3, is a plane through the origin. To find a normal to thatplane, we compute

(1, 3, –2) × (0, –5, 8) = (14, –8, –5)

Therefore, an equation for the plane is

14x – 8y – 5z = 0

Alternatively, but more painfully, we can use elementary row operations to reducethe matrix

to the matrix

Thus the vector (x, y, z)T is in the range of T if and only if 14x – 8y – 5z = 0.

22. Suppose that v is any vector in V and write

(*) v = a1v1 + a2 v2 + … + anvn

Now define a transformation T V | W by

T(v) = a1w1 + a2w2 + … + anw

n

Note that T is well-definied because, by Theorem 5.4.1, the constants a1, a2, … , an

in (*)are unique.

In order to complete the problem, you must show (i) that T is linear and (ii) that T(v

i) = w

ifor i = 1, 2, … , n.

23. The rank of T is at most 1, since dimR = 1 and the image of T is a subspace of R. So, weknow that either rank(T) = 0 or rank(T) = 1. If rank(T) = 0, then every matrix A is in thekernel of T, so every n × n matrix A has diagonal entries that sum to zero. This is clearlyfalse, so we must have that rank(T) = 1. Thus, by the Dimension Theorem (Theorem 8.2.3),dim (kerT) = n2 – 1.

1 0 1 4 3 5

0 1 1 3 5

0 0 0 14 8 5

− +( )−( )

− −

x y

x y

x y z

1 3 4

3 4 7

2 2 0

x

y

z−


24. (a) Suppose that v1, v2, … , vn is a basis for V and let

(*) v = a1v1 + a2v2 + … + a

nv

n

be an arbitrary vector in V. Since T is linear, we have

(**) T(v) = a1T(v1) + … + anT(v

n)

The hypothesis that dim(ker(T)) = 0 implies that T(v) = 0 if and only if v = 0. Sincev1, … , v

n is a linearly independent set, by (*), v = 0 if and only if

a1 = a2 = … = an

= 0

Thus, by (**), T(v) = 0 if and only if

a1 = a2 = … = an

= 0

That is, the vectors T(v1), T(v2), … , T(vn) form a linearly independent set in R(T).

Since they also span R(T), then dim(R(T)) = n.

26. If J(p) -2b, then the kernel of J is the set of all

p(x) = ax + b for which b = 0 except the y-axis.

27. If f(x)is in the kernel of D D, then f′′(x) = 0 or f(x) = ax + b. Since these are the onlyeligible functions f(x)for which f′′(x) = 0 (Why?), the kernel of D D is the set of allfunctions f(x) = ax + b, or all straight lines in the plane. Similarly, the kernel of D D D

is the set of all functions f(x) = ax2 + bx + c, or all straight lines except the y-axis andcertain parabolas in the plane.

28. (a) kernel, range

(b) (1, 1, 1)

(c) the dimension of V

(d) 3

29. (a) Since the range of T has dimension 3 minus the nullity of T, then the range of T hasdimension 2. Therefore it is a plane through the origin.

( ) ( / )ax b dx ax bx+ = + − −

∫ 21

1

1

1

2


(b) As in Part (a), if the range of T has dimension 2, then the kernel must have dimension1. Hence, it is a line through the origin.

30. (a) If T(f(x)) = f′′′′(x), then T(ax3 + bx2 + cx + d) = 0 for all polynomials in P3. Moreover,these are the only functions which are mapped to the zero vector, that is, the functionwhich is identically zero.

(b) If T transforms f(x) to its (n + 1)st derivative, then kernel (T) = Pn.


EXERCISE SET 8.3

1. (a) Clearly ker(T) = (0, 0), so T is one-to-one.

(c) Since T(x, y) = (0, 0) if and only if x = y and x = –y, the kernel is (0, 0) and T is one-to-one.

(e) Here T(x, y) = (0, 0, 0) if and only if x and y satisfy the equations x – y = 0, –x + y= 0, and 2x – 2y = 0. That is, (x, y) is in ker(T) if and only if x = y, so the kernel ofT is this line and T is not one-to-one.

2. (a) Since det(A) = 1 ≠ 0, then T has an inverse. By direct calculation

and so

3. (a) Since det(A) = 0, or equivalently, rank(A) < 3, T has no inverse.

Tx

xA

x

x

x x

x

− −

=

=

−− +

1 1

2

1 1

2

1 2

1

2

2 55 2x

A− =

−−

1 1 2

2 5

389

3. (c) Since A is invertible, we have

4. (a) Since the kernel of this transformation is the line x = 2y, the transformation is not one-to-one.

(c) Since the kernel of the transformation is (0, 0), the transformation is one-to-one.

5. (a) The kernel of T is the line y = –x since all points on this line (and only those points)map to the origin.

(b) Since the kernel is not (0, 0), the transformation is not one-to-one.

7. (b) Since nullity(T) = n – rank(T) = 1, T is not one-to-one.

(c) Here T cannot be one-to-one since rank(T) ≤ n < m, so nullity(T) ≥ 1.

8. (b) If p(x) = a0 + a1x + a2x2, then

T(p(x)) = a0 + a1 (x + 1) + a2(x + 1)2

= (a0 + a1 + a2) + (a1 + 2a2)x + a2x2

Thus T(p(x)) = 0 if and only if a2 = a1 + 2 a2 = a0 + a1 + a2 = 0, or a2 = a1 = a0 = 0.That is, the nullity of T is zero so T is one-to-one.

T

x

x

x

A

x

x

x

− −

=

=11

2

3

11

2

3

1

2

1

2

1

21

2

1

2

1

21

2

1

2

1

2

−

−

−

x

x

x

1

2

3

=

− +1

2 1 2x x x33

1 2 3

1 2 3

1

21

2

( )

− + +( )

+ −( )

x x x

x x x


10. (a) Since infinitely many vectors (0, 0, … , 0, xn) map to the zero vector, T is not one-to-

one.

(c) Here T is one-to-one and T–1 (x1, x2, … , xn) = (x

n, x1, … , x

n–1).

11. (a) We know that T will have an inverse if and only if its kernel is the zero vector, whichmeans if and only if none of the numbers a

i= 0.

12. Note that T1 and T2 can be represented by the matrices

that is,

(a) Since det(AT1) = –2 ≠ 0 and det(AT2

) = –5 ≠ 0, it follows that both T1 and T2 are one-to-one.

(b) The transformations T1–1 and T2

–1 are represented by AT1–1 and AT2

–1, respectively, where

Then T2 T1 can be represented by

and (T2 T1)–1 can be represented by

A AT T2 1

1 1

10

3 1

1 3( ) =

−

−

A AT T2 1

3 1

1 3=

−

A AT T1 2

1 11

2

1 1

1 1

1

5

2 1

1 2− −=

−

=

−and

Tx

yA

x

yT

x

yA

x

yT T1 21 2

=

=and

A AT T1 2

1 1

1 1

2 1

1 2=

−

=

−

and


that is

12. (c) It is easy to verify that

AT1–1 AT2

–1 = (AT2AT1

)–1

Alternative Solution: If you want to do this the hard way, notice that if T1(x, y) =

(x + y, x – y), then T2–1(x, y) = and if T2(x, y) = (2x + y, x – 2y)

then T2–1(x, y) = . Also T2 T1(x, y) = T2(x + y, x – y) =

(3x + y, –x + 3y) and (T1 T2)–1(x, y) = .

13. (a) By inspection, T1–1(p(x)) = p(x)/x, where p(x) must, of course, be in the range of T1

and hence have constant term zero. Similarly T2–1(p(x)) = p(x – 1), where, again, p(x)

must be in the range of T2. Therefore (T2 T1)–1(p(x)) = p(x – 1)/x for appropriate

polynomials p(x).

16. If there were such a transformation T, it would have nullity zero (because it would be one-to-one) and rank equal to the dimension of W (because its range would be W). But theDimension Theorem guarantees that rank(T) + nullity(T) = dim(V), which is, byhypothesis, greater than dim(V). Thus there can be no such transformation.

17. (a) Since T sends the nonzero matrix to the zero matrix, it is not one-to-one.

(c) Since T sends only the zero matrix to the zero matrix, it is one-to-one. By inspection,T–1(A) = T(A).

Alternative Solution: T can be represented by the matrix

0 1

0 0

3

10

3

10

x y x y− +

,

2

5

2

5

x y x y+ −

,

x y x y+ −

2 2

,

( )T Tx

y

x

y2 1

1 1

10

3 1

1 3 −

=

−

=

−+

1

10

3

3

x y

x y


By direct calculation, TB

= (TB)–1, so T = T–1.

18. If T(x, y) = (x + ky, –y) = (0, 0), then x = y = 0. Hence ker(T) = (0, 0) and therefore Tis one-to-one. Since T(T(x, y)) = T(x + ky, –y) = (x + ky + k(–y), –(–y)) = (x, y), we haveT–1 = T.

19. Suppose that w1 and w2 are in R(T). We must show that

T–1(w1 + w2) = T–1(w1) + T–1(w2)

and

T–1(kw1) = kT–1(w1)

Because T is one-to-one, the above equalities will hold if and only if the results of applyingT to both sides are indeed valid equalities. This follows immediately from the linearity of thetransformation T.

21. It is easy to show that T is linear. However, T is not one-to-one, since, for instance, it sendsthe function f(x) = x – 5 to the zero vector.

24. (a) False. Since T is not one-to-one, it has no inverse. For T–1 to exist, T must map eachpoint in its domain to a unique point in its range.

(b) True. If T1(v1) = T1(v2) where v1 ≠ v2, then T2 T1(v1) = T2

T1(v2) where v1 ≠ v2.

(c) True, since such rotations and reflections are one-to-one linear transformations.

25. Yes. The transformation is linear and only (0, 0, 0) maps to the zero polynomial. Clearlydistinct triples in R3 map to distinct polynomials in P2.

TB =−

−

0 0 0 1

0 1 0 0

0 0 1 0

1 0 0 0


26. It does. T is a linear operator on M22. If EA = O for any matrix A, then A = E–1O = O. ThusT is one-to-one. Alternatively, if EA1 = EA2, then E(A, –A2) = O, so A1 = A2 = E–1O = O. Thatis, A1 = A2.

27. No. T is a linear operator by Theorem 3.4.2. However, it is not one-to-one since T(a) = a× a = 0 = T(0). That is, T maps a to the zero vector, so if T is one-to-one, a must be thezero vector. But then T would be the zero transformation, which is certainly not one-to-one.


EXERCISE SET 8.4

2. (a) Let B = 1, x, x2 = u1, u2, u3 and B′ = 1, x = v1, v2. Observe that

T(u1) = T(1) = 1 = 1v1

T(u2) = T(x) = 1 – 2x = 1v1 – 2v2

T(u3) = T(x2) = – 3x = –3v2

Hence the matrix of T with respect to B and B′ is

(b) We have

and

[T(c0 + c1x + c2x2)]

B′ = [(c0 + c1) – (2c1 + 3c2)x]B′

=+

− −

c c

c c

0 1

1 22 3

[ ] [ ]',T

c

c

c

B B Bx =− −

=1 1 0

0 2 3

0

1

2

c c

c c

0 1

1 22 3

+− −

1 1 0

0 2 3− −

395

4. (a) Since

and

we have

(b) Since

and

Formula (5a) does, indeed, hold.

6. (a) Observe that

T(v1) = (1, –1, 0) = v1 – v2

T

T

v v v v

v

2 1 2 3

3

1 1 13

2

1

2

1

2

0 0 1

( ) = − −( ) = − + +

( ) = (

, ,

, , )) = + −1

2

1

2

1

21 2 3v v v

Tx x

x xx x x

B

B

x( )[ ] =−+

= +( ) +1 2

1 21 2 1 2 22uu uu

[ ] [ ]Tx

xB B

B

x =−

=−

2 1

2 0

2 1

2 0

1

2

−( ) + +( )

=−

x xB1 2 2 1 2

2 1

2 0

u u u

−

=

+

x

x x

x x

x

2

2 1

1 2

22

[ ]T B =−

2 1

2 0

T( )u2 11

1=

−−

= −uu

T( )uu uu uu1 1 20

22 2=

= +


Thus the matrix of T with respect to B is

(b) If we solve for (x1, x2, x3)B= a1 v1 + a2v2 + a3v3, we find that

Thus we have

and

[T(x)]B

= [(x1 – x2, x2 – x1, x1 – x3)]B

=

− − − + −

− − + − +

( ) ( ) ( )

( ) ( )

x x x x x x

x x x x

1 2 2 1 1 3

1 2 2 1

2(( )

( ) ( ) ( )

x x

x x x x x x

1 3

1 2 2 1 1 3

2

2

−

− + − − −

=

− −

− + −

− +

3 2

22

2

2

1 2 3

1 2 3

1 3

x x x

x x x

x x

[ ] [ ]T B Bx =

−

−

−

13

2

1

2

11

2

1

2

01

2

1

2

− +

− + +

+ −

x x x

x x x

x x x

1 2 3

1 2 3

1 2

2

2

33

1 2 3

1 2 3

2

3 2

22

=

− +

− + +

x x x

x x x

22

21 3x x−

x =− + − + + − +

B

x x x x x x x x x1 2 3 1 2 3 1 2 3

2 2 2

T

13

2

1

2

11

2

1

2

01

2

1

2

−

−

−


6. (c) No. The transformation T is not invertible because x1 – x2 = –(x2 – x1) and thusrank(T) = 2 ≠ 3. We can see this because the reduced row-echelon form of the matrix[T]

Bof the transformation is

8. (a) Since

T(1) = x

T(x) = x(x – 3) = –3x + x2

T(x2) = x(x – 3)2 = 9x – 6x2 + x3

we have

(b) Since we are working with standard bases, we have

so that T(1 + x – x2) = –11x + 7x2 – x3.

(c) We have T(1 + x – x2) = x(1 + (x – 3) – (x – 3)2) = –11x + 7x2 – x3.

9. (a) Since A is the matrix of T with respect to B, then we know that the first and secondcolumns of A must be [T(v1)]

Band [T(v2)]

B, respectively. That is

T

T

B

B

( )

( )

v

v

1

1

1

2

3

5

=−

=

[ ] [ ]',T B B Bx =−

−

0 0 0

1 3 9

0 1 6

0 0 1

1

1

1

0

11

7

1−

=−

−

[ ] ,T B B′ =−

−

0 0 0

1 3 9

0 1 6

0 0 1

1 0 1

0 1 1

0 0 0

−−

≠ .I


Alternatively, since v1 = 1v1 + 0v2 and v2 = 0v1 + 1v2, we have

and

(b) From Part (a),

and

(c) Since we already know T(v1) and T(v2), all we have to do is express [x1 x2]T in terms

of v1 and v2. If

then

x1 = a – b

x2 = 3a + 4b

or

a = (4x1 + x2)/7

b = (–3x1 + x2)/7

Thus

x

x

x x1

2

1 24

7

3

5

3

=

+−

+

− xx x

x x

x x

1 2

1 2

1 2

7

2

29

18

7107 24

7

+ −

=

+

− +

x

xa b a b

1

21 2

1

3

1

4

= + =

+

−

v v

T( )v v v2 1 23 52

29= + =

−

T( )v v v1 1 223

5= − =

−

( )T AB

v20

1

3

5 =

=

T AB

( )v1 =

=

−

1

0

1

2


9. (d) By the above formula,

11. (a) The columns of A, by definition, are [T(v1)]B, [T(v2)]

B, and [T(v3)]

B, respectively.

(b) From Part (a),

T(v1) = v1 + 2v2 + 6v3 = 16 + 51x + 19x2

T(v2) = 3v1 –2v3 = –6 – 5x + 5x2

T(v3) = –v1 + 5v2 + 4v3 = 7 + 40x + 15x2

(c) Let a0 + a1x + a2x2 = b0v1 + b1v2 + b2v3. Then

a0 = – b1 + 3b2

a1 = 3b0 + 3b1 + 7b2

a2 = 3b0 + 2b1 + 2b2

This system of equations has the solution

b0 = (a0 – a1 + 2a2)/3

b1 = (–5a0 + 3a1 – 3a2)/8

b2 = (a0 + a1 – a2)/8

Thus

T(a0 + a1x + a2x2) = b0T(v1) + b1T(v2) + b2T(v3)

=− +

+− +

+

239 161 247

24

201 111 247

8

0 1 2

0 1 2

a a a

a a ax

61 31 107

120 1 2 2a a a

x− +

T1

1

19 7

83 7

=

−


(d) By the above formula,

T(1 + x2) = 2 + 56x + 14x2

13. (a) Since

T1(1) =2 and T1(x) = –3x2

T2(1) = 3x T2(x) = 3x2 and T2(x2) = 3x3

T2 T1(1) = 6x and T2 T1(x) = –9x3

we have

and

(b) We observe that here

[T2 T1]B′,B = [T2]B′,B′′ [T1]B′′,B

T TB2 1

0 0

6 0

0 0

0 9

ο′, =

−

Β

[ ] [ ], ,T TB B B B1 2

2 0

0 0

0 3

0 0 0

3 0′′ ′ ′′=

−

=00

0 3 0

0 0 3


15. If T is a contraction or a dilation of V, then T maps any basis B = v1, … , vn of V to

kv1, … , kvn where k is a nonzero constant. Therefore the matrix of T with respect to B is

17. The standard matrix for T is just the m × n matrix whose columns are the transforms of thestandard basis vectors. But since B is indeed the standard basis for Rn, the matrices are thesame. Moreover, since B′ is the standard basis for Rm, the resulting transformation will yieldvector components relative to the standard basis, rather than to some other basis.

18. (a) Since D(1) = 0, D(x) = 1, and D(x2) = 2x, then

is the matrix of D with respect to B.

(b) Since D (2) = 0, D(2 – 3x) = –3 = – 3–2

p1 and D(2 – 3x + 8x2) = – 23—6

p1 – 16—6

p2, the

matrix of D with respect to B is

03

2

23

6

0 016

3

0 0 0

−

−

0 1 0

0 0 2

0 0 0

k

k

k

k

0 0 0

0 0 0

0 0 0

0 0 0


(c) Using the matrix of Part (a), we obtain

(d) Since 6 – 6x + 24x2 = p1 – p2 + 3p3, we have

or

D(6 – 6x + 24x2) = 13(2) – 16(2 – 3x) = –6 + 48x

19. (c) Since D(f1) = 2f1, D(f2) = f1 + 2f2, and D(f3) = 2f2 + 2f3, we have the matrix

20. The upper left-hand corner is all vectors in the space V. The upper right-hand corner is allvectors in the range of T. Thus it is a subspace of W. Since V is a real vector space, thelower left-hand corner is all of R4. The lower right-hand corner is the range of thetransformation obtained by multiplying every element of R4 by the matrix [T]

B′,B. It is thusa subspace of R7.

2 1 0

0 2 2

0 0 2

D x xB

( )6 6 24

03

2

23

6

0 016

30 0 0

2− +

=

−

−

−

= −1

1

3

13

166

0

B

D x x( )6 6 24

0 1 0

0 0 2

0 0 0

6

6

24

2− + =

−

=−

= − +6

48

0

6 48x


EXERCISE SET 8.5

1. First, we find the matrix of T with respect to B. Since

and

then

In order to find P, we note that v1 = 2u1 + u2 and v2 = –3u1 + 4u2. Hence the transitionmatrix from B′ to B is

Thus

P− =

1

4

11

3

111

11

2

11

P =−

2 3

1 4

A TB

= =−−

1 2

0 1

T( )u2

12 =−−

T( )u11

0=

405

and therefore

2. In order to compute A = [T]B, we note that

and

Hence

In order to find P, we note that

v1 = (1.3)u1 + (–0.4)u2

v2 = (–0.5)u1

Hence

P =−

−

. .

.

1 3 0 5

0 4 0

A TB

= =−

0 8 6 1

3 6 3 8

. .

. .

A TB

= =−

= +

3

166 1 3 81 2( . ) ( . )u u

T( ) ( . ) ( . )u u u1 1 216

20 8 3 6=

−

= +

′ = = =−

−−′

−A T P T PB B[ ] [ ]1 1

11

4 3

1 2

1 2

0 1

−

=− −

−

2 3

1 4

3

11

56

112

11

3

11


and


3. Since T(u1) = (1/ 2, 1/ 2) and T(u2) = (–1/ 2, 1/ 2), then the matrix of T withrespect to B is

From Exercise 1, we know that

Thus

5. Since T(e1) = (1, 0, 0), T(e2) = (0, 1, 0), and T(e3) = (0, 0, 0), we have

In order to compute P, we note that v1 = e1, v2 = e1 + e2, and v3 = e1 + e2 + e3. Hence,

A TB

= =

1 0 0

0 1 0

0 0 0

A T P APB

′ = = =−

−'

1 1

11 2

13 25

5 9

P P=−

=

−

−2 3

1 4

1

11

4 3

1 21and

A TB

= =−

1 2 1 2

1 2 1 2

A T P APB

′ ′= = =−−

−1 15 5 4 5

37 5 12 5

. .

. .

P− =

−− −

1 0 2 5

2 6 5

.

.


and

Thus

7. Since

and

we have

We note that q p p and q p p Hence1 1 2 2 1 22

9

1

3

7

9

1

6= − + = − .

TB

=−

2

3

2

91

2

4

3

T x( )p p p2 1 212 22

9

4

3= + = − +

T x( )p p p1 1 29 32

3

1

2= + = +

TB

=−

−

′

1 1 0

0 1 1

0 0 1

1 0 0

0 11 0

0 0 0

1 1 1

0 1 1

0 0 1

1 0 0

0

= 11 1

0 0 0

P− =

−−

11 1 0

0 1 1

0 0 1

P =

1 1 1

0 1 1

0 0 1


and

Therefore

8. (a) Since T(1, 0) = (3, –1) and T(0, 1) = (–4, 7), we have

(c) Since T(1) = 1, T(x) = x – 1, and T(x2) = (x – 1)2, we have

9. (a) If A and C are similar n × n matrices, then there exists an invertible n × n matrix Psuch that A = P–1CP. We can interpret P as being the transition matrix from a basis B′for Rn to a basis B. Moreover, C induces a linear transformation T Rn → Rn where C= [T]

B. Hence A = [T]

B′. Thus A and C are matrices for the same transformation withrespect to different bases. But from Theorem 8.2.2, we know that the rank of T isequal to the rank of C and hence to the rank of A.

det( )T =−

− =1 1 1

0 1 2

0 0 1

1

det( )T =−

−=

3 4

1 717

TB

=

−

′

3

4

7

23

21

2

3

2

91

2

4

3

−

−

=

2

9

7

91

3

1

6

1 1

0 1

P− =

1

3

4

7

23

21

P =−

−

2

9

7

91

3

1

6


Alternate Solution: We observe that if P is an invertible n × n matrix, then P2represents a linear transformation of Rn onto Rn. Thus the rank of the transformationrepresented by the matrix CP is the same as that of C. Since P–1 is also invertible, itsnull space contains only the zero vector, and hence the rank of the transformationrepresented by the matrix P–1 CP is also the same as that of C. Thus the ranks of Aand C are equal. Again we use the result of Theorem 8.2.2 to equate the rank of alinear transformation with the rank of a matrix which represents it.

Second Alternative: Since the assertion that similar matrices have the same rank dealsonly with matrices and not with transformations, we outline a proof which involvesonly matrices. If A = P–1 CP, then P–1 and P can be expressed as products ofelementary matrices. But multiplication of the matrix C by an elementary matrix isequivalent to performing an elementary row or column operation on C. From Section5.5, we know that such operations do not change the rank of C. Thus A and C musthave the same rank.

10. (a) We use the standard basis for P4. Since T(1) = 1, T(x) = 2x + 1, T(x2) = (2x + 1)2,T(x3) = (2x + 1)3, and T(x4) = (2x + 1)4, we have

(b) Thus rank(T) = 5, or [T] is invertible, so, by Theorem 8.4.3, T is one-to-one.

11. (a) The matrix for T relative to the standard basis B is

The eigenvalues of [T]B

are λ = 2 and λ = 3, while corresponding eigenvectors are(1, –1) and (1, –2), respectively. If we let

P =− −

=

− −

1 1

1 2

2 1

1 1then P-1

TB

=−

1 1

2 4

det( )T =

1 1 1 1 1

0 2 4 6 8

0 0 4 12 24

0 0 0 8 32

0 0 0 0 16

1024=


and

is diagonal. Since P represents the transition matrix from the basis B′ to the standardbasis B, we have

as a basis which produces a diagonal matrix for [T]B′.

12. (b) The matrix for T relative to the standard basis B is

The eigenvalues of [T]B

are λ = 1 and λ = –2. The eigenspace corresponding to λ = 1

is spanned by the vectors and and the eigenspace correspond to λ = –2

is spanned by the vector . If we let

then P–1[T]BP will be diagonal. Hence the basis

will produce a diagonal matrix for [T]B′.

B′ =−

1

1

0

1

0

1

1

, , 11

1−

P =−

−

1 1 1

1 0 1

0 1 1

1

1

1−

1

0

1

−

1

1

0

TB

=−

−

0 1 1

1 0 1

1 1 0

B′ = ,1

1

1

2−

−

P T PB− =

1 2 0

0 3[ ]


13. (a) The matrix of T with respect to the standard basis for P2 is

The characteristic equation of A is

λ3 – 2λ2 – 15λ + 36 = (λ – 3)2(λ + 4) = 0

and the eigenvalues are therefore λ = –4 and λ = 3.

(b) If we set λ = –4, then (λI – A)x = 0 becomes

The augmented matrix reduces to

and hence x1 = –2s, x2 = 8–3s, and x3 = s. Therefore the vector

is a basis for the eigenspace associated with λ = –4. In P2, this vector represents thepolynomial –2 + 8–

3x + x2.

If we set λ = 3 and carry out the above procedure, we find that x1 = 5 s, x2 = –2s,and x3 = s. Thus the polynomial 5 – 2x + x2 is a basis for the eigenspace associatedwith λ = 3.

−

2

8 3

1

1 0 2 0

0 1 8 3 0

0 0 0 0

−

− − −−

− −

9 6 2

0 3 8

1 0 2

1

2

3

x

x

x

=

0

0

0

A = − −−

5 6 2

0 1 8

1 0 2


14. (a) We look for values of λ such that

or

If we equate corresponding entries, we find that

λa –2c = 0

–a + λb – c = 0(*)

– b + (λ + 2)c = 0

(λ – 1)d= 0

This system of equations has a nontrivial solution for a, b, c, and d only if

or

(λ – 1)(λ + 2)(λ2 – 1) = 0

Therefore the eigenvalues are λ = 1, λ = –2, and λ = –1.

(b) We find a basis only for the eigenspace of T associated with λ = 1. The basesassociated with the other eigenvalues are found in a similar way. If λ = 1, the equationT(x) = λx becomes T(x) = x and the augmented matrix for the system of equations(*) above is

1 0 2 0 0

1 1 1 0 0

0 1 3 0 0

0 0 0 0 0

−− −

−

det

λλ

λλ

0 2 0

1 1 0

0 1 2 0

0 0 0 1

−− −

− +−

= 0

2

2

c a c

b c d

a b

c d

+−

=

λ λλ λ

Ta b

c d

a b

c d

=

λ


This reduces to

and hence a = 2t, b = 3t, c = t, and d = s. Therefore the matrices

form a basis for the eigenspace associated with λ = 1.

15. If v is an eigenvector of T corresponding to λ, then v is a nonzero vector which satisfiesthe equation T(v) = λv or (λI – T)v = 0. Thus λI – T maps v to 0, or v is in the kernel ofλI – T.

17. Since C[x]B

= D[x]B

for all x in V, we can, in particular, let x = vi

for each of the basisvectors v1, … , v

nof V. Since [v

i]B

= e3i

for each i where e1, … , e

n is the standard basis

for Rn, this yields Cei

= Dei

for i = 1, … , n. But Cei

and Dei

are just the ith columns of Cand D, respectively. Since corresponding columns of C and D are all equal, we have C = D.

18. Let B be the standard basis for R2, and let

B′ = (cos θ, sin θ), (–sin θ, cos θ) = v1, v2

be the basis consisting of the unit vector v1 lying along and the unit vector v2perpendicular to . Note that to change basis from B to B′, we rotate through an angle –θ.Now relative to B′, T is just the orthogonal projection on the v1-axis, which is accomplishedby setting the v2-coordinate equal to zero. Then to return to the basis B, we rotate throughan angle θ. Thus

2 3

1 0

0 0

0 1

and

1 0 2 0 0

0 1 3 0 0

0 0 0 0 0

0 0 0 0 0

−−


19. (a) False. Every matrix is similar to itself, since A = I–1 AI.

(b) True. Suppose that A = P–1 BP and B = Q–1 CQ. Then

A = P–1(Q–1 CQ)P = (P–1Q–1)C(QP) = (QP)–1C(QP)

Therefore A and C are similar.

(c) True. By Table 1, A is invertible if and only if B is invertible, which guarantees that Ais singular if and only if B is singular.

Alternatively, if A = P–1 BP, then B = PAP–1. Thus, if B is singular, then so is A.Otherwise, B would be the product of 3 invertible matrices.

(d) True. If A = P–1 BP, then A–1(P–1 BP)–1 = P–1 B–1(P–1)–1 = P–1B–1P, so A–1 and B–1 aresimilar.

20. If A is invertible and B is singular, then it follows from Exercise 19(c)that A and B cannotbe similar. Hence

are not similar.

Moreover, if A and B are both invertible but have different determinants, then it followsfrom Table 1 that they cannot be similar. Hence.

are not similar.

A B=

=

1 0

0 1

2 0

0 1and

A B=

=

1 0

0 1

1 0

0 0and

Tx

yT T I

x

yB B B B B

=

′ ′ ′[ ] [ ] [ ], ,

=−

cos sin

sin cos

coθ θθ θ

1 0

0 0

ss sin

sin cos

cos sin c

θ θθ θ

θ θ

−

=

x

y

2 oos

sin cos sin

θ

θ θ θ2

x

y


22. Let B = P–1 AP so that A = PBP–1. Let λ be an eigenvalue common to A and B and let x bean eigenvector of A corresponding to λ. Then Ax = λx, or

Ax = (PBP–1)x = λx

so that

BP–1x = P–1 λx

Thus

B(P–1x) = λ(P–1x)

That is, P–1x is an eigenvector of B corresponding to λ.

24. For every f in V, T(f) = f(x – 2π). Thus, if f is an eigenvector with eigenvalue λ = 1, wehave T(f) = 1 • f, so f(x – 2π) = f(x) for all x. Thus, the eigenspace of λ = 1 is precisely theset of all functions f that are periodic with periods that evenly divide 2π.

25. First, we need to prove that for any square matrices A and B, the trace satisfies tr(A) =tr(B). Let

Then,

[ ]AB a b a b a b a b an n j

j

11 11 11 12 21 13 31 1 1 11

= + + + =+=

nn

j

n n

b

AB a b a b a b a b

∑

= + + + =+

1

22 21 21 22 22 23 32 2 2[ ] aa b

AB a b a b a b

j

j

n

j

nn n n n n n n

21

2

1 1 2 2 3 3

=

+

∑

= + + +[ ] aa b a bnn nn nj

j

n

jn==∑

1

.

A

a a a

a a a

a a a

n

n

n n nn

=

11 12 1

21 22 2

1 2

=and

b11

B

b b

b b b

b b

n

n

n n

12 1

21 22 2

1 2 bnn


Thus,

tr(AB) = [AB]11 + [AB]22 + … + [AB]nn

Reversing the order of summation and the order of multiplication, we have

Now, we show that the trace is a similarity invariant. Let B = P–1 AP. Then

tr(B) = tr(P–1 AP)

= tr((P–1 A)P)

= tr(P(P–1 A))

= tr(PP–1)A)

= tr(I A)

= tr(A).

tr( )AB b a

b a b

jk

k

n

jk

j

n

k

k

n

k k

k

n

=

= +

==

= =

∑∑

∑

11

11

1 21

∑∑ ∑+ +

= + + +=

=a b a

BA BA BA

k nk

k

n

kn

nn

21

11 22[ ] [ ] [ ]

ttr BA( ).

= + + +

=

= = =∑ ∑ ∑a b a b a b

a

j

j

n

j j j

j

n

nj jn

j

n

kj

j

21

1 2 21 1

===∑∑

11

n

kj

k

n

b .


EXERCISE SET 8.6

1. (a) This transformation is onto because for any ordered pair (a, b) in R2, T(b, a) = (a, b).

(b) We use a counterexample to show that this transformation is not onto. Since there isno pair (x, y) that satisfies T(x, y) = (1, 0), T is not onto.

(c) This transformation is onto. For any ordered pair (a, b) in R2, T = (a, b).

(d) This is not an onto transformation. For example, there is nopair (x, y) that satisfies T(x, y) = (1, 1, 0).

(e) The image of this transformation is all vectors in R3 of the form (a, –a, 2a). Thus, theimage of T is a one-dimensional subspace of R3 and cannot be all of R3. In particular,there is no vector (x, y) that satisfies T(x, y) = (1, 1, 0), and this transformation is notonto.

(f) This is an onto transformation. For any point (a, b) in R2, there are an infinite number

of points that map to it. One such example is T = (a, b).

2. The transformation TA

is not onto if the image of TA

is not all of Rn. Thus, rank(TA) < n, or

equivalently, null(T) > 0. This is equivalent to the matrix A being singular.

3. (a) We find that rank(A) = 2, so the image of T is a two-dimensional subspace of R3. Thus,T is not onto.

(b) We find that rank(A) = 3, so the image of T is all of R3. Thus, T is not onto.

(c) We find that rank(A) = 3, so the image of T is all of R3. Thus, T is onto.

(d) We find that rank(A) = 3, so the image of T is all of R3. Thus, T is onto.

4. (a) We find that rank(A) = 1 and the codomain is R3, so this is not onto.

(b) We find that rank(A) = 3 and the codomain is R3, so this is an onto transformation.

(c) We find that rank(A) = 2 and the codomain is R3, so this is not onto.

a b a b+ −

2 2

0, ,

a b a b+ −

2 2

,

419

5. (a) The transformation T is not a bijection because it is not onto. There is no p(x) inP2(x) so that xp(x) = 1.

(b) The transformation T(A) = AT is one-to-one, onto, and linear, so it is a bijection.

(c) By Theorem 8.6.1, there is no bijection between R4 and R3, so T cannot be a bijection. Inparticular, it fails being one-to-one. As an example, T(1, 1, 2, 2) = T(1, 1, 0, 0) = (1, 1, 0).

(d) Because dim P3 = 4 and R3 = 3, Theorem 8.6.1 states that there is no bijection betweenP3 and R3, so T cannot be a bijection. In particular, it fails being one-to-one. As anexample, T(x + x2 + x3) = T(1 + x + x2 + x3) = (1, 1, 1).

6. Let T V → W be a bijection. Then T is one-to-one, onto, and invertible.First, we show that T–1 W → V is one-to-one. If T–1(w1) = T–1(w2), then T(T–1(w1)) =

T(T–1(w2)) so w1 = w2, and T–1 is one-to-one. Second, we show that T–1 is onto. Let v beany vector in V, and let w = T(v). Then T–1(w = T(w) = T–1(T(v)) = v, so v is in the imageof T–1 for any v in V, and T–1 is onto. Thus, the inverse of a bijection is again a bijection.

Now, we want to show that if T V → W is a bijective linear transformation, thenT–1 W → V is also a bijective linear transformation.

Because T is onto, for any w in W, there is a vector v in V so that w = T(v). Then,

T–1(cw) = T–1(cT(v)).

Since T is a linear transformation, cT(v) = T(cv), so

T–1(cw) = T–1(T(cv))

= cv

= cT–1(w).

Also, for any two vectors w1 and w2 in W, there exist vectors v1 and v2 in V so that w1 =T(v1) and w2 = T(v2), since T is onto. Thus,

T–1(w1 + w2) = T–1(v1) + T(v2)).

Since T is a linear transformation, T(v1 + T(v2) = T(v1 + v2), and we have

T–1(w1 + w2) = T–1(v1 + v2))

= v1 + v2

= T–1(w1) + T–1(w2).

Thus, T–1 is a linear transformation.

7. Assume there exists a subjective (onto) linear transformation T V → W, where dim W >dim V. Let m = dim V and n = dim W, with m < n. Then, the matrix A

Tof the

transformation is an n × m matrix, with m < n. The maximal rank of AT

is m, so thedimension of the image of T is at most m. Since the dimension of the image of T is smallerthan the dimension of the codomain Rn, T is not onto. Thus, there cannot be a subjectivetransformation from V onto W if dim V < dim W.


If n = dim W ≤ dim V = m, then the matrix AT

of the transformation is an n × m matrixwith maximim possible rank n. If rank(A

T) = n, then T is a subjective transformation.

Thus, it is only possible for T V → W to be a subjective linear transformation if dim W≤ dim V.

8. Let V be the vector of all symmetric 3 × 3 matrices, and let T V → R6 be defined by

Clearly, T is one-to-one and onto, so T is a bijection. Because T is the coordinate mapfrom V to R6, it is a linear transformation (refer to Example 7 in Section 8.1). Thus, T is abijective linear transformation.

9. Let T V → Rn be defined by T(v) = (v)S, where S = u1, u2, … , u

n is a basis of V. We

know from Example 7 in Section 8.1 that the coordinate map is a linear transformation.Let (a1, a2, … , a

n) be any point in Rn. Then, for the vector v = a1u1 + a2u2 + … + a

nu

n, we

have

T(v) = T(a1u1 + a2u2 + … + anu

n) = (a1, a2, … , a

n)

so T is onto.

Also, let v1 = a1u1 + a2u2 + … + anu

nand v2 = b1u1 + b2u2 + … + b

nu

n. If T(v1) =

T(v2), then (v1_S= (v2)S

, and thus a1, a2, … , an) = (b1, b2, … , b

n). It follows that a1 = b1,

a2 = b2, … , an

= bn, and thus

v1 = a1u1 + a2u2 + … + anu

n) = v2.

So, T is one-to-one and is thus an isomorphism.

10. T1 and T2 are both bijective linear transformations, we know from Theorem 8.1.2 thatT1

T2 is also a linear transformation. Let T1 V → W and T2 U → V. Because T1 is onto,we know that for any w in W, there exists a v in V so that T1(v) = w. Similarly, because T2is onto, we know that for any v in V, there exists a u so that T2(u) = v. Then

(T1 T2)(u) = T1(T2(u)) = T1(v) = w,

and T1 T2 is onto.

If there are two vectors u1 and u2 so that (T1 T2)(u1) = (T1

T2)(u2), then T1(T2(u1))= T1(T2(u2)). Since T1 is one-to-one, this means that T2(u1) = T2(u2), and since T2 is one-to-one, this means that u1 = u2. Thus, the composition T1

T2 is one-to-one.

Thus, the composition of two bijective linear transformations is again a bijective lineartransformation.

T

a b c

b d e

c e f

a

b

c

d

e

f

=


11. Let V = Span1, sin x, cos x, sin 2x, cos 2x. Differentiation is a linear transformation (seeExample 11, Section 8.1). In this case, D maps functions in V into other functions in V. Toconstruct the matrix of the linear transformation with respect to the basis B = 1, sin x,cos x, sin 2x, cos 2x, we look at coordinate vectors of the derivatives of the basis vectors:

D(1) = 0 D(sin x) = cos x D(cos x = –sin x D(sin 2x) = 2 cos 2x

D(cos 2x) = 2 sin 2x

The coordinate matrices are:

Thus, the matrix of the transformation is

Then, differentiation of a function in V can be accomplished by matrix multiplication by theformula

[D(f)]B

= SD[f]

B.

The final vector, once transformed back to V from coordinates in R5, will be the desiredderivative.

For example,

Thus, D(3 – 4 sin x + sin 2 x + 5 cos 2x) = –4 cos x – 10 sin 2x + 2 cos 2x.

12. (a) We have

uV

= ⟨u,u⟩ 1/2—V

= ⟨TPu, T(u)⟩1/2—W

= T(u)W

[ ( sin sin cos )]D x x x AB D3 4 2 5 2

3

4

− + + =−

0

1

5

0 0 0 0 0

0 0 1 0

=− 0

0 1 0 0 0

0 0 0 0 2

0 0 0 2 0

−

−

=

3

4

0

1

5

0

.

0

4

10

2

−−

AD =−

−

0 0 0 0 0

0 0 1 0 0

0 1 0 0 0

0 0 0 0 2

0 0 00 2 0

[ ( )] [ (sin )]D D xB B1

0

0

0

0

0

0

0

1=

=00

0

0

1

0

0

=−

[ (cos )]D x B

00

2

0

0

0

0

2

=

[ (sin )]D x B

=−

[ (cos )]D x B2

0

0

0

2

0

.


so an inner product space isomorphism preserves lengths. The angle θV

between uand v in V is given by

and the angle θW

between T(u) and T(v) in W is given by

Thus, because the inner product space isomorphism preserves lengths,

and the inner product space isomorphism also preserves angles.

(b) If B = v1, v2, … , v2 is an orthonormal set in V, then

Thus, the set T(v1, Y(v2), … , T(vn) = is an orthonormal set in W.

However, in general this is not true. Consider the inner transformation T R2 → R2

defined by

This transformation is a change of basis transformation, and is thus one-to-one andonto, but the vectors T(e1) and T(e2) are not orthonormal. In fact, T*e2) = 2, and⟨T(e1), T(e2)⟩ = 1 ≠ 0. Thus, length and angles are not preserved with thistransformation.

(c) The set e1, e2, … , en is an orthonormal basis of W. By the Gram-Schmidt process, we

can construct an orthonormal basis v1, v2, … , vn of V. Define a linear transformation

T V | W by

T(vj) = e

j, j = 1, 2, … , n.

T e T e( ) ( )1 21

0

1

0=

=

and

( ( ), ( )) ( ,T Ti j

i ji j jv v v vif

1 if= =

≠=

10

cos, ,

θWw v

v

T T

T w T w=

( ) ( )( ) ( )

=( )

=u v

u v

u v

u v vcos .θV

cos,

.θWw

T T

T w T w=

( ) ( )( ) ( )

u v

u v

cos,

θ =( )u v

u vv

v v


Let u = a1v1 + a2v2 + … + anv

nand v = b1v1 + b2v2 + … + b

nv

n. Then

T(u) = a1T(v1) + a2T(v2) + … + anT(v

n)

= a1e1 + a2e2 + … + ane

n

Similarly,

T(v) = b1T(v1) + v2T(v2) + … + bnT(v

n)

= b1e1 + v2e2 + … + bne

n

So, if T(u) = T(v), then and a1 = b1, a2 = b2, … , an

= bn. Thus, u = v

and T is one-to-one.

For any vector in W, we have

T(c1v1 + c2v2 + … + cnv

n) = = w.

so T is onto.

c

c

cn

1

2

w =

c

c

cn

1

2

a

a

a

b

b

bn n

1

2

1

2

=

=

b

b

bn

1

2

=

a

b

an

1

2


We now need to show that T preserves the inner product. We have

⟨u, v⟩V

= ⟨a1v1 + a2v2 + … + anv

n, b1v1 + b2v2 + … + b

nv

n⟩

= ajb

k⟨v

j, v

k⟩.

Recall that if j ≠ k, then ⟨vj, v

k⟩ = 0. We then have

⟨u, v⟩V

= a1b1 ⟨v1, v1⟩ + a2b2 ⟨v2, v2⟩ + … + anb

n⟨v

n, v

n⟩.

Now recall that ⟨vj, v

j⟩ = 1. Then

⟨u, v⟩ = ⟨a1b1 + a2b2 + … + anb

n

We also have

⟨T(u), T(v)⟩W

= ⟨a1e1 + a2e2 + … + ane

n, b1e1 + b2e2 + … + b

ne

n⟩

= a1b1 + a2b2 + … + anb

n.

Thus, ⟨T(u), T(v)⟩W

= ⟨u, v⟩V

and T is an inner product space isomorphism.

(d) Let dim V = dim W = n. Then by Part (c), there is an inner product isomorphism T1

V → Rn and another inner product isomorphism T2 W → Rn. Then T2–1 Rn → W is

an isomorphism by Problem 6. And, since

⟨u, v⟩ = ⟨T2(u), T2(v)⟩Rn,

we rename x = T2(u) and y = T2(v) and we have

⟨T2–1(x), T2

–1(y)⟩W

= ⟨x, y⟩Rn.

Thus, T2–1 Rn → W is another inner product space isomorphism. Then, the

composition (T2–1 T1) V → W is an isomorphism by Problem 10. We have

⟨u, v⟩V

= ⟨T1(u), T1(v(⟩Rn

= ⟨T2–1(T1(u)), T2

–1(T1(v))⟩W

.

= ⟨(T2–1 T

1)(u), (T2

–1 T1)(v)⟩W

.

so T2–1 T

1is an inner product isomorphism from V to W.

=

.

a

a

a

b

b

bn n

1

2

1

2

j

n

j

n

==∑∑

11


(e) An orthonormal basis for P5 is 1, x, x2, x3, x4, x5, using the inner product

⟨a0 + a1x + a1x2 + … + a5x

5, b0 =+ b1x + b2x2 + … + b5x

5⟩P

5

= a0b

0+ a

1b

1+ … + a

5b

5.

An orthonormal basis for M22 is

using the inner product

⟨A, B⟩M

23= tr(AT B).

We can define a linear transformation T P5 → M23 by

The transformation T is one-to-one and onto. To see that is an inner productisomorphism, we need to compute ⟨u, v⟩

P5and ⟨T(u), T(v)⟩

M23. Let u = a0 + a1x +

a2x2 + … + a5x

5, and let v = b0 + b1x + b2x2 + … + b5x

5. Then we have

⟨u, v⟩P

5= a0b0 + a1b1 + … a5b5

and

Thus, ⟨u, v⟩P

5= ⟨T(u),T(v)⟩

M23

, and T is an inner product space isomorphism.

T Ta a a

a a a

b b b

bM( ), ( ) ,u v

23

0 1 2

3 4 5

0 1 2=

33 4 5

0 0 1 1 3 3

b b

a b a b a b

= + + + + .

T T x T11 0 0

0 0 0

0 1 0

0 0 0( ) =

( ) =

, , xx

T x T

2

3

0 0 1

0 0 0

0 0 0

1 0 0

( ) =

( ) =

,

, xx T x4 50 0 0

0 1 0

0 0 0

0 0 1( ) =

( ) =

, .

1 0 0

0 0 0

0 1 0

0 0 0

0 0 1

0 0 0

0

, , ,

00 0

1 0 0

0 0 0

0 1 0

0 0 0

0 0 1

, ,

,



3. By the properties ofan inner product, we have

T(v + w) = ⟨v + w, v0⟩v0

= (⟨v, v0J + ⟨w, v0⟩)v0

= ⟨v, v0⟩v0 + ⟨w, v0⟩v0

= T(v) + T(w)

and

T(kv) = ⟨kv, v0⟩v0 = k⟨v, v0⟩v0 = kT(v)

Thus T is a linear operator on V.

4. (a) By direct computation, we have

T(x + y) = ((x + y) • v1, … , (x + y) • vm

)

= (x • v1 + y • v1, … , x • vm

+ y • vm

)

= (x • v1, … , x • vm

) + (y • v1, … , y • vm

)

= T(x) + T(y)

and

T(kx) = ((kx) • v1, … , (kx) • vm

)

= (k(x • v1), … , k(x • vm

))

= kT(x)

427

4. (b) If e1, … , en is the standard basis for Rn, and if v

i= (a1i

, a2i, … , a

ni) for i = 1, … , m,

then

T(ej) = (a

j1, aj2, … , ajm

)

But T(ej), interpreted as a column vector, is just the jth column of the standard matrix

for T. Thus the ith row of this matrix is (a1i, a2i

, … , ani

), which is just vi.

5. (a) The matrix for T with respect to the standard basis is

We first look for a basis for the range of T; that is, for the space of vectors B such thatAx = b. If we solve the system ofequations

x + z + w = b1

2x + y + 3x + w = b2

x + w = b3

we find that = z = b1 – b3 and that any one of x, y, or w will determine the other two.Thus, T(e3) and any two of the remaining three columns of A is a basis for R(T).

Alternate Solution :We can use the method of Section 5.5 to find a basis for thecolumn space of A by reducing AT to row-echelon form. This yields

so that the three vectors

form a basis for the column space of T and hence for its range.

1

2

1

0

1

0

0

0

0

1 1

0 1 0

0 0 1

0 0 0

2

A =

1 0 1 1

2 1 3 1

1 0 0 1


Second Alternative: Note that since rank(A) = 3, then R(T) is a 3-dimensionalsubspace of R3 and hence is all of R3. Thus the standard basis for R3 is also a basis forR(T).

To find a basis for the kernel of T, we consider the solution space of Ax = 0. If we setb1 = b2 = b3 = 0 in the above system of equations, we find that z = 0, x = –w, and y =w. Thus the vector (–1, 1, 0, 1) forms a basis for the kernel.

7. (a) We know that T can be thought of as multiplication by the matrix

where reduction to row-echelon form easily shows that rank([T]B) = 2. Therefore the

rank of T is 2 and the nullity of T is 4 – 2 = 2.

(b) Since [T]B

is not invertible, T is not one-to-one.

9. (a) If A = P–1 BP, then

AT = (P–1 BP)T

= PT BT (P–1)T

= ((PT)–1)–1BT (P–1)T

= ((P–1)T)–1B–1 (PT)T

Therefore AT and BT are similar. You should verify that if P is invertible, then so is PT

and that (PT)–1 = (P–1)T.

10. If statement (i) holds, then the range of T is V and hence rank(T) = n. Therefore, by theDimension Theorem, the nullity of T is zero, so that statement (ii) cannot hold. Thus (i)and (ii) cannot hold simultaneously.

Now if statement (i) does not hold, then the range of T is a proper subspace of V andrank(T) < n. The Dimension Theorem then implies that the nullity of T is greater thanzero. Thus statement (ii) must hold. Hence exactly one of the two statements must alwayshold.

[ ]T B =

−− −

−−

1 1 2 2

1 1 4 6

1 2 5 6

3 2 3 2


11. If we let then we have

The matrix X is in the kernel of T if and only if T(X) = 0, i.e., if and only if

a + b + c = 0

2b + d = 0

d = 0

Hence

The space of all such matrices X is spanned by the matrix and therefore has

dimension 1. Thus the nullity is 1. Since the dimension of M22 is 4, the rank of T musttherefore be 3.

Alternate Solution. Using the computations done above, we have that the matrix for thistransformation with respect to the standard basis in M22 is

Since this matrix has rank 3, the rank of T is 3, and therefore the nullity must be 1.

1 1 1 0

0 2 0 1

0 0 0 1

0 0 0 1

,1 0

1 0−

Xa

a=

−

0

0

Ta b

c d

a c b d b b

d d

=

+ +

+

0 0

=+ + +

a b c b d

d d

2

Xa b

c d=

,


12. We are given that there exist invertible matrices P and Q such that A = P–1 BP and B = Q–1

CQ. Therefore

A = P–1(Q–1 CQ)P = (QP)–1 C(QP)

That is, A and C are similar.

13. The standard basis for M22 is the set of matrices

If we think of the above matrices as the vectors

[1 0 0 0]T, [0 1 0 0]T, [0 0 1 0]T, [0 0 0 1]T

then L takes these vectors to

[1 0 0 0]T, [0 0 1 0]T, [0 1 0 0]T, [0 0 0 1]T

Therefore the desired matrix for L is

14. (a) Reading directly from P, we have

v1 = 2u1 + u2

v2 = –u1 + u2 + u3

v3 = 3u1 + 4u2 + 2u3

1 0 0 0

0 0 0 0

0 1 0 0

0 0 0 1

1 0

0 0

0 1

0 0

0 0

1 0

1 0

0 1

, , ,


14. (b) Since

by direct calculation we have

u1 = –v1 – v2 + v3

u2 = 5v1 + 4v2 – 2v3

u3 = – 7v1 – 5v2 + 3v3

15. The transition matrix P from B′ to B is

Therefore, by Theorem 8.5.2, we have

Alternate Solution: We compute the above result more directly. It is easy to show that u1= v1, u2 = –v1 + v2, and u3 = –v2 + v3. So

T(v1) = T(u1) = –3u1 + u2 = –4v1 + v2

T(v2) = T(u1 + u2) = T(u1) + T(u2)

= u1 + u2 + u3 = v3

T(v3) = T(u1 + u2 + u3) = T(u1) + T(u2) + T(u3)

= 8u1 – u2 + u3

= 9v1 – 2v2 + v3

[ ] [ ]T P T PB B′−= =

−−

14 0 9

1 0 2

0 1 1

P =

1 1 1

0 1 1

0 0 1

P− =

− −− −

−

12 5 7

2 4 5

1 2 3


16. Let and solve for a, b, c, and d so that


a – b = 2a + c

a + 4b = 2b + d

c – d = a + 3c

c + 4d = b + 3c

which has solution b = –a, c = 0, and d = –a. Thus, for instance,

will work.

However, if we try the same procedure on the other pair of matrices, we find that a =3d, b = d, and c = 3d, so that det(P) = 3d2 – 3d2 = 0. Thus P is not invertible.

Alternatively, if there exists an invertible matrix P such that

then we have a matrix with zero determinant on the left, and one with nonzero determinanton the right. Why? Thus the two matrices cannot be similar.

17. Since

T ,

1

0

0

1

0

1

=

T

0

1

0

1

1

0

=−

, and T

0

0

1

=−

1

0

1

3 1

6 2

1 2

1 01

− −

−

−= P P

P P=−−

=

−−

−1 1

0 1

1 1

0 11and

1 1

1 4

2 1

1 3

1 1

1 41

−

=

−

−P P Por

=

2 1

1 3P

Pa b

c d=


we have

In fact, this result can be read directly from [T(X)]B.

18. We know that det(T) ≠ 0 if and only if the matrix of T relative to any basis B has nonzerodeterminant; that is, if and only if every such matrix is invertible. Choose a particular basisB for V and let V have dimension n. Then the matrix [T]

Brepresents T as a matrix

transformation from Rn to Rn. By Theorem 4.3.1, such a transformation is one-to-one if andonly if [T]

Bis invertible. Thus T is one-to-one if and only if det(T) ≠ 0.

19. (a) Recall that D (f + g) = (f(x) + g(x))′′ = f′′(x) + g′′(x) and D(cf) = (cf(x))′′ = cf ′′(x).

(b) Recall that D(f) = 0 if and only if f′(x) = a for some constant a if and only if f(x) = ax

+ b for constants a and b. Since the functions f(x) = x and g(x) = 1 are linearlyindependent, they form a basis for the kernel of D.

(c) Since D(f) = f(x)if and only if f′′(x) = f(x) if and only if f(x) = aex + be–x for a and barbitrary constants, the functions f(x) = ex and g(x) = e–x span the set of all suchfunctions. This is clearly a subspace of C2 (–∞, ∞) (Why?), and to show that it hasdimension 2, we need only check that ex and e–x are linearly independent functions.To this end, suppose that there exist constants c1 and c2 such that c1e

x + c2e–x = 0. If

we let x = 0 and x = 1, we obtain the equations c1 + c2 = 0 and c1e + c2e–1 = 0. These

imply that c1 = c2 = 0, so ex and e–x are linearly independent.

21. (a) We have

and

T kp x

kp x

kp x

kp x

k( ( ))

( )

( )

( )

=

=1

2

3

( )

( )

( )

( ( ))

p x

p x

p x

kT p x

1

2

3

=

T p x q x

p x q x

p x q x( ( ) ( ))

( ) ( )

( ) ( )+ =++

1 1

2 2

pp x q x

p x

p x

p x( ) ( )

( )

( )

( )3 3

1

2

3+

=

+

=( )

( )

( )

(

q x

q x

q x

T

1

2

3

pp x T q x( )) ( ( ))+

[ ]T B =−

−

1 1 1

0 1 0

1 0 1


(b) Since T is defined for quadratic polynomials only, and the numbers x1, x2, and x3 aredistinct, we can have p(x1) = p(x2) = p(x3) = 0 if and only if p is the zero polynomial.(Why?) Thus ker(T) = 0, so T is one-to-one.

(c) We have

T(a1P1(x) + a2P2(x) + a3P3(x)) = a1T(P1(x)) + a2T(P2(x)) + a3T(P3(x))

(d) From the above calculations, we see that the points must lie on the curve.

23. Since

then

where the above vectors all have n + 1 components. Thus the matrix of D with respect toB is

[ ( )]( , , )

( , , , , ) , ,D x

k

k k

kB =

==

0 0 0

0 0 1 2

if

if , n

↑

kth component

D xk

kx k n

k

k( )

, , ,=

=

=

−

0 0

1 21

if

if

=

+

+

a a a1 2 3

1

0

0

0

1

0

0

0

1

=

a

a

a

1

2

3


24. Call the basis B and the vectors v0, v1, … , vn. Notice that D(v

i) = v

i–1 for i = 1, … , n,while D(v0) = 0. That is,

where the above vectors all have n + 1 components. Thus the matrix of D with respect toB is

In fact, the differentiation operator maps Pn

to Pn–1.

25. Let Bn

and Bn+1 denote the bases for P

nand P

n+1, respectively. Since

J(xk) = xk+1

——k + 1

for k = 0, … , n

0 1 0 0 0

0 0 1 0 0

0 0 0 1 0

0 0 0 0 1

0 0 0 0 0

[ ( )]( , , )

( , , , , )D

k

kk Bv

if

if=

==

0 0 0

0 1 0 1,, , ,2 n

↑

kth componeent

0 1 0 0 0

0 0 2 0 0

0 0 0 3 0

0 0 0 0

0 0 0 0 0

n


we have

where [xk]Bn

= [0, … , 1, … , 0]T with the entry 1 as the (k + 1)st component out of a totalof n + 1 components. Thus the matrix of J with respect to B

n+1 is

with n + 2 rows and n + 1 columns.

0 0 0 0

1 0 0 0

0 1 2 0 0

0 0 1 3 0

0 0 0 1 1( )n +

[ ( )] , , , , (J x Bk

nk

n+ =+

+1 01

10 2 )components

↑

+k 2nnd component


EXERCISE SET 9.1

1. (a) The system is of the form y′ = Ay where

The eigenvalues of A are λ = 5 and λ = –1 and the corresponding eigenspaces arespanned by the vectors

respectively. Thus if we let

we have

Let y = Pu and hence y′ = Pu′. Then

or

u′1 = 5u1

u′2 = –u2

′−

u = u

5 0

0 1

D P AP= =−

−1 5 0

0 1

P =−

1 2

1 1

1

1

2

1

−

and

A =

1 4

2 3

439

Therefore

u1 = c1e5x

u2 = c2e–x

Thus the equation y = Pu is

or

y1 = c1e5x– 2c2e

–x

y2 = c1e5x + c2e

–x

1. (b) If y1(0) = y2(0) = , then

c1 – 2c2 = 0

c1 + c2 = 0

so that c1 = c2 = 0. Thus y1 = and y2 = 0.

3. (a) The system is of the form y′ = Ay where

The eigenvalues of A are λ = 1, λ = 2, and λ = 3 and the corresponding eigenspaces arespanned by the vectors


P

/

=− −

0 1 2 1

1 1 1

0 1 1

0

1

0

1 2

1

1

1

1

1

−

−

/

A = −−

4 0 1

2 1 0

2 0 1

y

y

c e

c e

cx

x

1

1

15

2

1 2

1 1

=

−

=−

115

2

15

2

2e c e

c e c e

x x

x x

−

+

−

−


then


so that

u′1 = u1

u′2 = 2u2

u′3 = 3u3

Therefore

u1 = c1ex

u2 = c2e2x

u3 = c3e3x


or

y1 = – 12c2e2x – c3e

3x

y2 = c1ex + c2e

2x + c3e3x

y3 = c2e2x + c3e

3x

y

y

y

1

2

3

=− −

/0 1 2 1

1 1 1

0 1 1

c e

c e

c e

x

x

x

1

22

33

′

u u

1 0 0

0 2 0

0 0 3

D P AP=

−11 0 0

0 2 0

0 0 3


Note: If we use

as basis vectors for the eigenspaces, then

and

y1 = –c2e2x + c

3e3x

y2 = c1ex + 2c2e

2x – c3e3x

y3 = 2c2e2x – c3e

3x

There are, of course, infinitely many other ways of writing the answer, depending uponwhat bases you choose for the eigenspaces. Since the numbers c1, c2, and c3 arearbitrary, the “different” answers do, in fact, represent the same functions.

3. (b) If we set x = 0, then the initial conditions imply that

– 12c2 – c3 = –1

c1 + c2 + c3 = 1

c2 + c3 = 0

or, equivalently, that c1 = 1, c2 = –2, and c3 = 2. If we had used the “different” solutionwe found in Part (a), then we would have found that c1 = 1, c2 = –1, and c3 = –2. Ineither case, when we substitute these values into the appropriate equations, we findthat

y1 = e2x – 2e3x

y2 = ex – 2e2x + 2e3x

y3 = –2e2x + 2e3x

P =−

−−

0 1 1

1 2 1

0 2 1

0

1

0

1

2

2

1

1

1

−

−−


5. Following the hint, let y = f(x) be a solution to y′ = ay, so that f′(x) = af(x). Now considerthe function g(x) = f(x)e–ax. Observe that

g′(x) = f′(x)e–ax – af(x)e–ax

= af(x)e–ax – af(x)e–ax

= 0

Thus g(x) must be a constant; say g(x) = c. Therefore,

f(x)e–ax = c

or

f(x) = ceax

That is, every solution of y′ = ay has the form y = ceax.

6. Suppose that y′ = Ay where A is diagonalizable. Then there is an invertible matrix P suchthat P–1 AP = D where D is a diagonal matrix whose diagonal entries are the eigenvalues ofα. If we let y = Pu so that y′ = Pu′, then y′ = Ay becomes Pu′ = APu or

u′ = P–1 APu

= Du

That is,

u1′ = λ

1u

1. .. .. .

u′n

= λnu

n

and hence

u1 = c1eλ1x

. .. .. .

un

= cneλnx


Therefore y = Pu can be written

y1 = p11c1eλ1x + p12c2e

λ2x + … + p1nc

neλnx

. . . .. . . .. . . .

yn

= pn1c1e

λ1x + pn2c2e

λ2x + … + pnn

cneλnx

where pij

is the ijth entry in the matrix P. That is, each of the functions yi

is a linearcombination of eλ1x, … , eλnx.

7. If y1 = y and y2 = y′, then y1′ = y2 and y2′ = y′′ = y′ + 6y = y2 + 6y1. That is,

y1′ = y2

y2′ = 6y1+ y2

or y′ = Ay where

The eigenvalues of A are λ = –2 and λ = 3 and the corresponding eigenspaces are spannedby the vectors


then


P AP− =

−

1 2 0

0 3

P =−

1 1

2 3

−

1

2and

1

3

A =

0 1

6 1


or

y1 = –c1e–2x + c2e

3x

y2 = 2c1e–2x + 3c2e

3x

Therefore

u1 = c1e–2x

u2 = c2e3x


or

y1 = –c1e–2x + c2e

3x

y2 = 2c1e–2x + 3c2e

3x

Note that y1′ = y2, as required, and, since y1 = y, then

y = –c1e–2x + c2e

3x

Since c1 and c2 are arbitrary, any answer of the form y = ae–2x + be3x is correct.

8. Letting y = y1, y′ = y2, y′′ = y3 leads to the system

y1′ = y2

y2′ = y3

y3′ = 64 – 11y2 + 6y3

y

y

c e

c e

x

x

1

2

12

23

1 1

2 3

=

−

−

′−

u u

2 0

0 3


or

The eigenvalues and corresponding eigenvectors of A are:

The solution to the system is:

Then y= y1 = c1et + c2e

2t + c3e3t .

9. If we let y1 = y, y2 = y′, and y3 = y′′, then we obtain the system

y1′′ = y2

y2′ = y3

y3′ = 6y1 –11y2 + 6y3

The associated matrix is therefore

y

y

y

1

2

3

1

1

1

=

+c e c1t

222t

33te c e

1

2

4

1

3

9

λ

λ

1 1

2 2

1

1

1

1

2

,

,

= =

=

X

X

,

=

= =

1

2

4

3

1

3

93 3λ X

′′′

=−

y

y

y

1

2

3

0 1 0

0 0 1

6 11 6

′ =y

y

y

AY

1

2

3

, or y


The eigenvalues of A are λ = 1, λ = 2, and λ = 3 and the corresponding eigenvectors are

The solution is, after some computation,

y = c1ex + c2e

2x + c3e3x

10. (b) Suppose that the coefficient matrix A of the system y = Ay is diagonalizable and let D= P–1 AP where D and P are as given in the problem. Let y = Pu and y′ = Pu′ so thatu′ = Du. Thus u

i′ = λ

iu

ifrom which it follows that u

i= c

ieλix for i = 1, … , n. Since y

= Pu, we have

which is the desired result.

11. Consider y′ = Ay, where , with aij

REAL. Solving the system

yields the quadratic equation

λ2 – (a11 + a22) λ + a11a22 – a21, a12 = 0, or

λ2 – (TrA)λ + det A = 0.

det ( )λλ

λI-A =

− −− −

=a a

a a

11 12

12 220

Aa a

a a=

11 12

21 22

y

c e

c e

c e

x

x

nnx

=

X X L X1 2 n

11

22

λ

λ

λ

1

1

1

1

2

4

1

3

9

, and

A =−

0 1 0

0 0 1

6 11 6


Let λ1, λ2 be the solutions of the characteristic equation. Using the quadratic formula yields

Now the solutions y1(t) and y2(t) to the system y′ = Ay will approach zero as t → ∞ ifand only if Re(λ1, λ2) < 0. (Both are < 0)

Case I: Tn2A – 4 det A < 0.

In this case Rn

(λ1) = Re

(λ2) = TrA2 . Thus y1(t), y2(t) → 0 if and only if TrA < 0.

Case II: Tr2A – 4 det A = 0. Then λ1 = λ2, Reλ1 = TrA/2, so y1, y2 → 0 (if and only if) ↔ TrA

< 0. Tr2A – 4 det A = 0. Then λ1, λ2, and Re(λ1, λ2) = TrA2 , so y1, y2 → 0 if and only if TrA

< 0.

Case III: Tr2A – 4 det A > 0. Then λ1, λ2 are both real.

Subcase 1: det A > 0.

If TrA > 0, then both (λ1, λ2) > 0, so y1, y2 0.

If TrA < 0, then both (λ1, λ2) < 0, so y1, y2 → 0. TrA = 0 is not possible in this case.

Subcase 2: det A < 0

If TrA > 0, then one root (say λ1) is positive, the other is negative, so y1, y2 0.

If TrA = 0, then again λ1 > 0, λ2 < 0, so y1, y2 0.

Subcase 3: det A = 0. Then λ1 = 0 or λ2 = 0, so y1, y2 0.→

→

→

Then Tn Tn A2A A− > ≥4 0det

→

Then Tn A > Tn2A A− >4 0det

λ1 2

2 4

2,det

=± −Tr A Tr A A


12. From

we get yθ2 = et.

Thus, consider the differential equation

y′ = y = rt.

Solving, y = c1et + tet

13. The system

can be put into the form

The eigenvalues and eigenvectors of A are:

Solving:

14. The system

can be put into the form

y

y=

2

1

y

y+ t1

1

21

1

2

1

2

11

2= Ay+f

′ = + +

′ = − +

−

y y y e

y y y e

k

k

1 1 2

2 1 2

y

yc e c e t

k k1

22

31

1

1

1

0

1

=

−

+

+

−,

+

−

13

23

/

/

λ λ1 1 211

13, ,= =

−

= =x x2

11

1

′′

=

+

y

y

y

yt

1

2

1

2

2 1

1 2

1

22

′ = + +′ = +

y y y t

y y y t

1 1 2

2 1 2

2

2 2_

′ = +′ =

y y y

y y

1 1 2

2 2


The eigenvalues and eigenvectors of A are

Solving

=

− +

+

−+

−y

yc e c e

t t1

2

22

21

1 2

1

1 2,

+

−−

−

−e te

t

t

t2

1

1

2

λ λ1 1 1 2 221

1 22

1

1 2= =

− +

= − =

−+

x x

, ,


EXERCISE SET 9.2

1. (a) Since T(x, y) = (–y, –x), the standard matrix is

(b)

(c) Since T(x, y) = (x, 0), the standard matrix is

(d)

3. (b) Since T(x, y, z) = (x, –y, z), the standard matrix is

0 1 0

1 0 0

0 0 1

−

A =

0 0

0 1

1 0

0 0

A =−

−

1 0

0 1

A =−

−

1 0

0 1

451

5. (a) This transformation leaves the z-coordinate of every point fixed. However it sends (1,0, 0) to (0, 1, 0) and (0, 1, 0) to (–1, 0, 0). The standard matrix is therefore

(c) This transformation leaves the y-coordinate of every point fixed. However it sends (1,0, 0) to (0, 0, –1) and (0, 0, 1) to (1, 0, 0). The standard matrix is therefore

12. (b) To reduce this matrix to the identity matrix, we add –2 times Row 1 to Row 2 and thenadd –4 times Row 2 to Row 1. These operations when performed on I yield

respectively. The inverses of the above two matrices are

respectively. Thus,

and therefore the transformation represents a shear by a factor of 4 in the x-directionfollowed by a shear by a factor of 2 in the y-direction.

(c) To reduce this matrix to the identity matrix, we multiply Row 1 by –1/2, multiply Row2 by 1/4, and interchange Rows 1 and 2. These operations when performed on I yield

−

1 2 0

0 1

1 0

0 1 4

0 1

1 0

/

/

1 4

2 9

1 0

2 1

1 4

0 1

=

1 4

0 1

1 4

0 1

and

1 0

2 1

1 4

0 1−

−

and

0 0 1

0 1 0

1 0 0−

0 1 0

1 0 0

0 0 1

−


respectively. The inverses of the above matrices are

respectively. Thus,

Therefore, the transformation represents a reflection about the line y = x followed byexpansions by factors of 4 and –2 in the y- and x- directions, respectively. Note that theorder in which the two expansion matrices occur is immaterial. However the position ofthe reflection matrix cannot be changed.

We return to the original matrix and note that we could, of course, interchange Rows 1and 2 first and then multiply Row 1 by 1/4 and Row 2 by –1/2. This yields the factorization

Here the order of the two expansion matrices can also be interchanged. Note that there aretwo other factorizations, for a grand total of six.

13. (a)

(c)

14. (a)

15. (b) The matrices which represent compressions along the x- and y- axes are

and , respectively, where 0 < k < 1. But

k k0

0 1

1 0

0 1

1

=

−/

1 0

0 k

k 0

0 1

−−

=

−

=

1 0

0 1

5 0

0 1

1 0

0 1

0 1

−−

5 0

−−

=

−−

1 0

0 1

0 1

1 0

0 1

1 0

1 0

0 5

1 2 0

0 1

1 2 0

0 5

=

/ /

0 2

4 0

0 1

1 0

4 0

0 1

1 0

0 2

−

=

−

0 2

4 0

2 0

0 1

1 0

0 4

0 1

1 0

−

=

−

−

2 0

0 1

1 0

0 4

0 1

1 0


and

Since 0 < k < 1 implies that 1/k > 1, the result follows.

15. (c) The matrices which represent reflections about the x- and y- axes are and

, respectively. Since these matrices are their own inverses, the

result follows.

16. Since , it follows that points (x′, y′) on the image line must satisfy the

equations

x = –2x′ + 3y′

y = –3x′ + 4y′

where y = –4x + 3. Hence –3x′ + 4y′ = 8x′ – 12y′ + 3, or 11x′ – 16y′ + 3 = 0. That is, theimage of y = –4x + 3 has the equation 11x – 16y + 3 = 0.

17. (a) The matrix which represents this shear is ; its inverse is . Thus,

points(x′, y′) on the image line must satisfy the equations

x = x – 3y′

y = y′

where y = 2x. Hence y′ = 2x′ – 6y′, or 2x′ – 7y′ = 0. That is, the equation of the imageline is 2x – 7y = 0.

Alternatively, we could note that the transformation leaves (0, 0) fixed and sends(1, 2) to (7, 2). Thus (0, 0) and (7, 2) determine the image line which has theequation 2x – 7y = 0.

(c) The reflection and its inverse are both represented by the matrix . Thus the

point (x′, y′) on the image line must satisfy the equations

x = y′

y = x′

where y = 2x. Hence x′ = 2y′, so the image line has the equation x – 2y = 0.

0 1

1 0

1 3

0 1

−

1 3

0 1

A− = −

−

1 2 3

3 4

−

1 0

0 1

1 0

0 1−

1 0

0

1 0

0 1

1

k k

=

−

/


(e) The rotation can be represented by the matrix . This sends the

origin to itself and the point (1, 2) to the point ((1 – 2 3 )/2, (2 + 3)/2). Since both

(0, 0) and (1, 2) lie on the line y = 2x, their images determine the image of the line

under the required rotation. Thus, the image line is represented by the equation (2 +

3)x + (2 3 – 1)y = 0.

Alternatively, we could find the inverse of the matrix, , and

proceed as we did in Parts (a) and (c).

18. Since the shear is represented by the matrix , it will send (2, 1) to (2 + k, k) and (3,

0) to (3, 0). The origin, of course, remains fixed. Thus we must find a value of k for whichthe vectors (2 + k, k) and (3, 0) are orthogonal; that is,

(2 + k, k) • (3, 0) = 3(2 + k) = 0

Clearly k = –2.

20. The equation of a line in the plane is Ax + By + C = 0 where not both A and B are zero.Moreover

where ad – bc ≠ 0. Thus (x, y) is transformed into (x′, y′) where

Therefore, the line Ax + By + C = is transformed into

or

dx by

ad bcB

cx ay

ad bc

−−

+ =− ′ + ′

−+ =

−−

+ =− +

C

da cB

ad bcB

bA a

0

BB

ad bcC

−+ = 0

xdx by

ad bc

ycx ay

ad bc

=′ − ′

−

=′ + ′

−

a b

c d ad bc

d b

c a

=

−−

−

−11

1

0 1

k

1 2 3 2

3 2 1 2

/ /

/ /−

1 2 3 2

3 2 1 2

/ /

/ /

−


This is the equation of a line provided dA – cB and –bA + aB are not both zero. But if bothnumbers are zero, then we have

bdA – bcB = 0

–bdA + adB = 0

or

(ad – bc) = 0

This implies that B = 0 since ad – bc ≠ 0. However, B = 0 implies that dA = bA = 0. Thuseither A = 0 or d = b = 0. But since ad – bc ≠ 0, b and d cannot both be zero. Hence A mustequal zero. Finally, since not both A and B can equal zero, then not both dA – cB and –bA

+ aB can equal zero.

21. We use the notation and the calculations of Exercise 20. If the line Ax + By + C = 0 passesthrough the origin, then C = 0, and the equation of the image line reduces to (dA – cB)x +(–bA + aB)y = 0. Thus it also must pass through the origin.

The two lines A1x + B1y + C1 = 0 and A2x + B2y + C2 = 0 are parallel if and only if A1B2 = A2B1. Their image lines are parallel if and only if

(dA1 – cB1)(–bA2 + aB2) = (dA2 – cB2)(–bA1 + aB1)

or

bcA2B1 + adA1B2 = bcA1B2 + adA2B1

or

(ad – bc)(A1B2 – A2B1) = 0

or

A1B2 – A2B1 = 0

Thus the image lines are parallel if and only if the given lines are parallel.

22. (a) Since x is fixed, but y and z are interchanged, we have

1 0 0

0 0 1

0 1 0


23. (a) The matrix which transforms (x, y, z) to (x + kz, y + kz, z) is

24. (a) A reflection about the x-axis will not change the length of any vector. Thus the onlypossible eigenvalues are 1 and –1. The only two unit vectors which are transformed tovectors with the same or opposite direction are the standard basis vectors, e1 and e2.The vector e1 goes to itself and e2 goes to –e2. Therefore e1 and e2 are linearlyindependent eigenvectors with corresponding eigenvalues 1 and –1.

This is easily checked using the matrix

with characteristic equation (λ – 1)(λ + 1) = 0. The eigenspaces will be the x and yaxes.

(c) This is similar to Part (a) above only the two lines through the origin which remainfixed will be y = x and y = –x. Thus 2 linearly independent eigenvectors will be (1, 1)and (–1, 1). The corresponding eigenvalues will be 1 and –1.

Again, this is easily checked using the matrix

with characteristic equation λ2 – 1 = 0.

(e) The only line through the origin which remains fixed when k ≠ 0 is the y-axis. Sincedistance along the y-axis is not changed, the only possible eigenvalue is λ = 1. Acorresponding eigenvector ise2 = (0, 1).

(f) The only angles through which we can rotate R2 and keep a line through the originfixed are integer multiples of 0, π, and 2π. Since rotations by integer multiples of 0 and2π leave the plane unchanged, we consider only rotations by odd multiples of π. Theseleave all lines through the origin fixed but reverse their directions. Thus such rotationshave –1 as an eigenvalue, and (1, 0) and (0, 1) form a basis for the eigenspace.

A =

0 1

1 0

A =−

1 0

0 1

1 0

0 1

0 0 1

k

k


EXERCISE SET 9.3

1. We have

Thus the desired line is y = –1/2 + (7/2)x.

3. Here

M =

1 2 4

1 3 9

1 5 25

1 6 36

a

b

T

=

1 0

1 1

1 2

1 0

1 1

1 2

=

−11 0

1 1

1 2

0

2

7

T

3 3

3 5

9

16

5

6

1

21

2

1

2

1

=

−

−

−

=−

/

9

16

1 2

7 2

459

and

Thus the desired quadratic is y = 2 + 5x – 3x2.

5. The two column vectors of M are linearly independent if and only if neither is a nonzeromultiple of the other. Since all of the entries in the first column are equal, the columns arelinearly independent if and only if the second column has at least two different entries, orif and only if at least two of the numbers x

iare distinct.

6. Since we have

M

x x x

x x x

m

n n nm

=

1

1

1 12

1

2

a

b

c

M M MTT

1

2

3

1

0

10

48

76

=−−−

−( )

=

4 16 74

16 74 376

74 376 2018

−−−

=

−1134

726

4026

221

10

62

5

3

2262

5

649

90

8

93

2

8

9

1

9

134

−−−−

=−

726

4026

2

5

3


the columns of M will be linearly independent if and only if a0 = a1 = … = am

= is the onlysolution to the equation

a0 [1 … 1]T + a1 [x1… x

n]T + a2 [x1

2 … xn2]T + … + a

m[xm

1… x

nm]T = [0 … 0]T

That is, if and only if the system of equations

has only the trivial solution for the numbers a0, … , am

. But if any of the numbers ai

≠ 0,then there are at least m + 1 distinct numbers x

iwhich must satisfy the equation

a0 + a1x + a2x2 + … + a

mxm = 0

This is a polynomial of degree at most m and can therefore have at most m distinct realroots. Thus, indeed, a0 = … = a

m= 0.

8. Let xi

denote the month, starting with January = 1, and yi

denote the sales for that month.Then we have

where

M M MT=

1 1 1

1 2 2

1 3 3

1 4 4

1 5 5

2

2

2

2

2

==

5 15 55

15 55 225

55 225 979

a

a

a

M M MT T

0

1

2

1

4 0

4 4

5 2

6 4

8 0

=

−( )

.

.

.

.

.

a a x a x a xmm

0 1 1 2 12

1 0+ + + + =

a a x a x a xn n m nm

0 1 22+ + + + = 00


so that the desired quadratic is y = 4 – (.2)x + (.2)x2. When x = 12, y = 30.4, so theprojected sales for December are $30,400.

( )M MT − =

−

− −

−

1

23

5

33

10

1

233

10

187

70

3

71

2

3

7

1

14

=and M yT

28 0

94 0

3

.

.

770 8.


EXERCISE SET 9.4

1. (a) Since f(x) = 1 + x, we have

Using Example 1 and some simple integration, we obtain

k = 1, 2, . . .

Thus, the least squares approximation to 1 + x on [0, 2π] by a trigonometric polynomialof order ≤2 is

1 + x (1 + π) – 2 sin x – sin 2x

2. (a) Since f(x) = x2,

Using Example 1 and integration by parts or a table of integrals, we find that

k = 1, 2, …

b x kx dxk k

xk sin( )( )

cos(= = − +1 2 22

2

πππ π

kkx dxk

) = −∫∫4

0

2

0

2 ππ

a x kx dxk

x kx dxk

k cos( ) sin( )= = − =1 2 42

π π 220

2

0

2 ππ∫∫

a x dx02 2

0

21 8

3= =∫π

ππ

b x kx dxk

k ( )sin( )= + = −∫1

12

0

2

ππ

a x kx dxk ( )cos( )= + =∫1

1 00

2

ππ

a x dx0 0

211 2 2( )= + = +∫π

ππ

463

Thus, the least squares approximation to x2 on [0, 2π] by a trigonometric polynomialof order at most 3 is

3. (a) The space W of continuous functions of the form a + bex over [0, 1] is spanned by thefunctions u1 = 1 and u2 = ex. First we use the Gram-Schmidt process to find anorthonormal basis v1, v2 for W.

Since ⟨f, g⟩ = f(x)g(x)dx, then u1 = 1 and hence

v1 = 1

Thus

v2 =

where α is the constant

Therefore the orthogonal projection of x on W is

projW

x x

x x xe e e e

xd

, ,= +− + − +

=

1 11 1

α α

xxe e x e e

dx

e e

x x

x

+− + − +

= +− +

∫ ∫1 1

1

2

1

0

1

0

1

α α

α

( )

( )

3

2

1

2

1

11

−

= +−

− +

= −

e

ee e

x

α

11

2

1

1+

−

ee

x

α ( )

(

/

= − + = − +

=

∫e e e e dxx x1 1 2

0

1 1 2

33 1

2

1 2− −

e e)( )/

e e

e e

e ex x

x x

x−

−=

− +( , )

( , )

1 1

1 1

1

α

0

1

∫

x x x x2 24

34 2

4

93 4cos cos cos sin π π+ + + − xx x xsin sin− −2

4

33π π


(b) The mean square error is

The answer above is deceptively short since a great many calculations are involved.

To shortcut some of the work, we derive a different expression for the mean squareerror (m.s.e.). By definition,

m.s.e. = [f(x) – g(x)]2 dx

= f – g2

= ⟨f – g, f – g⟩

= ⟨f, f – g⟩ – ⟨g, f – g⟩

Recall that g = projW

f, so that g and f – g are orthogonal (see Figure 3). Therefore,

m.s.e. = ⟨f, f – g⟩

= ⟨f, f⟩ – ⟨fg⟩

But g = ⟨f, v1⟩v1 + ⟨f, v2⟩v2, so that

(*) m.s.e. = ⟨f, f⟩ – ⟨f, v1⟩2 – ⟨f, v2⟩

2

Now back to the problem at hand. We know ⟨f, v1⟩ and ⟨f, v2⟩ from Part (a). Thus,in this case,

Clearly the formula (*) above can be generalized. If W is an n-dimensional space withorthonormal basis v1, v2, . . . , v

n, then

(**) m.s.e. = f2 – ⟨f, v1⟩2 – • • • – ⟨f, v

n⟩2

m.s.e. = −

−−

∫ x dx

e22

0

12

1

2

3

2α

== −−−

≈( )

.1

12

3

2 10014

e

e

xe

e dxe

e

x− − +−

= +

+−

(( )

1

2

1

1

13

12

1

2 1

2

== +−−∫

1

2

3

2 10

1 e

e( )


4. (a) The space W of polynomials of the form a0 + a1x over [0, 1] has the basis 1, x. TheGram-Schmidt process yields the orthonormal basis 1, ƒ3(2x – 1). Therefore, theorthogonal projection of ex on W is

(b) From Part (a) and (*) in the solution to 3(b), we have

5. (a) The space W of polynomials of the form a0 + a1x + a2x2 over [–1, 1] has the basis 1,

x, x2. Using the inner product ⟨u, v⟩ = u(x)v(x)dx and the Gram-Schmidt

process, we obtain the orthonormal basis

(See Exercise 29, Section 6.3.) Thus

Therefore,

sinππ

x x3

sin , sin( )

sin , s

π π

π

x x dx

x x

v

v

1 1

1

2

1

20

3

2

= =

=

−∫

iin( )

sin , ( )sin

ππ

π

x dx

x x

−∫ =

= −

1

1

32

2 3

2

1

2

5

23 1v (( )π x dx

−∫ =1

10

1

2

3

2

1

2

5

23 12, , ( )x x −

−∫ 1

1

m.s.e = − − − −

=− −

∫ e dx e e

e e

x2 2 2

0

11 3 3

3 7

( ) ( )

( )( 119

2004

).≈

projWx x x

e e e x x

e

= + − −

= − +

, , ( ) ( )

(

1 1 3 2 1 3 2 1

1 3 3 −− −

= −( ) + −( )e x

e e x

) ( )3 2 1

4 10 18 6


(b) From Part (a) and (**) in the solution to 3(b), we have

8. The Fourier series of π – x is

Thus the required Fourier series is

where

and

k = 1, 2, . . .

Thus the required Fourier series is . Notice that this is consistent with the result

of Example 1, Part (b).

9.

(note a slight correction to definition of f(x) as stated on pg. 466.)

Let ( ),

,f x

x

x=

≤ <≤ ≤

1 0

0 2

ππ π

sin2

1 kkx

k=

∞

∑

b x kx dxk

k = − =∫1 2

0

2

ππ

π( )sin( )

a x kx dxk = − =∫1

00

2

ππ

π( )cos( )

a x dx0 0

210= − =∫π

ππ

( )

aa kx b kxk k

k

0

12( cos sin )+ +

=

∞

∑

m.s.e. = − = − ≈−∫ sin ( ) .2

2 21

1 61

639π

π πx dx


Then

So Fourier Series is

10.

Thus, the Fourier Series for f(x) = sin 3x is sin 3x.

a x dx

a x kxk

0 0

213 0

13

sin

sin cos

= =

=

∫π

π

π

sin sin ,

dx

b x kx dx kk

=

= = ≠

∫ 0

13 0 3

0

2π

π 00

2

32

0

213

1

21 6

π

π

π

π

∫

∫=

= − =

a x dx

x dx

sin

cos 110

2π∫

1

2

11 1

1

+ − −

=

∞

∑ ( ) sink

kxk

kπ

a f x dx dx

a f x kxdxk

0 00

21 11

1

= = =

= =

∫∫π π

π

ππ( )

( )cos11

0

1

00

2

π

π

ππcos

( )sin

kxdx

k

b f x kxk

=

=

=

∫∫1, 2,

ddx kxdx

k

k

=

= − −

∫∫1

11 1

00

2

π

π

ππsin

( ) .


EXERCISE SET 9.5

1. (d) Since one of the terms in this expression is the product of 3 rather than 2 variables,the expression is not a quadratic form.

5. (b) The quadratic form can be written as

The characteristic equation of A is (λ – 7)(λ – 4) – 1/4 = 0 or

4λ2 – 44λ + 111 = 0

which gives us

If we solve for the eigenvectors, we find that for

x1 = (3 + 10)x2

and

x1 = (3 - 10)x2

Therefore the normalized eigenvectors are

3 10

20 6 10

1

20 6 10

3 10

20 6 1

+

+

+

−

−and 00

1

20 6 10−

λ =−11 10

2

λ =+11 10

2

λ =±11 10

2

[ ]x xx

xx Ax

T1 2

1

2

7 1 2

1 2 4

=

469

or, if we simplify,

Thus the maximum value of the given form with its constraint is

The minimum value is

6. (b) We write the quadratic form as

The characteristic equation of a is

λ3 – 4λ2 + 3λ = λ(λ – 3)(λ – 1) = 0

with roots λ = 0, 1, 3. The corresponding normalized eigenvectors are

respectively. Thus the maximum value of the given form with its constraint is

1 3

1 3

1 3

0

1 2

1 2

−

−

−

2 6

1 6

1 6

x x x

x

x

x

1 2 3

1

2

3

2 1 1

1 1 0

1 0 1

= x xTA

11 10

2

1

20 6 10

1

20 6 101 2

−=

−

+=

−at andx x

11 10

2

1

20 6 10

1

20 6 101 2

+=

−=

+at andx x

1

20 6 10

1

20 6 10

1

20 6 10

1

20

−

−

−

+

−

and

66 10


3 at x1 = 2/ 6, x2 = 1/ 6, x3 = 1/ 6

The minimum value is

0 at x1 = 1/ 3, x2 = –1/ 3, x3 = –1/ 3

7. (b) The eigenvalues of this matrix are the roots of the equation λ2 – 10λ + 24 = 0. Theyare λ = 6 and λ = 4 which are both positive. Therefore the matrix is positive definite.

8. (b) The principal submatrices are

Their determinants are 5 and 24, so the matrix is positive definite.

9. (b) The characteristic equation of this matrix is λ3 – 3λ + 2 = (λ + 1)2(λ – 2). Since twoof the eigenvalues are negative, the matrix is not positive definite.

11. (a) Since x12 + x2

2 > unless x1 = x2 =, the form is positive definite.

(c) Since (x1 ~ x2)2 ≥ 0, the form is positive semidefinite. It is not positive definite because

it can equal zero whenever x1 = x2 even when x1 = x2 ≠ 0.

(e) If |x1| > |x2|, then the form has a positive value, but if |x1| < |x2|, then the form has anegative value. Thus it is indefinite.

12. (a) The related quadratic form is 3x12 – 2x2

2 + x32 which is indefinite since its values can be

both positive and negative. Hence the matrix is indefinite.

(c) The related quadratic form is 6x12 + 9x2

2 + x32 + 14x1x2 + 2x1x3 + 4x2x3. The possible

signs of the values which this form can assume are not obvious and its characteristicequation is rather messy. However, the determinant of the matrix is zero while thedeterminants of the remaining principal submatrices are positive. Hence the matrix ispositive semidefinite.

55 1

1 5[ ] −

−

and


13. (a) By definition,

T(x + y) = (x + y)T A(x + y)

= (xT + yT)A(x + y)

= xTAx + xT Ay + yT Ax + yT A y

= T(x) + xT Ay + (xT A Ty)T + T(y)

= T(x) + xT αy + xT ATy + T(y)

(The transpose of a 1 × 1 matrix is itself.)

= T(x) + 2xT Ay + T(y)

(Assuming that A is symmetric, AT = A.)

(b) We have

T(kx) = (kx)T A(kx)

9 = k2xT Ax (Every term has a factor of k2.)

= k2T(x)

(c) The transformation is not linear because T(kx) ≠ kT(x) unless k = or 1 by Part (b).

14. (b) The symmetric matrix associated with the quadratic form is

The determinants of the principal submatrices are 5, 1, and k – 2. These are positiveprovided k > 2. Thus the quadratic form is positive definite if and only if k > 2.

15. If we expand the quadratic form, it becomes

c12x1

2 + c22x2

2 + … + cn2x

n2 + 2c1c2x1x2 + 2c1c3x1x3 +

… + 2c1cnx1xn

+ 2c2c3x2x3 + … + 2cn–1cnx

n–1xn

5 2 1

2 1 1

1 1

−−

− −

k


Thus we have

and the quadratic form is given by xT Ax where x = [x1 x2… x

n]T.

16. (a) We have

Thus sx2 = xT Ax where

(b) Since sx2 is 1/(n –1) times a sum of squares, it cannot be negative. In fact, it can only

be zero if (xi

– x) = 0 for i = 1, … , n, which can only be true in case x1 = x2 = … =x

n. If we let all the values of x

ibe equal but not zero, we have that s

x2 = while x ≠ 0.

Thus the form is positive semidefinite.

1 1

1

1

1

1

1

1

1

1 1

n n n n n n n

n n n n n

−−

−−

−−

−−

−−

( ) ( ) ( )

( ) (

11

1

1

1

1

1

1

1

1

1

) ( )

( ) ( ) ( )

−−

−−

−−

−−

n n

n n n n n n n

sn

x x x x x x nxn nx2

12

22 2

121

12=

−+ + − + +( ) +

=−

+ + − + +( ) + + +( )1

1

2 112 2

12

12

nx x

nx x

nx xn n n

=−

+ + − + + + +1

1

121

2 212 2

1 2n

x xn

x x x xn n ++( )( )

=−

−+ +( ) −

−x x

n

n

nx x

nx

n n

n

1

12 21

1

1 2 11 2 1x x xn n+ +( )

−

A

c c c c c c c

c c c c c c c

c c c c

n

n

=

12

1 2 1 3 1

1 2 22

1 3 2

1 3 2 33 32

3

1 2 32

c c c

c c c c c c c

n

n n n n


17. To show that λn

≤ xT Ax if x = 1, we use the equation from the proof dealing with λ1 andthe fact that λ

nis the smallest eigenvalue. This gives

xT Ax = ⟨x, Ax⟩ = λ1 ⟨x, v1⟩2 + λ2 ⟨x, v2⟩

2 + … + λn

⟨x, vn⟩2

≥ λn

⟨x, v1⟩2 + λ

n⟨x, v2⟩

2 + … + λn

⟨x, vn⟩2

= λn

(⟨x, v1⟩2 + … + ⟨x, v

n⟩2)

= λn

Now suppose that x is an eigenvector of A corresponding to λn. As in the proof dealing

with λ1, we have

xT Ax = ⟨x, Ax⟩ = ⟨x, λnx⟩ = λ

n⟨x, x⟩ = λ

nx2 = λ

n


EXERCISE SET 9.6

1. (a) The quadratic form xT Ax can be written as

The characteristic equation of A is λ2 – 4λ + 3 = 0. The eigenvalues are λ = 3 and λ =1. The corresponding eigenspaces are spanned by the vectors

respectively. These vectors are orthogonal. If we normalize them, we can use the resultto obtain a matrix P such that the substitution x = Py or

will eliminate the cross-product term. This yields the new quadratic form

or 3y12 + y2

2.

2. (a) The quadratic form can be written as xT Ax where

xT = [x1 x2 x3]

y yy

y1 2

1

2

3 0

0 1

x

x

y

y

1

2

1

2

1 2 1 2

1 2 1 2

=

−

1

1

1

1−

and

x xx

x1 2

1

2

2 1

1 2

−−

475

and

The characteristic equation of A is

λ3 – 12λ2 + 39λ – 28 = (λ – 7)(λ – 4)(λ – 1) = 0

The eigenspaces corresponding to λ = 7, λ = 4, and λ = 1 are spanned by the vectors

respectively. If we normalize these vectors, we obtain the matrix

By direct calculation, we have

so the substitution x = Py will yield the quadratic form

yT By or 7y12 + 4y2

2 + y32

7. (a) If we complete the squares, then the equation 9x2 + 4y2 – 36x – 24y + 36 = 0 becomes

9(x2 – 4x + 4) + 4(y2 – 6y + 9) = –36 + 9(4) + 4(9)

or

9(x – 2)2 + 4(y – 3)2 = 36

P AP BT =

=7 0 0

0 4 0

0 0 1

P = −− −

1 3 2 3 2 3

2 3 1 3 2 3

2 3 2 3 1 3

1

2

2

2

1

2

2

2

−

−−11

A = −−

3 2 0

2 4 2

0 2 5


or

This is an ellipse.

(c) If we complete the square, then y2 – 8x – 14y + 49 = becomes

y2 – 14y + 49 = 8x

or

(y – 7)2 = 8x

This is the parabola (y′)2 = 8x′.

(e) If we complete the squares, then 2x2 – 3y2 + 6x + 2y = –41 becomes

or

or

12(x′)2 – 18(y′)2 = –419

8. (a) The matrix form for the conic 2x2 – 4xy – y2 + 8 = 0 is

(*) xT A x + 8 = 0

where

A =−

− −

2 2

2 1

23

23

10

3

419

6

2 2

x y+

− −

= −

2 39

43

20

3

100

941

92 2x x y y+ +

− − +

= − +22

100

3−

′( )+

′( )=

x y2 2

4 91


The eigenvalues of A are λ1 = 3 and λ2 = –2 and the eigenspaces are spanned by thevectors

respectively. Normalizing these vectors yields

We choose

Note that det(P) = 1. (Interchanging columns yields a matrix with determinant –1.) Ifwe substitute x = Px′ into (*), we have

(Px′)T A(Px′) + 8 = 0

or

(x′)T (PT AP)x′ + 8 = 0

Since

this yields the hyperbola

–2(x′)2 + 3(y′)2 + 8 = 0

Note that if we had let

P APT =

−

2 0

0 3

P =

−

1

5

2

52

5

1

5

−

2

51

5

1

52

5

and

−

2

1

1

2and


then det(P) = 1 and the rotation would have produced the diagonal matrix

and hence the hyperbola

3(x′)2 – 2(y′)2 + 8 = 0

That is, there are two different rotations which will put the hyperbola into the twodifferent standard positions.

8. (b) The matrix form for the conic 5x2 + 4xy + 5y2 = 9 is

xT Ax = 9

where

The eigenvalues of A are λ1 = 3 and λ2 = 7 and the eigenspaces are spanned by thevectors

If we normalize these vectors, we obtain

−

1

21

2

1

21

2

and

−

1

1

1

1and

A =

5 2

2 5

P APT =

−

3 0

0 2

P =−

2

5

1

51

5

2

5


Hence we can let

Note that det(P) = 1. Thus if we let x = Px′, then we have

(x′)T (PT AP)x′ = 9

where

This yields the ellipse,

7(x′)2 + 3(y′)2 = 9

Had we let

then we would have obtained

3(x)2 + 7(y′) 2 = 9

which is the same ellipse rotated 90°.

9. The matrix form for the conic 9x2 – 4xy + 6y2 – 10x – 20y = 5 is

xT Ax + Kx = 5

where

and K = [ – 10 – 20]A =−

−

9 2

2 6

P =

−

1

2

1

21

2

1

2

P APT =

7 0

0 3

P =

−

1

2

1

21

2

1

5


The eigenvalues of A are λ1 = 5 and λ2 = 1 and the eigenspaces are spanned by the vectors

Thus we can let

Note that det(P) = 1. If we let x = Px′, then

(x′)T (PT AP)x′ + KPx′ = 5

where

Thus we have the equation

5(x′)2 + 10(y′)2 – 10 5x′ = 5

If we complete the square, we obtain the equation

5(x′)2 – 2 5x′ + 5 + 10 (y) 2 = 5 + 25

or

(x′′)2 + 2(y′′)2 = 6

where x′′ = x′ – 5 and y′′ = y′. This is the ellipse

′′( )+

′′( )=

x y2 2

6 31

P AP KPT =

= −

5 0

0 1010 5 0and

P =

−

1

5

2

52

5

1

5

1

2

2

1

−

and


Of course we could also rotate to obtain the same ellipse in the form 2(x′′)2 + (y′′)2 = 6,which is just the other standard position.

11. The matrix form for the conic 2x2 – 4xy – y2 – 4x – 8y = –14 is

xT Ax + Kx = –14

where

The eigenvalues of A are λ1 = 3, λ2 = –2 and the eigenspaces are spanned by the vectors

Thus we can let


(x′)T (PT AP)x′ + KPx′ = –14

where


3(x′)2 – 2(y′)2 – 4 5y′ = –14

If we complete the square, then we obtain

P AP KPT =

−

= −

3 0

0 20 4 5and

P =−

2

5

1

51

5

2

5

−

2

1

1

2and

A K=−

− −

= − −[ ]2 2

2 14 8and


3(x′)2 – 2((y′)2 + 2 5y′ + 5)= –14 – 10

or

3(x′′)2 – 2(y′′)2 = –24

where x′′ = x′ and y′′ = y′ + 5. This is the hyperbola

We could also rotate to obtain the same hyperbola in the form 2(x′′)2 – 3(y′′)2 = 24.

14. The matrix form for the conic 4x2 – 20xy + 25y2 – 15x – 6y = 0 is

xT Ax + Kx = 0

where

The eigenvalues of A are λ1 = 29 and λ2 = 0 and the eigenspaces are spanned by the vectors

Thus we can let


(x′)T (PT AP)x′ + KPx′ = 0

P =−

2

29

5

295

29

2

29

−

2

5

5

2and

A K=−

−

= − −[ ]4 10

10 2515 6and

′′( )−

′′( )=

x y2 2

12 81


where


which is the parobola . The other standard positions for this parabola are

represented by the equations

15. (a) The equation x2 – y2 = 0 can be written as (x – y)(x + y) = 0. Thus it represents thetwo intersecting lines x ± y = 0.

(b) The equation x2 + 3y2 + 7 = 0 can be written as x2 + 3y2 = –7. Since the left side ofthis equation cannot be negative, then there are no points (x, y) which satisfy theequation.

(c) If 8x2 + 7y2 = 0, then x = y = 0. Thus the graph consists ofthe single point (0, 0).

(d) This equation can be rewritten as (x – y)2 = 0. Thus it represents the single line y =x.

(e) The equation 9x2 + 12xy + 4y2 – 52 = 0 can be written as (3x + 2y)2 = 52 or 3x + 2y

= ± 52. Thus its graph is the two parallel lines 3x + 2y ± 2 13 = 0.

(f) The equation x2 + y2 – 2x – 4y = –5 can be written as x2 – 2x + 1 + y2 – 4y + 4 = 0 or(x – 1)2 + (y – 2)2 = 0. Thus it represents the point (1, 2).

and′( ) = − ′ ′( ) = ± ′x y y x2 23

29

3

29

2′( ) =x33

29′y

2987

290

2′( ) − ′ =x y

P AP KPT =

= −

29 0

0 00 87 29and


EXERCISE SET 9.7

5. (a) ellipse 36x2 + 9y2 = 32

(b) ellipse 2x2 + 6y2 = 21

(c) hyperbola 6x2 – –3y2 = 8

(d) ellipse 9x2 + 4y2 = 1

(e) ellipse 16x2 + y2 = 16

(f) hyperbola 3y2 – 7x2 = 1

(g) circle x2 + y2 = 24

6. (a)

(b)

(c)

(d)

−

=9 0 0

0 4 0

0 0 1

0[ ]x y z

x

y

z

6 0 0

0 3 0

0 0 2

6[ ]x y z

x

y

z

−−

− == 0

[[ ]x y z

x

y

z

2 0 0

0 6 0

0 0 3

18

−

− = 00

[ ]x y z

x

y

z

36 0 0

0 9 0

0 0 4

36 0

− =

485

(e)

(f)

(g)

7. (a) If we complete the squares, the quadratic becomes

9(x2 – 2x + 1) + 36(y2 – 4y + 4) + 4(x2 – 6z + 9)

= –153 + 9 + 144 + 36

or

9(x – 1) 2 + 36(y – 2) 2 + 4z – 3) 2 = 36

or

This is an ellipsoid.

(c) If we complete the square, the quadratic becomes

3(x2 + 14z + 49) – 3y2 – z2 = 144 + 147

or

3(x + 7) 2 – 3y2 – z2 = 3

( ) ( ) ( )′+

′+

′=

x y z2 2 2

4 1 91

x y[ zz

x

y

z

]

1 0 0

0 1 0

0 0 1

25 0

− =

7 0 0

0 3 0

0 0 1

−

x y z[ ]

+

=x

y

z

x

y

z

0 0 1 0

[[ ]x y z

x

y

z

16 0 0

0 1 0

0 0 0

0 0

+ −66 0

=

x

y

z


or

This is a hyperboloid of two sheets.

7. (e) If we complete the squares, the quadric becomes

(x2 + 2x + 1) + 16(y2 – 2y + 1) – 16z = 15 + 1 + 16

or

(x + 1)2 + 16(y – 1)2 – 16(z + 2) = 0

or

This is an elliptic paraboloid.

(g) If we complete the squares, the quadric becomes

(x2 – 2x + 1) + (y2 + 4y + 4) + (z2 – 6z + 9) = 11 + 1 + 4 + 9

or

(x – 1)2 + (y + 2)2 + (z – 3)2 = 25

or

This is a sphere.

( ) ( ) ( )′+

′+

′=

x y z2 2 2

1 1 31

( ) ( ) ( )′+

′+

′=

x y z2 2 2

4 1 91

( ) ( ) ( )′+

′+

′=

x y z2 2 2

1 1 31


8. (a) The matrix form for the quadric is xT Ax + 150 = 0 where

The eigenvalues of A are λ1 = –25, λ2 = 3, and λ3 = 50. The corresponding eigenspacesare spanned by the orthogonal vectors

Thus we can let x = Px′ where

Note that det(P) = 1. (This is the only reason for actually calculating P in thisproblem.) Using the above transformation, we obtain the quadric

–25(x′)2 + 3(y′)2 + 50(z′)2 + 150 = 0

or

25(x′)2 – 3(y′)2 – 50(z′)2 – 150 = 0

This is a hyperboloid of two sheets.

(c) The matrix form for the quadric is xT Ax + Kx = where

The eigenvalues of A are λ1 = 225, λ2 = 100, and λ3 = 0. The eigenspaces are spannedby the vectors

4

0

3

0

1

0

3

0

4−

A K=−

−

= − −144 0 108

0 100 0

108 0 81

540 0and 7720

P P APT=

−

=−4 5 0 3 5

0 1 0

3 5 0 4 5

25 0 0

0 3 0

0

and

00 50

4

0

3

0

1

0

3

0

4−

A =

2 0 36

0 3 0

36 0 23


Thus we can let x = Px′ where

Note that det(P) = 1. Hence the quadric becomes

xT (PT AP)x + KPx′ = 0

or

225(x′)2 + 100(y′)2 – 900z′ = 0

This is the elliptic paraboloid

9(x′)2 + 4(y′)2 – 36z′ = 0

9. The matrix form for the quadric is xT Ax + Kx = –9 where

The eigenvalues of A are λ1 = λ2 = –1 and λ3 = 2, and the vectors

span the corresponding eigenspaces. Note that u1 • u3 = u2 • u3 = 0 but that u1 • u2 ≠ 0.Hence, we must apply the Gram-Schmidt process to u1, u2. We must also normalize u3.This gives the orthonormal set

−

−

−

1

2

0

1

2

1

65

61

6

−

1

31

31

3

u

-1

0

0

u and u1 1 1

1

1

0

1

1

1

=

=−

=

A K=

= − − −

0 1 1

1 0 1

1 1 0

6 6 4and

P P APT=

−

=4 5 0 3 5

0 1 0

3 5 0 4 5

225 0 0

0 100and 00

0 0 0


Thus we can let

Note that det(P) = 1,

Therefore the transformation x = Px′ reduces the quadric to

If we complete the squares, this becomes

Letting and yie′′ = ′ + ′′ = ′ + ′′ = ′ −x x y y z z1

2

1

16

4

3, , llds

This is the hyperb

( ) ( ) ( )′′ + ′′ − ′′ = −x y z2 2 22 1

ooloid of two sheets

( ) ( ) ( )′′−

′′−

′′=

z x y2 2 2

1 2 1 11

( ) ( )

(

′ + ′ +

+ ′ + ′ +

− ′

x x y y

z

2 221

2

2

6

1

6

2 ))2 8

3

16

39

1

2

1

6

32

3− ′ +

= + + −z

− ′ − ′ + ′ − ′ − ′ ′ = −( ) ( ) ( ) –x y z x y z2 2 22 2

2

6

16

39

P AP KPT =

= −− −

-1 0 0

0 -1 0

0 0 2

and 22

6

16

3

P =

−

− −

1

2

1

6

1

3

02

6

1

31

2

1

6

1

3


11. The matrix form for the quadric is xT Ax + Kx – 31 = where

The eigenvalues of A are λ1 = 1, λ2 = –1, and λ3 = 0, and the corresponding eigenspaces arespanned by the orthogonal vectors

Thus, we let x = Px where

Note that det(P) = 1,

Therefore, the equation of the quadric is reduced to

(x′)2 – (y′)2 + 2√2x′ + 8√2y′ + z′ – 31 = 0

If we complete the squares, this becomes

[(x′)2 + 2√2x′ + 2] – [(y′)2 – 8√2y′ + 32] + z′ = 31 + 2 –32

Letting x′′ = x′ + √2, y′′ = y′ – 4√2, and z′′ = z′ – 1 yields

(x′′)2 – (y′′)2 + z′′ = 0

This is a hyperbolic paraboloid.

P AP KPT = −

=

1 0 0

0 1 0

0 0 0

2 2 8 2 1and

P =

−

1

2

1

20

1

2

1

20

0 0 1

1

1

0

1

1

0

0

0

1

−

A K=

= −

0 1 0

1 0 0

0 0 0

6 10 1and


13. We know that the equation of a general quadric Q can be put into the standard matrix formxT Ax + Kx + j = where

Since a is a symmetric matrix, then A is orthogonally diagonalizable by Theorem 7.3.1.Thus, by Theorem 7.2.1, A has 3 linearly independent eigenvectors. Now let T be the matrixwhose column vectors are the 3 linearly independent eigenvectors of A. It follows from theproof of Theorem 7.2.1 and the discussion immediately following that theorem, that T–1 AT

will be a diagonal matrix whose diagonal entries are the eigenvalues λ1, λ2, and λ3 of A.Theorem 7.3.2 guarantees that these eigenvalues are real.

As noted immediately after Theorem 7.3.2, we can, if necessary, transform the matrix Tto a matrix S whose column vectors form an orthonormal set. To do this, orthonormalize thebasis of each eigenspace before using its elements as column vectors of S.

Furthermore, by Theorem 6.5.1, we know that S is orthogonal. It follows from Theorem6.5 2 that det(S) = ±1.

In case det(S) = –1, we interchange two columns in S to obtain a matrix P such thatdet(P) = 1. If det(S) = 1, we let P = S. Thus, P represents a rotation. Note that P isorthogonal, so that P–1 = PT, and also, that P orthogonally diagonalizes a. In fact,

Hence, if we let x = Px′, then the equation of the quadric Q becomes

(x′)T (PT AP)x′ + KPx′ + j = 0

or

λ1(x′)2 + λ2(y′)2 + λ3(z′)2 + g′x′ + h′y′ + i′z′ + j = 0

where

[g′ h′ i′] = KP

Thus we have proved Theorem 9.7.1.

P APT =

λλ

λ

1

2

3

0 0

0 0

0 0

A

a d e

d b f

e f c

K g h i=

= and


EXERCISE SET 9.8

1. If AB = C where A is m × n, B is n × p, and C is m × p, then C has mp entries, each of theform

cij

= ai1b1j

+ ai2b2j

+ … + ain

bnj

Thus we need n multiplications and n – 1 additions to compute each of the numbers cij.

Therefore we need mnp multiplications and m(n – 1)p additions to compute C.

2. If A is an n × n matrix, then for any other n × n matrix, B, it will take (by Exercise 1) n3

multiplications and n2(n – 1) additions to compute AB. To compute Ak, we must carry outk – 1 of these matrix multiplications, one to compute a2, a second to compute a3 = A(A2),and in all, k – 1 to compute Ak = A(Ak – 1). Therefore we must carry out a total of n3(k –1) multiplications and n2(n – 1)(k – 1) additions to compute the product Ak.

5. Following the hint, we have

Sn

= 1 + 2 + 3 + … + n

Sn

= n + (n – 1) + (n – 2) + … + 1

or

2Sn

= (n + 1) + (n + 1) + (n + 1) + … + (n + 1)

Thus

Sn n

n =+( )1

2

493

7. (a) By direct computation,

(k + 1)3 – k3 = k3 + 3k2 + 3k + 1 – k3 = 3k2 + 3k + 1

(b) The sum “telescopes”. That is,

[23 – 13] + [33 – 23] + [43 – 33] + … + [(n + 1)3 – n3]

= 23 – 13 + 33 – 23 + 43 – 33 + … + (n + 1)3 – n3

= (n + 1)3 – 1

(c) By Parts (a) and (b), we have

3(1)2 + 3(1) + 1 + 3(2)2 + 3(2) + 1 + 3(3)2 + 3(3) + 1 + … + 3n2 + 3n + 1

= 3(12 + 22 + 32 + … + n2) + 3(1 + 2 + 3 + … + n) + n

= (n + 1)3 – 1

(d) Thus, by Part (c) and exercise 6, we have

1 2 31

31 1 3

1

22 2 2 2 3+ + + + = + − −

+−

=

n nn n

n( )( )

(nn n n n

n n n n

+− −

+−

=+ − − + −

1

3

1

3

1

2 3

2 1 2 3 1 2

6

3

3

) ( )

( ) ( )

==+ + − −

=+ + + − −

( )[ ( ) ]

( )( )

n n n

n n n n

1 2 1 3 2

6

1 2 4 2 3 2

2

2

66

1 2

61 2 1

6

2

=+ +

=+ +

( )( )

( )( )

n n n

n n n


9. Since R is a row-echelon form of an invertible n × n matrix, it has ones down the maindiagonal and nothing but zeros below. If, as usual, we let x = [x1 x2

… xn]T and b = [b1 b2

… bn]T, then we have x

n= b

nwith no computations. However, since x

n–1 = bn–1 – cx

nfor

some number c, it will require one multiplication and one addition to find xn–1. In general,

xi

= bi

– some linear combination of xi+1, xi+2, …, x

n

Therefore it will require two multiplications and two additions to find xn–2, three of each to

find xn–3, and finally, n–1 of each to find x1. That is, it will require

multiplications and the same number of additions to solve the system by back substitution.

10. To reduce an invertible n × n matrix to In, we first divide Row 1 by a11 (which we are

assuming is not zero since no row interchanges are required). This requires n – 1multiplications because we can ignore the first row, first column position which must beone. We then subtract a

i1 times Row 1 from Row i for i = 2, 3, … , n to reduce the firstcolumn to that of I

n. This requires (n – 1)2 multiplications and the same number of

additions because we can ignore those entries which we know must be zero. Thus we haveused n – 1 + (n – 1)2 = n(n – 1) multiplications and (n – 1)2 additions.

We then repeat the process on the (n – 1) × (n – 1) submatrix obtained from theoriginal by deleting the first row and the first column. This will require (n – 1)(n – 2)multiplicaitons and (n – 2)2 additions.

We continue in this fashion until we have reduced every diagonal entry to one and everyentry below the diagonal to zero. The number of multiplications required so far is

n n n n n n

n n

( ) ( )( ) ( )( )− + − − + − − + + • + •

= −

1 1 2 2 3 3 2 2 12 ++ − − − + − − − + − + −

= +

( ) ( ) ( ) ( )

(

n n n n

n

1 1 2 2 3 3 2 22 2 2 2

2nn n n

n n

− + + +

− + − + + +

=+

1 3 2 1 3 2

1

2 2 2) ( )

( )(( ) ( )2 1

61

1

21

3

3

n n n

n n

+− −

+−

=−

1 2 3 11

1+ + + + − =

−( )

( )n

n n


and the number of additions is

Finally, we must reduce the entries above the main diagonal to zero. We start with thenth column. We multiply Row n by a

infor i = 1, 2, … , n – 1 and subtract the result from

Row i. This would appear to involve n – 1 multiplications and the same number of additionssince we can ignore everything but the entries in the last column. However, the result is aforegone conclusion. In fact, once we have reduced the matrix to row-echelon form, thefinal reduction is automatic. Thus the above formulas give all the multiplications andadditions required.

11. To solve a linear system whose coefficient matrix is an invertible n × n matrix, A, we formthe n × (n + 1) matrix [A|b] and reduce A to I

n. Thus we first divide Row 1 by a11, using n

multiplications (ignoring the multiplication whose result must be one and assuming thata11 ≠ 0 since no row interchanges are required). We then subtract a

i1 times Row 1 fromRow i for i = 2, … , n to reduce the first column to that of I

n. This requires n(n – 1)

multiplications and the same number of additions (again ignoring the operations whoseresults we already know). The total number of multiplications so far is n2 and the totalnumber of additions is n(n – 1).

To reduce the second column to that of In, we repeat the procedure, starting with Row

2 and ignoring Column 1. Thus n – 1 multiplications assure us that there is a one on themain diagonal, and (n – 1)2 multiplications and additions will make all n – 1 of theremaining column entries zero. This requires n(n – 1) new multiplications and (n – 1)2

new additions.

In general, to reduce Column i to the ith column of In, we require n + 1 – i

multiplications followed by (n + 1 – i)(n – 1) multiplications and additions, for a total ofn(n + 1 – i) multiplications and (n + 1 – i)(n – 1) additions.

If we add up all these numbers, we find that we need

n n n n n n n n n n

n n

2

2

1 2 2 1 1+ − + − + + + = + −

=

( ) ( ) ( ) ( ) ( ( )

( ++

= +

1

2

2 2

3 2

)

n n

( ) ( )( )( ) )

n nn n n

n n

− + − + + + =− −

= −

1 2 2 11 2 1

6

3

2 2 2 2

3 2

22 6+

n


multiplications and

additions to compute the reduction.

12.

Since we have

lim li

lim ( ) ,

( )

x

xk

k

x

P x

a x

→+∞

→+∞

=

=

1 0

mm

lim

x

kk

kk

xk

kk

a a x a x

a x

a

a x

a

a x

→+∞

→+∞

+ + +

= +

0 1

0 1kk

k

k

a

a x−−+ +

= + + + +

=

11

0 0 0 1

1

n n n n n n n

n

( ) ( ) ( )( ) ( ) ( )

(

− + − + − − + + − + −

= −

1 1 2 1 2 1 12

11 1 2 1

1 1

2

2 2

3

)( ( ) )

( )( )( )

n n

n n n

n n

+ − + + +

=− +

= −


EXERCISE SET 9.9

1 The system in matrix form is

This reduces to two matrix equations

and

The second matrix equation yields the system

3y1 = 0

–2y1 + y2 = 1

which has y1 = 0, y2 = 1 as its solution. If we substitute these values into the first matrixequation, we obtain the system

x1 – 2x2 = 0

x2 = 1

This yields the final solution x1 = 2, x2 = 1.

3 0

2 1

0

11

2−

=

y

y

1 2

0 11

2

1

2

−

=

x

x

y

y

3 6

2 5

3 0

2 1

1 2

0 11

2

−− −

=

−

−

x

x

=

x

x

1

2

0

1

499

3. To reduce the matrix of coefficients to a suitable upper triangular matrix, we carry out thefollowing operations:

These operations involve multipliers 1/2, 1, and 1/3. Thus the corresponding lowertriangular matrix is

We therefore have the two matrix equations

and


2y1 = –2

–y1 + 3y2 = –2

which has y1 = –1, y2 = –1 as its solution. If we substitute these values into the first matrixequation, we obtain the system

x1 + 4x2 = –1

x2 = –1

This yields the final solution x1 = 3, x2 = –1.

2 0

1 3

2

21

2−

=

−−

y

y

1 4

0 11

2

1

2

=

x

x

y

y

L =−

2 0

1 3

2 8

1 1

1 4

1 1

1 4

0 3

1 4

0 1− −

→

− −

→

→

= u


5. To reduce the matrix of coefficients to a suitable upper triangular matrix, we carry out thefollowing operations:

These operations involve multipliers of 1/2, 0 (for the 2 row), 1, –1/2, –4, and 1/5. Thus thecorresponding lower triangular matrix is

We therefore have the matrix equations

and


2y1 = –4

–2y2 = –2

–y1+ 4y2 + 5y3= 6

2 0 0

0 2 0

1 4 5

4

2

6

1

2

3

−−

=−−

y

y

y

1 1 1

0 1 1

0 0 1

1

2

3

1

2

− −−

=x

x

x

y

y

yy3

L = −−

2 0 0

0 2 0

1 4 5

2 2 2

0 2 2

1 5 2

1 1 1

0 2 2

1 5 2

− −−

−

→− −−

−

→− −−

→

− −−

1 1 1

0 2 2

0 4 1

1 1 1

0 1 1

0 4 1

→− −

−

→− −

−1 1 1

0 1 1

0 0 5

1 1 1

0 1 1

0 00 1

= U


which has y1 = –2, y2 = 1 and y3 = 0 as its solution. If we substitute these values into thefirst matrix equation, we obtain the system

x1 – x2 – x3 = –2

x2 – x3 = 1

x3 = 0

This yields the final solution x1 = –1, x2 = 1, x3 = 0.

11. (a) To reduce A to row-echelon form, we carry out the following operations:

This involves multipliers 1/2, 2, -2, 1 (for the 2 diagonal entry), and –1. Where nomultiplier is needed in the second entry of the last row, we use the multiplier 1, thusobtaining the lower triangular matrix

In fact, if we compute LU, we see that it will equal A no matter what entry we choosefor the lower right-hand corner of L.

If we stop just before we reach row-echelon form, we obtain the matrices

which will also serve.

U L=−

= −

1 1 2 1 2

0 0 1

0 0 1

2 0 0

2 1 0

2 0 1

L = −

2 0 0

2 1 0

2 1 1

2 1 1

2 1 2

2 1 0

1 1 2 1 2

2 1 2

2 1 0

−− −

→−

− −

→−

→−

1 1 2 1 2

0 0 1

2 1 0

1 1 2 1 2

0 0 1

0 0 1

→−

=1 1 2 1 2

0 0 1

0 0 0

U


(b) We have that A = LU where

If we let

then A = L1DU as desired. (See the matrices at the very end of Section 9.9.)

(c) Let U2 = DU and note that this matrix is upper triangular. Then A = L1U2 is of therequired form.

12. (a) Suppose that

This implies that ad = 0 but that ae = bd = 1. That is, if the matrix has an LU-decomposition, then we must have 4 numbers which satisfy the above equations. Nowif ad = 0, then either a or d must be 0. However, this cannot be true if ae = bd = 1.Thus the matrix has no such decomposition.

(b)

13. (a) If A has such an LU-decomposition, we can write it as

a b

c d w

x y

z

x y

wx yw z

=

=

+

1 0

1 0

A P L U

P L

=

=

=

=

0 1

1 0

0 1

1 0

1 0

0 1

,

,where

,

0 1

1 0

0

0

=

=

+a

b c

d e

f

ad ae

bd be cf

L D1

1 0 0

1 1 0

1 1 1

2 0 0

0 1 0

0 1 1

= −

=

and

L U= −

=−

2 0 0

2 1 0

2 1 1

1 1 2 1 2

0 0 1

0 0 1



x = a y = b wx = c yw + z = d

Since a ≠ 0, this has the unique solution

x = a y = b w = c/a z = (ad – bc)/a

The uniqueness of the solution guarantees the uniqueness of the LU-decomposition.

13. (b) By the above,

15. We have that L = E1–1 E2

–1 … Ek–1 where each of the matrices E

iis an elementary matrix

which does not involve interchanging rows. By Exercise 27 of Section 2.4, we know that ifE is an invertible lower triangular matrix, then E–1 is also lower triangular. Now the matricesE

iare all lower triangular and invertible by their construction. Therefore for i = 1, … , k we

have that Ei–1 is lower triangular. Hence L, as the product of lower triangular matrices, must

also be lower triangular.

17. Let A be any n × n matrix. We know that a can be reduced to row-echelon form and thatthis may require row interchanges. If we perform these interchanges (if any) first, wereduce A to the matrix

Ek

… E1A = B

where Ei

is the elementary matrix corresponding to the ith such interchange. Now we knowthat B has an LU-decomposition, call it LU where U is a row-echelon form of A. That is,

Ek

… E1A = LU

where each of the matrices Ei

is elementary and hence invertible. (In fact, Ei–1 = E

ifor all

Ei. Why?) If we let

P = (Ek

… E1)–1 = E1

–1 … Ek–1 if k > 0

and P = I if no row interchanges are required, then we have A = PLU as desired.

a b

c d c a

a b

ad bc a

=

−( )

1 0

1 0


18. First we interchange Row 2 and Row 3 of A to obtain the equation

Next we find an LU-decomposition of B as follows

This gives us

so that B = EA = LU or A = E–1LU. Since

we have A = ELU, or

A =

−1 0 0

0 0 1

0 1 0

3 0 0

0 2 0

3 0 1

1 1 33 0

0 1 1 2

0 0 1

E E− =

=11 0 0

0 0 1

0 1 0

L =

3 0 0

0 2 0

3 0 1

B →−

−

→−

1 1 3 0

0 2 1

3 1 1

1 1 3 0

0 2 1

0 0 1

→−

=1 1 3 0

0 1 1 2

0 0 1

U

B =−

−

=

3 1 0

0 2 1

3 1 1

1 0 0

0 0 1

0 1 0

33 1 0

3 1 1

0 2 1

−−

= EA


19. Assume A = PLU, where P is a permutation matrix. Then note P–1 = P. To solve AX = B,where A = PLU, set C = P–1B = PB and Y = UX.

First solve LY = C for Y.

Then solve UX = Y for X.

To solve

3 1 0

3 1 1

0 2 1

0

1

0

1

2

3

−−

=

x

x

x

=

=

, ,or AX

set

e

C Pe

2

2

1 0 0

0 0 1

0 1 0

=

0

1

0

0

0

1

Solve LY ==

C

y

y

y

, or =

3 0 0

0 2 0

3 0 1

1

2

3

00

0

1

0

0

1

1

2

3

, to get =

y

y

y

=−

Solve orUX Y ,

1 1 3 0

0 1 1 2

0 0 1

x

x

x

x

x

x

1

2

3

1

2

3

0

0

1

=

so

=−

1 6

1 2

1

If thenA A P L U=−−

=3 1 0

3 1 1

0 2 1

, ,

wwith P L=

=

1 0 0

0 0 1

0 1 0

3 0 0

0 2 0

3 0 1

,

=−

, U

1 1 3 0

0 1 1 2

0 0 1


EXERCISE SET 10.1

3. (b) Since two complex numbers are equal if and only if both their real and imaginary partsare equal, we have

x + y = 3

and

x – y = 1

Thus x = 2 and y = 1.

4. (a) z1 + z2 = 1 – 2i + 4 + 5i = 5 + 3i

(c) 4z1 = 4(1 – 2i) = 4 – 8i

(e) 3z1 + 4z2 = 3(1 – 2i) + 4(4 + 5i)

= 3 – 6i + 16 + 20i

= 19 + 14i

5. (a) Since complex numbers obey all the usual rules of algebra, we have

z = 3 + 2i – (1 – i) = 2 + 3i

(c) Since (i – z) + (2z – 3i) = –2 + 7i, we have

i + (–z + 2z) – 3i = –2 + 7i

or

z = –2 + 7i – i + 3i = –2 + 9i

507

6. (a) z1 + z2 = 4 + 5i and z1 – z2 = 2 – 3i

7. (b) –2z = 6 + 8i

8. (a) We have k1i + k2(1 + i) = k2 + (k1 + k2)i = 3 – 2i. Hence, k2 = 3 and k1 + k2 = – 2or k1 = –5.


9. (c)

10. (a) First, z1z2 = (2 – 5i)(–1 – i) = –7 + 3i. Thus,

z1 – z1z2 = (2 – 5i) – (–7 + 3i) = 9 – 8i

(c) Since 1 + z2 = –i, then z1 + (1 + z2) = 2 – 6i. Thus,

[z1 + (1 + z2)]2 = (2 – 6i)2 = 22(1 – 3i)2 = 4(–8 – 6i) = –32 – 24i

11. Since (4 – 6i)2 = 22(2 – 3i)2 = 4(–5 – 12i) = –4(5 + 12i), then

(1 + 2i)(4 – 6i)2 = –4(1 + 2i)(5 + 12i) = –4(–19 + 22i) = 76 – 88i

13. Since (1 – 3i)2 = –8 – 6i = –2(4 + 3i), then

(1 – 3i)3 = –2(1 – 3i)(4 + 3i) = –2(13 – 9i)

15. Since then( ) ,

( )

21

2

3

4

1

42

21

2

+ +

= +

+ +

i i i

i33

4

1

42

63

16

2 2

i i i

= +

= − +

z z i i i i i1 21

62 4 1 5

2

61 2 1 5

1

31 3= + − = + − = − +( )( ) ( )( ) ( 110

11

34

91 2

4

93 4

1

41 5

12 2

22

)

( ) ( )

(

= −

= + = − +

= −

i

z i i

z ii i i) ( )2 1

424 10 6

5

2= − − = − −


17. Since i2 = –1 and i3 = –i, then 1 + i + i2 + i3 = 0. Thus (1 + i + i2 + i3)100 = 0.

19. (a)

(d)

Hence

20. (a)

(c) We have

Hence

( )CA Bi i

i i

2 6 1

6 5 9=

+− − −

CAi i

i i=

− − −+ − +

=

−6 1

6 5 9

1 0and B2

00 1−

A BC

i

i

i i

i i

i( ) =

+−+ −

+ −3 2 0

2

1 1

5 4 11

3 −− −

=+ − + − −++ −

2 5

13 13 8 12 33 22

1 0

7 9

i

i i i

i i

i 66 6 16 16+ − −

i i

B Ai i

i i

2 2 9 12 2

18 2 13− =

+ +− +

Ai

i

i i

i i

2 2 4

4 10

11 12 6

18 6 23=

−

=

+ +− +

and B2

A iBi

i

i i

i i

i

+ =−

+

− ++

=+

31

3

6 3 6

3 9 12

1 6 −− ++ +

3 7

3 8 3 12

i

i i


21. (a) Let z = x + iy. Then

Im(iz) = Im[i(x + iy)] = Im(–y + ix) = x

= Re(x + iy) = Re(z)

22. (a) We have

Let z1 = –1 + i and z2 = – 1 – i. Then

z12 + 2z1 + 2 = (–1 + i)2 + 2(–1 + i) + 2

= –2i – 2 + 2i + 2

= 0

Thus, z1 is a root of z2 + 2z + 2. The verification that z2 is also a root is similar.

23. (a) We know that i1 = i, i2 = –1, i3 = –i, and i4 = 1. We also know that im + n = imin

and imn = (im)n where m and n are positive integers. The proof can be broken intofour cases:

1. n = 1, 5, 9, … or n = 4k + 1

2. n = 2, 6, 10, … or n = 4k + 2

3. n = 3, 7, 11, … or n = 4k + 3

4. n = 4, 8, 12, … or n = 4k + 4

where k = 0, 1, 2, . . . . In each case, in = i4k+ for some integer between 1 and 4.Thus

in = i4ki = (i4)k i = 1 ki = i


(b) Since 2509 = 4 ⋅ 627 + 1, Case 1 of Part (a) applies, and hence i2509 = i.

z z z i2 2 2 02 4 8

21+ + = = − ± − = − ±⇔


24. Let z1 = x1 + iy1 and z2 = x2 + iy2 and observe that

z1z2 = 0 ⇔ x1x2 = y1y2 and x1 y2 = –x2y1

If we suppose that x1 ≠ 0, then we have

This implies that

But since – (y1/x1)2 ≤ 0, the above equation can hold if and only if y2 = 0. Recall that

x2 = (y1y2)/x1; then y2 = 0 implies that x2 = 0 and hence that z2 = 0. Thus if x1 ≠ 0,then z2 = 0. A similar argument shows that if y1 ≠ 0, then z2 = 0. Therefore, if z1 ≠ 0,then z2 = 0. Similarly, if z2 ≠ 0, then z1 = 0.

25. Observe that zz1 = zz2 ⇔ zz1 – zz2 = 0 ⇔ z(z1 – z2) = 0. Since z ≠ 0 by hypothesis, itfollows from Exercise 24 that z1 – z2 = 0, i.e., that z1 = z2.

26. (a) Let z1 = x1 + iy1 and z2 = x2 + iy2. Then

z1 + z2 = (x1 + iy1) + (x2 + iy2)

= (x1 + x2) + i(y1 + y2)

= (x2 + x1) + i(y2 + y1)

= (x2 + iy2) + (x1 + iy2)

= z2 + z1

yy

xy x2

1

1

2

2= −

xy y

xy

x y

x21 2

12

2 1

1= =

−and


27. (a) Let z1 = x1 + iy1 and z2 = x2 + iy2. Then

z1z2= (x1 + iy1)(x2 + iy2)

= (x1x2 – y1y2) + i(x1y2 + x2y1)

= (x2x1 – y2y1) + i(y2x1 + y1x2)

= (x2 + iy2)(x1 + iy1)

= z2z1

30. They lie on a circle of radius one, centered at the origin.

If z1 = a1b1i, and z2 = a2 + b2i, where a12 + b1

2 = 1, a22 + b2

2 = 1, thenz1z2 = (a1a2 – b1b2) + i(a1b2 + a2b1).

Note that (a1a2 – b1b2)2 + (a1b2 + a2b1)

2

= a12a2

2 – 2a1a2b1b2 + b12b2

2 + a12b2

2 + 2a1a2b1b2 + a22b1

2

= a12a2

2 + b12b2

2 + a12b2

2 + a22b1

2

= (a12 + b1

2)(a22 + b2

2) = 1.

Hence z1z2 also lies on the unit circle.


EXERCISE SET 10.2

2. (a) Since i = 0 + 1i, then

(c)

(e)

3. (a) We have

z–z = (2 – 4i)(2 + 4i) = 20

On the other hand,

z2 = 22 + (–4)2 = 20

(b) We have

z–z = (–3 + 5i)(–3 – 5i) = 34

On the other hand,

z2 = (–3)2 + 52 = 34

4. (a) From Equation (5), we have

z

z

i

i

i i i1

22 2

1 5

3 4

1 5 3 4

3 4

17 19

25= −

+= − −

+= − −( )( )

− = − + = − + =8 8 0 8 0 82 2i ( )

− − = − + − = =3 4 3 4 25 52 2i ( ) ( )

i = + =0 1 12 2

515

ch10.2.qxd 9/30/05 1:29 PM

4. (c) Again using Equation (5), we have

(e) Since z2 = 5, we have

5. (a) Equation (5) with z1 = 1 and z2 = i yields

(c)

6. (a) Since

then

(c) Since

and

then

ziz

zi i i1

2 1

2

23

5

1

5

3

5

11

5− = − − −

= +

z i21 2=

iz

z

i i

i

i

i

i i

i

1

2

1

1 2

1

1 2

1 1 2

1 2= +

−= − +

−= − + +

−( ) ( )( )

( ))( )1 2

3

5

1

5+= − −

ii

zz

zi i i1

1

2

11

5

3

5

6

5

2

5− = + − − +

= +( )

z

z

i

i

i ii1

2

1

1 2

1 1 2

5

1

5

3

5= +

−= + + = − +( )( )

1 7 7

17

z i

ii=

−= =

( )

1 1

1i

ii=

−= −

( )

z

z

ii

1

2

1 5

5

1

5=

−= −

z

z

i

i

i i i1

22 2

1 5

3 4

1 5 3 4

3 4

23 11

25=

−

−=

− +

+ −=

−( )( )

( )


ch10.2.qxd 9/30/05 1:29 PM

7. Equation (5) with z1 = i and z2 = 1 + i gives

9. Since (3 + 4i)2 = –7 + 24i, we have

11. Since

13. We have

and

(1 – 2i) (1 + 2i) = 5

Thus

i

i i i

ii

( )( )( )1 1 2 1 2 5

1

10

1

10

12

12

− − +=

− += − +

i

i

i ii

1

1

2

1

2

1

2−= + = − +( )

3

3

3

4

2 2 3

4

1

2

3

2

3

1 3

2+−

=+

=+

= +

+− −

i

i

i ii

i

i

( )

( )(

then

ii

i

i

i i

)=

+

−=

+( ) +( )=

−+

+

12

32

12

32

1

1

2

1 3

4

1 3

4 i

1

3 4

7 24

7 24

7 24

62 2 2( ) ( ) ( )+=

− −− + −

=− −

i

i i

225

i

i

i ii

1

1

2

1

2

1

2+= − = +( )


ch10.2.qxd 9/30/05 1:29 PM

15. (a) If iz = 2 – i, then

16. (a)

(c) The result follows easily from the fact that This result holdswhenever z ≠ i.

17. (a) The set of points satisfying the

equation z = 2 is the set of all

points representing vectors of length 2.

Thus, it is a circle of radius 2 and center

at the origin.

Analytically, if z = x + iy, then

which is the equation of the above circle.

(c) The values of z which satisfy the equation z – i = z + i are just those z whosedistance from the point i is equal to their distance from the point –i. Geometrically,then, z can be any point on the real axis.

We now show this result analytically. Let z = x + iy. Then

z – i = z + i ⇔ z –i2 = z + i2

⇔ x + i(y – 1)2 = x + i(y + 1)2

⇔ x2 + (y – 1)2 = x2 + (y + 1)2

⇔ – 2y = 2y

⇔ y = 0

z x y

x y

= ⇔ + =

⇔ + =

2 2

4

2 2

2 2

i z i z z i+ = + = − .

z i z i z i+ = + = −5 5 5

zi

i

i ii= − = − − = − −2 2

11 2

( )( )


ch10.2.qxd 9/30/05 1:29 PM

18. (a) This inequality represents the set of

all points whose distance from the

point –i is at most 1.

Analytically let

Thus, the inequality represents the set of all points in or on the circle with center at

–i and radius 1.

(c) Since

this inequality represents the set

of all points whose distance from

the point 2i is less than 1/2.

Analytically, if z = x + iy, then

19. (a) Re(iz–) = Re(i

–z–) = Re[(–i)(x – iy)] = Re(–y – ix) = –y

(c) Re(iz–) = Re[i(x – iy)] = Re(y + ix) = y

20. (a) Since 1/i = –i and (–i)n = (–1)n(i)n, this problem is a variation on Exercise 23 ofSection 10.1.

(b) Since 2509 = 4(627) + 1 and (i)4 = 1, then (i)2509 = i and therefore (1/i)2509 =(–1)2509 • i = –i.

2 4 1 2 4 1

2 2 4 1

2

2

2 2

2 2

z i z i

x y

x y

− < ⇔ − <

⇔ + − <

⇔ + − <

( ) ( )

( )11

4

2 4 1 21

2z i z i− < ⇔ − < ,

z x iy

z i z i

x y

= +

+ ≤ ⇔ + ≤

⇔ + + ≤

.

( )

Then

1 1

1 1

2

2 2


ch10.2.qxd 9/30/05 1:30 PM

21. (a) Let z = x + iy. Then

22. Let z = x + iy. Then

23. (a) Equation (5) gives

Thus

25.

26. Let z1 = x1 + iy1 and z2 = x2 + iy2.

(a) z z x iy x iy

x x i y y

x

1 2 1 1 2 2

1 2 1 2

− = + − +

= − + −=

( ) ( )

( ) ( )

( 11 2 1 2

1 1 2 2

1 2

− − −= − − −= −

x i y y

x iy x iy

z z

) ( )

( ) ( )

z x y x y z= + = + − =2 2 2 2( )

Rez

z

x x y y

x y

1

2

1 2 1 2

22

22

=

+

+

z

z z

z z

x yx iy x iy

x

1

2 22 1 2

22

22 1 1 2 2

22

1

1

1

=

=+

+ −

=

( ) ( )

+++ + −

yx x y y i x y x y

22 1 2 1 2 2 1 1 2( ) ( )

z z x iy x iy y z= ⇔ + = − ⇔ = ⇔0 is real.

1

2

1

2

1

22( ) ( ) ( ) ( ) Re( )z z x iy x iy x x z+ = + + − = = =


ch10.2.qxd 9/30/05 1:30 PM

(c)

27. (a)

(b) We use mathematical induction. In Part (a), we verified that the result holds whenn = 2. Now, assume that (z

–)n = zn—. Then

and the result is proved.

28. Let A denote the matrix of the system, i.e.,

Thus det(A) = i – (–2i) = 3i. Hence

and

xi

i

i ii

ii2

21

3

2

2

1

34

1=−

= +( ) = = −

xi

i

i ii

ii1

21

3

2

1

1

32

1=− −

= − +( ) = − =

Ai i

=−

2 1

( ) ( )z z z

z z

z

n n

n

n

+

+

=

=

=

1

1

z zz z z z2 2= = = ( )

( / )

( )

/ (

z z

z

z z

z

z z

z z

1 2

22 1 2

22 1 2

1 2

1

1

=

=

=

• Part (b)

PPart (d))


ch10.2.qxd 9/30/05 1:30 PM

30. Let A denote the matrix of the system, i.e.,

Then det(A) = –4 and hence

32.

Therefore x1 = –(1 + i)t and x2 = t where t is arbitrary.

− − −− + −

→

+− + −

→

+1 1

1 2

1 1

1 2

1 1

0 0

i

i

i

i

i

x i i

x i

1

2

1

4

3 1 1

2 2 1 1

1 1 1

1

2

1

4

1 3 1

1 2 2 1

1

= − + −− −

= +

= − + −−−

=

= +− −

= −

1 1

2

1

4

1 1 3

1 1 2 2

1 1 1

1

23x i i

A = −−

1 1 1

1 1 1

1 1 1


ch10.2.qxd 9/30/05 1:30 PM

34.

If we let x3 = t, where t is arbitrary, then x2 = –it and x1 = –(1 – i)t.

35. (a)

It is easy to verify that AA–1 = A–1 A = I.

36. From Exercise 27(b), we have that . Since ak

is real, it follows that

Thus,

Hence, p z p z( ) ( ) .= ⇒ = =0 0 0

p z a a z a z a z

a a z a z a

nn( ) ( ) ( )= + + + +

= + + + +

0 1 22

0 1 22

nnn

nn

z

a a z a z a z

p z

= + + + +

=

0 1 22

( )

. . .

. . .

. . .

a z a zkk

kk( ) .=

( )z zn n=

Ai

i

i

i

i

− =+ −

=

−

1

2

1

2

2

1

2

1

1

1 1 2

2 1 2 3

1

0 1

0 1

i i

i i

i i

i i

i

i

−− −

− + −

→−

− −

→

−

→−1

0 1

0 0 0

1 0 1

0 1

0 0

i i

i

i

i

00


ch10.2.qxd 9/30/05 1:30 PM

39. (a)

Thus

A

i i i

i

i i

− =− − − − +

−

12 2 1

1 2

1

1 1 0 1 0 0

0 1 0 1 0

1 2 2 0 0 1

1 1 0 1 0 0

0

+

− −

+

i

i

i i

i

11 0 1 0

0 2 0 1

1 1 0 1 0 0

0 1 0 1 0

0 0 1 1

i

i i

i

i

i i

−

+

+−

1 1 0 1 0 0

0 1 0 1 2

0 0 1 1

1

i

i

i i

00 0 2 2 1

0 1 0 1 2

0 0 1 1

− − − − +−

i i i

i

i i


R R iR3 3 1→ +

R R iR3 3 2→ +

R R iR2 2 3→ −

R R i R2 1 21→ − +( )

ch10.2.qxd 9/30/05 1:30 PM

40. We have

and

Thus, z – 1 = z– – 1.Since z and z– are symmetric about the real axis, they are, in fact, equidistant from any

point on that axis.

41. (a) We have , which is just the distance between the two

numbers z1 and z2 when they are considered as points in the complex plane.

(b) Let z1 = 12, z2 = 6 + 2i, and z3 = 8 + 8i. Then

z1 – z22 = 62 + (–2)2 = 40

z1 – z32 = 42 + (–8)2 = 80

z2 – z32 = (–2)2 + (–6)2 = 40

Since the sum of the squares of the lengths of two sides is equal to the square of thethird side, the three points determine a right triangle.

z z a a b b1 2 1 22

1 22− = − + −( ) ( )

z z z z z z z− = − − = − − = − −1 1 1 1 1 1 12

( )( ) ( )( ) ( )( )

z z z z z− = − − = − −1 1 1 1 12

( )( ) ( )( )


ch10.2.qxd 9/30/05 1:30 PM

ch10.2.qxd 9/30/05 1:30 PM

EXERCISE SET 10.3

1. (a) If z = 1, then arg z = 2kπ where k = 0, ±1, ±2,. . .. Thus, Arg(1) = 0.

(c) If z = –i, then arg z = + 2kπ where k = 0, ±1, ±2,. . .. Thus, Arg(–i) = –π/2.

(e) If then arg z = + 2kπ where k = 0, ±1, ±2,. . .. Thus, Arg

2. We have

(a) Put k = 0 in the above equation.

(b) Put k = –1 in the above equation.

3. (a) Since 2i = 2 and Arg(2i) = π/2, we have

(c) Since and Arg(5 + 5i) = π/4, we have

5 5 5 24 4

cos sin+ =

+

i i

π π

5 5 50 5 2+ = =i

2 22 2

i icos sin=

+

π π

arg( ) , ,1 35

32 0 2− = + = ±i k k

π π K

( ) .− + =1 32

3i

π

2

3

πz i,= − +1 3

3

2

π

527

ch10.3.qxd 9/30/05 2:25 PM

3. (e) since we have

4. We have z1 = 2, Arg(z1) = , z2 = 3, and Arg(z2) = .

(a) Here z1z2 = z1z2 = 6 and Arg(z) = Arg(z1) + Arg(z2) = . Hence

(c) Here z2/z1 = z2/z1 = 3/2 and Arg(z2/z1) = Arg(z2) – Arg(z1) = – . Hence

5. We have z1 = 1, Arg(z1) = , z2 = 2, Arg(z2) = – , z3 = 2, and Arg(z3) = . So

and

Therefore

= cos(0) + i sin(0) = 1

6. (a) We have θ = , and n = 12. Thus

(1 + i)12 = 26[cos(3π) + i sin(3π)] = –64

π4

r ,= 2

z z

z

1 2

3

Arg Arg( Arg Arg11 2 3

z z

zz z z2

3

0

= + − =) ( ) ( )

z z

z

z z

z

1 2

3

1 2

3

1= =

π6

π3

π2

z

zi2

1

3

2 12 12cos sin= −

+ −

π π

π12

z z i1 2 65

12

5

12=

+

cos sin

π π

5

12

π

π6

π4

− − = −

+ −

3 3 3 2

3

4

3

4i icos sin

π π

− − = = − − −3 3 18 3 2 3 33

4i iand Arg( ) =

π,


ch10.3.qxd 9/30/05 2:25 PM

(c) We have r = 2, θ = , and n = 7. Thus

7. We use Formula (10).

(a) We have r = 1, θ = – , and n = 2. Thus

Thus, the two square roots of –i are:

(c) We have r = 27, θ = π, and n = 3. Thus

( ) cos sin− = +

+ +

27 33

2

3 3

2

31 2 π π π πk

ik

=k 0 1 2, ,

cos sin

cos

−

+ −

= −

π π

π4 4

1

2

1

2

3

4

i i

+

= − +i isin3

4

1

2

1

2

π

( ) cos sin ,− = − +

+ − +

=i k i k k1 2

4 40

π π π π 11

π2

3 27

6

7

6

2

7 7+( ) =

+

=

i icos sinπ π

77 3

2

1

264 3 64− −

= − −i i

π6


ch10.3.qxd 9/30/05 2:25 PM

Therefore, the three cube roots of –27 are:

7. (e) Here r = 1, θ = π, and n = 4. Thus

Therefore the four fourth roots of –1 are:

cos sin

cos sin

π π

π π4 4

1

2

1

23

4

3

4

+ = +

+ =

i i

i

cos sin

cos

− +

+ = − −

1

2

1

25

4

5

4

1

2

1

27

i

i iπ π

ππ π4

7

4

1

2

1

2sin+ = −i i

( ) cos sin/− = +

+ +

14 2 4 2

1 4 π π π πki

k

=k 0 1 2 3, , ,

33 3

3

2

3 3cos sin

π π

+

= +i

22

3 3

35

3

i

icos( ) sin( )

cos

π π

π

+ = −

+ ii isin5

3

3

2

3 3

2

π

= −


ch10.3.qxd 9/30/05 2:25 PM

9. We observe that w = 1 is one

sixth root of 1. Since the

remaining 5 must be equally

spaced around the unit circle, any

two roots must be separated

from one another by an angle of

= 60°. We show all six

sixth roots in the diagram.

11. We have z4 = 16 ⇔ z = 161/4. The fourth roots of 16 are 2, 2i, –2, and –2i.

12. All solutions to z4 + 8 = 0 are:

z i i

z

13 4 3 4

2

22

2

2

22

4 4= + +

= +

cos sinπ π

== − +

= +

22

2

2

22

3

4

3

43 4 3 4

i i

z

cos sinπ π

333 4 3 42

2

2

2

22

5

4

5

4= − −

= +

i icos sinπ π

zz i i43 4 3 42

2

2

2

22

7

4

7

4= + −

= +

cos sinπ π

2

6 3

π π=


ch10.3.qxd 9/30/05 2:25 PM

Note: z4 = , z3 = .

Then (z – z1)(z – z4) = z2 – (z1 + )z + z1 = z2 – 25/4z + 23/2

and (z – z2)(z – z3) = z2 – (z2 + )z + z2 = z2 + 25/4z + 23/2

Then z4 + 8 = (z2 – 25/4z + 23/2)(z2 + 25/4z + 23/2).

Since z4 + 8 has exactly four zeros, it follows that

z4 + 8 = (z – z1)(z – z2)(z – z3)(z – z4)

But

(z – z1)(z – z4) = z2 + 25/4z + 23/2

and

(z – z2)(z – z3) = z2 + 25/4z + 23/2

Thus,

z4 + 8 =

14. (a) We have θ = , and n = 8. Thus

(1 + i)8 = 24[cos(2π) + i sin(2π)] = 16

15. (a) Since

z = 3eiπ = 3[cos(π) + i sin(π)] = –3

then Re(z) = –3 and Im(z) = 0.

(c) Since

then and hence Re(z) = 0 and .

16. The values of z1/n given in Formula (10) are all of the form

zk

= r1/n [cos θk

+ i sin θk]

Im( )z = − 2z i= − 2

z e i ii= =

+

=2 2

2 222π π π

cos sin

π4

r ,= 2

z z z z2 5 4 3 2 2 3 22 2 2 2− +( ) + +( )5 4

z2z2

z1z1

z2z1


ch10.3.qxd 9/30/05 2:25 PM

where

Thus, two roots, z and zm

, can be equal if and only if cos θ + i sin θ = cos θm

+ i sin θm

.This can occur if and only if cos θ = cos θ

mand sin θ = sin θ

m, which, in turn, can occur

if and only if θ and θm

differ by an integral multiple of 2π. But

Thus we have:

(a) If ≠ m and and m are integers between 0 and n – 1, then 0 <

cannot be an integral multiple of 2π and hence the two resulting values

of z1/n are different.

(b) If, say, is not in the range 0 to n – 1, then we can write in the form

= nk + m

for some integer m between 0 and n – 1 and for some integer k ≠ 0. Thus

which guarantees that z = zm

.

17. Case 1. Suppose that n = 0. Then

(cos θ + i sin θ)n = 1 = cos(0) + i sin(0)

So Formula (7) is valid if n = 0.

θ θ π π

− =

−( ) ( ) =m

m

nk2 2( )

m

n2

( - )( )π

− <m

n1 so that

θ θ π π π − = − = −m

n

m

n

m

n

2 22

( )( )

θ θ πk

n

k

n= +

2


ch10.3.qxd 9/30/05 2:25 PM

Case 2. In order to verify that Formula (7) holds if n is a negative integer, we first letn = – 1. Then

Thus, Formula (7) is valid if n = –1.

Now suppose that n is a positive integer (and hence that –n is a negative integer).Then


18. From

Substitute x = iθ to get

Rearranging, to get

or

eiθ = cos θ + i sin θ

e iiθ θ θ θ θ= − + −

+ −1

2 4 3

2 4 3

! ! !…

! !+ − +

θ θ5 7

5 7…

e ii iiθ θ θ θ θ θ= + − − + + −1

2 3 4 5

2 3 4 5

! ! ! !

θθ θ θ6 7 8

6 7 8! ! !− + +i …

e xx x xx = + + + + +12 3 4

2 3 4

! ! !…

(cos sin ) cos sin,

[cos

θ θ θ θ+ = +( )

= −

− −i i

nn

1

θθ θ

θ θ

( )+ −( )= − + −

i

n i n

nsin ]

cos( ) sin( ) [By Formulla (7)]

(cos sin )cos sin

cos sin

cos(

θ θθ θ

θ θ

+ =+

= −

= −

−i

i

i

1 1

θθ θ) sin( )+ −i


ch10.3.qxd 9/30/05 2:25 PM

19. We have and . But (see Exercise 17)

If we replace z2 by 1/z2 in Formula (3), we obtain

which is Formula (5).

20. The first equation, when multiplied out, becomes

cos2 θ + 2i cos θ sin θ – sin2 θ = cos 2θ + i sin 2θ

If we equate real and imaginary parts, we obtain the identities

cos2 θ – sin2 θ = cos 2θ

and

2 cos θ sin θ = sin 2θ

Similarly, the second equation yields

cos3 θ – 3 cos θ sin2 θ = cos 3θ

and

3 cos2 θ sin θ – sin3 θ = sin 3θ

z

zz

z

r

ri

1

21

2

1

21 2 1

1=

= + −( )( )+cos sinθ θ θ ++ −( )( )

= −( )+ −( )

θ

θ θ θ θ

2

1

21 2 1 2

r

ricos sin

1 1 1

2 2 22

2

z r e re

i

i= = −θ

θ

z r ei

2 22= θ

z r ei

1 11= θ


ch10.3.qxd 9/30/05 2:25 PM

21. If

eiθ = cos θ + i sin θ

then replacing θ with –θ yields

e–iθ = cos(–θ) + i sin(–θ) = cos θ – i sin θ

If we then compute eiθ + e–iθ and eiθ – e–iθ, the results will follow.

22. If (a + bi)3 = 8, then a + bi = 2. That is, a2 + b2 = 4.

23. Let z = r(cos θ + i sin θ). Formula (5) guarantees that 1/z = r–1 (cos(–θ) + i sin(–θ))since z ≠ 0. Applying Formula (6) for n a positive integer to the above equation yields

which is just Formula (6) for –n a negative integer.

zz

r n i nn

n

n− −=

= −( ) + −( )( )1cos sinθ θ


ch10.3.qxd 9/30/05 2:25 PM

EXERCISE SET 10.4

1. (a) u – v = (2i – (–i), 0 – i, –1 – (1 + i), 3 – (–1))

= (3i, –i, –2 – i, 4)

(c) –w + v = (–(1 + i) – i, i + i, –(–1 + 2i) + (1 + i), 0 + (–1))

= (–1 – 2i, 2i, 2 – i, –1)

(e) –iv = (–1, 1, 1 – i, i) and 2iw = (–2 + 2i, 2, –4 – 2i, 0). Thus

–iv + 2iw = (–3 + 2i, 3, –3 – 3i, i)

2. Observe that ix = u – v – w. Thus

x = –i(u – v – w)

3. Consider the equation c1u1 + c2u2 + c3u3 = (–3 + i, 3 + 2i, 3 – 4i). The augmented matrixfor this system of equations is

The row-echelon form for the above matrix is

1 2 0 3

1 2 3 2

0 1 2 3 4

− − ++ +

− −

i i i

i i i i

i i

537

5.

6. (a) Since u + v = (3i, 3 + 4i, –3i), then

(c) Here

It follows that

(e)

8. Since kv = (3ki, 4ki), then

Thus k kvv = ⇔ = ±1 1 5.

k ki ki kvv = + =3 4 52 2

.

1 1

6

2

60

wwww =

+

i i, ,

Since thenww = + + = + =1 2 2 4 62 2

i i ,

− + = +i i iuu uu 10 10

uu uu= + − = + = − = − − =3 9 1 10 3 0 1 12 2

i i i i, ( , , ),and uu 00.

uu vv+ = + + + − = + + =3 3 4 3 9 25 9 432 2 2

i i i

((aa)) vv

((cc)) vv

= + =

= + + + + − =

1 2

2 0 2 1 1 4

2 2

2 2 2 2

i

i i ( ) ++ + + =0 5 1 10

Hence, ,c i c i i3 223

2

1

2

1

2

1

2= − = + − +

= = − −c c i3 10 2, .and

1 1 0 2

0 11

2

1

2

3

2

1

20 0 1 1

− + − −

+ +

−

i i

i i

i


9. (a) u • v = (–i)(–3i) + (3i)(–2i) = –3 + 6 = 3.

(c) u • v = (1 – i)(4 – 6i) + (1 + i)(5i) + (2i)(–1 – i) + (3)(–i)

= (–2 – 10i) + (–5 + 5i) + (2 – 2i) + (–3i)

= –5 – 10i

11. Let V denote the set and let

We check the axioms listed in the definition of a vector space (see Section 5.1).

(1)

So u + v belongs to V.

(2) Since u + v = v + u and , it follows that u + v = v + u.

(3) Axiom (3) follows by a routine, if tedious, check.

(4) The matrix serves as the zero vector.

(5)

(6) Since ku = , ku will be in V if and only if , which is true if and

only if k is real or u = 0. Thus Axiom (6) fails.

(7)–(9) These axioms all hold by virtue of the properties of matrix addition and scalarmultiplication. However, as seen above, the closure property of scalar multiplicationmay fail, so the vectors need not be in V.

(10) Clearly 1u = u.

Thus, this set is not a vector space because Axiom (6) fails.

ku ku=ku

ku

0

0

Let = Then−−

−

=

−−

uu uu

u

u

u

u

0

0

0

0. ++ ( ) = .− uu 00

0 0

0 0

u v v u+ = +

uu vv+ =+

+

=

+

+

u v

u v

u v

u v

0

0

0

0

uu = a du

u

v

v

0

0

0

0

=

n vv


12. (a) Since

(z1, 0, 0) + (z2, 0, 0) = (z1 + z2, 0, 0)

and

k(z, 0, 0) = (kz, 0, 0)

this set is closed under both addition and scalar multiplication. Hence it is a subspace.

(b) Since

(z1, i, i) + (z2, i, i) = (z1 + z2, 2i, 2i)

this is not a subspace. Part (b) of Theorem 5.2.1 also is not satisfied.

13. Suppose that T(x) = Ax = 0. It is easy to show that the reduced row echelon form for A is

Hence, x1 = (–(1 + 3i)/2)x3 and x2 = (–(1 + i)/2)x3 where x3 is an arbitrary complexnumber. That is,

spans the kernel of T and hence T has nullity one.

Alternatively, the equation Ax = 0 yields the system

ix1 – ix2 – x3 = 0

x1 – ix2 + (1 + i)x3 = 0

0 + (1 – i)x2 + x3 = 0

The third equation implies that x3 = –(1 – i)x2. If we substitute this expression for x3into the first equation, we obtain x1 = (2 + i)x2. The second equation will then be valid

xx =

− +( )− +( )

1 3 2

1 2

1

i

i

1 0 1 3 2

0 1 1 2

0 0 0

+( )+( )

i

i


for all such x1 and x3. That is, x2 is arbitrary. Thus the kernel of T is also spanned by thevector

If we multiply this vector by –(1 + i)/2, then this answer agrees with the previous one.

14. (a) While this set is closed under vector addition, it is not closed under scalarmultiplication since the real entries, when multiplied by a non-real scalar, need notremain real. Hence it does not form a subspace.

(b) This set does form a subspace since

and

That is, the set is closed under both operations.

15. (a) Since

(f + g)(1) = f(1) + g(1) = 0 + 0 = 0

and

kf(1) = k(0) = 0

for all functions f and g in the set and for all scalars k, this set forms a subspace.

(c) Since

( )( ) ( ) ( ) ( ) ( )

( ) ( ) (

f g x f x g x f x g x

f x g x

+ − = − + − = +

= + = ff g x+ )( )

kz z

z z

kz kz

kz kz

1 2

3 1

1 2

3 1−

=

−( )

u u

u u

v v

v v

u v u1 2

3 1

1 2

3 1

1 1 2

−

+

−

=

+ +++ − +( )

v

u v u v

2

3 3 1 1

x

x

x

i

i

1

2

3

2

1

1

=+

− +


the set is closed under vector addition. It is closed under scalar multiplication by a realscalar, but not by a complex scalar. For instance, if f(x) = xi, then f(x) is in the set butif (x) is not.

16. (a) Consider the equation k1u + k2v = (3i, 3i, 3i). Equating components yields

k1i + k22i = 3i

–k1i + k24i = 3i

k13i = 3i

The above system has the solution k1 = k2 = 1. Thus, (3i, 3i, 3i) is a linear combinationof u and v.

(c) Consider the equation k1u + k2v = (i, 5i, 6i). Equating components yields

k1i + k22i = i

–k1i + k24i = 5i

k13i = 6i

The above system is inconsistent. Hence, (i, 5i, 6i) is not a linear combination of u and v.

17. (a) Consider the equation k1u + k2v + k3w = (1, 1, 1). Equating components yields

k1 + (1 + i)k2 = 1

k2 + ik3 = 1

–ik1 + (1 – 2i)k2 + 2k3 = 1

Solving the system yields k1 = –3 – 2i, k2 = 3 – i, and k3 = 1 + 2i.

(c) Let A be the matrix whose first, second and third columns are the components of u, v,and w, respectively. By Part (a), we know that det(A) ≠ 0. Hence, k1 = k2 = k3 = 0.

18. We let A denote the matrix whose first, second and third columns are the components of v1,v2, and v3, respectively.

(a) Since det(A) = 6i ≠ 0, it follows that v1, v2, and v3 span C3.

(d) Since det(A) = 0, it follows that v1, v2, and v3 do not span C3.


19. (a) Recall that eix = cos x + i sin x and that e–ix = cos(–x) + i sin(–x) = cos x – i sin x.Therefore,

and so cos x lies in the space spanned by f and g.

(b) If af + bg = sin x, then (see Part (a))

(a + b)cos x + (a – b)i sin x = sin x

Thus, since the sine and cosine functions are linearly independent, we have

a + b = 0

and

a – b = –i

This yields a = –i/2, b = i/2, so again the vector lies in the space spanned by f and g.

(c) If af + bg = cos x + 3i sin x, then (see Part (a))

a + b = 1

and

a – b = 3

Hence a = 2 and b = –1 and thus the given vector does lie in the space spanned byf and g.

20. (a) Note that iu1 = u2.

21. Let A denote the matrix whose first, second, and third columns are the components of u1,u2, and u3, respectively.

(a) Since the last row of A consists entirely of zeros, it follows that det(A) = 0 and henceu1, u2, and u3, are linearly dependent.

(c) Since det(A) = i ≠ 0, then u1, u2, and u3 are linearly independent.

cos xe e

ix ix

=+

= +−

2

1

2

1

2ff gg


22. Solving k1, .

Equating components:

k1a + k2(b – ci) = 0, k1(b + ci) + k2a = 0.

Equating real and imaginary parts:

k1a + k2b = 0

k1b + k2a = 0

k1c = 0

k2c = 0.

The above has non-trivial solutions for k1, k2 when a = ±b, c = 0.

23. Observe that f – 3g – 3h = 0.

24. (a) Since the dimension of C2 is two, any basis for C2 must contain precisely two vectors.

25. (a) Since = –4 ≠ 0, the vectors are linearly independent and hence form a basis

for C2.

(d) Since = 0, the vectors are linearly dependent and hence are not a

basis for C2.

26. Since the number of vectors is the same as the dimension of C3, the vectors will form a basisif and only if they are linearly independent.

(a) Since = –i ≠ 0, the vectors are linearly independent. Hence, they form

a basis.

(c) From Problem 21(c), we know that the vectors are linearly independent. Hence, theyform a basis.

i i i

i i

i

0

0 0

2 3 3 2

1

− +−

i i

i

2 4

0

i i

i−

v k w k a b ci, ,+ = + +2 0 we get 1 kk b Ci a2 0,− =



27. The row-echelon form of the matrix of the system is

So x2 is arbitrary and x1 = –(1 + i)x2. Hence, the dimension of the solution space is one

and is a basis for that space.

29. The reduced row-echelon form of the matrix of the system is

So x3 is arbitrary, x2 = (–3i)x3, and x1 = (3 + 6i)x3. Hence, the dimension of the solution

space is one and is a basis for that space.

31. Let u = (u1, u2, . . . , un) and v = (v1, v2, . . . , v

n). From the definition of the Euclidean inner

product in Cn, we have

32. (a) Let u = (u1, u2, . . . , un), v = (v1, v2, . . . , v

n) and w = (w1, w2, . . . , w

n). Then, since

we have u + v = (u1 + v1, u2 + v2, . . . , un

+ vn),

33. Hint: Show that

u + kv2 = u2 + –k(v • u) + k(u • v) + k–

k v2

and apply this result to each term on the right-hand side of the identity.

( ) ( ) ( ) . . . ( )uu vv ww+ = + + + + + +• u v w u v w u v wn n n1 1 1 2 2 2

== + + + + + + +[ . . . ] [ . . . ]u w u w u w v w v w v wn n n n1 1 2 2 1 1 2 2

== • •uu ww ++ vv ww

uu vv• = + + +=

( ) ( ) ( ) . . . ( )

(

k u kv u kv u kv

u kv

n n1 1 2 2

1 1)) ( ) . . . ( )

( ) ( ) . . .

+ + += + +

u kv u kv

k u v u u v

n n2 2

1 1 2 2 2 ++= + + += •

u u v

k u v u v u v

k

n n n

n n

( )

[ . . . ]

( )1 1 2 2

uu vv

3 6

3

1

+−

i

i

1 0 3 6

0 1 3

0 0 0

− −

i

i

− +( )

1

1

i

1 1

0 0

+

i

34. Let f(x) = f1(x) + if2(x) and g(x) = g1(x) + ig2(x). Then

(f + g)(x) = f(x) + g(x)

= [ f1(x) + g1(x)] + i[f2(x) + g2(x)]

and

(kf)(x) = kf(x)

= kf1(x) + ikf2(x)

Thus, if f and g are continuous, then f + g is continuous; similarly, if f is continuous, thenkf is continuous. Hence, the result follows from Theorem 5.2.1.


EXERCISE SET 10.5

1. Let u = (u1, u2), v = (v1, v2), and w = (w1, w2). We proceed to check the four axioms

(1)

(2) ⟨u + v, w⟩ = 3(u1 + v1) + 2(u2 + v2)

= [3u1 + 2u2 ] + [3v1 + 2v2 ]

= ⟨u, w⟩ + ⟨v, w⟩

(3) ⟨ku, v⟩ = 3(ku1) + 2(ku2)

= k[3u1 + 2u2 ] = k⟨u, v⟩

(4) ⟨u, u⟩ = 3u1 + 2u2

= 3|u1|2 + 2|u2|

2 (Theorem 10.2.1)

≥ 0

Indeed, ⟨u, u⟩ = 0 ⇔ u1 = u2 = 0 ⇔ u = 0.

Hence, this is an inner product on C2.

2. (a)

(c) uu vv, ( ) ( ) ( ) ( )

( )

= + − + − +

= +

3 1 1 2 1 1

3 2

i i i i

i ( )2 2 2− =i i

uu vv, ( ) ( ) ( )( )= − + − = − − = −3 2 2 3 6 6 12i i i i

u2u1

v2v1

v2v1

w2w1w2w1

w2w1

vv uu

uu vv

, = +

= + =

3 2

3 2

1 1 2 2

1 1 2 2

v u v u

u v u v ,

547

ch10.5.qxd 9/30/05 7:15 AM

3. Let u = (u1, u2) and v = (v1, v2). We check Axioms 1 and 4, leaving 2 and 3 to you.

(1)

(4) Recall that |Re(z)| ≤ |z| by Problem 29 of Section 10.1. Now

Moreover, ⟨u, u⟩ = 0 if and only if both |u2| and |u1| – |u2| = 0, or u = 0.

4. (a) ⟨u, v⟩ = (2i)i + (1 + i)(2i)(–3i) + (1 – i)(–i)(i) + 3(–i)(–3i)

= –4 + 5i

(c) ⟨u, v⟩ = (1 + i)(1 + i) + (1 + i)(1 + i)(1 – i) + (1 – i)(1 – i)(1 + i) + 3(1 – i)(1 – i)

= 4 – 4i

5. (a) This is not an inner product on C2. Axioms 1–3 are easily checked. Moreover,

⟨u, u⟩ = u1 = |u1| ≥ 0

However, ⟨u, u⟩ = 0 ⇔ u1 = 0 ⇔/ u = 0. For example, ⟨i, i⟩ = 0 although i ≠ 0.Hence, Axiom 4 fails.

u1

2

uu uu, ( ) ( )= + + + − +

=

u u i u u i u u u u1 1 1 2 2 1 2 21 1 3

uu i u u i u u u

u

12

1 2 1 2 22

12

1 1 3

2

( ) ( )

Re

+ + + + +

= + ((( ) )

( )

1 3

2 1 3

1 2 22

12

1 2 22

+ +

≥ − + +

i u u u

u i u u u

== − +

= −( ) +

u u u u

u u u

12

1 2 22

1 1

2

22

2 2 3

2

vv uu, ( ) ( )= + + + − +v u i v u i v u v u1 1 1 2 2 1 2 21 1 3

== + − + + +

=

u v i u v i u v u v1 1 2 1 1 2 2 21 1 3( ) ( )

uu vv,


ch10.5.qxd 9/30/05 7:16 AM

(c) This is not an inner product on C2. Axioms 1 and 4 are easily checked. However, for w = (w1, w2), we have

⟨u + v, w⟩ = |u1 + v1|2|w1|

2 + |u2 + v2|2|w2|

2

≠ (|u1|2 + |v1|

2)|w1|2 + (|u2|

2 + |v2|2)|w2|

2

= ⟨u, w⟩ + ⟨v, w⟩

For instance, ⟨1 + 1, 1⟩ = 4, but ⟨1, 1⟩ + ⟨1, 1⟩ = 2. Moreover, ⟨ku, v⟩ = |k|2⟨u, v⟩, so that⟨ku, v⟩ ≠ k⟨u, v⟩ for most values of k, u, and v. Thus both Axioms 2 and 3 fail.

(e) Axiom 1 holds since

A similar argument serves to verify Axiom 2 and Axiom 3 holds by inspection. Finally,using the result of Problem 29 of Section 10.1, we have

⟨u, u⟩ = 2u1u–

1 + iu1u–

2– iu2u–

1 + 2u2u–

2

= 2|u1|2 + 2Re(iu1u

–2) + 2|u2|

2

≥ 2|u1|2 – 2|iu1u

–2| + 2|u2|

2

= (|u1| – |u2|)2 + |u1|

2 + |u2|2

≥ 0

Moreover, ⟨u, u⟩ = 0 ⇔ u1 = u2 = 0, or u = 0. Thus all four axioms hold.

6. ⟨u, v⟩ = (–i)(3) + (1 + i)(–2 + 3i) + (1 – i)(–4i) + i(1)

= –9 – 5i

vv uu, = + − +

=

2 2

2

1 1 1 2 2 1 2 2

1 1

v u iv u iv u v u

u v −− + +

=

iu v iu v u v2 1 1 2 2 22

uu vv,


ch10.5.qxd 9/30/05 7:16 AM

8. First,

Second, if h = h1 (x) + ih2(x), then

Third, it is easily checked that ⟨kf, g⟩ = k⟨f, g⟩. And finally,

However, ⟨f, f⟩ = 0 ⇔ f1(0) = f2(0) = 0 ⇔/ f = 0. For instance, if f = ix, then ⟨f, f⟩ = 0, butf ≠ 0. Therefore, this is not an inner product.

9. (a) w = [3(–i)(i) + 2(3i)(–3i)]1/2 =

(c) w = [3(0)(0) + 2(2 – i)(2 + i)]1/2 =

10. (a) w = [|–i|2 + |3i|2]1/2 =

(c) w = [|0|2 + |2 –i|2]1/2 =

11. (a) w = [(1)(1) + (1 + i)(1)(i) + (1 – i)(–i)(1) + 3(–i)(i)]1/2 =

(c) w = [(3 – 4i)(3 + 4i)]1/2 = 5

12. (a) A = [(–i)(i) + (7i)(–7i) + (6i)(–6i) + (2i)(–2i)]1/2 = 3 10

2

5

10

10

21

ff,,ff ( ) ( ) ( ) ( )

( )

= +( ) +( )=

f if f if

f

1 2 1 2

12

0 0 0 0

0 ++

≥

( )f22

0

0

ff ++ gg,,hh ( ) ( ) ( ) ( )= + + + ( ) ⋅f g i f g1 1 2 20 0 0 0 (( ( ) ( ))h ih1 20 0+

= +ff,, hh gg,, hh

gg ff, ( ( ) ( )) ( ( ) ( ))

(

= + +

=

g ig f if1 2 1 20 0 0 0

ff if g ig1 2 1 20 0 0 0( ) ( )) ( ( ) ( ))+ +

= ff gg,


ch10.5.qxd 9/30/05 7:16 AM

13. (a) Since u – v = (1 – i, 1 + i), then

d(u, v) = [3(1 – i)(1 + i) + 2(1 + i)(1 – i)]1/2 =

14. (a) Since u – v = (1 – i, 1 + i),

d(u, v) = [(1 – i)(1 + i) + (1 + i)(1 –i)]1/2 = 2

15. (a) Since u – v = (1 – i, 1 + i),

d(u, v)= [(1 – i)(1 + i) + (1 + i)(1 – i)(1 – i)

+ (1 – i)(1 + i)(1 + i) + 3(1 + i)(1 – i)]1/2

=

16. (a)

d(A, B) = [(6i)(–6i) + (5i)(–5i) + (i)(–i) + (6i)(–6i)]1/2 = = 7

17. (a) Since u • v = (2i)(–i) + (i)(–6i) + (3i)(k–), then u • v = 0 ⇔ 8 + 3ik

– = 0 or k = –8i/3.

18. (a) Let B denote the given matrix. Then

⟨A, B⟩ = (2i)(–3) + (i)(1 + i) + (–i)(1 + i) + (3i)(2) = 0

Thus, A and B are orthogonal.

(d) Let B denote the given matrix. Then

⟨A, B⟩ = i + (–i)(3 + i) = 1 – 2i

Thus, A and B are not orthogonal.

298

Since A Bi i

i i− =

6 5

6,

2 3

10


ch10.5.qxd 9/30/05 7:16 AM

19.

Also

and

20. (a) Since , the set of vectors is not orthonormal. (However, the set is

orthogonal.)

(b) Since = 0 and the norm of each vector is 1, the

set is orthonormal.

21. (a) Call the vectors u1, u2, and u3, respectively. Then u1 = u2 = u3 = 1 and

u1 • u2 = u1 • u3 = 0. However, u2 • u3 = . Hence the set is not

orthonormal.

22. If we use the Euclidean inner product, then . Hence, the vectors cannot be

orthonormal with respect to the Euclidean inner product. In fact, neither vector is normal

and the vectors are not orthogonal.

xx = ≠2 5 1

i i2 2

6 6

2

60+

−( )= − ≠

i i i i i i

2 2 2 2 2 2

2 2

, ,•−

= − +

( , )0 1 − =i 2

xx,1

3

1

3

( , , ) , , , ,•0 1 1 0i i e i i i

e

i

i

− = ( ) −( )

=

θ

θ(( )0

0

− +

=

i i

xx,1

31

3

( , , ) , , , ,

(

•1 0 1 1 1 0i e i i

e i

i

i

= ( ) ( )

=

θ

θ −− +

=

i 0

0

)

Since =1

3we have

1

3

x e i

e i

i

i

θ

θ

( , , ),

, ,

1 1

1 1xx = ( )) = + + =1

31 1 1 1 11 2( )( )


ch10.5.qxd 9/30/05 7:16 AM

We now let ⟨u, v⟩ = 3u1v–

1 + 2u2v–

2. Then

and

Hence x and y are orthonormal with respect to this inner product.

24. (a) We have

Because

25. (a) We have

vv11 =

i i i

3 3 3, ,

vv22 =

3

10 10

i i,

12

5

4

5

4 10

5

i i, ,

= then

uu uu vv vv22 22 11 11− = +−

=

( )• ( , ) ,2 24

10 10

3

10

12

i ii i

ii i

5

4

5,

vvuu

uu1111

11

= =−

=−

( , ),

i i i i3

10 10

3

10

xx yy, =

−

+ −

−35

2

302

5

3

30

i i i i

= 0

xx2

35 5

25 5

=

−

+ −

=i i i i

11

32

30

2

302

3

30

32yy =

−

+

−i i i ii

301

=


and, since it follows that/ ,•uu vv2 1 4 10= −

ch10.5.qxd 9/30/05 7:16 AM

and since u2 • v1 = 0, then u2 – (u2 • v1)v1 = u2. Thus,

Since the norm of the above vector is , we have

27. Let u1 = (0, i, 1 –i) and u2 = (–i, 0, 1 + i). We shall apply the Gram-Schmidt process

to u1, u2. Since , it follows that

and because the norm of the above vector is , we have

28. Following Theorem 6.3.6, we orthonormalize the set u1, u2. This gives us

and, since uu vv2 1 2 6/ ,• =

vvuu

uu11

1

= = −

i i i

60

6

2

6, , ,

vv2 =− +

3

15

2

15

1

15

i i, ,

15 3/

uu uu vv vv2 2 1 1− = − + − − +

( ) ( , , ) , ,• i i i0 1 02

3

2

3

2

3

= − +

i i, ,2

3

1

3

1

3

Because thenuu vv2 1 2 3/ ,• = i

vv1 =−

03

1

3, ,

i i

u1 = 3

vv3 =−

i i i

6 6

2

6, ,

1 6/

uu uu vv vv uu vv vv3 1 1 3 2− − =−

( ) ( ) , ,• •3 2 6 6

i i i

33

Also, and anduu vv uu vv3 1 3 24 3 1 2/ /• •= = hhence

vv22 = −

i i

2 20, ,


ch10.5.qxd 9/30/05 7:16 AM

so that

Therefore,

and

29. (a) By Axioms (2) and (3) for inner products,

⟨u –kv, u – kv⟩ = ⟨u, u –kv⟩ + ⟨–kv, u – kv⟩

= ⟨u, u – kv⟩ – k⟨v, u – kv⟩

If we use Properties (ii) and (iii) of inner products, then we obtain

⟨u – kv, u – kv⟩ = ⟨u, u⟩ – k–⟨u, v⟩ – k⟨v, u⟩ + kk–⟨v, v⟩

Finally, Axiom (1) yields

⟨u – kv, u – kv⟩ = ⟨u, u⟩ – k–⟨u, v⟩ – k + kk–⟨v, v⟩

and the result is proved.

(b) This follows from Part (a) and Axiom (4) for inner products.

uu vv,

ww ww ww2 1–=

= −

1

4

9

4

19

4

9

4i i i i, , ,

ww ww vv vv ww vv vv

vv vv

1 1 1 2 2

1 2

( ) ( )• •= +

= +−

= −

7

6

1

2 3

5

44

1

4

5

4

9

4i i i i, , ,−

vv22 = −

i i i i

2 3

3

2 3 2 3 2 3, , ,

uu uu vv vv2 0 01

30( ) ( , , , ) ( , ,•− = − −2 1 1 i i i ii i

ii

i

, )

, , ,

2

3

1

3 3= −


ch10.5.qxd 9/30/05 7:16 AM

30. Suppose that v ≠ 0 and hence that ⟨v, v⟩ ≠ 0. Using the hint, we obtain

Since ⟨v, v⟩ is real, this yields the inequality

or

or

as desired.

In case v = 0, Property (i) for inner products guarantees that both sides of the Cauchy-Schwarz inequality are zero, and thus equality holds.

32. If u and v are linearly dependent, then either v = 0 or u is a scalar multiple of v. In eithercase, a routine computation using properties of the inner product shows that equality holds.

Conversely, if equality holds, then we shall show that u and v must be linearlydependent. For suppose that they are not. Then u – kv ≠ 0 for every scalar k, so it followsthat ⟨u – kv, u – kv⟩ is greater than zero for all k. Thus, strict inequality must hold inProblem 29(b). Now if we choose k as in Problem 30 and rederive the Cauchy-Schwarzinequality, we find that strict inequality must hold there also. That is, if u and v are linearlyindependent, then the inequality must hold. Hence, if the equality holds, then we are forcedto conclude that u and v are linearly dependent.

33. Hint: Let v be any nonzero vector, and consider the quantity ⟨v, v⟩ + ⟨0, v⟩.

34. We have

u, v + w , +

+ ,

=

=

uu vv ww

vv ww uu ( )Axiom 1

, ,= +vv uu ww uu ( )Axiom 2

, ,

, ,

= +

= +

vv uu ww uu

uu vv uu ww ( )Axiom 1

uu vv uu uu vv vv, , ,2

≤

uu vv

vv vvuu uu

,

,,

2

≤

0

2 2 2

≤ − − +uu uuuu vv

vv vv

uu vv

vv vv

uu vv

vv vv,

,

,

,

,

,

,

0 ≤ −

−

uu uu

uu vv

vv vvuu vv

uu vv

vv vv,

,

,,

,

,uu vv

uu vv

vv vvvv vv,

,

,,+

2


ch10.5.qxd 9/30/05 7:16 AM

35. (d) Observe that u + v2 = ⟨u + v, u + v⟩. As in Exercise 29,

⟨u + v, u + v⟩ = ⟨u, u⟩ + 2 Re(⟨u, v⟩) + ⟨v, v⟩

Since (see Exercise 29 of Section 10.1)

|Re(⟨u, v⟩)| ≤ |⟨u, v⟩|

this yields

⟨u + v, u + v⟩ ≤ ⟨u, u⟩ + 2|⟨u, v⟩| + ⟨v, v⟩

By the Cauchy-Schwarz inequality and the definition of norm, this becomes

u + v2 ≤ u2 + 2u v + v2 = (u + v)2

which yields the desired result.

(h) Replace u by u – w and v by w – v in Theorem 6.2.2, Part (d).

37. Observe that for any complex number k,

u + kv2= ⟨u + kv, u + kv⟩

= ⟨u, u⟩ + k⟨v, u⟩ + k–⟨u, v⟩ + kk–⟨v, v⟩

= ⟨u, u⟩ + 2 Re(k⟨v, u⟩) + |k|2⟨v, v⟩

Therefore,

u + v2 – u – v2 + iu + iv2 – iu – iv2

= (1 – 1 + i – i)⟨u, u⟩ + 2 Re(⟨v, u⟩) – 2 Re(–⟨v, u⟩) + 2i Re(i⟨v, u⟩)

–2i Re(–i⟨v, u⟩) + (1 – 1 + i – i)⟨v, v⟩

= 4 Re(⟨v, u⟩) – 4i Im(⟨v, u⟩)

=

= 4 ⟨u, v⟩

4 vv uu,


ch10.5.qxd 9/30/05 7:16 AM

38. By the hint, we have

The last step uses the fact that the basis vectors vj

form an orthonormal set.

39. We check Axioms 2 and 4. For Axiom 2, we have

For Axiom 4, we have

Since |f |2 ≥ 0 and a < b, then ⟨f, f⟩ ≥ 0. Also, since f is continuous, |f |2 dx > 0 unless

f = 0 on [a, b]. [That is, the integral of a nonnegative, real-valued, continuous function (which

represents the area under that curve and above the x-axis from a to b) is positive unless the

function is identically zero.]

a

b

∫

ff ff, = =

= ( )( ) + ( )( )

∫ ∫ff ff dxa

b

a

bf dx

f x f x

2

12

22

∫ dxa

b

ff gg hh+ ,

,

= +

= +

=

∫

∫ ∫

( )ff gg hh

ff hh gg hh

ff

dx

dx dx

a

b

a

b

a

b

hhhh gg hh+ ,

uu ww uu vv ww vv vv vv

uu vv ww vv

, ,

,

=

=

=

=

∑ j k

j k

n

j k

j k

j

n

, ,

,

, 1

1

∑∑

uu uu vv vv ww ww vv= == =∑ ∑, and , v Thereforj j

j

n

k k

k

n

1 1

. ee,


ch10.5.qxd 9/30/05 7:16 AM

40. (a) Since we are working in C[0, 1],

Hence .

(b) Since we are again working in C[0, 1],

41. Let vm

= e2πimx = cos(2πmx) + i sin(2πmx). Then if m ≠ n, we have

Thus the vectors are orthogonal.

vv vvm n mx i mx nx, cos sin cos= ( ) + ( ) ( )∫ 2 2 20

1π π π −− ( )

= ( ) ( ) +

i nx dx

mx nx

sin

cos cos sin

2

2 2 2

π

π π π mmx nx dx

i mx

( ) ( )

+

∫ sin

sin

2

2

0

1π

π(( ) ( ) − ( ) ( )

=

∫ cos cos sin

co

2 2 20

1π π πnx mx nx dx

ss sin2 2

1

0

1

0

1π πm n x dx i m n x dx−( ) + −( )

=

∫∫

222

22

0

1

ππ

ππ

m nm n x

i

m nm n

−( ) −( ) −−( ) −sin[ ] cos[ (( )

= −−( ) +

−( )=

x

i

m n

i

m n

]0

1

2 2

0

π π

ff gg, = +( )= −( )

= −

∫

∫

x ix dx

x ix dx

i

1

1

1

2

1

3

0

1

0

1

gg /= 2 3

gg gg gg gggg2 2

0

1

0

11

4

3= = = +( ) =∫∫, dx x dx


ch10.5.qxd 9/30/05 7:16 AM

ch10.5.qxd 9/30/05 7:16 AM

EXERCISE SET 10.6

4. (b) The row vectors of the matrix are

Since r1 = r2 = 1 and r1 • r2 = 0, it follows that the matrix is unitary.

(c) The row vectors of the matrix are

r1 = (1 + i, 1 + i) and r2 = (1 – i, –1 + i)

Although r1 • r2 = (1 + i)(1 + i) + (1 + i)(–1 – i) = 0 (from which it follows thatr1 and r2 are orthogonal), they are not orthonormal because r1 = r2 = 2. Hence, thematrix is not unitary.

5. (b) The row vectors of the matrix are

Since r1 = r2 = 1 and

the matrix is unitary by Theorem 10.6.2. Hence,

A A A

i

i

T− = = =

− +

−

1

1

2

1

2

1

2

1

2

*

rr rr1 21

2

1

2

1

2

1

20• =

− +

+−

=i i

rr rr1 21

2

1

2

1

2

1

2=

= − + +

, ,and

i i

rr rr1 22

1

2 2

1

2=

= −

i i, ,and

561

5. (d) The row vectors of the matrix are

and

We have r1 = r2 = r3 = 1,

and

Hence, by Theorem 10.6.2, the matrix is unitary and thus

6. Using Formula (11) of Section 10.3, we see that and hence that Let

rr rr rr rr1 1 2 21

2

1

21• •= = + =

rr rr1 22 2 2 2

=

=

−

− −e e ie ie

i i i iθ θ θ θ, ,and . Then

e ei i− =θ θ .e e

i iθ θ= −

A A A

i i i

i

i i

T− = = =

− − −

−−

−

1

1

2 3

3

2 151

2

1

3

4 3

2 151

2 3

5

2

*

115

rr rr1 31

2

3

2 15

1

2

4 3

2• •=

+

−

−−i i i

115

1

2

5

2 150+

−=•

i

rr rr1 21

2 3

1

2

1

3

1

2• • •=

+

−

− +i i i

330

1

3

3

2 15

1

3

4 32 3

=

=

−

+−

rr rr• •i ii i i

2 15 3

5

2 150−

−=•

rr33

2 15

4 3

2 15

5

2 15= + +

i i i, ,

rr

rr

1

2

1

2

1

2

1

2

3

1

3 3

=+

−

=−

i

i i

, ,

, ,


and

Hence, by Theorem 10.6.2, the row vectors form an orthonormal set and so the matrix isunitary.

7. The characteristic polynomial of A is

Therefore, the eigenvalues are λ = 3 and λ = 6. To find the eigenvectors of A correspondingto λ = 3, we let

This yields x1 = –(1 – i)s and x2 = s where s is arbitrary. If we put s = 1, we see that

is a basis vector for the eigenspace corresponding to λ = 3. We normalize this

vector to obtain

To find the eigenvectors corresponding to λ = 6, we let

This yields and x2 = s where s is arbitrary. If we put s = 1, we have that


vector to obtain

1 2

1

−( )

i

xi

s11

2=

−

2 1

1 1

0

01

2

− +− −

=

i

i

x

x

PP1

1

31

3

=

− +

i

− +

1

1

i

−− −

=

1

1

0

01

2i

x

x

det ( )( ) ( )(λ

λλ λ λ

− − +− − −

= − − − = −

4 1

1 54 5 2 6

i

iλλ − 3)

rr rr1 21

2

1

2• = −( ) + ( )

=e ie e ie

i i i iθ θ θ θ −− + =i i 0


Thus

diagonalizes A and


Therefore the eigenvalues are λ = 2 and λ = 8. To find the eigenvectors of A correspondingto λ = 2, we let

This yields and x2 = s where s is arbitrary. If we put s = 1, we have that


vector to obtain

− +( )

1 2

1

i

xi

s11

2= −

+

− − −− + −

=

4 2 2

2 2 2

0

01

2

i

i

x

x

det ( )( )λ

λλ λ

− − −− + −

= − −

6 2 2

2 2 46 4

i

i−− = − −( )( )8 8 2λ λ

P AP

i

i

i

− =

− −

+

−1

1

3

1

3

1

6

2

6

4 1

11 5

1

3

1

6

1

3

2

6+

− + −

i

i i

=

3 0

0 6

P

i i

=

− + −

1

3

1

6

1

3

2

6

PP2

1

6

2

6

=

−

i



This yields x1 = (1 + i)s and x2 = s where s is arbitrary. If we set s = 1, we have that


vector to obtain

Thus

diagonalizes A and

P AP

i

i

i

− =

− +

−

+1

1

6

2

61

3

1

3

6 2 2

22 2 4

1

6

1

32

6

1

3−

−+ +

i

i i

=

2 0

0 8

P

i i

=−

+ +

1

6

1

62

6

1

3

PP2 =

+

1

31

3

i

1

1

+

i

2 2 2

2 2 4

0

01

2

− −− +

=

i

i

x

x

PP1

1

62

6

=−

+

i



Therefore, the eigenvalues are λ = 1, λ = 5, and λ = –2. To find the eigenvectors of Acorresponding to λ = 1, we let

This yields x1 = 0, x2 = – and x3 = s where s is arbitrary. If we set s = 1, we have

that is a basis vector for the eigenspace corresponding to λ = 1. We normalize

this vector to obtain


This yields x1 = s and x2 = x3 = 0 where s is arbitrary. If we let s = 1, we have that

is a basis vector for the eigenspace corresponding to λ = 5. Since this vector is already

normal, we let

1

0

0

0 0 0

0 6 1

0 1 5

1

2

3

−+

i

i

x

x

x

=

0

0

0

PP1

2

6

= −−

0

1 i

6

0

1 2

1

− −( )

i

1

2

− is,

−−

+

=4 0 0

0 2 1

0 1 1

0

01

2

3

i

i

x

x

x 00

det ( )( )(

λλ

λλ λ λ

−+ −+

= − −5 0 0

0 1 1

0 1

1 5i

i

++ 2)


To find the eigenvectors corresponding to λ = – 2, we let

This yields x1 = 0, x2 = (1 – i)s, and x3 = s where s is arbitrary. If we let s = 1, we have that

is a basis vector for the eigenspace corresponding to λ = –2. We normalize this

vector to obtain

Thus

diagonalizes A and

Pi i

= −− −

0 1 0

1

60

1

32

60

1

3

PP3

0

1

31

3

=−

i

0

1

1

−

i

−− −+ −

7 0 0

0 1 1

0 1 2

1

2

3

i

i

x

x

x

=

0

0

0

PP2

1

0

0

=


13. The eigenvalues of A are the roots of the equation

The roots of this equation, which are λ = are not real. This shows that the

eigenvalues of a symmetric matrix with nonreal entries need not be real. Theorem 10.6.6

applies only to matrices with real entries.

14. (b) Let A be hermetian and unitary. Then A = AH = A–1.

15. We know that det(A) is the sum of all the signed elementary products ,

where aij

is the entry from the ith row and jth column of A. Since the ijth element of

, then det is the sum of the signed elementary products or

. That is, det is the sum of the conjugates of the terms in det(A). But

since the sum of the conjugates is the conjugate of the sum, we have

det

16. (a) We have

det( *) det( )

det( ) ( det(

A A

A

T=

= =since det( )B BB

A

T ))

det( ) ( )= by Exercise 15

A A( ) = det( ).

( )A±a a aj j njn1 21 2…

±a a aj j njn1 21 2…( )AA aijis

±a a aj j njn1 21 2…

4 16 4 19

2

± − ( ),

detλ

λλ λ

− −− −

= − + =

1 4

4 34 19 02i

i

P AP

i

i

− =

−+

+

1

01

6

2

61 0 0

01

3

1

3

− − +− −

−−

5 0 0

0 1 1

0 1 0

0 1 0

1i

i

i

660

1

32

60

1

3

1 0 0

0 5 0

0 0 2

−

=−

i


(b) If A is Hermitian, then A = A*. Therefore,

det(A) = det(A*)

= (by Part (a))

But z = if and only if z is real.

(c) If A is unitary, then A–1 = A*. Therefore,

det(A) = 1/det(A–1)

= 1/det(A*)

= 1/ (by Part (a))

Thus det(A)2 = 1. But since |z| ≥ 0 for all z, it follows that |det(A)| = 1.

18. (b) We have

(d) We have

19. If A is invertible, then

A*(A–1)* = (A–1 A)* (by Exercise 18(d))

Thus we have (A–1)* = (A*)–1.

I I I IT T* = = =

( )* ( )

( )

* *

AB AB

AB

B A

B A

T

T

T T

=

=

=

=

( )* ( )

( )

( )

* *

A B A B

A B

A B

A B

T

T

T T

+ = +

= +

= += +

det( )A

z

det( )A


20. Hint: use Exercise 18(a) and Exercise 19.

21. Let ri

denote the ith row of A and let cj

denote the jth column of A*. Then, since

we have cj

= for j = 1,…,n. Finally, let

Then A is unitary ⇔ A–1 = A*. But

A–1 = A* ⇔ AA* = I

⇔ ri

cj = δij for all i, j

⇔ ri rj = δij for all i, j

⇔ r1,…,rn is an orthonormal set

22. We know that if A is unitary, then so is A*. But since (A*)* = A, then if A* is unitary, so isA. Moreover, the columns of A are the conjugates of the rows of A*. Thus we have

A is unitary ⇔ A* is unitary

⇔ the rows of A* form an orthonormal set

⇔ the conjugates of the columns of A form an orthonormalset

⇔ the columns of A form an orthonormal set

23. (a) We know that A = A*, that Ax = λIx, and that Ay = µIy. Therefore

x* Ay = x*(µIy) = µ(x* Iy) = µx*y

and

x* Ay = [(x* Ay)*]* = [y* A*x]*

= [y* Ax]* = [y* (λIx)]*

= [λy*x]* = λx*y

The last step follows because λ, being the eigenvalue of an Hermitian matrix, is real.

δij

i j

i j=

≠=

0

1

if

if

rrjA A AT T* ( ),= =


(b) Subtracting the equations in Part (a) yields

(λ – µ)(x* y) = 0

Since λ ≠ µ, the above equation implies that x*y is the 1 × 1 zero matrix. Letx = (x1,…, x

n) and y = (y1,…, y

n). Then we have just shown that

so that

and hence x and y are orthogonal.

x y x yn n1 1 0 0+ + = =

x y x yn n1 1 0+ + =



2. If a and b are not both zero, then | a |2 + | b |2 ≠ 0 and the inverse is

This inverse is computed exactly as if the entries were real.

3. The system of equations has solution x1 = –is + t, x2 = s, x3 = t. Thus

where s and t are arbitrary. Hence

form a basis for the solution space.

−

i

1

0

1

0

1

and

x

x

x

i

s

1

2

3

1

0

1

0

1

=−

+

t

a

a b

b

a b

b

a b

a

a b

2 2 2 2

2 2 2 2

+ +

−

+ +

573

5. The eigenvalues are the solutions, λ, of the equation

or

But so that . Thus we have

λ3 – 1 = 0

or

(λ – 1)(λ2 + λ + 1) = 0

Hence λ = 1, ω, or . Note that = ω 2.

6. (a) The eigenvalues of are

z = a + bi and = a – bi

(b) det = a2 + b2 = z = | z |2

which corresponds to the complex number (ac – bd) + i (ad + bc)

((cc)) Ζ Ζ Ζ− =+ −

=12 2

1 1

a b

a b

b a

T

det

whilee zz

z

a bi c di ac bd i

− =

+ + = − +

12

((dd)) ( )( ) ( ) (aad bc

a b

b a

c d

d c

+

−

−

)

while ==− − −+ −

ac bd ad bc

ad bc ac bc

zΖ

z

Ζ

ωω

ωω

ω+ + = + =11

2 1 0Re( )1

ωω= ,

λ ωω

λ ωω

λ3 211

11

1 0+ + +

− + +

− =

det

λ

λ ωω

λ ωω

0 1

1 11

0 1 11

−

− − − −

− + + +

= 00


7. (c) Following the hint, we let z = cos θ + i sin θ = eiθ in Part (a). This yields

If we expand and equate real parts, we obtain

But

Re Re1

1

1 111

−−

=

−( ) −+( ) +( )e

e

e en i

i

n iθ

θ

θ −−

−

+( )

( )−( ) −( )

= −

i

i i

n i

e e

e

θ

θ θ1 1

11

Reθθ θ θ

θ− +

−

=− +( )

−e e

e

n

i ni

i2 2

1

2

1 1

Re( )

cos θθ θ θ

θ

θ

− −( )+

−

= −

cos cos

cos

cos

n

1

1

2

1

11

1

1

1

21

−+

− +( )( )−

=

cos

cos cos

cosθθ θ

θn n

++− −

cos cos cos sin sin

sin

n n nθ θ θ θ θθ

22

2

= +−( )+1

21

1 22 2

cos cos sin sin cosn nθ θ θ θ θ

222

1

21

22

2

2

2

sin

cos sin sin

θ

θ θ

= ++n nθθ θ θ

θ

sin cos

sin

2 2

22

2

1 21

1

1

+ + + + =−

−

+cos cos cos Re

( )

θ θ θθ

θ ne

e

n i

i

11

12

1

+ + + + =−

−

+e e e

e

e

i i nin i

i

θ θ θθ

θ( )


Observe that because 0 < θ < 2π, we have not divided by zero.

8. Observe that ω 3 = 1. Hence by Exercise 6(b), we have 1 + ω + ω2 = 0. Therefore

Thus, the vectors form an orthonormal set in C3.

9. Call the diagonal matrix D. Then

( )* ( ) *UD UD D U DU DU

T T T= = = = −1

vv vv

vv vv

1 •

•

22 2

1 22

1

31

1

31 0

1

31

= + +( ) = + +( ) =

= + +

ω ω ω ω

ω ω 44 2

1 22 2 4

1

31 0

1

31

1

31

( ) = + +( ) =

= + +( )= +

ω ω

ωω ω ω

ω

vv vv•

55 10 2

2

1 2

1

31

0

1

31 1

+( ) =

= + +( )=

= +

ω ω ω

ω ω

( )

•

since

vv vv ++( ) =

= + +( ) = =

1 1

1

31 1 11 2

2 2

1

vv vv

vv vv

•

•

( )ωω ω ω ωωsince

222 2 41

31 1= + +( ) =ω ω ω ω4 as above( )

= ++

1

21 2 2

2

cos sin sin cos

sin

n nθ θ θ θ

θ

== ++

1

21

12

2

sin

sin

n θ

θ



We need only check that (UD)* = (UD)–1. But

and

Hence (UD)* = (UD)–1 and so UD is unitary.

10. (a) Since A* = –A, we have

Hence iA is Hermitian.

(b) By Part (a), iA is Hermitian and hence unitarily diagonalizable with real eigenvalues.That is, there is a unitary matrix P and a diagonal matrix D with real entries suchthat

P–1(iA)P = D

Thus

i(P–1 AP) = D

or

P–1 AP = –iD

That is, A is unitarily diagonalizable with eigenvalues which are –i times the eigenvaluesof iA, and hence pure imaginary.

( ) ( )* *iA iA i A iA= = − − =

DD

z

z

z

I

n

=

=

12

22

2

0 0

0 0

0 0

…

…

…

( )*( ) ( )( )UD UD DU UD DD= =−1

...

...

...


11. Show the eigenvalues of a unitary matrix have modulus one.

Proof: Let A be unitary. Then A–1 = AH.

Let λ be an eigenvalue of A, with corresponding eigenvector

Then Ax2 = (Ax)H(Ax) = (xHAH)(Ax) = xH(A–1A)x = xHx =

but also

Ax 2 = λx 2 = (λx)H(λx) = ( λ)(xHx)

since λ and are scalars = | λ |2 x 2. So, | λ |2 = 1, and hence the eigenvalues of A havemodulus one.

λ

λ

x1

2

x

EXERCISE SET 11.1

1. (a) Substituting the coordinates of the points into Eq. (4) yields

which, upon cofactor expansion along the first row, yields –3x + y + 4 = 0; that is,y = 3x – 4.

(b) As in (a),

yields 2x + y – 1 = 0 or y = –2x + 1.

2. (a) Using Eq. (9) yields

which, upon first-row cofactor expansion, yields 18(x2 + y2) – 72x – 108y + 72 = 0 or,dividing by 18, x2 + y2 – 4x – 6y + 4 = 0. Completing the squares in x and y yields thestandard form (x – 2)2 + (y – 3)2 = 9.

x y x y2 2 1

40 2 6 1

4 2 0 1

34 5 3 1

0

+

=

x y 1

0 1 1

1 1 1

0

−=

x y 1

1 1 1

2 2 1

0− =

579

2. (b) As in (a),

yields 50(x2 + y2) + 100x – 200y – 1000 = 0; that is, x2 + y2 + 2x – 4y – 20 = 0. Instandard form this is (x + 1)2 + (y – 2)2 = 25.

3. Using Eq. (10) we obtain

which is the same as

by expansion along the second row (taking advantage of the zeros there). Add column fiveto column three and take advantage of another row of all but one zero to get

.

Now expand along the first row and get 160x2 + 320xy + 160(y2 + y) – 320x = 0; that is,x2 + 2xy + y2 – 2x + y = 0, which is the equation of a parabola (since the discriminant c2

2 – 4c1c3 is zero).

x xy y y x2 2

4 0 0 2

4 10 20 2

16 4 0 4

0

+

−−

= .

x xy y x y2 2

0 0 1 0 1

4 0 0 2 0

4 10 25 2 5

16 4 1 4 1

0−

− −− −

=

x xy y x y2 2 1

0 0 0 0 0 1

0 0 1 0 1 1

4 0 0 2 0 1

4 10 25 2 5 1

16 4 1

−

− −− 44 1 1

0

−

=

x y x y2 2 1

8 2 2 1

34 3 5 1

52 4 6 1

0

+−

−

=


4. (a) From Eq. (11), the equation of the plane is

Expansion along the first row yields –2x – 4y – 2z = 0; that is, x + 2y + z = 0.

(b) As in (a),

yields –2x + 2y – 4z + 2 = 0; that is, –x + y – 2z + 1 = 0 for the equation of the plane.

5. (a) Using Eq. (12), the equation of the sphere is

Expanding by cofactors along the first row yields 16(x2 + y2 + z2) – 32x – 64y – 32z +32 = 0; that is, (x2 + y2 + z2) – 2x – 4y – 2x + 2 = 0. Completing the squares in eachvariable yields the standard form (x – 1)2 + (y – 2)2 + (z – 1)2 = 4.

NOTE: When evaluating the cofactors, it is useful to take advantage of the column ofones and elementary row operations; for example, the cofactor of x2 + y2 + z2

above can be evaluated as follows:

by cofactor expansion of the latter determinant along the last column.

1 2 3 1

1 2 1 1

1 0 1 1

1 2 1 1

1 2 3 1

2 0 2 0

0 2 2 0

0 0 4 0

−

−

=− −

− −−

==16

x y z x y z2 2 2 1

14 1 2 3 1

6 1 2 1 1

2 1 0 1 1

6 1 2 1 1

0

+ +

−

−

= .

x y z 1

2 3 1 1

2 1 1 1

1 2 1 1

0− −

=

x y z 1

1 1 3 1

1 1 1 1

0 1 2 1

0−

−−

= .


5. (b) As in (a),

yields –24(x2 + y2 + z2) + 48x + 48y + 72 = 0; that is, x2 + y2 + z2 – 2x – 2y – 3 = 0or in standard form, (x – 1)2 + (y – 1)2 + z2 = 5.

6. Substituting each of the points (x1, y1), (x2, y2), (x3, y3), (x4, y4), and (x5, y5) into theequation

c1x2 + c2xy + c3y

2 + c4x + c5y + c6 = 0

yields

These together with the original equation form a homogeneous linear system with a non-trivial solution for c1, c2, … , c6. Thus the determinant of the coefficient matrix is zero,which is exactly Eq. (10).

7. As in the previous problem, substitute the coordinates (xi, y

i, z

i) of each of the three

points into the equation c1x + c2y + c3z + c4 = 0 to obtain a homogeneous system withnontrivial solution for c1, … , c4. Thus the determinant of the coefficient matrix is zero,which is exactly Eq. (11).

8. Substituting the coordinates (xi, y

i, z

i) of the four points into the equation c1(x2 + y2 + z2)

+ c2x + c3y + c4z + c5 = 0 of the sphere yields four equations, which together with theabove sphere equation form a homogeneous linear system for c1, … , c5 with a nontrivialsolution. Thus the determinant of this system is zero, which is Eq. (12).

c x c x y c y c x c y c

c x

1 12

2 1 1 3 12

4 1 5 1 6

1 52

0+ + + + + =

+

.

cc x y c y c x c y c2 5 5 3 52

4 5 5 5 6 0+ + + + = .

x y z x y z2 2 2 1

5 0 1 2 1

11 1 3 1 1

5 2 1 0 1

11 3 1 1 1

0

+ +−

−−

=


9. By substituting of the coordinates of the three points (x1, y1), (x2, y2) and (x3, y3) weobtain the equations:

c1y + c2x2 + c3x + c4 = 0.

c1y1 + c2x12 + c3x1 + c4 = 0.

c1y2 + c2x22 + c3x2 + c4 = 0.

c1y3 + c2x32 + c3x3 + c4 = 0.

This is a homogeneous system with a nontrivial solution for c1, c2, c3, c4, so the determinantof the coefficient matrix is zero; that is,

y x x

y x x

y x x

y x x

2

1 12

1

2 22

2

3 32

3

1

1

1

1

0= .


EXERCISE SET 11.2

1.

Applying Kirchhoff’s current law to node A in the figure yields

I1 = I2 + I3.

Applying Kirchhoff’s voltage law and Ohm’s law to Loops 1 and 2 yields

5I1 + 13I2 = 8

and

9I3 – 13I2 + 5I3 = 3.

In matrix form these three equations are

with solution I I I1 2 3255

317

97

317

158

317= = =, , .

1 1 1

5 13 0

0 13 14

1

2

3

− −

−

I

I

I

=

0

8

3

loop 1

loop 2

node A

5 Ω

5 Ω

13

9 Ω

3 V

8 V–

–

+

+

I1

I2

I3

585

2.

Node A gives I1 + I2 = I3.

Loop 1 gives –2I1 + 2I2 = –6.

Loop 2 gives –2I2 – 4I3 = –8.

In matrix form we have

with solution

3.

loop 1 loop 2

node A

4 Ω

6 Ω

2 Ω

2 V

1 V

–

–

+

+

I1

I2

I3

I I I1 2 313

5

2

5

11

5= = − =, , .

1 1 1

2 2 0

0 2 4

1

2

3

−−

− −

I

I

I

== −−

0

6

8

loop 1 loop 2

node A

4 Ω2 Ω2 Ω

3 V

8 V

6 V–

–

+

+

I1 I2 I3


Node A gives I1 + I3 = I2.

Loop 1 gives –4I1 – 6I2 = –1.

Loop 2 gives 4I1 – 2I3 = –2.

In system form we have

with solution

4.

Node A gives I1 = I2 + I4.

Node B gives I4 = I3 + I5.

Node C gives I6 = I3 + I5.

Loop 1 gives 20I1 + 20I2 = 10.

Loop 2 gives –20I2 + 20I3 = 0.

Loop 3 gives –20I3 + 20I5 = 10.

loop 1 loop 2 loop 3

node A node B

node C

20 Ω20 Ω 10 V10 V—+ —

+

I1

I2

I4 I5

I6

I3

I I I1 2 35

22

7

22

6

11= − = =, , .

1 1 1

4 6 0

4 0 2

1

2

3

−− −

−

I

I

I

== −−

0

1

2


In matrix form we have

with solution

5.

After setting I5 = 0 we have that:

node A gives I1 = I3

node B gives I2 = I4

loop 1 gives I1R1 = I2R2

loop 2 gives I3R3 = I4R4.

From these four equations it easily follows that R4 = R3R2/R1.

loop 1

loop 2

node AE node B

I1 I2

R2

R5

R1

R3 R4

I3 I4

I5

I0

–+

I I I I I I1 4 5 6 31

20= = = = = =and 2 .

1 1 0 1 0 0

0 0 1 1 1 0

0 0 1 0 1 1

20 20 0 0 0 0

0 20 20 0 0 0

0 0 2

− −−

−

−− 00 0 20 0

1

2

3

4

5

6

I

I

I

I

I

I

=

0

0

0

10

0

10


6.

Node A in the first circuit gives I = I1 + I2.

Loop 1 in the first circuit gives I1R1 = E.

Loop 2 in the first circuit gives I1R1 = I2R2.

Loop 3 in the first circuit gives IR = E.

From these four equations we obtain

RE

I

E

I I

E

E

R

E

R R R

.= =+

=+

=+1 2

1 2 1 2

11 1

loop 1 loop 2

node A

E E–+

I1R1 R2

I

I2

loop 3

–+

R

I


EXERCISE SET 11.3

1.

In the figure, the feasible region is shown and the extreme points are labelled. The valuesof the objective function are shown in the following table:

Thus the maximum, 7-1/3, is attained when x1 = 2 and x2 = 2/3.

(3/2, 1)

(2, 2/3)

(2, 0)(0, 0)

x1 = 2x2

x1

x2 = 1

2x1 + 3x2 = 6

(1/2, 1)

591

Extreme point Value of(x1, x2) z = 3x1 + 2x2

(0, 0) 0

(1/2, 1) 3-1/2

(3/2, 1) 6-1/2

(2, 2/3) 7-1/3

(2, 0) 6

2.

The intersection of the five half-planes defined by the constraints is empty. Thus thisproblem has no feasible solutions.

3.

The feasible region for this problem, shown in the figure, is unbounded. The value ofz = –3x1 + 2x2 cannot be minimized in this region since it becomes arbitrarily negativeas we travel outward along the line –x1 + x2 = 1; i.e., the value of z is –3x1 + 2x2 = –3x1 + 2(x1 + 1) = –x1 + 2 and x1 can be arbitrarily large.

3x1 – x2 = ñ5

2x1 + 4x2 = 12

–x1 + x2 = 1

x1

x2

2x1 − x2 = −2

4x1 − x2 = 0x2 = 3

x1

x2


4.

The feasible region and vertices for this problem are shown in the figure. The maximumvalue of the objective function z = .1x1 + .07x2 is attained when x1 = 6,000 and x2 = 4,000,then z = 880. In other words, she should invest $6,000 in bond A and $4,000 in bond Bfor an annual yield of $880.

5.

The feasible region and its extreme points are shown in the figure. Though the region isunbounded, x1 and x2 are always positive, so the objective function z = 7.5x1 + 5.0x2is also. Thus, it has a minimum, which is attained at the point where x1 = 14/9 andx2 = 25/18. The value of z there is 335/18. In the problem’s terms, we use 7/9 cups ofmilk and 25/18 ounces of corn flakes, a minimum cost of 18.6¢ is realized.

x2

x1

x1 = 3/2 3x1 – 2x1 = 0

x1 – 2x2 = 0

4x1 + 2x2 = 9

(3/2, 9/4)

(14/9, 25/18)

(40/21, 20/21)

x1 + x2 =18

110

13

(3/2, 3/2)

x2

x2 = 2000

x1 – x2 = 0 x1 + x2 = 10,000x1 = 6000

(5000, 5000)

(6000, 4000)

(6000, 2000)

x1

(2000, 2000)


6. Letting x1 be the number of Company A’s containers shipped and x2 be the number ofCompany B’s, the problem is

Maximize z = 2.20x1 + 3.00x2

subject to

40x1 + 50x2 ≤ 37,000

2x1 + 3x2 ≤ 2,000

x1 ≥ 0

x2 ≥ 0.

The feasible region is shown in the figure. The vertex at which the maximum is attainedis when x1 = 550 and x2 = 300, then z = 2110.

7. We must now maximize z = 2.50x1 + 3.00x2. The feasible region is as in 11.6, but now themaximum happens when x1 = 925 and x2 = 0, then z = 2312.50.

8. Let x1 be the number of pounds of ingredient A used, and x2 be the number of pounds ofingredient B. Then the problem is

Minimize z = 8x1 + 9x2

subject to

2x1 + 5x2 ≥ 10

2x1 + 3x2 ≥ 8

6x1 + 4x2 ≥ 12

x1 ≥ 0

x2 ≥ 0

40x1 + 50x2 = 37,000

(550, 300)

(0, 0) (925, 0)

2x1 + 3x2 = 2000

x2

x1

(0, 666 )23


Though the feasible region shown in the figure is unbounded, the objective function isalways positive there and hence must have a minimum. This minimum occurs at the vertexwhere x1 = 2/5 and x2 = 12/5. The minimum value of z is 124/5 or 24.8¢.

9. It is sufficient to show that if a linear functions has the same value at two points, then ithas that value along the line connecting the two points. Let ax1 + bx2 be the function, andax′1 + bx2′ = ax′1′ + bx2′′ = c. Then (tx′1 + (1 – t)x′1′, tx2′ + (1 – t)x2′′) is an arbitrary point on theline connecting (x1′, x2′) to (x1′′, x2′′) and a(tx1′ + (1 – t)x1′′) + b(tx2′ + (1 – t) x2′′)= t(ax1′ + bx2′) + (1 – t)(ax1′′ + bx2′′) = c.

10.

The level curves of the objective function –5x1 + x2 are shown in the figure, and it is readilyseen that the objective function remains bounded in the region.

z decreasing

x2

x1

z increasing

(0,3)

x2

x1

(2/5, 12/5)

(5/2, 1)

6x1 + 4x2 = 12 2x1 + 3x2 = 8 2x1 + 5x2 = 10

(5, 0)


EXERCISE SET 11.4

1. (a) Step 1 yields

and Step 2 yields

We see that two lines are needed to cover the zeros as shown. Thus the test foroptimality fails. Applying Step 5 yields

Now we cannot cover the zeros with less than three lines, so the matrix contains anoptimal assignment. Actually there are two such assignments:

from the original matrix, we see the cost for each is 30.

2 0 2

0 1 0

0 0 0

2 0 2

0 1 0

0 0 0

and

;

2 0 2

0 1 0

0 0 0

.

0 0 0

3 0 3

1 0 1

.

13 0 6

10 0 3

11 0 4

597

1. (b) Step 1 yields and Step 2 yields

The zeros can be covered by three lines (as shown), so we proceed. Step 5 gives

Now we cannot cover the zeros with less than four lines, so there is an optimalassignment. There are two in fact:

each (from the original matrix) with a cost of –2.

(c) Step 1 yields

5 2 0 0 3

7 3 0 5 6

4 1 0 7 7

0 3 7 1 4

0 5 4 1 5

3 0 0 0

8 8 0 6

0 7 3 0

5 0 0 0

3 0 0 0

8 8 0 6

0 7

and33 0

5 0 0 0

,

3 0 0 0

8 8 0 6

0 7 3 0

5 0 0 0

.

5 0 2 3

8 6 0 7

0 5 3 1

7 0 2 3

5 0 2 2

8 6 0 6

00 5 3 0

7 0 2 2

.


and Step 2 yields

The zeros can be covered by four lines, so, Step 5 yields

Now the zeros cannot be covered with fewer than five lines, so we have an optimalassignment. There are three such (one is shown), each having a cost of 39.

2. From the matrix (1), Step 1 yields

Step 2 yields

and this is not optimal. Step 5 yields

10 13 0

0 0 0

18 10 0

16 59 0

6 46 0

24 56 0

6 1 1 0 0

7 1 0 4 2

5 0 1 7 4

0 1 7 0 0

0 3 4 0 1

..

5 1 0 0 0

7 2 0 5 3

4 0 0 7 4

0 2 7 1 1

0 4 4 1 2

.


clearly optimal, and we obtain the solutions given in the example.

3. Another such covering is

Then Step 5 yields

in which there is still a covering by three lines. Step 5 again yields

Since this matrix is the same as (8), we must get the same optimal assignments.

.

40 0 5 0

0 25 0 0

55 0 0 5

0 40 30 40

−

−

20 0 5 0

0 45 20 20

35 0 0 5

0 60 50 60

−

15 0 0 0

0 50 20 25

35 5 0 10

0 65 50 65

.

0 3 0

0 0 10

8 0 0


4. Another such assignment is

giving the matches

Sue - Don (rank = 10)

Ann - Tom (rank = 9)

Bea - Joe (rank = 6)

Fay - Bob (rank = 6)

Hal - unmatched,

a total score of 31.

5. (a) A dummy coin adds a column of zeros, then we negate to obtain a starting matrix:

Step 1 yields

60 145 0 75 210

55 155 0 75 230

65 115 0 60 200

50 120 0 60 1900

30 150 0 40 200

− − − −− − − −− − −

150 65 210 135 0

175 75 230 155 0

135 85 200 −−− − − −− − − −

140 0

140 70 190 130 0

170 50 200 160 0

.

0 3 0 4 0

4 0 6 1 5

3 1 0 4 0

0 1 6 2 1

0 0 0 0 3


and Step 2 yields

which is not optimal. Step 5 yields

which is not optimal. Step 5 yields

This is optimal, containing only the optimal assignment shown. Thus, bidder 1 getscoin 3 ($210), bidder 2 gets coin 1 ($175), bidder 3 gets coin 2 ($85), bidder 5 getscoin 4 ($160), and bidder 4 gets nothing; a total of $630.

(b) We use bidder 2’s row twice (so he has two chances at getting accepted), and addtwo dummy coin columns. Then negating yields the starting matrix

5 5 0 10 0

0 15 0 10 20

35 0 25 20 15

15 0 20 15 0

0 35 25 0 15

.

10 10 0 15 0

5 20 0 15 20

35 0 20 20 10

20 5 20 20 0

0 35 20 0 10

30 30 0 35 20

25 40 0 35 40

35 0 0 20 10

20 5 0 20 0

0 35 0 0 10


After Steps 1 and 2 we get

not optimal. Step 5 yields

still not optimal. Step 5 again

10 10 0 15 0 0

5 20 0 15 20 20

5 20 0 15 20 20

35 0 20 20 10 10

20 5 220 20 0 0

0 35 20 0 10 10

,

30 30 0 35 20 20

25 40 0 25 40 40

25 40 0 35 40 40

35 0 0 20 10 10

220 5 0 20 0 0

0 35 0 0 10 10

,

− − − −− − − −− − −

150 65 210 135 0 0

175 75 230 155 0 0

175 75 2330 155 0 0

135 85 200 140 0 0

140 70 190 130 0 0

−− − − −− − − −−− − − −

170 50 200 160 0 0

.


which is optimal. The matrix contains several optimal assignments, but all result in:bidder 2 gets coins 1 and 3 ($405), bidder 3 gets coin 2 ($85), bidder 5 gets coin 4($160), and bidders 1 and 4 get nothing; this is a total of $650.

6. Negating the original matrix, and applying Steps 1 and 2 yield

This is not optimal, and Step 5 gives

4 6 13 5 5 0 0 20 7

4 1 1 0 3 6 7 8 4

0 0 2 3 0 6 6 0 1

6 2 8 5 2 13 12 0 7

7 3 88 5 3 13 11 0 7

4 2 3 4 2 13 10 0 7

7 2 1 3 5 7 8 0 0

4 0 0 2 2 4 5 0 0

4 0 0 2 22 4 5 0 0

.

5 5 0 10 0 0

0 15 0 10 20 20

0 15 0 10 20 20

35 0 25 20 15 15

15 0 20 15 0 0

0 35 25 0 15 15

,


This is not optimal. Step 5 again yields

This is still not optimal. Step 5 again gives

6 10 17 17 7 0 0 24 11

4 3 3 0 3 4 5 10 6

0 2 4 3 0 4 4 2 3

0 2 4 3 0 9 8 0 7

5 33 8 3 1 9 7 0 7

2 2 3 2 0 9 6 0 7

5 2 1 1 3 3 4 0 0

2 0 0 0 0 0 1 0 0

2 0 0 0 0 0 1 00 0

.

4 8 15 15 5 0 0 22 9

4 3 3 0 3 6 7 10 6

0 2 4 3 0 6 6 2 3

4 2 8 3 0 11 10 0 7

5 33 8 3 1 11 9 0 7

2 2 3 2 0 11 8 0 7

5 2 1 1 3 5 6 0 0

2 0 0 0 0 2 3 0 0

2 0 0 0 0 22 3 0 0

.


This is optimal, yielding the following lineup(s):

P - Pete 2B - Jill or Ann LF - Herb

C - Liz 3B - Alex CF - John or Lois

1B - Sam SS - Ann or Jill RF - Lois or John

All possibilities have a “score” of 125.

7. Proof of part (ii) of the test for optimality: Since each zero entry of an optimal assignmentis in a different row, a horizontal line can cover at most one assigned zero; similarly, avertical line can cover at most one assigned zero. Thus, as many lines as there are zeros isneeded, that is, n lines.

Proof of part (i) of the test for optimality: Suppose we are given an n × n matrix that doesnot contain an optimal assignment of zeros. Then we cannot find a set of n zero entries, notwo of which come from the same row or column. But we can always find a set of fewerthan n zero entries, say k zero entries where k < n, no two of which come from the samerow or column (possibly k is zero). A set of k such zero entries for which k is as large aspossible is called a maximal assignment of zeros, and the zero entries in the set are calledthe assigned zeros of the maximal assignment. We shall prove the following theorem:

THEOREM All of the zero entries in matrix that contains a maximal

assignment of zeros with k assigned zeros can be covered with k vertical

and horizontal lines.

8 10 17 19 9 0 0 26 11

4 1 1 0 3 2 3 10 4

0 0 2 3 0 2 2 2 1

4 0 6 3 0 7 6 0 5

5 11 6 3 1 7 5 0 5

2 0 1 2 0 7 4 0 5

7 2 1 3 5 3 4 2 0

4 0 0 2 2 0 1 2 0

4 0 0 2 2 0 1 22 0


PROOF. The following algorithm shows how the k lines in the statement of the theoremcan be found:

Step 1. Search the matrix for unassigned zeros that do not contain any assigned zeros intheir rows. Cover each of the columns in which these unassigned zeros lie with a verticalline. Label one such unassigned zero in each covered column as 01. Each column coveredin this way must also contain an assigned zero, which we label as 01

*. (If not, then we couldadd the unassigned zero 01 in that column to the maximal assignment of zeros — acontradiction to the definition of a maximal assignment of zeros.)

Step 2. Search the matrix for uncovered, unassigned zeros that contain an assigned zero ofthe form 01

* in their rows. Cover each of the columns in which these uncovered, unassignedzeros lie with a vertical line. Label one such uncovered, unassigned zero in each newly-covered column as 02. Each column covered in this way must also contain an assigned zero,which we label as 02

*. (If not, then we could add the unassigned zero 02 in that column to themaximal assignment of zeros after we reassign the assigned zero of the form 01

* in its row tothe corresponding unassigned zero 01.)

Step 3. Continue Step 2 iteratively, so that at the m-th step we search for uncovered,unassigned zeros that contain an assigned zero of the form 0

m–1* in its row. Cover their

columns with vertical lines and label one such uncovered, unassigned zero from each suchcovered column as 0

m. Each newly covered column must also contain an assigned zero,

which we label as 0m* (otherwise we could increase the number of assigned zeros by one by

replacing m – 1 assigned zeros of the form 01*, 02

*, …, 0*m–1 by m unassigned zeros of the

form 01, 02, …, 0m–1, 0

m). We continue iteratively until we can find no more unassigned

zeros which contain covered, assigned zeros in their rows.

Step 4. After Step 3, all uncovered, unassigned zeros must lie in rows that containuncovered, assigned zeros. Passing horizontal lines through these remaining uncovered,assigned zeros completes the algorithm. All zeros in the matrix are covered and each of thek assigned zeros has either a vertical or a horizontal line through it. There are thus exactlyk covering lines, which is what we wanted to prove.

As an example of this algorithm, the zero entries of the following 7 × 7 matrix containing amaximal assignment of zeros with six assigned zeros (labeled with asterisks):


can be covered with six lines as follows:

Remark: The above theorem was first proved in 1931 by two Hungarian mathematicians, D. König and E. Egerváry. It is after them that the Hungarian Method is named.

8. (a) Let q be the number of intersections of covering lines. Then there are nm – q entriescovered altogether (since each line covers n entries, but q are covered twice). Thereare thus n2 – (nm – q) uncovered entries. The total amount subtracted from C is thusa(n2 – nm + q), and the total amount added to C is aq. So, the sum of the entries ofC′ is equal to the sum of the entries of C plus aq – a(n2 – nm + q), i.e.,

(b) We assume the original matrix contains integers only. Since the difference Σ (cij

– c′ij)

is positive if a > 0 and n > m, the sum of the entries will decrease each time. If theprocess does not terminate, the sequence of sums (being a decreasing sequence ofnonnegative integers) will reach zero. But then the matrix, non-negative, must beidentically zero, so we must get an optimal assignment of zeros either way.

c c an n mij ij∑ ∑− ′ = −( ).

1 0 0 0 1 1 1

1 1 0 0 1 1 0

0 1 1 0 1 0 0

1 0 1 0 1 1 1

1 1 0 1 0 0 1

1 1 1 0 0 1 1

1 0 1 1 0

1 2

2

1

1

*

*

*

*

*

*

11 1 1

.

1 0 1 0 1 1 1

1 1 0 0 1 1 0

0 1 1 0 1 0 0

1 0 1 0 1 1 1

1 1 0 1 0 0 1

1 1 1

*

*

*

*

*

00 0 1 1

1 0 1 1 0 1 1

*

,


EXERCISE SET 11.5

1. We have 2b1 = M1, 2b2 = M2, … , 2bn–1 = M

n–1 from (6).

Inserting in (13) yields

6a1h + M1 = M2

6a2h + M2 = M3

6an–2h + M

n–2 = Mn–1,

from which we obtain

Now S′′ (xn) = M

n, or from (6), 6a

n–1h + 2bn–1 = M

n.

Also, 2bn–1 = M

n–1 from (6) and so

6an–1h + M

n–1 = Mn

or

aM M

h

aM M

h

aM M

hn

n n

12 1

23 2

21 2

6

6

6

=−

=−

=−

−− −

.

609

Thus we have

for i = 1, 2, … , n – 1. From (9) and (11) we have

aih3 + b

ih2 + c

ih + d

i= yi+1, i = 1, 2, … , n – 1.

Substituting the expressions for ai, b

i, and d

ifrom (14) yields

cih + y

i= y

i+1, i = 1, 2, …, n – 1. Solving for ci

gives

for i = 1, 2, … , n – 1.

2. (a) Set h = .2 and

x1 = 0, y1 = .00000.

x2 = 2, y2 = .19867.

x3 = 4, y3 = .38942.

x4 = .6, y4 = .56464.

x5 = .8, y5 = .71736.

x6 = 1.0, y6 = .84147.

Then

6(y1 – 2y2 + y3)/h2 = –1.1880

6(y2 – 2y3 + y4)/h2 = –2.3295

6(y3 – 2y4 + y5)/h2 = –3.3750

6(y4 – 2y5 + y6)/h2 = –4.2915

cy y

h

M Mhi

i i i i=−

−−

+ +1 1 2

6

M M

hh

Mhi i i+ −

+ +1 3 2

6 2

aM M

hi

i=−+1

6

aM M

hn

n n−

−=−

11

6.


and the linear system (21) for the parabolic runout spline becomes

Solving this system yields

M2 = –.15676

M3 = –.40421

M4 = –.55592

M5 = –.74712.

From (19) and (20) we have

M1 = M2 = –0.15676

M6 = M5 = –0.74712.

The specified interval .4 ≤ x ≤ .6 is the third interval. Using (14) to solve for a3, b3,c3 and d3 gives

a3 = (M4 – M3)/6h = –.12643

b3 = M3/2 = –.20211

c3 = (y4 – y3)/h – (M4 + 2M3)h/6 = .92158

d3 = y3 = .38942.

The interpolating parabolic runout spline for .4 ≤ x ≤ .6 is thus

S(x) = –.12643(x – .4)3 – .20211(x –.4)2

+.92158(x – .4) + .38942.

5 1 0 0

1 4 1 0

0 1 4 1

0 0 1 5

2

3

4

5

M

M

M

M

=

−−−−

1 1880

2 3295

3 3750

4 2915

.

.

.

.

.


2. (b)

S(.5) = –.12643(.1)3 – .20211(.1)2 + .92158(.1) + .38942

= .47943.

Since sin(.5) = S(.5) = .47943 to five decimal places, the percentage error is zero.

3. (a) Given that the points lie on a single cubic curve, the cubic runout spline will agreeexactly with the single cubic curve.

(b) Set h = 1 and

x1 = 0 , y1 = 1

x2 = 1 , y2 = 7

x3 = 2 , y3 = 27

x4 = 3 , y4 = 79

x5 = 4 , y5 = 181.

Then

6(y1 – 2y2 + y3)/h2 = 84

6(y2 – 2y3 + y4)/h2 = 192

6(y3 – 2y4 + y5)/h2 = 300

and the linear system (24) for the cubic runout spline becomes


M2 = 14.

M3 = 32.

M4 = 50.

6 0 0

1 4 1

0 0 6

84

1922

3

4

=M

M

M 3300

.



M1 = 2M2 – M3 = –4.

M5 = 2M4 – M3 = 68.

Using (14) to solve for the ai’s, b

i’s, c

i’s, and d

i’s we have

a1 = (M2 – M1)/6h = 3

a2 = (M3 – M2)/6h = 3

a3 = (M4 – M3)/6h = 3

a4 = (M5 – M4)/6h = 3

b1 = M1/2 = –21

b2 = M2/2 = 171

b3 = M3/2 = 161

b4 = M4/2 = 251

c1 = (y2 – y1)/h – (M2 + 2M1)h/6 = 55

c2 = (y3 – y2)/h – (M3 + 2M2)h/6 = 10

c3 = (y4 – y3)/h – (M4 + 2M3)h/6 = 33

c4 = (y5 – y4)/h – (M5 + 2M4)h/6 = 74

d1 = y1 = 1.

d2 = y2 = 7.

d3 = y3 = 2.

d4 = y4 = 79.


For 0 ≤ x ≤ 1 we have

S(x) = S1(x) = 3x3 – 2x2 + 5x + 1.


S(x) = S2(x) = 3(x – 1)3 + 7(x – 1)2 + 10(x – 1) + 7

= 3x3 – 2x2 + 5x + 1.


S(x) = S3(x) = 3(x – 2)3 + 16(x – 2)2 + 33(x – 2) + 27

= 3x3 – 2x2 + 5x + 1.


S(x) = S4(x) = 3(x – 3)3 + 25(x – 3)2 + 74(x – 3) + 79

= 3x3 – 2x2 + 5x + 1.

Thus S1(x) = S2(x) = S3(x) = S4(x),

or S(x) = 3x3 – 2x2 + 5x + 1 for 0 ≤ x ≤ 4.

4. The linear system (16) for the natural spline becomes


4 1 0

1 4 1

0 1 4

0002

3

4

=−.M

M

M

11116

0000816

0000636

−−

.

.


M2 = –.0000252.

M3 = –.0000108.

M4 = –.0000132.


M1 = 0.

M5 = 0.

Solving for the ai’s, b

i’s, c

i’s, and d

i’s from Eqs. (14) we have

a1 = (M2 – M1)/6h = –.00000042

a2 = (M3 – M2)/6h = –.00000024

a3 = (M4 – M3)/6h = –.00000004

a4 = (M5 – M4)/6h = –.00000022

b1 = M1/2 = 0

b2 = M2/2 = –.0000126

b3 = M3/2 = –.0000054

b4 = M4/2 = –.0000066

b5 = M5/2 = 0

c1 = (y2 – y1)/h – (M2 + 2M1)h/6 = –.000214

c2 = (y3 – y2)/h – (M3 + 2M2)h/6 = –.000088

c3 = (y4 – y3)/h – (M4 + 2 M3)h/6 = –.000092

c4 = (y5 – y4)/h – (M5 + 2M4)h/6 = –.000212


d1 = y1 = .99815.

d2 = y2 = .99987.

d3 = y3 = .99973.

d4 = y4 = .99823.

The resulting natural spline is

Assuming the maximum is attained in the interval [0, 10] we set S′(x) equal to zero in thisinterval:

S′(x) = .00000072x2 –.0000252x + .000088 = 0.

To three significant digits the root of this quadratic equation in the interval [0, 10] is x =3.93, and

S(3.93) = 1.00004.

5. The linear system (24) for the cubic runout spline becomes


M2 = –.0000186.

M3 = –.0000131.

M4 = –.0000106.

6 0 0

1 4 1

0 0 6

0002

3

4

=−.M

M

M

11116

0000816

0000636

−−

.

.

.

S x

x x

( )

. ( ) . ( ) .

=

− + + + +00000042 10 000214 10 998153 ,,

. ( ) . ( ) .

− ≤ ≤

− +

10 0

00000024 0000126 0000883 2

x

x x (( ) . ,

. ( ) .

x x

x

+ ≤ ≤

− − −

99987 0 10

0000004 10 0000053 44 10 000092 10 99973 10 20

000

2( ) . ( ) . ,

.

x x x− − − + ≤ ≤

− 000022 20 0000066 20 000212 203 2( ) . ( ) . ( )x x x− − − − − ++ ≤ ≤

. , .99823 20 30x



M1 = 2M2 – M3 = –.0000241.

M5 = 2M4 – M3 = –.0000081.

Solving for the ai’s, b

i’s, c

i’s and d

i’s from Eqs. (14) we have

a1 = (M2 – M1)/6h = .00000009

a2 = (M3 – M2)/6h = .00000009

a3 = (M4 – M3)/6h = .00000004

a4 = (M5 – M4)/6h = .00000004

b1 = M1/2 = –.0000121

b2 = M2/2 = –.0000093

b3 = M3/2 = –.0000066

b4 = M4/2 = –.0000053

c1 = (y2 – y1)/h – (M2 + 2 M1)h/6 = .000282

c2 = (y3 – y2)/h – (M3 + 2M2)h/6 = .000070

c3 = (y4 – y3)/h – (M4 + 2M3)h/6 = .000087

c4 = (y5 – y4)/h – (M5 + 2M4)h/6 = .000207

d1 = y1 = .99815.

d2 = y2 = .99987.

d3 = y3 = .99973.

d4 = y4 = .99823.


The resulting cubic runout spline is

Assuming the maximum is attained in the interval [0, 10], we set S′(x) equal to zero in thisinterval:

S′(x) = .00000027x2 – .0000186x + .000070.

To three significant digits the root of this quadratic in the interval [0, 10] is 4.00 and

S(4.00) = 1.00001.

6. (a) Since S(x1) = y1 and S(xn) = y

n, then from S(x1) = S(x

n) we have y1 = yn.

By definition S′′(x1) = M1 and S′′(xn) = M

n, and so from S′′(x1) = S′′(x

n) we have

M1 = Mn.

From (5) we have

S′(x1) = c1

S′(xn) = 3a

n–1h2 + 2b

n–1h + cn–1.

Substituting for c1, an–1, bn–1, cn–1 from Eqs. (14) yields

S′(x1) = (y2 – y1)/h – (M2 + 2M1)h/6

S′(xn) = (M

n– M

n–1)h/2 + Mn–1h

+ (yn

– yn–1)/h – (M

n+ 2M

n–1)h/6.

S x

x x

( )

. ( ) . ( ) .

=

+ − + +00000009 10 0000121 10 03 2 000282 10 99815 10 0

00000009 3

( ) . ,

. ( )

x x

x

+ + − ≤ ≤

− .. ( ) . ( ) . ,

.

0000093 000070 99987 0 10

0

2x x x+ + ≤ ≤

00000004 10 0000066 10 0000873 2( ) . ( ) . (x x x− − − + −− + ≤ ≤

− −

10 99973 10 20

00000004 20 003

) . ,

. ( ) .

x

x 000053 20 000207 20 99823 20 302( ) . ( ) . ,x x x− + − + ≤ ≤ ..


Using Mn

= M1 and yn

= y1, the last equation becomes

S′(xn) = M1h/3 + M

n–1h/6 + (y1 – yn–1)/h.

From S′(x1) = S′(xn) we obtain

(y2 – y1)/h – (y1 – yn–1)/h = M1h/3 + M

n–1h/6 + (M2 + 2M1)h/6

or

4M1 + M2 + Mn–1 = 6(y

n–1 – 2y1 + y2)/h2.

(b) Eqs. (15) together with the three equations in part (a) of the exercise statement give

4M1 + 4M2 + Mn–1 = 6(y

n–1 – 2y1 + y2)/h2

4M1 + 4M2 + M3 = 6(y1 – 2y2 + y3)/h2

4M2 + 4M3 + M4 = 6(y2 – 2y3 + y4)/h2

Mn–3 + 4M

n–2 + Mn–1

= 6(yn–3 – 2y

n–2 + yn–1)/h2

M1 + 4Mn–2 + 4M

n–1 = 6(yn–2 – 2y

n–1 + y1)/h2.

This linear system for M1, M2, … , Mn–1 in matrix form is

4 1 0 0 0 0 0 1

1 4 1 0 0 0 0 0

0 1 4 1 0 0 0 0

0 0

. . .

. . .

. . .

00 0 0 1 4 1

1 0 0 0 0 0 1 4

1

2

. . .

. . .

M

M

MM

M

M

h

y y

n

n

n

3

2

1

2

1

6

2

−

−

−

=

− 11 2

1 2 3

2 3 4

3 2 1

2

2

2

2

+− +− +

− +− − −

−

y

y y y

y y y

y y y

y

n n n

n

−− +

−2 1 1y yn


7. (a) Since S′(x1) = y1′ and S′(xn) = y

n′, then from (5) we have

S′(x1) = c1

S′(xn) = 3a

n–1h2 + 2b

n–1h + cn–1.

Substituting for c1, an–1, bn–1, c

n–1 from Eqs. (14) yields

y1′ = (y2 – y1)/h – (M2 + 2M1)h/6

yn′ = (M

n– M

n–1)h/2 + Mn–1h

+ (yn

– yn–1)/h – (M

n+ 2M

n–1)h/6.

From the first equation we obtain

2M1 + M2 = 6(y2 – y1 – hy1′)/h2.

From the second equation we obtain

2Mn

+ Mn–1 = 6(y

n–1 – yn

+ hyn′)/h2.

(b) Eqs. (15) together with the two equations in part (a) give

2M1 + 2M2 + M3 = 6(y2 – y1 – hy1′)/h2

2M1 + 4M2 + M3 = 6(y1 – 2y2 + y3)/h2

2M2 + 4M3 + M4 = 6(y2 – 2y3 + y4)/h2

Mn–2 + 4M

n–1 + Mn

= 6(yn–2 – 2y

n–1 + yn)/h2

Mn–1 + 2M

n= 6(y

n–1 – yn

+ hyn′)/h2.


This linear system for M1, M2, … , Mn

in matrix form is

2 1 0 0 0 0 0 0

1 4 1 0 0 0 0 0

0 1 4 1 0 0 0 0

0 0 1 4 0 0 0

. . .

. . .

. . .

. . . 00

0 0 0 0 0 1 4 0

0 0 0 0 0 1 4 1

0 0 0 0 0 0 1 2

. . .

. . .

. . .

−

−

M

M

M

M

M

M

n

n

1

2

3

2

1

nn

h

hy y y

y

=

− ′ − +−

6

2

2

1 1 2

1 yy y

y y y

y y y

y y hy

n n n

n n n

2 3

2 3 4

2 1

1

2

2

+− +

− +− + ′

− −

−


EXERCISE SET 11.6

1. (a)

Continuing in this manner yields

(b) P is regular because all of the entries of P are positive. Its steady-state vector q

solves (P – I)q = 0; that is,

This yields one independent equation, .6q1 – .5q2 = 0, or q1 = q2 Solutions are thus

of the form q = s

2. (a)

and xx( ).

.

.

.3273

396

331

=

xx xx xx( ) ( ) ( ).

.

.

;1 0 27

2

1

= =

=P likewise

..

.

.

23

52

25

5 6

1

156

1

6

11

5 1

=

+= =. Set to obtains qq

11

6 11

.

5

6

−−

=

. .

. ..

6 5

6 5

0

01

2

q

q

xx xx( ) ( ).

.

.

.4 54546

5454

45454

54546=

=

and

.

xx(3) =

.

.,

454

546

xx xx xx xx( ) ( ) ( ) ( ).

.,

.

.1 0 2 14

6

46

54= =

= =

P P

.

623

(b) P is regular because all of its entries are positive.To solve (P – I)q = 0, i.e.,

reduce the coefficient matrix to row-echelon form:

This yields solutions (setting q23 = s) of the form

To obtain a probability vector, take

yielding

3. (a) Solve (P – I)q = 0, i.e.,

The only independent equation is yielding Setting

yields qq =

9 17

8 17

s = 8

17qq =

9 8

1.s

2

3q q1 2

3

4= ,

−−

=

2 3 3 4

2 3 3 4

0

01

2.

q

q

qq =

22

72

29

72

21

72, , .

t

s =+ +

=1

2221

2921

1

21

72,

22 21

29 21

1

.s

−−

−

⇒−

−

8 1 7

6 6 2

2 5 9

2 5 9

0 21 29

0 0 0

⇒

−

−

1 022

21

0 129

210 0 0

.

−−

−

. . .

. . .

. . .

8 1 7

6 6 2

2 5 9

1

2

3

q

q

q

=

,

0

0

0


(b) As in (a), solve

i.e., .19q1 = .26q2. Solutions have the form

Set to get

(c) Again, solve

by reducing the coefficient matrix to row-echelon form:

yielding solutions of the form

Set to get

4. (a) Prove by induction that p12(n) = 0: Already true for n = 1. If true for n – 1, we

have Pn = Pn–1P, so p12(n) = p12 + p22. But p12 = = 0 so

p12(n) = 0 + 0 = 0. Thus, no power of P can have all positive entries, so P is not

regular.

pn

121( )−

pn

121( )−

pn

111( )−

qq =

3 19

4 19

12 19

.s =12

19

qq =

1 4

1 3

1

s.

1 0 1 4

0 1 1 3

0 0 0

−−

−−

−

2 3 1 2 0

1 3 1 0

1 3 1 2 1 4

1

2

3

q

q

q

=

0

0

0

qq =

26 45

19 45.s =

19

45

qq =

26 19

1s.

−−

=

. .

. .

19 26

19 26

0

01

2

q

q


4. (b)

etc. We use induction to show Pnx =

Already true for n = 1, 2. If true for n – 1, then

Since we get if x is a state vector.

(c) Theorem 3 says that the entries of the steady state vector should be positive; they

are not for

5. Then since the row sums of

sums of P are 1. Thus (Pq)i

= qi for all i.

P p qk

pk

pki ij j

j

k

ij

j

k

ij

j

k

qq( ) = = = == = =

∑ ∑ ∑1 1 1

1 1 1,Let qq =

1 1 1

k k k

t

.

0

1

.

limn→∞

=+

=

P

x x

nxx0 0

11 2lim

n→∞

=1

20

n

,

P P P P

xn n

n

n= =

−

−−

( )11

1

1

2

11

2

xx

xx x

x

n

1 2

1

11

2

1

2

+

=

−

111

2

1

2

1

1 2−

+

+

−n n

x x

=

−

+

1

2

11

2

1

1 2

n

n

x

x x

.

1

2

11

2

1

1 2

−

+

n

n

x

x x

.

If xx xx=

=

+

x

xP

x

x x

1

2

1

1 2

1

21

2

,

=+ +

, P

x

x x x

21

1 1 2

1

41

4

1

2

xx


6. Since P has zero entries, consider P2 = , so P is regular.

Note that all the rows of P sum to 1. Since P is 3 × 3, Exercise 5 implies q =

7. Let x = [x1 x2]t be the state vector, with x1 = probability that John is happy and

x2 = probability that John is sad. The transition matrix P will be

since the columns must sum to one. We find the steady state vector for P by solving

i.e., Let and get so 10/13 is the

probability that John will be happy on a given day.

8. The state vector x = [x1 x2 x3]t will represent the proportion of the population living in

regions 1, 2 and 3, respectively. In the transition matrix, pij

will represent the proportionof the people in region who move to region i, yielding

P =

. . .

. . .

. . .

.

90 15 10

05 75 05

05 10 85

qq =

10 13

3 13,s =

3

131

5

2

3

10 3

11 2q q s= =

, .so qq

−−

=

1 5 2 3

1 5 2 3

0

01

2,

q

q

P =

4 5 2 3

1 5 1 3

1

3

1

3

1

3

t

.

1

2

1

4

1

41

4

1

2

1

41

4

1

4

1

2


To find the steady state vector, solve

First reduce to row echelon form , yielding

Set and get , i.e., in the long run 13/24 (or 54 %) of the people

reside in region 1, 4/24 (or 16 %) in region 2 and 7/24 (or 29 %) in region 3.1

6

2

3

1

6q =

13 24

4 24

7 24

s =7

24

qq =

13 7

4 7

1

.s

1 0 13 7

0 1 4 7

0 0 0

−−

−−

−

. . .

. . .

. . .

10 15 10

05 25 05

05 10 15

q11

2

3

0

0

0

q

q

=

.


EXERCISE SET 11.7

1. Note that the matrix has the same number of rows and columns as the graph has vertices,and that ones in the matrix correspond to arrows in the graph. We obtain

2. See the remark in problem 1; we obtain

P3

P3

P4P5P6

P5

P3P4

P1 P1 P1P2

P2

P2P4

(c)(b)(a)

((aa)) ((bb))

0 0 0 1

1 0 1 1

1 1 0 1

0 0 0 0

0 1 1 0 0

0 0 0 0

11

1 0 0 1 0

0 0 1 0 0

0 0 1 0 0

0 1 0 1 0 0

1

((cc))

00 0 0 0 0

0 1 0 1 1 1

0 0 0 0 0 1

0 0 0 0 0 1

0 0 1 0 1 0

629

3. (a) As in problem 2, we obtain

(b) m12 = 1, so there is one 1-step connection from P1 to P2.

So m12(2) = 2 and m12

(3) = 3 meaning there are two 2-step and three 3-step connectionsfrom P1 to P2 by Theorem 1. These are:

1-step: P1 → P2

2-step: P1 → P4 → P2 and P1 → P3 → P2

3-step: P1 → P2 → P1 → P2, P1 → P3 → P4 → P2,

and P1 → P4 → P3 → P2.

(c) Since m14 = 1, m14(2) = 1 and m14

(3) = 2, there are one 1-step, one 2-step and two 3-stepconnections from P1 to P4. These are:

1-step: P1 → P4

2-step: P1 → P3 → P4

3-step: P1 → P2 → P1 → P4 and P1 → P4 → P3 → P4.

4. (a) Note that to be contained in a clique, a vertex must have “two-way” connections withat least two other vertices. Thus, P4 could not be in a clique, so P1, P2, P3 is the onlypossible clique. Inspection shows that this is indeed a clique.

(b) Not only must a clique vertex have two-way connections to at least two other vertices,but the vertices to which it is connected must share a two-way connection. Thisconsideration eliminates P1 and P2, leaving P3, P4, P5 as the only possible clique.Inspection shows that it is indeed a clique.

M M2 3

1 2 1 1

0 1 1 1

1 1 1 0

1 1 0 1

2 3 2 2

1 2=

=and11 1

1 2 1 2

1 2 2 1

.

P2

P4

P1

P3


(c) The above considerations eliminate P1, P3 and P7 from being in a clique. Inspectionshows that each of the sets P2, P4, P6 , P4, P6, P8 , P2, P6, P8 , P2, P4, P8 and P4, P5, P6 satisfy conditions(i) and (ii) in the definition of a clique. But note that P8 can be added to the firstset and we still satisfy the conditions. P5 may not be added, so P2, P4, P6, P8 is aclique, containing all the other possibilities except P4, P5, P6 , which is also a clique.

5. (a) With the given M we get, as in Example 5, that

Since sii(3) = 0 for all i, there are no cliques in the graph represented by M.

(b) As in (a),

The elements along the main diagonal tell us that only P3, P4 and P6 are members ofa clique. Since a clique contains at least three vertices, we must have P3, P4, P6 asthe only clique.

6. As in problem 1,

M =

0 0 1 1

1 0 0 0

0 1 0 1

0 1 0 0

.

S =

0 1 0 1 0 0

1 0 1 0 1 0

0 1 0 1 0 1

1 0 1 0 1 1

0 1 0 1 0 0

0 0 1 1 0 0

=, S3

0 6 1 7 0 2

6 0 7 1 6 3

1 7 2 8 1 4

7 1 8 2 7 55

0 6 1 7 0 2

2 3 4 5 2 2

.

S =

0 1 0 1 0

1 0 1 0 0

0 1 0 0 1

1 0 0 0 1

0 0 1 1 0

,, S3

0 3 1 3 1

3 0 3 1 1

1 3 0 1 3

3 1 1 0 3

1 1 3 3 0

=

.


Then

By summing the rows of M + M2, we get that the power of P1 is 2 + 1 + 2 = 5, the powerof P2 is 3, of P3 is 4, and of P4 is 2.

7. Associating vertex P1 with team A, P2 with B, … , P5 with E, the game results yield thefollowing dominance-directed graph:

which has vertex matrix

Now,

M2

0 1 1 1 2

1 0 0 2 1

1 1 0 1 0

0 0 1 0 1

0 2 1 1 0

=

+ =

, M M2

0 2 2 2 2

1 0 1 2 2

1 1 0 2 1

0 1 1 0 1

1 2 1 2 0

.

M =

0 1 1 1 0

0 0 1 0 1

0 0 0 1 1

0 1 0 0 0

1 0 0 1 0

.

P2

P4

P1

P3

P5

M M M2 2

0 2 0 1

0 0 1 1

1 1 0 0

1 0 0 0

0 2 1 2

=

+ =and11 0 1 1

1 2 0 1

1 1 0 0

.


Summing the rows, we get that the power of A is 8, of B is 6, of C is 5, of D is 3, and of E is 6. Thus ranking in decreasing order we get A in first place, B and E tie for secondplace, C in fourth place, and D last.


EXERCISE SET 11.8

1. (a) From Eq. (2), the expected payoff of the game is

(b) If player R uses strategy [p1 p2 p3] against player C’s strategy

his payoff will be pAq = (–1/4)p1 + (9/4)p2 – p3. Since p1, p2 and p3 are nonnegativeand add up to 1, this is a weighted average of the numbers –1/4, 9/4 and –1. Clearlythis is the largest if p1 = p3 = 0 and p2 = 1; that is, p = [0 1 0].

(c) As in (b), if player C uses [q1 q2 q3 q4]t against , we get pAq = –6q1 +

3q2 + q3 – 12q4. Clearly this is minimized over all strategies by setting q1 = 1 and

q2 = q3 = q4 = 0. That is q = [1 0 0 0]t.

2. As per the hint, we will construct a 3 × 3 matrix with two saddle points, say a11 = a33 = 1.Such a matrix is

A .=

1 2 1

0 7 0

1 2 1

1

20

1

2

1

4

1

4

1

4

1

4

t

,

p qA =

− −−

− −

1

20

1

2

4 6 4 1

5 7 3 8

8 0 6 2

= − .

1

41

41

41

4

5

8

635

(The reader is invited to prove that if the matrix A has two saddle points ars

and atu

, thenwe must necessarily have a

rs= a

tu).

3. (a) Calling the matrix A, we see a22 is a saddle point, so the optimal strategies are pure,namely: p = [0 1], q = [0 1]t; the value of the game is a22 = 3.

(b) As in (a), a21 is a saddle point, so optimal strategies are p = [0 1 0], q = [1 0]t; the value of the game is a21 = 2.

(c) Here, a32 is a saddle point, so optimal strategies are p = [0 0 1], q = [0 1 0]t

and v = a32 = 2.

(d) Here, a21 is a saddle point, so p = [0 1 0 0], q = [1 0 0]t and v = a21 = –2.

4. (a) Calling the matrix A, the formulae of Theorem 2 yield p = [5/8 3/8], q = [1/8 7/8]t,v = 27/8 (A has no saddle points).

(b) As in (a),

(c) For this matrix, a11 is a saddle point, so p = [1 0], q = [1 0]t and v = a11 = 3.

(d) This matrix has no saddle points, so, as in (a),

(e) Again, A has no saddle points, so as in (a),

qq =

=−1

13

12

13

29

13

t

vand .

pp ,=

3

13

10

13

pp qq,=−−

−−

=

=−−

−3

5

2

5

3

5

2

5

3

5

2

−−

=

=−−

=

5

3

5

2

5

19

5

19

5

t t

v .

and

pp qq,=

=

=40

60

20

60

2

3

1

3

10

60

50

600

1

6

5

6

1400

60

70

3

=

= =

,

(

t

v Agaain, A has no saddle points).


5. Let a11 = payoff to R if the black ace and black two are played = 3.

a12 = payoff to R if the black ace and red three are played = –4.

a21 = payoff to R if the red four and black two are played = –6.

a22 = payoff to R if the red four and red three are played = 7.

So, the payoff matrix for the game is

A has no saddle points, so from Theorem 2,

that is, player R Should play the black ace 65 percent of the time, and

player C should play the black two 55 percent of the time. The value of the game is ,that is, player C can expect to collect on the average 15 cents per game.

6. If p* = [0 0 … 1 … 0] (one in r-th place) and

q* = [0 0 … 1 … 0]t (one in s-th place) and

A is an m × n matrix (m = length of p, n = length of q), then

(since other entries of p*, q* are zero), verifying Eq. (6). If ars

is a saddle point then a

rs≤ a

rjfor all j, since a

rsis the smallest in its row. So if q = [q1

… qn]t is any

column strategy, we have

pp** qq pp** qqAA AA= ( ) =

= ≥

= =∑ ∑p a q a qi

i

m

ij j rj j

j

n*

1 1jj

n

rs j

j

n

rs j rs

j

n

rs

a q

a q a a

= =

=

∑ ∑

∑= = =

1 1

1

1( ) ,

pp** qq** == pp** qq **AA AA( ) =

=

= =∑ ∑* *

p a qi

i

m

ij j

j

n

1 1

*p a ai

i

m

is rs

=∑ =

1

−3

20

qq ;=

11

20

9

20

t

pp ,=

13

20

7

20

A .=−

−

3 4

6 7


demonstrating Eq. (7). For Eq. (8), note that ars is a saddle point implies ars

≥ ais

for all i, since a

rsis the largest in its column. So, if p = [p1

… pn] is any row strategy, we

have

7. Take . Suppose a22 – a21 > 0. Since the game is not strictly

determined, a21 is not a saddle point, so a21 < a11. Similarly, a11 is not a saddle point, so

a11 > a12. Then . All the parenthesized expressions being

positive yields If a22 – a21 < 0, get a22 < a12, then a12 > a11, so all the parenthesized

expressions are negative; again

If a11 > a12, then a12 < a22, yielding a22 > a21; i.e., all parenthesized expressions are positive,

so If a11 < a12, then a11 < a21, so a21 > a22, all parenthesized expressions are

negative and

Expressing as respectively,

and using the non-strictly determined property of A as above, we get and as well.0 12

*< <q

0 11*< <q

a a

a a a a

a a

a

22 12

22 12 11 21

11 21−( )−( ) + −( )

−( )and

111 21 22 12−( ) + −( )a a aq q1 2

* *and

0 12 .*< <p

0 12 .*< <p

pa a

a a a a

a a

a a2

11 12

11 22 12 21

11 12

11 12

* =−

+ − −=

−( )−(( ) + −( )a a22 21

.0 11 .*< <p

0 11 .*< <p

pa a

a a a a1

22 21

22 21 11 12

* =−( )

+( ) + −( )

pa a

a a a a1

22 21

11 22 12 21

* =−

+ − −

pp qq** pp qq **AA AA= ( ) =

= ≤

= =∑ ∑p a q p ai ij j

j

n

i is

i

m

*

1 1ii

m

rs i

i

n

rs rs

a p

a a

= =∑ ∑

= =1 1

1( ) .


EXERCISE SET 11.9

1. (a) Calling the given matrix E, we need to solve

This yields , that is, p = s[1 3/2]t. Set s = 2 and get p = [2 3]t.

(b) As in (a), solve

In row-echelon form, this reduces to

Solutions of this system have the form p = s[1 5/6 1]t. Set s = 6 and get p = [6 5 6]t.

(c) As in (a), solve

( )

. . .

. . .

. . .

I E− =− −

− −− −

pp

65 50 30

25 80 30

40 30 60

=

p

p

p

1

2

3

0

0

0

,,

1 0 1

0 1 5 6

0 0 0

1

2

3

−−

=p

p

p

00

0

0

.

( )

/ /

/ /

/

I E− =−

− −− −

pp

1 2 0 1 2

1 3 1 1 2

1 6 1 1

.

p

p

p

1

2

3

0

0

0

=

1

2

1

31 2p p=

( )I Ep

p− =

−−

=

pp

1 2 1 3

1 2 1 3

0

01

2

.

639

which reduces to

Solutions are of the form p = s[78/79 54/79 1]t. Let s = 79 to obtain p = [78 54 79]t.

2. (a) By Corollary 1, this matrix is productive, since each of its row sums is .9.

(b) By Corollary 2, this matrix is productive, since each of its column sums is less thanone.

(c) Try x = [2 1 1]t. Then Cx = [1.9 .9 .9]t, i.e., x > Cx, so this matrix is productive by Theorem 3.

3. Theorem 2 says there will be one linearly independent price vector for the matrix E if somepositive power of E is positive. Since E is not positive, try E2:

4. The exchange matrix for this arrangement (using A, B, and C in that order) is

For equilibrium, we must solve (I – E)p = 0. That is

1 2 1 3 1 4

1 3 2 3 1 4

1 6 1 3 1 2

1

2

3

− −− −− −

p

p

p

=

.

0

0

0

1 2 1 3 1 4

1 3 1 3 1 4

1 6 1 3 1 2

/

.

E2

2 34 1

2 54 6

6 12 3

0=

>. . .

. . .

. . .

.

1 3 4 3 2

0 1 54 79

0 0 0

1

2

3

−−

p

p

p =

.

0

0

0


Row reduction yields solutions of the form p = [18/16 15/16 1]ts.

Set s = 1600/15 and obtain p = [120 100 106.67]t; i.e., the price of tomatoes was $120,corn was $100 and lettuce was $106.67.

5. Taking the CE, EE, and ME in that order, we form the consumption matrix C, where c

ij= the amount (per consulting dollar) of the i-th engineer’s services purchased by the

j-th engineer. Thus,

We want to solve (I – C)x = d, where d is the demand vector, i.e.

In row-echelon form this reduces to

Back-substitution yields the solution x = [1256.48 1448.12 1556.19]t.

6. The i-th column sum of E is eji, and the elements of the i-th column of I – E are

the negatives of the elements of E, except for the ii-th, which is 1 – eii. So, the i-th

column sum of I – E is 1 – eji

= 1 – 1 = 0. Now, (I – E)t has zero row sums, so

the vector x = [1 1 … 1]t solves (I – E)tx = 0. This implies det(I – E)t = 0. But det(I – E)t = det(I – E), so (I – E)p = 0 must have nontrivial (i.e., nonzero) solutions.

j

n

=∑

1

j

n

=∑

1

1 2 3

0 1 43877

0 0 1

1

2

3

− −−

. .

.

x

x

x

=

500

785 31

1556 19

.

.

.

1 2 3

1 1 4

3 4 1

1

2

3

− −− −− −

. .

. .

. .

x

x

x

=

500

700

600

.

C =

0 2 3

1 0 4

3 4 0

. .

. .

. .

.


7. Let C be a consumption matrix whose column sums are less than one; then the row sumsof Ct are less than one. By Corollary 1, Ct is productive so (I – Ct)–1 ≥ 0. But

(I – C)–1 = (((I – C)t))–1)t = ((I – Ct)–1)t ≥ 0.

Thus, C is productive.

8. (I) Let y be a strictly positive vector, and x = (I – C)–1y. Since C is productive (I – C)–1 ≥ 0, so x = (I – C)–1y ≥ 0. But then (I – C)x = y > 0, i.e., x – Cx > 0, i.e., x > Cx.

(II) Step 1: Since both x* and C are ≥ 0, so is Cx*. Thus x* > Cx* ≥ 0.

Step 2: Since x* > Cx*, x* – Cx* > 0. Let ε be the smallest element in x* – Cx*, andM the largest element in x*. Then x* – Cx* > 2M

ε—–x* > 0, i.e., x* – 2Mε—–x* > Cx*.

Setting λ = 1 – 2Mε—– < 1, we get Cx* < λx*.

Step 3: First, we show that if x > y, then Cx > Cy. But this is clear since (Cx)i

=

Now we prove Step 3 by induction on n, the case n = 1

having been done in Step 2. Assuming the result for n – 1, then Cn–1x* < λn–1x*. But then Cnx* = C(Cn–1x*) < C(λn–1x*) = λn–1(Cx*) < λn–1(λx*) = λnx*, proving Step 3.

Step 4: Clearly, Cnx* ≥ 0 for all n. So we have

Denote the elements of Then we have for all i. But

–c

ij≥ 0 and x* < 0 imply –c

ij= 0 for all i and j, proving Step 4.

Step 5: By induction on n, the case n = 1 is trivial. Assume the result true for n – 1.Then

(I – C)(I + C + C2 + … + Cn–1) = (I – C)

(I + C + … + Cn–2) + (I – C)Cn–1 = (I – Cn–1) + (I – C)Cn–1

= I – Cn–1 + Cn–1 – Cn,

= I – Cn,

proving Step 5.

01

==∑ c xij j

j

n*lim .

n

nijC c

→∞by

0 0≤ ≤ = =→∞ →∞ →∞

lim * lim * , lim *n

n

n

n

n

nC Cx x i.e., xλ 00.

c x c y Cyij j

j

n

ij j

j

n

i

= =∑ ∑> =

1 1

( ) .


Step 6: First we show (I – C)–1 exists. If not, then there would be a nonzero vector z such that Cz = z. But then Cnz = z for all n, so z = lim

n → ∞ Cnz = 0, acontradiction, thus I – C is invertible. Thus, I + C + … + Cn–1 = (I – C)–1(I – Cn), so S = lim

n → ∞ (I – C)–1(I – Cn) = (I – C)–1 (I – limn → ∞ Cn) = (I – C)–1, proving

Step 6.

Step 7: Since S is the (infiite) sum of nonnegative matrices, S itself must be non-negative.

Step 8: We have shown in Steps 6 and 7 that (I – C)–1 exists and is nonnegative,thus C is productive.


EXERCISE SET 11.10

1. Using Eq. (18), we calculate

So all the trees in the second class should be harvested for an optimal yield (since s = 1000) of $15,000.

2. From the solution to Example 1, we see that for the fifth class to be harvested in theoptimal case we must have

p5s/ (.28–1 + .31–1 + .25–1 + .23–1) > 14.7s,

yielding p5 > $222.63.

3. Assume p2 = 1, then Yld2 = = .28s. Thus, for all the yields to be the same we

must have

s

.281( )−

Ylds

s

Ylds s

2

3

30

215

50

232

100

7.

= =

=+

=

645

p3s/(.28–1 + .31–1) = .28s

p4s/(.28–1 + .31–1 + .25–1) = .28s

p5s/(.28–1 + .31–1 + .25–1 + .23–1) = .28s

p6s/(.28–1 + .31–1 + .25–1 + .23–1 + .27–1) = 28s

Solving these sucessively yields p3 = 1.90, p4 = 3.02, p5 = 4.24 and p6 = 5.00. Thus the ratio

p2 p3 p4 p5 p6 = 1 1.90 3.02 4.24 5.00 .

4. From Eq. (17) we have

Then Eq. (16) yields

for 1 ≤ i ≤ k – 1. Then (*) and Eq. (13) yield Eq. (19).

5. Since y is the harvest vector, N = yi

is the number of trees removed from the forest.

Then Eq. (7) and the first of Eqs. (8) yield N = g1x1, and from Eq. (17) we obtain

Ng s

g

g

g

g

s

g gk k

.=+ + ⋅ ⋅ ⋅ +

=+ ⋅ ⋅ ⋅ +

− −

1

1

2

1

1 1 11

1 1

i

n

=∑

1

(*)xg x

g

g

g

s

gg g

ii i

k

= =+ ⋅ ⋅ ⋅ +

−

1 1 1

11 1

1 1

=+ ⋅ ⋅ ⋅ +

−

1

1 1

1 1

g

s

g g

i

k

xs

g

g

g

g

g

g

s

gg gk

11

2

1

3

1

11

1 2

1 1 1=

+ + + ⋅ ⋅ ⋅ +=

+ + ⋅−

⋅⋅ ⋅ +

−

1

1gk

.


6. Set g1 = … = gn–1 = g, and p2 = 1. Then from Eq. (18), Yld2 = = gs. Since we want all

of the Yldk’s to be the same, we need to solve Yld

k= = gs for p

kfor 3 ≤ k ≤ n.

Thus pk

= k – 1. So the ratio

p2 p3 p4

… pn

= 1 2 3 … (n – 1).

p s

kg

k

( )−11

p s

g

2

1

1


EXERCISE SET 11.11

1. (a) Using the coordinates of the points as the columns of a matrix we obtain

(b) The scaling is accomplished by multiplication of the coordinate matrix on the left by

resulting in the matrix

0 3

2

3

2

0

0 0 1

2

1

2

0 0 0 0

,

3

20 0

01

20

0 0 1

,

0 1 1 0

0 0 1 1

0 0 0 0

.

649

which represents the vertices (0, 0, 0), as shown below.

1. (c) Adding the matrix

to the original matrix yields

which represents the vertices (–2, –1, 3), (–1, –1, 3), (–1, 0, 3), and (–2, 0, 3) asshown below.

(d) Multiplying by the matrix

cos( ) sin( )

sin( ) cos( )

− − −

− −

30 30 0

30 30 0

0 0 1

,

− − − −− −

2 1 1 2

1 1 0 0

3 3 3 3

,

− − − −− − − −

2 2 2 2

1 1 1 1

3 3 3 3

3

20 0

3

2

1

20 0, , , , , ,

and 0,1

2


we obtain

The vertices are then (0, 0, 0), (.866, – .500, 0), (1.366, .366, 0), and (.500, .866, 0)as shown:

2. (a) Simply perform the matrix multiplication

(b) We multiply

yielding the vertices (0, 0, 0), (1, 0, 0), 3

21 0

1

21 0, , , , , .

and

11

20

0 1 0

0 0 1

0 1 1 0

0 0 1 1

0 0 0 0

=

0 13

2

1

20 0 1 1

0 0 0 0

11

20

0 1 0

0 0 1

x

y

z

i

i

i

=

+

.

x y

y

z

i i

i

i

1

2

0 30 30 30 30

0

cos( ) cos( ) sin( ) sin( )

sin

− − − − − −

(( ) cos( ) sin( ) cos( )− − + − −

30 30 30 30

0 0 0 0

=

− 0 866 1 366 500

0 500 366 866

0 0 0 0

. . .

. . .

.


2. (c) Obtain the vertices via

yielding (0, 0, 0), (1, .6, 0), (1, 1.6, 0) and (0, 1, 0), as shown:

3. (a) This transformation looks like scaling by the factors 1, –1, 1, respectively and indeedits matrix is

(b) For this reflection we want to transform (xi, y

i, z

i) to (–x

i, y

i, z

i) with the matrix

Negating the x-coordinates of the 12 points in view 1 yields the 12 points (–1.000,–.800, .000), (–.500, –.800, –.866), etc., as shown:

−

1 0 0

0 1 0

0 0 1

.

1 0 0

0 1 0

0 0 1

−

.

1 0 0

6 1 0

0 0 1

0 1 1 0

0 0 1 1

0 0 0 0

.

=

. . ,

0 1 1 0

0 6 1 6 1

0 0 0 0


(c) Here we want to negate the z-coordinates, with the matrix

This does not change View 1.

4. (a) The formulas for scaling, translation and rotation yield the matrices

(b) Clearly, P′ = M5M4M3(M1P + M2).

M M1 2

1 2 0 0

0 2 0

0 0 1 3

1

2

1

2

1

2

1

2

=

=

⋅⋅⋅/

/

, 00 0 0 0

0 0 0 0

1 1 0

0 23

⋅⋅⋅

⋅⋅⋅

=

,

M cos 00 20

0 20 20

4

−

=

sin

sin cos

,

cos(

M

−− −

− − −

45 0 45

0 1 0

45 0 45

) sin( )

sin( ) cos( )

=

−

,

cos sin

sin cos

and

M5

90 90 0

90 90 0

00 0 1

.

1 0 0

0 1 0

0 0 1−

.


5. (a) As in 4(a),

(b) As in 4(b), P′ = M7(M6 + M5M4(M3 + M2 M1P)).

6. Using the hint given, we have

R R1 2

0

0 1 0

0

=−

=cos sin

sin cos

,

cosβ β

β β

α −−

=

sin

sin cos ,

cos sin

αα α

θ θ

0

0

0 0 1

0

03R 11 0

0

0

4

−

=− − −

sin cos

,

cos( ) sin( )

θ θ

α αR ssin( ) cos( ) ,− −

α α 0

0 0 1

M M1 2

3 0 0

0 5 0

0 0 1

1 0 0

0 45 4=

= −.

. , cos sin 55

0 45 45

1 1 1 1

0 0 03

sin cos

,

=⋅⋅⋅⋅⋅⋅M 00

0 0 0 0

35 0 35

0 1 04

⋅⋅⋅

=

−

,

cos sin

sin

M

335 0 45

45 45

5

cos

,

cos( ) sin(

=

− − −

M

))

sin( ) cos( ) ,

0

45 45 0

0 0 1

0 0

6

− −

=

M

00 0

0 0 0 0

1 1 1 1

⋅⋅⋅⋅⋅⋅⋅⋅⋅

=, and

2 0 0

0 1 07M

00 0 1

.



7. (a) We rewrite the formula for vi′ as

So

(b) We want to translate xi

by –5, yi

by +9, zi

by –3, so x0 = –5, y0 = +9, z0 = –3. The matrix is

8. This can be done most easily performing the multiplication RRt and showing that this is I.For example, for the rotation matrix about the x-axis we obtain

RRt = −

1 0 0

0

0

1 0 0

0cos sin

sin cos

cθ θθ θ

oos sin

sin cos

θ θθ θ0

1 0 0

0 1 0

0 0 1−

=

.

1 0 0 5

0 1 0 9

0 0 1 3

0 0 0 1

−

−

.

v

x

y

z

x

y

zi

i

i

i

′ =

1 0 0

0 1 0

0 0 1

0 0 0 1

0

0

0

11

.

v

x x

y y

z zi

i

i

i

′ =

⋅ + ⋅⋅ + ⋅⋅ + ⋅

⋅

1 1

1 1

1 1

1 1

0

0

0

.

and R5

0

0 1 0

0

=− −

− − −

cos( ) sin( )

sin( ) cos( )

β β

β β

.

EXERCISE SET 11.12

1. (a) The discrete mean value property yields the four equations

Translated into matrix notation, this becomes

t

t

t

t

1

2

3

4

01

4

1

40

1

40 0

=

11

41

40 0

1

4

01

4

1

40

1

2

t

t

tt

t

3

4

0

1

2

0

1

2

+

.

t t t

t t t

t t t

t

1 2 3

2 1 4

3 1 4

4

1

4

1

41 1

1

4

= +

= + + +

= +

=

( )

( )

( )

11

41 12 3( ).t t+ + +

657

1. (b) To solve the system in part (a), we solve (I – M)t = b for t:

In row-echelon form, this is

Back substitution yields the result t = [1/4 3/4 1/4 3/4]t.

(c)

t(2) = Mt(1) + b = [1/8 5/8 1/8 5/8]t

t(3) = Mt(2) + b = [3/16 11/16 3/16 11/16]t

tt tt bb( )1

01

4

1

40

1

40 0

1

41

40 0

1

4

01

4

1

40

= + =

M(0)

0

0

0

0

+

=

0

1

2

0

1

2

0

1

2

0

11

2

11

4

1

40

0 1 15 4

0 0 11

2

0 0 0 1

− −

−

−

=

t

t

t

t

1

2

3

4

00

0

1 8

3 4

−

.

01

4

1

40

1

41 0

1

41

40 1

1

4

01

4

1

41

− −

− −

− −

− −

t

t

t

t

1

2

3

4

=

.

0

1

2

0

1

2


t(4) = Mt(3) + b = [7/32 23/32 7/32 23/32]t

t(5) = Mt(4) + b = [15/64 47/64 15/64 47/64]t

from Eq. (10).

(d) Using percentage error we have that the

percentage error for t1 and t3 was and for t2 and t4 was

2. The average value of the temperature on the circle is

where r is the radius of the circle and f(θ) is the temperature at the point of the circumfer-ence where the radius to that point makes the angle θ with the horizontal. Clearly f(θ) = 1for –π/2 < θ < π/2 and is zero otherwise. Consequently, the value of the integral above(which equals the temperature at the center of the circle) is 1/2.

3. As in 1(c), but using M and b as in the problem statement, we obtain

t(1) = Mt(0) + b = [3/4 5/4 1/2 5/4 1 1/2 5/4 1 3/4]t

t(2) = Mt(1) + b = [13/16 9/8 9/16 11/8 13/16 7/16 21/16 15/8]t.

4. Using the telephone number 555-1212, we start in row 1, column 2. The ten boundaryvalues we obtain are 0, 0, 2, 2, 0, 0, 0, 2, 1, and 1. The last arrow used was in row 4,column 9, and the average value obtained was .8.

1

2πθ θ

π

π

rf rd( ) ,

−∫

−× = −

.

.% . %.

0129

7629100 1 7

.

.% . %,

0129

2371100 5 4× =

=−

×computed value actual value

actual value1100%


EXERCISE SET 11.13

1. (a) Using the lines in Eq. (5) we have

a1ta1 = 2 a2

ta2 = 5 a3ta3 = 10

b1 = 2 b2 = –2 b3 = 3

Setting x(p)k

= we have

Writing xx xxaa xx

aa aaaak

pkp k k

tkp

kt

k

k

b( ) ( )( )

= +−

−

−1

1

aa xx

aa xx

1 0 01 02

2 1 11 122

t p p p

t p p

x x

x x

( ) ( ) ( )

( ) ( ) (

= +

= − pp

t p p p

t p

t

x x

b

)

( ) ( ) ( )

( )

aa xx

aa xx

aa aa

3 2 21 22

1 1 0

1 1

3= −

−=

22

2

2

01 02

2 2 1

2 2

11

− −

−=

− −

x x

b x

p p

t p

t

p

( ) ( )

( ) ( )aa xx

aa aa

−−

−=

− −

x

b

a a

x x

p

t p

t

p p

12

3 3 2

3 3

21 22

5

3 3

1

( )

( ) ( ) ( )aa xx

00.

x

x

kp

kp

1

2

( )

( )

aa aa aa1 2 31

1

1

2

3

1=

=

−

=

−

661

in the form:

we have

Substituting for the expressions

for k = 2:

for k = 3:

where , which means

x x x xp p p p

011

021

31 32( ) ( ) ( ) ( ), , .+ +( ) = ( )

x xp p

01

3( ) ( )+ =

x xx xp p

p p

31 2121 223 3

103

1

109( ) ( )

( ) ( )

= +− +

= +• xx x

x xx

p p

p pp

21 22

32 2221

3

3 3

( ) ( )

( ) ( )( )

+

= +− + xx

x x

pp p22

21 22101

1

103 3 9

( )( ) ( )( )• − = − + +

x xx x

xp p

p pp

12 0201 02

012

21

1

22( ) ( )

( ) ( )(= +

− −= +•

)) ( )

( ) ( )( ) (

−

= +− − +

x

x xx x

p

p pp p

02

21 1111 122 2 ))

( ) ( )

( ) (

51

1

52 4 211 12

22 12

• = − + +

=

x x

x x

p p

p p))( ) ( )

( )( )+− − +

− = + +•2 2

52

1

54 2 211 12

11 1x x

x x

p pp

22( )p

x xx x

xp p

p p

11 0101 02

012

21

1

22( ) ( )

( ) ( )

= +− −

= +•(( ) ( )p p

x−

02

bkk k

tkp

kt

k

−

=−aa xx

aa aa1 1

( )

gives for :

x xb

akp

kp k k

tkp

kt

k

k1 1 11( )

,( )

( )

= +−

−

−aa xx

aa aa11

2 1 21x x

bk

pk

p k kt

kp

kt

k

( ),

( )( )

= +−

−

−aa xx

aa aaaak2.

x

x

x

x

kp

kp

kp

kp

1

2

1 1

1 2

( )

( )

,( )

,( )

=

−

−

+−

−b

a

a

k kt

kp

kt

k

k

k

aa xx

aa aa1

1

2

( )


1. (b) From part (a) we have

Subsituting the expressions for from part (a) gives

Subsituting the expressions for from part (a) gives

From part (a) we have

Subsituting for from part (a) gives

Subsituting for from part (a) gives

x x x xp p p p

32 01 02 013

101

2

22

1

22( ) ( ) ( ) ( )= + + −( ) + − ++( )

= + −

x

x x

p

p p

02

01 021

2024 3 3

( )

( ) ( ) .

x xp p

11 12( ) ( )and

x x x xp p p

32 11 121

103

3

52 4 2

9

54 2( ) ( ) ( )= − + − + +( ) + + 111 12

11 123

101 2

( ) ( )

( ) ( )

p p

p p

x

x x

+( )

= + +

.

x xp p

21 22( ) ( )and

x x xp p p

32 21 221

103 3 9( ) ( ) ( ) .= − + +

x x x xp p p

31 01 02 011

5055

10

22

5

22( ) ( ) ( ) (= + + −( ) + − pp p

p p

x

x x

) ( )

( ) ( )

+( )

= + −

02

01 021

2028 .

x xp p

11 12( ) ( )and

x x x xp p p

31 11 12 11

109

1

52 4 2

3

54 2( ) ( ) ( )= + − + +( ) + + 11 12

11 121

5055 10 5

( ) ( )

( ) (

p p

p p

x

x x

+( )

= + + )) .

x xp p

21 22( ) ( )and

x x xp p p

31 21 221

109 3( ) ( ) ( ) .= + +


Now

and so

1. (c) The linear system

can be rewritten as

which has the solution

2. (a) Setting and using part (b) of Exercise 1,

we have

x

x

311

321

1

2028 1 40000

1

2024 1

( )

( )

.

.

= =

= = 220000

1

2028 1 4 1 2 1 4100031

2

322

x

x

( )

(

. . .= + − =

)) ( . ) ( . ) .= + − =1

2024 3 1 4 3 1 2 1 23000

x01

011

021

310

320 0( ) ( ) ( ) ( ) ( ), , ( ,= ( ) = ( ) =x x x x ),0

x

x

31

32

31 22

27 22

*

* .

=

=

19 28

3 23 24

31 32

31 32

x x

x x

* *

* * ,

+ =

− + =

x x x

x x x

31 31 32

32 31

1

2028

1

2024 3 3

* = + −

= + −

* *

* *332*

x x x

x

p p p

p

31 311

321

32

1

2028

1

2

( ) ( ) ( )

( )

= + −

=

− −

0024 3 331

102

1+ −

− −x x

p p( ) ( ) .

orx x x x xp p p p p

0 31

01 02 311( ) ( ) ( ) ( ) (,= ( ) =− − )) ( ), ,x

p32

1−( )


(b)

Since x3(1) in this part is the same as x3

(1) in part (a), we will get x3(2) as in part (a)

and therefore x3(3), … , x3

(6) will also be the same as in part (a).

(c)

x01

310

320

311

148 15

1

202

( ) ( ) ( )

( )

( , ) ,= − = ( )=

x x

x 88 148 15 9 55000

1

2024 3 14832

1

+ − − =

= +

( ) .

( )( )x −− − =3 15 25 65000( ) .

x01

310

320

311

1 1

1

2028 1 1

( ) ( ) ( )

( )

( , ) ,= = ( )= + −

x x

x =

= + − =

1 4

1

2024 3 1 3 1 1 232

2

.

( ) ( ) .( )x

x

x

313

323

1

2028 1 41 1 23 1 40900

1

2

( )

( )

. . .= + − =

=00

24 3 1 41 3 1 23 1 22700

1

20231

4

+ − =

=

( . ) ( . ) .

( )x 88 1 409 1 227 1 40910

1

2024 3 132

4

+ − =

= +

. . .

( .( )x 4409 3 1 227 1 22730

1

2028 1 4031

5

) ( . ) .

.( )

− =

= +x 991 1 2273 1 40909

1

2024 3 1 409132

5

− =

= +

. .

( .( )x )) ( . ) .

.( )

− =

= +

3 1 2273 1 22727

1

2028 1 409031

6x 99 1 22727 1 40909

1

2024 3 1 409032

6

− =

= +

. .

( .( )x 99 3 1 22727 1 22727) ( . ) . .− =



3. (a) The three lines are

L1 x1 + x2 = 2

L2 x1 – 2x2 = –2

L3 3x1 – x2 = 3

x L

x

1 1

2

12

11

10

11

22

112*

*

lies on since

lie

+ = =

ss on sinceL

x

2

3

46

552

78

55

110

552−

=−

= −

* lies on sinceL3 331

22

27

22

66

223

− = = .

= + −1

2028 9 55 25 631

2 . .( )x 55 0 59500

1

2024 3 9 55 3 25 6532

2

=

= + −

.

( . ) ( . )( )x = −

= + + =

1 21500

1

2028 0 595 1 21531

3

.

. .( )x 11 49050

1

2024 3 0 595 3 1 21532

3

.

( . ) ( . )( )x = + + = 11 47150

1

2028 1 4905 1 4715 1 4031

4

.

. . .( )x = + − = 0095

1

2024 3 1 4905 3 1 4715 132

4x

( ) ( . ) ( . ) .= + − = 220285

1

2028 1 40095 1 20285 1 4031

5x

( ) . . .= + − = 9991

1

2024 3 1 40095 3 1 2028532

5x

( ) ( . ) ( . )= + − = 11 22972

1

2028 1 40991 1 22972 131

6

.

. . .( )x = + − = 440901

1

2024 3 1 40991 3 1 2297232

6x

( ) ( . ) ( . )= + − = 1 22703.


The slope of line L1 is –1.

The slope of the line

Since (–1) • 1 = –1, it follows——x1*x2* is perpendicular to L1.

The slope of line L2 is 12.

The slope of the line .

Since 12 • (–2) = –1, it follows

——x1*x2* is perpendicular to L2.

The slope of line L3 is 3.

The slope of the line

Since , it follows is perpendicular to L3.

(b) Substituting in the equations for x1(1)from part (a) of Exercise 1

gives

Then gives

x211 1

52 4

12

112

10

11( ) = − +

+

• • =

= +

+

=•

46

55

1

54 2

12

11

10

11

7221

x( ) 88

55

xx11 12

11

10

11( ) ,=

x

x

111

12

1

22

31

22

27

22

48

44

12

11( )

(

= + −

= =

11 1

22

31

22

27

22

40

44

10

11)

.

= − +

= =

xx01 31

22

27

22( ) ,=

x x2 3* *3

1

31• −

= −

x x2 327 22 78 55

31 22 46 55

21

63

1

3* * is

−−

=−

=−

..

isx x1 278 55 10 11

46 55 12 11

28

142* * −

−=

−= −

x x1 327 22 10 11

31 22 12 11

7

71* * .is

−−

= =


and gives

or

4. Referring to the figure below and starting with x0(1) = (0, 0)

x0(1) is projected to x1

(1) on L1,


(1) on L2,


(1) on L3, and so on.

As seen from the graph the points of the limit cycle are

x 1* = A, x2* = B, x3* = A.

Since x1* is the point of intersection of L1 and L3 it follows on solving the system

x12* = 1

x11* – x12* = 0

L1 x1 x1

x2

x0

x3

A

B

x2

x2 = 1

x1 – x2 = 0 x1 – x2 = 2

x1

L3

L2(1)

(1)

(1)

(1)

(2)

xx31 31

22

27

22( ) , .=

x311 1

109

46

553

78

55

775

550( ) = + + ⋅

= =

331

22

1

103 3

46

559

78

55321

x( ) = − + ⋅

+ ⋅

= =

675

550

27

22

xx21 46

55

78

55( ) ,=


that x1* = (1, 1). Since x2* = (x21*, x22*) is on L2, it follows that x21* – x22* = 2. Now is perpendicular to L2; therefore

so we have x22* – 1 = 1 – x21* or x21* + x22* = 2.

Solving the system

x21* – x22* = 2

x21* + x22* = 2

gives x21* = 2 and x22* = 0. Thus the points on the limit cycle are

x1* = (1, 1), x2* = (2, 0), x3* = (1, 1).

5. (a) From Theorem 1:

and so

= atx* + b – atx* = b.

Therefore xp

lies on the line atx = b.

(b) Let z be a point on the line L, so that atz = b. We will prove that xp

– x* isperpendicular to x

p– z by showing that (x

p– z)t (x

p– x* = 0 (i.e., their dot product

is 0). Now

xxp p p

b

b

−( ) − = −−

=

zz xx xx xx zzaa xx

aa aaaa

t t)( ) (**t

t

−−

−

aa xx

aa aaxx zz aa

tt)

*

t( p

aa xx aa xxaa xx

aa aaaa aat

pt

t

t

tb= +

−

**

xx xxaa xx

aa aaaap

t

t

b= +

−

**

x

x

22

21

1

11 1

*

*( )

−

−

= −

x x1 2* *


Since xp

and z are on L, we have

atxp

= b

atz = b ,

and so at (xp

– z) = 0 or (xp

– z)t a = 0.

Thus (xp

– z)t (xp

– x*) = 0, and so xp

– x* is perpendicular to L.

6. Suppose that the center of some pixel, say the K-th pixel, is not crossed by any of theM beams in the scan. Then we have a

iK= 0 for i = 1, 2, … , M. That is, the K-th component

of each of the N-dimensional vectors a1, a2, … , aM

is zero. But then they cannot spanN since, in particular, no linear combination of them could equal e

K, where e

Kis the

N-dimensional vector whose K-th component is one and whose other components are allzero.

7. Let us choose units so that each pixel is one unit wide. Then

aij

= length of the center line of the i-th beam that lies in the j-th pixel

If the i-th beam crosses the j-th pixel squarely, it follows that aij

= 1. From Fig. 11 in thetext, it is then clear that

a17 = a18 = a19 = 1

a24 = a25 = a26 = 1

a31 = a32 = a33 = 1

a73 = a76 = a79 = 1

a82 = a85 = a88 = 1

a91 = a94 = a97 = 1

since beams 1, 2, 3, 7, 8, and 9 cross the pixels squarely.

Next, the centerlines of beams 5 and 11 lie along the diagonals of pixels 3, 5, 7 and 1, 5, 9,respectively. Since these diagonals have length , we have

a53 = a55 = a57 = = 1.41421

a11,1 = a11,5 = a11,9 = = 1.41421.2

2

2


In the following diagram, the hypotenuse of triangle A is the portion of the centerline of the 10th beam that lies in the 2nd pixel. The length of this hypotenuse is twice the height of triangle A, which in turn is – 1. Thus,

a10,2 = 2( – 1) = .82843.

By symmetry we also have

a10,2 = a10,6 = a12,4 = a12,8

= a62 = a64 = a46 = a48 = .82843.

Also from the diagram, we see that the hypotenuse of triangle B is the portion of thecenterline of the 10th beam that lies in the 3rd pixel. Thus,

a10,3 = 2 – = .58579.

By symmetry we have

a10,3 = a12,7 = a61 = a49 = .58579.

1− 1−2

1−2

1

1

B

A

1

2

22

2

2 − 1

2( )2 − 1

2− 2

2

2

2


The remaining aij’s are all zero, and so the 12 beam equations (4) are

x7 + x8 + x9 = 13.00

x4 + x5 + x6 = 15.00

x1 + x2 + x3 = 8.00

.82843 (x6 + x8) + .58579x9 = 14.79

1.41421 (x3 + x5 + x7) = 14.31

.82843 (x2 + x4) + .58579 x1 = 3.81

x3 + x6 + x9 = 18.00

x2 + x5 + x8 = 12.00

x1 + x4 + x7 = 6.00

.82843 (x2 + x6) + .58579x3 = 10.51

1.41421 (x1 + x5 + x9) = 16.13

.82843 (x4 + x8) + .58579x7 = 7.04

8. Let us choose units so that each pixel is one unit wide. Then

aij

= area of the i-th beam that lies in the j-th pixel

Since the width of each beam is also one unit it follows that aij

= 1 if the i-th beam crossesthe j-th pixel squarely. From Fig. 11 in the text, it is then clear that

a17 = a18 = a19 = 1

a24 = a25 = a26 = 1

a31 = a32 = a33 = 1

a73 = a76 = a79 = 1

a82 = a85 = a88 = 1

a91 = a94 = a97 = 1

since beams 1, 2, 3, 7, 8, and 9 cross the pixels squarely.


For the remaining aij’s, first observe

from the figure on the right that anisosceles right triangle of height h, asindicated, has area h2. From the dia-gram of the nine pixels, we then have

Area of triangle B ( ) ( )= −

= − =1

22 1

1

43 2 2

2

..

.

04289

1

2

1

4250

2

Area of triangle C =

= = 000

3

22 1

9

43 2

2

Area of triangle F ( ) (= −

= − 22 38604) .=

1−2

3−2 3−

2

1−2

3−2

1

1

A

B

B

D

C E

F

2

2

( )2 − 1

( )2 − 1

Area = h2 h


We also have

Referring back to Fig. 11, we see that

a11,1 = Area of polygon A = .91421

a10,1 = Area of triangle B = .04289

a11,2 = Area of triangle C = .25000

a10,2 = Area of polygon D = .75000

a10,3 = Area of polygon E = .61396

By symmetry we then have

a11,1 = a11,5 = a11,9 = a53 = a55 = a57 = .91421

a10,1 = a10,5 = a10,9 = a12,1 = a12,5 = a12,9

= a63 = a65 = a67 = a43 = a45 = a47 = .04289

a11,2 = a11,4 = a11,6 = a11,8

= a52 = a54 = a56 = a58 = .25000

a10,2 = a10,6 = a12,4 = a12,8

= a62 = a64 = a46 = a48 = .75000

a10,3 = a12,7 = a61 = a49 = .61396.

The remaining aij’s are all zero, and so the 12 beam equations (4) are

x7 + x8 + x9 = 13.00

x4 + x5 + x6 = 15.00

x1 + x2 + x3 = 18.00

0.04289 (x3 + x5 + x7) + 0.75 (x6 + x8) + 0.61396x9 = 14.79

Area of polygon Area of triangle(A = − ×1 2

Area of polygon Area o

B

D

)

.

(

= − =

= −

21

291421

1 ff triangle C

Area of polygon

)

.= − = =11

4

3

47500

( )

.

E = −

= −( ) =

1

1

418 2 23 6139

Area of triangle F

66

0.91421 (x3 + x5 + x7) + 0.25 (x2 + x4 + x6 + x8) = 14.31

0.04289 (x3 + x5 + x7) + 0.75 (x2 + x4) + 0.61396x1 = 3.81

x3 + x6 + x9 = 18.00

x2 + x5 + x8 = 12.00

x1 + x4 + x7 = 6.00

0.04289 (x1 + x5 + x7) + 0.75 (x2 + x6) + 0.61396x3 = 10.51

0.91421 (x1 + x5 + x7) + 0.25 (x2 + x4 + x6 + x8) = 16.13

0.04289 (x1 + x5 + x7) + 0.75 (x4 + x8) + 0.61396x7 = 7.04


EXERCISE SET 11.14

1. Each of the subsets S1, S2, S3, S4 in the figure is congruent to the entire set scaled by afactor of 12/25. Also, the rotation angles for the four subsets are all 0°. The displacementdistances can be determined from the figure to find the four similitudes that map the entireset onto the four subsets S1, S2, S3, S4. These are, respectively,

where the four values of

Because s = 12/25 and k = 4 in the definition of a self-similar set, the Hausdorff dimensionof the set is d

H(S) = ln(k)/ln(1/s) = ln(4)/ln(25/12) = 1.889 … . The set is a fractal because

its Hausdorff dimension is not an integer.

e

f

i

i

are

0

0

13 25

0

0, ,

113 25

13 25

13 25

, and .

Tx

y

x

yi

=

12

25

1 0

0 1

+

=,

e

fi

i

i

1, 2, 3, 4,

y

x

S3

S1 S2

S4

1

1

1225 1

25

677

2. The rough measurements indicated in the figure give an approximate scale factor of s ≈(15/16)/(2) = .47 to two decimal places. Since k = 4, the Hausdorff dimension of the set isapproximately d

H(S) = ln(k)/ln(1/s) ≈ ln(4)/ln(1/.47) = 1.8 to two significant digits.

Examination of the figure reveals rotation angles of 180°, 180°, 0°, and –90° for the sets S1,S2, S3, and S4, respectively.

3. (a) The figure shows the original self-similar set and a decomposition of the set into sevennonoverlapping congruent subsets, each of which is congruent to the original setscaled by a factor s = 1/3. By inspection, the rotations angles are 0° for all sevensubsets. The Hausdorff dimension of the set is d

H(S) = ln(k)/ln(1/s) = ln(7)/ln(3) =

1.771 … . Because its Hausdorff dimension is not an integer, the set is a fractal.

2"

15 "16

s3

s1 s2

s4


(b) The figure shows the original self-similar set and a decomposition of the set into threenonoverlapping congruent subsets, each of which is congruent to the original setscaled by a factor s = 1/2. By inspection, the rotation angles are 180° for all threesubsets. The Hausdorff dimension of the set is d

H(S) = ln(k)/ln(1/s) = ln(3)/ln(2) =

1.584 … . Because its Hausdorff dimension is not an integer, the set is a fractal.

3. (c) The figure shows the original self-similar set and a decomposition of the set into threenonoverlapping congruent subsets, each of which is congruent to the original setscaled by a factor s = 1/2. By inspection, the rotation angles are 180°, 180°, and –90° for S1, S2, and S3, respectively. The Hausdorff dimension of the set is d

H(s) = ln(k)/ln(1/s) = ln(3)/ln(2) = 1.584 … . Because its Hausdorff dimension is not

an integer, the set is a fractal.

(d) The figure shows the original self-similar set and a decomposition of the set into threenonoverlapping congruent subsets, each of which is congruent to the original setscaled by a factor s = 1/2. By inspection, the rotation angles are 180°, 180°, and –90°for S1, S2, and S3, respectively. The Hausdorff dimension of the set is d

H(S) =

ln(k)/ln(1/s) = ln(3)/ln(2) = 1.584 … . Because its Hausdorff dimension is not aninteger, the set is a fractal.

S3

S1 S2


4. The matrix of the affine transformation in question is The matrix

portion of a similitude is of the form Consequently, we must have

. Solving this pair of equations gives

5. Letting be the vector to the tip of the fern and using the hint, we have

Solving this matrix equation gives

rounded to three decimal places.

6. As the figure indicates, the unit square can be expressed as the union of 16 nonoverlappingcongruent squares, each of side length 1/4. Consequently, the Hausdorff dimension of theunit square as given by Equation (2) of the text is d

H(S) = ln(k)/ln(1/s) = ln(16)/ln(4) = 2.

x

y

=

−

−.15 .

. .

.

.

04

04 15

075

180

1

=

.

.

766

996

Tx

y

x

y

=or2

.

. .

.

.

.85 04

04 85

075

18−

+

x

y 00

.

x

y

=x

y

= −−. ... tan ( /8509 041and θ .. ) . ... .85 2 69= − °

s = ( ) + −( ) =. .85 042 2

s sandcos . sin .θ θ= = −85 04

ssin

sin cos.

cosθ θθ θ

−

.85 .

. ..

04

04 85−

S3S2

S1


7. The similitude T1 maps the unit square (whose vertices are (0, 0), (1, 0), (1, 1), and (0, 1))onto the square whose vertices are (0, 0), (3/4, 0), (3/4, 3/4), and (0, 3/4). The similitudeT2 maps the unit square onto the square whose vertices are (1/4, 0), (1, 0), (1, 3/4), and(1/4, 3/4). The similitude T3 maps the unit square onto the square whose vertices are (0, 1/4), (3/4, 1/4), (3/4, 1), and (0, 1). Finally, the similitude T4 maps the unit square ontothe square whose vertices are (1/4, 1/4), (1, 1/4), (1, 1), and (1/4, 1). The union of thesefour smaller squares is the unit square, but the four smaller squares overlap. Each of thefour smaller squares has side length of 3/4, so that the common scale factor of thesimilitudes is s = 3/4. The right-hand side of Equation (2) of the text gives ln(k)/ln(1/s) =ln(4)/ln(4/3) = 4.818 …. This is not the correct Hausdorff dimension of the square (whichis 2) because the four smaller squares overlap.

8. Because s = 1/2 and k = 8, Equation (2) of the text gives dH(S)= ln(k)/ln(1/s) =

ln(8)/ln(2) = 3 for the Hausdorff dimension of a unit cube. Because the Hausdorffdimension of the cube is the same as its topological dimension, the cube is not a fractal.

9. A careful examination of Figure 27 of the text shows that the Menger sponge can beexpressed as the union of 20 smaller nonoverlapping congruent Menger sponges each ofside length 1/3. Consequently, k = 20 and s = 1/3, and so the Hausdorff dimension of theMenger sponge is d

H(S)= ln(k)/ln(1/s) = ln(20)/ln(3) = 2.726 … . Because its Hausdorff

dimension is not an integer, the Menger sponge is a fractal.

10. The figure shows the first four iterates as determined by Algorithm 1 and starting with theunit square as the initial set. Because k = 2 and s = 1/3, the Hausdorff dimension of theCantor set is d

H(S)= ln(k)/ln(1/s) = ln(2)/ln(3) = 0.6309 … . Notice that the Cantor set is

a subset of the unit interval along the x-axis and that its topological dimension must be 0(since the topological dimension of any set is a nonnegative integer less than or equal to itsHausdorff dimension).

141


11. The area of the unit square S0 is, of course, 1. Each of the eight similitudes T1, T2, …, T8given in Equation 8 of the text has scale factor s = 1/3, and so each maps the unit squareonto a smaller square of area 1/9. Because these eight smaller squares are nonoverlapping,their total area is 8/9, which is then the area of the set S1. By a similar argument, the areaof the set S2 is 8/9-th the area of the set S1. Continuing the argument further, we find thatthe areas of S0, S1, S2, S3, S4, …, form the geometric sequence 1, 8/9, (8/9)2, (8/9)3,(8/9)4, …. (Notice that this implies that the area of the Sierpinski carpet is 0, since thelimit of (8/9)n as n tends to infinity is 0.)

Initial set

First Iterate

Second Iterate

Third Iterate

Fourth Iterate


EXERCISE SET 11.15

1. Because 250 = 2 • 53 it follows from (i) that

Because 25 = 52 it follows from (ii) that


Because 30 = 6 • 5 it follows from (ii) that

Because 10 = 2 • 5 it follows from (i) that

Because 50 = 2 • 52 it follows from (i) that




2. The point (0, 0) is obviously a 1-cycle. We now choose another of the 36 points of the form(m/6, n/6), say (0, 1/6). Its iterates produce the 12-cycle

So far we have accounted for 13 of the 36 points. Taking one of the remaining points,

say (0, 2/6), we arrive at the 4-cycle We

continue in this way, each time starting with some point of the form (m/6, n/6) that has not

0

2 6

2 6

4 6

0

4 6

→

→

→44 6

2 6

.

0

1 6

1 6

2 6

3 6

5 6

→

→

→2 6

1 6

3 6

4 6

1 6

5 6

→

→

→

→

→

0

5 6

5 6

4 6

3 6

1 6

→

→

→

4 6

5 6

3 6

6 6

5 6

1 6

.

( ) .5 2 5 10= =•∏

( ) .6 2 6 12= =•∏

( ) .3750 2 3750 7500= =•∏

( ) .50 3 50 150= =•∏

( ) .10 3 10 30= =•∏

( ) .30 2 30 60= =•∏

( ) .125 2 125 250= =•∏

( ) .25 2 25 50= =∏ •

( ) .250 3 250 750= =∏ •

683

yet appeared in a cycle, until we exhaust all such points. This yields a 3-cycle:

The possible periods of points for the form (m/6, n/6) are thus 1, 3, 4 and 12. The leastcommon multiple of these four numbers is 12, and so

3. (a) We are given that x0 = 3 and x1 = 7. With p = 15 we have

x3 = x2 + x1 mod 15 = 7 + 3 mod 15 = 10 mod 15 = 10,

x4 = x3 + x2 mod 15 = 10 + 7 mod 15 = 17 mod 15 = 2,

x5 = x4 + x3 mod 15 = 2 + 10 mod 15 = 12 mod 15 = 12,

x6 = x5 + x4 mod 15 = 12 + 2 mod 15 = 14 mod 15 = 14,

x7 = x6 + x5 mod 15 = 14 + 12 mod 15 = 26 mod 15 = 11,

x8 = x7 + x6 mod 15 = 11 + 14 mod 15 = 25 mod 15 = 10,

x9 = x8 + x7 mod 15 = 10 + 11 mod 15 = 21 mod 15 = 6,

x10 = x9 + x8 mod 15 = 6 + 10 mod 15 = 16 mod 15 = 1,

x11 = x10 + x9 mod 15 = 1 + 6 mod 15 = 7 mod 15 = 7,

x12 = x11 + x10 mod 15 = 7 + 1 mod 15 = 8 mod 15 = 8,

x13 = x12 + x11 mod 15 = 8 + 7 mod 15 = 15 mod 15 = 0,

x14 = x13 + x12 mod 15 = 0 + 8 mod 15 = 8 mod 15 = 8,

x15 = x14 + x13 mod 15 = 8 + 0 mod 15 = 8 mod 15 = 8,

x16 = x15 + x14 mod 15 = 8 + 8 mod 15 = 16 mod 15 = 1,

x17 = x16 + x15 mod 15 = 1 + 8 mod 15 = 9 mod 15 = 9,

( ) .6 12=∏

3 6

0

3 6

3 6

0

3 6

→

→

→

; another 4-cycle:

4 6

0

4 6

4 6

→

→

;

2 6

0

2 6

2 6

and anoother 12-cycle:1 6

0

1 6

1 6

→

→

→

2 6

3 6

5 6

2 6

→

→

→

1 6

3 6

4 6

1 6

5 6

0

5 6

5 6

4 6

3 6

→

→

→

→

1 6

4 6

55 6

3 6

→

.2 6

5 6


x18 = x17 + x16 mod 15 = 9 + 1 mod 15 = 10 mod 15 = 10,

x19 = x18 + x17 mod 15 = 10 + 9 mod 15 = 19 mod 15 = 4,

x20 = x19 + x18 mod 15 = 4 + 10 mod 15 = 14 mod 15 = 14,

x21 = x20 + x19 mod 15 = 14 + 4 mod 15 = 18 mod 15 = 3,

x22 = x21 + x20 mod 15 = 3 + 14 mod 15 = 17 mod 15 = 2,

x23 = x22 + x21 mod 15 = 2 + 3 mod 15 = 5 mod 15 = 5,

x24 = x23 + x22 mod 15 = 5 + 2 mod 15 = 7 mod 15 = 7,

x25 = x24 + x23 mod 15 = 7 + 5 mod 15 = 12 mod 15 = 12,

x26 = x25 + x24 mod 15 = 12 + 7 mod 15 = 19 mod 15 = 4,

x27 = x26 + x25 mod 15 = 4 + 12 mod 15 = 16 mod 15 = 1,

x28 = x27 + x26 mod 15 = 1 + 4 mod 15 = 5 mod 15 = 5,

x29 = x28 + x27 mod 15 = 5 + 1 mod 15 = 6 mod 15 = 6,

x30 = x29 + x28 mod 15 = 6 + 5 mod 15 = 11 mod 15 = 11,

x31 = x30 + x29 mod 15 = 11 + 6 mod 15 = 17 mod 15 = 2,

x32 = x31 + x30 mod 15 = 2 + 11 mod 15 = 13 mod 15 = 13,

x33 = x32 + x31 mod 15 = 13 + 2 mod 15 = 15 mod 15 = 0,

x34 = x33 + x32 mod 15 = 0 + 13 mod 15 = 13 mod 15 = 13,

x35 = x34 + x33 mod 15 = 13 + 0 mod 15 = 13 mod 15 = 13,

x36 = x35 + x34 mod 15 = 13 + 13 mod 15 = 26 mod 15 = 11,

x37 = x36 + x35 mod 15 = 11 + 13 mod 15 = 24 mod 15 = 9,

x38 = x37 + x36 mod 15 = 9 + 11 mod 15 = 20 mod 15 = 5,

x39 = x38 + x37 mod 15 = 5 + 9 mod 15 = 14 mod 15 = 14,

x40 = x39 + x38 mod 15 = 14 + 5 mod 15 = 19 mod 15 = 4,

x41 = x40 + x39 mod 15 = 4 + 14 mod 15 = 18 mod 15 = 3,

x42 = x41 + x40 mod 15 = 3 + 4 mod 15 = 7 mod 15 = 7,

and finally: x41 = x0 and x42 = x1. Thus this sequence is periodic with period 41.

(b) Step (ii) of the algorithm is

xn+1 = x

n+ x

n–1 mod p. (A)


Replacing n in this formula by n + 1 gives

xn + 2 = xn + 1

+xn

mod p = (xn+ xn–1) + x

nmod p = 2x

n+ x

n–1 mod p. (B)

Equations (A) and (B) can be written as

xn+1 = x

n–1 + xn

mod p

xn+2 = x

n–1 + 2xn

mod p

which in matrix form are

3. (c) Beginning with we obtain

x

x

2

3

1 1

1 2

5

5

=

modd mod2110

1521

10

15

4

5

=

=

x

x

=

mod1 1

1 2

10

15221

25

4021

4

19

6

7

=

mod =

x

x

=

=mod1 1

1 2

4

1921

233

4221

2

0

8

9

mod =

x

x==

=

mod

1 1

1 2

2

021

2

2

=

mod 212

2

1 1

1 2

10

11

=

x

x

=

mod mo2

221

4

6dd 21

4

6=

x

x

0

1

5

5

=

,

x

x

x

x

n

n

n

n

+

+

−

=

1

2

11 1

1 2

mod .p


and we see that

4. (a) We have that

Γ25101

101

7 778m

n

=

, ,, , , , ,

, , , ,

742 049 12 586 269 025

12 586 269 025 20 365,, ,011 074

101

101

m

n

=

+

mod

, , , , , ,

1

7 778 742 049

101

12 586 269 025

101m nn

m n12 586 269 025

101

20 365 011 074

101

, , , , , ,+

=

+

+

+ +

mod1

1

101

1

101

r m sn

sm t

n

m n r swhere , , , ,mod (1 and are integers)t

rm snm

sm tn

=

+( ) +

+( ) +

101

nn

m

n

101

1

101

101

=

mod

+ +(because and are integersrm sn sm tn )).

x

x

x

x

16

17

0

1

=

.

1 1

1 2

12

13

=

x

x

=

mod mod4

621

10

1621 ==

10

16

1 1

1 2

14

15

=

x

xmod mod

10

1621

26

4221

=

==5

0

1 1

1 2

16

17

=

x

x

55

021

5

521

5

5

=

mod mod =

.


4. (b) From (a) we have that every point of the form (m/101, n/101) returns to its startingposition after 25 iterations, and so every such point is a periodic point with a periodthat must divide 25. The only integers that divide 25 are 1, 5, and 25, and so everysuch point must have period 1, 5, or 25.

(c) We have that

Because all five iterates are different, the period of the periodic point (1/101, 0) mustbe greater than 5.

(d) From part (b) all points of the form (m/101, n/101) must have period 1, 5, or 25. Frompart (c) the point (1/101, 0) has period greater that 5, and so must have period 25.Because the least common multiple of 1, 5, and 25 is 25, it follows that

5. If 0 ≤ x < 1 and 0 ≤ y < 1, then T(x, y) = (x + 5/12, y) mod 1, and so T 2(x, y) = (x + 10/12, y) mod 1, T 3(x, y) = (x + 15/12, y) mod 1, . . . , T12(x, y) = (x + 60/12, y) mod 1 =(x + 5, y) mod 1 = (x, y). Thus every point in S returns to its original position after

( ) .101 25=∏

Γ1

101

0

1 1

1 2

1

=

1101

0

1

1

101

1

101

=

mod

=

,

Γ21

101

0

1 1

1 2

=mod

1

101

1

101

1

2

101

3

101

,

Γ31

101

0

=

1 1

1 2

2

101

3

101

mod11

5

101

108

101

1

101

0

4

=

,

Γ

=

1 1

1 2

5

101

8

101

=

mod1

13

101

21

101

,,

Γ51

101

0

1 1

1 2

=

=mod

13

101

21

101

1

34

101

55

1001

,


12 iterations and so every point in S is a periodic point with period at most 12. Becauseevery point is a periodic point, no point can have a dense set of iterates, and so the mappingcannot be chaotic.

6. (a) The matrix of Arnold’s cat map, is one in which (i) the entries are all

integers, (ii) the determinant is 1, and (iii) the eigenvalues, …

and …, do not have magnitude 1. The three conditions of anAnosov automorphism are thus satisfied.

(b) The eigenvalues of the matrix are ±1, both of which have magnitude 1. By

part (iii) of the definition of an Anosov automorphism, this matrix is not the matrix ofan Anosov automorphism.

The entries of the matrix are integers; its determinant is 1; and neither of its

eigenvalues, has magnitude 1. Consequently, this is the matrix of an Anosovautomorphism.

The eigenvalues of the matrix are both equal to 1, and soboth have magnitude

1. By part (iii) of the definition, this is not the matrix of an Anosov automorphism.

The entries of the matrix are integers; its determinant is 1; and neither of its

eigenvalues, has magnitude 1. Consequently, this is the matrix of an Anosovautomorphism.

The determinant of the matrix is 2, and so by part (ii) of the definition, this

is not the matrix of an Anosov automorphism.

(c) The eigenvalues of the matrix are ±i; both of which have magnitude 1.

By part (iii) of the definition, this cannot be the matrix of an Anosov automorphism.

0 1

1 0−

6 2

5 2

4 15,±

5 7

2 3

1 0

0 1

2 3,±

3 2

1 1

0 1

1 0

( ) .3 5 2 0 3819+ =

( ) .3 5 2 2 6180+ =

1 1

1 2

,


Starting with an arbitrary point in the interior of S, (that is, with 0 < x < 1

and 0 < y < 1) we obtain

Thus every point in the interior of S is a periodic point with period at most 4. The

geometric effect of this transformation, as seen by the iterates, is to rotate each point

in the interior of S clockwise by 90° about the center point of S.

Consequently, each point in the interior of S has period 4 with the exception of the

center point which is a fixed point.

For points in the interior of S, we first observe that the origin is a fixed

point, which can easily be verified. Starting with a point of the form with 0 < x < 1,

we obtain a 4-cycle if ,

otherwise we obtain the 2-cycle1 2

0

0

1 2

1 2

0

→

→

.

x ≠ 1 2x

x

x

0

0

1

1

0

0

→

→

→−

−

xx

x

→

0

x

0

0

0

1 2

1 2

1 2

1 2

0 1

1 01

−

=

=

−

mod mod

x

y

y

x11

1

0 1

1 0 1

,

mo

=−

−

=

−

y

x

y

xdd mod ,1

11

1

1

0 1

1 0

=−−

=

−−

−

x

y

x

y

=

−−

=

−− +

mod

1

11

1

1

x

y

y

xmod ,1

1

0 1

1 0

1

=−

−

=

−

y

x

y

x =

− +

=

mod mod .1

11

x

y

x

y

x

y


Similarly, starting with a point of the form with 0 < y < 1, we obtain a 4-cycle

if , otherwise we obtain the

2-cycle . Thus every point not in the interior of S is a

periodic point with point 1, 2, or 4. Finally, because all points in S can have a dense setof iterates and so the mapping cannot be chaotic.

7. That Arnold’s cat map is a one-to-one mapping over the unit square S and that its range isS is fairly obvious geometrically. An analytical proof would proceed as follows: To showthat it is one-to-one, suppose that (x1, y1) and (x2, y2), with 0 ≤ x

i< 1 and 0 ≤ y

i< 1, are

two points in S such that Γ(x1, y1) = Γ(x2, y2). From the definition of Γ it follows that

(x1 + y1) mod 1 = (x2 + y2) mod 1

(x1 + 2 y1) mod 1 = (x2 + 2 y2) mod 1.

From the definition of “mod 1”, there are integers r1, s1, r2, and s2 such that

x1 + y1 – r1 = x2 + y2 – r2 (A)

x1 + 2y1 – s1 = x2 + 2y2 – s2. (B)

Subtracting the first equation from the second gives

y1 – s1 + r1 = y2 – s2 + r2

or

y1 – y2 = s1 – s2 + r2 – r1. (C)

The right-hand side of Eq. (C) is an integer and the left-hand side satisfies –1 < y1 – y2 < 1.Consequently, the integer on the right-hand side must be zero, and so y1 = y2. Putting y1 = y2 into Eq. (A) gives x1 – r1 = x2 – r2 and a similar argument shows that x1 = x2. Thus(x1, y1) = (x2, y2), which shows that the mapping is one-to-one.

0

1 2

1 2

0

0

1 2

→

→

y ≠ 1 2

0

0

0

1

l

y

y

y

→

→−

→−−

→

y

y0

0

0

y


To show that the range of Γ is S, let (u, v) be any point in S, so that 0 ≤ u < 1 and 0 ≤ v < 1. We want to show that there exists a point (x, y) in S such that Γ(x, y) = (u, v);that is, such that

(x + y) mod 1 = u

(x + 2y) mod 1 = v.

From the definition of “mod 1”, this means that we must find integers r and s that

x + y – r = u

x + 2y – s = v

and for which x and y lie in [0, 1). Solving for x and y in this linear system gives

x = (2u – v) + (2r – s)

y = (–u + 2 v) + (–r + s).

Let a and b be integers such that (2u – v) + a and (–u + 2v) + b lie in the interval [0, 1).Solving the system

2r – s = a

–r + s = b

gives r = a + b and s = a + 2b, both of which are integers. This choice of r and s thenprovides the desired values of x and y.

8. Let (x, y) lie in S, so that 0 ≤ x < 1 and 0 ≤ y < 1. We must show that Γ–1(Γ(x, y)) = (x, y). From the definition of Γ–1 and Γ we obtain

Γ–1(Γ(x, y)) = Γ–1(x + y mod 1, x + 2y mod 1)

= Γ–1(x + y – r, x + 2y – s), for certain integers r and s,

= (2(x + y – r) – (x + 2y – s), – (x + y – r) + (x + 2y – s)) mod 1

= (x – 2r + s, y + r – s) mod 1

= (x, y),

where the last step follows because –2r + s and r – s are integers and 0 ≤ x, y < 1.



9. As per the hint, we wish to find the regions in S that map onto the four indicated regionsin the figure below.

We first consider region I′ with vertices (0, 0), (1/2, 1), and (1, 1). We seek points (x1, y1),(x2, y2), and (x3, y3), with entries that lie in [0, 1], that map onto these three points under

the mapping for certain integer values of a and b to be

determined. This leads to the three equations

The inverse of the matrix We multiply the above three matrix

equations by this inverse and set Notice that c and d must be

integers. This leads to

c

d

2

1

=−

−

.1

1

a

b

1

1is

21

2

1

1 1

−

−

.

1

1

1

2

1

1

+

x

y

a

b==

,0

0

1

2

2

2

1

1

x

y

/,+

=

a

b

1 2

1

1

2

1

1

+

=x

y

a

b

3

3

1

1

.

x

y

x

y

a

b

→

+

1 2

1 2

I'

II'

III'

IV'

(1, 1)(0, 1)

(0, 0) (1, 0)( , 0)

( , 1)

12

12

The only integer values of c and d that will give values of xI

and yi

in the interval [0, 1]

are c = d = 0. This then gives a = b = 0 and the mapping that

maps the three points (0, 0), (0, 1/2), and (1, 0) to the three points (0, 0), (0, 1/2), and(1, 1), respectively. The three points (0, 0), (0, 1/2), and (1, 0) define the triangular regionlabeled I in the diagram below, which then maps onto the region I′ above.

For region II′, the calculations are as follows:

1

1

1

2

1

1

+

x

y

a

b==

,1 2

0

1

2

2

2

1

1

x

y++

=

,

a

b

1

1

I

IIIII

IV

(1, 1)(0, 1)

(0, 0) (1, 0)

(1, )12(0, )1

2

x

y

x

y

→

1 1

1 2

x

y

1

1

2 1

1 1

0

0

=−

−

−

= −

,c

d

c

d

x

y

2

2

=−

−

/2 1

1 1

1 2

1 /−

=

−

c

d

c

d

0

1 2

=−

−

,

x

y

3

3

2 1

1 1

1

1

−

=

−c

d

1

0.

c

d


Only c = 1 and d = –1 will work. This leads to a = 0, b = –1 and the mapping

that maps region II with vertices (0, 1/2), (0, 1), and

(1, 0) onto region II′.

For region III′, thew calculations are as follows:

1

1

1

2

1

1

+

x

y

a

b==

+

,0

0

1

2

2

2

1

1

x

y,

a

b

=

1 2

1

1

2

1

1

x

y

a

b

3

3

0

1

+

=

=−

−

;

x

y

1

1

1

1 1

0

0

2

−

= −

c

d

c

d

x

y

2

2

=−

−

−2 1

1 1

1 2

1

c

d

=

−

,0

1 2

3

3

c

d

x

y

=−

−

−2 1

1 1

0

1

cc

d

c

d

=−

−

.1

1

x

y

x

y

→

+

−

1 1

1 2

0

1

1

1

1

2

1

03

3

+

=

x

y

a

b

=

−−

;

x

y

1

1

1

1 1

1 2

0

2

−

=

−

−

/

c

d

c

d

1

1 2

=

−−

,

x

y

2

2

1

1 1

1

1

2

−

=

−

c

d

c

d

0

1,

x

y

3

3

1

1 1

1

0

=

−−

2.−

=

−

−

c

d

c

d

2

1


Only c = –1 and d = 0 will work. This leads to a = –1, b = –1 and the mapping

that maps region III with vertices (1, 0), (1, 1/2), and

(0, 1 ) onto region III′.

For region IV′, the calculations are as follows:

Only c = 0 and d = –1 will work. This leads to a = –1, b = –2 and the mapping

that maps region IV with vertices (0, 0), (1, 1), and

(1/2, 0) onto region IV′.

x

y

x

y

→

+1 1

1 2

−−−

1

2

1

1

1

2

1

1

+

x

y

a

b==

+

,0

0

1

2

2

2

1

1

x

y,

a

b

=

1

1

1

2

1

1

x

y

a

b

3

3

1 2

0

+

=

=−

−

;

x

y

1

1

1

1 1

0

0

2

−

= −

c

d

c

d

x

y

2

2

=−

−

−

2 1

1 1

1

1

c

d

=

−

,1

0

3

3

c

d

x

y

=−

−

−2 1

1 1

1 2

0

c

dd

c

d

−

−

.1

1 2

x

y

x

y

→

+1 1

1 2

−−

−

1

1


10. We first establish the result for n = 2. Given (x0, y0) we have

and

where r0, s0, r1 and s1 are integers. This last expression is the same as

mod 1, which is what we wanted to show.

A similar argument establishes the result for any integer n.

11. As per the hint, we want to show that the only solution of the matrix equation

with 0 ≤ x0 < 1, 0 ≤ y0 < 1, and integer r and s is x0 = y0 = 0. This matrix equation isequivalent to the system

x0 = x0 + y0 – r

y0 = x0 + 2y0 – s.

The first equation is y0 = r, which can only have the solution y0 = r = 0 with the givenconstraints on y0 and r. Putting y0 = 0 into the second equation yields x0 = s, which likewisehas only the solution x0 = s = 0.

x

y

x

y

r

s

0

0

0

0

1

1 2

=

−

1

1.

1

1 2

20

0

x

y

x

y

x

y

2

2

1

1

1

1 21

=

mod

1,=

−

=

1

1

1

1 2

1

1

1

1

1

1

x

y

r

s

22

1 1

1 20

0

0

0

−

−

x

y

r

s

−

=

r

s

x

1

1

201

1 2

1

yy

r s r

r s s0

0 0 1

0 0 12

−

+ ++ +

x

y

x

y

1

1

0

0

1

1 11

=

mod

1==

−

1 1

1 20

0

0

0

x

y

r

s


12. As per the hint, we want to find all solutions of the matrix equation

where 0 ≤ x0 < 1, 0 ≤ y0 < 1, and r and s are nonnegative integers. This equation can berewritten as

which has the solution First trying r = 0 and s = 0, 1,

2, …, then r = 1 and s = 0, 1, 2, …, etc., we find that the only values of r and s that yield

values of x0 and y0 lying in [0, 1) are:

r = 1 and s = 2, which give x0 = 2/5 and y0 = 1/5;



r = 3 and s = 5, which give x0 = 3/5 and y0 = 4/5.

We can then check that (2/5, 1/5) and (3/5, 4/5) form one 2-cycle and (1/5, 3/5) and (4/5,2/5) form another 2-cycle.

13. In the matrix equation mod 1, let the matrix be

denoted by , all of whose entries are positive integers. The matrix equation can

then be written as

where r and s are integers. By Cramer’s rule, this last equation has the solution

xd r b s

a d b cy

an n

n n n n

n0 0

1

1 1

1=

− −− − −

=−( )

( )( )

( )and

rr c r

a d b c

n

n n n n

−− − −( )( )

.1 1

x

y

a b

c d

x

y

n n

n n

0

0

0

0

=

−

−

−

r

s

a b

c d

x

y

n n

n n

or1

1

0

00

=

r

s

a b

c d

n n

n n

,

1 1

1 2

nx

y

x

y

n

0

0

0

0

1 1

1 2

=

xr s

yr s

0 04 3

5

3

5.=

− +=

−and

1 3

3 4

0

0

=

,x

y

r

s

x

y

x

y

0

0

0

0

2 3

3 5

=

−rr

s


These two expressions identify x0 and y0 as the quotients of integers, and hence as rational

numbers. One point remains: to verify that the determinant of the matrix

is not equal to zero, so that Cramer’s rule is valid. This is equivalent to showing that the

matrix does not have the eigenvalue 1. Now, the eigenvalues of this matrix are

λ1n and λ2

n, where λ1 and λ2 are the eigenvalues of Because these eigenvalues are

it is not possible that their

nth powers can be 1 for n = 1, 2, …

λ λ1 3 5 2 2 6180 3 5 2( / ) . ( /= + = = −… and 2 )) . ,= 0 3819 …

1 1

1 2

.

a b

c d

n n

n n

a b

c d

n n

n n

−−

1

1


EXERCISE SET 11.16

1. First we group the plaintext into pairs and add the dummy letter T:

DA RK NI GH TT

or equivalently from Table 1:

4 1 18 11 14 9 7 8 20 20

(a) For the enciphering matrix we have

1 3

2 1

4

1

7

9

1 3

2 1

=

G

I

=

=

18

11

51

47

25

21

=

=

Y

U

1 3

2 1

14

9

41

37

115

1126

1 3

2 1

7

8

O

K(mod )

=

=

31

22

5

22

1 3

2 1

E

V

=

=

20

20

80

60

2

8

B

H

A =

1 3

2 1

701

The Hill cipher is

GIYUOKEV BH

1. (b) For the enciphering matrix we have

The Hill cipher is

SFANEFZW JH

2. (a) For , we have det(A) = 18 – 7 = 11, which is not divisible by 2 or 13.

Therefore by Corollary 2, A is invertible. From Eq. (2):

A =

9 1

7 2

4 3

1 2

4

1

19

6

4 3

1 2

=

S

F

=

=

18

11

105

40

1

14

=

A

N

4 3

1 2

14

9

83

32==

5

626

4 3

1 2

7

8

E

F(mod )

=

=

52

23

0

23

4 3

1 2

Z

W

=

=

20

20

140

60

10

8

J

H

A =

4 3

1 2


Checking:

(b) For , we have det(A) = 9 – 5 = 4, which is divisible by 2. Therefore

by Corollary 2, A is not invertible.

(c) For , we have det(A) = 72 – 11 = 61 = 9 (mod 26), which is not

divisible by 2 or 13. Therefore by Corollary 2, A is invertible. From (2):

A− −=

−

−

=−

−

1 199 11

1 83

9 11

1 8( )

==−

−

=

27 33

3 24

1 19

23 242(mod 66).

A =

8 11

1 9

A =

3 1

5 3

AA− =

=

1 9

7

1

2

12 7

23 15

131 78

130 79

=

=

−

1 0

0 126

12 7

23 15

91

(mod )

A A77

1

2

157 26

312 53

1 0

0 1

=

=

(mod 226) .

A− −=

−

−

=−

−

=

1 1112 1

7 919

2 1

7 9

3

( )

88 19

133 171

12 7

23 1526

−

−

=

(mod )).


Checking:

2. (d) For , we have det(A) = 14 – 1 = 13, which is divisible by 13. Therefore

by Corollary 4, A is not invertible.

(e) For , we have det(A) = 6 – 6 = 0, so that A is not invertible by

Corollary 2.

(f) For , we have det(A) = 3 – 8 = –5 = 21 (mod 26), which is not

divisible by 2 or 13. Therefore by Corollary 2, A is invertible. From (2):

Checking:

AA− =

=11

1

8

3

15 12

21 5

183 52

78 27

=

=

−

1 0

0 126

15 12

21 5

1

(mod )

A A

=

=1

1

8

3

27 156

26 183

1 0

00 126

(mod ).

A− −=

−

−

=−

−

=

1 1213 8

1 15

3 8

1 1

15

( )

−−

−

=

40

5 5

15 12

21 526(mod ).

A =

1 8

1 3

A =

3 1

6 2

A =

2 1

1 7

AA− =

=18 11

1 9

1 19

23 24

261 4116

208 235

1 0

0 126

1

=

−

(mod )

A AA =

=1 19

23 24

8 11

1 9

27 182

208 4469

1 0

0 126

=

(mod ).


3. From Table 1 the numerical equivalent of this ciphertext is

19 1 11 14 15 24 1 15 10 24

Now we have to find the inverse of .

Since det(A) = 8 – 3 = 5, we have from Corollary 2:

To obtain the plaintext, we multiply each ciphertext vector by A–1:

The plaintext is thus

WE LOVE MATH

16 5

15 6

19

1

309

291

23

=

=55

16 5

15 6

11

14

246

2

=

W

E

449

12

15

16 5

15 6

15

2

=

L

O

44

360

369

22

6

=

=

V

E(mod )26

16 5

15 6

1

15

91

105

=

=

13

1

16 5

15 6

10

24

M

A

=

=

280

294

20

8

T

H

A− −=

−

−

=−

−

1 15

2 1

3 421

2 1

3 4( )

=

−

−

=

42 21

63 84

16 5

15 6 (mod ).26

A =

4 1

3 2


4. From Table 1 the numerical equivalent of the known plaintext is

AR MY

1 18 13 25

and the numerical equivalent of the corresponding ciphertext is

SL HK

19 12 8 11

so the corresponding plaintext and ciphertext vectors are

We want to reduce

to I by elementary row operations and simultaneously apply these operations to

The calculations are as follows:

19

8

12 1

11 13

18

25

1

8

132 11

11 13

198

25

P

t

t.=

=

pp

pp

1

2

1 18

13 25

C

t

t=

=

cc

cc

1

2

19 12

8 11

pp cc

pp

1 1

2

=

← → =

=

1

18

19

12

113

25

8

11

← → =

cc2


We multiplied the first row by19–1 = 11 (mod 26) .

We formed the matrix .C P

Thus so the deciphering matrix is

(mod 26).

Since (mod 26), we have

for the enciphering matrix.

A A= =−

−

=

−−

− − −( ) ( )1 1 1235 15

6 717

5 15

6 7

=−

−

=

85 255

102 119

7 5

2 1526(mod )

A−( ) = − = − =1 35 90 55 23

A− =

1 7 15

6 5

( )At− =

1 7 6

15 5

− − −

1

8

2 11

11 13

16

25

1

0

2 11

5 75

16

103

1

0

2 11

21 3

16

1

1

0

2 11

1 15

16

5

−

1

0

0 19

1 15

6

5

1

0

0 7

1 15

6

5


We added –8 times the first rowto the second.

We replaced the entries in thesecond row by their residuesmodulo 26.

We replaced 132 and 198 bytheir residues modulo 26 .

We added –2 times the secondrow to the first.

We replaced –19 by its residuemodulo 26.

We multiplied the second rowby 21–1 = 5 (mod 26).

5. From Table 1 the numerical equivalent of the known plaintext is

AT OM

1 20 15 13

and the numerical equivalent of the corresponding ciphertext is

JY QO

10 25 17 15

The corresponding plaintext and ciphertext vectors are:

We want to reduce



10

17

25 1

15 15

20

13

27

17

40 16

15 15

33

13

P =

1 20

15 13.

C =

10 25

17 15

pp cc

pp

1 1

2

1

20

10

25

15

13

=

← → =

=

← → =

cc2

17

15


We added the second row tothe first (since 10–1 does notexist mod 26).


Thus , and so the deciphering matrix is

From Table 1 the numerical equivalent of the given ciphertext is

LN GI HG YB VR EN JY QO

12 14 7 9 8 7 25 2 22 18 5 14 10 25 17 15

A− =

1 24 5

19 14.

At−( ) =

1 24 19

5 14

− − −

1

17

14 16

15 15

7

13

1

0

14 16

223 257

7

1106

1

0

14 16

11 3

7

24

1

0

14 16

1 57

7

4

556

1

0

14 16

1 5

7

14

1

0

0 54

1 5

189

1

− −

44

1

0

0 24

1 5

19

14


We replaced the entries in thefirst row by their residuesmodulo 26.


We added –17 times the firstrow to the second.


We multiplied the second rowby 11-1 = 19 (mod 26).

We replaced –54 and –189 bytheir residues modulo 26.

We added –14 times the secondrow to the first.

To obtain the plaintext pairs, we multiply each ciphertext vector by A–1:

which yields the message

THEY SPLIT THE ATOM

6. Since we want a Hill 3-cipher, we will group the letters in triples. From Table 1 thenumerical equivalent of the known plaintext are

I H A V E C O M E

9 8 1 22 5 3 15 13 5

24 5

19 14

12

14

358

424

=

=220

8

24 5

19 14

7

9

21

=

T

H

33

259

5

25

24 5

19 14

=

E

Y

88

7

227

250

19

16

2

=

=

S

P

44 5

19 14

25

2

610

503

1

=

=22

926

24 5

19 14

22

18

(mod )L

I

=

=

618

670

20

20

24 5

19

T

T

114

5

14

190

291

8

5

=

=

=

H

E

24 5

19 14

10

25

365

540

=

1

20

24 5

19 14

17

15

A

T

=

=

483

533

15

13

O

M


and the numerical equivalents of the corresponding ciphertext are

H P A F Q G G D U

8 16 1 6 17 7 7 4 21

The corresponding plaintext and ciphertext are

We want to reduce



8

6

7

16

17

4

1 9

7 22

21 15

8

5

13

1

3

5

P =

9 8 1

22 5 3

15 13 5

.

C =

8 16 1

6 17 7

7 4 21

pp cc

pp

1 1

2

=

← → =

=

9

8

1

8

16

1

22

5

3

← → =

=

cc

pp

2

3

6

17

7

15

13

5

← → =

cc3

7

4

21



15

6

7

20

17

4

22 24

7 22

21 15

21

5

13

6

3

5

1

6

7

140

17

4

154 168

7

222

21 15

147

5

13

42

3

5

1

6

7

1

00

17

4

24 12

7 22

21 15

17

5

13

16

3

5

−

−

− −

− −

−

−

1

0

0

10

43

66

24 12

137 50

147 69

17

97

1106

16

93

107

1

0

0

10

9

12

24 12

19

−

−

22

9 9

17

7

24

16

11

23

1

0

0

10

11

12

24 12

57 6

9 9

17

21

24

16

33

23

1

0

0

10

1

12

24 12

5 6

9 9

17

21

24

16

7

23

− −

− −

−1

0

0

0

1

0

26 48

5 6

51 63

1993

21

228

54

7

61−

−

−


We added the third row to thefirst since 8–1 does not existmodulo 26.

We replaced the entries in thefirst row by their residuesmodulo 26.

We replaced the entries in thesecond and third rows by theirresidues modulo 26.

We added –6 times the first rowto the second and –7 times thefirst row to the third.


We added –10 times the secondrow to the first and –12 timesthe second row to the third.

We multiplied the first row by15–1 = 7 (mod 26).

We multiplied the second rowby 9–1 = 3 (modulo 26).

Thus,

and so the deciphering matrix is

From Table 1 the numerical equivalent of the given ciphertext is

H P A F Q G G D U G D D

8 16 1 6 17 7 7 4 21 7 4 4

H P G O D Y N O R

8 16 7 15 4 25 14 15 18

A− =

1

4 9 15

15 17 6

24 0 17

.

At−( ) =

1

4 15 24

9 17 0

15 6 17

1

0

0

0

1

0

0 4

5 6

11 15

15

21

6

24

7

17

1

0

0

0

1

00

0 4

0 69

1 15

15

9

6

24

78

17

1

0

0

− − −

0

1

0

0 4

0 9

1 15

15

17

6

24

0

17


We replaced the entries in thefirst and second row by theirresidues modulo 26.


We added –5 times the thirdrow to the second.

To obtain the plaintext triples, we multiply each ciphertext vector by A–1:

Finally, the message is

I HAVE COME TO BURY CAESAR

4 9 15

15 17 6

24 0 17

8

16

1

191

3

= 998

209

9

8

1

4 9 15

15 17 6

2

=

I

H

A

44 0 17

6

17

7

282

421

263

=

=

22

5

3

4 9 15

15 17 6

24 0 17

V

E

C

=

=

7

4

21

379

299

525

155

13

5

4 9 15

15 17 6

24 0 17

7

O

M

E

44

4

124

197

236

20

15

2

=

=

T

O

B

(mod )26

4 9 15

15 17 6

24 0 17

8

166

7

281

434

311

21

18

25

=

=

U

R

Y

4 9 15

15 17 6

24 0 17

15

4

25

=

=

471

443

785

3

1

5

C

AA

E

4 9 15

15 17 6

24 0 17

14

15

18

==

=

461

573

642

19

1

18

S

A

R


7. Since 29 is a prime number, by Corollary 1 a matrix A with entries in Z29 is invertible if andonly if det(A) ≠ 0 (mod 29).

8. Testing x = 0, 1, 2, … , 25 in 4x = 1 (mod26) gives

4( 0) = 0 ≠ 1

4( 1) = 4 ≠ 1

4( 2) = 8 ≠ 1

4( 3) = 12 ≠ 1

4( 4) = 16 ≠ 1

4( 5) = 20 ≠ 1

4( 6) = 24 ≠ 1

4( 7) = 28 = 2 ≠ 1

4( 8) = 32 = 6 ≠ 1

4( 9) = 36 = 10 ≠ 1

4(10) = 40 = 14 ≠ 1 (mod 26)

4(11) = 44 = 18 ≠ 1

4(12) = 48 = 22 ≠ 1

4(13) = 52 = 0 ≠ 1

4(14) = 56 = 4 ≠ 1

4(15) = 60 = 8 ≠ 1

4(16) = 64 = 12 ≠ 1

4(17) = 68 = 16 ≠ 1

4(18) = 72 = 20 ≠ 1

4(19) = 76 = 24 ≠ 1

4(20) = 80 = 2 ≠ 1

4(21) = 84 = 6 ≠ 1

4(22) = 88 = 10 ≠ 1

4(23) = 92 = 14 ≠ 1

4(24) = 96 = 18 ≠ 1

4(25) = 100 = 22 ≠ 1


9. (a) We have

where Api

= ci

i = 1, 2, … , n.

Assuming A–1 exists, we can write

pi

= A–1ci

i = 1, 2,…, n

or pit = c

it (A–1)t i = 1, 2,…, n

which in matrix form is

P = C (A–1)t.

(b) From part (a), P = C (A–1)t. Multiply both sides by En… E2E1 from the left to

obtain

En… E2E1P = E

n… E2E1C (A–1)t.

But En…E2E1C = I, so that

En… E2E1P = (A–1)t.

10. (a) We have

where Api

= ci

(mod 26)

or pi

= A–1ci

(mod 26)

or pit = c

it (A–1)t (mod 26), i = 1, 2,…, n

P C

t

t

nt

t

t

nt

=

=

pp

pp

pp

cc

cc

cc

1

2

1

2

P C

t

t

nt

t

t

nt

=

=

pp

pp

pp

cc

cc

cc

1

2

1

2


which in matrix form is

P = C (A–1)t (mod 26) .

Multiplying both sides by C–1 gives

C–1P = (A–1)t

or

A–1 = (C–1 P)t (mod 26).

(b) From Example 8, we have

Using Eq. (2) to find C–1 we have

and

C P− =

=

1 22 17

25 21

4 5

1 18

1055 416

121 503

1 0

17 9(mod

=

26)) .

C− − −= −

−−

= −

−1 1 118 2852 15

19 9267

2 1( ) ( )

55

19 9

192 15

19 911

2 151

−

=−

−

=

−−

−119 9

22 165

209 99

22 17

25 21

=−

−

=

(mod ),26

P C=

=

4 5

1 18

9 15

19 2, .


Thus from part (a):

A C Pt− −= =

1 1 1 17

0 9( ) .


EXERCISE SET 11.17

1. Use induction on n, the case n = 1 being already given. If the result is true for n – 1, thenMn = Mn–1 M = (PDn–1 P–1)(PDP–1) = PDn–1(P–1 P)DP–1 = PDn–1 DP–1 = PDn P–1, provingthe result.

2. Using Table 1 and the notations of Example 1, we derive the following equations:

The transition matrix is thus

The characteristic polynomial of M is det(λI – M) = λ3 – (3/2)λ2 + (1/2)λ = λ(λ – 1)

so the eigenvalues of M are and λ3 = 0. Corresponding λ λ,= =11

22λ ,−

1

2

M =

1

2

1

40

1

2

1

2

1

2

01

4

1

2

.

a a b

b a b c

n n n

n n n

= +

= + +

− −

− −

1

2

1

41

2

1

2

1

2

1 1

1 1 nn

n n nc b c

−

− −= +

1

1 11

4

1

2.

719

eigenvectors (found by solving (λI – M)x = 0) are e1 = [1 2 1]t, e2 = [1 0 –1]t ande3 = [1 –2 1]t. Thus

This yields

Remembering a0 + b0 + c0 = 1, we obtain

Since approaches zero as we obtain as

n → ∞.

anda b cn n n→ → →1

4

1

2

1

4, ,n → ∞.1

2

1

+n

a a b c an

n n

= + + +

−

+ +1

4

1

4

1

4

1

2

1

20 0 0

1

0

1

cc

a c

b a b c

n

n

0

1

0 0

0 0 0

1

4

1

2

1

2

1

2

1

2

= +

−( )

= + + =

+

11

2

1

4

1

4

1

2

1

2

1

20 0 0

1

0c a b c an

n

= + + −

+

+ nn

n

c

a c

+

+

= −

−( )

1

0

1

0 01

4

1

2.

xx( )n

n

n

n

a

b

c

=

=

+

1

4

1

2

nn n

n

+ +

+

−

−

1 1

1

1

4

1

4

1

2

1

2

1

2

1

2

1

4

1

2

1

4

1

44

1

2

1

0

0

0+

+n

a

b

c

.

M PD Pn n= = −

−

−1

1 1 1

2 0 2

1 1 1

1 0 0

01

2

−

−

n

0

1 0 0

1

4

1

4

1

41

20

1

21

4

11

4

1

4

.


3. Call M1 the matrix of Example 1, and M2 the matrix of Exercise 2. Then x(2n) = (M2M1)n x(0)

and x(2n+1) = M1(M2M1)n x(0). We have

The characteristic polynomial of this matrix is so the eigenvalues are

λ1 = 1, λ3 = 0. Corresponding eigenvectors are e1 = [5 6 1]t, e2 = [–1 0 1]t,

and e3 = [1 –2 1]t. Thus,

Using the notation of Example 1 (recall a0 + b0 + c0 = 1), we obtain

and

a a b c

b a

n n

n n

2 1 0 0 0

2 1 0

2

3

1

6 42 4

1

3

1

6 42

+

+

= +⋅

− +( )

= −⋅

−− +( )=+

b c

c n

0 0

2 1

4

0.

a a b c

b

c

n n

n

n

2 0 0 0

2

2

5

12

1

6 42 4

1

21

12

1

6 4

= +⋅

− +( )

=

= −⋅ nn

a b c2 40 0 0− +( ) .

( )M M PD Pn n

2 11

5 1 1

6 0 2

1 1 1

1

= =

−

−

−

00 0

01

40

0 0 0

1

12

1

12

1

12

−

n

11

3

1

6

2

31

4

1

4

1

4−

λ21

4= ,

λ λ λ3 25

4

1

4− + ,

M M2 1

1

2

1

40

1

2

1

2

1

2

01

4

1

2

11

=

220

01

21

0 0 0

1

2

3

8

1

41

2

1

2

1

2

01

=

88

1

4

.


4. The characteristic polynomial of M is (λ – 1) so the eigenvalues are λ1 = 1 and

Corresponding eigenvectors are easily found to be e1 = [1 0]t and e2 = [1 –1]t.

From this point, the verification of Eq. (7) is in the text.

5. From Eq. (9), if b0 = .25 = , we get b1 , then b2 = , b3 = ,

and in general bn

= We will reach = .10 in 12 generations. According to

Eq. (8), under the controlled program the percentage would be in 12 generations, or

= .00006 = .006%.

6. P

D P

t

n

−

−

= + −

=

1 0

1 0

1

2

1

20 0

1

2

5

2

1

2

5

2xx

xx

( )

( ) 11

2

1

20 0 2

1 5

22

1 5

2

1 1+

−

+ +n ntt

n

n

PD P−

+

=

− +( ) +

− −(

1 0

11

2

1

23 5

1 5

4

1

23 5

xx( )

)) −

+

+

−

+

+ +

1 5

4

21 5

42

1 5

4

1

1

n

n n 11

1

2

1 5

4

1

2

1 5

4

1

2

1 5

4

+

+

−

+

+

n n

n

11

2

1 5

4

21 5

42

1 5

4

1 1

−

+

+

−

+ +

n

n n

11

2

1

23 5

1 5

4

1

23 5

1 5

4

1

− +( ) +

− −( ) −

+n n++

1

1

16384

1

214

2

20

2

8 + n.

1 5

11 10

2

11=

2 9

10 9

1

5=1 4

9 8

2

9=

1

4

λ21

2= .

λ −

,

1

2


Since all exponentiated terms above are less than one in absolute value, we have

7. From (13) we have that the probability that the limiting sibling-pairs will be type (A, AA)is

The proportion of A genes in the population at the outset is as follows: all the type (A, AA)genes, 2/3 of the type (A, Aa) genes, 1/3 the type (A, aa) genes, etc. … yielding

8. From an (A, AA) pair we get only (A, AA) pairs and similarly for (a, aa). From either(A, aa) or (a, AA) pairs we must get an Aa female, who will not mature. Thus no offspringwill come from such pairs. The transition matrix is then

9. For the first column of M we realize that parents of type (A, AA) can produce offspring onlyof that type, and similarly for the last column. The fifth column is like the second column,and follows the analysis in the text. For the middle two columns, say the third, note thatmale offspring from (A, aa) must be type a, and females are of type Aa, because of the waythe genes are inherited.

1 0 0 0

0 0 0 0

0 0 0 0

0 0 0 1

.

a b c d e0 0 0 0 02

3

1

3

2

3+ + + + .

a b c d e0 0 0 0 02

3

1

3

2

3+ + + + .

lim .( )

n

n

t

→∞=

xx1

20 0 0 0

1

2


EXERCISE SET 11.18

1. (a) The characteristic polynomial of L is λ2 – λ – 3/4, so the eigenvalues are λ1 = 3/2,λ2 = –1/2. The eigenvector corresponding to λ1 is [3 1]t.

(b)

(c)

2. For L in Eq. (4), we obtain

The determinant of this matrix is obtained most easily using cofactors along the first row.Each minor is a lower-triangular matrix. Thus, the characteristic polynomial of L is

(λ – a1)λn–1 – a2b1λn–2 – a3b1b2λ

n–3 – … – anb1b2

… bn–1.

λ

λ

λ

λI L

a a a a a

b

b

n n

− =

− − − ⋅⋅⋅ − −

− ⋅⋅⋅

− ⋅

−1 2 3 1

1

2

0 0 0

0 ⋅⋅ ⋅

⋅ ⋅ ⋅ −

−

0 0

0 0 0 1

bn λ

.

xx xx xx6 51

5857

285

855

287( ) ( ) ( )= =

=

L . λ ..

xx xx xx xx1 0 2 3100

50

175

50( ) ( ) ( ) (= =

=

L , , ))

( ) ( )

=

=

=

250

88

382

125

570

194 5

,

,xx xx11

.

725

3. From the solution in Exercise 2, we see that the derivative of the characteristic polynomialevaluated at λ1 is

nλ1n–1 – (n – 1)a1λn

1–2

– (n – 2)a2b1λ1n–3 – … – a

n–1b1b2… b

n–2.

Recalling that ai

≥ 0 and at least one ai

> 0, and bi

> 0, we see that the above is strictlygreater than

So the derivative is positive and λ1 is a simple root.

4. The product

is [c 0 0 … 0]t, where c is the first element of P–1x(0). But then P[c 0 0 … 0]t returns c times the first column of P which is cx1, x1 being the first eigenvector, i.e., thus the firstcolumn of P.

5. a1 is the average number of offspring produced in the first age period. a2b1 is the numberof offspring produced in the second period times the probability that the female will liveinto the second period, i.e., it is the expected number of offspring per female during thesecond period, and so on for all the periods. Thus, the sum of these, which is the netreproduction rate, is the expected number of offspring produced by a given female duringher expected lifetime.

6. If the population is eventually decreasing, then λ1 < 1 and the characteristic polynomial of L is positive for all λ > λ1, in particular for λ = 1. But the polynomial evaluated at 1

1 0 0 0

0 0 0 0

0 0 0 0

⋅ ⋅ ⋅

⋅ ⋅ ⋅

⋅ ⋅ ⋅

−

p

11 0xx( )

n a a b a b bn n nn nλ λ λ1

11 1

22 1 1

31 1 2

− − −− −− − − −( ⋅⋅⋅ ⋅ ⋅ ⋅ ))

≥ ≥⋅⋅⋅ −n

a b bn nλ11 1 0.


(which is 1 minus the net reproduction rate) is positive. Thus the net reproduction rate isless than one. Conversely, if the R < 1 we get that the characteristic polynomial at 1 ispositive; thus λ1 < 1 and the population is eventually decreasing. For R > 1, just switch allinequality symbols above.

7.

8. R = 0 + (.00024)(.99651) + … + (.00240)(.99651) … (.987)

= 1.49611.

9. Let λ be any eigenvalue of L, and write We know λn = rneinθ, so

Since q(λ is real, we can

ignore all the sine terms in the formula.

Thus

i.e., . Since q is a decreasing function, we have r ≤ λ1.q r( ) ≥ 1

1 21 2 12

( ) cos cos= = + + ⋅⋅⋅ +qa

r

a b

r

anλ θ θbb b b

rn

a

r

a b

r

a b b

n

n

1 2 12

1 2 1 1 2

−

≤ + + ⋅⋅⋅ +

cos θ

⋅⋅⋅⋅=−b

rq rn

n

1 ( )

1 1 1 2 1= = − + ⋅⋅⋅+⋅⋅⋅ −q

a

ri

a b b bn n( ) (cos sin )λ θ θrr

n i nn

(cos sin )θ θ−

λ = re r iiθ θ θ= +(cos sin ).

R = +

+

=0 41

23

1

2

1

419 8.


EXERCISE SET 11.19

1. (a) The characteristic polynomial of L is λ3 – 2λ – 3/8 = (λ – 3/2)[λ2 + (3/2)λ + 1/4], so

λ1 = 3/2. Thus h, the fraction harvested of each age group, is from Eq. (6),

so the yield is of the population. The eigenvector corresponding to λ1 = 3/2 is

[1 1/3 1/18]t; this is the age distribution vector after each harvest.

(b) From Eq. (10), the age distribution vector x1 is [1 1/2 1/8]t. Eq. (9) tells us that h1 = 1 – 1/(19/8) = 11/19, so we harvest 11/19 or 57.9% of the youngest age class.Since Lx1 = [19/8 1/2 1/8]t, the youngest class contains 79.2% of the population.Thus the yield is 57.9% of 79.2%, or 45.8% of the population.

2. The Leslie matrix of Example 1 has b1 = .845, b2 = .975, b3 = .965, etc. This, together withthe harvesting data from Eq. (13) and the formula of Eq. (5) yields

The total of the entries of Lx1 is 7.584. The proportion of sheep harvested is h1(Lx1)2 +h9(Lx1)9 = 1.51, or 19.9% of the population.

3. Using the L of Eq. (3) and the x1 of Eq. (10) we obtain for the first coordinate of Lx1,

a1 + a2b1 + a3b1b2 + … + anb1b2b3

… bn–1.

x

x

1

1

1 845 824 795 755 699 626 532 0 0 0 0= [ ]. . . . . . . .t

L == [ 2 090 845 824 795 755 699 626 532 418 0 0 0. . . . . . . . . ]] t.

33 13

%

12

3

1

3− =

729

The other coordinates of Lx1 are, respectively, b1, b1b2, b1b2b3, … , b1b2… b

n–1. Thus,

4. In this situation we have hI

≠ 0, and h1 = h2 = … = hI–1 = h

I+1 = … = hn

= 0. Eq. (4) thentakes the form

a1 + a2b1 + a3b1b2 + aIb1b2

… bI–1(1 – h

I) + a

I+1b1b2… b

I(1 – h

I)

+ … + anb1b2

… bn–1(1 – h

I) = 1.

Factoring out (1 – hI) we get:

(1 – hI)[a

Ib1b2

… bI–1 + a

I+1b1b2… b

I+ … + a

nb1b2

… bn–1]

= 1 – a1 – a2b1 – … – aI–1b1b2

… bI–2.

L

a a b a b b a b b bn n

xx xx1 1

1 2 1 3 1 2 1 2 1

− =

+ + + ⋅ ⋅ ⋅ + ⋅ ⋅ ⋅ − −−

=

−

1

0

0

0

1

0

0

0

R

.


So

5. Here hJ

= 1, hI

≠ 0, and all the other hk

are zero. Then Eq. (4) becomes

a1 + a2b1 + … + aI–1b1b2

… bI–2 + (1 – h

I)[a

Ib1b2

… bI–1 + … + a

J–1b1b2… b

J–2] = 1.

We solve for hI

to obtain

ha a b a b b b

a b bI

I I

I

=+ + ⋅ ⋅ ⋅ + ⋅ ⋅ ⋅ −

⋅− −1 2 1 1 1 2 2

1 2

1

(

⋅ ⋅ + ⋅ ⋅ ⋅ + ⋅ ⋅ ⋅+

=+ +

− −b a b b b

a a b

I n n1 1 2 1

1 2 1

1

⋅⋅ ⋅ ⋅ + ⋅ ⋅ ⋅ − + ⋅ ⋅ ⋅ +− − −a b b b a b b bI I I I1 1 2 2 1 2 11 ⋅⋅ ⋅ ⋅ + ⋅ ⋅ ⋅⋅ ⋅ ⋅ + ⋅ ⋅ ⋅

−

−

a b b b

a b b b

n n

I I

1 2 1

1 2 1 ++ ⋅ ⋅ ⋅

=−

⋅ ⋅ ⋅ + ⋅ ⋅ ⋅ +

−

−

a b b b

R

a b b b

n n

I I

1 2 1

1 2 1

1

aa b b bn n1 2 1⋅ ⋅ ⋅ −


ha a b a b b b

a b bI

I I

I

=+ + ⋅ ⋅ ⋅ + ⋅ ⋅ ⋅ −

⋅− −1 2 1 1 1 2 2

1 2

1

⋅ ⋅ + ⋅ ⋅ ⋅ + ⋅ ⋅ ⋅+

=+

− − −b a b b b

a a b

I J J1 1 1 2 1

1 2 1

1

++ ⋅ ⋅ ⋅ + ⋅ ⋅ ⋅ −⋅ ⋅ ⋅ +

− −

−

a b b b

a b b b

J J

I I

1 1 2 2

1 2 1

1

⋅⋅ ⋅ ⋅ + ⋅ ⋅ ⋅− −.

a b b bJ J1 1 2 1

EXERCISE SET 11.20

1. From Theorem 2, we compute

So the least-squares trigonometric polynomial is

2. From Theorem 3, we compute

and

So the least-squares trigonometric polynominal is

T T t

T

t

T

t

T

2 2

23

2 1

4

4 1

9

6 1

16

8+ + + +

ππ π π

cos cos cos cosππ

ππ π

t

T

T t

T

t

T

− + +sin sin sin2 2 1

2

4 1

3

6ππ πt

T

t

T+

1

4

8sin .

bT

tk t

Tdt

T

kk

Tsin .= = −∫

2 222

0

ππ

aT

tk t

Tdt

T

kk

T= =∫

2 222

2 20cos

ππ

aT

t dt TT

02 2

0

2 2

3= =∫ ,

π 2

34 2

4

93+ + +cos cost t tcos .

4 10

22

0

2

kb t kt dtk, ( ) sin .and = − =∫π

ππ

a t dt a t kt dtk02 2 2

0

2

0

21 2

3

1= − = = −∫π

π ππ

ππ

( ) , ( ) cosππ

∫ =

733

3. As in Exercise 1,

(Note the upper limit on second integral),

So the least-squares trigonometric polynomial is


So the least-square trigonometric polynomial is

2 4

1 3

2

3 5

3

5 7π πcos cos cos

−⋅

+⋅

+⋅

+ ⋅⋅⋅t t t

++− +

cos

( )( ).

nt

n n2 1 2 1

a t kt dtk

k = = −−

= −

∫ sin cos( )

1 1

2

4

4 1

4

0

2

2π π

π

π

(( )( ),

sin sin .

2 1 2 1

1 1

20

0

2

k k

b t kt dtk

− +

= =∫ππ

a t dt0 0

21 1

2

4= =∫π π

πsin .

1 1

2

2

32

2

154

π π π+ − −sin cos cos .t t t

a t kt dt

kk kt

k =

=−

∫1

1 1

1

0

2

π

π

πsin cos

[ sin sinn cos cos ]

[ (

t kt t

k

+

=−

+ −

0

2

1 1

10 1

π

π)) ]

( )

k

k

kk

− −

=−

−

1

2

1

0

2

1

if is odd

if isπ

even.

if

= =∫b kt t dtk

11

20ππ

sin sinkk

k

=

1

if 10 >.

a f t dt t dt0 00

21 1 2= = =∫∫π π π

ππ( ) sin



So the least-squares trigonometric polynomial is:

if n is even; the last term involves n – 1 if n is odd.

6. (a)

(b)

(c)

7. There are several cases to consider. First, the function 1 is orthogonal to all the othersbecause

cos sin .kt dt kt dt0

2

0

20

π π∫ ∫= =

sin sincos

kt kt dtt

dt= =−

=∫ ∫20

2

0

2 1 2

2

π ππ ..

cos coscos

kt kt dtt

dt= =+

=∫ ∫20

2

0

2 1 2

2

π ππ ..

1 20

2= =∫ dt

ππ .

T T t

T

t

T4

8 1

2

2 1

6

6 1

10

12 2 2 2

cos cos cos− + +π

π π 00 1

2

22

π πt

T n

nt

T( )cos+ ⋅⋅⋅ +

aT

f t dtT

t dtT

T t dtTT T

T

T

0 0 0

2 2 2

2

12

12

= = + − =∫ ∫ ∫( ) ( ) ..

cos ( ) cosaT

tk t

Tdt

TT t

k t

Tk

T

T

T= + −∫

2 2 2 20

12

12

π π∫∫

= − − =

dt

T

k

k

T

k

k4

41 1

0

8

2

2 22

ππ

(( ) )

( )

if is even

22

2 2 2

if is oddk

bT

tk t

Tdt

TTk

= + −

.

cos (π

ttk t

Tdt

T

TT)cos .1

2

12 2

00 ∫∫ =

π


Then consider cos mt cos nt dt = 0 if m ≠ n. Similarly sin mt sin nt dt = 0

if m ≠ n. Finally cos mt sin nt dt = 0 for all values of m and n.

8. is defined for all in the interval immediately implies

and f is defined in the interval [0, T]. is a continuous function of for

in the interval since there it is the composition of two continuous functions, namely

f and Theorem 2 tells us that the trigonometric polynomial that minimizes

But then by making the change of variable we obtain Theorem 3 immediately.tT

=τ

2π

fT

g dτπ

τ τ2

2

−

( ) has coefficients00

2

0

21

2

1

π

π

πτπ π

τ

∫

∫=

=a fT

kr dr b fT

k kcos and220

2

πτ

π

∫ sin .kr d

g( )τg( )τ τπ

=2Τ

.

[ , ]0 2πτ

τfTτ

π2

0 ≤2

≤ ,Τ Ττ

π

[0, 2 ] because 0π τ π≤ ≤ 2τfTτ

π2

0

2π∫

0

2π∫0

2π∫


EXERCISE SET 11.21

1. (a) Equation (2) is

and Equation (3) is c1 + c2 + c3 = 1. These equations can be written in combinedmatrix form as

This system has the unique solution c1 = 1/5, c2 = 2/5, and c3 = 2/5. Because thesecoefficients are all nonnegative, it follows that v is a convex combination of the vectorsv1, v2, and v3.

(b) As in part (a) the system for c1, c2 and c3 is

which has the unique solution c1 = 2/5, c2 = 4/5, and c3 = –1/5. Because one of thesecoefficients is negative, it follows that v is not a convex combination of the vectors v1,v2, and v3.

1 3 4

1 5 2

1 1 1

2

4

1

1

2

3

=

c

c

c

1 3 4

1 5 2

1 1 1

3

3

1

1

2

3

=

c

c

c

.

c c c1 2 31

1

3

5

4

2

3

3

+

+

=

737

1. (c) As in part (a) the system for c1, c2 and c3 is

which has the unique solution c1 = 2/5, c2 = 3/5, and c3 = 0. Because these coefficients are all nonnegative, it follows that v is a convex combination of the vectorsv1, v2, and v3.

(d) As in part (a) the system for c1, c2 and c3 is

which has the unique solution c1 = 4/15, c2 = 6/15, and c3 = 5/15. Because thesecoefficients are all nonnegative, it follows that v is a convex combination of the vectorsv1, v2, and v3.

2. For both triangulations the number of triangles, m, is equal to 7; the number of vertexpoints, n, is equal to 7; and the number of boundary vertex points, k, is equal to 5. Equation(7), m = 2n – 2 – k, becomes 7 = 2(7) – 2 – 5, or 7 = 7.

3. Combining everything that is given in the statement of the problem, we obtain:

w = Mv + b = M(c1v1 + c2v2 + c3v3) + (c1 + c2 + c3)b

= c1(Mv1 + b) + c2(Mv2 + b) + c3(Mv3 + b) = c1w1 + c2w2 + c3w3.

4. A triangulation of the points in Figure 11.21.3 in which the points v3, v5, and v6 form thevertices of a single triangle is given in diagram (a), and a triangulation in which the pointsv2, v5, and v7 do not form the vertices of a single triangle is given in diagram (b):

3 2 3

3 2 0

1 1 1

1

0

1

1

2

3

−−

=

c

c

c

3 2 3

3 2 0

1 1 1

0

0

1

1

2

3

−−

=

c

c

c


5. (a) Let Then the three matrix equations Mvi+ b =

wi, i =1, 2, 3, can be written as the six scalar equations

m11 + m12 + b1 = 4

m21 + m22 + b2 = 3

2m11 + 3m12 + b1 = 9

2m21 + 3m22 + b2 = 5

2m11 + m12 + b1 = 5

2m21 + m22 + b2 = 3

The first, third, and fifth equations can be written in matrix form as

and the second, fourth, and sixth equations as

1 1 1

2 3 1

2 1 1

3

5

3

21

22

2

=

m

m

b

.

1 1 1

2 3 1

2 1 1

4

9

5

11

12

1

=

m

m

b

.

Mm m

m m

b

b=

=

11 12

21 22

1

2and bb .

v1 v2 v1 v2

v7 v7v6 v6

v4 v4

v3 v3

v5 v5

(a) (b)



The first system has the solution m11 = 1, m12 = 2, b1 = 1 and the second

system has the solution m21 = 0, m22 = 1, b2 = 2. Thus we obtain

(b) As in part (a), we are led to the following two linear systems:

Solving these two linear systems leads to

(c) As in part (a), we are led to the following two linear systems:

and


(d) As in part (a), we are led to the following two linear systems:

0 2 1

2 2 1

4 2 1

5 2

711

12

1− −

=m

m

b

22

7 2−

M =

=

−

1 0

0 1

2

3and b .

−

=−2 1 1

3 5 1

1 0 1

221

22

2

m

m

b

22

3−

.

−

=2 1 1

3 5 1

1 0 1

0

511

12

1

m

m

b 33

M =−

=

3 1

1 1

0

1and b .

−

=−2 2 1

0 0 1

2 1 1

8

0

5

11

12

1

m

m

b

−

and

2 2 1

0 0 1

2 1 1

21

22

2

m

m

b

=

1

1

4

.

=

1

2bb .

M =

1 2

0 1and

and


6. (a) From Section 3.1, a point v lies on the line segment connecting a and b if v =a + t(b – a) for some t with 0 ≤ t ≤ 1. This can be written as v = (1 – t) a + tb orv = c1a + c2b where c1 = 1 – t and c2 = t. We note that c1 and c2 are nonnegativenumbers such that c1 + c2 = 1.

(b) If c1 + c2 = 0, then c1 = c2 = 0 and so c1a + c2b = 0 and we are finished. If c1 + c2 > 0, set

d1 = c1/(c1 + c2) and d2 = c2/(c1 + c2). Then d1 and d2 both lie in the interval [0, 1] and

d1 + d2 = 1. Consequently, the vector w = d1a + d2b lies on the line segment connecting

the vectors a and b. The vector (c1 + c2)w then lies on the line segment connecting the

origin and the vector w since 0 < c1 + c2 ≤ 1. Because this line segment is contained in

the triangle connecting the origin and the tips of the vectors a and b, it follows that the

vector (c1 + c2)w, or c1a + c2b, is also contained in that triangle.

(c) Because c1, c2, and c3 are nonnegative numbers such that c1 + c2 + c3 = 1, it follows

that 0 ≤ c1 + c2 ≤ 1. As per the hint, set a = v1 – v3 and b = v2 – v3. Then from part (b)

the vector w = c1a + c2b lies in the triangle connecting the origin and the tips of the

vectors a and b. The vector w + v3 then lies on the displaced triangle connecting the

vectors v3, a + v3(= v1), and b + v3(= v2). Finally,

w + v3 = c1a + c2b + v3 = c1(v1 – v3) + c2(v2 – v3) + v3

= c1v1 + c2v2 + (1 – c1 – c2)v3 = c1v1 + c2v2 + c3v3.

7. (a) The vertices v1, v2, and v3 of a triangle can be written as the convex combinations

v1 = (1)v1 + (0)v2 + (0)v3, v2 = (0)v1 + (1)v2 + (0)v3, and v3 = (0)v1 + (0)v2 + (1)v3.

In each of these cases, precisely two of the coefficients are zero and one coefficient is

one.

(b) If, for example, v lies on the side of the triangle determined by the vectors v1 and v2then from Exercise 6(a) we must have that v = c1v1 + c2v2 + (0)v3 where c1 + c2 = 1.Thus at least one of the coefficients, in this example c3, must equal zero.

M =

=

−

1 2 1

2 0

1 2

1and bb .

0 2 1

2 2 1

4 2 1

1

321

22

2− −

=−m

m

b −−

9

.



(c) From part (b), if at least one of the coefficients in the convex combination is zero,then the vector must lie on one of the sides of the triangle. Consequently, none of thecoefficients can be zero if the vector lies in the interior of the triangle.

8. Consider the vertex v1 of the triangle and its opposite side determined by the vectors v2 andv3. The midpoint vm of this opposite side is (v2 + v3)/2 and the point on the line segmentfrom v1 to vm that is two-thirds of the distance to vm is given by

1

3

2

3

1

3

2

3 2

1

3

1

3

1

31 12 3

1 2vv vv vvvv vv

vv vv vv+ = ++

= + +m 33.

linear algebra

Documents

Transcript of linear algebra