1.1. THE LORENTZ TRANSFORMATION 41 3dirac.ups-tlse.fr/fleig/courses/Nuclear/lecture3.pdf · x 0...

12
1.1. THE LORENTZ TRANSFORMATION 41 v 3 c. Proof. lim v 1 ,v 2 c v 3 = 2c 2 = c. And also lim v 2 c = v 1 +c 1+ v 1 c = c(v 1 +c) v 1 +c = c. lim c→∞ v 3 = v 1 + v 2 which establishes the addition theorem for velocities in the non- relativistic limit. 1.1.5 Minkowski Metric The modern standard representation of the Lorentz transformation is different from both the ones we have established in subsection 1.1.3. Let us review the scalar product of the SpaceTime vector x ıct with itself (Eq. (1.30)): ( x ıct ) x ıct = ( x ıct ) γ -ı v c γ ı v c γ γ 1 0 0 1 γ ı v c γ -ı v c γ γ x ıct = ( x ıct ) γ 2 - v 2 c 2 γ 2 0 0 - v 2 c 2 γ 2 + γ 2 x ıct = ( x ıct ) 1 0 0 1 x ıct = x 2 - c 2 t 2 (1.43) where we have used the usual 3-dimensional Euclidean metric for the scalar product, a unit matrix 11 . But it is possible to write the same scalar product for real-valued coordinate axes by changing the form of the metric 12 . Obviously, ( x ıct ) 10 01 x ıct = ( x ct ) 1 0 0 -1 x ct where a new non-unity metric for the scalar product has been intro- duced. Moreover, as we have seen earlier, for the conservation of the scalar product x 2 - c 2 t 2 = x 2 - c 2 t 2 (Eq. (1.32)) the global sign is irrelevant; what matters is only the relative sign between spatial and 11 in the present case 2-dimensional. 12 Likewise, the scalar product over two vectors in a vector space when changing from orthogonal to non-orthogonal axes can be conserved by introducing an accompanying change of the metric.

Transcript of 1.1. THE LORENTZ TRANSFORMATION 41 3dirac.ups-tlse.fr/fleig/courses/Nuclear/lecture3.pdf · x 0...

Page 1: 1.1. THE LORENTZ TRANSFORMATION 41 3dirac.ups-tlse.fr/fleig/courses/Nuclear/lecture3.pdf · x 0 {ct0 with itself (Eq. (1.30)): x 0 {ct0 x 0 {ct0 = x {ct {v c {v c 1 0 0 1 {v c {v

1.1. THE LORENTZ TRANSFORMATION 41

• v3 ≤ c.Proof. lim

v1,v2→cv3 = 2c

2 = c. And also limv2→c

= v1+c

1+v1c

= c(v1+c)v1+c = c.

• limc→∞

v3 = v1 + v2

which establishes the addition theorem for velocities in the non-relativistic limit.

1.1.5 Minkowski Metric

The modern standard representation of the Lorentz transformation isdifferent from both the ones we have established in subsection 1.1.3.Let us review the scalar product of the SpaceTime vector

(x′

ıct′

)with

itself (Eq. (1.30)):(x′ ıct′

) ( x′

ıct′

)=

(x ıct

) ( γ −ıvcγıvc γ γ

) (1 00 1

) (γ ıvcγ−ıvc γ γ

) (xıct

)

=(x ıct

)(γ2 − v2

c2γ2 0

0 −v2

c2γ2 + γ2

) (xıct

)

=(x ıct

) ( 1 00 1

) (xıct

)= x2 − c2t2 (1.43)

where we have used the usual 3-dimensional Euclidean metric for thescalar product, a unit matrix11.

But it is possible to write the same scalar product for real-valuedcoordinate axes by changing the form of the metric12. Obviously,

(x ıct

) ( 1 0

0 1

) (x

ıct

)=(x ct

) ( 1 0

0 −1

) (x

ct

)

where a new non-unity metric for the scalar product has been intro-duced. Moreover, as we have seen earlier, for the conservation of thescalar product x2 − c2t2 = x′2 − c2t′2 (Eq. (1.32)) the global sign isirrelevant; what matters is only the relative sign between spatial and

11in the present case 2-dimensional.12Likewise, the scalar product over two vectors in a vector space when changing from orthogonal to non-orthogonal

axes can be conserved by introducing an accompanying change of the metric.

Page 2: 1.1. THE LORENTZ TRANSFORMATION 41 3dirac.ups-tlse.fr/fleig/courses/Nuclear/lecture3.pdf · x 0 {ct0 with itself (Eq. (1.30)): x 0 {ct0 x 0 {ct0 = x {ct {v c {v c 1 0 0 1 {v c {v

42 CHAPTER 1. SPECIAL THEORY OF RELATIVITY

time coordinates. So we are free to choose the metric according to

(ct x

) ( 1 0

0 −1

) (ct

x

)= c2t2 − x2 (1.44)

where the “minus” sign is on the spatial coordinate instead. This metrichas become the modern standard13. We will call

g =

1 0 0 0

0 −1 0 0

0 0 −1 0

0 0 0 −1

(1.45)

modern Minkowski metric, for the four dimensions of SpaceTime,and use it from here onward. Moreover, in the new basis of Space-Time (where the time coordinate also becomes the first coordinate)the Lorentz transformation matrix changes. The reader can easily ver-ify the equivalence of the two representations:

(ıct′

x′

)=

(γ −ıvcγıvcγ γ

) (ıct

x

)with complex axes (1.46)

(ct′

x′

)=

(γ −v

−vcγ γ

) (ct

x

)with real axes (1.47)

From now on we use the designation Λ(v) :=

(γ −v

−vcγ γ

)for

Lorentz boosts.

1.1.6 Lorentz transformation in Terms of Rapidity

There is a convenient way of expressing the Lorentz boost which isuseful in the context of combined Lorentz transformations and theLorentz group. If we define the “rapidity”

13It has been the standard for at least 50 years. A landmark text that still uses the old unit metric is Bethe andSalpeter, “Quantum Mechanics of One- and Two-Electron Atoms”.

Page 3: 1.1. THE LORENTZ TRANSFORMATION 41 3dirac.ups-tlse.fr/fleig/courses/Nuclear/lecture3.pdf · x 0 {ct0 with itself (Eq. (1.30)): x 0 {ct0 x 0 {ct0 = x {ct {v c {v c 1 0 0 1 {v c {v

1.1. THE LORENTZ TRANSFORMATION 43

Φx := arctanh(vc

)(1.48)

then it follows thatv

c= tanh Φx

v2

c2=

cosh2 Φx − 1

cosh2 Φx

γ = cosh Φxv

cγ = sinh Φx (1.49)

and so we obtain for the modern representation of the Lorentz trans-formation(

γ −vcγ

−vcγ γ

) (ct

x

)=

(cosh Φx − sinh Φx

− sinh Φx cosh Φx

) (ct

x

)(1.50)

It is then a straightforward exercise to prove the following identity:

Λ(Φx1)Λ(Φx2) = Λ(Φx)

withΦx1 + Φx2 = Φx (1.51)

Eq. (1.51) shows that the double boost occurring, e.g., in the derivationof the velocity addition theorem can — using the rapidity parameter —be written conveniently as a new Lorentz transformation where the newrapidity Φx simply is the sum of the two original rapidities. This re-sembles the formal situation for the addition of velocities in Newtonianmechanics using the Galilei transformation.

Page 4: 1.1. THE LORENTZ TRANSFORMATION 41 3dirac.ups-tlse.fr/fleig/courses/Nuclear/lecture3.pdf · x 0 {ct0 with itself (Eq. (1.30)): x 0 {ct0 x 0 {ct0 = x {ct {v c {v c 1 0 0 1 {v c {v

44 CHAPTER 1. SPECIAL THEORY OF RELATIVITY

1.2 Four-Vectors in SpaceTime

1.2.1 Inverse Lorentz Transformation

If we want to represent the Lorentz boost from frame K’ into frame K,we need the inverse Lorentz transformation matrix. It can be derivedstraightforwardly from Eq. (1.47) to be

(ct

x

)=

(γ v

cγvcγ γ

) (ct′

x′

). (1.52)

Therefore, Λ−1(v) = Λ(−v). This result is physically intuitive, sincethe origins of K and K’ propagate in opposite directions relative tothe respective other frame. It is easily verified that Λ−1(v) Λ(v) =

Λ(v) Λ−1(v) = 114 for the four-dimensional case. We are now in theposition to introduce general four-vectors in Minkowski SpaceTime.

1.2.2 Four-Vectors (Co- and Contravariant)

1.2.2.1 Position four-vector

We have already given the time coordinate in SpaceTime physical di-mension of length (through the multiplication with the speed of light),and so, with x0 = ct, it is suggested to introduce a position four-vector as

x :=

x0

x1

x2

x3

With components {xµ} (1.53)

where the x0 is the time-like component and xk, k ∈ {1, . . . , 3} arethe cartesian space-like components14. Then we can write the Lorentzboost also as

x′ = ΛK→K’ x (1.54)14In the present case of the position vector, the time-like component is actually time itself and the space-like

components represent space itself.

Page 5: 1.1. THE LORENTZ TRANSFORMATION 41 3dirac.ups-tlse.fr/fleig/courses/Nuclear/lecture3.pdf · x 0 {ct0 with itself (Eq. (1.30)): x 0 {ct0 x 0 {ct0 = x {ct {v c {v c 1 0 0 1 {v c {v

1.2. FOUR-VECTORS IN SPACETIME 45

or, using component notation,

x′ν =

3∑

µ=0

Λνµ xµ ∀ν ∈ {0, . . . , 3} (1.55)

This is the Minkowski-space equivalent of the transformation law of avector, a three-tensor of rank 1, in real space! Here we transform afour-vector in Minkowski space.

From now on, we will use Einstein summation convention whichis defined as a sum in Minkowski space (or coordinate space) overduplicate indices in any given term. Then,

x′ν = Λνµ xµ. (1.56)

From the above it immediately follows that∂x′ν∂xµ

= Λνµ. (1.57)

Conversely, from Eq. (1.56), we have(Λ−1)κνx′ν =

(Λ−1)κν

Λνµ xµ

= δκµ xµ

= xκ(Λ−1)µνx′ν = xµ. (1.58)

and so∂xµ∂x′ν

=(Λ−1)µν. (1.59)

1.2.2.2 General Contra- and Covariant Four-Vectors

Let us now take a first look at fields in special relativity. Be ϕ(x) ascalar differentiable field with x the position four-vector in frame K.We regard the derivative with respect to coordinates in K’, i.e.,

∂ϕ(x)

∂x′µ=∂ϕ(x)

∂xκ

∂xκ∂x′µ

(1.60)

Page 6: 1.1. THE LORENTZ TRANSFORMATION 41 3dirac.ups-tlse.fr/fleig/courses/Nuclear/lecture3.pdf · x 0 {ct0 with itself (Eq. (1.30)): x 0 {ct0 x 0 {ct0 = x {ct {v c {v c 1 0 0 1 {v c {v

46 CHAPTER 1. SPECIAL THEORY OF RELATIVITY

since ϕ (x(x′)), where in the second equality the chain rule and Einsteinsummation have been used. This means that we can write for thedifferential operator

∂x′µ=

∂xκ

∂xκ∂x′µ

=∂

∂xκ

(Λ−1)κµ

(1.61)

where we have used Eq. (1.59). If we compare this result with Eq.(1.56), we see a difference: The components of the position four-vectorare transformed from K to K’ via the transformation matrix Λ, but thecomponents of the derivative vector with respect to position transform— also from K to K’ — via the inverse transformation matrix Λ−1! Thisimplies that there are, generally speaking, two types of four-vectors inrelativity theory15:

1. The contravariant components of four-vectors transform fromK to K’ as to the above Lorentz transformation. They are, byconvention, written with upper indices, aµ.

2. The covariant components of four-vectors transform from K toK’ as to the inverse Lorentz transformation. They are converselywritten with lower indices, bν.

Let us now consider the scalar product of a covariant four-vector witha contravariant four-vector in Minkowski SpaceTime. For consistencywith general matrix algebra, we will here consider covariant vectorsas row vectors, i.e., x′µ = xν

(Λ−1)νµand always sum over repeated

upper and lower indices. Then

b′µaµ′ = bν

(Λ−1)νµ

Λµκ a

κ

= bν δνκ a

κ (1.62)= bν a

ν = bµ aµ. (1.63)

15The reader may argue that this situation actually already exists in non-relativistic physics for position andgradient vectors, for example. This is true, but for a Euklidean metric there is no difference between the contra-and covariant components of a vector!

Page 7: 1.1. THE LORENTZ TRANSFORMATION 41 3dirac.ups-tlse.fr/fleig/courses/Nuclear/lecture3.pdf · x 0 {ct0 with itself (Eq. (1.30)): x 0 {ct0 x 0 {ct0 = x {ct {v c {v c 1 0 0 1 {v c {v

1.2. FOUR-VECTORS IN SPACETIME 47

We find that the scalar product of a covariant with a contravariantvector is invariant under Lorentz transformation. bν a

ν is thereforecalled a Lorentz scalar. It can without difficulties be shown thataµ′b′µ = aνbν, and so the scalar product of any contravariant with any

covariant four-vector is also a Lorentz scalar.The result in Eq. (1.63) for general four-vectors will be of utmost

importance in the construction of relativistic theories.

1.2.2.3 Relationship Between Contra- and Covariant Four-Vectors

In Eq. (1.44) had agreed that the scalar product involves the “modernmetric”, {gµν}. It shall by definition have the property gµν = gµν. Fromthis it directly follows that

(gg)κµ = gµν gνκ = δκµ (1.64)

where δκµ is the usual Kronecker delta symbol.Let us now see how contra- and covariant components of a four-

vector are related to each other. The expression bν aν is a Lorentz

scalar. So is bν aν. Since these two Lorentz scalars are made up of thesame four-vectors, they should be identical. We now suppose that

bν aν = bµ gµν gνκ a

κ (1.65)

which can now simply be calculated:

bµ gµν gνκ a

κ = bµ δµκ a

κ = bµ aµ = bν a

ν (1.66)

This shows that we have the following relationships between co- andcontravariant indices of a given four-vector:

bν = bµ gµν (1.67)

aν = gνκ aκ (1.68)

In other words, contravariant and covariant vectors of the same typediffer in their signs on the space-like components. For the case of the

Page 8: 1.1. THE LORENTZ TRANSFORMATION 41 3dirac.ups-tlse.fr/fleig/courses/Nuclear/lecture3.pdf · x 0 {ct0 with itself (Eq. (1.30)): x 0 {ct0 x 0 {ct0 = x {ct {v c {v c 1 0 0 1 {v c {v

48 CHAPTER 1. SPECIAL THEORY OF RELATIVITY

position four-vector we have explicitly

xµ=0 = ct (1.69)xµ=1 = x (1.70)xµ=0 = ct (1.71)xµ=1 = −x (1.72)

Choosing the contravariant vector components to correspond to non-relativistic notation is a plain matter of convention. This means thatthe previous formulation of the position four-vector in the usual non-covariant notation will now be changed. We choose the same four-vector to be defined by its contravariant components (upper indices),as

x :=

x0

x1

x2

x3

With components {xµ} (1.73)

As a known example, consider the scalar product of the positionvector with itself. Obviously,

x′µ xµ′ = xµ

′x′µ = xν xν = xν x

ν (1.74)(ct′)2 − (x′)2 − (y′)2 − (z′)2 = ct2 − x2 − y2 − z2

which is just identical to Eq. (1.44), confirming the logical consistencyof the formalism.

The generalization of these concepts to rank-n four-tensors is straight-forward and will be useful for the reformulation of electrodynamics andin the framework of general relativity, as well as in many other fieldsof physics.

Page 9: 1.1. THE LORENTZ TRANSFORMATION 41 3dirac.ups-tlse.fr/fleig/courses/Nuclear/lecture3.pdf · x 0 {ct0 with itself (Eq. (1.30)): x 0 {ct0 x 0 {ct0 = x {ct {v c {v c 1 0 0 1 {v c {v

1.2. FOUR-VECTORS IN SPACETIME 49

1.2.3 SpaceTime Diagrams

The phenomenological consequences of the Lorentz tranformation canbe conveniently visualized through a technique already introduced byMinkowski: SpaceTime diagrams. A reduction to two dimensions –time and one spatial dimension – is sufficient for many purposes.

Figure 1.8:ct

x

eventct

x

0

0

S

An event is represented by a point in SpaceTimewith coordinates {ct0, x0}.

Figure 1.9:

ct

x

S

x1

worldlineAn observer/object at rest at position x1 is repre-sented by a worldline in the SpaceTime diagram.

Page 10: 1.1. THE LORENTZ TRANSFORMATION 41 3dirac.ups-tlse.fr/fleig/courses/Nuclear/lecture3.pdf · x 0 {ct0 with itself (Eq. (1.30)): x 0 {ct0 x 0 {ct0 = x {ct {v c {v c 1 0 0 1 {v c {v

50 CHAPTER 1. SPECIAL THEORY OF RELATIVITY

Figure 1.10:

ct

xx1x0

S

worldlineof moving object

θ

ct

ct0

1

An object moving in x-direction with velocity v incoordinates of S. This velocity follows from graphicalobservation: tanϑ = x1−x0

ct1−ct0 = v(t1−t0)c(t1−t0) = v

c. So ϑ =

arctan(vc

).

Since 1 ≥ vc≥ 0 we have

max(arctan

(vc

))=

max(ϑ) = π4.

Figure 1.11:ct

xx1x0

ct1

light cone

S

E

E’Emission of a light pulse as event at {ct0, x0} andpropagation of the light pulse in frame S. EventE cannot be causally connected with an eventat {ct0, x0}, because information cannot propa-gate faster than with c. Event E ′ can be causallyconnected with an event at {ct0, x0} because itis inside the light cone.

As a simple illustration for the scenario with event E, imagine adistant observer appearing within δt = εt around t0 at xE > x1. Theobserver cannot see the light pulse because he has “disappeared” beforethe light pulse can reach his position.

1.2.4 Space-, Light-, and Time-Like Four-Vectors

With the establishments of the preceding subsections we can revisitfour-vectors in the context of SpaceTime diagrams and come to a coupleof interesting conclusions.

Page 11: 1.1. THE LORENTZ TRANSFORMATION 41 3dirac.ups-tlse.fr/fleig/courses/Nuclear/lecture3.pdf · x 0 {ct0 with itself (Eq. (1.30)): x 0 {ct0 x 0 {ct0 = x {ct {v c {v c 1 0 0 1 {v c {v

1.2. FOUR-VECTORS IN SPACETIME 51

Figure 1.12:

dx =cdt0

1dx =dx

{x }(1)µ

{dx }µ

{x }(2)µ

ct

x

S

{

{

SpaceTime distance dx

Two events (1) and (2) and their SpaceTime distancerepresented by vectors in a two-dimensional vectorspace.

From Fig. (1.12) we infer that

dx0 = x0(1)− x0(2) = c(t1 − t2) = cdt

dx1 = x1(1)− x1(2) = x1 − x2 = dx.

16 The scalar product of the four-vector dx with itself is then

D := (ds)2 = dxµdxµ = dxµ gµν dxν

=(c(t1 − t2) x1 − x2

) ( 1 0

0 −1

) (c(t1 − t2)

(x1 − x2)

)

=(c(t1 − t2) x1 − x2

) ( c(t1 − t2)

−(x1 − x2)

)(1.75)

= c2(t1 − t2)2 − (x1 − x2)2. (1.76)

D is a Lorentz scalar (it is the scalar product of a contra- and a co-variant four-vector), i.e., all of our following conclusions are Lorentzinvariant. We distinguish three general cases:

D = 0 From this it follows that c2(t1− t2)2 = (x1−x2)2 which means thatds is on a light cone.Four-vectors v with vµvµ = 0 are called “light-like” four-vectors.

16The notation dx0 here means dxµ=0.

Page 12: 1.1. THE LORENTZ TRANSFORMATION 41 3dirac.ups-tlse.fr/fleig/courses/Nuclear/lecture3.pdf · x 0 {ct0 with itself (Eq. (1.30)): x 0 {ct0 x 0 {ct0 = x {ct {v c {v c 1 0 0 1 {v c {v

52 CHAPTER 1. SPECIAL THEORY OF RELATIVITY

D < 0 This would correspond to the case shown in Fig. (1.12) if we wouldassume the events to be the points chosen for illustration. Then,if we suppose that for the Lorentz-transformed spatial componentsxk′(1)− xk′(2) = 0 ∀k ∈ {1, . . . , 3}, then⇒ D = c2(t′1− t′2)2 ≥ 0

which is in contradiction with the assumption. This means thatgiven D < 0 there is no reference frame K’ in which the two eventsoccur at the same position and chronologically.On the other hand, if t′1 − t′2 = 0 ⇒ D = −(x′1 − x′2)2 ≤ 0.Therefore, a reference frame K’ exists such that the two eventsoccur simultaneously but at different positions. D is ensuinglya “space-like” interval17. Note that in this case event 1 is outsidethe light cone based in event 2.Four-vectors v with vµvµ < 0 are called “space-like” four-vectors.

D > 0 In this case, if t′1 − t′2 = 0, i.e., in a frame S’ the two events occursimultaneously, then ⇒ D = −(x′1 − x′2)2 ≤ 0 contradicts theassumption and the two events cannot occur simultaneously in anyK’.However, if xk′(1)− xk′(2) = 0→ D = c2(t′1 − t′2)2 > 0 is possiblewhich means that there exists a frame K’ in which the two eventsoccur at the same point in space and chronologically. Theycan be causally connected (inside the respective light cone).Four-vectors v with vµvµ > 0 are called “time-like” four-vectors.

17to be distinguished from the space-like components of a four-vector which is quite a different thing