Relativistic Waves and Quantum Fields: Andreas Brandhuber
Relativistic Waves and Quantum Fields: Andreas Brandhuber
Andreas Brandhuber
Department of Physics
Queen Mary, University of London
Mile End Road, London, E1 4NS
United Kingdom
email: [email protected]
Abstract
Lecture notes for the course Relativistic Waves and Quantum Fields MSci 4242/QMUL
PHY-415. (These are preliminary notes and may contain typos, so use with care!)
Contents
1
2 CONTENTS
2.18 Neutrinos . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
The fundamental challenge of every physical theory is to describe Nature and its phenom-
ena in as much detail as possible. Furthermore, this should be possible for the microcosm
and the macrocosm i.e. we are looking for theories that are valid at short distance scales
and long distance scales.
The deficiencies of the classical viewpoint are most pronounced in the physics of the
microcosm and in the subnuclear regime. Many phenomena of molecular physics, atomic
physics, nuclear physics and elementary particle physics simply cannot be explained using
ideas from classical physics. Also the inclusion of Quantum Mechanics (QM) alone is often
insufficient, to explain physical problems of the microcosm. As was realised in the last
century, only the inclusion of Einstein’s Theory of Special Relativity (SR) leads to the
desired success.
Today Quantum Field Theory (QFT) provides the unified framework to describe all
particles and its interactions (forces) that we observe in Nature, including electromag-
netism, the weak and the strong nuclear force, and probably gravity. QFT was born out
of the attempt to combine the rules of Quantum Mechanics with the principles of Special
Relativity and field theory. The prime example for the success of QFT is Quantum Elec-
trodynamics (QED), which with its extremely accurate predictions of physical quantities,
like the anomalous magnetic moment of the electron, is also called ”the jewel of physics”.
3
4 CHAPTER 0. WHY QUANTUM FIELD THEORY?
Chapter 1
In this chapter we want to review and introduce basic concepts of Quantum Mechanics and
Special Relativity. At this stage we treat these two topics separately before we attempt
to unify them in Chapter 2 trying to preserve most features of usual non-relativistic QM.
This will lead only to partial success since a couple of conceptual problems will arise.
As was understood in the last century, the main reason for these problems is that the
inclusion of SR implies the existence of anti-particles and leads to a multi-particle theory
in which probability is not preserved, because particles can be destroyed and created
(death and birth of particles). These issues can be addressed properly if we adopt the
formalism of Quantum Field Theory, which will be developed from Chapter 3 on.
In this section I will remind you of the most important axioms of Quantum Mechanics
(QM). I expect that you are familiar with most of these facts, but it is good to repeat
them to see which ones we will be forced to give up once we include SR.
Axioms of QM:
• The state of the system is represented by a wave function |Ψi (also called a
vector in Hilbert space). Note that Ψ = Ψ(qi , si , t) is a function of the coordinates
5
6 CHAPTER 1. QUANTUM MECHANICS AND SPECIAL RELATIVITY
of individual particles qi , some internal degrees of freedom like spin si and, if we are
in the Schödinger picture also on time t (if you are not familiar with this concept
either wait until I will review it later in the course or check your favourite book
on QM). Most importantly Ψ itself has no direct physical meaning whatsover; only
expression like the probability density ρ = |Ψ|2 ≥ 0.
Z X
hAi = hΨ|A|Ψi
b = d3 xΨ∗ AΨ
b = |an |2 λn . (1.1.1)
n
∂Ψ∗
Z Z
d d ∗ ∂Ψ
hΨ|Ψi = Ψ Ψ= ( Ψ + Ψ∗ Ψ)
dt dt ∂t ∂t
Z
i b ∗ i b
= ((− HΨ) Ψ + Ψ∗ (− HΨ))
~ ~
Z
i b ∗ Ψ − Ψ∗ HΨ) i b
= ((HΨ) b = (hHΨ|Ψi − hΨ|HΨi)
b = 0 . (1.1.2)
~ ~
1.1. QUANTUM MECHANICS 7
Therefore, if
[A,
b H]
b = 0, (1.1.5)
then
dhAi
= 0, (1.1.6)
dt
1
i.e. the QM operator A
b corresponding to the observable A obeys ∂ A/∂t
b = 0.
8 CHAPTER 1. QUANTUM MECHANICS AND SPECIAL RELATIVITY
Translational Invariance
First consider a single particle. If ~x is the position vector, then a translation is the
operation
~x → ~x0 = ~x + ~a . (1.1.7)
If H
b is invariant then
b x0 ) = H(~
H(~ b x + ~a) = H(~
b x) . (1.1.8)
For an infinitesimal displacement we can make a Taylor expansion2
b x + ~a) ∼
H(~ = H(~ ~ H(~
b x) + ~a · ∇ b x) , (1.1.9)
b x + ~a) − H(~
0 = H(~ ~ H(~
b x) = ~a · ∇ b x) . (1.1.10)
In general for the momentum operator P~ and any other operator O(~
b x) we have
b
where we have suppressed the explicit ~x and t dependence. Since eqn. (1.1.11) is true for
arbitrary wavefunctions Ψ
[P~ , O(~
x)] = −i~∇~ O(~
b x) . (1.1.12)
b b
In particular for O
b=H
b
[P~ , H] ~H
= −i~∇ b. (1.1.13)
b b
Now,
~H
0 = −i~~a · ∇ ~ , H]
b = ~a · [P
b b (1.1.14)
and since this is true for an arbitrary displacement vector ~a we find
[P~ , H] = 0. (1.1.15)
b b
~i
dhP
We conclude that momentum is a conserved quantity dt
= 0 if the H
b is translationally
invariant.
Take e.g.
2
b =−~ ∇
H ~ 2 + V (~x) (1.1.16)
2m
2
The symbol ∼
= indicates that we only expand to first order in ~a and suppress all higher order terms.
1.1. QUANTUM MECHANICS 9
which is only true for a (trivial) constant potential, i.e. for a free particle. Thus, the
momentum of a free particle is conserved in QM in the sense
dhP~ i
= 0. (1.1.20)
dt
Consider now a two particle system (easily generalised to N particles). If the two
particles have position vectors ~x1 and ~x2 , the invariance condition for the translation of
the system through ~a reads
H(~
b x1 , ~x2 ) = H(~
b x1 + ~a, ~x2 + ~a) . (1.1.21)
P~ = P~ 1 + P~ 2 , (1.1.24)
b b b
where
P~ 1 = −i~∇
~ 1 and P
~ 2 = −i~∇
~2. (1.1.25)
b b
and so
[P~ , O(~ ~1+∇
x1 , ~x2 )] = −i~(∇ ~ 2 )O(~
b x1 , ~x2 ) . (1.1.27)
b b
~1+∇
0 = −i~~a · (∇ ~ 2 )H ~ , H]
b = ~a · [P
b b . (1.1.28)
Since this must be true for arbitrary translation vector ~a, we have
[P~ , H] = 0, (1.1.29)
b b
~i
dhP
and total momentum is conserved in the sense of QM i.e. dt
= 0.
Rotational Invariance
Take spherical polar coordinates and take the axis of rotation to be the z-axis. Spec-
ify the position vector ~x in spherical polar coordinates (r, θ, φ) of the point. Then the
symmetry operation ~x → ~x0 corresponding to a rotation by an angle α about the z−axis
is
(r, θ, φ) → (r0 , θ0 , φ0 ) = (r, θ, φ + α) . (1.1.30)
For the Hamiltonian to be invariant under rotations about the z-axis
b x0 ) = H(~
H(~ b x)
b 0 , θ0 , φ0 ) = H(r,
⇐⇒ H(r b θ, φ)
⇐⇒ H(r,
b θ, φ + α) = H(r, b θ, φ) . (1.1.31)
For an infinitesimal rotation
b θ, φ + α) ∼
H(r, b θ, φ) + α ∂ H(r,
= H(r, b θ, φ) , (1.1.32)
∂φ
and, hence, for invariance of H
b
0 = b θ, φ + α) − H(r,
H(r, b θ, φ) = α ∂ H(r,
b θ, φ)
∂φ
∂ b
−→ H(r, θ, φ) = 0 . (1.1.33)
∂φ
The z−component of the orbital AM operator written in spherical coordinates is3
bz = −i~ ∂
L (1.1.34)
∂φ
3
In cartesian coordinates ~x = (x, y, z): L
b z = (~x ~ )z = −i~(x ∂ − y ∂ ) . The other components, L
b×P
b bx
∂y ∂x
and Lb y , can be obtained by cyclic permutation of (x, y, z).
1.1. QUANTUM MECHANICS 11
b = −i~[ ∂ , O]
b = −i~( ∂ (OΨ) b ∂ Ψ) = −i~ ∂ O Ψ .
b
[L
bz , O]Ψ b −O (1.1.35)
∂φ ∂φ ∂φ ∂φ
b = −i~ ∂ O ,
b
[L
bz , O] (1.1.36)
∂φ
b = −i~ ∂H
b
[L
bz , H] . (1.1.37)
∂φ
If the Hamiltonian is invariant under rotation about the z-axis, we now conclude that
[L
bz , H]
b = 0. (1.1.38)
We can define angles φx and φy analogous to φz ≡ φ for rotations about the x and y-axis.
If H
b is also invariant under rotations about the x and y-axis we will conclude that
[L
bx , H]
b = [L
by , H]
b = [L
bz , H] ~b H]
b = 0 i.e. [L, b = 0. (1.1.39)
~2 ~ 2
H=−
b ∇ + V (|~x|) . (1.1.41)
2m
~ 2 = ∇·
Because ∇ ~ ∇~ and |~x| are scalars, so is the Hamiltonian and orbital AM is conserved.
H
b = H(r
b 1 , θ1 , φ1 ; r2 , θ2 , φ2 ) . (1.1.42)
12 CHAPTER 1. QUANTUM MECHANICS AND SPECIAL RELATIVITY
The invariance condition for rotation of the system by an angle α about the z-axis is
H(r
b 1 , θ1 , φ1 ; r2 , θ2 , φ2 ) = H(r
b 1 , θ1 , φ1 + α; r2 , θ2 , φ2 + α) , (1.1.43)
0 = b 1 , θ1 , φ1 + α; r2 , θ2 , φ2 + α) − H(r
H(r b 1 , θ1 , φ1 ; r2 , θ2 , φ2 )
= α(∂φ1 + ∂φ2 )H b
−→ (∂φ1 + ∂φ2 )H
b = 0. (1.1.45)
The z-components of the orbital AM operator for the two particles are
b1z = −i~∂φ1 , L
L b2z = −i~∂φ2 . (1.1.46)
L
bz = L
b1z + L
b2z . (1.1.47)
[L b = −i~(∂φ1 + ∂φ2 )O
bz , O] b, (1.1.48)
and in particular
[L b = −i~(∂φ1 + ∂φ2 )H
bz , H] b. (1.1.49)
If the Hamiltonian is invariant under rotations about the z-axis, we now conclude that
[L
bz , H]
b = 0. (1.1.50)
[L
bx , L
by ] = i~L
b z , [L
bx , L
by ] = i~L
b z , [L
bx , L
by ] = i~L
bz . (1.1.53)
1.2. SPECIAL RELATIVITY 13
An atom with atomic number Z with Coulomb forces between the nucleus and the
electrons and between the electrons is an example of a system with the necessary rotational
invariance for conservation of AM. In this case:
Z Z Z
~2 X ~ 2 X Ze2 X e2
H=−
b ∇i − + , (1.1.54)
2m i=1 i=1
4π0 |~xi | i,j=1,i<j 4π0 |~xi − ~xj |
where ∇~ i acts on the coordinates of the i-th electron, and ~xi is the position vector of
the i-th electron relative to the nucleus. As before, ∇ ~ 2i and |~xi | are scalars (rotationally
invariant) and so is |~xi − ~xj |, so that H
b is rotationally invariant.
J~ = L
b ~b b ~
+S (1.1.55)
that commutes with the Hamiltonian
b~ b
[J, H] = 0 (1.1.56)
b is rotationally invariant, and then hJz i and hJ~2 i are constants of motion.
if H
For the atomic (non-relativistic) Hamiltonian above, hLz i and hL ~ 2 i are also constants
of motion, because the spin does not appear explicitly in the Hamiltion. This is a mani-
festation of the fact that spin is an effect of Special Relativity as we will see later in the
course. However, if we include the Spin-Orbit interaction due to Relativistic effects
1 1 dV ~b b ~,
H
b Spin−Orbit = L·S (1.1.57)
2m2e c2 r dr
Topics we are covering in this section include: Lorentz transformations, 4-vectors, covari-
ance, contravariance, forminvariance, Lorentz and Poincare group, Maxwell equations as
an example of a relativistic wave equations.
1. The speed of light c in vacuum is absolut. It is the same in all inertial frames (these
are reference frames moving at constant velocity with respect to each other).
2. Principle of Relativity. The laws of physics are the same in all inertial frames. Or
in other words there is no preferred reference frame and, in particular, there is no
absolute time.
Two observers in two different inertial frames K and K 0 using coordinate systems
(t, x, y, z) and (t0 , x0 , y 0 , z 0 ) should agree on the predicted results of all experiments. In
other words the form of all dynamical equations should be invariant (Forminvariance).
You are all familiar with Lorentz transformations (LT), say a boost in the x direction:
ct − vx/c
ct0 = p = γ(ct − βx)
1 − v 2 /c2
x − vt
x0 = p = γ(−βct + x)
1 − v 2 /c2
y0 = y
z0 = z (1.2.1)
p
with β = v/c and γ = 1/ 1 − v 2 /c2 . This LT can be neatly rewritten in matrix form as
0
ct γ −γβ 0 0 ct
x0 −γβ γ 0 0 x .
0 = (1.2.2)
y 0 0 1 0 y
z0 0 0 0 1 z
Note that general LTs are linear transformations of the coordinates. If we were to consider
general relativity we would have to relax this condition and consider general non-linear
transformations.
This explains why the speed of light is constant in two frames K and K 0 . Assume that
a spherical light wave starts at t = 0 expanding from the origin x = y = z = 0. Then for
t > 0 the equation c2 t2 − x2 − y 2 − z 2 describes a spherical wave front with radius r = ct
i.e. it is expanding with the speed of light. Obviously, using the identity derived above
this holds true also in the frame K 0 and actually for any other frame, hence the speed of
light is consant.
In order to deal with this one introduces a metric tensor η, which is a 4-by-4 matrix, and
two types of 4-vectors, covariant ones and contravariant ones.
because
1 0 0 0 ct ct
3
0 −1 0 0 −x = x = xµ .
X
η µν xν =
0 0 −1 0 −y y (1.2.12)
ν=0
0 0 0 −1 −z z
1 0 0 0 1 0 0 0
0 −1 0 0 0 −1 0 0
µν
The metric tensor ηµν = 0 0 −1 0 and the inverse metric η = 0 0 −1 0
0 0 0 −1 0 0 0 −1
are used to raise and lower indices:
X 3
η µν xν = xµ ,
ν=0
X 3
ηµν xν = xµ . (1.2.13)
ν=0
Finally, one introduces the Einstein summation convention in order to avoid writing the
boring sums over and over again: repeated upper and lower index pairs are automatically
summed over (unless stated otherwise). Note that this only makes sense if one index
is upper and the other is lower, otherwise one gets expressions that do not transform
properly under LTs.
Examples
c2 t2 − x2 − y 2 − z 2 = xµ xµ = xµ xµ = η µν xµ xν (1.2.15)
µ µν
x = η xν (1.2.16)
ν
xµ = ηµν x (1.2.17)
x0µ = Λµ ν xν , (1.2.18)
Therefore,
x0α = Λα β xβ with Λα β = ηαµ Λµ ν η νβ . (1.2.20)
Now the Lorentz transformations are those Λ’s that leave xµ xµ invariant. Note that a
priori Λ has 16 components which should be reduced to 6. Let’s see how this works. We
require that the norm of a 4-vector xµ is Lorentz invariant. So we have
which implies
ηµν Λµ α Λν β = ηαβ . (1.2.22)
This is a matrix equation for Λ with 16 components. But only 10 components are inde-
pendent because both sides of the equation are symmetric under exchange of α and β.
Hence the number of independent components of Λ is reduced to 16 − 10 = 6. This is
just the right number to incorporate 3 boosts and 3 spatial rotations!
T 0µ1 ...µn ν1 ...νm = Λµ1 α1 . . . Λµn αn Λν1 β1 . . . Λνm βm T α1 ...αn β1 ...βm . (1.2.23)
18 CHAPTER 1. QUANTUM MECHANICS AND SPECIAL RELATIVITY
This is the general transformation law for tensors that transform covariantly, however we
usually do not account tensors with more than two indices (unless we consider General
Relativity).
It is also important to know how differentials and partial derivatives transform under
LTs.
From
x0µ = Λµ ν xν (1.2.24)
we learn that
dx0µ = Λµ ν dxν , (1.2.25)
so dx0µ transforms like a contra-variant vector. It is also known that for general coordinate
transformations we have
∂x0µ ν
dx0µ = dx , (1.2.26)
∂xν
0µ
where for the special case of LTs ∂x∂xν
= Λµ ν .
where we call Kx the generator of the boost in the x-direction. (Check for yourself
that exp(−iωKx ) gives back the finite Lorentz transformation.) Similarly we find the
generators for boosts in the y and z direction.
0 0 −i 0 0 0 0 −i
0 0 0 0
, Kz = 0 0 0 0 .
Ky = −i 0 0 0 0 0 0 0 (1.2.32)
0 0 0 0 −i 0 0 0
where Jx is the generator for rotations around the x axis. Similarly we find the generators
for rotations around the y and z axis.
0 0 0 0 0 0 0 0
, Jz = 0 0 −i 0 .
0 0 0 i
Jy =
0 0 0 0 0 i 0 0 (1.2.34)
0 −i 0 0 0 0 0 0
The boost and rotation generators obey the following set of commutation relations.
Note that the first line is just the standard SU (2) algebra for angular momentum opera-
tors.
This commutator algebra reveals a more interesting structure if we introduce the linear
combinations
~ = 1 J~ + iK
N ~
2
~ = 1
M J~ − iK
~ . (1.2.36)
2
Note that these are two 3-vectors where each component of the 3-vectors is a 4-by-4
matrix. These two sets of matrix generators obey the following commutator algebra
Until now we have considered a particular representation of the operators Kx,y,z and
Jx,y,z in terms of 4-by-4 matrices which are the generators of boosts and rotations acting
on 4-vectors (and tensors by multiple action of Λs). But this algebra could be represented
by differential operators (as we are used to from QM) or different sets of 4-by-4 matrices
(as we will find later for the spinors which are spin 1/2 particles). All these have one
thing in common: they obey the same algebra. So let us now consider the operators
abstractly and assume that the only thing we know about them is that they obey the
Lorentz algebra (1.2.37).
This is a bonus section and will NOT be part of the exam. A nice exposition of the
material in this and the previous section can be found in the book by Ryder [4].
So far we have only considered rotations and generalised rotations i.e. boosts, but
you might have wondered what has happened to translations in space and time that
we discussed in the context of non-relativistic QM. We will remedy this situation now
and introduce the Poincaré Group which is the natural extension of the Lorentz group
including translations in space-time:
x0µ = Λµ ν xν + aµ (1.2.38)
The first part of this transformation are just the LTs, which are generated by Jx,y,z
and Kx,y,z and the second, inhomogenous, part are the translations generated by the
4-momentum pµ .
Notably, this set of operators obeys the same algebra (1.2.35) that we found in the
last section for the generators Jx,y,z and Kx,y,z in matrix form.
These differential operators are defined through their action on functions, while the
matrices act directly on the coordinates which appear as arguments of spacetime depen-
dent functions. So these two viewpoints are naturally related, e.g. consider an infinitesi-
mal boost in the x-direction. Recalling
Λµ ν = δ µ ν + iωx (Kx )µ ν
x0µ = xµ + iωx (Kx )µ ν xν , (1.2.40)
where the argument x of f (x) stands for a point in spacetime i.e. a 4-vector.
22 CHAPTER 1. QUANTUM MECHANICS AND SPECIAL RELATIVITY
Another common form of the algebra of the K 0 s and J 0 s is written in terms of the
anti-symmetric tensor M
cµν which is defined as follows:
cij = −M
M cji = ijk Jbk
ci0 = −M
M c0i = −K bi (1.2.42)
Including translations generated by pbµ = i∂µ one finds the additional commutation rela-
tions
[b
pµ , pbν ] = 0 , (1.2.44)
h i
cρσ = i (ηµρ pbσ − ηµσ pbρ ) .
pbµ , M (1.2.45)
This is the Poincaré algebra (the Poincaré group is obtained by exponentiating the gen-
erators). A mathematical problem with the Lorentz group is that it does not have finite
dimensional unitary representations; the representations we listed in the last section are
finite dimensional but non-unitary! The problem comes from the boosts, whose generators
are not Hermitian and hence their exponentiation gives non-unitary matrices or opera-
tors. But in QM we seek unitary finite dimensional representations and here the Poincaré
group comes to the rescue, since it does have finite dimensional unitary representations
corresponding to particles with various spins.
This result allows us to answer with mathematical rigour the following fundamental
question: What is a particle?
The mathematicians answer: An irreducible representation of the Poincaré group.
In order to classify representations we have to look for operators that commute with
b2
all operators of the Poincaré group. (This is the analogue to the operator J~ for the
angular momentum operators.) It turns out that there exist exactly two such operators,
which are also called Casimir operators of the Poincaré algebra:
which for a momentum eigenstate becomes p2 = E 2 − p~2 = m2 , so this is just the mass
b2
squared. Now interestingly J~ is not a Casimir since it does not commute with boosts K.
~
b
To constructe the second Casimir one introduces the Pauli-Lubanski vector Wcµ
cµ = − 1 µνρσ M
W cνρ Pbσ
2
cµ Pbµ = 0 ,
W (1.2.47)
so that
C2 = W cµ .
cµ W (1.2.48)
1.2. SPECIAL RELATIVITY 23
As an example consider a massive, spin s particle in its rest frame where pµ = (m, ~0).
Now in the rest frame the orbital angular momentum L ~ = 0 so that the total angular
momentum J~ = L ~ +S~ = S.
~ For Wµ we find
1
W0 = − 0ijk J ij P k = 0
2
1 m m
Wi = − iµνρ J µν P ρ = − iµν0 J µν = − ijk J jk = −mJi (1.2.49)
2 2 2
and, hence,
~ 2 = −m2 J~2 = −m2 S
C2 = Wµ W µ = (W0 )2 − W ~ 2 = −m2 s(s + 1) . (1.2.50)
So we find that a relativistic particle is characterised by its mass m and its spin s.
~ in terms of which B
Let us introduce the 4-vector potential Aµ = (φ, A) ~ =∇
~ ×A
~ and
~
~ = − ∂ A − ∇φ.
~
E ∂t
This implies the 2nd and 3rd Maxwell equation since
~ ·B
∇ ~ =∇
~ · (∇
~ × A)
~ = 0,
~ ~ ~
~ × (− ∂ A − ∇φ)
∇ ~ = −∇ ~ × ∂A = − ∂B . (1.2.52)
∂t ∂t ∂t
Now we define the anti-symmetric electro-magnetic fieldstrength tensor F µν (remember
∂
∂ µ = ( ∂t ~
, −∇):
F µν = −F νµ = ∂ µ Aν − ∂ ν Aµ
~
∂A
F 0i = ∂ 0 Ai − ∂ i A0 = ( ~ i = −Ei
+ ∇φ)
∂t
F ij = ∂ i Aj − ∂ j Ai = −ijk Bk . (1.2.53)
24 CHAPTER 1. QUANTUM MECHANICS AND SPECIAL RELATIVITY
So
0 −E1 −E2 −E3
E1 0 −B3 B2
F µν = (1.2.54)
E2 B3 0 −B1
E3 −B2 B1 0
Under Lorentz transformations
F µν → F 0µν = Λµ α Λν β F αβ (1.2.55)
which can also be written as a matrix equation F → F 0 = Λ · F · ΛT . The second and the
third Maxwell equation are encoded in the so-called Bianchi identity
∂ µ F νρ + ∂ ν F ρµ + ∂ ρ F µν = 0, (1.2.56)
while the first and the fourth Maxwell equation are encoded in the equation
∂µ F µν = j ν (1.2.57)
with j ν = (ρ, ~j). This is an example of an equation that is forminvariant under Lorentz
transformations, i.e. it transforms covariantly under LTs.
In this gauge
∂µ F µν = ∂µ (∂ µ Aν − ∂ ν Aµ ) = Aν − ∂ ν (∂µ Aµ ) = j ν (1.2.60)
becomes
Aν = j ν . (1.2.61)
Furthermore, for j ν = 0 (in vacuo) we obtain
Aν , (1.2.62)
which is a relativistic wave equation for massless spin 1 particles called photons. The
equation implies in particular that the waves/photons are travelling at the speed of light
as any massless particle does (independent of its spin).
Chapter 2
~2
p
In non-relativistic QM, the free Hamiltonian H = E = 2m
is quantised by the substitution
∂ ~
H → i~ , p~ → −i~∇ (2.1.1)
∂t
to give the Schrödinger equation
∂Ψ ~2 ~ 2
i~ =− ∇ Ψ. (2.1.2)
∂t 2m
25
26 CHAPTER 2. RELATIVISTIC QUANTUM MECHANICS
Now because of the negative sign between the two terms in ρ, the probability density can
both take positive and negative values (in contrast to non-relativistic QM where ρ = |Ψ|2
is positive definite)! This is an absurd and nonsensical result for a probability density!!
Junk the KG equation for the moment and try harder. Schrödinger was the first to
write down this relativistic wave equation, but discarded it for a different reason; the
spectrum is not bounded from below.
b = i~ ∂Ψ
HΨ (2.2.1)
∂t
Integrating the probability density over a volume V bounded by the surface S we find V ∂ρ
1 3
R
∂t d x =
d 3 ~ ~ 3 ~ ~
R R R
dt V ρd x = − V ∇ · jd x = − S j · dS. This implies that probability cannot be created or destroyed;
it can only flow from one point to another.
2.2. THE DIRAC EQUATION 27
∂
Since the equation is linear in ∂t
, Lorentz covariance suggests it should be linear also
in the ∂x∂ i , i = 1, 2, 3.
So we write
H
b = c~ p~ + βmc2
α·b
= −i~c~α·∇~ + βmc2 , (2.2.3)
where αi and β are coefficients to be determined. More explicitly this equation can be
written
1 ∂ 2 ∂ 3 ∂
b = −i~c α
H 1
+α 2
+α 3
+ βmc2 . (2.2.4)
∂x ∂x ∂x
Now we determine the coefficients αi and β by requiring that this linear operator ”squares”
to the KG operator
H ~ 2 + m2 c4 .
b 2 = −~2 c2 ∇ (2.2.5)
We find
∂2 2 2
b2 2 2 1 2 2 2 ∂ 3 2 ∂
H = −~ c (α ) + (α ) + (α ) + β 2 m 2 c4
∂(x1 )2 ∂(x2 )2 ∂(x3 )2
∂
−i~mc3 (α1 β + βα1 ) 1 + . . .
∂x
∂2
2 2 1 2 2 1
−~ c (α α + α α ) 1 2 + . . . . (2.2.6)
∂x ∂x
(αi )2 = I , i = 1, 2, 3
αi αj + αj αi = 0 , i 6= j
β2 = I
αi β + βαi = 0 , i = 1, 2, 3 , (2.2.7)
where I denotes a unit matrix (if a subscript is added it denotes the dimensionality, e.g.
I2 denotes a 2 × 2 unit matrix).
It is obviously NOT possible to solve those equations if the coefficients are simply
complex numbers. So let us assume that they are N ×N matrices. With some (guess)work
it can be shown that the smallest value of N for which eq. (2.2.7) can be solved is N = 4.
28 CHAPTER 2. RELATIVISTIC QUANTUM MECHANICS
This implies that the Dirac wave function is a 4-component column vector
Ψ1 (x)
Ψ2 (x)
Ψ(x) = Ψ3 (x)
(2.2.8)
Ψ4 (x)
where x ≡ (x0 , ~x) and the Dirac equation becomes a matrix equation
∂Ψ ~ + βmc2 )Ψ .
i~ = HΨ
b = (c~ p~ + βmc2 )Ψ = (−i~c~
α·b α·∇ (2.2.9)
∂t
This is a set of 4 first order linear differential equations to determine Ψ1 , . . . , Ψ4 .
A particular set of solutions of (2.2.7) for the 4 × 4 matrices αi , β can be written with
the help of the 2 × 2 Pauli matrices σ i , i = 1, 2, 3,
1 0 1 2 0 −i 3 1 0
σ = , σ = , σ = (2.3.1)
1 0 i 0 0 −1
We may satisfy the first two lines in eq. (2.2.7) by taking αi to be the 4 × 4 matrices
i 0 σi
α = , i = 1, 2, 3 . (2.3.3)
σi 0
Now we may satisfy the remaining two lines in eq. (2.2.7) by taking
I2 0
β= . (2.3.4)
0 −I2
(αi )† = αi , β † = β . (2.3.5)
∂Ψ ~ + mc2 βΨ
i~ = −i~~
α · ∇Ψ (2.4.1)
∂t
and its Hermitian conjugate is
∂Ψ† ~ †·α
−i~ = i~∇Ψ ~ + mc2 Ψ† β . (2.4.2)
∂t
Now take Ψ† ×(Dirac eqn.) and (Hermitian conjugate eqn.)×Ψ and subtract the two
to obtain:
∂(Ψ† )
† ∂Ψ
i~ Ψ + Ψ = −i~c Ψ† α ~ + ∇(Ψ
~ · ∇Ψ ~ †) · α
~Ψ . (2.4.3)
∂t ∂t
Dividing this equation by i~ we obtain a Continuity Equation
∂ρ ~ ~
+∇·J =0 (2.4.4)
∂t
with the positive definite probability density given by
4
X 4
X
†
ρ=ΨΨ= Ψ∗k Ψk = |Ψk |2 > 0 , (2.4.5)
k=1 k=1
∂Ψ
i~ = βmc2 Ψ
∂t
∂Ψ mc2
⇒ = −i βΨ . (2.5.1)
∂t ~
30 CHAPTER 2. RELATIVISTIC QUANTUM MECHANICS
I2 0
Taking β = the four equations for the components of Ψ turn into
0 −I2
∂Ψ1 mc2
= −i Ψ1
∂t ~
∂Ψ2 mc2
= −i Ψ2
∂t ~
∂Ψ3 mc2
= +i Ψ3
∂t ~
∂Ψ4 mc2
= +i Ψ4 (2.5.2)
∂t ~
mc2
⇒ Ψ1 = c1 e−i ~
t
e.t.c. (2.5.3)
where c1 is an arbitrary constant.
By acting with the Hamiltonian operator H b = i~ ∂ we find that the first term in the
∂t
2
solution (2.5.5) carries positive energy (+mc ) whereas the second term carries negative
energy (−mc2 ).
i~∂t Ψ = HΨ
b with H
b = c~ p~ + βmc2
α·b (2.6.1)
2.7. THE COVARIANT FORM OF THE DIRAC EQUATION 31
~
p~ = −i~∇.
with b
[H, ~ = i~c(~
b S] α×b
p~) (2.6.4)
and
[H, ~ = −i~c(~
b L] α×b
p~) , (2.6.5)
b~
thus, [H,
b J]=0. So J~ is a conserved quantity which we interpret as the total angular
momentum.
0 0 0 −1
1
⇒ we are describing spin 2
particles (and anti-particles).
γ µ ≡ γ 0 , ~γ = γ 0 , γ 1 , γ 2 , γ 3 ,
(2.7.6)
i~/
∂ Ψ = mcΨ (2.7.7)
or
p/bΨ = mcΨ (2.7.8)
⇒ (/
pb − mc) Ψ = 0 (2.7.9)
where pbµ = i~∇µ .
Using the properties of the αi and β we may show that the gamma-matrices obey the
following anti-commutation identity
{γ µ , γ ν } ≡ γ µ γ ν + γ ν γ µ = 2g µν I4 , (2.8.1)
1 0 0 0
0 −1 0 0
with g µν = .
0 0 −1 0
0 0 0 −1
In particular, (γ 0 )2 = I4 , (γ i )2 = −I4 , i = 1, 2, 3.
In the previous sections we have introduced the covariant form of the Dirac equation
without actually justifying that name. Let us clarify this point now.
The Dirac wave function has four components but it does not transform like a
4-vector. Nevertheless we expect the transformation to be linear.
2. In the primed system Ψ0 (x0 ) should obey an equation that has the same form as the
Dirac equation in the un-primed system.
µ ∂
ie
γ − m Ψ0 (x0 ) = 0 , (2.9.3)
∂x0µ
or
Ψ(x) = S −1 (Λ)Ψ0 (x0 ) = S −1 (Λ)Ψ0 (Λ · x) . (2.9.5)
Combining eqns. (2.9.1) and (2.9.5) we can find an equation for Ψ0 (x0 )
iSγ µ S −1 ∂µ − m Ψ0 (x0 ) = 0 .
(2.9.7)
∂x0ν ∂
Now ∂µ = ∂xµ ∂x0ν
= Λν µ ∂x∂0ν = Λν µ ∂ν0 and, hence, we get
Λν µ Sγ µ S −1 = γ ν (2.9.9)
then we would have proven form invariance (or commonly called convariance) of the Dirac
equation! We can write this condition also as
S −1 γ ν S = Λν µ γ µ . (2.9.10)
Λν µ = δ ν µ − iδω ν µ , (2.9.11)
and write the transformation matrix acting on the Dirac spinor also in infinitesimal form
S = I4 − iδσ . (2.9.12)
Its inverse is
S −1 = I4 + iδσ . (2.9.13)
Plugging these expressions into (2.9.10) we find
Solutions for this equation are easy to find and well known. For ω ν µ = Kx,y,z ; Jx,y,z
we find σ(Kx ), σ(Ky ), . . . , σ(Jz ) with
i 0 i i 0 σi
σ(Ki ) = γ ,γ = , (2.9.15)
4 2 σi 0
i j k 1 σi 0 1
σ(Ji ) = ijk γ , γ = = Σi . (2.9.16)
8 2 0 σi 2
Note that the σ(Ki ) and σ(Ji ) obey the same commutation relations as the Ki and
Ji . They correspond to a particular representation of the Lorentz group called the spinor
2.9. THE DIRAC EQUATION AND LORENTZ TRANSFORMATIONS 35
representation. The σ(Ki ) and σ(Ji ) correspond to the vector representation which acts
by generalised rotations on Lorentz 4-vectors.
Finite LTs for Dirac spinors are again obtained by exponentiating the infinitesimal
LTs:
S = exp(−iδσ) , (2.9.17)
so that under a LT
Ψ → SΨ . (2.9.18)
S = exp(−iωσ(Kz ))
0 ω2 σz
= exp ω
σ
2 z
0
cosh(ω/2) 0 sinh(ω/2) 0
0 cosh(ω/2) 0 − sinh(ω/2)
=
sinh(ω/2)
(2.9.21)
0 cosh(ω/2) 0
0 − sinh(ω/2) 0 cosh(ω/2)
Acting with this S on a plane wave solution with p~ = (0, 0, 0) we should obtain a plane
wave solutionqwith p~ = (0, 0, pz ). In order to show this it is helpful to use the identities
and sinh(ω/2) = √ pz
p
cosh(ω/2) = E+m 2m
with E = m2 + p2z . (Homework)
2m(E+m)
36 CHAPTER 2. RELATIVISTIC QUANTUM MECHANICS
From now on we will work in natural units ~ = c = 1. We look for plane wave solutions
of the Dirac equation of the form
∓ip·x φ
Ψ=e (2.10.1)
χ
where φ are the upper two components and χ the lower two components of Ψ and p · x =
Et − p~ · ~x, with E > 0.
The factor e−ip·x gives solutions with positive energy E and momentum p~, and the
factor e+ip·x gives solutions with negative energy −E and momentum −~p. Substituting
back into the Dirac equation
~
∂Ψ
−i~
α · ∇ + βm Ψ = i , (2.10.2)
∂t
and using ∂t Ψ = ∓iEΨ and ∂x Ψ = ±ipx Ψ e.t.c., we obtain
α · p~ + βm)Ψ = i(∓iE)Ψ
(−i(±i)~
→ (±~α · p~ + βm)Ψ = ±EΨ . (2.10.3)
If we use the standard representation for the αi and β from Section 2.4 we obtain
mI ±~σ · p~ φ φ
= ±E (2.10.4)
±~σ · p~ −mI χ χ
(m ∓ E)φ ± ~σ · p~χ = 0
±~σ · p~φ − (m ± E)χ = 0 . (2.10.5)
We will construct the solutions in such a way that they have a straighforward p~ = 0 limit.
(m − E)φ + ~σ · p~χ = 0
~σ · p~φ − (m + E)χ = 0 . (2.10.6)
Note that the first equation in (2.10.6) only gives the on-(mass)shell condition E 2 =
p~2 + m2 .
1 0
We can write φ in terms of φ1 = and φ2 = .
0 1
where
√
φs
U (p, s) = E+m σ ·~
~ p (2.10.10)
(E+m) s
φ
is a positive energy Dirac spinor. In the last expression a convenient normalization factor
has been introduced.
It can be checked that the first equation in (2.10.6) is automatically satisfied by using
the identity (~σ · p~)2 = p~2 I2 .
(m + E)φ − ~σ · p~χ = 0
−~σ · p~φ − (m − E)χ = 0 . (2.10.11)
Note that the second equation in (2.10.11) only gives the on-(mass)shell condition E 2 =
p~2 + m2 .
0 1
We can write χ in terms of χ1 = and χ2 = .
1 0
where
√ σ ·~
~ p
(E+m) s
χ
V (p, s) = E+m (2.10.15)
χs
is a negative energy Dirac spinor. In the last expression a convenient normalization factor
has been introduced.
Interpretation
To find the physical interpretation for the four independent solutions we consider the
rest frame p~ = 0. Then:
1
√ √
φ1 0
U (p, 1) = 2m = 2m
0 0
0
0
√ √
φ2 1
U (p, 2) = 2m = 2m
0 0
0
0
√ √
0 0
V (p, 1) = 2m = 2m
χ1 0
1
0
√ √
0 0
V (p, 2) = 2m = 2m (2.10.16)
χ2 1
0
operator becomes
1~
J~ = L
~ +S~=S ~= Σ
1 2
σ
2 3
0
⇒ Sz = 1 (2.10.17)
0 σ
2 3
Thus, U (p, 1) and U (p, 2) are positive energy solutions with Sz eigenvalues sz = +1/2
and sz = −1/2 respectively, whereas V (p, 1) and V (p, 2) are negative energy solutions
with Sz eigenvalues sz = −1/2 and sz = +1/2 respectively.
In general, U and V are the Lorentz boosts of these solutions to a frame where p~ 6= 0.
Interpret the negative energy solutions later.
We will often make use of the adjoint spinors which are defined as
U (p, s) ≡ U † (p, s)γ 0 , V (p, s) ≡ V † (p, s)γ 0 . (2.11.3)
By taking the Hermitian adjoint of the equation obeyed by U and V we find
U (p, s)(p/ − m) = 0 , V (p, s)(p
/ + m) = 0 . (2.11.4)
One may also check directly (using again (~σ · p~)2 = p~2 I2 ) that
U † (p, s)U (p, s) = V † (p, s)V (p, s) = 2E , s = 1, 2 , (2.11.5)
and
U (p, s)U (p, s) = 2m , V (p, s)V (p, s) = −2m , s = 1, 2 . (2.11.6)
40 CHAPTER 2. RELATIVISTIC QUANTUM MECHANICS
Since the Dirac equation has negative energy solutions, why do positive energy electrons
not radiate energy and fall into a negative energy state? Dirac: Negative energy states
are completely filled and the Pauli exclusion principle (which applies to fermions) forbids
the transition. Consequently, the Vacuum is a state with all positive energy states empty
but all negative energy states filled.
The absence of a spin-up electron of energy −|E| and momentum −~p is equivalent to
the presence of a spin-down positron of energy +|E| and momentum +~p. (Think about
time running backwards or the arrow in a Feynman diagram reversed)
Thus, the electron wavefunction eip·x V (p, s) corresponding to energy −E and momen-
tum −~p describes a positron of energy +E and momentum +~p. Also, V (p, 1) and V (p, 2)
which describe spin down and spin up negative energy electrons must describe spin up
and spin down positrons.
In general the infinite negative charge of the vacuum produces no effect because the
distribution of charge is homogeneous.
However, consider the effect of a positive energy electron with charge −|e| on the
vacuum. It repels the negative energy electrons and electrically polarises the vacuum.
Thus the physical charge −|e| seen by a test charge at a large distance from the electron
is numerically smaller than the bare charge −|e0 |, i.e. |e| < |e0 |.
2.14. CHARGE CONJUGATION SYMMETRY C 41
However, if the test charge comes very close it will see the bare charge −|e0 |. For
S-wave electrons (l = 0) in an atom, the proton sees a charge numerically greater than
the ordinary electric charge |e|. Note that for l > 0 the wavefunction vanishes at the
origin and the proton feels a numerically smaller charge. This effect leads to measurable
shifts of the energy levels of atoms.
C : Ψ → ΨC (2.14.1)
which turns a positive energy electron wavefunction (e− ) into a negative energy wave
function (e+ ) with the same momentum and spin state. If Ψ = e−ip·x U (p, s) then ΨC =
eip·x V (p, s). The required operation turns out to be
C : Ψ → ΨC = Cγ 0 Ψ∗ , (2.14.2)
Proof: We claim that the Dirac equation is charge conjugation invariant. The Dirac
equation may be written as
(iγ µ ∂µ − m)Ψ = 0, (2.14.3)
taking the complex conjugate gives
(−iCγ 0 (γ µ )∗ ∂µ − m)Ψ∗ = 0
→ (iγ µ (Cγ 0 ))∂µ − mcγ 0 )Ψ∗ = 0
→ (iγ µ ∂µ − m)ΨC = 0 , (2.14.5)
where we have used the identity Cγ 0 (γ µ )∗ = −γ µ (Cγ 0 ). This shows that ΨC obeys the
same equation as Ψ.
42 CHAPTER 2. RELATIVISTIC QUANTUM MECHANICS
The Dirac equation is also invariant under reflection of space coordinates in the origin
P : Ψ → Ψ0 = P Ψ (2.15.2)
T : t → t0 = −t , ~x → ~x0 = ~x , (2.16.1)
T : Ψ → Ψ0 = T ψ ∗ (2.16.2)
~ = ~x × p~, and
p
This is as expected for time reversal, since p~ = m~v / 1 − ~v 2 /c2 and L
~ → −L
thus under time reversal p~ → −~p and L ~ and in particular Lz → −Lz . We assume
that this applies to any AM operator, so that in particular Sz → −Sz . In this case, Ψ0
obeys the same equation as Ψ with ∂t replaced by −∂t .
2
Since P also transforms the space time coordinates this operation should be written more properly
as Ψ(t, ~x) → Ψ0 (t0 , ~x0 ) = Ψ0 (t, −~x) = P Ψ(t, ~x). Hence Ψ0 (t, ~x) = P Ψ(t, −~x); a similar comment applies
to time reversal T .
2.17. DIRAC COVARIANTS 43
It is important in the study of the Weak Interactions to know the properties of objects
like Ψγ µ Ψ, Ψγ µ γ5 Ψ, etc, where we introduced
0 1 2 3 0 I
γ5 ≡ iγ γ γ γ = . (2.17.1)
I 0
We list here the behaviour of some of the Dirac covariants under Lorentz transforma-
tions, P, C:
Covariant LT’s P C
ΨΨ scalar +ΨΨ −ΨΨ
Ψγ5 Ψ pseudoscalar −Ψγ5 Ψ −Ψγ5 Ψ
Ψγ µ Ψ 4-vector +Ψγ 0 Ψ +Ψγ µ Ψ
−Ψγ i Ψ
Ψγ µ γ5 Ψ (pseudo) 4-vector −Ψγ 0 γ5 Ψ −Ψγ µ γ5 Ψ
+Ψγ i γ5 Ψ
Thus,
Ψγ 0 Ψ → Ψγ 0 γ 0 γ 0 Ψ = Ψγ 0 Ψ (2.17.3)
Ψγ i Ψ → Ψγ 0 γ i γ 0 Ψ
= −Ψ(γ 0 )2 γ i Ψ
= −Ψγ i Ψ (2.17.4)
which is invariant under both P and C because the two negative signs for the U γ i U
cancel for space inversion. Thus the Electromagnetic interactions are both space reflection
invariant and charge conjugation invariant.
The corresponding probability amplitude for the Weak Interaction has a factor
The term
U γµ γ5 U U γ µ U
3
X
= U γ 0 γ5 U U γ 0 U − U γ i γ5 U U γ i U (2.17.7)
i=1
Note, that in a general theory C, P and T are not preserved, but the combination of
the three transformations CPT is always a symmetry.
2.18 Neutrinos
Some modification of RQM is needed in the physically important case of massless spin-1/2
particles — Neutrinos (with todays experimental evidence of Neutrino oscillations this is
not quite true, nevertheless it is a very good approximation.)
2.18. NEUTRINOS 45
Experimental observation shows that whereas an e− can have Helicity +1/2 or −1/2,
a Neutrino (which is massless) can only have Helicity −1/2 and an Anti-Neutrino can
only have Helicity +1/2.
Thus, whereas we need four degrees of freedom to describe the 2 spin states of an
electron or positron, we need only 2 degrees of freedom to describe the spin states of the
neutrino and anti-neutrino. We need to discard 2 spin states of the Dirac particle.
Now we return to the Dirac equation for a positive energy solution of energy E and
momentum p~. However, we choose a different representation of the Dirac matrices (and
hence a different representation of the gamma-matrices). This does not effect the physics
but makes the proof much easier. It may be checked that
0 I ~σ 0
β= , α
~= (2.18.2)
I 0 0 −~σ
also obey the Dirac Algebra (2.2.7).
−ip·xφ
For Ψ = e α · p~ + βm)Ψ = EΨ (see Section
the Dirac equation reduces to (~
χ
2.11), from which we get
~σ · p~ mI φ φ
=E , (2.18.3)
mI −~σ · p~ χ χ
which gives the coupled equations
~σ · p~φ + mχ = Eφ
mφ − ~σ · p~χ = Eχ . (2.18.4)
Now taking m = 0 for a massless neutrino decouples the two equations,
~σ · p~φ = Eφ
~σ · p~χ = −Eχ , (2.18.5)
and since E = |~p| for m = 0,
~σ · p~ 1
φ = φ
2|~p| 2
~σ · p~ 1
χ = − χ. (2.18.6)
2|~p| 2
46 CHAPTER 2. RELATIVISTIC QUANTUM MECHANICS
Thus the upper 2 components of Ψ describe Helicity 1/2, and the lower two describe
helicity −1/2 when Ψ has positive energy. To obtain an appropriate Ψ to describe a
Neutrino we perform a projection that removes the upper 2 components.
1 0 0
If we form 2
(I−γ5 )
= , then we may use it to project the upper 2 components
0 I
and leave the Helicity −1/2 components. Thus,
1
ΨL ≡ (I − γ5 )Ψ , (2.18.9)
2
with Ψ a positive energy spinor, may be used to describe the neutrino.
If instead we start from a negative energy solution Ψ, the from Section 2.11
α · p~ + βm)Ψ = −EΨ .
(−~ (2.18.10)
φ
For Ψ = eip·x we then have
χ
−~σ · p~ mI φ φ
= −E , (2.18.11)
mI ~σ · p~ χ χ
which gives the coupled equations
~σ · p~φ = Eφ
~σ · p~χ = −Eχ , (2.18.13)
This is the same as for the positive energy solution. Thus, the upper 2 components of Ψ
still describe Helicity +1/2 and the lower 2 components describe Helicity −1/2. To obtain
an appropriate Ψ to describe an Anti-Neutrino with Helicity +1/2 we need a negative
energy state with Helicity −1/2.
could give negative values (we have renamed the wavefuntion Ψ by φ).
∂2 ~ 2 + m2 )φ = 0
( −∇ (2.19.2)
∂t2
has positive energy solutions
ρ = |N |2 (±2E) . (2.19.5)
Thus, negative probabilities come from negative energy solutions. These are (as usual)
the problem.
We need an interpretation for the negative energy solutions of the KG equation. Dirac
Hole theory will NOT work for the spin-0 Bosons described by the KG equation, because
they do not obey the Dirac exclusion principle to give a filled negative energy sea.
Feynman gave an alternative way of interpreting negative energy solutions which works
for both bosons and fermions!
48 CHAPTER 2. RELATIVISTIC QUANTUM MECHANICS
In Feynman diagrams, which are the rules of calculating scattering and decay am-
plitudes in RQM, when Anti-Particles are involved we draw lines for negative energy
particles propagating backwards in time and use Feynman’s interpretation. E.g. for elec-
tromagnetic Electron-Positron scattering via photon exchange there are two diagrams
that contribute:
pµ → pµ + qAµ , (2.20.1)
~ = (φ, A)
Aµ ≡ (A0 , A) ~ (2.20.2)
The free particle Dirac equation is (iγ · ∂ − m)Ψ = 0. Making the above substitution
γ · ∂ → γ · ∂ − iqγ · A (2.20.5)
or
(i∂/ − m)Ψ = −qA
/. (2.20.7)
2.20. DIRAC EQUATION IN AN ELECTROMAGNETIC FIELD 49
It is sometimes convenient to write the equation in terms of the Dirac matrices i.e. in
Hamiltonian form. Begin with the equation
∂ ~ − m)Ψ = −q(A0 γ 0 − A
~ · ~γ )Ψ
(iγ 0 + i~γ · ∇ (2.20.8)
∂t
and multiply from left with γ 0 = β. Since (γ 0 )2 = I and γ 0 γ i = ββαi = αi we obtain
∂Ψ ~ ~
i = (−i∇ + q A) · α + βm Ψ − qA0 Ψ
∂t
∂Ψ ~
→ i = α · Π + βm Ψ − qA0 Ψ , (2.20.9)
b
∂t
where
~b = −i∇
Π ~ + qA
~ =b ~.
p~ + q A (2.20.10)
Gauge Invariance
This can be fixed if we transform the Dirac wavefunction by a spacetime dependent phase
factor
Ψ → eiα(x) Ψ = Ψ̃ . (2.20.12)
So we find
If we now require that the 2nd term cancels the 4th term in the last line of that equation,
which simply implies
α(x) = qχ(x) , (2.20.14)
then we get
eiqχ(x) (i∂/ + qA
/ − m) Ψ = 0 . (2.20.15)
So up to an irrelevant overall phase factor we have recovered the original Dirac equation!
Hence the Dirac equation is forminvariance under gauge transformations:
Aµ → õ = Aµ + ∂µ χ(x) ,
Ψ → Ψ̃ = eiqχ(x) Ψ . (2.20.16)
50 CHAPTER 2. RELATIVISTIC QUANTUM MECHANICS
Charge Conjugation
It is interesting to see what happens to the Dirac equation coupled to the EM field
under charge conjugation:
C : Ψ → ΨC = Cγ 0 Ψ∗ , (2.20.17)
with C = iγ 2 γ 0 .
i.e. under charge conjugation the charge of the particle flips sign. (Proof in homework)
In the non-relativistic limit the rest mass mc2 is the largest energy in the problem (since
|~v |2 << c2 ) and we can write for a positive energy solution
−imt φ
Ψ=e (2.21.1)
χ
where φ and χ vary slowly with time and will be called large and small components for
reasons that will become clear in a moment.
Substituting in the Dirac equation (with the Dirac representation for β and αi , which
is more appropriat for studying non-relativistic limits) in an electromagnetic field
!
~b
−imt φ −imt ∂t φ −imt (−qA0 + m)I ~σ · Π φ
me + ie =e ,
χ ∂t χ ~
~σ · Π
b
(−qA0 − m)I χ
(2.21.2)
+imt
multiplying with e and subtracting the first term on the left hand side from both sides
we obtain !
~
0
i∂t φ −qA I ~σ · Π φ
b
= . (2.21.3)
i∂t χ ~ (−qA − 2m)I
~σ · Π
b 0 χ
~b
~σ · Π
χ∼ φ (2.21.5)
2m
2.21. THE MAGNETIC MOMENT OF THE ELECTRON 51
~b = b
where Π ~ Hence, for momenta and EM fields small compared to the rest mass
p~ + q A.
χ << φ . (2.21.6)
~b ,
i∂t φ = −qA0 φ + ~σ · Πχ (2.21.7)
~b 2
(~σ · Π)
0
⇒ i∂t φ = −qA φ + φ. (2.21.8)
2m
~b 2 φ = (~σ · Π)(~
(~σ · Π) ~b σ · Π)φ
~b = Π
~b · ΠIφ
~b + i~σ · (Π
~b × Π)φ
~b . (2.21.12)
Now (~ = 1),
~b Π)φ
(Π× ~b = (b ~
p~ +q A)×( ~ = (−i∇+q
p~ +q A)φ
b ~ ~
A)×(−i ~
∇+q ~ = −(∇+iq
A)φ ~ ~
A)×( ~
∇+iq ~ .
A)φ
(2.21.13)
The x component of this expression is
and hence
~b × Π)φ
(Π ~b = −iq Bφ
~ . (2.21.15)
52 CHAPTER 2. RELATIVISTIC QUANTUM MECHANICS
Now
~b 2 φ = Π
(~σ · Π) ~b · Πφ
~b + q~σ · Bφ
~ , (2.21.16)
hence, the non-relativistic limit of the Dirac equation in an EM field (also called Pauli
equation) may be written as
!
∂φ (p
b~ + q ~ 2 q~σ · B
A) ~
i = −qA0 + + φ (2.21.17)
∂t 2m 2m
we can interpret the last term in eqn. (2.21.18) as a potential energy −~µspin · B ~ due to
the spin magnetic moment of the electron in an external magnetic field. Thus the spin
magnetic moment is
~
qS ~
qS
~µspin = − ≡ −g (2.21.20)
m 2m
where g is the so-called gyromagnetic ration. Hence, the Dirac equation predicts g = 2
whereas classically we would expect g = 1. This prediction was confirmed experimentally
and is one of the spectacular successes of the Dirac equation! Including radiative correction
from Quantum Electrodynamics (QED) yields a more precise value of g = 2(1.0011 . . .)
which agrees up to nine digits after the dot with experiment!
∂Ψ
HΨ α·b
b = (~ p~ + βm + V (r))Ψ = i , (2.22.1)
∂t
for positive energy solutions with energy eigenvalue E > 0 we make an separation ansatz
so that
i∂t Ψ = EΨ (2.22.3)
which gives a time independent equation
α·b
(~ p~ + βm + V (r))Ψ = EΨ . (2.22.4)
Zα e2
V (r) = − , α= . (2.22.5)
r 4π
The total AM operator J~ commutes with (~ p~ + βm) as in section 2.7. Also J~ commutes
α ·b
with V (r) because V (r) is independent of Spin and as in section 1.3, orbital AM operator
~ commutes with V (r). Thus [J,
L ~ H]
b = 0.
l
The problem can be solved using simultaneous eigenstates ψj,m of J~2 , Jz and the parity
operator P (which takes ~x → −~x). The corresponding quantum numbers are j(j + 1), m
and (−1)l , where l is orbital angular momentum.
For the spin-1/2 electron, the allowed values of j are j = l ± 1/2. The gory details of
the calculation can be found in section 2.3.2 of [6], with the result for the energy levels
1 Z 2 α2 1 Z 4 α4
1 3 6
En,j = me 1 − − − + O((Zα) ) (2.22.6)
2 n2 2 n3 j + 1/2 4n
This result predicts correctly the splitting of the energy levels with the same princi-
ple quantum number n but different j (Fine Splitting); It does not predict the observed
splitting of energy levels with the same n and j but different parity (−1)l (Lamb Shift).
This requires the quantization of the EM field Aµ and, hence, the use of Quantum Elec-
trodynamics (QED). Other important quantum corrections are discussed in [6].
[1] J. Bjorken and S. Drell, Relativistic Quantum Mechanics and Relativistic Quantum
Fields, McGraw-Hill.
[2] M.E. Peskin and D.V. Schröder, An Introduction to Quantum Field Theory, Addison-
Wesley.
[3] S. Weinberg, The Quantum Theory of Fields, Vol 1, Cambridge University Press.
[7] I.J. Aitchison and A.J. Hey, Gauge Theories in Particle Physics, Dover Publications.
[8] F. Mandl and G. Shaw, Quantum Field Theory, John Wiley & Sons.
55