Chapter 2
Chapter 2
GYRODYNAMICS
Introduction to the dynamics of rigid bodies
Introduction. Though Newton wrote on many topics—and may well have given
thought to the odd behavior of tops—I am not aware that he committed any
of that thought to writing. But by Euler was active in the field, and it
has continued to bedevil the thought of mathematical physicists. “Extended
rigid bodies” are classical abstractions—alien both to relativity and to quantum
mechanics—which are revealed to our dynamical imaginations not so much by
commonplace Nature as by, in Maxwell’s phrase, the “toys of Youth.” That such
toys behave “strangely” is evident to the most casual observer, but the detailed
theory of their behavior has become notorious for being elusive, surprising
and difficult at every turn. Its formulation has required and inspired work
of wonderful genius: it has taught us much of great worth, and clearly has
much to teach us still.
Early in my own education as a physicist I discovered that I could not
understand—or, when I could understand, remained unpersuaded by—the
“elementary explanations” of the behavior of tops & gyros which are abundant
in the literature. So I fell into the habit of avoiding the field, waiting for
the day when I could give to it the time and attention it obviously required
and deserved. I became aware that my experience was far from atypical:
according to Goldstein it was in fact a similar experience that motivated Klein
& Sommerfeld to write their 4-volume classic, Theorie des Kreisels (–).
In November I had occasion to consult my Classical Mechanics II
students concerning what topic we should take up to finish out the term. It
was with feelings of mixed excitement and dread that I heard their suggestion
that we turn out attention to the theory of tops. The following material takes
as its point of departure the class notes that were written on that occasion.
2 Gyrodynamics
where pi = miẋ xi . P = M Ẋ
X is the total linear momentum of the system and is
for isolated systems conserved, whatever may be the nature of the intra-system
interactions. In this notation (2) has become
T =T0 + T (6)
T= 1
2 miṙr2i = kinetic energy relative to the center of mass
i
T0 = 12 M Ẋ 2 = 1
2M P
2
= kinetic energy of the center of mass
aij ≡ |x
xi − xj | = |rri − r j |
ȧij = 0 (7)
and where the constant vectors r 0i record the initial and enduring design of
the rigid assemblage: r 0i = xi (0) − X (0). Three degrees of freedom enter
into the specification of X , and (in 3 -dimensional space) three more into the
specification of R.
Chasle’s equation (8) amounts to the assertion that in a rigid assemblage
r i = R r 0i (10)
xi = X + R r0i
we obtain finally
ẋ X + Arri = Ẋ
xi = Ẋ ω × ri)
X + (ω (12)
1
See E. T. Whittaker, Analytical Dynamics (4th edition ), page 4.
4 Gyrodynamics
where
I≡ mi BTi Bi
i
r 2 − r1 r 1 − r 1 r2 − r 1 r3
= mi − r 2 r1 r 2 − r2 r2 − r 2 r3 (14)
i − r 3 r1 − r 3 r2 r − r3 r3 i
2
Look finally to the total angular momentum of a rigid body. For any
multi-particle system one has
J= xi × pi = mi xi × ẋ
xi
i i
The expression on the right develops into a sum of four terms, of which two—
the “cross terms”—vanish in consequence of the fundamental constraint (3).
We are left with
= Lorbital + Lintrinsic (15)
with
angular momentum of the
Lorbital = X ×M Ẋ
X = X ×P P = center of mass, relative to
the coordinate origin
angular momentum relative to
Lintrinsic = mi r i × ṙri =
the center of mass
i
The intrinsic angular momentum of rigid multi -particle systems is called spin:
one has
Lintrinsic −→ S = mi r i × Arri
i
= mi r i × (ω
ω × ri)
i
=− mi r i × (rri × ω )
i
= mi BTi Bi ω
i
= Iω (16)
J =L+S (17)
T = 12 S T I –1 S provided det I
= 0 (18.1)
= 12 ω · S (18.2)
2. Nature & properties of the moment of inertia matrix. Let the distribution
function ρ(x) describe how some material of interest (mass, let us say) is
distributed along the real line. One writes
M = m = x0 ρ(x) dx
(0)
: 0th moment
M X = m(1) = x1 ρ(x) dx : 1st moment
m = x2 ρ(x) dx
(2)
: 2nd moment
..
.
to define the “moments” of the distribution.2 From the set of all moments one
can construct the “moment generating function”
∞
ϕ(k) ≡ 1 n (n)
n! (ik) m = eikx ρ(x) dx
n=0
Evidently
µ(0) = M
the center of mass relative to the
µ(1) = m(1) − Xm(0) = 0 :
center of mass resides at the origin
All of which carries over straightforwardly to higher-dimensional situations. In
three dimensions we have
M= x) d3 x
ρ(x : solitary 0th moment
x1
MXX= x2 ρ(x x) d3 x : vector of 1st moments
x3
r1 r 1 r 1 r 2 r 1 r 3
M= r2 r1 r2 r2 r2 r3 ρ(rr) d3 r : matrix of centered 2nd moments
r3 r 1 r 3 r2 r3 r 3
and are placed now in position to recognize that the moment of inertia matrix
is an object assembled from centered second moment data:4
I = (trace M)·U − M (19)
Remarkably, the low- order moment data built into the designs of M , X and I
is central to the dynamical theory of rigid bodies, but the moments of higher
order are (in most contexts) utterly irrelevant: distinct rigid bodies can be
expected to move identically if they have identical 0th , 1st and 2nd order
moments.
The moment of inertia matrix I is manifestly real and symmetric
: IT = I.
We
are assured,
therefore, that the eigenvalues of I (call them
I1 , I 2 , I3 or
A, B, C
) are real , and the associated eigenvectors (call them e 1 2 3 or
, e , e
a, b, c ) are—or can always be taken to be—orthogonal : ei· ej = δij . We are
assured, moreover, that I can in every case be diagonalized by a suitably chosen
rotation matrix :
I1 0 0 1 0 0
R T I R = 0 I2 0 with R T R = 0 1 0
0 0 I3 0 0 1
3
I have here allowed myself to write ρ(r) where I should more properly have
written something like ρ̃(r) ≡ ρ(X + r). Similarly, I will later write ρ(rr) when
x) that was given and ρ(X
actually it was ρ(x X + r ) that is intended.
4
Since the symbol I is busy, I have here had to use U to represent the 3 × 3
identity matrix.
8 Gyrodynamics
The orthonormal triple e1 , e2 , e3 serves to define the “principal axes” of the
rigid body. With respect to the “principal coordinate frame” that has its origin
at the center of mass and coordinate axes parallel to the principal axes one has
I1 0 0 1 1
0 I2 0 0 = I1 0
0 0 I3 0 0
I1 0 0 0 0
0 I2 0 1 = I2 1
0 0 I3 0 0
I1 0 0 0 0
0 I2 0 0 = I3 0
0 0 I3 1 1
If r1 , r2 , r3 refer to the principal frame, then the continuous version of (14)
supplies
2
I1 0 0 r2 + r32 0 0
0 I2 0 = 0 r12 + r32 0 ρ(rr)d 3 r (20)
0 0 I3 0 0 r1 + r22
2
More interestingly,
2 2
I1 + I2 = r1 + r2 + 2r3 ρ(rr)d r
2 2 3
r1 + r22 ρ(rr)d 3 r = I3
..
.
etc.
which is to say:
I3 I2
I1
Moment of inertia with respect to an axis 9
This occurance of the triangle inequality is, in view of (20), not at all surprising,
for if α, β and γ are any positive numbers then
i=α+β
j =β+γ
k =γ+α
5
I am indebted to Tom Wieting for this observation.
10 Gyrodynamics
n
λn x
dm to the axial line. The question now before us: how does that scalar moment
of inertia relate to the matrix-valued construction I ?
We confront first a simple geometrical problem: how to describe the length
of the normal dropped from a point to a line? To describe a line through the
n (λ variable, n a fixed unit vector). The condition that the
origin we write λn
vector p(λ) ≡ x − λn n · p(λ) = 0) enforces λ = n · x. The
n be normal to the line (n
length of the normal dropped from x to the line can therefore be described
r2 (x x − (n
x) = [x n · x)nn]·· [x n · x)n
x − (n n]
= x · x − (x
x · n)(nn · x)
= x · (U − N ) x (22.1)
= n · [(trace X)·U − X ] n (22.2)
Equation (23) indicates how moments with respect to axes (numbers of the
type I0 ) can be extracted from the data written into I. One can also proceed
in the reverse direction. Suppose, for example, we were to set
1
n = 0
0
We would then have
measured value of I0 = I11
More generally, we have, for each selected n,
n) = n1 n1 I11 + 2n1 n2 I12 + 2n1 n3 I13
measured value of I0 (n
+ n2 n2 I22 + 2n2 n3 I23
+ n2 n2 I22
Given six well-chosen instances of that equation, we would have enough
information to compute all the Iij by straightforward linear algebra.6
6
Is there an computationally optimal way to select n1 , n2 , n3 , n4 , n5 , n6 ?
12 Gyrodynamics
changed. The issue before us: How is I final related to I initial ? Working from
(14), i.e., from (compare (19))
r1 r 1 r 1 r2 r1 r3
I= mi r2 · U − W i with W ≡ r2 r1 r 2 r2 r2 r3
i r3 r 1 r 3 r2 r3 r3
The effect of body rotation—as was remarked already at (10)— can be described
r 0i −→ r i = R r 0i : all i
which induces
0
Wαβ ≡ rα0 rβ0
−→ Wαβ ≡ rα rβ = Rαµ rµ0 rν0 Rβν = Rαµ Wµν
0
Rβν
Ṙ = A R with AT = −A
so we have
İ = A I − I A (26)
S = Iω
S = I ω̇
Ṡ ω + ω× Iω − Iω× ω
= I ω̇
ω + ω ×S
S (27)
X + mir̈ri
= miẌ (28)
P
= Ṗ (29)
where the 3rd law has killed the interactive force terms, and the constraint (3)
has served to kill the relative acceleration terms. Application of
xi × = X + r i )×
(X (30)
i i
14 Gyrodynamics
where the final term actually vanishes if—as we will assume—the interactive
interactive
forces are central: F ij (rri − r j ). On the other hand, application of
(30) to the right side of (28) was already seen at (15) to give Lorbital + Lintrinsic .
So we have, for any centrally interactive N -body system,
N orbital + N intrinsic = L̇
Lorbital + L̇
Lintrinsic
where
N orbital ≡ X × F total impressed
N intrinsic ≡ r i × F iimpressed
i
N intrinsic = L̇
Lintrinsic : refers to motion relative to the center of mass
Now impose the assumption of rigidity upon our N -body system, and
emphasize that we have done so by notational adjustment: Lintrinsic
→ S = I ω .
Drawing upon (27) we then have
S
N intrinsic = Ṡ
= I ω̇
ω + ω ×Iω (31)
Here N intrinsic , ω and the integrals that assign instantaneous meaning to I all
refer to the space frame, a (generally non-inertial) translated copy of the inertial
lab frame. Several circumstances limit the utility of this result:
• The value of N intrinsic will, in the general case, depend upon both the
location and the orientation of the rigid body, and may even depend upon
its instantaneous state of motion (as would happen if the body had been
assembled from charged particles and were placed in a magnetic field).
This circumstance introduces an element of circularity into the discussion:
one must know the net effect of all past motion to understand what (31)
has to say about present motion.
• Even in the simplest case N intrinsic = 0 an awkward time-dependence lurks
in the design of I, which changes moment to moment as the body rotates.
a frame that is fixed in the body. If I 0 ≡ I(0) refers to the initial orientation
of the body, and I ≡ I(t) to its evolved orientation, then by (25) we have
I = R I 0 R –1 (32)
where R ≡ R (t) refers to the rotational that has been accomplished during the
interval in question. In this notation (31)—after multiplication on the left by
R –1 —becomes
N 0 = I 0 R –1 ω̇
ω + R –1ω × R I 0 R –1 ω with N 0 ≡ R –1N
= I 0 · R –1 ω̇
ω + R –1ω × R · I 0ω 0 with ω 0 ≡ R –1 ω
R A R –1 = (R a)× (33)
7
See again page 3.
8
On pages 27–35 of gyrodynamics (/) and again on page 92 below
I discuss in detail how Coriolis and centrifugal forces—universal symptoms of
non-inertiality—conspire to achieve this remarkable result.
16 Gyrodynamics
Computation9 shows the vector ω 0×I 0 ω 0 to be, in the general case, a fairly
intricate object. It is, however, entirely natural to identify the body frame with
the principal axis frame
I11 I12 I13 I1 0 0
• I22 I23 −−−−−−−−−−−−−−−−−−−−−−→ 0 I2 0
passage to principal axis frame
• • I33 0 0 I3
Major simplifications are then achieved: one is led from (34) by quick
calculation to the so-called Euler equations
N10 = I10 ω̇10 + (I30 − I20 ) ω30 ω20
N20 = I20 ω̇20 + (I10 − I30 ) ω10 ω30 (35)
N30 = I30 ω̇30 + (I20 − I10 ) ω20 ω10
We have here a coupled system of three non-linear first-order differential
conditions on three unknown functions ω 0 (t).
Suppose for the moment that equations (35) have been solved . How does
knowledge of ω 0 (t) determine the rotation matrix R(t) by means of which we—
if not riding on the body but watching it from a position at rest with respect
to the space frame—propose to understand the perceived motion of the rigid
body? We have
Ṙ R –1 = A = ω ×
= (R ω 0 )×
ω 0 ×)R –1
= R (ω by lemma (33)
giving
Ṙ = R A0 with (we may assume) R (0) = U (36.1)
Equivalently
t
R (t) = U + R (τ )A0 (τ ) dτ (36.2)
0
which can, in principle, be solved by iteration. But except in the simplest of
cases we can expect the solution of (35) to be very difficult, and the solution of
(36.1) to be also very difficult.
are not independent, but are subject by R R T=U to six constraints. To deal with
that circumstance we will bring into play the “method of Lagrange multipliers,”
following the lead of Saletan & Cromer.10
Backing up to the equation that led to (13), we have
T= 1
2 miṙri· ṙri
i
= 1
2 mi (Ṙ r i0 )··(Ṙ r i0 )
i
1
= 2 Ṙµα Mαβ Ṙµα
µ,α,β
0
Mαβ ≡ 0 0
mi riα riβ
i
= 1
2 tr Ṙ M 0 Ṙ T
α,β
whence
L( Ṙ, R, Ȧ
•,A• ) =
1
2 tr Ṙ M 0 Ṙ T − U(R) + tr A
• (R R − U)
T
d ∂ − ∂ L = (R̈ M 0 )αβ + ∂U/∂Rαβ − 2(R A
• )αβ = 0
dt ∂ Ṙαβ ∂Rαβ
d ∂ − ∂ L = (R T R − U)αβ = 0
dt ∂ λ̇αβ ∂λαβ
which in matrix notation (after multiplying the first equation by R –1 on the left)
become
R –1 R̈ M 0 + R –1 ∂U/∂R = 2A
•
R TR − U = O
Returning with R –1 = R T (information supplied by the second equation) to the
10
E. J. Saletan & A. H. Cromer, Theoretical Mechanics (), pages 144–146.
See also J. V. José & E. J. Saletan, Classical Dynamics: A Contemporary
Approach (1998), pages 514–519.
18 Gyrodynamics
from which all reference to the Lagrange multipliers has been eliminated. This
second-order matrix equation is (by antisymmetry) equivalent to a system of
three scalar equations, and will now be shown to comprise no more nor less
than a matrix formulation of the Euler equations (35).
Let us first of all agree to write
(∂U/∂R)T R − R T (∂U/∂R) ≡ N 0
Ȧ0 = Ṙ TA R + R TȦ R + R TA Ṙ
= R T (−A2 + Ȧ + A2 )R
= R –1 Ȧ R
R T R̈ = Ȧ0 + A0A0
M 0 = 12 (trI 0 )·U − I 0
giving finally
N 0 = (trI 0 )·Ȧ0 − (Ȧ0 I 0 + I 0Ȧ0 ) − (A0 A0 I 0 − I 0A0 A0 ) (39)
The claim now is that (39) stands to Euler’s equations (35) in precisely the
11
See again (19) on page 7.
Euler angles 19
0 −ω30 ω20 I10 0 0
A0 = ω30 0 −ω10 and I =
0 0 I20 0
−ω20 ω10 0 0 0 I30
and discover that the expression on the right side of (39) can be written
0 −[I30 ω̇30 +(I20 −I10 ) ω20 ω10 ] [I20 ω̇20 +(I10 −I30 ) ω10 ω30 ]
[I30 ω̇30 +(I20 −I10 ) ω20 ω10 ] 0 −[I10 ω̇10 +(I30 −I20 ) ω30 ω20 ]
−[I20 ω̇20 +(I10 −I30 ) ω10 ω30 ] [I10 ω̇10 +(I30 −I20 ) ω30 ω20 ] 0
0 −N30 N20
N30 0 −N10
−N20 N10 0
we see that the Lagrangian formalism has led us to what is in effect the dual
of Euler’s system of equations.
provides of the elements of O(2). This was first accomplished by Euler,12 who
observed (see the following figure) that if one
• rotates through an appropriate angle φ about the 3-axis, then
• rotates through an appropriate angle θ about the new 1-axis, then
• rotates through an appropriate angle ψ about the newest 3-axis
one can bring any frame into coincidence with any other frame.13 Reading from
12
L. Euler (–) retained an interest in dynamics—particularly the
dynamics of rigid bodies—throughout his professional career, but his papers on
the subject were written mainly between and , when he was attached
to the court of Frederick the Great, in Berlin. I suspect it was his physical
work which stimulated the invention of the Euler angles, though they are of
independent mathematical interest and importance.
13
We assume, of course, that the two frames share the same origin and are
similarly handed.
20 Gyrodynamics
2
φ
1
1
ψ
≡ E(φ, θ, ψ) r
Euler angles 21
serves to describe how the coordinates r relative to the rotated red frame of a
fixed point P are related to its coordinates r relative to the black frame:
x
x
= E(φ + π, − θ, ψ + π) (40.2)
x = Rx
x0
where now
R = E –1 (φ, θ, ψ)
22 Gyrodynamics
y
y0
x x0
Explicitly
cos φ − sin φ 0 1 0 0 cos ψ − sin ψ 0
R = sin φ cos φ 0 0 cos θ − sin θ sin ψ cos ψ 0
0 0 1 0 sin θ cos θ 0 0 1
= Ṙ 3 (φ)·R 1 (θ)·R 3 (ψ) + R 3 (φ)· Ṙ 1 (θ)·R 3 (ψ) + R 3 (φ)·R 1 (θ)· Ṙ 3 (ψ)
· R 3T (ψ)·R 1T (θ)·R 3T (φ)
= Ṙ 3 (φ)·R 3T (φ) + R 3 (φ)· Ṙ 1 (θ)·R 1T (θ)·R 3T (φ)
+ R 3 (φ)·R 1 (θ)· Ṙ 3 (ψ)·R 3T (ψ)·R 1T (θ)·R 3T (φ)
and similarly
1
Ṙ 1 (θ)·R 1T (θ) = θ̇ 0 ×
0
0
Ṙ 3 (ψ)·R 3T (ψ) = ψ̇ 0 ×
1
Drawing upon lemma (33) we therefore have the deceptively suggestive formal
statement
φ + θ̇θ + ψ̇
ω = φ̇ ψ (41.1)
where
0 1 0
φ ≡ φ̇ 0 ,
φ̇ θ̇θ ≡ θ̇ R 3 (φ) 0 , ψ ≡ ψ̇ R 3 (φ) R 1 (θ) 0
ψ̇
1 0 1
24 Gyrodynamics
give
0 cos φ sin θ sin φ
φ = φ̇ 0 ,
φ̇ θ̇θ = θ̇ sin φ , ψ = ψ̇ − sin θ cos φ
ψ̇ (41.2)
1 0 cos θ
whence finally
θ̇ cos φ + ψ̇ sin θ sin φ
ω= θ̇ sin φ − ψ̇ sin θ cos φ (42)
φ̇ + ψ̇ cos θ
Which the αkλ -functions latent in (42) obviously do not satisfy: it is not possible
α. Right at the heart of 3-dimensional rotational kinematics lives
to write ω = α̇
a non-integrability condition. Contrast this with the 2-dimensional situation,
where if
cos θ sin θ
R=
− sin θ cos θ
then
0 θ
Ṙ R T = d
, which by dualization becomes simply ω = d
dt θ
dt −θ 0
We found at (13) that the intrinsic (or rotationial) kinetic energy can be
described
T = 12 ω T I ω
which by (25) becomes
T = 12 ω T R I 0 R –1 ω
= 12 ω 0 T I 0 ω 0 with ω 0 ≡ R –1 ω (43)
One could evaluate ω 0 by Mathematica -assisted brute force. Or one could work
Lagrangian formalism using Euler angles 25
A0 = R –1A R
= R –1 Ṙ R –1 R = R –1 Ṙ
8. Lagrangian formalism using Euler angles. It follows clearly from results now
in hand that the rotational dynamics of a rigid body can be considered to
devolve from a Lagrangian of—if we exercise our option to identify the body
frame with the principal axis frame—the form
which precisely reproduces the third of the Euler equations (35). Certain linear
combinations of the remaining Lagrange equations
d ∂ − ∂ L=0 (46.2)
dt ∂ θ̇ ∂ θ
d ∂ − ∂ L=0 (46.3)
dt ∂ φ̇ ∂φ
serve to reproduce first and second of the Euler equations.
One can understand the relative complexity the Lagrange equations (46)
on grounds
that they undertake
to accomplish more than Euler equations.
Solutions φ(t), θ(t), ψ(t) of (46) serve in themselves to describe how the body
gyrates, while solutions ω (t) of (35) leave us—as we saw at (36)—one awkward
integration away from such explicit information.
26 Gyrodynamics
If I10 = I20 then the rigid body (or “top”) is said to be symmetrical,14
and the Lagrangian (45) assumes the form
L = 12 I10 (φ̇2 sin2 θ + θ̇2 ) + 12 I30 (φ̇ cos θ + ψ̇)2 − U(φ, θ, ψ) (47)
Had we instead set (not I10 = I20 but, say) I10 = I30 then the simplification of
(45) would have been masked or disguised, while the simplified Euler equations
would be precisely similar to (48). It is, within the Lagrangian formalism, as
a mere convenience, and without real loss of generality, that one identifies the
3-axis of the principal axis frame with the symmetry axis of a symmetrical top.
If I10 = I20 = I30 then the Lagrangian simplifies still further, to become
N 0 = I10 ω̇
ω0 (50)
We will return to discussion of some of the remarkably rich physics that arises
in these important special cases; i.e., to a discussion of the solutions of the
associated equations of motion.
9. Free motion of a rigid body. Working first in the space frame (since it is from
a position at rest with respect to the almost-inertial laboratory that we expect
to view our gyrating rigid objects), we return to (31) and, setting N = 0, obtain
S = I ω̇
Ṡ ω + ω×I ω = 0 (51)
according to which
Trivially,
So also is S 2 ≡ S·S a constant of the free motion (53)
14
The term conveys no information about the actual shape of the top.
Free motion of a rigid body 27
For a free rigid body the rotational energy is all kinetic, and as we saw at
(18.2) can be described T = 12 ω · S . To prepare for a proof that Ṫ = 0 I digress
to establish the following
Explicitly
W –1 · W m i W n j Amn = &ikj M k p ap
which in index-free notation becomes
15
See classical electrodynamics (/), pages 172/3.
28 Gyrodynamics
Ṫ = 12 ( ω̇
ω · S + ω · Ṡ
S)
ω·S
= 12 ω̇ S =0
by (51): Ṡ
But—again by (51)—
ω = −I –1 (ω
ω̇ ω × I ω)
= −I –1 (ω
ω ×)I –1 · I 2ω
But the symmetry of I implies that of I –1 , so we can use lemma (56.3) to obtain
= −(det I)–1 (I ω ) × I 2ω (57)
giving
Ṫ = 12 (det I)–1 (I 2 ω × I ω )·· I ω
But the triple scalar product
a1 b1 c1
a × b)·· c = a2
(a b2 c2 = 0 unless a, b and c are linearly independent
a3 b3 c3
Which in the preceding equation is clearly not the case. So we have
Ṫ = 0 : T is a constant of the free motion (58)
1 d
2 dt
ω 0· ω 0
ω 2 = ω̇
= (det I 0 )–1 (I 02 ω 0 × I 0 ω 0 )·· ω 0
Electing to work in the principal axis frame, where I 0 is diagonal, and agreeing
to omit all 0 superscripts for the duration of this argument, we therefore have
2
I1 ω1 I1 ω1 ω1
= 1 I22 ω2 I2 ω2 ω2
I1 I2 I3 2
I3 ω3 I3 ω3 ω3
I12 I22 I32
det J
= ω1 ω2 ω3 with J ≡ I1 I2 I3 (59)
I1 I2 I3
1 1 1
which gives
ω12 α1
ω22 = J –1 α2
ω32 α3
(I2 − I3 ) −(I2 + I3 )(I2 − I3 ) I2 I3 (I2 − I3 )
J –1 = (det J )–1 · (I3 − I1 ) −(I3 + I1 )(I3 − I1 ) I3 I1 (I3 − I1 )
(I1 − I2 ) −(I1 + I2 )(I1 − I2 ) I1 I2 (I1 − I2 )
with
etc. ≡ α1 −(I2 +I3 )α2 +I2 I3 α3 α1 −(I2 +I3 )α2 +I2 I3 α3 α1 −(I2 +I3 )α2 +I2 I3 α3
with
2(I2 + I3 )T − S 2
λ1 =
I2 I3
2(I3 + I1 )T − S 2
λ2 =
I3 I1
2(I1 + I2 )T − S 2
λ3 =
I1 I2
which, it will be noticed, are assembled from frame-independent
• system parameters and
• constants of the free motion
Returning with this information to (59) we come at last to the statement
ω dω = (λ1 − ω 2 )(λ2 − ω 2 )(λ3 − ω 2 )
dt
which can be used to compute
16
My derivation of (60) has been freely adapted from the discussion that
can be found in §137 of E. J. Routh, Advanced Dynamics of a System of Rigid
Bodies (6th edition ).
Free motion of a rigid body 31
Tmost = S2 (61.2)
2Ismallest
The intermediate ellipses were produced by incrementing the energy
in equal steps.
Tcritical = S2 (61.2)
2Iintermediate
the curves that enveloped the major axis have fused, to produce an ×
at the puncture-point of the intermediate principal axis.
0
10. Poinsot’s construction. The preceding constructions live in S -space: they
tell us—on the assumption that the values of I1 , I2 , I3 are known, and that the
values of S and T have been prescribed—where S 0 is free to roam, but not how
in time it elects to do so. We have
defines what I will call the T-ellipse: its center is pinned to the origin of ω -space,
and its axes—of lengths
2T/I1 2T/I2 2T/I3
d ω
Poinsot’s construction proceeds now from two key observations, the first
of which is that
∇ω T(ω
ω) = I ω = S
In words: S stands normal to the plane which tangentially kisses the T-ellipsoid
at ω . The planes thus constructed at various times t will, by the invariance of
S , be parallel . Poinsot observed further that those planes are in fact identical,
for the normal distance from origin to plane
ω⊥ = ω · Ŝ
S = 2T/S = constant of free gyro motion
18
That is why we took the trouble to introduce ρ -space !
Poinsot’s construction 37
ρ⊥
ρ
invariant plane
It will prove useful to observe in this connection that if the physical ellipsoid
M = d · 43 πa1 a2 a3
19
See Problem 2-1.
38 Gyrodynamics
—the implication being that the physical ellipsoid and the associated Poinsot
ellipsoid have distinct but qualitatively similar figures. During the first half of
the 19th Century close study of the geometry of polhode and herpolhode appears
to have been a flourishing industry. Many wonderful facts were discovered,
most of which are now forgotten, but some of which can be recovered from
the old textbooks. Webster reports, for example,20 that while polhodes are
invariably reentrant, herpolhodes are usually not, and never possess inflection
points: their name (from ´ρπιν = to creep like a snake) is therefore somewhat
misleading. The old literature provides elaborate figures produced by laborious
hand calculation. It would be amusing—possibly instructive—to use modern
computer resources to recreate some of that material, to produce animated
images of rolling Poinsot ellipsoids, etc.
I must emphasize that Poinsot’s construction pertains to the gyrodynamics
of free bodies. The application of torques would, in general, cause both S and
T to become time-dependent. The formerly “invariant plane” would begin to
move, to wobble, and the center of the inertial ellipsoid to rise and fall with
respect to that plane: the whole construction would become “seasick,” and
rapidly lose its utility.
11. First look at the free gyration of a symmetric top. By “symmetric” we refer
here not to the shape of the body itself, but to the shape of its only dynamically
relevant feature—the inertia ellipsoid (or—reciprocally—the energy ellipsoid in
spin space) . . . though in practice most rigid bodies that are symmetric in the
above sense are axially symmetric also in their spatial form. I will occasionally
allow myself to call such bodies “tops.”
We are obliged at the outset to distinguish (see Figure 13) two principal
classes of axially symmetric tops:
oblate prolate
A ≡ I1 > I2 = I3 ≡ B B ≡ I1 = I2 > I3 ≡ A
Standing at the interface between those two classes is the essentially trivial class
spherical
A ≡ I1 = I2 = I3
of fully symmetric tops. In the presence of symmetry the Euler equations (35)
simplify: we have
0
I1 ω̇1 =0
I3 ω̇2 + (I1 − I3 )ω1 ω3 = 0 : oblate case
I3 ω̇3 − (I1 − I3 )ω1 ω2 = 0
20
A. G. Webster, The Dynamics of Particles and of Rigid Bodies (2nd edition
; Dover reprint ), page 264; see also J. B. Hart, “Incorrect herpolhodes
in textbooks,” AJP 37, 1064 (1969).
First look at the free gyration of a symmetric top 39
B B B B
A A
Figure 13: Symmetric tops come in two flavors. At the top are
triangle diagrams of the sort first encountered on page 8, and below
are oblate/prolate Poinsot ellipsoids (figures of revolution) typical
of that symmetry class.
and 0
I1 ω̇1 − (I1 − I3 )ω3 ω2 = 0
I1 ω̇2 + (I1 − I3 )ω3 ω1 = 0 : prolate case
I3 ω̇3 =0
which share the same abstract structure, together with
0
I1 ω̇1 =0
I1 ω̇2 = 0 : spherical case
I1 ω̇3 =0
We conclude (in a phrase that describes not the motion of the top itself, but
the motion of the vector ω 0 (t) in ω 0 -space) that
giving
det J
d
dt ω2 = ω1 ω2 ω3 = 0 : ω 2 is conserved (69)
I1 I2 I3
For tops-in-general ω and ω 2 are both time-dependent (unless ω happens to be
an eigenvector of I, in which case both are constant: see again (54)), but for all
symmetric tops ω 2 becomes constant, though ω typically continues to wander.
21
H. Goldstein, Classical Mechanics (2nd edition ), page 212.
Detailed account of the free gyration of a symmetric top 41
12. Detailed account of the free gyration of a symmetric top. Knowledge of ω 0 (t),
or even of ω (t), leaves one still an integration away from a description of the
motion R(t) of the physical top itself. To gain the latter kind of understanding
we look to the motion of the Euler angles that serve to describe the relation of
the body frame to the space frame.
We know that S is conserved, and will (without real loss of generality)
look to those motions with the property that S is aligned with the 3 -axis of the
space frame:
0
S = 0 (70)
S
It is also without loss of generality that we will identify the symmetry axis of
the top with the 30 -axis of the body frame . . . which is to say: we will stipulate
42 Gyrodynamics
the symmetry of the top by setting I1 = I2 . The Lagrangian then becomes (see
again (47) on page 26)
0 0 A
and obtain
B(θ̇ sin φ sin ψ + θ̇ cos ψ)
= B(θ̇ sin φ cos ψ − θ̇ sin ψ)
A(φ̇ cos θ + ψ̇)
Evidently
S1 cos φ + S2 sin φ = B θ̇
0 S sin θ0 sin ψ
S 0 = R –1 0 = S sin θ0 cos ψ
S S cos θ0
From the last of the equations (73) we see that the first of the Lagrange
equations (72) can be formulated
which when brought to the preceding equation tells us that the second Lagrange
equation can be written
The equation
B φ̇ cos θ − A(φ̇ cos θ + ψ̇) = 0
has therefore become B φ̇ − S cos θ = 0, which supplies
φ̇ = S : constant (76)
B
Returning with this information to (75) we obtain
effective figure of the symmetric top), as I now show: Returning to (71) with
descriptions of B φ̇ and A(φ̇ cos θ + ψ̇) that were developed just above, we obtain
T = L = 1 1 sin2 θ0 + 1 cos2 θ0 S 2 (79.1)
2 B A
= 1 S −
2 1 1 − 1 S 2 sin2 θ
0
2A 2 A B
= 1 S 2 + 1 1 − 1 S 2 cos2 θ0 (79.2)
2B 2 A B
giving
tan θ0 = 1 − 2T / 2T − 1 (80)
A S2 S2 B
It is gratifying to note in this connection that (61) supplies
1 2T 1 : oblate case
B S2 A
1 2T 1 : prolate case
A S2 B
so the parenthetic expressions under the radical have in all cases the same sign:
we are at (80) never asked
√ to take the square root of a negative number. Making
use now of cos2 = 1/ 1 + tan2 we obtain finally
Ωψ = B − A S 2T − 1 / 1 − 1
AB S2 B A B
=S 1 − 1 2T − 1 (81)
A B S2 B
where again—for reasons just stated—the expression under the radical is in all
cases non-negative.
I enter now upon a series of elementary remarks that culminate in a
celebrated geometrical interpretation of the R(t) implicit in (78):
• From T = 12 ω ·S = 12 ωS cos α and the established facts that for a symmetric
free top not only T and S but also ω are constants of the motion, we see
that for such a top the angle S ∠ ω (we have named it α) is invariant. We
have
2T/S
cos α = (82)
ω
which in the notation of Figure 11 becomes simply cos α = d/ω
• Bringing θ̇ = 0 to the description (42) of ω , we find
ω
φ̇
θ
ψ̇
It follows in any event from (78) that
1
cos2 θ0 + 1
sin2 θ0
cos α = A B
(85)
1
A2 cos2 θ0 + 1
B2 sin2 θ0
Evidently α → 0 as A → B
• The angle ω ∠(symmetry axis)—call it β—can be obtained from
ω30
cos β =
ω
φ̇ cos θ0 + ψ̇
= by (44)
ω
S
A cos θ0
= by (75) and (84)
S A12 cos2 θ0 + B12 sin2 θ0
= 1 (86)
!
A 2
1+ B tan2 θ0
√
Drawing again upon cos = 1/ 1 + tan2 , we have
tan β = ± B
A
tan θ0 (87)
β
θ0 α
invariant plane
space frame
α θ0
β
invariant plane
space frame
• From the figure, which was designed to make plain the meanings of the
invariant angles θ0 , α and β, we read
"
β−α : oblate cases
θ0 = (88)
β+α : prolate cases
Detailed account of the free gyration of a symmetric top 47
from which, after simplifications, we are led back again to precisely (85).
• At (79.2) we had an equation which by (76) and (77) can be written
2T/S = φ̇ + ψ̇ cos θ0
φ̇ + ψ̇ cos θ0
cos α = (89.1)
ω
This equation is structurally reminiscent of an equation
ψ̇ + φ̇ cos θ0
cos β = (89.2)
ω
sin2 β
φ̇2 = ω 2 (90.2)
sin2θ0
• Evidently
which provides the basis for the claim—developed in the following figures—
that the body cone (with vertex angle β) rolls without slipping on the
stationary space cone (vertex angle α).
48 Gyrodynamics
r sin α
r sin β
r
α β
dφ · r sin α = dψ · r sin β
But that is precisely the upshot of (91). The lower figure provides
another representation of the same principle, and makes clear the
fact that rotation of the body cone causes its center to advance
around the space cone in that same sense. The red axis in the
upper figure represents the symmetry axis of the top.
Detailed account of the free gyration of a symmetric top 49
S
ω
30
10
20
ω S 30
10
20
Figure 18: Alternative representations of the body cone rolling
without slipping on the space cone (prolate case above, oblate case
below). The rolling body cone controls the motion of the symmetry
axis of the body—the 30 -axis. The uniform rotation (about that
axis) is controlled by the Ω-vector that was introduced on page 40:
it is retrograde in the prolate case, prograde in the oblate case.
Instability of spin about the intermediate axis 51
B B
A 0< A−B 1
B
B B B − 12 A
0< 1
A B
B B
0< A 1
A B
13. Instability of spin about the intermediate axis.22 Let the Euler equations (35)
of a free asymmetric top be written
0
I1 ω̇1 − (I2 − I3 )ω2 ω3 = 0
I2 ω̇2 − (I3 − I1 )ω3 ω1 = 0
I1 ω̇3 − (I1 − I2 )ω1 ω2 = 0
22
My primary source here has been the discussion presented by S. T. Thornton
& J. B. Marion in §11.12 of their Classical Dynamics of Particles and Systems,
(5th edition ).
52 Gyrodynamics
Assume I1 > I2 > I3 and agree for the purposes of this discussion to omit the 0 s.
Write
ω1
ω = 2 : 2 and 3 infinitesimal
3
to express our presumption that ω lies in the immediate neighborhood of the
principal axis of greatest moment. In leading order (i.e., after abandoning terms
of 2nd order) we have
I1 ω̇1 = 0 ⇒ ω1 is constant
I2 ˙2 − (I3 − I1 )ω1 3 = 0
I1 ˙3 − (I1 − I2 )ω1 2 = 0
we would have been led by a similar argument to the conclusion that 1 and 3
both move like
−iΩ2 t (I2 − I3 )(I2 − I1 )
χ(t) = P e+iΩ2 t
+ Qe with Ω2 ≡ ω2
I3 I1
while
1
ω = 2 : 1 and 2 infinitesimal
ω3
leads to
−iΩ3 t (I3 − I1 )(I3 − I2 )
χ(t) = P e +iΩ3 t
+ Qe with Ω3 ≡ ω3
I1 I2
= 2π/Ω1 + · · ·
with
(A − B)(A − C)
Ω1 ≡ ω 1
BC
—in precise agreement with the result obtained by simpler means on page 52.
Formulæ appropriate to cases in which S 2 falls on the red interval in the
preceding figure (cases, that is to say, in which 2TC S 2 < 2TB and ω 0
circulates about the principal axis of least moment) can be obtained from the
preceding formulae by A C interchange.
Return now to (93) and set S 2 = 2TB to obtain
1
2π
τ1 → τ 2 = 4 ABC 1 dφ (94)
(A − B)(B − C)2T 0 1 − p2 sin2 φ
with
(B − C)(A − B)
p2 = =1
(A − B)(B − C)
τ1 (S 2 ) = F1 (S 2 ) · EllipticK[ p21 (S 2 )]
≡ 1 − q (σ )
2 2
Instability of spin about the intermediate axis 55
EllipticK[ p2 ] ≡ EllipticK[ 1 − q 2 ]
= Q + 14 (Q − 1)q 2 + 64
9
(Q − 76 )q 4 + 256
25
(Q − 37
30 )q + · · ·
6
where
Q ≡ log(4/q) : blows up logrithmically as q ↓ 0
2
The results now in hand could be used to compute τ1 (Scritical + σ 2 ), and by
A C one could without labor obtain a description of τ3 (Scritical − σ 2 ). For
2
Suppose, for example, we set T = 1 and assign to the principal moments the
values A = 4, B = 83 and C = 2 that were used to construct the figures on
pages 32–34: then 2TA = 8, 2TB = 16 3 = 5.333, 2TC = 4 and Mathematica
constructs the graph of τ (S 2 ; 4, 16
3 , 2, 1) presented here as Figure 19.
80
60
40
20
5 6 7 8
23
See E. Jahnke & F. Emde, Tables of Functions (), page 73; J. Spanier
& K. B. Oldham, An Atlas of Functions (), page 612.
56 Gyrodynamics
r N
14. Symmetric tops with a fixed point. Astrophysical reality presents many
examples of semi-rigid bodies rotating semi-freely in intertial space, but we are
seldom inclined to call such objects “tops,” as has been my practice. The tops
of playroom experience derive much of their fascination from the fact that—
inevitably—they are spun in uniform gravitational fields, and are supported
. . . which is to say: they are not free.
Discussions of the precession of tops such as are found in introductory
textbooks24 standardly proceed from diagrams resembling Figure 20. Writing
F = F vertical + F horizontal
x3
x03
x02
x2
ψ
φ x01
x1
line of nodes
where m refers to the total mass of the top, where the final term on the right
is a potential enegy term that was absent from the theory of free rigid rotators,
and where it is to be understood that in the present instance25
A = (center of mass value)
B = (center of mass value) + mh2
pψ − Aφ̇ cos θ
ψ̇ =
A
which when brought to the first equation gives
pφ − pψ cos θ
φ̇ = (97.1)
B sin2 θ
whence
pψ pφ − pψ cos θ
ψ̇ = − cos θ (97.2)
A B sin2 θ
25
This follows directly from (14) if one makes the replacements
r1 → r1
r2 → r2
r3 → r3 + h
and uses (3) to eliminate terms of the form r hρ(ρρ) dr1 dr2 dr3 .
Symmetric tops with a fixed point 59
Note that if θ(t) were known then we could in principle use (97) to figure out
φ(t) and ψ(t).
From the design (95) of L it follows that total energy of the spinning top
is conserved:
E = 12 B(φ̇2 sin2 θ + θ̇2 ) + 12 A(φ̇ cos θ + ψ̇)2 + mgh cos θ = constant (98)
But 12 A(φ̇ cos θ + ψ̇)2 = 12 p2ψ /A was seen at (96) to be conserved all by itself, so
we have conservation of
At this point it becomes natural to mimic methods borrowed from the mechanics
of one-dimensional conservative systems, writing (for example)
dθ =
B E − V(θ)
2
dt
⇓
θ
transit time θ → θ = 1 dϑ
θ 2
B E − V(ϑ)
5000
4000
3000
2000
1000
Π Π
2
For many purposes it is, however, very useful to notice that (99) can, by a
change of variables θ → u ≡ cos θ, be brought to the form
u̇2 = (α − βu)(1 − u2 ) − (p − qu)2
= f (u) : cubic in u (100)
and to proceed under the presumption that physically self-consistent values
have been assigned to α ≡ 2E/B, β ≡ 2mgh/B, p ≡ pφ /B and q ≡ pψ /B.
From
f (u) ∼ βu3 for u large
from
f (±1) = −(p ∓ q)2 < 0
(we agree to exclude temporarily the exceptional cases p ∓ q = 0) and from the
fact that for our results to admit of physical interpretation it must be the case
that −π < θ < π (−1 < u < +1), we conclude that in physically realistic cases
f (u) must be of the form graphed in the following figure:
-1 1
-1
-2
-3
p − qu
ψ̇ = (B/A)q − u
1 − u2
p − qu
φ̇ = (101)
1 − u2
• as it spins and precesses it nutates under the control of θ(t), the detailed
motion of which is, by
θ = arccos u
again under the conrol of u(t), which oscillates back and forth—periodically
but non-sinusoidally—between the turning points u1 and u2 .
Looking to (101) we see that φ̇ vanishes at u = u ≡ p/q. If u lies between
u1 and u2 then φ̇ reverses sign as u proceeds u1 → u2 and the symmetry axis
of the top traces a looping curve such as appears at the top of the following
figure. If—exceptionally—u = u2 we get cusps (middle of the figure),26 while
if u > u2 then φ̇ retains the same sign as u proceeds u1 → u2 and we get the
undulating curve shown at the bottom of the figure.
The elementary theory sketched on pages 56–57 provided no indication of
the nutation exhibited by real tops. We are in position now to recognize that
a top will display nutation-free precession if and only if the turning points u1
and u2 are coincident: u1 = u2 = u0 , where u0 marks the point at which the
effective potential
(p − qu)2
V (u) ≡ V(arccos u) = + βu
1 − u2
and look for the root of the quartic numerator that lies on the physical interval
[−1, +1 ]. We note in this connection that the numerator is quadratic in (p−qu),
26
Though mathematically exceptional, such cusps are in fact observed if one
spins up a top and then “drops” it with (initially) φ̇ = θ̇ = 0.
62 Gyrodynamics
(1 − u20 ) ! "
p − qu0 = q ± q 2 − 2βu0
2u0
so in leading order
q pψ A
φ̇0+ = = = ω
u0 B cos θ0 B cos θ0 spin
β mgh mgh
φ̇0− = = =
2q pψ A ωspin
It is interesting that φ̇0− is (in leading order) θ0 -independent; i.e., that steady
precession can occur at any tilt. This may account for why it is that—according
to Thornton & Marion 22 that “it is the slower of the two [steady] precessional
velocities that is usually observed.” In any event (to rephrase in more physical
terms a point established just above), steady precession of neither sort can
occur unless
4Bmgh cos θ0
ωspin >
A2
Much more could be said about the physics of toy tops, a subject which has
first charmed, then challenged, many of the greatest classical theorists. Here I
quote the 25 -year- old Maxwell, writing in :27
“To those who study the progress of exact science, the common
spinning-top is a symbol of the labours and the perplexities of men
who had successfully threaded the mazes of the planetary motions.
The mathematicians of the last age, searching through nature for
problems worthy of their analysis, found in this toy of their youth,
ample occupation for their highest mathematical powers. . . We find
Euler and D’Alembert devoting their talent and their patience to the
establishment of the laws of the rotation of solid bodies. Lagrange
has incorporated his own analysis of the problem with his general
27
See pages 246–262 in Volume I of W. D. Niven (editor), The Scieintific
Papers of James Clerk Maxwell ().
64 Gyrodynamics
treatment of mechanics, and since his time Poinsôt has brought the
subject under the power of a more searching analysis than that of the
calculus, in which ideas take the place of symbols, and
intelligible propositions supersede equations.” Maxwell continues
with a reference to “. . . the top which I have the honour to spin
before the Society. . . ”
But for further particulars and finer details I must refer my reader to §5 -7 in the
1st and 2nd editions ( and ) of Goldstein, and to additional references
cited there.
S (θ)
δ 2θ
ω B A
Writing
we have # $ # $2 # $2
A+B 2
S1 − 2 + S2 = A−B
2
The implication is that the vectors S (θ) all lie on the circle shown in the
preceding figure. On two occasions S (θ) and ω are parallel:
2θ = 12 π + δ
with
Σ1 ≡ 12 (A − B) sin 2ψ sin θ
Σ2 ≡ 12 [A sin2 ψ + B cos2 ψ − C ] sin 2θ
Σ3 ≡ (A sin2 ψ + B cos2 ψ) sin2 θ + C cos2 θ
Our assignment is to describe the Σ (ψ, θ)-vector, which R(φ) serves simply to
twirl about the 3-axis (the ω -axis), with these consequences:
giving
2 (a − b) sin 2ψ] sin θ
1
Σ1 = [
Σ2 = [ 2 (a + b) − 2 (a − b) cos 2ψ] sin θ cos θ
1 1
A>B>C
A S3 C
can be written
S12 + S22 + (S3 − C − k)2 = k 2 (104)
From
dk(ψ) (A − B)(A − C)(B − C) sin 2ψ
=
dψ (a sin2 ψ + b cos2 ψ)2
we learn that
It is on the basis of (104) that I have constructed Figure 26, and from the
figure we discover that half the story remains untold , for the figure appears to
indicate that I possesses (in addition to the isolated eigenvalue C ) a continuum
of eigenvalues lying between B and A. The point to notice is that we obtained
(104) by eliminating θ between the following two equations:
2 a+b a−b 2
S12 + S22 = a−b
4 sin 2ψ + 4 − 4 cos 2ψ cos 2
θ sin2 θ
a+b a−b 2 2
S3 = 4 − 4 cos 2ψ sin θ + C
If, on the other hand, we had undertaken to eliminate ψ we would have obtained
with
a+b 2
= (θ) ≡ 2 − ab sin2 θ
2
= A+B
2 − C − (A − C)(B − C) sin2 θ
68 Gyrodynamics
From (105) we are led to Figure 27, which again tells only half—the
other half—of the story. It is only by conflating those figures—by taking (104)
and (105) in combination—that we obtain a description of the set of points to
which parameters φ, θ and ψ can be simultaneously assigned, a description of
the curious region to which S (φ, θ, ψ) is necessarily confined.
The ω -axis punctures that “crescent of revolution” at only three points,
and those mark the eigenvalues of I.
It is evident from Figure 28 that
1
[(greatest eigenvalue) − (least eigenvalue) ]
δ = arcsin 12 (106)
2 [(greatest eigenvalue) + (least eigenvalue) ]
y = Mx
x : M a real 3 × 3 symmetric matrix
16. Theory of celts. Footballs, hardboiled eggs, tippy tops . . . all behave in
counterintuitive ways when spun, and each has generated a literature.30 Here I
propose to discuss only one of those curiosities. The story begins in the British
Museum, where one day in the s the physicist G. T. Walker had reason to
examine that museum’s collection of “celts”—smooth axhead-like stones found
in abundance at paleolithic sites all over Europe and the British Isles—and
chanced to notice that many of them, while they spun easily in one direction,
first wobbled and then reversed course when spun in the opposite direction.
28
Such an equation relates stress to strain in elastic media, polarization to
electric field strength in dielectric media, etc.: the list could be very greatly
extended.
29
Culmann (–) was a German professor of civil engineering who
is remembered today mainly for his contributions—some of which had been
anticipated by Maxwell—to “graphical statics.” Mohr (–) taught civil
engineering first in Stuttgart and then (from until his retirement in ) in
Dresden. He was said by his student A. Föppl (who himself figures importantly
in the history of electrodynamics, and whose texts influenced the development
of the young Einstein) to have been an outstanding teacher: a tall, proud and
taciturn man who spoke and wrote with simplicity, clarity and conciseness.
“Mohr’s stress circle”—the contribution for which he is today remembered—
provided the basis for his theory of stress failure; for an account of something
called the “Coulomb-Mohr fracture criterion” see (for example) C. C. Mei,
Mathematical Analysis in Engineering (), p. 150. For an electrodynamical
application of Mohr’s idea—having nothing at all to do either with tops or with
fracture—see my classical electrodynamnics (), p. 127.
30
A fairly extensive bibliography—which is, however, by no means complete
—begins at page 146 in gyrodynamics (/).
Theory of celts 71
We are led to look at the bottom of the x = 0 cross-section of the celt, where
z(y) = −c 1 − (y/b)2
31
Quarterly Journal of Pure & Applied Mathematics 28 (1896), pages 175–
184.
32
“The rigid body dynamics of unidirectional spin,” Proceedings of the Royal
Society (London) 405A, 265 (1986).
72 Gyrodynamics
K ≡ d (slope) = 1 d arctan dz = z
3
ds
1 + (z ) dy
2 dy [1 + (z )2 ] 2
which in the instance at hand supplies
Figure 30: The formerly homogeneous celt has now been “loaded”
in such a way as to preserve the location of the center of mass, and
to preserve also the z-axis as a principal axis, but to slew the other
principal axes with respect to the associated geometrical axes.
px2 + qy 2 = k 2
34
Note that a > b entails p < q.
35
Mathematica confirms that the eigenvalues of M(ψ) are {p, q} and that
det M(ψ) = pq for all values of ψ.
74 Gyrodynamics
0.5
-1 -0.5 0.5 1
-0.5
-1
2
P x2 + 2Rxy + Qy 2 + z = 1
c
which in the near neighborhood of the point of static support (i.e., for small x
and y) becomes
= − c 1 − 12 (P x2 + 2Rxy + Qy 2 ) − · · · (109)
Importance will attach in the dynamical theory to the unit normal vector
at the point to contact (see Figure 32). To obtain a description of that vector,
Theory of celts 75
ϕ(x, y, z) ≡ 12 c (P x2 + 2Rxy + Qy 2 ) − z
ϕ(x, y, z) = c
We then have
c(P x + Ry)
∇ϕ = c(Rx + Qy)
−1
giving
∇ϕ
n=
∇ϕ|
|∇
c(P x + Ry)
= 1 c(Rx + Qy)
1 + c (P x + Ry)2 + c 2 (Rx + Qy)2
2
−1
where the abandoned terms are of higher than second order in x and y.
76 Gyrodynamics
In that same leading approximation the vector that extends from the center of
mass to the instantaneous contact point becomes
x
r0 = y (111)
−c
so we have
m(v̇v 0 + ω 0 × v 0 ) = mg n0 + f 0
The motion of ω 0 is described by Euler’s equation
ω 0 + ω 0 × I 0 ω 0 = r 0 ×ff 0
I 0ω̇
36
It is principally because Hoyle works in 2nd order that his equations are
relatively so complicated, and the significance of his results so hard to grasp
intuitively.
Theory of celts 77
Here
A 0 0
I0 = 0 B 0 (112.3)
0 0 C
n = 0 because the table is (by assumption) flat, giving
Additionally we have ṅ
n0 + ω 0 × n 0 = 0
ṅ (112.4)
And finally, because we assume the celt rolls without slipping, we have
v0 + ω 0 × r0 = 0 (112.5)
Equations (112) provide the physical basis of Walker’s theory (also of Hoyle’s).
In working out the consequences of these equations let us now agree, as a matter
of typographic convenience, to drop the 0 s.
a × b)× c = (cc · a)bb − (cc · b)a
Cross (112.4) into the unit vector n and use (a a
n ×n
to obtain ṅ n· ω )n
n + (n n − (n
n· n)ω
ω = 0 or
n ×n
ω = ṅ n + ω n with ω ≡ (n
n· ω ) (113.1)
= magnitude of ω
v = r × (ṅ
n ×n
n + ω n) (113.2)
ω = n̈
ω̇ n ×n n + ω ṅ
n
v̇v = ṙr × (ṅ
n ×n
n + ω n) + r × (n̈
n ×n
n + ω ṅ
n)
—can be used to turn (112.2) into an equation involving only n, r and their
derivatives. I postpone that substitutional exercise.
The unit normal n, though fixed with respect to the table frame, moves
relative to the body frame (in which we are now working), and Walker takes
that apparent motion to be the indicator of what the celt is doing. Hoyle, on
the other hand, elects to watch the motion of r (i.e., of x and y, which refer
37
Here we accept Walker’s intuition-based assertion that ω will be constant
in leading order, that its temporal variation will be a higher-order effect. It
was in an effort to avoid such ad noc assertions that Hoyle worked in second
order.
78 Gyrodynamics
ṙr = J ṅ
n
38
The 4 factors have been introduced to identify terms that we imagine to
be “small,” and to provide Mathematica with means to identify and discard
second order terms as they arise: at the end of the day we will set 4 = 1.
Theory of celts 79
0 = + n̈1 [B + mc 2 ]
+ ṅ1 [−mcρ ω]
+ ṅ2 [+(A + B − C)ω + 2mc 2 ω − mc β ω]
+ n1 [(−A + C − mc 2 + mc α)ω 2 − mg(c − α)]
+ n2 [mρ(g + cω 2 )]
0 = + n̈2 [A + mc 2 ]
+ ṅ1 [−(A + B − C)ω − 2mc 2 ω + mc α ω]
+ ṅ2 [+mcρ ω]
+ n1 [mρ(g + cω 2 )]
+ n2 [(−B + C − mc 2 + mcβ)ω 2 − mg(c − β)]
M n̈
n + (S + A) ṅ
n + Kn = O (114)
where
B+mc 2 0
M=
0 A+mc 2
− mc ρ ω
1
2 mc (α−β) ω
S= 1
2 mc (α−β) ω + mc ρ ω
(115)
0 +[A+B−C+2mc2 − 12 mc(α+β)] ω
A=
−[A+B−C+2mc2 − 12 mc(α+β)] ω 0
(−A+C−mc 2 +mc α) ω 2 −mg(c−α) mρ(g+c ω 2 )
K=
mρ(g+c ω )2
(−B+C−mc 2 +mcβ) ω 2 −mg(c−β)
The later four parameters are hidden in the designs of α, β and ρ. In the absence
of misalignment we have ρ = 0. The matrices M, S and K are symmetric; A is
antisymmetric.
Notice that reversing the sign of what Hoyle calls the “spin” (ω → −ω)
sends
M n̈
n + (S + A) ṅ
n + Kn = O
↓
M n̈
n − (S + A) ṅ
n + Kn = O
Which is to say: Celts spun or are described by distinct equations of motion,
already in first- order theory. The chirality of celt dynamics is thus made
immediately apparent.
The Walker/Gray argument becomes at this point a cleverly executed
exercise in stability theory. To expose the elegance of their idea without the
distraction of notational clutter, let us write
m1 0
M=
0 m2
−ρσ s
S =
s ρσ
(116)
0 a
A =
−a 0
k1 ρκ
K=
ρκ k2
which entails
0 = det[−Ω2 M + iΩ( S + A) + K ]
≡ (K − ρ2 κ2 ) − (µ − ρ2 s2 )Ω2 + M Ω2
+ ρ i(σ · ∆k − 2sκ)Ω − iσ · ∆m Ω3
-2 2
-1.5
39
It is tempting but would be incorrect to call the Ω’s “eigenvalues”—though
they are, like eigenvalues, roots of a polynomial—and it would for that same
reason be incorrect to call the ν ’s—which are in general not orthogonal—
“eigenvectors.”
82 Gyrodynamics
Go to a point in parameter space where Γfast and Γslow are both positive. Both
of the exponentials in (123) then diminish as t increases. Such motion is stable.
Notice now that
s ≡ ω · 12 mc(α − β)
σ = ω · ρ mc
—which enter as factors into the construction of both Γ ’s—reverse sign when
the celt is spun in the opposite direction: ω → − ω. That sign reversal is
passed on to the Γ ’s, with the consequence that the exponentials blow up: the
purported motion (123) has been revealed to be unstable: the small-amplitude
theory has declared itself irrelevant to the subsequent physics.
To pursue that physics one would have to wrestle with the equations of
motion in their full non-linear complexity (which, so far as I am aware, has
never been attempted). The most variable to watch (because most informative)
would be not n but ω , for which one expects to obtain motion ω (t) of the form
small large oscillations small
small −→ large oscillations −→ small
large small large, but of opposite sign
Hoyle has remarked in this connection that so far as concerns the spin of the
celt (angular momentum with respect to its center of mass) we have (in the
space frame)
d
dt S = r ×f
f
d
= mrr × [ dt v − gn]
from which it follows in particular that
d
d
dt (normal component of S ) = n · dt S
n·(rr ×
= mn d
dt v )
= −m dt
d
v ·(n
n × r)
The stable solutions of (123) die exponentially, which is to say: they lose
energy—an effect we normally attribute to dissipation. Odd in the present
instance, since no dissipation mechanism was built into the model . The
resolution of this little paradox must lie in the circumstance that energy leaks
into higher-order aspects of the motion which the first- order theory is powerless
to take into account. . . just as the first- order theory is powerless to temper the
seeming “explosions” in the unstable case.41
I draw attention finally to the fact that it is the S -term in (115) that
accounts for celtic chirality. One has
0 s 0 a
lim S = : resembles A=
ρ↓0 s 0 −a 0
where both s and a depend linearly upon ω. But in the 0th -order theory one
encounters only s2 and a2 , both of which are insensitive to the sign of ω. In
first order one encounteres s (also σ) and a2 : A is again blameless.
It follows that
I2 I2
ϕ̇1 = + α̇ dϕ1 = + dα
I1 + I2 I1 + I2
⇐⇒ (125)
I1
I1
ϕ̇2 = − α̇ dϕ2 = − dα
I1 + I2 I1 + I2
from which (124) are readily recovered as corollaries. Suppose now that the
device has been programed so as to cause α to increase/decrease periodically,
and to synchronously flex its elbows in such a way as to make
large when α̇ > 0 : “ inhaling”
J1 ≡ I2 /(I1 + I2 )
small when α̇ < 0 : “exhaling”
J1
α
42
This last problem has been addressed in a profound way by A. Shapere &
F. Wilczek in “Gauge kinematics of deformable bodies,” AJP 57, 514 (1989).
The paper appears also as §8.3 in Geometric Phases in Physics (), which
they edited, and provides the basis of the discussion which begins on page 89
below.
43
Actually, spin conservation. Without essential loss of generality we will
assume that initially—and therefore for all time, in the continued absence of
impressed torques—S S = 0.
86 Gyrodynamics
ϕ1
ϕ2
2Π 4Π
Figure 36: The green curve (short dashes) describes the motion of
I1 , the solid red curve the motion of I2 , the blue curve (long dashes)
the motion of α. Notice that I2 is dominant when α is increasing,
I1 is dominant when α is decreasing. In constructing the figure I
have set a = 2, b = ω = αmax = 1.
2Π 4Π 6Π
which yields a ϕ1 -advance per period (equal necessarily to the ϕ2 -advance per
period) given by
π
∆ϕ = 4 (b/a)αmax
Notice that t enters into the preceding integral only via the dimensionless
product ωt: it is for this reason that ∆ϕ is independent of the time τ = 2π/ω
that it takes for the device to complete a stroke, a deformation cycle.
Deformable bodies 89
β β
β β
Identifying the frame attached to {α, β}initial with a fixed reference frame in
physical space (the space frame), we introduce R (t) to describe the relationship
of that frame to the frame carried by the deformed blob after it has been
transported along C to the shape {α(t), β(t)} (see Figure 40). The oriention of
the deformed blob is determined by physical principle (conservation of angular
momentum), but the way it wears its frame is arbitrary. Our problem is to find
some way to distinguish what’s physical from what’s merely conventional.
From R (t)T R(t) = I it follows familiarly that it is always possible to write
Ṙ = A R with AT = −A
t
Now, the solution of ṙ(t) = a(t)r(t) (i.e., of r(t) = r0 + 0
a(t )r(t ) dt ) is easily
seen to be
t
r(t) = exp a(t )dt · r0 (128)
0
Deformable bodies 91
d d
dt r(t) = a(t)r(t) =⇒ dτ R(τ ) = A(τ )R(τ )
of Ṙ = A R . Shapere & Wilczek interpret this to mean that the R(t) that
results from transport along a “curve in shape space” depends not at all upon
temporal specifics of the process, but only upon the geometry of the curve C.
Equation (129) makes clear also that specification of A(t) is sufficient in
principle to determine R(t).
Our further progress will be facilitated by some notational adjustment. Let
us write α1 = α, α2 = β, and let us recognize that R(t) means R(α1 (t), α2 (t))
which we will abbreviate R(α(t)). Then
R ;i ≡ R ,i − Ai R (131)
We then have
QR ;i = Q R ,i − QAi R
= (R̂ ,i − Q ,i R) − Q Ai R
= (R̂ ,i − Q ,i Q –1 R̂) − Q Ai Q –1 R̂
and insist upon
= R̂ ,i − Âi R̂
t
48
This is obtained by iteration of R(t) = R0 + 0 A(t )R(t ) dt , and gives
back (128) when all A(t) -matrices commute with one another. Here P is the
“chronological ordering” operator, the characteristic action of which becomes
evident from
A(t1 )A(t2 ) : t1 t2
P[A(t1 )A(t2 )] =
A(t2 )A(t1 ) : t2 t1
Deformable bodies 93
the precise meaning of which depends upon the structure assigned to the gauge
fields Ai (α), which are constrained only by (132): it is here—by contrived
specification of Ai (α)—that we will have an opportunity to slip some physics
into this formal scheme. The matrices R(α + dα) and R(α + dα) will, in general,
be distinct. If, however, they are identical then we say that R(α) −→ R(α +dα)
has proceeded by parallel transport , and can write
If the parallel transport is along a curve α(t) in shape space—here t might but
need not signify time—then we have R(t + dt) = R(t) + Ai (t)R(t)α̇i dt or
49
Said another way, we have
It is from the circumstance that both matrices on the left side of the second
equation attach to the same point in shape space that R ;i (α) acquires its
superior transformation properties.
94 Gyrodynamics
α(t)
α(s)
α(0)
Figure 41: Two curves inscribed on shape space that are coincident
except in the localized neighborhood of a pimple, where they differ
only infinitesimally.
and to distribute such pimples along the length of the curve we have only to
write
t t s
δR = P exp A(τ ) dτ · δA(s) · P exp A(τ ) dτ ds
0 s 0
But
δA(s) = Aj (s) · δ α̇j (s) + α̇i (s) · δAi (s)
∂Ai j
= Aj · ds
d
δαj (s) + α̇i (s) · δα (s)
∂αj
so t t s
δR = P exp etc. · Aj ds d
δαj · P exp etc. ds
0 s 0
t t s
∂Ai i
+ P exp etc. · j
α̇ · P exp etc. δαj (s)ds
0 s ∂α 0
where it is understood that the factors between dots are to be evaluated at s.
The first term we integrate by parts to obtain (after noting that by assumption
δαi (s) vanishes at the endpoints: δαi (0) = δαi (t) = 0 )
t t s
∂Aj i
− P exp etc. − Ai ȧ Aj +
i
α̇ + Aj Ai ȧ P exp
i
etc. δαj (s)ds
0 s ∂αi 0
Deformable bodies 95
giving finally
t t s
δR = δα (s) P exp
j
etc. · Fji · P exp etc. α̇i (s)ds (137)
0 s 0
and52
∂i Âj − ∂j Âi = Q(∂i Aj − ∂j Ai )Q –1 + same stuff!
50
“On Einstein’s λ transformations,” Phys. Rev. 103, 780 (1956). For my
immediate source, see pages 134–137 in Chapter 2 of classical dynamics
(/).
51
Note in this connection that if the matrices Ai and Q were number-valued
instead of matrix-valued, then (132) would read
Âi = Ai + ∂i Q
Fij = ∂i Aj − ∂j Ai = F̂ij
We conclude (see the preceding figure) that transport of I around a finite closed
curve C—a cycle of shapes—has a rotational consequence that can be described
RC ≡ P exp A(τ ) dτ I = Fij (α) dαi ∧ dαj
C R
and that (since the initial and final reference frames are identical ) RC is gauge
invariant.53
We have now to pour some physics into the mathematical vessel that we
have been at such pains to construct. Erect an inertial frame at (let us say) the
center of mass of a (let us say) spinless system of particles:
S= mi r i × ṙri = 0
i
53
Recall from electrodynamics that
A · ds = (Ai,j − Aj,i ) dxi ∧ dxj
∂R
Imagine now that we have—in some continuous but otherwise arbitrary way—
associated a frame (origin coincident with that of the space frame) with every
possible configuration of the system. At time t the system finds itself in some
specific configuration, to which we have associated a frame. We agree to write
r i to describe the position of mi relative to that momentary frame, and
r i = R(t)rri
giving
Ω = −I –1 S
whence54
B = Bij with Bij = 2ijk (I –1S )k
At this point we
• can (but are under no obligation to) identify the particles mi with the
component parts of our deformable blob;
• can (but are under no obligation to) take our configuration-associated
frames to be principal axis frames.
Whatever our position with respect to the exercise of those options, we
• extract from the physics of the system a description of (compare (135))
B = Bi α̇i whence of Bi (which will be defined not everywhere in shape
space, but only where it is needed: on the curve pursued by the system),
with the aid of which we play the parallel transport game. Should the
system ever revisit a point in shape space we will be able to announce
whether it has experienced a net rotation as a result of its dynamical
zero -spin adventures (contortions).
54
It becomes clear only at this point why Shapere & Wilczek look to the
iterative solution of Ṙ = R B rather than (which is more common) of Ṙ = A R.
The resulting formalism is literally the transpose of that described on pages
90–96. Transposition entails reversal of the chronological ordering.
98 Gyrodynamics
S
N intrinsic = Ṡ
= I ω̇
ω + ω ×Iω
when referred to the space frame (generally non-inertial translated copy of the
inertial lab frame), on page 15 we drew attention to the fact that those equations
read
N 0 = I 0 ω̇
ω 0 + ω 0 ×I0 ω 0
—which is to say: they preserve their structure—when referred to the wobbly
body frame. I want now to discuss how this remarkable fact comes about.
Let a wobbly red frame which shares the origin of—but be in a state of
arbitrary rotation with respect to—the space frame, and write
to describe the relationship between the red and black coordinates of any given
point. Immediately
If the space frame were inertial (which generally it is not) then to describe
the dynamics of a single particle we would write F = mr̈r, which when referred
to the wobbly frame becomes
F = m r̈r + 2U ṙr + ( U̇ + U 2 )rr
with F ≡ WF
F and m ≡ m. The preceding equation is often written
with
which is not the result ( WS S = S ) that one might have anticipated. We note
also in this connection that, by appeal once again to the lemma,
W I W –1 = − mi W (rri ×)W –1 W (rri ×)W –1 = − mi (rri ×)2 ≡ I (145)
i i
N ≡ WN N
= WṠS
= Ṡ + İ Ω + I Ω̇
S Ω − ẆS
S
S + İ Ω + I Ω̇
= Ṡ Ω − Ẇ W –1 · WS
S
S + İ Ω + I Ω̇
= Ṡ Ω + Ω × (S
S + I Ω)
S + Ω ×S
= (Ṡ S ) + ( [ İ Ω + I Ω̇
Ω ] + Ω×IΩ)
100 Gyrodynamics
N − (Ω
Ω ×S
S + İ Ω ) − (I Ω̇
Ω + Ω × I Ω ) = Ṡ
S (146)
But the expressions in braces are familiar already from (143). The striking
implication is that (146) can be written
N + N Coriolis + N centrifugal = Ṡ
S (148)
where
N Coriolis ≡ r i ×F
F iCoriolis (149.1)
i
N centrifugal ≡ F icentrifugal
r i ×F (149.2)
i
with respect to which all particles are at rest: ṙri = 0 (all i). From (147.1) and
!
S ≡ mir i × ṙri it then follows that (Ω
Ω × S + İ Ω ) and Ṡ
S both vanish: (146)
therefore becomes
N − (I Ω̇
Ω + Ω×IΩ) = 0 (150.1)
which are usually written
N = I Ω̇
Ω + Ω×IΩ (150.2)
and called the Euler equations. Equations (150.1) serve at once to
illuminate and to deepen the mystery that motivaated this discussion, for they
establish that Euler’s equations might most properly be expressed
Notice that if the system were deformable (which is to say: not rigid) then
it would be impossible to select a frame with respect to which all ṙri terms vanish:
one would be forced to work with some instance of (148). The implication is
that it should be possible to get from (148) to the Liouville equations,56 which
are used by astrophysicists to study the rotational dynamics of stars, planets
and astroids. I must admit, however, that I do not at present know how to do
so.
56
See Problem 10 in Chapter 5 in H. Goldstein’s Classical Mechanics (2nd or
rd
3 editions).