Lecture Notes
Lecture Notes
i
Contents
-1 Introduction 1
-1.1 Group theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
-1.1.1 Polyhedra . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
-1.1.2 Colouring . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
-1.1.3 Polynomial roots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
-1.1.4 Cryptography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
0 Preliminaries 6
0.1 Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
0.2 Maps a.k.a. Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
0.3 Relations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
0.4 Integers, or: Rem(a)inders from Arithmetic . . . . . . . . . . . . . . . . . . . 13
0.4.1 Division . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
0.4.2 Congruence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
0.5 Complex numbers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
0.6 Matrices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
1.3 Subgroups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
1.3.1 Basic groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
1.3.2 Matrix groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
1.3.3 Cyclic groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
1.3.4 Dihedral groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
1.3.5 Permutation groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
1.3.6 Generating new subgroups . . . . . . . . . . . . . . . . . . . . . . . . 34
1.4 Guises of the same group . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
1.5 Group Actions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
1.5.1 Group actions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
1.5.2 Concrete examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
1.5.3 Abstract examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
1.5.4 New actions from old . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
1.6 Orders . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
1.6.1 Order of a group . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
1.6.2 Order of an element . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
2 Families of groups 51
2.1 Congruence groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
2.1.1 Z/nZ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
2.1.2 U(n) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
2.2 Cyclic groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
2.2.1 Definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
2.2.2 Fundamental Theorem of Cyclic Groups . . . . . . . . . . . . . . . . 58
2.2.3 Cyclicity of U(n) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
2.3 Dihedral groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
2.3.1 Definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
2.3.2 Order of a dihedral group . . . . . . . . . . . . . . . . . . . . . . . . . 64
2.3.3 Subgroups of dihedral groups . . . . . . . . . . . . . . . . . . . . . . 66
2.3.4 The center of a group . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
2.3.5 Generators and relations . . . . . . . . . . . . . . . . . . . . . . . . . . 68
2.3.6 Infinite dihedral group . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
2.4 Symmetric groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
2.4.1 Fundamentals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
2.4.2 Order of the symmetric group . . . . . . . . . . . . . . . . . . . . . . 72
2.4.3 Composing and inverting permutations . . . . . . . . . . . . . . . . . 73
2.4.4 Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
2.4.5 Cayley’s Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
2.4.6 Cycles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
2.4.7 Inverting and composing cycles . . . . . . . . . . . . . . . . . . . . . 79
2.4.8 Permutations in terms of cycles . . . . . . . . . . . . . . . . . . . . . . 80
2.4.9 Cycle Decomposition Theorem . . . . . . . . . . . . . . . . . . . . . . 82
2.4.10 Cycle type . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
iii
Contents
iv
Chapter -1
Introduction
1
Chapter -1. Introduction
-1.1.1 Polyhedra
Consider a cube. We cannot easily “reflect” it in three-dimensional space without the aid
of a mirror, but we may rotate it in a few different ways while maintaining its shape:
• Rotating by 90◦ , 180◦ , or 270◦ about an axis through the midpoints of two opposing
edges.
• Rotating by 180◦ about an axis through the midpoints of two opposing edges.
• Rotating by 120◦ or 240◦ about a “grand diagonal”, an axis through two diagonally
opposing vertices.
You can see a visualization here. Observe that with each type of rotation, the faces of the
cube are permuted in different ways! Something that group theory studies is what these
symmetries each change and what they preserve, as well as how they interact with each
other.
-1.1.2 Colouring
Suppose we have a bunch of beads of various colors. How many necklaces can be made
out of those beads? Basic permutation results aren’t enough here: the “starting bead”
doesn’t matter but the order of the beads matter! This is a type of symmetry—“rotating”
the necklace preserves the order of the beads. Similarly, we may also “flip” the necklace
and introduce another type of symmetry.
2
Chapter -1. Introduction
Example -1.1.1 — For a more tractable example, suppose we want to make a necklace
out of four beads that are red, yellow, green, and blue respectively. The only distinct
necklaces are the following:
Simply listing the possibilities becomes infeasible when we introduce more colors and/or
beads. Group theory to the rescue: Burnside’s lemma can be used to count items featuring
symmetries like this by counting everything equivalent up to symmetry as the same. We
will learn about this sometime in November.
b2 − 3ac
1 k
rk = − b+ξ C+ (k = 0, 1, 2)
3a ξk C
where s p
3 b2 − 3ac ± (b2 − 3ac)2 − 4(2b3 − 9abc + 27a2 d)3
C=
2
and √
−1 + 3i
ξ=
2
. . . you’re not expected to know this offhand.
And there is one for quartics as well:
3
Chapter -1. Introduction
But mathematicians got those formulas in the 1500s, and they started looking at quintics.
They got stuck. For two hundred years. The question becomes: is the formula just really
complicated or is it straight up impossible?
Finally in 1799 Ruffini came up with a partial proof that yes, some quintic polynomials
just don’t have a solutions in nice formulas like those. Abel finished up the proof some
years after. Later Galois and Cayley developed criteria so that we know precisely which
polynomials are solvable and which ones aren’t. The tools they developed along the
way evolved into what we now call Galois theory, which, among other things, is used to
investigate the behaviour and relationship of roots of polynomials.
This will be covered in more detail in MAT401, so that’s something to look forward to.
-1.1.4 Cryptography
The art of secret messages is another inspiration to the formalization we see in group
theory.
The main idea of secret messages is to convert a message to gibberish is a reversible way.
The idea is someone who knows the secret, should be able to make sense of the gibberish.
But someone who does not know the secret should not be able to decipher the gibberish.
Ideally, you want the ability to have secret communication with many people (often people
who don’t necessarily know you) without setting up a system of secrets each time. In other
words, there have to be many possible secrets. One should be unable to narrow down
which secrets a particular person might use. A secret for us is an identifier of the exact
process of gibberishizing you re using. You want random person to be unable to try out all
secrets on your gibberish and find out your message.
Thus, we take a large ‘group’ or set of reversible transformations.
Example -1.1.2 — You may have seen the RSA cryptosystem in MAT246 (and a much
simpler proof of its mechanism can be had with group theory!). The idea is:
• Take a large modulus m which is a product of two primes p and q. Publish m
while keeping p and q secret.
• Choose an encryption key e and give it to the person with whom you wish to
communicate.
• Compute the decryption key d using e, p, q and use it to decrypt the messages
encrypted with e.
Given a fixed m, we can pick many different encryption keys e1 , . . . , en to give to dif-
ferent people—which improves the safety of communication while keeping decryption
nice and simple. These valid encryption keys are derived from the structure of the
group behind the RSA cryptosystem, and we will talk about this group in a few weeks.
4
Chapter -1. Introduction
Exercise 1. What does the set of all valid encryption keys look like here?
Such a system allows for multiple layers of security. For example, healthcare data may
be encrypted multiple times while being sent to different agencies, so that identifiable
information is obscured and at each step, an agency only knows information essential to
their operation. When the processed data is sent all the way back to the hospital, we may
apply decryption schemes along the way and retrieve information for each patient.
5
Chapter 0
Preliminaries
0.1 Sets
A set is a collection of things under consideration, and a subset is a collection of some of
those things—including potentially all of them as well as none of them. To write down a
set, you can either write down all its elements:
∅
Some common sets we will be making use of are:
• N = {1, 2, 3, . . . }: set of natural numbers (sometimes including 0),
• Z = {. . . , −1, 0, 1, 2, . . . }: set of integers,
p
• Q= q
: p, q ∈ Z, q ̸= 0 : set of rational numbers,
6
Chapter 0. Preliminaries
Given two subsets A and B of a set X, here are the most important ways to form new
subsets of X.
The union of two sets is the set of elements contained in at least one of them. That is,
A ∪ B = {x : x ∈ A or x ∈ B (or both!)}.
The intersection of two sets is the set of elements contained in both of them. That is,
A ∩ B = {x : x ∈ A and x ∈ B}.
Two sets are disjoint if their intersection is empty. Visually, disjoint sets don’t overlap.
A∪B X A∩B X
A B A B
The complement of a subset is the set of things not in it. More precisely,
A or Ac = {x ∈ X : x ̸∈ A}.
The relative complement of one set, A, in another set, B, is the set of things in B that are not
in A. That is,
B − A or B \ A = {x ∈ B : x ̸∈ A}.
A X A−B X
A B A B
Exercise 2. Write A ∩ Ac = ∅ in plain language (using the word “disjoint”), and then
prove it.
Exercise 3. Let A ⊆ X. Show that the “complement of A” is the same thing as the
“relative complement of A in X”.
7
Chapter 0. Preliminaries
A partition of a set is a collection of subsets that divide it up. Formally, Ai ⊆ X for all i, and
[
X= Ai ,
i
and Ai ∩ Aj = ∅ for all i ̸= j, then we say the Ai ’s partition X (also: form a partition of X).
Example 0.1.1 — The sets {1, 2} and {3, 4} partition the set {1, 2, 3, 4}, but the sets
{1, 2, 3} and {1, 2, 4} do not.
−3 −2 −1 0 1 2 3
The Cartesian product of two sets A and B is the set of ordered pairs (a, b) where a is in A
and b is in B. That is,
A × B = {(a, b) : a ∈ A, b ∈ B}.
f : R → R,
f(x) = x2 .
8
Chapter 0. Preliminaries
Example 0.2.1 — The squaring map on the set {0, 1, 2, 3} can be written as
0 1 2 3
0 1 4 9
in two-line notation.
Given sets X, Y and a map f : X → Y, here are the most important objects derived from f.
The image of a subset A ⊆ X under f is the set of all the values f(a) where a ranges just
over A.
f(A) = {y : y = f(a) for some a in A}.
Example 0.2.2 — Let f(x) = x2 + 1 from R to R. Then im f = [1, ∞) while f([−2, 1)) =
[1, 5].
The graph of f is the set of pairs (x, f(x)) as x ranges over X, formally
f is injective or one-to-one (or even just 1-1) if it doesn’t send different inputs to the same
output.
Exercise 6. Suppose f is injective, and f(x) = f(y). What can you deduce about x and
y?
f is surjective or onto if the image equals the codomain. Surjectivity only makes sense if you
specify a codomain!
Exercise 7. Suppose f is surjective, and y is in Y. What can you deduce about f and
X?
9
Chapter 0. Preliminaries
x x x
Given another map g : Y → Z, g composed with f (or g after f) is the map obtained by
applying f and then g. That is, (g ◦ f)(x) = g(f(x)).
Exercise 8.
X Y Z
f g z1
x1 y1
x2 z2
x3 y2 z3
y3 z4
x4 z5
Exercise 9. Show that the composition of two injective functions is injective. Do the
same with “injective” replaced by “surjective”.
Take care not to confuse this notation with the inverse function f−1 : The preimage is a set,
while f−1 is a function. We may talk about the preimage of sets under any function, but
only bijective ones have an inverse.
Exercise 10. What are the preimages of each element in Z under g in the previous
example? What about g ◦ f?
10
Chapter 0. Preliminaries
Example 0.2.4 —
X X
f
a a
b b
c c
d d
Probably the most important self-map is the identity map x 7→ x, sometimes explicitly
denoted id or idX .
Self-maps are interesting because they can be iterated. Given f : X → X, the nth iterate of f
is defined as f composed with itself n times, denoted fn . For convenience, we set f0 = idX .
Exercise 13. For the function f in the previous example, write out fn for n = 0, . . . , 6.
What do you notice?
0.3 Relations
Given a set X, a relation on X is, formally, a subset R of X × X (the set of pairs (x, y) with
x and y in X). For any x, y in X, we say x is related to y (but not necessarily vice versa) if
(x, y) is in R, denoted xRy.
R is reflexive if all elements are related to themselves. That is, xRx for all x.
R is symmetric if the relation goes both ways. That is, if xRy then yRx as well (for all x and
y).
11
Chapter 0. Preliminaries
R is antisymmetric if no two distinct elements are mutually related. That is, if xRy and yRx,
then x = y.
R is transitive if you can “remove the middleman” in a chain of relations. That is, if xRy
and yRz, then xRz.
A relation that is reflexive, symmetric, and transitive is called an equivalence relation.
Equivalence relations are denoted ∼ instead of R.
“Our human condition is such that [the relation x loves y] is, alas, neither reflexive,
symmetric, nor transitive.”
—Seth Warner, Modern Algebra
Exercise 14.
Fill out the properties of the following relations.
x, y are people R? S? T?
“x loves y”
“x is aware of y”
“x and y were married at some point”
“x is an ancestor of y”
“x looks like y (Think about the Ship of Theseus paradox!)”
“x is not younger than y”
“x has been to the same school as y”
“x is born in the same year as y”
Exercise 15. When is a relation possibly symmetric, transitive, but not reflexive?
Given an equivalence relation ∼, an equivalence class is a complete set of elements that are
all related to one another.
The equivalence class of x is denoted [x] = {y ∈ X : x ∼ y} and every equivalence class has
this form.
Exercise 16. Show that any two elements in [x] are related.
Exercise 18. Revisit Exercise 12 by showing that the relation “x ∼ y if f(x) = f(y)” is
an equivalence relation. What are the equivalence classes?
12
Chapter 0. Preliminaries
42 = 4 · 9 + 6.
When r = 0, we say b divides a and write b | a. A number is prime if it has just two
divisors. Divisibility is a transitive and reflexive relation on Z. It is neither symmetric nor
antisymmetric, but if a | b and b | a then a = b or a = −b.
A number is prime if it has just two divisors.
A common divisor of two numbers is a number dividing them both. The greatest common
divisor or gcd of two numbers is just that—the biggest of the common divisors. The gcd
has the wonderful property that if c | a and c | b then c | gcd(a, b). Two numbers are
coprime if gcd(a, b) = 1.
Dually, a common multiple of two numbers is a number they both divide. The least common
multiple or lcm of two numbers is just that—the smallest of the common multiples. Like
the gcd, the lcm has the wonderful property that if a | m and b | m then lcm(a, b) | m.
The lcm and the gcd are related by the formula
13
Chapter 0. Preliminaries
0.4.2 Congruence
Fix an integer m, called the modulus. Say a ≡ b (mod m) if and only if m | a − b.
Congruence modulo m is an equivalence relation on Z. [Check this!] When m is clear from
context, the equivalence class of a is denoted [a], while the set of equivalence classes is
variously denoted
Z/m or Z/(m) or Z/mZ or Zm . In this course, we use Z/mZ.
The set Z/mZ inherits addition, subtraction, and multiplication from Z, meaning that you
can add, subtract, and multiply equivalence classes (of the same modulus).
In other words, one defines
[a] + [b] = [a + b] and [a] · [b] = [ab]
and then checks that these operations are well-defined.
Example 0.4.2 — Z/24Z has twenty-four elements, [0], [1], [2], . . . , [23].
The set of complex numbers can be visualized as the complex (or Argand) plane:
14
Chapter 0. Preliminaries
ℑ{z}
2i
i 2+i
ℜ{z}
−2 −1 1 2
−i
3
−1 − 2
i
−2i
z = x − iy = x + iy = z
1 z
Exercise 19. If z ̸= 0, show that z
= |z|2
.
We can also divide complex numbers by getting rid of the imaginary part in the denomina-
tor:
a + ib (a + ib)(c − id) ac + bd bc − ad
= 2 2
= 2 2
+i 2
c + id c +d c +d c + d2
The argument of z is the counterclockwise angle, in radians, from the positive real axis to
the line segment connecting z and the origin.
15
Chapter 0. Preliminaries
ℑ{z}
−2 + 2i
2i
| − 2 + 2i| i
−2 −1 1 2
−i
−2i
z = reiθ
which can be obtained by using the Taylor series for the exponential, and recognizing the
Taylor series for sine and cosine.
Euler’s identity allows us to easily convert between the polar and Cartesian coordinates.
Polar coordinates are most useful for multiplication, division, and exponentiation owing
to identities we know about exponentiation:
The unit circle is the set of complex numbers with modulus 1. These points form a circle on
the complex plane.
ℑ{z}
i
ℜ{z}
−1 1
−i
16
Chapter 0. Preliminaries
Exercise 20. Show that the nth roots of unity all have the form e2πik/n for some k in Z.
ℑ{z}
1 ℜ{z}
0.6 Matrices
An m-by-n matrix over R is an array of real numbers with m rows and n columns.
We typically use capital letters (A, B, C, . . . ) for matrices, and lowercase letters (a, b, c, . . . )
for their entries, subscripted by row and then column.
The set of all m-by-n matrices is denoted Mm×n (R).
Note, some people write Mm×n (R) as Rm×n . This is fine, but beware—R2×2 ̸= R4 !
Given an m-by-n matrix A = (ai,j ) and an n-by-p matrix B = (bk,l ), their product AB is the
m-by-p matrix of dot products of the rows of A with the columns of B. Explicitly,
X
n
(AB)i,j = ai,k bk,j .
k=1
4 0 5
17
Chapter 0. Preliminaries
The transpose of an m-by-n matrix A is the n-by-m matrix whose rows are the columns of
A. That is, (AT )i,j = aj,i for all i, j. Just flip it over its diagonal:
T 1 4
1 2 3
= 2 5 .
4 5 6
3 6
A square matrix is a matrix with the same number of rows as columns. If A is an n-by-n
square matrix, an inverse of A is a matrix B such that AB = BA = I. Not every matrix has
an inverse—just consider
0 0
A=
0 0
—but when an inverse exists, it’s unique, and we denote it A−1 . A matrix whose inverse
exists is called invertible.
Finally, the determinant of a square matrix A = (aij ) is defined as follows. For a 1-by-1
matrix,
det a = a,
and for a larger matrix,
X
n
det A = (−1)i+j ai,j det A
e i,j (2)
j=1
18
Chapter 0. Preliminaries
You’re also allowed to expand down any column; the formula for column expansion has the
same shape as (2) but this time the sum is over i (the rows) and it’s j (the column) that’s
fixed.
Recall that the determinant is multiplicative:
19
Chapter 1
In other words, a ⋆ b is the image of the element (a, b) under the function ⋆. In more other
words, a ⋆ b = ⋆(a, b).
Recall that the elements of S × S are ordered pairs (a, b) with a and b both in S. A function
⋆ : S × S → S has to map each ordered pair somewhere. If a and b are distinct, then the
ordered pairs (a, b) and (b, a) are distinct, too. There’s no reason why ⋆ should send
distinct ordered pairs to the same place. Thus, we generally do not have
a ⋆ b = b ⋆ a. (1.1)
20
Chapter 1. Groups and subgroups
− −2 −1 0 1 2 3
−2 0 −1 −2 −3 −4 −5
−1 1 0 −1 −2 −3 −4
0 2 1 0 −1 −2 −3
1 3 2 1 0 −1 −2
2 4 3 2 1 0 −1
3 5 4 3 2 1 0
As with matrices, Cayley tables are indexed by row and then column. The (a, b)-entry in
the Cayley table is a ⋆ b.
In any Cayley table, the elements should appear in the same order down the right as across
the top. When S is finite, Cayley tables can be used to completely describe (hence define)
binary operations.
Example 1.1.7 (Rock–Paper–Scissors) — Consider the set M = {r, p, s}, where the
elements stand for rock, paper, and scissors. Define x ⋆ y to be the winner of the match
if x ̸= y, and define x ⋆ x = x when it’s a tie. Thus, for example, r ⋆ p = p and p ⋆ s = s.
21
Chapter 1. Groups and subgroups
⋆ r p s
r p
p s
s
1.1.2 Associativity
Question — In Rock–Paper–Scissors, what is the value of
r ⋆ p ⋆ s?
1+2+3+4+5
as
1 + ((2 + (3 + 4)) + 5).
However, non-associativity of subtraction is the reason we (should) never write
1−2−3−4−5
even though, in this case, most people would argue (vehemently and to the death) that it’s
−13 because they’re reading it left to right. But what about
Exercise 23. Explain every “no” in the table above. (That is, find a counterexample).
23
Chapter 1. Groups and subgroups
in this course.
Definition 1.1.9 — A composition law is an associative binary operation.
Definition 1.1.10 — A group is a set G with a composition law ⋆ (called its group
operation) and a distinguished element e satisfying these two axioms:
Identity: a ⋆ e = e ⋆ a = a for all a in G
Inversion: for each a in G there exists b in G such that a ⋆ b = b ⋆ a = e
We denote this (G, ⋆, e). Often we simply write (G, ⋆) or even G and let the rest be
implied.
Intuitively, the Identity axiom says “You Can Do Nothing” while the Inversion axiom says
“You Can Undo Anything”—these axioms give us the structure of the invertible symmetries
perspective on groups that we discussed in Week 1.
Any element satisfying the Identity axiom is called an identity; any element satisfying the
Inversion axiom is called an inverse.
Aside. The traditional definition of a group (which you may see in your textbooks) says that a
group is a set G with an operation ⋆ satisfying these four axioms:
Associativity: ⋆ is associative
The traditional definition is unsatisfactory for a few technical reasons. First, the Closure axiom
becomes redundant once you ask ⋆ to be a binary operation. Second, the Associativity axiom
is moreso a property of the operation than of the elements. Third, the traditional definition
doesn’t clarify that the ‘e’ that’s asserted to exist in the Identity axiom is the same ‘e’ that
appears in the Inversion axiom.
24
Chapter 1. Groups and subgroups
Exercise 25. Show that for any a in G, there is a unique element b such that a ⋆ b =
b ⋆ a = e. [That is, show that if b ′ is another inverse for a, then b ′ = b.]
Remark 1.1.11. This unique element is called the inverse of a and denoted a−1 .
a⋆c=b⋆c
a=b
c⋆a=c⋆b
a=b
Note the importance of the sides of the expressions we are working with: We do not have in
general that c ⋆ a = b ⋆ c implies a = b [When do we have this?].
The definition of groups is actually a little stronger than we require. In fact, we can weaken
the axioms so that we only check one side of the equalities.
Exercise 28. Let G be a set with an associative binary operation ⋆ and a distinguished
element e satisfying these two axioms:
• a ⋆ e = a for all a in G (Axiom of Right Identity)
• for each a in G there exists b in G such that a ⋆ b = e (Axiom of Right Inversion)
These are like the group axioms, except they’re only required to hold “on one side”. In
this exercise you will prove that any structure (G, ⋆, e) satisfying these weaker axioms
is actually already a group.
a) Prove that G has the right-cancellation property: a ⋆ c = b ⋆ c implies a = b.
b) An idempotent is an element i such that i⋆i = i. Show that e is the only idempotent
in G. How is this related to left-cancellation?
c) Show that every right inverse is a left inverse.
d) Show that e is a left identity.
e) Explain why we are done.
25
Chapter 1. Groups and subgroups
1.1.6 Notation
Composition laws have lots of notations, like ⋆, ∗, ◦, ·, ×, ⊗, +, ⊕, ... But when we’re
dealing with a single group, there’s only one composition law involved—so we can get
away with not writing it at all. (It’s also kind of annoying to write ⋆ all the time.) This is
called the multiplicative notation.
If the composition law is commutative, the group is called abelian. Some people write the
composition law in abelian groups using a plus sign (+), but we’ll stick to the multiplicative
notation except in very concrete cases, like Z/nZ under addition.
Indeed, we will use more notation inspired by those found in multiplication and addition:
Definition 1.1.12 — Let G be a group with identity element e and let g ∈ G. For each
integer n, define gn as follows:
if n > 0, put
gn = g · . . . · g
| {z }
n times
if n < 0, put
gn = (g−1 )−n
and if n = 0, put
g0 = e.
aabbbcdddd = a2 b3 cd4 .
a + b + c + b − a = 2b + c.
To summarize,
notation multiplicative additive
operation a · b or ab a+b
identity e or 1 0
inverses a−1 −a
powers an na
Note in particular that we will use multiplicative notation for function composition.
Multiplicative notation is convenient as it behaves largely the same way multiplication
does.
26
Chapter 1. Groups and subgroups
Proof. Exercise.
1.2 Examples
1.2.1 Basic groups
Wherever addition is defined, we may have a group; the set in question must contain zero
(the additive identity) and be closed under negation (to form additive inverses).
Thus N is not a group under addition, but Z, Q, R, and C are.
Similarly, wherever multiplication is defined, if the set contains 1 (the multiplicative
identity) and is closed under reciprocation (to form multiplicative inverses), then we have
a group. Using a superscript × to denote the set of “multiplicatively invertible” elements,
we find that Q× , R× , and C× are Q\{0}, R\{0}, and C\{0} respectively, and they are all groups
under multiplication.
27
Chapter 1. Groups and subgroups
so neither [3] nor [4] are invertible. (If they were, say [3][a] = [1], then we’d have
so [5] is invertible.
By restricting our attention to invertible elements, we obtain:
(Z/nZ)×
Example 1.2.1 — (Z/6Z)× = {[1], [5]} and (Z/8Z)× = {[1], [3], [5], [7]}.
Exercise 31. What are the invertible elements of Z/nZ? What are the invertible
elements of Z/pZ if p is a prime?
We can also consider matrices with entries in Z/mZ. However, det A ̸= [0] is no longer
enough—we need det A to be invertible modulo m. That is,
28
Chapter 1. Groups and subgroups
G×G→G
(e, e) 7→ e
Exercise 32. Check that the trivial group is a group. What is its Cayley table?
Similarly, in the additive group modulo n, every element can be written as a sum of [1]’s.
Therefore,
Z/nZ = ⟨[1]⟩.
The group µn is the set of complex nth roots of unity under multiplication. That is,
µn = {z ∈ C : zn = 1}.
Since the nth roots of unity are of the form e2πik/n , we may write
µn = ⟨e2πi/n ⟩.
∗
multiple in additive notation
29
Chapter 1. Groups and subgroups
and reflections
Let r denote the one-fifth clockwise turn and let f denote the flip over the vertical axis.
Then every rotation is a power of r.
We can also express every reflection in terms of just f and r. For example, to produce a
flip across the orange axis, first flip across the red axis (f) then turn two-fifths clockwise
(r2 ), yielding fr2 .
f r2
30
Chapter 1. Groups and subgroups
1.3 Subgroups
Definition 1.3.1 — A group H is a subgroup of a group G if H is a subset of G and
the group operation on H is the same as the operation on G. The subgroup relation is
written
H≤G
meaning “H is a subgroup of G”.
Proof. The ‘only if’ (forward implication) is easy. For the converse, we show that the three
properties (i), (ii), and (iii) hold, albeit in a different order.
Start with (ii). Since H is non-empty, there is some element a in H. By hypothesis, aa−1 ∈ H.
But aa−1 = e, so e ∈ H.
Next, (iii). Let a ∈ H. By (ii) and the hypothesis, ea−1 ∈ H. But ea−1 = a−1 , so a−1 ∈ H.
Finally, we show (i). Let a, b ∈ H. By (iii) and the hypothesis, a(b−1 )−1 ∈ H. But
(b−1 )−1 = b, so ab ∈ H.
31
Chapter 1. Groups and subgroups
Z ≤ Q ≤ R ≤ C,
because the difference of two integers (resp. rationals, reals) is an integer (resp. rational,
real). (Of course, 0 ∈ Z.)
Similarly,
Q× ≤ R× ≤ C× ,
because the quotient of two nonzero rational (resp. real) numbers is rational (resp. real).
(Also, 1 ∈ Q× .)
To show that SLn ≤ GLn , just note that I ∈ SLn because det I = 1, and if A, B ∈ SLn , then
so AB−1 ∈ SLn .
Exercise 34. Prove that On is a matrix group. Conclude that SOn is a matrix group.
32
Chapter 1. Groups and subgroups
is a matrix group.
⟨gk ⟩ ≤ G.
For example, we saw that Z under addition forms a cyclic group generated by 1. Thus,
⟨k⟩ ≤ Z for every integer k. In particular, the set of even numbers is a subgroup of Z.
∗
Also see Klein Four.
33
Chapter 1. Groups and subgroups
◦ e ρ σ τ
e
ρ
σ
τ
We’ll talk more about symmetric groups and permutation groups in weeks 5 and 6.
⟨S⟩ = {g±1 ±1
1 . . . gk : k ≥ 0 and gi ∈ S}.
Proposition 1.3.3
Let S ⊆ G. Then ⟨S⟩ ≤ G.
Proof. ⟨S⟩ is non-empty, because we can always form the empty product to get e. And if
a, b ∈ S then
a = gϵ1 1 . . . gϵkk and b = hδ11 . . . hδl l ,
where gi , hj ∈ S and ϵi , δj ∈ {1, −1}. Thus
All k + l terms are in S and all the exponents are 1 or −1, so ab−1 ∈ ⟨S⟩.
Remark 1.3.4. ⟨S⟩ is in fact the smallest subgroup containing S—any subgroup containing
S must contain all elements of the form g±1 ±1
1 . . . gk , and so it must contain ⟨S⟩.
If S is a singleton (i.e. n = 1), say S = {g}, then ⟨S⟩ = ⟨g⟩ is called the cyclic subgroup
generated by g. Of course, the whole group G is cyclic iff G = ⟨g⟩ for some g in G.
34
Chapter 1. Groups and subgroups
⟨ 21 , 13 ⟩ = { n2 + m
3
: n, m ∈ Z}
n m 3n + 2m
+ = .
2 3 6
Since 3(−3) + 2(5) = 1, this subgroup is actually cyclic—every element is an integer
multiple of 61 .
⟨2, 3⟩ = {2n 3m : n, m ∈ Z}
is the subgroup of fractions whose numerator and denominator (in lowest terms) are
divisible by 2 and 3 only. For example,
2 256 1
6, , , ∈ ⟨2, 3⟩
3 243 1024
but 5 is not.
Exercise 38. Show that the group in the above example cannot be cyclic. (That is,
show that there is no g ∈ Q× such that ⟨g⟩ = ⟨2, 3⟩.
35
Chapter 1. Groups and subgroups
Question — When are two mathematical objects the same? In other words, when can
we interchange one thing with the other in a question?
As it turns out, the answer depends on how we define sameness. For example, consider
the two sets S = {a, b, c, d, e} and T = {1, 2, 3, 4, 5}. Of course, the elements have different
names, so they’re different in that manner. However, they do have the same size, and we
may view these two sets as interchangeable in set-theoretic contexts.
Exercise 39. Consider the two questions “how many three-letter words can you make
from the letters in S (with repeats)?” and “how many functions are there from {1, 2, 3}
to T ?”. Why are these two questions in fact the same question?
The notion of “interchangeable” sets allows us to answer questions concerning finite sets
by just answering questions about sets {1, 2, 3, ..., n}.∗
When we introduce additional structure on the object in question, the notion of “sameness”
also expands to include these structure. For example, if we are to ask questions about
a vector space V, we know these questions have the same answers as with the vector
space W if there is an invertible linear transformation T : V → W—linear transformations
preserve vector addition and scalar multiplication. This allows us to reduce the study of all
finite dimensional vector spaces to just the study of Fn , where F is the corresponding field.
Returning to groups, we may ask when two groups are the same. An understanding of
this type will allow us to answer questions about groups more easily, by treating many
groups as guises of other groups that we’ve already studied.
Definition 1.4.1 — Given two groups (G, ⋆) and (H, ◦), we say G and H are isomorphic
if there is a bijection ϕ : G → H such that for all a, b ∈ G, we have
Remark 1.4.2. The operation on the left-hand side is done in G (before f is applied) whereas
the operation on the right-hand side is done in H (after f is applied). In the future we will
drop the operation entirely and write
ϕ(ab) = ϕ(a)ϕ(b)
36
Chapter 1. Groups and subgroups
∼ is an equivalence relation.
Exercise 40. Show that =
Exercise 41. Show that R× is isomorphic to GL1 (R). (Is the latter group abelian?)
A visual way to check isomorphisms of small groups is to draw both Cayley tables, colour
each element of one group uniquely, and colour the corresponding elements in the other
group the same colour. If the resulting patterns match, then the groups are isomorphic.
For example, writing ω = e2πi/3 , we can consider Z/3Z and µ3 = {1, ω, ω2 }:
Exercise 42. Show that the groups listed at the beginning of this section are isomor-
phic.
37
Chapter 1. Groups and subgroups
Say two puzzle pieces are rotation-equivalent if you can rotate one so it looks like the other.
How many puzzle pieces are there up to rotation-equivalence?
The answer is 6. Each puzzle piece has one of the following shapes:
Mathematically, we just determined the orbits of the action of the group C4 on the set of
puzzle pieces.
38
Chapter 1. Groups and subgroups
Proposition 1.5.2
Let G ↷ X. Then “x ∼ y iff g . x = y for some g in G” is an equivalence relation.
hg . x = h . g . x = h . y = z.
Definition 1.5.3 — The equivalence classes of the relation above are called orbits. The
orbit of x is denoted OrbG (x) or Gx, and the set of equivalence classes is denoted X/G.
That is,
OrbG (x) = Gx = {g . x : g ∈ G}
and
X/G = {Gx : x ∈ X}.
Remark 1.5.4. Note that, since orbits are equivalence classes, the set of all orbits form a
partition of X.
Example 1.5.5 — Consider again the action of C4 on the set of puzzle pieces. There
are 6 orbits, indicated by colour. Some orbits are small (like those of and ) while
others are large (like those of , , and ). Note that the total number of puzzle pieces
is the sum
1 + 4 + 4 + 2 + 4 + 1 = 16
because the orbits partition the set.
39
Chapter 1. Groups and subgroups
Remark 1.5.6. A right group action is just like a left group action except that the function
goes from X × G to X and is denoted (x, g) 7→ x . g. The difference stems from flipping the
compatibility axiom into (x ↷ g) ↷ h = x ↷ gh—the action by gh on x is performed g
and then h, as opposed to h and then g (as in function composition).
The other notations are flipped as well: we write X ↶ G to mean G acts on X on the right
(“right-acts”), and the orbit space of a right action is denoted G\X.
Below we will introduce many types of group actions for future reference.
Example 1.5.8 — The group µ2 acts on any group G by inversion: for all x in G,
1.x = x
−1 . x = x−1
Since the identity and involutions are self-inverse, while other elements are not, the
orbits are either pairs {x, x−1 } where x ̸= x−1 or singletons {x} where x = x−1 (including
{e}).
0 . g = g0 = e ̸= g in general, and
Example 1.5.10 — Dn acts on the n-gon (more precisely, parts of the n-gon) by flips
and rotations on the right: we agreed that e.g. fr should be interpreted as first flip, then
rotate.
Example 1.5.11 — Consider the equilateral triangle with colored vertices. Now we
have its rotations
40
Chapter 1. Groups and subgroups
and reflections
We may say that D3 acts on the red, green, and blue vertices of the triangle, such that
rotation by 120◦ clockwise sends the red vertex to the (original position of the) green
vertex. We may also say that Dn acts on the (unlabelled) edges of the triangle. We
may even say that Dn acts on the set of six possible configurations of the equilateral
triangle.
The orbit of any vertex is the whole set of vertices; the same holds for any edge, mutatis
mutandis.
Exercise 45. For n ≥ 4 an n-gon has 12 n(n − 3) diagonals. How many orbits of those
are there?
Example 1.5.12 — Given a fixed origin, O3 (R) acts on objects in the three-dimensional
space by rotations and reflections about that origin. SO3 (R) acts on them by rotations
only.
Exercise 46. Show that Dn is a subgroup of O2 (R) using their geometric definitions.
Example 1.5.13 — SX ↷ X for any set X in the obvious way: σ . x = σ(x). In particular,
Sn acts on {1, ..., n}.
x 7→ ax + b (a ̸= 0)
acts on R in the obvious way. There is only one orbit—for any two real numbers u and
v, the affine map x 7→ x + v − u sends u to v.
Exercise 47. For which pairs of distinct points (u1 , u2 ) and (v1 , v2 ) does there exist an
affine map sending ui to vi ?
41
Chapter 1. Groups and subgroups
Example 1.5.15 — Assume for simplicity that we can identify musical notes with
their fundamental frequencies.
On a standard modern 88-key piano,∗ the A above middle C sounds the frequency 440
Hz, and the ratio between the frequencies f0 < f1 of two successive keys (white–black,
black–white, or white–white) is
f1 √
12
= 2 = 1.059...
f0
Thus the frequencies of the 88 keys (starting four octaves below 440) ranges from 27.5
Hz to just over 4186 Hz.†
For reference, the human hearing range is commonly given as 20 Hz to 20000 Hz,
which, in musical terms, ranges from a fifth below the bottommost piano-note to two
octaves and a minor third above the topmost piano key.
A unifying feature of nearly all musical traditions is the observation that two frequen-
cies sound “the same” when the ratio between them is an integer power of 2. This is
called octave equivalence.‡
So, let
X = {440 · 2k/12 : k ∈ Z}
be the “idealized” Western musical scale, whose elements we’ll refer to as pitches, and
let G = ⟨2⟩ be the cyclic subgroup of R× generated by 2. Then G ↷ X by transposition
by octaves, and the orbit space X/G has exactly twelve elements (called pitch classes):
Example 1.5.16 — They say you’re supposed to “rotate your mattress” every so often
to prevent it from sagging. Certain models can (and therefore must) also be “flipped”.
Together, these physical manoeuvres give an action of D2 on the ideal mattress (only
42
Chapter 1. Groups and subgroups
theoretical).
Unfortunately, D2 is not cyclic. That means you can’t “cycle” the mattress through all
possible configurations by repeating a single, easy-to-remember action. Consequently,
mattress companies have devised complicated mattress-flipping schemes detailing
when and how to flip your mattress for maximum performance.
Exercise 48. Show that orbits are G-invariant, and that every G-invariant subset is a
union of (zero or more) orbits.
Example 1.5.21 (Subgroup action) — Let H ≤ G. Then H acts on X the same way G
does.
Example 1.5.22 (Function action) — Let Y be a set and let Y X denote the collection of
all functions X → Y. Then G acts on Y X on the right by f . g = (x 7→ f(g . x)).
43
Chapter 1. Groups and subgroups
Exercise 49. Show that the function action is indeed an action—that it satisfies the
identity and compatibility axioms.
Exercise 50 (Opposite action). Let x . g = g−1 . x for all x in X and all g in G. This turns
the left action of G on X into an equivalent right action. In terms of the original (left)
action, what is the orbit of x under the opposite (right) action? Show that the opposite
of the opposite is the original.
1.6 Orders
1.6.1 Order of a group
With all the structures on G, we would like to count it.
Definition 1.6.1 — The order of a group G, denoted o(G) or |G|, is the number of
elements in G. If that number is infinite, we say the group has infinite order, and we
write o(G) = ∞.
g . h = gh.
We also claim that the orbits all have the same size. Indeed, given an orbit xH, consider the
function H → xH defined by h 7→ xh. This function is
• injective: if h1 and h2 map to the same place, then xh1 = xh2 , so by left-cancellation,
h1 = h2 , and
• surjective: if y ∈ xH, then y ∈ OrbH (x), so y = xh for some h in H, so h maps to y.
Thus h 7→ xh is a bijection, so H and xH have the same size. In particular, every orbit has
size o(H).
Since orbits are equivalence classes, they partition G. All the orbits have the same size
o(H). Since G is finite, o(G) is a multiple of o(H) i.e. o(H) divides o(G).
During the course of the proof, several very important concepts came up.
• the group action—H acting on G by right multiplication—is called the right subgroup
action of H on G,
• the orbits—the sets of the form xH—are called left cosets (of H (in G)),
• the set of orbits—the set of left cosets of H in G—is denoted G/H, and
• the number of orbits—the size of the set G/H—is called the index (of H (in G)),
denoted [G : H].
Remark 1.6.4. Note that only in the end of the proof did we use finiteness of G. The rest of
the proof of Lagrange’s theorem shows that
o(G) = [G : H]o(H)
o(G)
[G : H] = .
o(H)
Clearly, the cosets partition Z/24Z. Each coset has 6 elements (the order of H) and
there are 4 cosets in total (the index of H). And indeed, 6 times 4 is 24.
45
Chapter 1. Groups and subgroups
Exercise 51. Show that if a group G has a finite subgroup of finite index, then G is
finite.
Exercise 52. Give an example of an infinite group with an infinite subgroup of infinite
index.
Exercise 53. Show that [Z : ⟨n⟩] = n for each n > 0. What is [Z : ⟨0⟩]?
Exercise 54. Let G = Z and let H = ⟨n⟩. In the notation from the proof of Lagrange’s
theorem, show that a ∼ b if and only if a ≡ b (mod n). What does this prove about
the sets Z/⟨n⟩ and Z/nZ?
Remark 1.6.6. Note that we often denote the subgroup H = ⟨n⟩ by nZ, hence the Z/nZ
notation.
Remark 1.6.7. Note that in this case, our G/H is itself a group under gH + g ′ H = (g + g ′ )H.
This is not true in general—we will explore when G/H is a group in the future when we
talk about normal subgroups.
* !+
1 2 3
Exercise 55. Let G = S3 and H = . Can you put a group structure on
2 1 3
G/H using the group operation from G?
Example 1.6.9 — The identity element is the only element of order 1. That is, o(g) = 1
iff g = e.
46
Chapter 1. Groups and subgroups
[a] [0] [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11]
o([a]) 1 12 6 4 3 12 2 12 3 4 6 12
gm = gqn+r = (gn )q gr = eq gr = gr .
Corollary 1.6.13
Let n = o(g) be finite. Then gi = gj iff i ≡ j (mod n).
The following fundamental Theorem connects the two meanings of the word “order”.
Theorem 1.6.14
The order of an element is equal to the order of the cyclic subgroup it generates.
Proof. Suppose g has infinite order. Then there is no nonzero k such that gk = e, and it
follows that ⟨g⟩ is infinite.
So, suppose g has finite order n. We want to show two things: first, that
and second, that the set on the right-hand side actually has n elements.
First: The elements of ⟨g⟩ are, by definition, integer powers of g. Given gk , let r be the
remainder of k when it’s divided by n. Then k ≡ r (mod n) so, by the Corollary, gk = gr .
Note that 0 ≤ r < n, so every power of g equals something in this list.
Second: To show that this list has no repeats, suppose gi = gj for some 0 ≤ i ≤ j < n. Then
i ≡ j (mod n), again by the Corollary. But i and j are both less than n, so i = j.
Putting it all together, we conclude that the elements of ⟨g⟩ are just the n powers e, g, g2 , . . . ,
gn−1 . Therefore o(⟨g⟩) = n = o(g).
The relationship between o(x), o(y), and o(xy) is complicated. One might hope that
o(xy) = o(x)o(y)
—but that’s false in general. For example, o(xx−1 ) = 1 regardless of what o(x) is.
48
Chapter 1. Groups and subgroups
Proposition 1.6.16
Let g be an element of finite order. Then
o(g)
o(gk ) =
gcd(o(g), k)
n km k
| = m.
d d d
n k n
Since and are coprime, by the magic of number theory, | m.
d d d
[2]2 = [4], [2]3 = [8], [2]4 = [5], [2]5 = [10], [2]6 = [9],
[2]7 = [7], [2]8 = [3], [2]9 = [6], and [2]10 = [1].
We can easily compute the order of any other element now. For example,
10 10
o([5]) = o([2]4 ) = = = 5,
gcd(10, 4) 2
10 10
o([7]) = o([2]7 ) = = = 10,
gcd(10, 7) 1
and
10 10
o([10]) = o([2]5 ) = = = 2.
gcd(10, 5) 5
49
Chapter 1. Groups and subgroups
Proposition 1.6.18
If x and y commute (that is, xy = yx) and have coprime orders, then o(xy) = o(x)o(y).
because (xy)k = e and xm = e. By the Division lemma, n = o(y) | mk. Since gcd(n, m) = 1,
we have n | k. By symmetry, m | k.
Now m = o(x) and n = o(y) both divide k = o(xy). Since they’re coprime, their product
divides o(xy) as well. Therefore o(xy) = o(x)o(y).
Exercise 61. Show that o(xy) = o(yx) regardless of whether x and y commute.
50
Chapter 2
Families of groups
Z/nZ = ⟨[1]⟩.
But [1] is not the only generator! [−1] = [n − 1] also works, and in concrete cases we can
find many others:
and
⟨[8]⟩ = {[8], [4], [0]} ⪇ Z/12Z
so [7] generates Z/12Z while [8] does not.
Exercise 62. Can you guess, in general, how to tell whether [a] generates Z/nZ?
Proposition 2.1.2
[a] generates Z/nZ iff gcd(a, n) = 1.
Remark 2.1.3. By the Lemma in the handout, gcd(a, n) is independent of the representative—
if [a] = [b], then gcd(a, n) = gcd(b, n).
51
Chapter 2. Families of groups
Proof. First of all, [a] generates Z/nZ iff o([a]) = n. [Why?] In particular, since Z/nZ =
⟨[1]⟩, we have o([1]) = n. Expressing [a] = a[1] as a “power” of the generator, we can
appeal to the order formula:
o([1]) n
o([a]) = = .
gcd(o([1]), a) gcd(n, a)
Example 2.1.4 — The generators of Z/12Z are [1], [5], [7], and [11]. The other classes(’s
representatives) are divisible by 2 or 3.
The number of generators of Z/nZ is therefore φ(n), as defined in the GCD and φ handout.
2.1.2 U(n)
Aside from cyclic groups (like the additive groups Z/nZ) the most important finite abelian
groups are the multiplicative groups U(n). In some sense, studying these particular groups
tells you everything you need to know about finite abelian groups.
Recall that U(n) a.k.a. (Z/nZ)× is the set of invertible residue classes modulo n. As
promised, we’re going to explain
1. how to tell if a given class is invertible, and
2. how to find the inverse of an invertible class.
Definition 2.1.5 — Fix an integer n. A residue class [a] is invertible if there exists [b]
such that [a][b] = [1].
Example 2.1.6 — In Z/16Z, [5] and [11] are invertible because [5][13] = [65] = [1] and
[11][3] = [33] = [1]. We have [5]−1 = [13] and [11]−1 = [3].
How do we tell apart invertible and non-invertible classes? Let’s start with a simple
criterion for the latter.
Lemma 2.1.7
If [a][b] = [0] but [a], [b] ̸= [0], then neither [a] nor [b] is invertible modulo n.
52
Chapter 2. Families of groups
Proof. Suppose [b] had an inverse, say [c]. Then [b][c] = [1]. But [a][b] = [0] by hypothesis.
Since multiplication is associative,
Example 2.1.8 — Continuing in Z/16Z, neither [4] nor [8] are invertible because
[4][8] = [32] = [0].
As you may have guessed from the last two examples, coprimality with n has something
to do with invertibility.
Lemma 2.1.9
If gcd(a, n) > 1 then [a] is not invertible modulo n.
Proof. Let d be a nontrivial common divisor of a and n. Then the integer b = n/d is not
divisible by n. However, ab = an/d is divisible by n, because the quotient ab/n is the
integer a/d. Thus,
[a][b] = [ab] = [0]
so [a] is not invertible.
Theorem 2.1.10
If gcd(a, n) = 1 then [a] is invertible modulo n.
f : Z/nZ → Z/nZ
[x] 7→ [a][x].
If [a][x] = [a][y], then n divides ax − ay = a(x − y). But since n and a are coprime, that
means n divides x − y, and so [x] = [y]. Thus f is injective.
But! An injection from a finite set to itself must be bijective (by the pigeonhole principle).
In particular, [1] is in the image of f. In other words, there exists some [b] in Z/nZ such
that [a][b] = [1]. That means [a] is invertible!
53
Chapter 2. Families of groups
Second proof of the Theorem. Since a and n are coprime, by “Bézout’s Euclidean algorithm,”
there exist integers s and t such that
as + nt = 1.
as ≡ 1 (mod n)
Example 2.1.11 — Let’s compute the inverse of [123] modulo 1024, if it exists. First,
we run the Euclidean algorithm to compute gcd(1024, 123).
1024 = 8 · 123 + 40
123 = 3 · 40 + 3
40 = 13 · 3 + 1
Thus gcd(1024, 123) = 1, so [123] is invertible modulo 1024. To find its inverse, we
work backwards:
1 = 40 − 13 · 3
= 40 − 13 · (123 − 3 · 40) = 40 · 40 − 13 · 123
= 40 · (1024 − 8 · 123) − 13 · 123 = 40 · 1024 − 333 · 123
With these theorems in hand, it’s now clear that the order of the multiplicative group of
integers modulo n is the totient of n:
o(U(n)) = φ(n).
Example 2.1.12 —
U(7) = {[1], [2], [3], [4], [5], [6]}
and
U(12) = {[1], [5], [7], [11]}.
54
Chapter 2. Families of groups
Proof. By Exercise 57, ao(G) = e in any group G. Thus [a]φ(n) = [1] in U(n).
Remark 2.1.15. Compare this proof with the very long proofs you may have seen in
MAT246 or MAT315!
Example 2.2.1 —
The trivial group is cyclic.
Z = ⟨1⟩ is infinite cyclic.
Z/nZ = ⟨[1]⟩ is finite cyclic of order n.
µn = ⟨e2πi/n ⟩ is also finite cyclic of order n.
Let G be any group and let h ∈ G. Then H = ⟨h⟩ is a cyclic subgroup of G of order
o(g).
⟨2⟩ ≤ Q× is the subgroup of powers of 2:
1 1 1
..., , , , 1, 2, 4, 8, . . .
8 4 2
∗
multiple in additive notation
55
Chapter 2. Families of groups
√ √
⟨1 + 2⟩ ≤ R× is the subgroup of powers of 1 + 2:
√ √ √ √
. . . , 3 − 2 2, −1 + 2, 1, 1 + 2, 3 + 2 2, . . .
√ √
Exercise 63. Prove that every integer power of 1 + 2 has the form a + b 2 for some
integers a and b.
Definition 2.2.2 — A generator for µn is called primitive nth root of unity, while a
(representative of a) generator for U(n) is called a primitive root modulo n.
Example 2.2.3 — Consider some “small” √ groups of roots of unity. In particular, let
(2πi)/8 1 (2πi)/12 1
ζ=e = 2 (1 + i) and ξ = e
√ = 2 ( 3 + i). Then
µ8 = ⟨ζ⟩ = {1, ζ, ζ2 , ζ3 , ζ4 , ζ5 , ζ6 , ζ7 }
= {1, ζ, i, ζ3 , −1, ζ5 , −i, ζ7 }
and
ℑ{z}
i ξ2 ζ
4
ξ ξ
ζ3
ξ5 ℜ{z}
1
−1
ξ11
ξ 7 ζ7
ξ10
ζ5 ξ8 −i
The only nontrivial proper subgroups of µ8 are µ4 = {1, i, −1, −i} and its subgroup
µ2 = {1, −1}. On the other hand, µ12 has several additional subgroups aside from these,
like µ3 = {1, ξ4 , ξ8 } and µ6 = {1, ξ2 , ξ4 , ξ6 , ξ8 , ξ10 }. However,
µ8 ≰ µ12 .
56
Chapter 2. Families of groups
Exercise 64. Show that a complex number ζ can be a primitive nth root of unity for
at most one positive integer n. What is n in terms of ζ?
Here’s the most basic criterion for determining whether a finite group is cyclic.
Proposition 2.2.4
A finite group of order n is cyclic iff it has an element of order n.
Proof. Exercise.
Exercise 65. In the Proposition, why is it necessary that the group be finite?
Example 2.2.5 (A non-cyclic group) — U(8) = {[1], [3], [5], [7]} is not cyclic, as every
element squares to the identity.
Example 2.2.6 (Another non-cyclic group) — The dihedral group Dn has order 2n,
but no element has such large order, because rotations have order at most n, and flips
are involutions. Thus Dn is not cyclic.
Example 2.2.7 (Yet another non-cyclic group) — Fix n > 1. Let G be the set of subsets
of {1, . . . , n} under the △ operation. In Homework 1, you proved G is a group. Today,
we prove G is not cyclic. We have
A2 = A△A = (A \ A) ∪ (A \ A) = ∅ ∪ ∅ = ∅
There is always at least one group of any given finite order n, namely Cn , the cyclic group
of order n. We say “the”, because...
Proposition 2.2.8
Any two cyclic groups of the same order are isomorphic.
57
Chapter 2. Families of groups
Proof. Let G = ⟨g⟩ and H = ⟨h⟩ be cyclic groups of order n and define f : G → H by
f(gi ) = hi .
Before we can say anything about f, we have to check that it’s actually well-defined—that
the proposed value on an element is independent of the exponent. So, if gi = gj then i ≡ j
(mod n), so j = i + mn. Thus
hj = hi+mn = hi (hn )m = hi
Exercise 66. Show that any two infinite cyclic groups are isomorphic.
Lemma 2.2.9
Let G = ⟨g⟩ be a cyclic group and let H ≤ G have finite index k. Then H = ⟨gk ⟩.
Proof. First, we prove there exists some positive integer d such that gd ∈ H.
• If H is trivial, then G must be finite, so we may take d = o(G).
• If H is nontrivial, then H contains some nonidentity element gm (as every element of
G looks like this), so we may take d = |m|.
Next, let d be the least positive integer such that gd ∈ H. We claim gm ∈ H iff d | m
(cf. Division Lemma).
(⇐) If d | m, then gm = (gd )m/d ∈ H because gd ∈ H.
(⇒) If gm ∈ H, then m = qd + r for some 0 ≤ r < d, and gr = gm (gd )−q ∈ H. Since d was
least, we get r = 0, so d | m.
58
Chapter 2. Families of groups
Exercise 67. Exactly one situation is not covered by the Lemma. What is it?
Corollary 2.2.10
Let G = ⟨g⟩ be cyclic. If H1 and H2 have the same index in G, then H1 = H2 .
Example 2.2.12 — Suppose G = ⟨g⟩ has order n. For each d | n, the subgroup ⟨gd ⟩
has order n/d and index d, because
n n
o(⟨gd ⟩) = o(gd ) = =
gcd(n, d) d
and
o(G) n
[G : ⟨gd ⟩] = = = d.
o(⟨gd ⟩) n/d
Example 2.2.13 — Suppose G = ⟨g⟩ has order ∞. For each d ̸= 0, the subgroup ⟨gd ⟩
has order ∞ and index d. What about d = 0?
59
Chapter 2. Families of groups
The Lemma, its Corollary, and the two Examples show that a cyclic group has exactly one
subgroup of every possible index.∗ What’s interesting is that the converse is true (at least in
the finite case). In fact:
Theorem 2.2.14
Let G be a finite group. If G has at most one subgroup of each index (equivalently,
order), then G is cyclic.
We will leave the proof for later. In the meantime, let’s see an application of the FToCG.
Example 2.2.15 — Recall Bézout’s theorem: if gcd(a, b) = d then there exist integers
s and t such that
as + bt = d.
We proved Bézout’s theorem by way of the Extended Euclidean algorithm. Using the
fact that every subgroup of Z is cyclic, we can give another proof of Bézout’s theorem.
Consider ⟨a, b⟩ which is the subgroup of Z generated by a and b; its elements are
integer combinations of a and b.
Since Z = ⟨1⟩ is cyclic, its subgroup ⟨a, b⟩ must be cyclic as well, so
⟨a, b⟩ = ⟨d⟩
for some integer d (a “power” of the generator 1). Since ⟨d⟩ = ⟨−d⟩, we may take d to
be positive.
Now, since a, b ∈ ⟨d⟩ we have d | a and d | b. To prove that d = gcd(a, b) we use
the other inclusion: d ∈ ⟨a, b⟩ so there exist integers n and m such that d = an + bm.
Then if c | a and c | b then c | an + bm. In (other) words, if c divides a and b then c
divides every integer combination of a and b. In particular, every common divisor of
a and b divides d. Since d divides a and b, d is the greatest common divisor of a and
b.
∗
Gallian calls this result the Fundamental Theorem of Cyclic Groups, but he only proves it for finite cyclic
groups, and states it in terms of orders instead of indices.
60
Chapter 2. Families of groups
Remark 2.2.17. We will prove only the “if” direction, leaving the “only if” for PS3.
Proof. U(2) is the trivial group, which is cyclic; and U(4) = {[1], [3]} = ⟨[3]⟩ is cyclic, too.
Let p be an odd prime. We will postpone the cyclicity of U(p) to a homework problem.
For now, let’s assume this fact.
Next up, let k > 1. To show U(pk ) is cyclic, we will take the product of two elements of
orders pk−1 and p − 1 to obtain an element of order pk−1 (p − 1) = o(U(pk )).
The first element is [p + 1]. Observe that, for any n ≥ 1, the binomial theorem says
n n n n−1 n n−2 n
(p + 1) = p + p + p ··· + p+1 (*)
1 2 n−1
where
n n! n(n − 1)(n − 2) . . . (n − i + 1)
= = .
i i!(n − i)! i(i − 1)(i − 2) . . . 1
Choosing n = pk−1 , we see
that each term in (*)—except the last one—is divisible by p
k
pm pm pm −1
[because i = i i−1 .] Thus
k−1
[p + 1]p = [1] in U(pk )
Finally, for U(2pk ), let g be an odd primitive root modulo pk . [This is possible precisely
because p is odd—the class of g modulo pk therefore contains representatives of both
parities.] We claim U(2pk ) = ⟨[g]⟩. Indeed, gcd(g, 2pk ) = 1 because gcd(g, pk ) = 1 and
gcd(g, 2) = 1, so [g] is invertible. Note that the elements of ⟨[g]⟩ are distinct modulo 2pk
because the same is true modulo pk . Since φ(2pk ) = φ(pk ), we have ⟨[g]⟩ = U(2pk ).
61
Chapter 2. Families of groups
Exercise 70. Show that U(2k ) is not cyclic for k ≥ 3 by exhibiting “too many” sub-
groups of order 2.
f fr
f fr
f fr
62
Chapter 2. Families of groups
Equivalent definition
While the “two-involutions” definition is very useful theoretically, the “geometric” defi-
nition is very useful in practice. The following Proposition describes how to canonically
switch back and forth between these two perspectives.
Proposition 2.3.2
Let G be a dihedral group with generating involutions a and b. Then G is generated
by the flip f = a and the rotation r = fb, which are distinct and satisfy frf = r−1 .
Conversely, if a group G is generated by two distinct elements f and r such that
o(f) = 2 and frf = r−1 , then G is dihedral with generating involutions f and fr.
Moreover, passing from one description the another and back again leaves us where
we started.
Proof. We just showed the backward direction. For the forward, we have to show that if
f = a and r = ab then G = ⟨f, r⟩ and f ̸= r and frf = r−1 .
Plainly,
⟨a, ab⟩ ≤ ⟨a, b⟩;
and since a(ab) = b, we have b ∈ ⟨a, ab⟩, so
and
(f, r) −→ (f, fr) −→ (f, ffr) = (f, r).
63
Chapter 2. Families of groups
The identity
frf = r−1
is called the fundamental relation between flip and rotation.
f r f
r−1
rf = fr−1 .
It follows that
rk f = fr−k
for all integers k. This enables us to simplify long strings of r’s and f’s—we can move any
rotation past f, at the cost of inverting it.
Theorem 2.3.3
If o(r) is finite, then o(G) = 2o(r).
Proof. Let n = o(r). The fundamental relation between flip and rotation allows us to write
every element of G as either rk (a rotation) or frk (a flip) for some integer k. Thus
which means o(G) ≤ 2n. (We don’t know if these are all distinct—the list could contain
repeats.)
For i, j between 0 and n − 1, we have
which can only happen if i = j (since 0 ≤ i, j < n). Thus, in our list above, the flips are
distinct from each other, and the rotations are distinct from each other.
Yet are flips and rotations distinct from each other? We know that f ̸= r, but what about fri
and rj ? We have
fri = rj ⇐⇒ f = rj−i ⇐⇒ f ∈ ⟨r⟩.
So to prove that fri ̸= rj , we just have to show that f ̸∈ ⟨r⟩.
Suppose to the contrary that f ∈ ⟨r⟩. Then f commutes with r, so rf = fr. But, by the
fundamental relation, rf = fr−1 . Together, these imply r = r−1 , so r2 = e. Thus r has order
1 or 2. Since f ∈ ⟨r⟩ has order 2, r must have order 2. However, the 2-element cyclic group
⟨r⟩ can only have one element of order 2. Therefore f = r, a contradiction.
We use the notation Dn to denote the dihedral group with a rotation of order n. By the
theorem, o(Dn ) = 2n.
Just like with cyclic groups, we also speak of Dn as the dihedral group of order 2n, because
there is only ever one such group:
Proposition 2.3.5
Any two dihedral groups of the same order are isomorphic.
Not-really-dihedral groups
While it’s difficult to draw regular n-gons for n = 1 and 2, that hasn’t stopped crazy
mathematicians group theorists from talking about their symmetry groups, D1 and D2 .
Proposition 2.3.6
Dn is abelian iff n = 1 or 2.
65
Chapter 2. Families of groups
Proof. If Dn is abelian, then r and f commute; conversely, if r and f commute, then so does
every pair of elements in the group they generate.
As we saw at the end of the proof of Theorem 2.3.3, fr = rf is equivalent to r2 = e, which
is equivalent to n = 1 or 2.
These two “smallest” dihedral groups are so boring and so unlike their non-abelian bigger
siblings that for the rest of this class we will assume n ≥ 3.
Theorem 2.3.7
Let G be a dihedral group and let H ≤ G. Then H is either
c) cyclic: ⟨rk ⟩ where k = 12 [G : H], or
d) dihedral: ⟨rk , fri ⟩ where k = [G : H] and 0 ≤ i < k.
We will explore the proof of this (at least in the finite dihedral case) in Problem Set 3.
Example 2.3.8 — The subgroup of all rotations in Dn is ⟨r⟩. Its order is n, so its index
is 2. Subgroups of the rotation subgroup are cyclic subgroups of the whole group.
66
Chapter 2. Families of groups
Exercise 75. Show that Z(G) ≤ G, with equality if and only if G is abelian.
Example 2.3.11 (Z(Dn )) — Let ri be a rotation. Since rotations commute with rota-
tions, let frj be a flip. Then
(ri )(frj ) = frj−i
whereas
(frj )(ri ) = frj+i .
These are equal if and only if r−i = ri , i.e. ri is an involution. Geometrically, the only
involutive rotation is the 180-degree turn, possibly only when n is even. Algebraically,
r−i = ri implies i ≡ −i (mod n), and if n | 2i then n must be even and i must be n/2.
Either way, rn/2 is the only rotation that commutes with a flip. In particular, no flip
commutes with r when n > 2, so we conclude
⟨e⟩ if n is odd
Z(Dn ) =
⟨rn/2 ⟩ if n is even.
When a group’s center is trivial—like Dn ’s is when n is odd—we say the group is centerless.
" #
a b
Example 2.3.12 (Z(GL2 (R))) — Let A = ∈ Z(GL2 (R)). Since A commutes
c d
" #
0 1
with every invertible matrix, A must commute with the particular matrix B =
1 0
67
Chapter 2. Families of groups
with a ̸= 0. After verifying that these so-called “scalar matrices” do commute with
everything else, we conclude that Z(GL2 (R)) = {aI : a ̸= 0}.
68
Chapter 2. Families of groups
A convenient way to encode this information about Dn is to write down its presentation:
G = ⟨S | R⟩
Exercise 78. Suppose G = ⟨g⟩ is cyclic of order n. Write down a presentation for G.
Well, it doesn’t quite make sense to say “r∞ = e”. But it does make sense to say o(r) = ∞,
which means r satisfies no nontrivial relation of the form rn = e. The other relations—
f2 = e and frf = r−1 —have no dependence on n, so they remain. It seems reasonable,
therefore, to define the infinite dihedral group as
Proposition 2.3.13
The group D∞ is a dihedral group of infinite order.
69
Chapter 2. Families of groups
To interpret D∞ geometrically, consider this sequence of nested n-gons, depicted here for
3 ≤ n ≤ 12:
Pointwise, as n → ∞, the sides of the n-gons converge to the black line, and the vertices
converge to the black dots. The rotational symmetry becomes a translational symmetry—
we can shift the black line, left or right, any integer number of side-lengths. The vertical
reflection symmetry remains present all the while—we can flip the “∞-gon” across the
middle, or indeed across the perpendicular to any vertex or midpoint.
70
Chapter 2. Families of groups
is infinite dihedral.
SX .
∗
i.e. bijective self-maps
71
Chapter 2. Families of groups
Example 2.4.4 — The identity map ϵ fixes everything, and is the only element in any
symmetric group to do so.
Example (Example 1.5.13) — SX ↷ X for any set X in the obvious way: σ . x = σ(x).
In particular, Sn acts on {1, ..., n}.
What are the orbits of this action?
Proof. Let X = {x1 , . . . , xn }. To write down a permutation of X, we just have to fill in the
bottom row of !
x1 x2 x3 x4 x5
in such a way that each xi appears exactly once (here, n = 5 for concreteness).
Clearly, there are n possible things to put in the first column (under x1 ). Write something
down, like x3 : !
x1 x2 x3 x4 x5
x3
72
Chapter 2. Families of groups
Every permutation of X can be obtained this way. At each stage, the number of choices de-
creases by 1, from n at the start to 1 at the end, so there are n(n − 1) . . . 1 = n! permutations
altogether.
n 1 2 3 4 5 6 7 8 9 10
n! 1 2 6 24 120 720 5040 40320 362880 3628800
Exercise 80. Let X be an infinite set. Show that Sym(X) is infinite. [Hint: Let
x1 , x2 , x3 , . . . be infinitely many distinct elements of X and let x0 be another; define
σi : X → X to do nothing except swap x0 and xi .]
Perhaps a more intuitive way to compose permutations in two-line notation is to (i) stack
them vertically and (ii) rearrange the columns so that the top one’s bottom line matches
the bottom one’s top line; then (iii) the composite permutation is just the mapping defined
73
Chapter 2. Families of groups
The inverse of a permutation in two-line notation is obtained by just (i) interchanging the
lines and (ii) sorting the columns so the top line matches the original.
!−1 ! !
a b c x y z c x a y z b a b c x y z
= =
c x a y z b a b c x y z c z a b x y
The order of a permutation is its order as an element of the symmetric group it lives in.
2.4.4 Examples
Example 2.4.6 — Let X = {1, 2, 3, 4}. The functions σ, τ, and ϵ, defined for all i by
σ(i) = 5 − i, τ(i) ≡ i − 1 (mod 4), and ϵ(i) = i, are permutations of X, hence elements
of S4 . In two-line notation, they are written
! !
1 2 3 4 1 2 3 4
σ= τ=
4 3 2 1 4 1 2 3
!
1 2 3 4
ϵ=
1 2 3 4
74
Chapter 2. Families of groups
can be expressed as
!
. . . −2 −1 0 1 2 3 4 5 ...
g=
. . . −4 1 −2 3 0 5 2 7 ...
1
5 2
4 3
75
Chapter 2. Families of groups
· 1 2 3 4 5 6
1 1 2 3 4 5 6
2 2 4 6 1 3 5
3 3 6 2 5 1 4
4 4 1 5 2 6 3
5 5 3 1 6 4 2
6 6 5 4 3 2 1
Each row can be interpreted as a permutation of the first row, so that e.g. 2 and 3 define
the permutations
! !
1 2 3 4 5 6 1 2 3 4 5 6
and
2 4 6 1 3 5 3 6 2 5 1 4
Proof. [This was previously covered in Tutorial 3 Question 2, but we include the proof for
completeness’ sake.]
Fix a group G. For every g ∈ G, we call the map in Example 2.4.9
fg : G → G, fg (x) = gx.
Tutorial 3 Question 2b gives a proof this is a bijection, but knowing G is a group, the proof
could be simplified greatly:
76
Chapter 2. Families of groups
This is an injection because fg = fh only if fg (e) = g = h = fh (e). Thus we may restrict the
codomain to F(G), and automatically get that F is a bijection G → F(G).
Now for any g, h, x ∈ G, we also have
· g1 g2 g3 g4
g1 g1 g2 g3 g4
g2 g2 g1 g4 g3
g3 g3 g4 g1 g2
g4 g4 g3 g2 g1
Then our four permutations are
! !
1 2 3 4 1 2 3 4
σ1 = ϵ = , σ2 = ,
1 2 3 4 2 1 4 3
! !
1 2 3 4 1 2 3 4
σ3 = , σ4 =
3 4 1 2 4 3 2 1
2.4.6 Cycles
What do the following permutations (from above) have in common?
! ! !
1 2 3 4 1 2 3 4 5 1 2 3 4 5 6
τ= , ρ= , and “3” =
4 1 2 3 2 3 4 5 1 3 6 2 5 1 4
77
Chapter 2. Families of groups
If we write i → j to mean σ(i) = j then we can visualize the action of each of these
permutations on their respective letter-sets:
τ : 1 → 4 → 3 → 2 → 1 → ...
ρ : 1 → 2 → 3 → 4 → 5 → 1 → ...
“3” : 1 → 3 → 2 → 6 → 4 → 5 → 1 → . . .
All the elements are in one loop! Permutations which act cyclically like this have a special
name.
Definition 2.4.12 — Let k ≥ 2. A k-cycle, denoted (a1 a2 . . . ak ) where the ai ’s are
distinct, is a permutation that moves ai to ai+1 for 1 ≤ i < k, moves ak to a1 , and fixes
everything else.
An infinite cycle is a permutation denoted
(. . . a−1 a0 a1 a2 . . . )
“3” = (1 3 2 6 4 5)
Remark 2.4.13. Cycles can be written in multiple ways, namely by “rotating” their innards:
(a1 a2 . . . ak ) = (a2 a3 . . . ak a1 ) = . . . = (ak a1 . . . ak−1 ).
These are all equally valid, but if it makes sense to do so, we prefer to write the smallest
letter first.
Exercise 81. Show that σ is a transposition iff σ is a 2-cycle.
Theorem 2.4.14
k-cycles have order k.
Proof. Let σ = (a1 a2 . . . ak ) be a k-cycle. Then σk is the identity map, because it “moves”
each ai to ai and fixes everything else. But if 1 < i < k, then σi moves a1 to ai+1 , so σi is
not the identity map. Thus o(σ) = k.
78
Chapter 2. Families of groups
Example 2.4.16 — The composition of any cycle with its inverse is the identity map,
which is not a cycle.
(1 2)(3 4)
is the permutation !
1 2 3 4
2 1 4 3
79
Chapter 2. Families of groups
(1 2 3)(2 3 4 5)
is the permutation !
1 2 3 4 5
2 1 4 5 3
which is not a cycle.
is not a cycle. We see that σ swaps 1 and 2 and permutes 3, 4, and 5 cyclically. A cycle is
supposed to move things in one loop, but σ moves things in two loops. So although σ isn’t
a cycle, it is a product of the cycles
(1 2) and (3 4 5)
in the sense that
σ = (1 2)(3 4 5) = (3 4 5)(1 2).
Definition 2.4.19 — Two permutations are disjoint if they do not move any of the
same elements. In other words, σ and τ are disjoint if for all x in X, σ(x) ̸= x implies
τ(x) = x.
Example 2.4.20 — The permutations (1 2) and (3 4) are disjoint, while the permuta-
tions (1 2) and (2 3) are not.
Observe that (1 2)(3 4) = (3 4)(1 2) are the same permutation. On the other hand,
!
1 2 3
(1 2)(2 3) = = (1 2 3)
2 3 1
while !
1 2 3
(2 3)(1 2) = = (1 3 2)
3 1 2
so (1 2)(2 3) ̸= (2 3)(1 2) are not the same permutation.
Lemma 2.4.21
Disjoint permutations commute.
Case 3: σ(x) = x and τ(x) ̸= x. This is just Case 2 with the roles of σ and τ reversed.
Case 4: σ(x) ̸= x and τ(x) ̸= x. Impossible, because σ and τ are disjoint!
With this terminology in hand, we can now state the most important theorem on permuta-
tions:
Proposition 2.4.23
Let X be a set and let σ ∈ SX . Define x ∼ y iff σk (x) = y for some integer k. Then ∼ is
an equivalence relation on X.
σk . x = σk (x).
Now when σ is a permutation, we claim that ∼ and ∼0 define the same relation.
If x ∼ y, we have σk (x) = y = σ0 (y), so x ∼0 y.
82
Chapter 2. Families of groups
The orbit of a particular x ∈ X under this action is denoted ⟨σ⟩x. We call this the orbit of x
under σ∗ , and the set of all orbits the orbits of σ.
Lemma 2.4.24
Let σ ∈ SX and let x ∈ X. Then
Example 2.4.25 — The identity permutation ϵ is the empty product. It generates the
trivial group, which acts on X by
ϵ . x = x.
The orbits of ϵ are the singleton subsets ⟨ϵ⟩x = {x}.
1 7→ 2 7→ 3 7→ 4 7→ 5 7→ 1, 6 7→ 6, 7 7→ 7.
∗
Instead of “⟨σ⟩”.
83
Chapter 2. Families of groups
(1 2 4)(3 6 5)
(1 3 2 6 4 5).
Example 2.4.29 — On N,
n + 1 n is odd
f(n) =
n − 1 n is even
!
1 2 3 4 5 6 7 8 9 ...
=
2 1 4 3 6 5 8 7 10 . . .
= (1 2)(3 4)(5 6)(7 8) . . .
n
be the set of all 2n th roots of unity, i.e. ζ in C such that ζ2 = 1 for some n ≥ 0. Then
κ(z) = z3 is a permutation of µ2∞ . [Exercise.] Let ζ = e2πi/8 and ξ = e2πi/16 be primitive
8th and 16th roots of unity, respectively. Then
!
1 −1 i −i ζ ζ3 ζ5 ζ7 ξ . . .
κ=
1 −1 −i i ζ3 ζ ζ7 ζ5 ξ3 . . .
84
Chapter 2. Families of groups
is the product of infinitely many finite cycles of increasing length. [The precise cycle
structure of κ is related to the order of [3] in U(2n ).]
Exercise 84. Show that σ fixes x iff ⟨σ⟩x = {x}. Such orbits are called trivial.
Definition 2.4.31 — A cycle is a permutation with just one nontrivial orbit. If that
orbit has size l, the cycle is called an l-cycle and l is called the length.
Remark 2.4.32. ∞-cycles are also called chains because they aren’t really “cycles” in the
literal sense.∗
Remark 2.4.33. Trivial orbits, i.e. orbits of size 1, a.k.a. fixed points, are also called “1-cycles”
(another oxymoron).
We now have everything we need to properly state and prove the Cycle Decomposition
Theorem.
Remark 2.4.35. Note that the product notation makes sense because disjoint cycles com-
mute.
∗
But if we’re being picky, isn’t the term “infinite cyclic group” literal nonsense as well?
85
Chapter 2. Families of groups
Proof. Pick one xi from each nontrivial orbit of σ and define σi : X → X by the formula
σ(x) if x ∈ ⟨σ⟩xi
σi (x) =
x otherwise.
You should check that each σi is a bijection (one-to-one and onto). In fact, each σi is a cycle,
because ⟨σ⟩xi is its only nontrivial orbit.
This identity also shows that σi and σj are disjoint when i and j are distinct, because
i ̸= j ⇐⇒ xi ̸= xj ⇐⇒ xi ̸∼ xj ⇐⇒ ⟨σ⟩xi ∩ ⟨σ⟩xj = ∅.
To prove that Y
σi = σ
i
we evaluate both sides at an arbitrary x in X. So, let x ∈ X.
• If σ fixes x, then the σ-orbit of x is trivial, so x ̸∼ xi for any i. Thus σi (x) = x for all i,
and we have Y
σi (x) = x = σ(x).
i
• If σ moves x, then the σ-orbit of x is nontrivial, so x ∼ xi0 for some i0 . Since the xi ’s
are one per orbit, x ̸∼ xi for any other i ̸= i0 . Thus σi0 (x) = σ(x), while σi (x) = x.
Since disjoint permutations commute [and since composition is associative], we have
Y Y
σi (x) = σi0 σi (x) = σi0 (x) = σ(x).
i i̸=i0
Q
Thus σ = i σi is the desired decomposition of σ as a product of disjoint cycles.
Q
To show uniqueness, suppose σ = j τj where τj ∈ SX are pairwise disjoint cycles. Since
Q i ) ̸= xi , there must exist a unique j = ψ(i) such that τj (xi ) ̸= xi . [Exists because
σ(x
j τj = σ; unique because τj are Q disjoint.] Since τj is a cycle, it has only one nontrivial
class, which must be ⟨τj ⟩xi . Since j τj = σ, it follows that τj (x) = σ(x) for all x ∼ xi . But
σ(x) = σi (x) for these same x! Therefore, τj = σi . The desired bijection is then φ = ψ−1 .
Remark 2.4.37. When X is finite, say of order n, cycle types are written as n-tuples
(c1 , c2 , c3 , . . . , cn ). When X is infinite, we separate c∞ from the rest of the list with a
semicolon: (c1 , c2 , c3 , . . . ; c∞ ).
86
Chapter 2. Families of groups
“2” = (1 2 4)(3 6 5)
and
“3” = (1 3 2 6 4 5)
in Sym(U(7)) are (0, 0, 2, 0, 0, 0) and (0, 0, 0, 0, 0, 1), respectively.
on Z is (0, 0, 0, . . . ; 2).
87
Chapter 2. Families of groups
Proposition 2.4.42
When X is finite, then so is every orbit, and we have
X
lcl = n.
l≥1
Proposition 2.4.43
Let G be a group and let a, b ∈ G. If a and b commute and have finite order, then
Theorem 2.4.45
If σ and τ are disjoint permutations of finite order, then
Proof. Write n = o(σ) and m = o(τ). By the Proposition, we just have to prove that
lcm(n, m) ≤ o(στ).
88
Chapter 2. Families of groups
Corollary 2.4.46
Suppose σ has cycle type (c1 , c2 , . . . ; c∞ ). If c∞ = 0 and cl = 0 for all but finitely many
l, then
o(σ) = lcm{l : cl > 0}.
where the σl,i are pairwise disjoint cycles of lengths l. By the preceding Theorem, the order
of σ is
Example 2.4.47 —
o (1 2)(3 4 5) = lcm(2, 3) = 6
and
o (1 4 3)(2 7)(5 8) = lcm{3, 2, 2} = 6.
Example 2.4.48 —
89
Chapter 2. Families of groups
Exercise 90. Suppose σ has cycle type (c1 , c2 , . . . ; c∞ ). Show that o(σ) = ∞ iff c∞ > 0
or cl ̸= 0 for infinitely many l.
Proposition 2.4.49
Any k-cycle (a1 . . . ak ) can be written as a product of transpositions
Proof. It suffices to check the right hand side: If i ̸= aj for some j, the right hand fixes i. If
i = aj for j ̸= k, then the right hand side sends it to
aj 7→ aj+1 .
If i = ak , the right hand side sends it to
ak 7→ ak−1 7→ · · · 7→ a1 .
So the right hand side evaluates to the cycle (a1 . . . ak ).
Corollary 2.4.50
Any permutation in Sn can be written as a product of transpositions. That is, Sn is
generated by the transpositions in Sn .
Proof. Exercise.
Proposition 2.4.51
Sn is generated by the transpositions
90
Chapter 2. Families of groups
Theorem 2.4.52
Sn is generated by the n-cycle σ = (1 2 . . . n) and the transposition τ = (1 2).
Induction tells us σ and τ generate all transpositions in Proposition 2.4.51, so they together
generate Sn .
Exercise 91. Give an example of an n-cycle and a transposition that do not together
generate Sn .
You can find a number of other generating sets of Sn , but these—the set of all transpositions,
the set of adjacent transpositions, and the set {(1 2 . . . n), (1 2)}—are the most commonly
seen.
Exercise 92. For n ≥ 3, show that Sn must be generated by at least two elements.
[That is, Sn is not cyclic.]
91
Chapter 3
3.1.1 Recall...
Let H be a subgroup of G.
A left coset of H in G is a subset of the A right coset of H in G is a subset of the
form form
aH = {ah : h ∈ H} Ha = {ha : h ∈ H}
where a ∈ G. where a ∈ G.
Left cosets are precisely the equivalence classes of the left coset relation† : a ∼ b iff b−1 a ∈ H
iff aH = bH. The set of equivalence classes (a.k.a. left cosets) is denoted G/H.
Exercise 93. The space of right cosets of H in G is denoted H\G. What is the analogous
equivalence relation, Ha = Hb iff ... ∈ H?
Remark 3.1.1. When we talk about “cosets” without specifying left or right, we always
mean left cosets.
The significance of these relations being equivalence relations is that two cosets (of the same
type) are either identical or disjoint—cosets cannot partially overlap.
†
Originating from the orbits of the right action of H on G by left multiplication.
92
Chapter 3. Quotients and Morphisms
Finally, we record the trivial but highly useful observation that aH = H if (and only if)
a ∈ H. In particular, eH = H.
The notation for the coset space sometimes produces familiar objects.
Z/nZ!
Example 3.1.3 — Let G = Z and H = 10Z. Consider the cosets 2 + 10Z and 3 + 10Z.
We have
which looks just like the addition [2] + [3] = [5] in Z/10Z.
Example 3.1.4 — Let G = R and H = Z. Consider the cosets −0.7 + Z and 2.5 + Z.
We have
∗
After all, we already interpret aH that way.
93
Chapter 3. Quotients and Morphisms
which looks just like the addition {−0.7} + {2.5} = {1.8} in R/Z.
Notice that, in both examples, the sum of two cosets is again a coset: the “coset of the
sum”.
√3.1.5√— Let G = R and H = ⟨−1⟩. Consider the cosets πH = {π, −π} and
×
Example
√
2H = { 2, − 2}. We have
√ √ √
(πH)( 2H) = {π, −π}{ 2, − 2}
√ √
= {π 2, −π 2}
√
= π 2H.
Example 3.1.6 — Let G = R× and H = (0, ∞). Consider the cosets 1H = H and
−1H = (−∞, 0). We have
Notice that, in both examples, the product of two cosets is again a coset—the “coset of the
product”.
and again we see that the product of cosets is the “coset of the product”.
Our next and final example serves to dispel the hasty conclusion that multiplying cosets
always works out.
94
Chapter 3. Quotients and Morphisms
Example 3.1.8 — Let G = Dn for n > 2 and H = ⟨f⟩ = {e, f}. Consider the cosets
rH = {r, rf} = {r, fr−1 } and frH = {fr, frf} = {fr, r−1 }. We have
Lemma 3.1.9
Suppose (aH)(bH) = cH. Then c ∼ ab, i.e., cH = abH.
Proof. Certainly a ∈ aH and b ∈ bH, so ab ∈ (aH)(bH). But if (aH)(bH) = cH, then there
exists h in H such that ab = ch, in which case cH = chH = abH.
In other words, if the product of cosets, (aH)(bH), is a coset of H, then it must be the “coset
of the product”, abH. So, when do we have the equality (aH)(bH) = abH?
Lemma 3.1.10
If b−1 hb ∈ H for all h in H, then (aH)(bH) = abH.
Proof. We always have (aH)(bH) ⊇ abH because abh = (ae)(bh) for all h ∈ H. Therefore,
the equality (aH)(bH) = abH holds if and only if
95
Chapter 3. Quotients and Morphisms
Theorem 3.1.13
Let H ⊴ G. Then G/H is a group under coset multiplication, called the quotient group of
G by H.
Proof. By the Lemmas, coset multiplication is a binary operation on G/H. It’s associative
because
(aH)(bH) (cH) = (abH)(cH) = (ab)cH
= a(bc)H = (aH)(bcH) = (aH) (bH)(cH)
so it’s a composition law. The identity is H itself: for all aH in G/H,
(aH)H = aH;
and the inverse of aH is a−1 H, because
(aH)(a−1 H) = aa−1 H = eH = H.
96
Chapter 3. Quotients and Morphisms
Since H is always clear from context∗ it’s common to write a, instead of aH, for a typical
element of the quotient group G/H.
Taking a quotient is kind of like ignoring the information encoded by the subgroup. In
Z/10Z, we’re ignoring everything but the ones digits. In R/Z, we’re ignoring the integer
parts and just focusing on the fractional parts. In R× /⟨−1⟩, we’re ignoring signs: −x = x
for all x. And in R× /(0, ∞), we’re ignoring magnitude: 10100 = 10−100 = 1.
o(D6 ) 12
o(D6 /⟨r2 ⟩) = [D6 : ⟨r2 ⟩] = = =4
o(⟨r ⟩)
2 3
∗
It’s right there in the notation G/H!
97
Chapter 3. Quotients and Morphisms
Taking the quotient of a finite group by a (proper) nontrivial normal subgroup always
yields a smaller (nontrivial) group. Although quotients may be “slippery” to work with,
they are, in a certain sense, simpler.
Proposition 3.1.18
The normal subgroups of Dn are Dn , every subgroup of ⟨r⟩, and, just when n is even,
∼ 1, Dn /⟨rk ⟩ =
⟨r2 , f⟩ and ⟨r2 , fr⟩. The corresponding quotients are Dn /Dn = ∼ Dk for each
∼ ∼
k | n, and Dn /⟨r , f⟩ = Dn /⟨r , fr⟩ = C2 .
2 2
98
Chapter 3. Quotients and Morphisms
Example 3.1.19 —
V = ⟨(1 2)(3 4), (1 3)(2 4)⟩ = {ϵ, (1 2)(3 4), (1 3)(2 4), (1 4)(2 3)}
Exercise 99 (PS1XQ3b). Show that every proper subgroup of the quaternion sub-
group Q = ⟨i, j, k⟩ is normal in Q.∗ What are the possible quotients of Q?
∗
Recall that i, j, k are the unit quaternions satisfying i2 = j2 = k2 = ijk = −1.
Example 3.1.21 — Automorphisms of the form cg (x) = gxg−1 are called inner; auto-
morphisms not of this form are called outer.∗ Inner automorphisms constitute a normal
subgroup of Aut(G) [Why?]; the quotient is denoted Out(G).
∗
Some jokesters like to abbreviate “outer automorphism” to “outermorphism”.
99
Chapter 3. Quotients and Morphisms
1. Right cosets?
Let’s start with 1.
Had we worked with right cosets instead of left ones, Lemma 2 would have said (Ha)(Hb) =
Hab if aha−1 ∈ H for all h in H, because
h1 ah2 b = h1 ah2 a−1 ab = (h1 )(ah2 a−1 ) ab.
| {z }
∈H
And in order for the multiplication to have worked for all cosets, we would have required
H to satisfy
aha−1 ∈ H for all h in H, for all a in G
or, more succinctly,
aHa−1 ⊆ H for all a in G.
This is equivalent to normality as previously defined, because (a−1 )−1 = a.
So, using right cosets instead of left ones doesn’t change the required “normality” condition.
But, does it change the group?
Exercise 102. Show that the converse holds, namely, that if Ha = aH for all a in G,
then H is normal.
So, using right cosets instead of left ones doesn’t change the group elements. And it doesn’t
change the group operation, either, as the following Corollary shows.
Corollary 3.1.23
If H ⊴ G, then (aH)(bH) = (Ha)(Hb) for all a, b in G.
2. Normality—needed?
Next, 2. Is normality necessary? Well, if we want the product of any two cosets to be a
coset, then H has to be normal.
Lemma 3.1.24
Let H ≤ G. If (aH)(bH) ∈ G/H for all a, b in G, then H ⊴ G.
Proof. We know from last lecture that if (aH)(bH) is a coset, then it must be the coset abH.
In particular, for all a in G,
(aH)(a−1 H) = aa−1 H = eH = H.
3. Normality—what?
Finally, 3. What’s normality all about?
The key idea is conjugacy.
In general, any expression involving g on one side and g−1 on the other is an instance of
conjugation. For example, x and gxg−1 (as well as g−1 xg) are conjugate elements. Similarly,
H and gHg−1 are conjugate subgroups.
Informally, conjugacy can be seen as a “change in perspective”, and, loosely speaking,
normal subgroups are the ones which look the same from all “points of view”.
γ−1 σ γ
2 7−→ 4 7−→ 5 7−→ 1
101
Chapter 3. Quotients and Morphisms
which looks exactly like σ = (1 2 3)(4 5) but with the letters replaced by their images
under γ.
Proposition 3.1.26
For all k-cycles and all γ’s,
Example 3.1.27 — In Dn ,
frf−1 = frf = r−1 .
In other words, observing a clockwise rotation from behind makes it look like a
counterclockwise rotation.
Similarly,
rfr−1 = fr−2 ,
which means that observing a flip with your head tilted makes it look like a flip over a
tilted axis.
102
Chapter 3. Quotients and Morphisms
Example 3.1.28 (Change of Basis) — Let V be a vector space with two bases v1 , . . . , vn
and w1 , . . . , wn . Every vector x in V can be uniquely written as a linear combination of
basis vectors:
x = a 1 v1 + a 2 v2 + · · · + a n vn
where a1 , . . . , an are scalars. In other words, x corresponds to the vector (a1 . . . an )T
with respect to the basis v1 , . . . , vn . To express x in terms of the basis w1 , . . . , wn , we
first express each vi :
X
n
vi = bij wj
j=1
Then
X X X X X X
n n n n n
! n
x= a i vi = ai bij wj = ai bij wj =: aj′ wj
i=1 i=1 j=1 j=1 i=1 j=1
so that x corresponds to the vector (a1′ . . . an′ )T with respect to the basis w1 , . . . , wn .
Thus, “change of coordinates” is given by the formula
a1′ b11 . . . bn1 a1
. . ... .. ..
.. = ..
. .
′
an b1n . . . bnn an
103
Chapter 3. Quotients and Morphisms
For example, if we wish to go from the “default” basis v1 = (1, 0), v2 = (0, 1) to the
“custom” basis w1 = (1, −1), w2 = (1, 2), we can use the change of basis matrix:
" #
1 2 −1
B= .
3 1 1
The duck’s beak lies at the tip of the vector 4v1 + 2v2 , which is (4 2)T w.r.t. “default”
coordinates. Where’s the duck’s beak w.r.t. “custom” coordinates?
" # ! ! !
1 2 −1 4 1 8−2 2
= =
3 1 1 2 3 4+2 2
Indeed, one can check directly that 4v1 + 2v2 = 2w1 + 2w2 . So far, so good.
Now imagine we have a linear operator∗ T on V, given by some matrix Told w.r.t. the
first basis v1 , . . . , vn . To represent T completely in terms of the second basis w1 , . . . , wn ,
we need a matrix Tnew which
(1) accepts input in new coordinates,
(2) changes it back to old coordinates,
(3) performs the operation of Told , and
(4) returns an answer in new coordinates.
In other words,
Tnew = BTold B−1
is the conjugate of Told by B.
104
Chapter 3. Quotients and Morphisms
For example, let T be a 90-degree counterclockwise rotation. In the “default” basis, the
matrix of T is " # " #
cos π2 −sin π2 0 −1
Told = = .
sin π2 cos π2 1 0
We can express T entirely in the “custom” basis by conjugating Told by B:
" #" #" # " #
1 2 −1 0 −1 1 1 1 1 −5
Tnew = BTold B−1 = = .
3 1 1 1 0 −1 2 3 2 −1
∗
“Linear operator” is to “endomorphism” as “linear transformation” is to “homomorphism”.
Thus, revolving the duck about the origin moves the tip of its beak to
" # ! ! !
1 1 −5 2 1 2 − 10 −8/3
= = .
3 2 −1 2 3 4−2 2/3
Conjugacy as automorphism
For each g in G, the function cg (x) = gxg−1 is an automorphism of G. [This is PS2Q2.]
If G is abelian, then cg (x) = gxg−1 = gg−1 x = ex = x for all x in G, so each cg is the
identity map—conjugation doesn’t do anything. So, conjugation is only interesting in the
nonabelian setting.
Like any automorphism, conjugation moves subgroups to other subgroups. [Why?] Since
every subgroup contains e, the distinct conjugates of a given subgroup always overlap.∗
∗
Do not confuse conjugates and cosets. The distinct cosets of a given subgroup never overlap.
105
Chapter 3. Quotients and Morphisms
r f fr3
r f r 2 r2
4
r r5
e e r3
fr fr2 fr fr5
fr4 fr2
r f fr
e
2
r fr2
r3 fr3
The next Proposition proves that the final picture is accurate—normal subgroups are
self-conjugate.
Proposition 3.1.29
Let H ⊴ G. Then gHg−1 = H for all g in G.
Exercise 103. Let H ≤ G and let f ∈ Aut(G) such that f(H) ≤ H. Is it necessarily true
that f(H) = H?
106
Chapter 3. Quotients and Morphisms
Theorem 3.1.30
Let G be a group and let H ≤ G. The following are equivalent.
1. H ⊴ G.
2. gH = Hg for all g in G.
3. G/H is a group under coset multiplication.
4. H is self-conjugate: gHg−1 = H for all g in G.
107