0% found this document useful (0 votes)

14 views8 pages

Repeated Games

Uploaded by

sohamghige14

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views8 pages

Repeated Games

Uploaded by

sohamghige14

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

REPEATED STRATEGIC GAME

Consider the prisoner’s dilemma game with possible actions Ci for Pi cooperating (with the other player)
and Di for Pi defecting from the other player. (Earlier, these actions were called quiet and fink respectively.)
The payoff matrix for the game is assumed to be as follows:

C 2 D2
C1 (2, 2) (0, 3)
D1 (3, 0) (1, 1)

We want to consider repeated play of this game for several or an infinite number of times. To simplify the
situation, we consider the players making simultaneous moves with the current move unknown to the other
player. This is defined formally on page 206. We use a game graph rather than a game tree to represent this
game. See Figure 1.

(C, C) (C, C) (C, C)

(C, D) (C, D) (C, D)

(D, C) (D, C) (D, C)

(D, D) (D, D) (D, D)

F IGURE 1. Game tree for repeated prisoner’s dilemma

Let a(t) = (a1(t) , a2(t) ) be the action profile at the t th stage. The one step payoff is assumed to depend on
only the action profile at the last stage, u i (a(`) ). There is a discount factor ) < δ < 1 to bring this quantity
back to an equivalent value at the first stage, δ t−1 u i (a(t) ). For a finitely repeated game of T stages (finite
horizon), the total payoff for Pi is

Ui (a(1) , . . . , a(T ) ) = u i (a(1) ) + δ u i (a(2) ) + · · · + δ T −1 u i (a(T ) )

T
δ t−1 u i (a(t) ).
X
=
t=1

There are a couple of ways to understand the discounting. If r > 0 is an interest rate, then capital V1 at
the first stage is worth Vt = (1 + r )t−1 V1 at the t th stage (t − 1 steps later). Thus, the value of Vt at the
first stage is Vt/(1 + r )t−1 . In this context, the discounting is δ = 1/(1 + r ). If the payoff is not money but
satisfaction, then δ is a measure of the extent the player wants rewards now, i.e., how impatient the player
is. See the book for further explanation.

For a finitely repeated prisoner’s dilemma game with payoffs as above, at the last stage, both players
optimize their payoff by selecting Di . Given this choice, then the choice that optimizes the payoff at the
T − 1 stage is again Di . By backward induction, both players will select D at each stage. See Section 14.4.
1
2 REPEATED STRATEGIC GAME

I NFINITELY REPEATED GAMES ( INFINITE HORIZON )

For the rest of this section, we consider an infinitely repeated game starting at stage one (infinite horizon).
The discounted payoff for player Pi is given by
∞
δ t−1 u i (a(t) ).
X
Ui ({at }∞
t=1 ) =
t=1
If {wt }∞
t=1 is the stream of payoffs (for one of the players), then the discounted sum is
∞
X
U ({wt }∞
t=1 ) = δ t−1 wt .
t=1
If all the payoffs are the same value, wt = c for all t, then
∞
X
U ({c}∞
t=1 ) = δ t−1 c
t=1
X∞
=c δk
k=0
c
= , so
1−δ
c = (1 − δ) U ({c})∞t=1 ).
Thus, For this reason, we call the quantity
Ũ ({wt }∞
t=1 ) = (1 − δ) U ({wt }t=1 )
∞

is called the discounted average. This quantity Ũ ({wt }∞t=1 ) is such that if the same quantity is repeated
infinitely many times then the same quantity is returned by Ũ . Applying this to actions, the quantity
t=1 ) = (1 − δ) Ui ((at )t=1 )
Ũi ({at }∞ ∞

is the discounted average payoff of the action stream.

S OME NASH EQUILIBRIA STRATEGIES

We describe some strategies as reactions to action profiles that have gone before. We only describe
situations where both players use the same rules to define their strategies. In describing the strategy for Pi ,
we let j be the other player. Thus, if i = 1 then j = 2, and if i = 2 then j = 1. We then describe a manner
in which to understand these strategies in terms of a modified game graph.
Defection Strategy. In this strategy, both players select D in response to any history of actions. It is easy
to check that this is a Nash equilibrium.
Grim Trigger Strategy. (page 426) The strategy for Pi is given by
Ci if t = 1 or a (`)
(
(1 (t−1) j = C for all 1 ≤ ` ≤ t − 1
si (a , . . . , a )= (`)
Di a j = D for some 1 ≤ ` ≤ t − 1.
We are next going to decibel this strategy in terms of states of the two players. The states are defined
so that the action of the strategy for player Pi depends only on the state of Pi . These states can be used to
determine a new game tree that has a vertex at each stage for a pair of states for the two players.
For the grim trigger strategy, there are two states for Pi :
Ci = {t = 1} ∪ {(a(1) , . . . , a(t−1) ) : a (`)
j = C j for all 1 ≤ ` ≤ t − 1 }

Di = {(a(1) , . . . , a(t−1) ) : a (`)

j = D j for some 1 ≤ ` ≤ t − 1 }.
REPEATED STRATEGIC GAME 3

The strategy of Pi is to select Ci if the state is Ci and to select Di if the state is Di . The transitions between
the states depend only on the action of the other player at the last stage. This situation can be represented
by the game tree in Figure 2.

(C1 , C2 )
(C1 , C2 ) (C1 , C2 )

(D1 , C2 )
(C1 , C2 )

(∗, C2 )
(C1 , D2 ) (C1 , D2 )

(D1 , C2 ) (C1 , D2 )
(C1 , C2 ) (∗, D2 )
(C1 , D2 )
(C1 , ∗) (D1 , C2 )
(D1 , C2 )

(D1 , D2 ) (D1 , ∗) (D1 , D2 )

(∗, ∗)
(D1 , D2 ) (D1 , D2 )

F IGURE 2. Game tree for grim trigger

As given in the book, rather than giving a game tree, it is easier to give a figure presenting the transitions
and states (of only one player). See Figure 3.

Dj
Ci : Ci Di : Di ∗

F IGURE 3. States and transitions for grim trigger

We next check that if both players use the grim trigger strategy the result is a Nash equilibrium. Since we
start in state (C1 , C2 ), applying the strategy will keep both players in the same states. The one step payoff
at each stage is 2. Assume that P2 maintains the strategy and P1 deviates at stage T by selecting D1 . Then,
P2 selects C2 for t = T and selects D2 for t > T . The greatest payoff for P1 results from selecting D1 for
t > T . Thus, if P1 selects D1 for t = T , then the greatest payoff from that stage onward is
3 δ T + δ T +1 + δ T +2 + · · · = 3 δ T + δ T +1 1 + δ + δ 2 + · · ·

δ T +1
= 3 δT + .
1−δ
If P1 plays the original strategy, the payoff from the T th stage onward is
2 δT
2 δ T + 2 δ T +1 + 2 δ T +2 + · · · = .
1−δ
4 REPEATED STRATEGIC GAME

Therefore, the grim trigger strategy is a Nash equilibrium provided that

2 δT δ T +1
≥ 3 δT +
1−δ 1−δ
2 ≥ 3(1 − δ) + δ = 3 − 2 δ
2δ ≥ 1
δ ≥ 12 .

This shows that if both players are patient enough so that δ ≥ 1/2, then the grim trigger strategy is a Nash
equilibrium.

Tit-for-tat Strategy. (page 427, Section 14.7.3) We describe this strategy in terms of states of the players.
For the tit-for-tat strategy, there are two states for Pi that only depend on the action of P j in the last period:

Ci = {t = 1} ∪ {(a(1) , . . . , a(t−1) ) : a (t−1)

j = Cj }
Di = {(a(1) , . . . , a(t−1) ) : a (t−1)
j = D j }.
The transitions between states are given in Figure 4.
Cj
Cj

Dj
Ci : Ci Di : Di Dj

F IGURE 4. States and transitions for tit-for-tat

We next check that the tit-for-tat strategy by both players is also a Nash equilibrium for δ ≥ 1/2. Assume
that P2 maintains the strategy and P1 deviates by selecting D1 at the T th -stage. The payoff for the original
strategy starting at the T th -stage is
2 δT
.
1−δ
The other possibilities for actions by P1 include (a) D1 for t ≥ T , (b) alternating D1 and C1 forever, and
(c) D1 for k times and then C1 . (The latter returns P2 to the original state, so it is enough to calculate this
segment of the payoffs. Note that the book ignores the last case.) We check these three case in turn.
(a) If P1 uses D1 for t ≥ T , the P2 uses C2 for t = T and then D2 for t > T . The payoff for these choices
is
δ T +1
3 δ T + δ T +1 + δ T +2 + · · · = 3 δ T + .
1−δ
For tit-for-tat to be a Nash equilibrium, we need
2 δT δ T +1
≥ 3 δT +
1−δ 1−δ
2 ≥ 3(1 − δ) + δ = 3 − 2 δ
2δ ≥ 1
δ ≥ 12 .
REPEATED STRATEGIC GAME 5

(b) If P1 alternates D1 and C1 , then P2 alternates C2 and D2 . The payoff for P1 is

3 δ T + (0) δ T +1 + 3 δ T +2 + · · · = 3 δ T 1 + δ 2 + δ 4 + · · ·

3 δT
= .
1 − δ2
In order for tit-for-tat to be a Nash equilibrium, we need
2 δT 3 δT
≥
1−δ 1 − δ2
2 (1 + δ) ≥ 3
2δ ≥ 1
δ ≥ 12 .
We get the same condition on δ as in case (a).
(c) If P1 selects D1 for k stages and then C1 , then P2 will select C2 and then D2 for k stages. At the end,
P2 is back in state C2 . The payoffs for these k + 1 stages of the original strategy and the the deviation are
2δ T + · · · 2δ T +k and 3δ T + δ T +1 + · · · + δ T +k−1 + (0)δ T +k .
Thus, we need
2δ T + · · · + 2δ T +k ≥ 3δ T + δ T +1 + · · · + δ T +k−1 or
−1 + δ + · · · + δ k−1 + 2δ k ≥ 0.
If δ ≥ 1/2, then
1 k
k−1
2δ k + δ k−1 + · · · + δ − 1 ≥ 2 + 12 + · · · + 12 − 1

2
1 k−1
k−1
+ 12 + · · · + 12 − 1

≥ 2
k−1 k−2
≥ 2 12 + 12 + · · · + 21 − 1
..
.
1

≥2 2
−1
= 0.
Thus, the condition is satisfied. This checks all the possible deviations, so the tit-for-tat strategy is a Nash
equilibrium for δ ≥ 1/2.
Limited punishment Strategy. (Section 14.7.2) In this strategy, each player has k + 1 states for some
k ≥ 2. For Pi , starting in state Pi,0 , if the other player selects D j , then there is a transition to Pi,1 , then a
transition to Pi,2 . . . , Pi,k , and then back to Pi,0 . The transitions from Pi,` for 1 ≤ ` ≤ k do not depend
on the actions of either player. For the limited punishment strategy, the actions of Pi are Ci in state Pi,0 and
Di in states Pi,` for 1 ≤ ` ≤ k. See Figure 5 for the case of k = 2. See the book for the case of k = 3.
Cj ∗

Dj ∗
Pi,0 : Ci Pi,1 : Di Pi,2 : Di

F IGURE 5. States and transitions for limited punishment

6 REPEATED STRATEGIC GAME

If P1 select D1 at some stage, the P2 will select C2 and then D2 for the next k stages. The maximum payoff
for P1 is obtained by selecting D1 for all of these k + 1 stages. The payoffs for P1 are 2 + 2δ + · · · + 2δ k for
the limited punishment strategy that results in all C for both players, and 3 + δ + · · · + δ k for the deviation.
Therefore, we need

3 + δ + · · · + δ k ≤ 2 + 2δ + · · · + 2δ k ,
1 − δk

1 ≤ δ + ··· + δ = δ
k
,
1−δ
1 − δ ≤ δ − δ k+1 , and
gk (δ) = 1 − 2 δ + δ k+1
≤ 0.

To check that this is true for δ large enough, we use calculus.

gk (1) = 0,
1 k+1
gk 12 = 1 − 1 + >

2
0,
gk0 (δ) = −2 + (k + 1)δ , k
and
gk0 (1) = −2 + k + 1 > 0 since k ≥ 2.

There is only one δ̄ such that gk0 (δ̄) = 0:

2
δ̄ k =
k+1
k1
2
δ̄ =
k
.
k+1

Therefore, there is a 12 ≤ δk∗ ≤ δ̄ < 1 such that gk (δ) ≤ 0 for δk∗ ≤ δ < 1. For this range of δ, the limited
punishment strategy is a Nash equilibrium.
The book mentions that δ2∗ ≈ 0.62 and δ3∗ ≈ 0.55.

Existence of many Nash equilibrium. The book states that it is possible to realize many different payoffs
with Nash equilibrium. See Theorem 435.1. In particular, there are uncountably many different payoffs for
different Nash equilibrium.

S UBGAME P ERFECT E QUILIBRIA : S ECTIONS 14.9 & 14.10

The following is a criterion for a subgame perfect equilibrium.

Definition 1. One deviation property: No player can increase her payoff by changing her action at the start
of any subgame in which she is the first mover, given the other players’ strategy and the rest of her own
strategy.

The point is that the deviation needs only be checked at one stage at a time.

Proposition (438.1). A strategy in an infinitely repeated game with discount factor 0 < δ < 1 is a subgame
perfect equilibrium iff it satisfies the one deviation property.

Defection Strategy. This is obviously a subgame perfect strategy since the same choice is made at every
vertex and it is a Nash equilibrium.
REPEATED STRATEGIC GAME 7

Grim Trigger Strategy. (Section 14.10.1) This is not subgame perfect as given. Starting at the state
(C1 , D2 ), it is not a Nash equilibrium. Since P2 is playing the grim trigger, she will pick D2 at every
stage after. Player P1 will play C1 and then D1 for every other stage. The payoff for P1 is
0 + δ + δ2 + · · · .
However, if P1 changes to always playing D1 , then the payoff is
1 + δ + δ2 + · · · ,
which is larger. Therefore, this is not a Nash equilibrium on a subgame with root pair of states (C1 , D2 ).
A slight modification leads to a subgame perfect equilibrium. Keep the states the same, but make a
transition from Ci to Di if the action of either player is D. See Figure 6. This gives a subgame perfect
equilibrium for δ ≥ 1/2.
(C1 , C2 )

Di , D j
Ci : Ci Di : Di ∗

F IGURE 6. States and transitions for the modified grim trigger

Limited punishment Strategy. (Section 14.10.2) This can also be modified to make a subgame perfect
equilibrium: Make the transition from Pi,0 to Pi,1 when either player takes the action D. The rest is the
same.
Tit-for-tat Strategy. (Section 14.10.3) The four combinations of states for the two players are (C1 , C2 ),
(C1 , D2 ), (D1 , C2 ), and (D1 , D2 ). We need to check that the strategy is a Nash equilibrium on a subgame
starting at any of these four state profiles.
(i) (C1 , C2 ): The analysis we gave to show that it was a Nash equilibrium applies and shows that it is
true for δ ≥ 1/2.
(ii) (C1 , D2 ): If both players adhere to the strategy, then the actions will be
(C1 , D2 ), (D1 , C2 ), (C1 , D2 ), · · · ,
with payoff
3δ
0 + 3 δ + (0) δ 2 + 3 δ 3 = 3 δ(1 + δ 2 + δ 4 + · · · ) = .
1 − δ2
If P1 instead starts by selecting D1 , then the actions will be
(D1 , D2 ), (D1 , D2 ), · · ·
with payoff
1
1 + δ + δ2 + · · · = .
1−δ
So we need
3δ 1
≥
1−δ 2 1−δ
3δ ≥ 1 + δ
2δ ≥ 1
δ ≥ 12 .
8 REPEATED STRATEGIC GAME

(iii) (D1 , C2 ): If both players adhere to the strategy, then the actions will be
(D1 , C2 ), (C1 , D2 ), (D1 , C2 ), · · · ,
with payoff
3
3 + (0) δ + 3 δ 2 + (0) δ 3 = 3 (1 + δ 2 + δ 4 + · · · ) = .
1 − δ2
If P1 instead starts by selecting C1 , then the actions will be
(C1 , C2 ), (C1 , C2 ), · · ·
with payoff
2
2 + 2 δ + 2 δ2 + · · · = .
1−δ
So we need
3 2
≥
1−δ 2 1−δ
3 ≥ 2 + 2δ
1 ≥ 2δ
δ ≤ 21 .
(iv) (D1 , D2 ): If both players adhere to the strategy, then the actions will be
(D1 , D2 ), (D1 , D2 ), (D1 , D2 ), · · · ,
with payoff
1
1 + δ + δ2 + · · · = .
1−δ
If P1 instead starts by selecting C1 , then the actions will be
(C1 , D2 ), (D1 , C2 ), · · ·
with payoff
3δ
0 + 3 δ + (0) δ 2 + 3 δ 3 = 3 δ(1 + δ 2 + δ 4 + · · · ) = .
1 − δ2
So we need
1 3δ
≥
1−δ 1 − δ2
1 + δ ≥ 3δ
1 ≥ 2δ
δ ≤ 21 .
For all four of these conditions to hold, we need δ = 1/2.

EC941 - Game Theory: Prof. Francesco Squintani Email: F.squintani@warwick - Ac.uk
No ratings yet
EC941 - Game Theory: Prof. Francesco Squintani Email: F.squintani@warwick - Ac.uk
45 pages
Game Theory Week #9
No ratings yet
Game Theory Week #9
15 pages
Mit17 810s21 Lec5
No ratings yet
Mit17 810s21 Lec5
93 pages
Pset 4
No ratings yet
Pset 4
6 pages
EC941 2023 Lecture 6 - Repeated Games
No ratings yet
EC941 2023 Lecture 6 - Repeated Games
51 pages
Infinitely Repeated Games: Carlos Hurtado
No ratings yet
Infinitely Repeated Games: Carlos Hurtado
23 pages
Lecture 13
No ratings yet
Lecture 13
46 pages
Chapter 3
No ratings yet
Chapter 3
34 pages
7 Repeated Games PDF
No ratings yet
7 Repeated Games PDF
36 pages
Game Theory: Zhixin Liu
No ratings yet
Game Theory: Zhixin Liu
50 pages
IE54500 - Problem Set 4: 1. Pure and Mixed Nash Equilibria
100% (1)
IE54500 - Problem Set 4: 1. Pure and Mixed Nash Equilibria
5 pages
Advanced Game Theory Concepts
No ratings yet
Advanced Game Theory Concepts
17 pages
InFinitely Repeated Games
No ratings yet
InFinitely Repeated Games
36 pages
MARK8397-lecture5 001
No ratings yet
MARK8397-lecture5 001
14 pages
Csecon Repeated Games
No ratings yet
Csecon Repeated Games
15 pages
Game Theory Questions and Answers
No ratings yet
Game Theory Questions and Answers
6 pages
GTHW1 Sol
No ratings yet
GTHW1 Sol
5 pages
Lecture19 Short
No ratings yet
Lecture19 Short
84 pages
Slides Chapter3
No ratings yet
Slides Chapter3
36 pages
GT 5 Repeated Games
No ratings yet
GT 5 Repeated Games
9 pages
Game Theory: Repeated Games: Branislav L. Slantchev
No ratings yet
Game Theory: Repeated Games: Branislav L. Slantchev
19 pages
GOS4 ch10 Solutions Solved
No ratings yet
GOS4 ch10 Solutions Solved
5 pages
Baye 9e Chapter 10 SM
No ratings yet
Baye 9e Chapter 10 SM
8 pages
GOS4 ch06 Solutions Solved
100% (1)
GOS4 ch06 Solutions Solved
13 pages
Game Theory
No ratings yet
Game Theory
3 pages
ECON0027 Section 3
No ratings yet
ECON0027 Section 3
7 pages
Perfect Equilibria in Game Theory
No ratings yet
Perfect Equilibria in Game Theory
46 pages
Static Games of Complete Information PDF
No ratings yet
Static Games of Complete Information PDF
12 pages
Microeconomics Game Theory Advanced
No ratings yet
Microeconomics Game Theory Advanced
10 pages
Infinitely Repeated Games: U S Us
No ratings yet
Infinitely Repeated Games: U S Us
14 pages
Chapter 6
No ratings yet
Chapter 6
5 pages
Annotated-Group 5 Game Theory Part 3
No ratings yet
Annotated-Group 5 Game Theory Part 3
5 pages
Dutch Recap of A Primer in Game Theory (Gibbons) 1992
No ratings yet
Dutch Recap of A Primer in Game Theory (Gibbons) 1992
72 pages
Advanced Microeconomics Class4 Jan 2023
No ratings yet
Advanced Microeconomics Class4 Jan 2023
27 pages
Topic 3 Lecture Notes
No ratings yet
Topic 3 Lecture Notes
13 pages
Repeated Games
No ratings yet
Repeated Games
27 pages
Game Theory
No ratings yet
Game Theory
137 pages
ECON-459: Applied Game Theory Problem Set 1 - Solutions
No ratings yet
ECON-459: Applied Game Theory Problem Set 1 - Solutions
8 pages
Unofficial Solutions Manual To R.A Gibbon's A Primer in Game Theory
83% (23)
Unofficial Solutions Manual To R.A Gibbon's A Primer in Game Theory
36 pages
Advanced Microeconomics: Game Theory Problem Set
100% (1)
Advanced Microeconomics: Game Theory Problem Set
4 pages
HomeWkSet02 Answers
No ratings yet
HomeWkSet02 Answers
4 pages
GT Game 8 17
No ratings yet
GT Game 8 17
31 pages
Pset 4 Sol
No ratings yet
Pset 4 Sol
4 pages
Lesson 5
No ratings yet
Lesson 5
24 pages
Final Exam January 2017 Solutions
No ratings yet
Final Exam January 2017 Solutions
8 pages
Orozco Report
No ratings yet
Orozco Report
69 pages
Game Theory - Complete Notes
No ratings yet
Game Theory - Complete Notes
48 pages
Game Theory Models of Pricing
No ratings yet
Game Theory Models of Pricing
118 pages
Example
No ratings yet
Example
3 pages
ECON3160 Lecture 3
No ratings yet
ECON3160 Lecture 3
40 pages
GT 2023 Fall Midterm A
No ratings yet
GT 2023 Fall Midterm A
10 pages
Fudenberg Tirole Game Theory Solutions Complete
86% (14)
Fudenberg Tirole Game Theory Solutions Complete
201 pages
Computational Game Theory LCTN - Yishay Mansour
No ratings yet
Computational Game Theory LCTN - Yishay Mansour
150 pages
Economic Development, Chapter1: What Models Do
No ratings yet
Economic Development, Chapter1: What Models Do
10 pages
Religious Beliefs, Religious Participation and Cooperation
No ratings yet
Religious Beliefs, Religious Participation and Cooperation
36 pages
The Colonel Blotto Game
No ratings yet
The Colonel Blotto Game
9 pages
Game - Prisoner's Dilemma
No ratings yet
Game - Prisoner's Dilemma
11 pages
Game Theory in Wireless and Communication Networks Theory Models and Applications 1st Edition Zhu Han Download
100% (1)
Game Theory in Wireless and Communication Networks Theory Models and Applications 1st Edition Zhu Han Download
47 pages
Game Theory - Negotiation
100% (1)
Game Theory - Negotiation
37 pages
GAME THEORY (Oligopoly)
No ratings yet
GAME THEORY (Oligopoly)
17 pages
LLM Multi-Agent Benchmark Study
No ratings yet
LLM Multi-Agent Benchmark Study
18 pages
Unpacking The State's Reputation (Daliuag and Dos Santos)
No ratings yet
Unpacking The State's Reputation (Daliuag and Dos Santos)
9 pages
Game Theory and Its Applications
100% (1)
Game Theory and Its Applications
39 pages
Xinyuan Dai, Duncan Snidal, and Michael Sampson - International Cooperation Theory
No ratings yet
Xinyuan Dai, Duncan Snidal, and Michael Sampson - International Cooperation Theory
36 pages
Varian - 2018 - Artificial Intelligence, Economics, and Industrial
No ratings yet
Varian - 2018 - Artificial Intelligence, Economics, and Industrial
22 pages
Introduction To Game Theory: Yale Braunstein Spring 2007
100% (1)
Introduction To Game Theory: Yale Braunstein Spring 2007
39 pages
4.1. 4.2 Underdevelopment As A Coordination Failure and Multiple Equilibria.
100% (2)
4.1. 4.2 Underdevelopment As A Coordination Failure and Multiple Equilibria.
18 pages
Game Theory: Slides by Pamela L. Hall Western Washington University
No ratings yet
Game Theory: Slides by Pamela L. Hall Western Washington University
50 pages
Identifying Cooperative Personalities in Multi-Agent Contexts Through Personality Steering With Representation Engineering
No ratings yet
Identifying Cooperative Personalities in Multi-Agent Contexts Through Personality Steering With Representation Engineering
10 pages
Quest FINALS ECON 004 QUIZ 2
No ratings yet
Quest FINALS ECON 004 QUIZ 2
4 pages
Social Exchange Kelley & Thibaut
100% (1)
Social Exchange Kelley & Thibaut
10 pages
Understanding Social Institutions
No ratings yet
Understanding Social Institutions
6 pages
1 Rational Choice Theory
No ratings yet
1 Rational Choice Theory
10 pages
Game Theory for Economists & Philosophers
No ratings yet
Game Theory for Economists & Philosophers
8 pages
Prudential Strategy PDF
100% (1)
Prudential Strategy PDF
49 pages
Understanding Egoism in Philosophy
No ratings yet
Understanding Egoism in Philosophy
9 pages
Case Study Managerial Economics
100% (1)
Case Study Managerial Economics
4 pages
Holland
No ratings yet
Holland
10 pages
The Legal Function of Ritual
No ratings yet
The Legal Function of Ritual
55 pages
Human Brain Mapping - 2005 - Patel - A Bayesian Approach To Determining Connectivity of The Human Brain
No ratings yet
Human Brain Mapping - 2005 - Patel - A Bayesian Approach To Determining Connectivity of The Human Brain
10 pages
Pertanyaan Dan Soal Oligopoly Dan Game Theory
No ratings yet
Pertanyaan Dan Soal Oligopoly Dan Game Theory
35 pages
Types of Games
100% (1)
Types of Games
6 pages
Tutorial 12 Suggested Solutions
No ratings yet
Tutorial 12 Suggested Solutions
5 pages

Repeated Games

Uploaded by

Repeated Games

Uploaded by

REPEATED STRATEGIC GAME

(C, C) (C, C) (C, C)

(C, D) (C, D) (C, D)

(D, C) (D, C) (D, C)

(D, D) (D, D) (D, D)

F IGURE 1. Game tree for repeated prisoner’s dilemma

Ui (a(1) , . . . , a(T ) ) = u i (a(1) ) + δ u i (a(2) ) + · · · + δ T −1 u i (a(T ) )

I NFINITELY REPEATED GAMES ( INFINITE HORIZON )

is the discounted average payoff of the action stream.

S OME NASH EQUILIBRIA STRATEGIES

Di = {(a(1) , . . . , a(t−1) ) : a (`)

(D1 , D2 ) (D1 , ∗) (D1 , D2 )

F IGURE 2. Game tree for grim trigger

F IGURE 3. States and transitions for grim trigger

Therefore, the grim trigger strategy is a Nash equilibrium provided that

Ci = {t = 1} ∪ {(a(1) , . . . , a(t−1) ) : a (t−1)

F IGURE 4. States and transitions for tit-for-tat

(b) If P1 alternates D1 and C1 , then P2 alternates C2 and D2 . The payoff for P1 is

F IGURE 5. States and transitions for limited punishment

To check that this is true for δ large enough, we use calculus.

There is only one δ̄ such that gk0 (δ̄) = 0:

S UBGAME P ERFECT E QUILIBRIA : S ECTIONS 14.9 & 14.10

F IGURE 6. States and transitions for the modified grim trigger

You might also like