Discrete Math Notes
Discrete Math Notes
Version 221106
Contents
To access the video links you need to first log into https://vle.york.ac.uk/webapps/login/
with your University of York username and password.
Part 1:
Lecture notes: Chapter 1, Exercises 1, Coursework Assignment 1
Part 2:
Lecture notes: Chapter 2, Exercises 2, Coursework Assignment 2
Part 3:
Lecture notes: Chapter 3, Exercises 3, Coursework Assignment 3
Part 4:
Lecture notes: Chapter 4, Exercises 4, Coursework Assignment 4
The text is accompanied by videos and Excel files, which explain various ideas, examples and
arguments. The videos can be accessed by links like the one above.
M.Capinski and T.Zastawniak, Mathematics for Finance, 2nd ed., Springer 2011.
3
Chapter 1
In this chapter we explore the simplest option pricing models. We assume that there is a
single trading period, from the present date (time 0) to a fixed future time T . We exam-
ine mathematical models for the behaviour of the prices of one or more underlying stocks,
beginning with the simplest case of a single stock whose price at time T takes one of just
two possible values. This analysis is extended to a model with two stocks, models with three
possible prices for each stock, and finally to a general pricing model with d > 1 stocks, each
with m possible outcomes for its price at T.
The principal economic requirement, arising from the assumption that the financial mar-
kets being modelled are efficient is that, provided all market participants share the same
information about the random evolution of the stock price, an investor in this market should
not be able to make a profit without incurring some level of risk. This assumption is cast in
mathematical terms and underlies the methods we develop to establish how to determine the
value of financial instruments whose price depends only on the price of the underlying stock
at time T : the so-called European derivative securities, whose study is the principal topic of
this and the next two chapters.
4
Assets
Assume we are given some asset, which is usually thought of as a stock and is customarily
referred to, in defiance of proper grammatical usage, as the underlying since its price will
determine the prices of the securities we wish to study.
The current price S(0) > 0 of the underlying is known, while its future price S(1) is not,
but we can consider all possible future prices and the probabilities of attaining them. This
is performed formally by first choosing a non-empty set Ω, called the sample space. The
elements of this set are related to outcomes of some random experiment (either specific,
tossing a coin, or quite vague, the future state of the economy) and we call them scenarios
or states. In these lecture notes Ω will always be a finite set.
Just for the current section we take a two-element set Ω = {u, d} and we let the future
price of our asset be a function
In general, a function defined on Ω is called a random variable. Since S(1) takes just two
values, so does the return
S(1) − S(0)
KS = ,
S(0)
which determines the price
S(1) = S(0)(1 + KS ).
We shall define S(1) by first deciding the shape of KS . Assume that
KS (u) = U,
KS (d) = D,
where
−1 < D < U
to avoid negative prices. Then
S(1, u) = S(0)(1 + U ),
S(1, d) = S(0)(1 + D).
It is convenient to introduce the following notational convention: S u = S(1, u), S d = S(1, d),
which gives the random variable of stock prices as
Su S(0)(1 + U )
S(1) = = .
Sd S(0)(1 + D)
The probabilities are given by prescribing a number p from the interval (0, 1) so that
P ({u}) = p,
P ({d}) = 1 − p.
5
Here we are careful with notation since P assigns numbers to subsets of Ω, but the following
simplified notation is quite useful: P (ω) = P ({ω}) for any ω ∈ Ω. Furthermore, the proba-
bility P is assumed to be zero for the empty set and 1 for the whole of Ω, and so it is defined
for all subsets of Ω, called events, satisfying additionally the so-called additivity property:
P (A ∪ B) = P (A) + P (B)
for any A, B ⊂ Ω such that A ∩ B = ∅. (The choice of such A, B in our case is rather
limited!)
We also assume that there is an accompanying non-random asset, called the money
market account, with deterministic prices A(0), A(1), where
A(1) = A(0)(1 + R)
Investment
Suppose we have at our disposal a certain sum of money, which we invest by purchasing a
portfolio of x units of the above risky asset and y units of the money market account. We
build a portfolio (x, y) with initial value given by
that is,
xS(0)(1 + U ) + yA(0)(1 + R)
V(x,y) (1) = .
xS(0)(1 + D) + yA(0)(1 + R)
We assume that the market is frictionless: we do not impose any restrictions on the
numbers x, y, so that unlimited short-selling is allowed: at time 0 we can borrow a share,
sell it, and purchase some other asset, while at time 1 we buy the share back to return it
to the owner. The assets are assumed to be arbitrarily divisible, meaning that x, y can
take arbitrary real values. We do not impose any bounds on x, y, thus assuming unlimited
liquidity in the market. Finally, there are no transaction costs involved in trading, that is, the
same stock price applies to long (buy: x > 0) and short (sell: x < 0) positions. Moreover,
risk-free investment (y > 0} and borrowing (y < 0) both use the same rate of return R.
6
Our most important modelling assumption is the No Arbitrage Principle (NAP), which
asserts that trading cannot yield riskless profits. In general, the form of this assumption
depends crucially on the class of trading strategies that we allow in the market. In the
present context, with no intervening trading dates, this is trivial, since a ‘strategy’ consists
of a single portfolio chosen at time 0.
Definition 1.1
A portfolio (x, y), chosen at time 0, is called an arbitrage opportunity if
For our choice of Ω, condition (3) reduces to saying that we have a strict inequality for
at least one of the two possible scenarios.
Assumption 1.2 (No Arbitrage Principle)
This has an immediate consequence for the relationship between the returns on the risky
and riskless assets:
Theorem 1.3
The No Arbitrage Principle implies that
D < R < U.
Proof
If R ≤ D, that is,
S(0)(1 + R) ≤ S d < S u ,
we take x = 1 and y = −S(0)/A(0). At time 1
which is strictly positive in the ‘up’ state, that is, with probability p > 0.
If R ≥ U , that is,
S d < S u ≤ S(0)(1 + R),
7
we take the opposite position: x = −1 and y = S(0)/A(0). Then
Proof
Suppose (x, y) is a portfolio with zero initial value. By the definition of portfolio value
so
S(0)
y = −x .
A(0)
We compute the terminal value with y as determined above:
under the given assumption that D < R < U. If x < 0, the above signs are reversed
and they remain opposites. So (x, y) cannot be an arbitrage opportunity for any choice
of x.
Definition 1.5
Suppose that an asset X has value X(1) at time 1. Its discounted value (discounted
8
to time 0) is given by
X̃(1) = X(1)(1 + R)−1 .
Moreover, for the price X(0) at time 0 we define the discounted value simply as
X̃(0) = X(0).
and define the discounted stock price process S̃ = {S̃(0), S̃(1)}. Clearly, Ã(1) = Ã(0) =
A(0). Similarly, the discounted value process becomes
yielding
so the change of the discounted portfolio value results exclusively from the change of the
discounted value of the risky asset.
where we use the notation f + (ω) = max{0, f (ω)} for any function f on Ω and ω ∈ Ω. (The
reader should take care not to confuse the return KS on the stock S with the strike price K
of the option.) The effect of holding a call option is the right (without an obligation) to buy
the underlying at the expiry (or exercise) time 1 for a price that cannot exceed K since the
option will pay the difference if the asset is more expensive. If the underlying can be bought
in the market for less than K at time 1, the option is simply not exercised.
9
The right to sell the asset for at least K at the expiry time 1 is called a European put
option and its payoff is of the form
K − S(1) if S(1) < K
P (1) =
0 otherwise
= (K − S(1))+ .
To avoid trivial cases, we assume that the strike price K satisfies
S(0)(1 + D) < K < S(0)(1 + U ).
Then it is clear from the No Arbitrage Principle that options cannot be issued for free at
time 0 since the payoffs are always non-negative and have positive probability of being strictly
positive.
Finding ‘rational’ (that is, arbitrage free) prices C(0) (resp. P (0)) for a call (resp. put)
option in a given market model is one of the major tasks of mathematical finance. The initial
price C(0) (resp. P (0)) is called the option premium.
Example 1.6
Let U = 0.3, D = −0.2, and R = 0.2. Moreover, let S(0) = 100 and A(0) = 100.
We consider a put and a call option with expiry time 1 and strike price K = 110. Since
S u = 130 and S u = 80, the call payoff will be
u
S − K in the u scenario 20 in the u scenario
C(1) = = ,
0 in the d scenario 0 in the d scenario
We now look for a portfolio (x, y) (consisting of the underlying asset and a position in
the money market account) with the same value at time 1 as the option,
V(x,y) (1) = C(1).
It is called a replicating portfolio. This gives a pair of simultaneous equations
xS(0)(1 + U ) + yA(0)(1 + R) = S(0)(1 + U ) − K,
xS(0)(1 + D) + yA(0)(1 + R) = 0,
which are easily solved:
S(0)(1 + U ) − K
x= ,
S(0)(U − D)
(1 + D)(S(0)(1 + U ) − K)
y=− .
A(0)(U − D)(1 + R)
10
Example 1.7
Given the data from Example 1.6, let us compute the replicating portfolio (x, y) for
the call option. The above simultaneous equations become
Video
We claim that the price of the option should be the initial value of the replicating portfolio.
This is based on the intuition that two portfolios with identical values at time 1 should have
the same values at time 0. The justification involves the No Arbitrage Principle: in the case
of different initial values, at time 0 we buy the cheaper asset and sell the more expensive
one, and at time 1 we keep the profit. Doing this requires trading in options, which leads us
to extend the notion of a portfolio in the market extended by the option.
We extend the notion of a portfolio by including the position in options: the extended
portfolio is now a triple (x, y, z), where x represents the number of units of stock S, y the
risk-free position (the money market account A), and z the number of units of the call option
C held (z can be negative, which corresponds to taking a short position by writing and selling
z options).
The definition of the portfolio value is also extended in a natural way:
11
Bear in mind that the second equation asserts equality of random variables! This allows us
to formulate our key assumption in a similar way as before.
Definition 1.8
We say that (x, y, z) is an (extended) arbitrage opportunity if
Later on we shall always assume that the notion of a portfolio is similarly extended to
cover all securities traded in the marker.
Theorem 1.10
The No Arbitrage Principle implies that the price of a European call with strike price K
is the value at time 0 of the replicating portfolio (x, y), that is,
Proof
Suppose that C(0) > xS(0) + yA(0). Then, if we sell the expensive asset (option)
and buy the cheap one (portfolio (x, y)), we obtain a positive balance C(0) − (xS(0) +
yA(0)) at time 0 To construct an arbitrage opportunity we invest this amount in the
money market account in addition to yA(0) already held, so our extended portfolio is
1
(x0 , y 0 , z 0 ) = (x, (C(0) − xS(0)) , −1)
A(0)
12
At time 1, in each scenario the value of this portfolio is
since by replication C(1) = xS(1) + yA(0)(1 + R). But C(0) − (xS(0) + yA(0)) > 0
by our assumption, so we have a strictly positive outcome in each scenario at zero
initial cost, more than is needed to contradict the No Arbitrage Principle.
On the other hand, if C(0) < xS(0) + yA(0), we can take the opposite position
to construct an arbitrage opportunity.
Example 1.11
With the data given in Example 1.6 for the market model and a call option, we com-
puted the replicating portfolio in Example 1.7 to be
2 4
(x, y) = ,− .
5 15
Then, according to Theorem 1.10, we can find the time 0 call price as
2 4 40
C(0) = xS(0) + yA(0) = × 100 − × 100 = ≈ 13.33.
5 15 3
Video
Remark 1.12
The fact that the probabilities of the stock movements do not appear in the pricing
formula is a paradox. One might expect that a call option where the probability of
the up movement of the stock is high should cost more as it is more likely to be
exercised, but this is not the case. This surprising effect arises from the fact that the
No Arbitrage Principle and replication do not directly involve the probabilities of up or
down movements of the stock.
Example 1.13
An example of placing bets is quite illuminating. To determine the pay-out ratios a
bookmaker follows the proportion of bets placed, and not the ‘true’ probabilities based
on his own perceptions of likelihoods. So if 1000 bets are on team A and 2000 on
13
team B, then the ratios are 3/1 for A and 1.5/1 for B. The bookmaker is safe and
cannot lose money.
On the other hand, in casinos the pay-outs for roulette depend on the probabilities
being 34/1 since there are 34 equally likely fields. The house here can go bankrupt.
In reality, in both cases the ratios are slightly unfavourable to the players, so the
bookmaker has a steady guaranteed income and the casino has an income in the long
run.
H(1) = h(S(1))
for some payoff function h : (0, ∞) → R. (In fact, any random variable X : Ω → R can be
written in such form.) We simply write h(S u ) = X(u) and h(S d ) = X(d), and extend h to
the whole of R arbitrarily. (Here we make use of the fact that S u 6= S d .)
Example 1.14
For example, the call option has the payoff function of the form
xS u + yA(0)(1 + R) = h(S u ),
xS d + yA(0)(1 + R) = h(S d ).
14
Solving this linear system for x and y, we get
h(S u ) − h(S d )
x= ,
Su − Sd
h(S ) u u d
h(S u ) − xS u S u −S d
(S u − S d ) − h(SS u)−h(S
−S d
) u
S
y= =
A(0)(1 + R) A(0)(1 + R)
1 h(S )S − h(S u )S d
d u
= .
A(0)(1 + R) Su − Sd
Definition 1.15
The delta ∆H of the derivative security H is the underlying asset position in the
replicating portfolio, given by
h(S u ) − h(S d )
∆H = .
Su − Sd
h(S u ) − h(S d )
x= ,
S(0)(U − D)
(1 + U )h(S d ) − (1 + D)h(S u )
y= .
A(0)(U − D)(1 + R)
We claim that the derivative price H(0) is the present value of this portfolio, namely H(0) =
V(xH ,yH ) (0) = xH S(0) + yH A(0), which becomes
The proof of the following theorem is exactly the same as in the case of the call options
above. We need only replace C(1) by H(1) = h(S(1)). The details are left to the reader.
Theorem 1.16
The No Arbitrage Principle implies that the price of the security with payoff H(1) =
h(S(1)) has the form (1.1).
Example 1.17
Considering the same data and the same call option as in Examples 1.6 and 1.7, we
15
can now use formula (1.1) to compute the time 0 call price once again. With
16
Definition 1.18
Let Q be a probability on Ω = {u, d} with values q = Q(u) and 1 − q = Q(d). We
say that Q us a risk-neutral probability on Ω = {u, d} if 0 < q < 1 and
Example 1.20
Numerical computations for option pricing using the risk-neutral probability in a single-
step binomial model can be found in the accompanying Excel file. The input data and
call option are the same as in Examples 1.6, 1.7, 1.11, and 1.17
Remark 1.21
Expectation can be defined for any random variable X : Ω → R and for any probability
on Ω. In the case when the probability used is the original probability P (called
physical or real probability, in contrast to the risk-neutral one) the subscript P is
often dropped, so that
Consider the discounted underlying asset price and compute the risk-neutral expectation
S(0)(1 + U ) S(0)(1 + D)
EQ (S(1))
e = q + (1 − q)
1+R 1+R
R − D S(0)(1 + U ) U − R S(0)(1 + D)
= +
U −D 1+R U −D 1+R
= S(0) = S(0).
e
17
Saying that EQ (S(1))
e = S(0)
e means that the sequence S(0),
e S(1)
e is a (one-step) martingale
under Q. Accordingly, we often call Q a martingale probability.
The reason why Q is referred to as ‘risk-neutral’ becomes clear when we calculate the
expected return on the stock under this probability: it turns out to be equal to the return R
on the risk-free asset.
Theorem 1.22
The expected return under Q on the underlying is equal to the risk-free return if and
only if Q is a martingale probability:
EQ (KS ) = R.
Proof
R−D U −R
EQ (KS ) = qU + (1 − q)D = U+ D = R.
U −D U −D
R−D
On the other hand, if qU + (1 − q)D = R, then solving this for q gives q = U −D
.
i = 1, 2. In general, we do not expect both returns to agree with the ‘up’ and ‘down’
meaning of the elements of Ω = {u, d}. For instance, we may have K1 (u) > K1 (d),
18
but K2 (u) < K2 (d). However we always assume distinct values: w the natural notation
Ki (u) = Ui , Ki (d) = Di for i = 1, 2, we assume that U1 6= D1 and U2 =
6 D2 .
Example 1.23
There are real numbers a 6= 0, and b such that K1 = aK2 + b. To show this we need
to solve the following two equations for a and b:
U1 = aU2 + b,
D1 = aD2 + b.
This gives
U1 − D1
a= ,
U2 − D2
D1 U2 − D2 U1
b= .
U2 − D2
We can interpret this result in terms of the covariance of K1 and K2 . Given a random
variable X, we obtain the centred (mean-zero) random variable Xc = X − E(X) by subtract-
ing its expectation. So the covariance of X and Y can be written as cov(X, Y ) = E(Xc Yc ).
Normalising this, we obtain the correlation coefficient
cov(X, Y )
ρX,Y = ,
σX σY
since σX = σXc and similarly for Y. Note that we always have |ρX,Y | ≤ 1. The linear
relationship between K1 and K2 shows that they are perfectly correlated, that is |ρK1 ,K2 | = 1.
We can regard S2 as a derivative security with S1 as the underlying; a function h such
that S2 (1) = h(S1 (1)) can be easily found.
Denote by Q1 the risk-neutral probability determined by S1 , that is Q1 = (q1 , 1 − q1 ) with
R − D1
q1 = .
U1 − D1
We know that the no-arbitrage price of the derivative security S2 is given by
S2 (0) = EQ1 (S̃2 (1)). (1.4)
Introduce the risk-neutral probability Q2 given by S2 , that is, with
R − D2
q2 = ,
U2 − D2
and by the martingale property for S̃2 (i.e. the equality of expectations) we have
S2 (0) = EQ2 (S̃2 (1)). (1.5)
19
From (1.4) and (1.5) we get
This gives q1 U2 + (1 − q1 )D2 = R, which implies q1 = q2 . We have thus proved the following
result.
Theorem 1.24
No-arbitrage implies Q1 = Q2 .
Proof
Seeking a candidate for arbitrage, suppose that for some portfolio we have
and
V(x,y,z) (1) = xS1 (1) + yA(1) + zS2 (1) ≥ 0.
Take the expectation with respect to Q1
EQ1 (V(x,y,z) (1)) = xEQ1 (S1 (1)) + yA(1) + zEQ1 (S2 (1))
= xEQ1 (S1 (1)) + yA(1) + zEQ2 (S2 (1))
= xS1 (0)(1 + R) + yA(0)(1 + R) + zS2 (0)(1 + R)
= 0.
In general, a non-negative random variable with zero expectation must be zero almost
surely. Here this expectation is a weighted average of non-negative numbers with
positive weights, so each of the numbers must be zero, and arbitrage is excluded.
Example 1.26
Consider two risky assets with up/down returns U1 = 20%, D1 = −10% and U2 =
30%, D2 = −20%, respectively. Suppose that R = 10%. We can compute the
20
risk-neutral probability of the ‘up’ scenario for each these two risky assets:
R − D1 0.1 − (−0.1) 2
q1 = = = ,
U1 − D1 0.2 − (−0.1) 3
R − D2 0.1 − (−0.2) 3
q2 = = = .
U2 − D2 0.3 − (−0.2) 5
Remark 1.27
What the above results show is that the binomial model does not allow sufficient
freedom to include two arbitrary stocks in the model: under the No Arbitrage Principle,
we find that their risk-neutral probabilities are identical and the stocks are perfectly
correlated. Thus, in order to construct models which reflect real markets more closely
we are forced to increase their complexity. First we shall increase the number of possible
random outcomes.
with pu , pm , pd ∈ (0, 1) such that pu + pm + pd = 1, assuming that Ω = {u, m, d}. The the
expectation of a random variable X on Ω takes the form
We assume that the three returns are different, and without loss of generality we may further
assume D < M < U.
21
A derivative security will be a random variable H(1) = h(S(1)) as in the binomial model,
and again we note that any random variable X : Ω → R is of this form by defining h on the
three (different!) values of S(1) to agree with those of H(1) and extending it arbitrarily.
If R is outside the range of stock returns, the model makes no sense since it is always
preferable to either invest risk free or in the stock; in other words, the model admits arbitrage
and, as in the binomial case, the converse is also true.
Theorem 1.28
No-arbitrage is equivalent to D < R < U.
Proof
The proof of the implication from the left to right is the same as in the binomial case
(Theorem 1.3).
Conversely, a candidate (x, y) for an arbitrage opportunity must satisfy
S(0)
V(x,y) (0) = 0, xS(0) + yA(0) = 0, y = −x .
A(0)
with the opposite inequalities when x < 0. Hence (x, y) cannot be an arbitrage: if
R lies between U and D, then the three values above cannot all have the same sign
when x 6= 0, and they are all zero when x = 0. Therefore the model is arbitrage free.
In real life stock prices change in a discrete way, quite frequently, but the movements
are limited to the so-called ticks. So for example, given S(0) = 30 the possible values after
one time step are 30.50 or 29.50, say. However, the price may not change at all, hence
introducing the medium return is quite realistic.
Unfortunately, this leads to mathematical complications since replication (using only risk-
free investment and a single underlying) of random payoffs at time 1 is no longer always
possible. To see this, we try to replicate a random payoff H(1) with three values H u , H m , H d ,
22
which requires finding a portfolio (x, y) with V(x,y) (1) = H(1), that is,
xS u + yA(0)(1 + R) = H u ,
xS m + yA(0)(1 + R) = H m ,
xS d + yA(0)(1 + R) = H d ,
obtaining a system of three equations in two variables which in general has no solution. Thus
typically a replicating portfolio for H(1) does not exist.
The holder of such a portfolio is protected against the option payoff. Super-replication is
satisfactory from the point of view of the trader who sold the option, for whom the option
payoff is a liability. This is a source of risk and super-replication hedges (i.e. eliminates) this
risk completely. Whatever happens, the trader will have sufficient funds to cover the amounts
due to the option holder. Therefore, the party selling the option would willingly accept the
following price:
Definition 1.30
The super-replication option price H super is the minimal initial value of a super-
replicating portfolio:
Example 1.31
Let S(0) = 30, U = 20%, M = 10%, D = −10%, R = 5%, and H = (S(1) − 32)+
(a call option with strike price 32). The least expensive super-replicating portfolio is
x = 0.4444, y = −11.4286 and the super-replication price is H super = 1.9048.
For details, see the Excel file, in which the Solver has been used to compute the
solution.
We should also analyse the problem from the point of view of the option buyer. Suppose
that the buyer wishes raise cash to buy the option by selling a portfolio (x, y) at time 0.
23
Then, at time 1, the buyer will receive the option payoff H(1) and will have to pay V(x,y) (1)
to close the short position in the portfolio. There will be no risk to the buyer as long as
V(x,y) (1) ≤ H(1). This leads to the following definition of a sub-replicating portfolio.
Definition 1.32
A portfolio (x, y) sub-replicates the option with payoff H(1) if
The highest amount V(x,y) (0) that can be raised by selling a sub-replicating portfolio to
buy the option is the so-called sub-replication price, formally defined as follows.
Definition 1.33
The sub-replication option price H sub is the maximal initial value of a sub-replicating
portfolio:
H sub = sup {V(x,y) (0) : V(x,y) (1) ≤ H(1)}.
(x,y)∈R2
Example 1.34
Let S(0) = 30, U = 20%, M = 10%, D = −10%, R = 5%, and H = (S(1) − 32)+
(a call option with strike price 32). The most expensive sub-replicating portfolio is
x = 0.1667, y = −4.2857 and the sub-replication price is H sub = 0.7143.
For details, see the Excel file, in which the Solver has been used to compute the
solution.
Remark 1.35
In fact, H sub is a maximum and, similarly, H super is a minimum (and not just a
supremum or infimum) since we are optimising the linear function V(x,y) (0) of x, y over
a closed subset of the plane, so that the supremum and infimum are attained at some
elements of the set.
24
The next Proposition justifies the terminology.
Proposition 1.37
If a ∈ (H sub , H super ), then taking H(0) = a does not lead to an (extended) arbitrage
opportunity.
Proof
Let (x, y, z) be such that
and
−xS(1) − yA(1) ≤ H(1),
hence (−x, −y) is sub-replicating. Thus
contradicting our assumption that V(x,y,z) (0) = 0. Similarly, when z = −1, we find
that (x, y) is super-replicating, yielding the contradiction V(x,y) (0) ≥ H super > H(0).
Next, consider the possibility of using risk-neutral probabilities, which in the binomial
model served as an alternative to replication for pricing. We use Q to denote a probability
measure on the three-element sample space Ω = {u, m, d}. It can be represented by a triple
qu , qm , qd ≥ 0 with qu + qm + qd = 1. The definition of a risk-neutral probability is similar to
that for two-element sample space.
Definition 1.38
We say that Q is a risk-neutral probability on Ω = {u, m, d} if qu , qm , qd ∈ (0, 1),
qu + qm + qd = 1. (1.6)
25
and
S(0) = (1 + R)−1 (qu S u + qm S m + qd S d ). (1.7)
qu U + qm M + qd D = R. (1.8)
We have two linear equations (1.7) and (1.8) with three unknowns qu , qm , qd . Such a system
may have many solutions, potentially giving infinitely many risk-neutral probabilities. Of
course, we still have to identify which of these solutions take all their values in the interval
(0, 1). The next proposition will show that in a pricing model consistent with the No Arbitrage
Principle there are always some that do.
Example 1.39
Proposition 1.40
If the trinomial pricing model is consistent with the No Arbitrage Principle, then there
is a probability Q that is non-zero on each scenario u, m, d such that the underlying
stock prices satisfy
S(0) = EQ [S̃(1)].
Proof
To find such a probability note that the absence of arbitrage (Theorem 1.28) ensures
that S d < S(0)(1 + R) < S u , or, equivalently, S̃ d < S(0) < S̃ u . Now, either S̃ m >
S(0) or S̃ m ≤ S(0). In the first case let A = 21 (S̃ u + S̃ m ) and let B = S̃ d ; in
the second case take A = S u and B = 12 (S̃ m + S̃ d ). In both cases we obtain B <
S(0) < A, so there is a c ∈ (0, 1) such that S(0) = cA + (1 − c)B. Hence either
S(0) = c 21 (S̃ u + S̃ m ) + (1 − c)S̃ d or S(0) = cS̃ u + (1 − c) 12 (S̃ m + S̃ d ). In each case
we can find the required probability measure Q = (qu , qm , qd ): in the first, we take
qu = qm = 2c , qd = 1 − c, and in the second case we use qu = c, qm = qd = 12 (1 − c).
26
In the binomial model the risk-neutral probability is unique and gives the price of any
derivative security as the discounted expectation of the payoff with respect to the risk-
neutral probability. Since here we may have many such probabilities, we cannot expect the
same result, but we now show that the risk-neutral probabilities determine a range of option
prices consistent with the No Arbitrage Principle. Let
Note that I is an interval since the set of risk-neutral probabilities is obviously convex and
the map
(qu , qm , qd ) → (1 + R)−1 EQ (H(1))
is linear.
Proposition 1.41
Each number in the interval I is a price of H consistent with the No Arbitrage Principle.
Proof
Let a ∈ I, write H(0) = a, so that H(0) = (1 + R)−1 EQ (H(1)) for some risk-neutral
probability Q. Consider a portfolio (x, y, z) in the extended market with V(x,y,z) (0) = 0
and V(x,y,z) (1) ≥ 0. Compute the expectation of discounted V(x,y,z) (1) with respect
to Q:
EQ ((1 + R)−1 V(x,y,z) (1)) = xEQ ((1 + R)−1 S(1)) + yA(0) + zEQ ((1 + R)−1 H(1))
= xS(0) + yA(0) + zH(0) = 0.
But zero expectation of a non-negative discrete random variable forces this random
variable to be zero, hence V(x,y,z) (1) = 0.
I = (H sub , H super ).
Proof
First, we show that I is contained in [H sub , H super ]. We assume A(0) = 1 without loss
of generality.
27
Suppose the (x, y) is a sub-replicating portfolio, that is,
Take any risk-neutral probability Q and compute the expectation of both sides:
which implies
xS(0) + y ≤ (1 + R)−1 EQ (H(1)).
Take the supremum over all such portfolios
so that
H sub ≤ (1 + R)−1 EQ (H(1)).
In the same manner we can prove that H super ≥ (1 + R)−1 EQ (H(1)) by considering
super-replicating portfolios (x, y) such that
which implies
xS(0) + y ≥ (1 + R)−1 EQ (H(1)).
Taking the infimum over all super-replicating portfolios (x, y) we conclude that
A = {(S(1, e ωi )) : i = 1, 2, 3} ⊂ R2 .
e ωi ), H(1,
Let C denote the closed convex hull of A, that is, the set of all, not necessarily strict,
convex combinations of these points. Since A has three elements, the closed set C is
either a triangle or a line segment in R2 . Since D < M < U , the three values S(1, e ωi )
for i = 1, 2, 3 are different from one another. This means that if C is a triangle, then
none of its sides is vertical, and if C is a line segment, then it is not vertical either.
28
First, suppose that the three points of A are not collinear, so that C is a triangle
2
in R . The No Arbitrage Principle ensures that S(0) lies strictly between the smallest
and largest value of S(1,
e ωi ) for i = 1, 2, 3. This follows since S(0)(1 + D) < S(0)(1 +
R) < S(0)(1 + U ). Thus the intersection of the triangle C with the vertical line
through (S(0), 0) is a non-empty line segment with endpoints (S(0), a), (S(0), b), say.
Clearly, (a, b) = {u : (S(0), u) ∈ int(C)}, where int(C) denotes the interior of C.
Moreover, we can express any (S(0), u) in int(C) as a strictly convex combination of
the points of A, that is,
3
X 3
X
(S(0), u) = ci (S(1,
e ωi ), H(1,
e ωi )), where ci > 0, ci = 1.
i=1 i=1
Now observe that, since the triangle C has no vertical edges, there is a non-vertical
straight line l with gradient x and intercept y passing through the point (S(0), b) such
that A is contained in the half-plane below this line. In fact, this is a line containing
one side of the triangle (see Figure 1.1, where Pi = (S(1, e ωi ), H(1,
e ωi ))). In other
words, H(1)
e ≤ xS(1)
e + y or, equivalently,
which means that (x, y) is a super-replicating portfolio. Because the line passes through
(S(0), b), we have
b = xS(0) + y ≥ H super .
Similarly, there is also a non-vertical straight line, again containing one side of the
triangle, with gradient x0 and intercept y 0 that passes through (S(0), a) such that A is
29
contained in the half-plane above this line (in Figure 1.1 this would be the line passing
through P1 and P3 ), so that H(1)
e ≥ x0 S(1)
e + y 0 . It follows that
so that (x0 , y 0 ) is a sub-replicating portfolio. Since the line passes through (S(0), a),
we have
a = x0 S(0) + y 0 ≤ H sub .
Hence we have shown that (a, b) contains (H sub , H super ), so that, I =
(H , H super ). This completes the proof in the case when C is a triangle.
sub
Finally, if C were a line segment, we already know that it would not be ver-
tical, so the line would have intercept x and gradient y. Since all three points
Pi = (S(1,
e ωi ), H(1,
e ωi )) would lie on this line, we would have xS(1)
e + y = H(1),
e so
xS(1) + y = H(1),
contrary to the assumption that H sub < H super . Hence C cannot, in fact, be a line
segment in these circumstances.
30
Remark 1.43
It should be clear from the above proof that this result also holds for any single-step,
single-stock model with n ≥ 3 possible outcomes at time 1. The convex hull is then
a polygon with at most n vertices in the plane, and otherwise the proof is essentially
the same.
Proof
In the definitions of H sub and H super we can take min and max since the sets are
closed. So there are portfolios (xsub , y sub ) sub-replicating H(1) and (xsuper , y super )
super-replicating H(1) such that
hence
(xsub − xsuper )S(0) = y super − y sub .
Moreover,
xsub S̃(1) + y sub ≤ H̃(1) ≤ xsuper S̃(1) + y super ,
so
(xsub − xsuper )S̃(1) ≤ y super − y sub = (xsub − xsuper )S(0).
Hence if xsub − xsuper > 0, then S̃(1) ≤ S(0), that is, U, M, D ≤ R, contradicting
the No Arbitrage Principle. Similarly, if xsub − xsuper < 0, then S̃(1) ≥ S(0), that
is, U, M, D ≥ R, also contradicting the No Arbitrage Principle. It follows that the
two portfolios are the same, (xsub , y sub ) = (xsuper , y super ), and thus form a replicating
portfolio.
This shows the existence of a replicating portfolio when H sub = H super . Next we show
that such a portfolio is unique.
Proposition 1.45
The replicating portfolio is unique.
31
Proof
Suppose that (x1 , y1 ) and (x2 , y2 ) replicate the claim H. This means hat
x1 S(0) + y1 = x2 S(0) + y2 ,
so
(x1 − x2 )S̃(1) = (y2 − y1 ) = (x1 − x2 )S(0).
As a result, x1 = x2 because S(1) is a random variable with at least two distinct
values. This, then, implies that y1 = y2 .
Finally we observe that in the current circumstances the claim H has a single price H(0)
consistent with the No Arbitrage Principle.
Proposition 1.46
The existence of a replicating portfolio for H implies that there is a unique price H(0)
that does not allow an extended arbitrage opportunity, and this price is given by
H(0) = EQ (H̃(1))
Proof
If a portfolio (x, y) replicating the claim H exists, then the price H(0) of the claim
must be equal to the initial value of the portfolio,
H(0) = xS(0) + y,
or else an extended arbitrage opportunity would exist. This proves the uniqueness
of H(0).
Now let Q be a risk-neutral probability, so that S(0) = EQ (S̃(1)). Since the
replicating portfolio satisfies xS̃(1) + y = H̃(1), we have
Going back to the proof of Theorem 1.42, this can be seen very clearly. When C is a line
32
segment (that is, the three points of A are collinear), the intersection of C and the vertical
line through (S(0), 0) is a single point. This point has coordinates (S(0), H(0)), where H(0)
is the unique price of the contingent claim.
We shall show in Theorem 1.63 that the existence of a replicating portfolio for every
derivative security ensures not only that the price of each derivative security is unique, but
also implies uniqueness for the risk-neutral probability.
33
Definition 1.47
A market model is complete if every random payoff H(1) can be replicated.
We now seek a risk-neutral probability so that both discounted stock prices satisfy the
martingale condition, that is,
EQ (K1 ) = R,
EQ (K2 ) = R,
which can be written as
qu U1 + qm M1 + qd D1 = R,
(1.9)
qu U2 + qm M2 + qd D2 = R.
A system of three linear equations (the third one is qu + qm + qd = 1) with three variables
can be solved, apart from some singular cases. However, the question for us is whether the
solution represents a genuine probability measure, that is, if the three numbers qu , qm , qd
belong to the interval (0, 1). We illustrate this with examples.
Example 1.48
For an example where Q is not a genuine probability, consider stocks S1 , S2 with
U1 = 30%, M1 = 5%, D1 = −20%, U2 = −10%, M2 = 0%, D2 = 15%, R = 5%.
Solving the above equations, we find qu = 1, qm = −1, qd = 1.
This model allows arbitrage: If S1 (0) = 100, S2 (0) = 50, A(0) = 1, the portfolio
x1 = −0.5, x2 = −2, y = 150 is an arbitrage opportunity: V(x1 ,x2 ,y) (0) = 0, while
V(x1 ,x2 ,y) (1) ≥ 2.5 in each scenario. Note that here the two stocks do not go ‘up’
together.
The calculations can be found in the Excel file.
This example illustrates the following First Fundamental Theorem of Asset Pricing, which
will be proved more generally in the next section.
Theorem 1.49
When Ω is finite, there is no arbitrage if and only if there exists a risk-neutral probability
measure, with positive probabilities for each scenario.
Let H(1) = (S1 (1) − X)+ , which is a call option on stock 1. Let R = 5%, and let
S1 (0) = 30, A(0) = 1, X = 32, U1 = 20%, M1 = 10%, D1 = −10%. Suppose
now that S2 (0) = 30 and the movements of this stock are given by U2 = −5%,
M2 = 15%, D2 = 10%. We can find a replicating strategy, which provides the option
34
price H(0) = 1.6402. To do this we simply solve the matrix equation
36 28.5 1.05 x1 4
33 25.5 1.05 x2 = 1
27 33 1.05 y 0
and compute H(0) = x1 S1 (0) + x2 S2 (0) + y. See the Excel file for details.
There is a unique risk-neutral probability Q and
This seemingly positive result requires some critical thinking. We have obtained a unique
price of H, but this was obtained as a result of completing the market by introducing a second
underlying security. The option price therefore depends on this choice, which contradicts
common sense. Why should some other stock influence the price of a call option written
on stock 1? So, in fact, we still have a range of prices, this time related to the variety of
securities which can be chosen to complete the market. To see a possible use of the above
ideas suppose that a derivative security H written on S1 is traded, its market prices available.
Now we can choose H as the additional security, making the market complete, and we may
price other derivatives using the unique risk-neutral probability. In this way a statement, often
found in the financial literature, that ‘the market chooses the risk-neutral probability’, gains
substance.
Example 1.51
With U2 = 10%, M2 = 5%, D2 = −20% and the remaining data as before, the
replication problem can be solved with H(0) = −1.2698 which is nonsense (a negative
price for a positive payoff security gives arbitrage). See Excel file.
Here there is no risk-neutral probability: the equations (1.9) have the unique
solution qm = 2, qu = −0.83 which does not yield a probability measure. See Excel
file.
As with the first fundamental theorem, this result will be formulated and proved in a more
35
general setting in the next section. Our two simple examples (binomial and trinomial models)
serve to illustrate the main features of the general theory we now proceed to develop.
X: {ω1 , . . . , ωM } → R,
The random vector of prices (S1 (1), . . . , Sd (1)) consists of d random variables, each
taking at most M different values. In fact, we shall assume that altogether M different
values of this random vector appear at time 1. This ensures that all M elements of Ω are
‘needed’. As a consequence of this natural non-degeneracy assumption, in the same way as
in the binomial or trinomial model, any random variable X: Ω → R can be written in the
form X = h(S1 (1), . . . , Sd (1)) for some function h: Rd → R.
These data constitute a finite single-step market model. The notion of a portfolio
must be adjusted to incorporate d stocks: it is a (d + 1)-vector (x, y) = (x1 , . . . , xd , y)
whose coordinates indicate the stock holdings and risk-free investment, respectively. The
initial value of the portfolio (x, y) is
d
X
V(x,y) (0) = xi Si (0) + yA(0),
i=1
36
Without loss of generality, we can take A(0) = 1 for simplicity. Then A(1) = 1 + R, where R
is the riskless rate.
Note that fixing the initial value V (0) and the vector x = (x1 , . . . , xd ) of stock holdings
of a portfolio automatically determines the bond holding y. This is true, in particular, for
arbitrage opportunities, whose definition extends as follows to the current setting.
Definition 1.53
An arbitrage opportunity is a portfolio (x, y) = (x1 , . . . , xd , y) such that
The first fundamental theorem of asset pricing, which we now formulate and prove in this
setting, gives an equivalent condition for the absence of arbitrage.
q1 , . . . , qM ∈ (0, 1),
M
X
qj = 1,
j=1
and
EQ (Si (1)) = (1 + R)Si (0)
for all i = 1, . . . , d, where EQ denotes the expectation with respect to Q, and R
denotes the riskless interest rate. Such Q is often called a (single-step) martingale
probability for the stock price processes {Si : i = 1, . . . , d}.
37
Theorem 1.55 (First Fundamental Theorem: single step)
In a finite single-step market model there is no arbitrage opportunity if and only if there
exists a risk-neutral probability measure.
Proof
Recall that we assume A(0) = 1. First suppose that there is a risk-neutral prob-
ability Q. Consider any portfolio (x, y) = (x1 , . . . , xd , y) with V(x,y) (0) = 0 and
V(x,y) (1) ≥ 0 (otherwise this portfolio could not be an arbitrage opportunity). Com-
pute
d
X
EQ (V(x,y) (1)) = EQ xi Si (1, ωj ) + y(1 + R)
i=1
d
X
= xi EQ (Si (1)) + y(1 + R)
i=1
d
X
= (1 + R) xi Si (0) + y(1 + R)
i=1
= (1 + R)V(x,y) (0) = 0.
This implies that V(x,y) (1) = 0 with probability 1 since zero expectation of a non-
negative random variable forces this random variable to be zero with probability 1. It
follows that there is no arbitrage opportunity.
For the converse implication, suppose that there is no arbitrage opportunity. De-
fine the discounted gains of a portfolio (x, y) = (x1 , . . . , xd , y) to be the random
variable
X d S (1)
i
Gx = xi − Si (0) .
i=1
1+R
Note that y does not appear since the discounted gains of a risk-free position are zero.
The set
W = {Gx : x ∈ Rd }
of discounted portfolio gains is a vector space. It can be identified with a subspace
of RM by writing Gx = (Gx (ω1 ), . . . , Gx (ωM )).
We claim that if Gx (ω) ≥ 0 for all ω, then Gx (ω) = 0 for all ω. To see this,
take x = (x1 , . . . , xd ) such that Gx (ω) ≥ 0 for all ω. Ensure that the portfolio (x, y)
has zero initial wealth V(x,y) (0) = 0 by supplementing the xi by a suitable y. The
38
discounted final value of this portfolio is
d
1 X
Ve(x,y) (1) = xi Si (1) + y
1 + R i=1
d
1 X
= xi Si (1) + y − V (0)
1 + R i=1
d
X h S (1) i
i
= xi − Si (0)
i=1
1+R
= Gx ≥ 0.
We shall show that there is a vector Q in A with positive coordinates that is orthog-
onal to the subspace W. The coordinates of such a vector give the required probabili-
ties qj since orthogonality means that the Euclidean inner product of (q1 , . . . , qM ) and
(Gx (ω1 ), . . . , Gx (ωM )) vanishes for each Gx in W , that is,
M
X
EQ (Gx ) = qj Gx (ωj ) = 0
j=1
39
theorem!). Since Gx ≥ 0 implies Gx =P0, it follows that A∩W = ∅. Now Lemma 1.56
provides a z = (z1 , ..., zM ) such that M j=1 zj aj > 0 for all a = (a1 , ..., aM ) ∈ A and
PM
j=1 zj Gx (ωj ) = 0 for all Gx in W. It follows that zj > 0 for each j = 1, . . . , M
since we can take (a1 , ..., aM ) in A with aj = 1 (and so with the remaining coordinates
necessarily zero). Hence, for each j = 1, . . . , M, we can take
1
qj = PM zj > 0,
i=1 zi
PM
so that j=1 qj = 1 and
M
X
qj Gx (ωj ) = 0
j=1
Remark 1.57
We have shown that for finite single-step models the existence of a risk-neutral proba-
bility is equivalent to the absence of arbitrage in the model. We shall show later that
this remains true for multi-step finite market models, in general. Our proof will depend
heavily on the single-step case, and crucially on the finiteness of the market model,
that is, on the sample space Ω being finite.
40
Any random variable H is a derivative security in our market model since its value is
completely determined by the price vector S(1) and it can be written in the form H = h(S(1))
for some h : Rd → R, which is a consequence of the non-degeneracy assumption made at
the beginning of this section. In particular, for any ω ∈ Ω, the random variable 1{ω} is a
derivative security.
We now show that a complete market model is characterized by the existence of a unique
risk-neutral probability.
Theorem 1.59 (Second Fundamental Theorem: single step)
A single-step arbitrage-free market model is complete if and only if there is exactly one
risk-neutral probability.
Proof
No arbitrage implies the existence of a martingale probability, so we need to show that
we cannot have more than one such probability in a complete market. Suppose we
have two martingale probabilities Q1 6= Q2 . For any ω ∈ Ω, the random variable 1{ω}
is a derivative security. Since the market model is complete, there exists a replicating
portfolio (x, y) such that
so that
Q1 (ω) = Q2 (ω).
But ω ∈ Ω was arbitrary, so the probabilities Q1 and Q2 coincide.
For the converse implication, assume Q is the unique risk-neutral probability and
suppose there is a derivative H which cannot be replicated. Our goal will be to
construct a martingale probability Q1 different from Q.
Consider the set of values at time 1 of all portfolios:
41
{ω1 , . . . , ωM }. The subspace W contains the vector 1 = (1, . . . , 1) because if x = 0
1
and y = 1+R then V(x,y) (1) = 1 for all scenarios. Now equip RM with the inner
PM
product hw, viQ = j=1 wj vj Q(ωj ) and use the following version of Lemma 1.56
with the subspace W and the compact convex set A = {H}.
Lemma 1.60
Suppose A ⊂ Rn is convex and compact, and W is a vector subspace of Rn
disjoint from A. Then there is a z ∈ Rn such that hz, aiQ > 0 for all a ∈ A,
and hz, wiQ = 0 for all w ∈ W, where hx, yiQ = di=1 xi yi qi , where qi > 0 for
P
all i = 1, . . . , d, is an inner product in Rn .
42
The final identity follows because Si (1) belongs to W (take xi = 1, xk = 0 for k 6= i,
and y = 0, which gives V(x,y) (1) = Si (1)), so that hZ, Si (1)iQ = 0. This completes
the proof.
Consider a derivative security with payoff H(1) and time 0 price H(0). The notion of an
extended portfolio and extended arbitrage opportunity can be adapted to the general single-
step setting as follows. An extended portfolio is a triple (x, y, z), where x = (x1 , . . . , xd )
represents the positions in the d stocks, y the risk-free position (the money market account
A), and z the number of units of the derivative security H. The values of the extended
portfolio at times 0 and 1 are
d
X
V(x,y,z) (0) = xi Si (0) + yA(0) + zH(0),
i=1
d
X
V(x,y,z) (1) = xi Si (1) + yA(1) + zH(1).
i=1
Definition 1.61
We say that and extended portfolio (x, y, z) is an (extended) arbitrage opportunity
if
We can now apply the fundamental theorems to option pricing in a single-step model in
which there is no extended arbitrage opportunity. Suppose the payoff H(1) can be replicated
by a portfolio (x, y) = (x1 , . . . , xd , y), that is,
It follows that
H(0) = V(x,y) (0)
since otherwise we would have an extended arbitrage opportunity by a similar argument as
in the proof of Theorem 1.10. The expectation with respect to a risk-neutral probability Q
43
(which exists by the first fundamental theorem) gives
where we have used the martingale property for all stock prices. We obtain
1
H(0) = EQ (H(1)).
1+R
If the model is complete, then every derivative security can be replicated, which means that
the price H(0) of every derivative security is given by this formula with expectation taken
under the unique (by the second fundamental theorem) risk-neutral probability Q.
Example 1.62
In a trinomial model, consider a basket option with payoff
Let S1 (0) = S2 (0) = 60 and R = 5%, and let U1 = 20%, M1 = 10%, D1 = −10%
and U2 = −5%, M2 = 15%, D2 = 10%. There is a unique risk-neutral probability
7 1 4
Q = 18 , 6 , 9 , and the option price is H(0) ≈ 65.56.
Finally, we note that the sub- and super-replicating prices coincide in complete models.
Theorem 1.63
Completeness implies that H sub = H super .
Proof
If (x, y) = (x1 , . . . , xd , y) is a replicating portfolio, V(x,y) (1) = H(1), and (x, y) is any
super-replicating portfolio, V(x,y) (1) ≥ H(1), then
44
The No Arbitrage Principle implies
But the replicating price belongs to the set of all super-replicating prices,
Proof
Consider the set
A − W = {a − w : a ∈ A, w ∈ W }.
Since A and W are disjoint, 0 ∈ / A − W.
Next, we note that A − W is convex since both A and W are. In more detail,
if x, y ∈ A − W, x = a − v, y = b − w, then with α ∈ [0, 1], αx + (1 − α)y =
[αa + (1 − α)b] − [αv + (1 − α)w] ∈ A − W. Thus A − W is convex.
Consider the set B = {|x| : x ∈ A − W }. We claim that it is closed. To show this
45
consider a convergent sequence bn ∈ B with bn → b. Then there exists a sequence
xn ∈ A − W such that |xn | = bn , and we can write xn = an − wn for some an ∈ A and
wn ∈ W. The sequence an has a convergent subsequence ank → a since A is compact.
The sequence wnk is bounded since otherwise |ank − wnk | = bnk would go to infinity.
So selecting a further subsequence gives a convergent sequence wnkm → w. Now by
the continuity of x 7→ |x|, we have |a − w| = b, so b ∈ B. Hence B is closed.
Take z to be the element of A − W closest to 0, so that for any other element c
of A − W we have |c| ≥ |z| > 0 (recall that 0 ∈ / A − W ). The existence of z follows
from the fact that the set B = {|x| : x ∈ A − W } is closed, does not contain 0, and
so has the smallest positive element, which is the Euclidean norm of some element of
A − W , which we take as z.
Consider the triangle (in Rn ) with vertices 0, z, c for any c in A − W . We shall
prove that hz, ci ≥ hz, zi = |z|2 > 0. The idea is best explained by a picture, see
Figure 1.2. We take a vector y parallel to c such that z and y − z are perpendicular.
Then
|z| |c|
hz, ci = |z| |c| cos α = |z| |c| = |z|2 ,
|y| |y|
so it is sufficient to show that |c| ≥ |y|. Suppose that |c| < |y|. The line segment
joining c and z is contained in the convex set A−W since both z and c belong to A−W .
But the line intersects the interior of a ball with the centre at 0 and radius |z| . So this
segment contains a point with distance to 0 strictly smaller than |z|, a contradiction
with the definition of z.
This gives hz, a − wi = hz, ai − hz, wi ≥ hz, zi for any a ∈ A and w ∈ W.
However, W is a vector space, so nw ∈ W , hence hz, ai − nhz, wi ≥ hz, zi for
each n, which is impossible unless hz, wi = 0. This, in turn, yields hz, ai ≥ hz, zi > 0,
completing the proof.
y
0 c
46
Lemma 1.60
Suppose A ⊂ Rn is convex and compact, and W is a vector subspace of Rn disjoint
from A. Then there is a z 0 ∈ Rn such that hz 0 , aiQ > 0 for all a ∈ A, and hz 0 , wiQ = 0
for all w ∈ W, where
X n
hx, yiQ = xi yi qi
i=1
Proof
By Lemma 1.56, we have z ∈ Rn with ni=1 zi ai > 0, ni=1 zi wi = 0 for any a =
P P
∈ A and w =P(w1 , . . . , wn ) ∈ W. Take zi0 = zi /qi , which satisfies the
(a1 , . . . , an ) P
claims since i=1 zi0 yi qi = i=1 zi yi for any yi .
47
Chapter 2
Having described single-step pricing models, we are ready to consider models with a finite
number of consecutive trading dates. We shall see that these models behave essentially as a
succession of one-step models, so that our earlier analysis can be applied repeatedly to yield
explicit pricing formulae.
For a simple binomial model we now describe how the use of risk-neutral probabilities
implies the martingale property of discounted stock prices and thus of the discounted value
process of any self-financing strategy. In this model pricing techniques for European options
using replicating portfolios or risk-neutral probabilities are compared and an explicit pricing
formula for the European call is then derived.
48
The probability on Ω will be defined by multiplying probabilities for single coin tosses. De-
noting the probability of an up movement in each single step by p, we have
P (uu) = p2 ,
P (ud) = P (du) = p(1 − p),
P (dd) = (1 − p)2 ,
using the simplified notation P (ω) rather than P ({ω}). Given the probabilities of single-
element sets, we can find the probability of any subset of Ω as
X
P (A) = P (ω).
ω∈A
The value of Kn is assumed to depend only on the nth element of ω and it acts in the same
way as in the single-step binomial model, with K1 determined by the first step, K2 by the
second. So
K1 (uu) = K1 (ud) = U,
K1 (du) = K1 (dd) = D,
K2 (uu) = K2 (du) = U,
K2 (ud) = K2 (dd) = D.
S(1) = S(0)(1 + K1 ),
S(2) = S(1)(1 + K2 ) = S(0)(1 + K1 )(1 + K2 ).
49
We introduce convenient self-explanatory notation where a superscript determines both the
number of steps and the position on the tree:
u
S = S(0)(1 + U ),
S(1) =
S d = S(0)(1 + D),
uu
S = S(0)(1 + U )2 ,
S(2) = S ud = S du = S(0)(1 + U )(1 + D),
dd
S = S(0)(1 + D)2 .
Similar notation will also be used below for other process.
We also assume a risk-free asset, represented by the money market account determined
by riskless return R. We have
Let S(0) = 100, U = 20%, D = −10%, and p = 0.6. Then we have a concrete
recombining (i.e. such that S ud = S du ) tree of stock prices:
S(0) S(1) S(2)
144
0.6
120
0.6
0.4
100 108
0.6
0.4
90
0.4
81
Taking A(0) = 100 and R = 5%, we have A(1) = 105 and A(2) = 110.25.
Trading strategies
Example 2.2
Continuing Example 2.1, suppose we have at our disposal a cash amount of 50 and
wish to invest in the market by buying one share of stock. For the first portfolio the
stock and risk-free positions will be denoted x(1) and y(1) respectively. This is slightly
surprising as one might expect to indicate time by 0 here, that is, the time when this
portfolio is created. However, it is an established notational tradition to label the
portfolio by the number of the time step over which the portfolio is held (step 1 here).
50
Taking x(1) = 1, with V (0) = 50 we have a short risk-free position
The next portfolio is denoted by (xu (2), y u (2)). We decide to reduce our stock position
to 0.8, which determines the risk-free component since we assume no inflow or outflow
of funds, so
67.5 = 0.8 × 120 + y u (2) × 105
to give y u (2) = −0.285. We can find the final values of our investment:
Case 2: ‘down’
In this case the value of the portfolio becomes
With cheap stock we decide to buy more, so we set xd (2) = 1.2. This implies
51
with the self-financing condition
representing the fact that at the rebalancing moment n = 1, when (x(2), y(2)) are selected,
we only use the funds available then, and we use them all. We then say that (x, y) is a
self-financing strategy.
Pricing
As in the single-step model, the basis for pricing is the No Arbitrage Principle. We start with
the definition of an arbitrage strategy in the current context.
Definition 2.3
We say that (x, y) = {(x(n), y(n))}n=1,2 is an arbitrage strategy if
(2) V(x,y) (1) = x(2)S(1) + y(2)A(1) with probability 1 (that is, (x, y) is a self-
financing strategy),
A derivative security of European type has payoff of the form H = h(S(2)) for some
real-valued function h defined on [0, ∞). Such a security is called path-independent. A
path-dependent version has payoff H = h(S(1), S(2)) for some h : [0, ∞) × [0, ∞) → R.
The prices of such a derivative security form a sequence H(n), n = 0, 1, 2, with H(2) = H.
As for the single-step model, we now formulate an extended version of the No Arbitrage
Principle, which allows us to determine these derivative security prices.
First, we need to adapt the concept of extended strategy, by which we shall now
understand a triple (x, y, z), where (x, y) = {(x(n), y(n))}n=1,2 is a two-step strategy as
above describing the positions in stock and the money market, and z ∈ R is a position in the
derivative security H.
52
Remark 2.5
Note that, in contrast to x(n) and y(n), the position z in the derivative security H
does not depend on the time step n. Once a position z in H is created at the time 0,
the same position z is held up to the final time 2. We say that a static position is
held in H.
Definition 2.6
We say that (x, y, z) is an extended arbitrage strategy whenever
1. V(x,y,z) (0) = 0,
Example 2.8
Let S(0) = 100, U = 20%, D = −10%, and R = 5%. Consider a call option with
exercise price K = 100 at time 2. The payoff of the option is
(S(0)(1 + U )2 − K)+ = 44 if ω = uu,
+
C(2, ω) = (S(0)(1 + U )(1 + D) − K) = 8 if ω = ud or ω = du,
(S(0)(1 + D)2 − K)+ = 0 if ω = dd.
53
It is a path-independent derivative security, by which we mean that
We work out the replicating strategy (x, y) by moving backwards in time, following
the scheme depicted in Figure 2.1. Replication means that the value of the replicating
strategy at time 2 should match the option price, V(x,y) (2) = C(2). Thus, in the ‘up’
case we get the system
with solution
xu (2) = 1,
y u (2) = −0.907,
to find
xu (2) = 0.296,
y u (2) = −0.218.
By the self-financing condition for the replicating strategy, we get the values if the
strategy at time 1:
so
x(1) = 0.698,
y(1) = −0.562,
54
resulting in
V (0) = x(1)S(0) + y(1)A(0) = 13.605.
This, then, gives the option price
by the extended No Arbitrage Principle. To see this requires just a moment of reflection.
Suppose that C(0) < 13.605 (bearing in mind the rounding error), in which case we
buy the option and sell the strategy investing the difference risk free. Replication at
the exercise time ensures that the value of the strategy will cancel with the option
payoff and a profit will remain. In case of the opposite inequality, we take the opposite
action.
In a similar manner, we can also get the option prices at time 1 by the extended No
Arbitrage Principle for the single-step subtrees with root at node u and d, respectively:
C u = V u = 24.762,
C d = V d = 3.810.
uu uu uu
u u u
ud ud ud
d d d
dd dd dd
Note that the values of the strategy constructed above form a recombining tree, but
this does not have to be the case if the payoffs of the derivative security for ud and du are
different (i.e. a path-dependent security).
Recall that in the single-step model a convenient alternative was to use risk-neutral prob-
abilities to compute the price of a derivative security. This can readily be done at each step
of the above procedure, but our goal is to give the story extra flavour by developing some
theory, which can be generalised to more general settings.
55
2.2 Partitions and information
For our next topic it is in fact better to extend the time scale to three steps. This gives the
corresponding sample space as
P (uuu) = p3 ,
P (uud) = P (udu) = P (duu) = p2 (1 − p),
P (udd) = P (dud) = P (ddu) = p(1 − p)2 ,
P (ddd) = (1 − p)3 .
We have to modify the definition of K1 , K2 and add another return K3 with value U
for triples ending with u and value D for triples ending with d, so that Kn depends only
on the nth element of ω, just as in the two-step binomial model. However, the notion of
independence is now somewhat more involved, as clarified by the following proposition.
Proposition 2.9
Proof
The verification for each pair of indices is the same as in the two-step case. To verify
the above identity for the triple of indices, note that by definition both sides are 0
unless each xi takes one of the two values U, D. There are 8 possible cases, grouped
according to the number of times U occurs: for U U U we have p3 on the left by the
definition of P, and this equals the RHS. Similarly for U U D, U DU or DU U we obtain
p2 (1 − p) on each side, for U DD, DU D, DDU it is p(1 − p)2 , and for DDD we
obtain (1 − p)3 .
144
0.6
0.4
120 129.6
0.6 0.6
0.4
100 108
0.6
0.4 0.4
90 97.2
0.6
0.4
81
0.4
72.9
Example 2.11
Consider the following random variable
H = (S(3) − 100)+
which is nothing but a call payoff with strike K = 100 and exercise time 3. If we own
such a security, we might be interested in the expected payoff, so we compute
Next we analyse the future possible developments and their impact on our perception. It
is important to emphasise that we are now performing a ‘what if’ analysis. We do not travel
in time, but consider hypothetically all possible future scenarios with the purpose of revising
our point of view and possibly responding with some appropriate action.
Position at step 1
Case ‘up’
Let us prepare ourselves for the case of an increase in the stock price after one step. This
means that the collection of available scenarios will be reduced to the triples beginning with u,
the set of which we denote by Bu = {uuu, uud, udu, udd}. Knowledge that the first step is
‘up’ means that this set will play the role of Ω and a probability is produced by adjusting the
57
original probabilities to ensure that the new probability of Bu is 1. For ω ∈ Bu we put
P (ω)
Pu (ω) = .
P (Bu )
Observe that u is the first element of each ω ∈ Bu and, according to the definition of P, we
have P (Bu ) = p and this is the first factor producing P (ω), so it will cancel. For instance
P (A ∩ Bu )
Pu (A) = = P (A|Bu ),
P (Bu )
The sum in this example is the conditional expectation of H given Bu . We introduce the
following two alternative pieces of notation for the conditional expectation:
X
E(H|Bu ) = E(H|S(1) = 120) = H(ω)Pu (ω) = 40.416.
ω∈Bu
Case ‘down’
In the case of a down move at the first step we introduce, similarly, the set Bd of all still
remaining scenarios (all those beginning with a d) and the adjusted probabilities for ω ∈ Bd :
P (ω)
Pd (ω) = .
P (Bd )
P (A ∩ Bd )
Pd (A) = = P (A|Bd ),
P (Bd )
58
and the expectation of H in this situation is
P1 = {Bu , Bd }.
We put together these two cases defining a random variable with two competing notations,
equivalent since the partition P1 is fully determined by S(1),
40.416 if ω ∈ Bu ,
E(H|P1 )(ω) = E(H|S(1))(ω) =
10.656 if ω ∈ Bd .
Position at step 2
Consider all possible price movements in the first two steps. There are four cases, which
can be described by specifying a partition
Each of these is equipped with a probability defined in a similar way as before, for example,
for any ω ∈ Bud ,
P (ω)
Pud (ω) = .
P (Bud )
Observe the effect of the cancellation mentioned above and notice for example that P (Bud ) =
p(1 − p), so Pud (udu) = p and Pud (udd) = 1 − p, and in each case we have a familiar single-
step binomial tree.
59
As before, we compute the expected value of H in each of these cases, that is, the
conditional expectations
E(H|Buu ) = 55.52,
E(H|Bud ) = 17.76,
E(H|Bdu ) = 17.76,
E(H|Bdd ) = 0,
hence
55.52 if ω ∈ Buu ,
E(H|P2 )(ω) = 17.76 if ω ∈ Bud ∪ Bdu ,
0 if ω ∈ Bdd .
Remark 2.14
We can see that the conditional expectation given Bud is equal to the conditional
expectation given Bdu . This means that we could employ the alternative, more intuitive
notation E(H|S(2)). To this end note that the partition generated by the values of
S(2) is different from the partition related to the history of the movements over the
first two steps, introduced above. The partition generated by S(2) consists of just
three elements: Buu , Bud ∪ Bdu , Bdd , and the random variable E(H|S(2)) takes three
values. The fact that the values E(H|Bud ), E(H|Bdu ) coincide gives the equivalence
of these two approaches in this instance, but this does not have to be the case in
general.
The actual values of S(2) are irrelevant for defining the conditional expectation
since they do not appear in the computations. What matters is the partition related
to these values.
Position at step 3
Given the knowledge of all three steps, we will know the actual payoff. The above analysis
can formally be performed for the partition
P3 = {Buuu , . . . , Bddd }
of Ω into eight single-element parts with no randomness and the obvious conclusion that
E(H|P3 ) = H.
60
expectation of X given B is defined as
X
E(X|B) = X(ω)P (ω|B),
ω∈Ω
1
where P (ω|B) = P (B) P (ω) for ω ∈ B and P (ω|B) = 0 otherwise.
Given a partition P = {Bi }, the above definition applied to each Bi gives a
function defined for each ω in Ω, constant on each Bi ,
E(X|P) : Ω → R
by assigning
Bi 3 ω → E(X|Bi ).
This random variable is called the conditional expectation of X with respect to the
partition {Bi }.
The properties of E(X|P) are discussed in a general setting in the next chapter. Here we
continue the analysis of binomial trees.
Example 2.16
Now we consider the 4-step binomial tree.
Su Sd
EQ (S̃(1)) = q + (1 − q) = S(0).
1+R 1+R
61
In the three-step model the probability Q is defined on subsets of the 8-element probability
space Ω = {uuu, uud, . . . , ddd} in the same way as P was defined by means of p. For
example, Q(udu) = q 2 (1 − q) and then Q({ω : S(1)(ω) = S u }) = Q(Bu ) = q. Taking
a trivial partition P0 = {Ω} and noting that conditional expectation with respect to P0
coincides with the expectation, we have
1
EQ (S̃(1)|P0 ) = EQ (S̃(1)|Ω) = (q S̃ u + (1 − q)S̃ d ) = EQ (S̃(1)) = S(0).
Q(Ω)
EQ (S̃(2)|P1 ) = S̃(1).
The next step is to find EQ (S̃(3)|P2 ). We just consider one of the four cases in detail:
1 1
EQ (S̃(3)|Buu ) = (q 3 S uuu + q 2 (1 − q)S uud )
Q(Buu ) (1 + R)3
1
= (qS uu (1 + U ) + (1 − q)S uu (1 + D)) since Q(Buu ) = q 2
(1 + R)3
1
= S uu (1 + R) since qU + (1 − q)D = R
(1 + R)3
1
= 2
S uu ,
(1 + R)
62
Filtration
Observe that the partitions P0 , P1 , P2 , P3 considered above are such that for each n = 1, 2, 3
and each B ∈ Pn there is an A ∈ Pn−1 satisfying B ⊂ A. This represents the fact that A
carries more information than B. As time passes, we gain more information.
Example 2.17
For example, Bud ⊂ Bu . If we happen to be in Bu , we know that the first step was
‘up’, but have no knowledge of the second (or third) step. However, if we have landed
in Bud , we have more information: we know not only that the first step was ‘up’, but
also that the second step was ‘down’ (but still have no knowledge of the third step).
Definition 2.19
A sequence (M (n))n=0,...,N of random variables defined on Ω is a martingale for the
filtration (Pn )n=0,...,N and probability P if
EP (M (n)|Pn−1 ) = M (n − 1).
We can conclude that the discounted stock prices follow a martingale with respect to Q
for n = 0, 1, 2, 3. Later we shall see that of course this is true for any number of steps.
V (0) − x(1)S(0)
y(1) = ,
A(0)
V(x,y) (1) − x(2)S(1)
y(2) = ,
A(1)
V(x,y) (2) − x(3)S(2)
y(3) = ,
A(2)
63
where we should remember that the last two lines represent equalities between random vari-
ables.
Proposition 2.20
The discounted strategy values Ṽ(x,y) (n) form a martingale with respect to the filtration
P0 , P1 , P2 , P3 and risk-neutral probability Q.
Proof
First
1
EQ (Ṽ(x,y) (1)|P0 ) = q [x(1)S u + y(1)A(1)]
1+R
+ (1 − q) x(1)S d + y(1)A(1)
1
= x(1)[qS(0)(1 + U ) + (1 − q)S(0)(1 + D)]
1+R
+ y(1)A(0)
= x(1)S(0) + y(1)A(0)
= V (0).
having inserted y(2). In addition to the single-step martingale property, the key points
here are that xu (2) is constant on Bu , with the same value in each of the two cases,
and the self-financing property, as captured by the form of the risk-free component.
Clearly, similar analysis can be performed for Bd and in all cases at time 2. So, for
n = 0, 1, 2 we have shown that
64
Corollary 2.21
As an immediate consequence, we have the martingale property of the process of option
prices since they are values of the replicating strategy.
The final property of the discounted value process we shall need is that for n = 0, 1, 2, 3,
so that
1
EQ (Ṽ(x,y) (2)) = [qV u + (1 − q)V d ]
1+R
= EQ (Ṽ(x,y) (1))
= V (0).
65
We introduce the sample space for the N -step model exactly as it was done for n = 2, 3.
Namely, we take Ω = {u, d}N , the set of all sequences of length N consisting of characters
u, d (such sequences will be called paths). In keeping with the above cases, the probability
of a path ω in Ω will be
P (ω) = pk (1 − p)N −k ,
where k is the number of occurrences of u and N − k the number of occurrences of d
N
in ω. Note that there are k such paths, so the probability of the set of such paths is
N
p (1 − p)N −k .
k
k
For each n = 1, . . . , N the binomial splittings induce a partition of Ω into 2n disjoint sets:
Pn = {Buu...u , ..., Bdd...d }, where each set in the partition has an n-tuple subscript consisting
of u’s and d’s, so that in each set all paths ω have their first n entries in common. We can see
(as is obvious in the earlier examples) that for any n = 1, . . . , N , each set in the partition Pn
is a disjoint union of two sets in Pn+1 , just like for N = 3 we have Bu = Buu ∪ Bud . The
sequence of partitions P0 , P1 , . . . , PN is clearly a filtration.
Fix two numbers −1 < D < U to represent the random returns in each step defined by
U with probability p,
Kn =
D with probability 1 − p.
Again, Kn acts only on the nth entry of the sequence defining the path ω. We note that
these returns are independent, in the sense described in the next remark.
Remark 2.22
This is again easy to see from the definition of the Kn (see also Proposition 2.9).
for n = 1, . . . , N. This means, in particular, that the value of S(n, ω) for a path ω ∈ Ω
depends only on the number of ‘up’ and ‘down’ movements (that is, the number of u and d
symbols) in the first n steps of the path ω, but not on the actual order of these movements
(symbols). In this case we say that prices follow a recombinant binomial tree.
In addition to holding stock, we can invest in the money market account, which we take
as a security manufactured by a series of single risk-free deposits, with return R per period,
66
assumed constant throughout. So we have a sequence of numbers, known in advance, with
A(0) given (we shall frequently take A(0) = 1 for simplicity), and
for n = 1, . . . , N. To avoid arbitrage we assume that D < R < U , as we did in the single-step
case.
As in the cases of one, two, or three steps, the risk-neutral probability Q is obtained
by replacing p by
R−D
q=
U −D
in each step.
The key property observed for a three-step model can easily be extended.
Theorem 2.23
The sequence of discounted stock prices in the binomial model is a martingale with
respect to the filtration P0 , P1 , . . . , PN and risk-neutral probability Q.
Proof
The key argument, repeated in each case above, is concerned with examining the
two next step prices at each node and computing their expectation exactly as for
a single step. This can be summarised in the following way: for any B ∈ Pn at
n = 0, 1, 2, . . . , N − 1,
67
Theorem 2.24
In the binomial model the discounted prices of a derivative security with payoff H(N )
are given by the recursive relations
H̃(n − 1) = EQ (H̃(n)|Pn−1 )
for n = 1, 2, . . . , N , that is, the discounted option price process H̃(n) is a martingale
with respect to the filtration P0 , P1 , . . . , PN and risk-neutral probability Q.
In particular, the initial price is given by
Proof
Take any n = 1, . . . , N and consider the single-step submodels at each node at time
n − 1. The formula in Theorem 1.19 applies for each such single step model, and can
be written as
H(n − 1) = EQ (1 + R)−1 H(n)|Pn−1 .
Example 2.25
We can use Theorem 2.24 to price a European call expiring at time 2 in a two-step
binomial model. Let S(0) = 100, U = 10%, D = −10%, and R = 5%, and let
the strike price for the call be K = 90. Then H u (1) = 24.29, H d (1) = 6.43,
H(0) = 18.88. See the Excel file for details.
68
where Y is a random variable giving the number of upward movements in N steps along each
path, which has the binomial distribution
N k
Q{Y = k} = q (1 − q)N −k for k = 0, 1, . . . , N
k
under the risk-neutral probability. Hence
H(0) = EQ (1 + R)−N H(N )
N
−N
X N k
q (1 − q)N −k h S(0)(1 + U )k (1 + D)N −k .
= (1 + R)
k=0
k
In particular, for the European call option with strike price K this summation begins
at the first integer k for which the payoff is non-zero:
N
−N
X N k
H(0) = (1 + R) q (1 − q)N −k (S(0)(1 + U )k (1 + D)N −k − K),
k=l
k
where
l = min{k : S(0)(1 + U )k (1 + D)N −k > K}.
This formula can be written in a more concise form. First, we write it as
N
−N
X N k
H(0) = (1 + R) S(0) q (1 − q)N −k (1 + U )k (1 + D)N −k
k=l
k
N
−N
X N k
− (1 + R) K q (1 − q)N −k .
k=l
k
69
Write
1+U
q1 = q
1+R
so that
1 + R − q − qU (1 − q) + (R − qU )
1 − q1 = =
1+R 1+R
and, since qU + (1 − q)D = R, we have
1+D
1 − q1 = (1 − q) .
1+R
The first term above then takes the form
N
X N k
S(0) q1 (1 − q1 )N −k = S(0)(1 − Fq1 (l − 1))
k=l
k
This is the Cox–Ross–Rubinstein (CRR) formula for the initial price of a European
call option. (Many texts prefer to state this formula in terms of the complementary
binomial distribution Ψ, where
N
X Nk
Ψ(l, N, r) = 1 − Fr (l − 1) = rk (1 − r)N −k
k=l
k
The coefficient at S(0) has a clear meaning: it is the probability that the option will be
exercised, computed by means of the modified probability q1 .
In the second term the exercise price is discounted, which is logical since the exercise
price K applies at time N, but the formula is concerned with the price at time 0. The coef-
ficient at the discounted K gives the martingale probability that the option will be exercised.
Example 2.27
In the Excel file we use the CRR formula to price a call option. Let N = 90, S(0) = 100,
U = 0.06%, D = −0.02%, and R = 0.02%, and let the strike price be K = 100.
Then l = 23 and the CRR formula gives C(0) = 1.7837.
70
2.5 Delta hedging
Example 2.28
Let us put ourselves in a position of an option writer who has sold a call with exercise
time N = 3 for the no-arbitrage price resulting from the binomial model with U =
20%, D = −10%, S(0) = 100 with R = 10%, with an exercise price of K = 105. The
price can easily be computed using the CRR formula: C(0) = 23.31.
We are exposed to some risk. For instance, if the stock keeps going up in each period,
we will have to pay the holder of the option the amount
Of course, if the stock goes down we will keep the amount 23.31 we have cashed when
selling the option.
To make our position safe we embark on hedging, which boils down to replicating
the option by means of a suitable strategy. However, we do not have to construct the
whole tree of replicating portfolios by working backwards from the payoff. This would
be simple in the case considered here, but for large trees, with many time instants, it
could become unduly complicated. Instead, we work step-by-step from the beginning,
following the path of actual stock changes and not bothering to analyse all scenarios.
Our first portfolio requires finding the call prices at time 1, for which we again use
the CRR formula:
33.94
C(1) =
9.04
and then computing the delta coefficient, (that is, the ratio of one-step price changes
in option and stock) which, as we have seen in the single-step case, gives our stock
position x(1) in the replicating portfolio:
33.94 − 9.04
x(1) = = 0.83,
120 − 90
meaning that we have to buy this fraction of a share. This enables us to complete the
portfolio by taking
y(1) = 23.31 − 0.83 × 100 = −59.70,
which means borrowing this amount of money to have zero initial balance.
Now suppose the stock goes up. The new delta based on two option prices at time 2
is
48.55 − 14.91
xu (2) = = 0.93,
144 − 108
which means we have to purchase more stock (where we use our notational convention,
exceptionally adding the time variable for greater clarity). The corresponding money
71
market position is
y ud (3) = −55.45.
At the next time instant the option will be exercised. Suppose the stock goes up. We
have S udu = 129.60. We owe the option holder the amount 129.60 − 105 = 24.60. We
owe the bank (negative money market position) 55.45 × 1.13 = 73. 80. So, altogether
we have to pay 98. 40. We own 0.76 of a share worth 129.60, which we sell and get
0.76 × 129.60 = 98.40 as required to cover the payment.
It is easy to see that by following any other path and performing the required tasks
similarly, the cash balance will be zero after closing all positions irrespectively of the
path, which is no surprise since pricing was based on replication, and here we have just
tested the scheme.
This video shows delta hedging the option for a different scenario.
Remark 2.29
It is worth noting that the stock holding x(1) is not the coefficient at S(0) in the CRR
formula.
Example 2.30
Example 2.31
%, D = −5%, and R = 5%, and consider the arithmetic Asian call option with
72
exercise time 3 and payoff
+
S(1) + S(2) + S(3)
H(4) = −K ,
3
This video shows how to compute the option prices at all nodes of the tree.
This video shows how to hedge the option along a given scenario (delta hedging).
In this video we find the entire replicating strategy (for all scenarios). Here is
the accompanying Excel file.
73
Chapter 3
Building on our experience with binomial models, we now formulate a general market model
with finite sample space and finitely many discrete time steps. Beginning with a single-stock
model, we review the definitions of basic concepts encountered for binomial models, such as
partitions of the sample space, conditional expectations with respect to partitions or random
variables, and filtrations and martingales, and derive some of their properties. We then
describe predictable self-financing trading strategies and formulate the No Arbitrage Principle
for general finite models. Finally, moving on to a multi-stock model, the two Fundamental
Theorems of Asset Pricing, describing the absence of arbitrage and market completeness
in terms of the existence and uniqueness of a risk-neutral probability, are proved for finite
multi-step models.
Definition 3.1
For any non-empty B ⊂ Ω, we define a new probability PB (ω) = P (ω)/P (B) if ω ∈ B
and 0 otherwise. Thus PB (B) = 1, and for any A ⊂ Ω,
P (A ∩ B)
PB (A) = .
P (B)
74
Definition 3.2
As in Definition 2.15, for any random variable X and any non-empty event B ⊂ Ω,
the conditional expectation of X given B is defined by
X
E(X|B) = X(ω)PB (ω).
ω∈Ω
so that
1
E(X|B) = E(1B X). (3.1)
P (B)
Recall Definition 2.13: a partition of Ω is a (finite) family P = {Bi } of pairwise disjoint
subsets of Ω whose union is Ω.
Definition 3.3
Applying the above construction to each Bi gives a random variable E(X|P) : Ω → R
with constant value
1
E(X|Bi ) = E(1Bi X)
P (Bi )
on Bi for each i. We say that E(X|P) is the conditional expectation of X with
respect to the partition P. Gluing these together, we have
k
X 1
E(X|P) = E(1Bi X)1Bi . (3.2)
i=1
P (Bi )
75
Remark 3.4
A partition P can be expanded to a field F by forming all possible (finite!) unions of
elements of P and adding the empty set:
( k )
[
F = {∅} ∪ Bj : B1 , . . . , Bk ∈ P .
j=1
This procedure can be reversed. The minimal non-empty elements of a field F are its
atoms, which together form a partition of Ω, yielding the original field by the above
expansion.
Example 3.5
Consider the probability space
Ω = {0, 1, 2, 3, 4, 5, 6, 7, 8, 9}
Definition 3.6
If X is an arbitrary random variable on Ω and Y is a random variable with possible
values y1 , y2 , . . . , yk such that the sets Bi = {Y = yi } are non-empty for all i, then
the conditional expectation E(X|Y ) of X given Y is the conditional expectation
of X with respect to the partition
PY = {B1 , . . . , Bk },
that is,
E(X|Y ) = E(X|PY )
for i = 1, 2, . . . , k .
76
Remark 3.7
The idea is that we decompose Ω into pairwise disjoint parts Bi on which Y is constant.
It follows that
1
E(X|Y ) = E(X|Bi ) = E(1{Y =yi } X)
P ({Y = yi })
on Bi for each i = 1, 2, . . . , k.
Example 3.8
Let
Ω = {1, 2, 3, 4, 5, 6, 7, 8, 9}
with uniform probability
1
P (ω) = for each ω ∈ Ω,
9
and let X, Y : Ω → R be random variables such that
Example 3.9
The random variable Z : Ω → R defined on the same probability space as in Exam-
ple 3.8 by
Z(ω) = |ω − 5| for each ω ∈ Ω
defines the same partition as the random variable Y in Example 3.8,
PZ = PY .
Therefore
E(X|Y ) = E(X|Z).
77
3.2 Properties of conditional expectation
First, note that taking the conditional expectation is a linear operation.
Proposition 3.10
For any random variables X, Y , any numbers a, b ∈ R and any partition P, we have
Proof
This is evident from the definition and linearity of the usual expectation:
X 1
E(aX + bY |P) = E(1Bi (aX + bY ))1Bi
i
P (B i )
X 1 X 1
= a E(1Bi X)1Bi + b E(1Bi Y )1Bi
i
P (B i ) i
P (Bi )
= aE(X|P) + bE(Y |P).
where P = {Bi } .
Proof
The video illustrates the idea behind this proof with a concrete choice of a
probability space Ω, random variable X, and partition P.
78
With P = {Bi }, we have
X
E(E(X|P)) = E(X|Bi )P (Bi ) definition of expectation
i
X 1
= E(X1Bi )P (Bi ) definition of conditional expectation
i
P (Bi )
!
X
= E X 1Bi additivity of expectation
i
X
= E(X) since P is a partition, so 1Bn = 1,
n
as required.
Remark 3.12
This result can be interpreted as follows: the average of averages is the same as the
overall average.
First consider an example using the random variables S(1) and S(2) specified for the
stock prices in the binomial model. The partition P1 defined by S(1) consists of two
sets Bu , Bd , and the partition P2 determined by S(1) and S(2) of four Buu , Bud ,
Bdu , Bdd (as defined in the previous chapter). Notice that Bu = Buu ∪ Bud and
Bd = Bdu ∪ Bdd . We say that P2 is finer than P1 . In practice this means that S(2)
together with S(1) carry more information than just S(1).
Definition 3.14
Given two partitions P and P0 , we say P0 is finer than P if each element of P can be
represented as a union of some elements of P0 . Equivalently, we say that P is coarser
than P0 .
Consider partitions P and P0 such that P0 is finer than P. Then the following formula
holds
E(E(X|P0 )|P) = E(X|P)
This is known as the tower property or the iterated expectations property.
79
Proof
Let P = {Bi } and P0 = {Cj } . Since P0 is finer than P, for each i, j we have either
Cj ⊂ Bi , in which case 1Cj 1Bi = 1Cj , or Cj ∩ Bi = ∅, in which case 1Cj 1Bi = 0. It
follows that for each i
!
X 1
E (E(X|P0 )1Bi ) = E E(1Cj X)1Cj 1Bi
j
P (Cj )
X 1
= E E(1Cj X)1Cj
j:C ⊂B
P (C j )
j i
X 1
= E(1Cj X)E 1Cj
j:Cj ⊂Bi
P (Cj )
X X
= E(1Cj X) = E 1Cj X = E(1Bi X).
j:Cj ⊂Bi j:Cj ⊂Bi
The above calculation resembles (and extends) the proof of Proposition 3.11. It follows
that
X 1
E(E(X|P0 )|P) = E(E(X|P0 )1Bi )1Bi
i
P (Bi )
X 1
= E(1Bi X)1Bi = E(X|P).
i
P (Bi )
Proposition 3.16
Suppose that P is a finer partition than the partition PY defined by a random variable Y .
Then
E(Y X|P) = Y E(X|P).
Remark 3.17
This property is often referred to as ‘taking out what is known’. The partition P
being a finer than PY means that if we know which event in P has occurred, then we
know the value of Y .
80
Proof
For any event Bi in P the random variable Y is constant on Bi . Hence E(Y X1Bi )1Bi =
E(X1Bi )Y 1Bi , so
X 1
E(Y X|P) = E(Y X1Bi )1Bi
i
P (Bi )
X 1
= Y E(X1Bi )1Bi = Y E(X|P).
i
P (B i )
Proposition 3.18
If X and Y are independent random variables, then
Proof
Let PY = {Bi } be the partition generated by Y . Then the random variables X and 1Bi
are independent, so E(X1Bi ) = E(X)E(1Bi ) for each i. It follows that
X 1 X 1
E(X|Y ) = E(X1Bi )1Bi = E(X)E(1Bi )1Bi
i
P (Bi ) i
P (Bi )
X 1 X
= E(X) E(1Bi )1Bi = E(X) 1Bi = E(X),
i
P (Bi ) i
Consider two random variables X and Y , where X takes values {x1 , . . . , xk } and Y
takes values {y1 , . . . , yl }. First construct the partition PX defined by X, then decom-
pose each element Bi = {X = xi } according to the values taken by Y on Bi :
l
[
Bi = Bij , Bij = Bi ∩ {Y = yj }.
j=1
81
We can proceed in this way, introducing more random variables. Let X1 , . . . , XN
be a sequence of random variables and denote by Pk the partition defined by
X1 , . . . , Xk .f or any k = 1, . . . , N . We also take P0 = {Ω} to be the trivial par-
tition. Thus, we have a sequence of partitions P0 , P1 , . . . , PN with Pk finer than Pk−1
for each k = 1, . . . , N .
The tower property has the following consequence: for any j, k = 0, 1, . . . , N
such that j < k,
E(E(X|Pk )|Pj ) = E(X|Pj )
so if we denote Y (i) = E(X|Pi ) for i = 0, . . . , N , then
This motivates the general definition a martingale with respect of a filtration. In applica-
tions the role of a martingale will usually be played by the stock prices S(n) or by the value
process V (n) of a given trading strategy. In our applications the time horizon N will remain
finite.
Definition 3.20
A sequence of partitions P0 , . . . , PN of Ω such that Pk is finer than Pk−1 for each
k = 1, . . . , N is called a filtration.
Definition 3.21
A sequence M (0), . . . , M (N ) of random variables is a martingale with respect to a
filtration P0 , . . . , PN if
E(M (k)|Pj ) = M (j)
for any j, k = 0, 1, . . . , N such that j < k.
In terms of ‘information’ carried by a filtration, this property can be read as follows: given
some (partial) information, the expected value of some quantity (price) in the future is the
value of the quantity at that time. In other words, the values M (n) encapsulate all the
predictive power of the information available at time n.
If M (n) is a martingale with respect to Pn then Pn is a finer partition than PM (n)
since M (n) is the conditional expectation with respect to Pn . We then say that M (n) is
Pn -measurable.
82
Proposition 3.22
Proof
This follows from the tower property. Let m > n. Then
after m − n steps.
Proposition 3.23
Let X be a random variable and let P0 , . . . , PN be a filtration. Then the sequence
is a martingale.
Proof
For the sequence E(X|Pn ) to be a martingale, it must satisfy the condition
for n < m, which holds by the tower property because Pm is a finer partition than Pn .
Proof
If M (0), M (1), . . . , M (N ) is a martingale, then M (n) = E(M (n+1)|Pn ) for each n =
0, . . . , N − 1. Applying expectation to both sides of this equality, we have
E(M (n)) = E(E(M (n + 1)|Pn )) = E(M (n + 1))
83
for each n, as required.
Example 3.25
This video shows an example where the reverse implication does not hold, hence
E(M (0)) = E(M (1)) = · · · = E(M (N )) does not necessarily mean that
M (0), M (1), . . . , M (N ) is a martingale.
Definition 3.26
We say that a sequence M (0), M (1), . . . , M (N ) of random variables is a submartin-
gale (respectively, a supermartingale) with respect to a filtration P0 , P1 , . . . , PN if
the partition of M (n) is coarser than Pn and
for each n = 0, 1, . . . , N − 1.
The condition that the partition of M (n) should be coarser than Pn , i.e. that M (n)
is measurable with respect to Pn , simply means that M (n) is known at time n. The
partition Pn represents our knowledge at time n, and if Pn is known, then M (n) is known.
In defining martingales we did not need to state explicitly the condition that the partition
of M (n) should be coarser than Pn because it follows directly from the identity E(M (n +
1)|Pn ) = M (n).
Definition 3.27
We can also define an increasing sequence of fields (Fn )n≤N , again called a filtration,
where F0 = {∅, Ω} and for n ≥ 1, the field Fn is generated by the first n elements
of the sequence, that is, by M (1), . . . , M (n). The field generated by a single random
variable X is of the form FX = {X −1 (B) : B ⊂ R, B a Borel set}, and we take Fn
to be the smallest field containing FM (k) for all k = 1, . . . , n. In this way we ensure
that (M (n))n≤N is adapted to (Fn )n≤N , which means that for all n, M (n) is Fn -
measurable.
In applications, where the sequence of random variables in question is often the sequence
{S(n) : n = 0, 1, . . . , N } of stock prices, it will be convenient to assume that FN = F. The
latter is 2Ω , so to ensure equality we have to reduce Ω if the atoms of FN are not single-
element sets by picking one element in each to form the new sample space, again denoted
by Ω for simplicity.
84
3.4 Trading strategies and arbitrage
We return to the case when the martingales are related to stock prices. Assume that
S(0), S(1), . . . , S(N ) is a sequence of stock prices for all trading dates in some pricing
model with time horizon N. By a (trading) strategy we mean a sequence of portfolios. At
each step our decision on how to adjust the previous portfolio depends on the information
available at this time. We are ready for the general definitions.
Definition 3.28
A (trading) strategy is a sequence of pairs of random variables showing, for n =
1, 2, . . . , N, the number of shares x(n) of stock and the number y(n) of money market
account units chosen at time n − 1 and held until time n, and such that (x(n), y(n)) is
Fn−1 -measurable, for each n = 1, 2, . . . , N . In general, a sequence of random variables
satisfying this measurability requirement is called predictable.
Definition 3.29
The value process of a strategy is a sequence defined for n = 1, . . . , N by
The next definition summarises the requirement that the funds available at a given time
are used for building the next portfolio, that is, no inflows or outflows of funds are allowed.
Definition 3.30
A strategy (x, y) is self-financing whenever
for all n = 1, . . . , N − 1.
for n = 1, . . . , N. We can write the self-financing condition in the following equivalent form,
which will be useful later.
85
Proposition 3.31
A strategy is self-financing if and only if
for each n = 1, . . . , N .
Proof
This is immediate since the strategy is self-financing if an only if V(x,y) (n − 1) =
x(n)S(n − 1) + y(n)A(n − 1) for each n = 2, . . . , N .
Given the initial wealth, all that matters are the decisions concerning the number of shares
held, with the money market position following as a consequence of self-financing.
Proposition 3.32
Proof
Given V (0) and x(1), let
1
y(1) = (V (0) − x(1)S(0)).
A(0)
Then, for each n = 1, . . . , N − 1, let
1
y(n + 1) = y(n) + (x(n) − x(n + 1))S(n).
A(n)
Observe that y(n + 1) is Fn -measurable since all the ingredients on the right are.
Moreover,
86
Example 3.33
In a 3-step binomial three model with
and
S(0) = 100, A(0) = 1
take the predictable process
and let
V (0) = 10.5.
This video shows how to construct the unique predictable process y(1), y(2), y(3)
such that (x, y) is a self-financing strategy with
Next, we focus on a key property of the discounted values of a strategy. For the multi-
step binomial model, we described in the previous chapter how to construct a probability Q
on the sample space Ω built for that model such that the discounted stock prices form a
Q-martingale. This provides the basis for the assumption made in the following theorem.
In the next section we shall show that this assumption is equivalent to the No Arbitrage
Principle for general finite models.
Theorem 3.34
Suppose that there is a probability Q such that EQ (S̃(n + 1)|Fn ) = S̃(n) for all
n = 0, 1, . . . , N − 1. Then the discounted values of a self-financing strategy form a
martingale with respect to Q.
Proof
By definition,
1
Ve(x,y) (n) = V(x,y) (n).
A(n)
87
Since S(n)
e = S(n)/A(n) is a martingale, that is, EQ (S(n
e + 1)|Fn ) = S(n)
e for each
n = 0, 1, . . . , N − 1, we can proceed as follows:
Let us emphasise that this result does not use any particular model of stock prices. We only
used the martingale property of the discounted stock prices together with the self-financing
and predictability conditions.
We conclude the section by giving a general multi-step version of the No Arbitrage Prin-
ciple. First we define what is meant by arbitrage opportunities in our multi-step single-stock
model based on a finite sample space Ω.
Definition 3.35
A predictable strategy (x(n), y(n)), n = 0, 1, . . . , N, is an arbitrage opportunity if
(4) there is an n = 1, . . . , N such that V(x,y) (n) > 0 with positive probability.
88
Remark 3.37
As we have seen before, for extended markets the definition of arbitrage strategy has
to be modified to include positions in the relevant derivative securities.
The next proposition provides an apparently weaker, but in fact equivalent, version of the
notion of arbitrage, which is useful in some applications.
Proposition 3.38
Proof
This video illustrates the main idea of the proof in the simple case when N = 2.
Suppose we have a strategy (x, y) satisfying the assumptions of the proposition, and
suppose that it does not satisfy Definition 3.35, so its value is negative, V(x,y) (n) < 0
on B for some n = 1, . . . , N and some B ∈ Pn . Suppose that n is the largest
integer with this property. Clearly, n < N by condition (3) in the Proposition. In any
scenario ω ∈ B the strategy (x, y) generates some profit at time n + 1 since the value
becomes non-negative after being negative at n. We shall exploit this to construct an
arbitrage opportunity in the sense of Definition 3.35.
We construct a new strategy (x∗ , y ∗ ) as follows. We begin by constructing x∗ .
We do not invest in stock on any trading date other than n:
At time n we employ the stock position in original strategy for any scenario in B and
do not invest in stock in scenarios outside B:
∗ 0 on Ω \ B.
x (n + 1) =
x(n + 1) on B.
89
Clearly, the process x∗ (k), k = 1, . . . , N, is predictable. We know (Proposition 3.32)
that there is a unique predictable sequence y ∗ (k), k = 1, . . . , N, such that (x∗ , y ∗ ) is
self-financing and V(x∗ ,y∗ ) (0) = V(x,y) (0) = 0. We have
y ∗ (1) = Ṽ(x,y) (0) − x(1)S̃(0)
and
y ∗ (k) = y ∗ (k − 1) + (x∗ (k − 1) − x∗ (k))S̃(k − 1) for each k = 2, . . . , N.
It follows that
y ∗ (k) = 0 for each k = 1, . . . , n.
Next,
y ∗ (n + 1) = 0 on Ω \ B
and
y ∗ (n + 1) = y ∗ (n) + (x∗ (n) − x∗ (n + 1))S̃(n)
= −x(n + 1)S̃(n) on B,
so
V(x∗ ,y∗ ) (n + 1) = 0 on Ω \ B
and
Ṽ(x∗ ,y∗ ) (n + 1) = x∗ (n + 1)S̃(n + 1) + y ∗ (n + 1)
= x(n + 1)S̃(n + 1) − x(n + 1)S̃(n)
= Ṽ(x,y) (n + 1) − y(n + 1) − x(n + 1)S̃(n) on B.
From the self-financing condition we have
y(n + 1) + x(n + 1)S̃(n) = V(x,y) (n).
Inserting this in the above expression, we further obtain
Ṽ(x∗ ,y∗ ) (n + 1) = Ṽ(x,y) (n + 1) − Ṽ(x,y) (n) > 0 on B
since V(x,y) (n + 1) ≥ 0 and V(x,y) (n) < 0 on B.
Next, for each k = n + 2, . . . , N , we have x∗ (k) = 0, that is, all wealth in the
strategy is in the money market account at these time steps. This implies that
Ṽ(x∗ ,y∗ ) (N ) = Ṽ(x∗ ,y∗ ) (n + 1).
Since we have shown that Ṽ(x∗ ,y∗ ) (n − 1) ≥ 0 with probability 1 and Ṽ(x∗ ,y∗ ) (n − 1) > 0
with positive probability, this proves that (x∗ , y ∗ ) is an arbitrage opportunity (in the
sense of Definition 3.35). By the No Arbitrage Principle, this is impossible, completing
the proof.
90
3.5 General multi-step model
The multi-step models we have discussed so far all involve a market with a single underlying
stock. We now analyse derivative securities defined on an underlying market containing several
risky assets (stocks) and a single money market account. As we saw for trinomial models,
where market completeness requires two underlying stocks, pricing and hedging derivatives in
such a model proceeds much like in the single-stock case, and it is often useful to treat the
vector of stock prices as a single entity. With this proviso, much of theory we have developed
so far will generalise to this more general setting quite simply, and we begin by reviewing the
main concepts.
We still assume that time runs in a discrete way, just as in the binomial model. With h
being the length of the time step, we use n = 0, 1, . . . , N to indicate the time instant nh.
The basic market consists of the money market account and d risky assets (stocks) with prices
S1 (n), . . . , Sd (n), which comprise the components of the Rd -valued random variable S(n)
and are discrete strictly positive random variables defined on the finite sample space Ω =
{ω1 , . . . , ωM }. The dynamics of the single risk-free asset (the money market account) is again
determined by the risk-free rate R, which is assumed constant, so A(n) = A(0)(1 + R)n . A
portfolio is a vector (x, y) = (x1 , . . . , xd , y) with coordinates representing positions in the
corresponding securities. A strategy is a sequence of portfolios
(x(n), y(n)) = (x1 (n), . . . , xd (n), y(n)), n = 1, . . . , N.
These are Rd+1 -valued random vectors, except for the initial portfolio, which is determined
at time 0, so is a deterministic vector. We shall write (x, y) for this sequence.
Later, this market will be extended by adding derivative securities, and the portfolio
vectors will contain additional coordinates zk , reflecting the positions in these derivatives.
The value of a strategy at time n is
d
X
V(x,y) (n) = xj (n)Sj (n) + y(n)A(n), for n = 1, 2, . . . , N,
j=1
d
X
V(x,y) (0) = xj (1)Sj (0) + y(1)A(0).
j=1
We shall again assume throughout that the strategies we consider are predictable, so
the random vectors (x(n + 1), y(n + 1)) are Fn -measurable. This condition reflects the
convention that the portfolio at time n + 1 is constructed at time n using the funds and the
information about stock prices available at that time.
The key ideas developed for the single-stock model also apply in this more general model.
A strategy is self-financing if
d
X
V(x,y) (n) = xj (n + 1)Sj (n) + y(n + 1)A(n) for n = 0, 1, . . . , N − 1.
j=1
91
We state the version of the No Arbitrage Principle which applies in the model as follows.
Definition 3.39
A predictable strategy (x, y) is an arbitrage opportunity in the underlying market if
its value process satisfies
(4) there is an n = 1, . . . , N such that V(x,y) (n) > 0 with positive probability.
For the market extended by adding some derivative securities, the definition is similar;
cf. Definitions 1.8 and 1.61.
Assumption 3.40 (No Arbitrage Principle)
As a consequence, we prove the following property, known as the law of one price.
Theorem 3.41 (Law of One Price)
H(N ) = H 0 (N ),
H(n) = H 0 (n).
Proof
This is routine: if for one ω0 , at some time n0 we have H(n0 , ω0 ) < H 0 (n0 , ω0 ), say,
then we do nothing in scenarios other than ω0 , and in scenario ω0 do nothing up to
time n0 , and then take the position xH (n0 , ω0 ) = 1, xH 0 (n0 , ω0 ) = −1 in the two
securities, and y(n0 , ω0 ) = H 0 (n0 , ω0 ) − H(n0 , ω0 ) in the money market to obtain an
arbitrage opportunity.
In particular, put-call parity can be obtained as a consequence of the law of one price.
92
Proposition 3.42 (Put-call parity)
For any n = 0, 1, . . . , N, we have the following relation between the call and put values
Proof
Since the call and put payoffs are
It follows that
C(N ) − P (N ) = S(N ) − K.
The law of one price implies that
Remark 3.43
Note that put-call parity does not require using any particular model and can be justified
directly by the No Arbitrage Principle via the Law of One Price.
In the First Fundamental Theorem below we shall prove that the No Arbitrage Principle is
equivalent to the existence of a martingale probability, that is, a probability Q with Q(ω) > 0
for all ω in Ω and such that for each j = 1, . . . , d, the discounted stock price process
{Sej (n) : n = 0, 1, . . . , N } is a Q-martingale.
The following definition is both intuitive and useful.
Definition 3.44
The gains process G(x,y) = (G(x,y) (n))n ≤ N generated by the strategy (x, y) is defined
by
G(x,y) (0) = 0,
n X
X d
G(x,y) (n) = xj (k)∆Sj (k) + y(k)∆A(k) for n = 1, . . . , N.
k=1 j=1
En equivalent condition for self financing can be formulated in terms of the gains process.
93
Proposition 3.45
for each n = 1, . . . , N.
Proof
The proof is virtually the same as that of Proposition 3.31.
Remark 3.46
Observe that
G(x,y) (n) = V(x,y) (n) − V(x,y) (0)
for any self-financing strategy (x, y).
In self-financing predictable strategies the risk-free positions are determined by the stock
positions and the initial value of the strategy, as already observed in the case of a single
stock, see Proposition 3.32.
Theorem 3.47
Given a predictable sequence x(n) = (x1 (n), . . . , xd (n)), n = 1, . . . , N, and initial
wealth V (0), there is a unique predictable sequence y(n), n = 1, . . . , N, such that
(x, y) is a self-financing strategy and V(x,y) (0) = V (0).
Proof
This is proved by induction. First, let
which ensures that V(x,y) (0) = V (0). Thus y(1) is non-random, as is necessary for
y(n) to be predictable. We have therefore determined V(x,y) (1) uniquely,
and the initial induction step is complete. Next, for the induction step, suppose that
Fn−1 -measurable y(n) has already been constructed for some n = 1, . . . , N − 1. This
gives
V(x,y) (n) = x1 (n)S1 (n) + · · · + xd (n)Sd (n) + y(n)A(n),
94
which is Fn -measurable (since so are S1 (n), . . . , Sd (n)). We define y(n + 1) in such a
way that the self-financing condition is guaranteed:
Definition 3.48
The discounted gains process will be denoted by
n X
X d n
X
G
ex (n) = xj (i)∆Sej (i) = hx(i), ∆S(i)i,
e
i=1 j=1 i=1
where we use the notation ha, bi = dj=1 aj bj for the usual inner product of vectors
P
95
Proof
This is an immediate consequence of Proposition 3.45.
Definition 3.50
A strictly positive adapted process is called a numeraire.
Example 3.51
A numeraire can be regarded as the price process of a benchmark security against which
the price changes of other securities are compared.
Proposition 3.52
Given an adapted sequence Z(n) > 0 for n = 0, 1, . . . , N (that is, a numeraire), with
(x, y) and (S, A) as above, the strategy (x,y) is self-financing for the price process
(S, A) if and only if it is self-financing for the price process (ZS1 , . . . , ZSd , ZA).
Proof
Write V(x,y) (n) = hx(n), S(n)i , where, by abuse of notation, we write
S = (S1 , ..., Sd , A) and x = (x1 , ..., xd , y) as vectors in Rd+1 . Dropping the subscript
(x, y) for simplicity, the self-financing condition becomes
96
The last proposition means that the self-financing property of trading strategies is invari-
ant under a change of numeraire. It also shows that the converse of Theorem 3.49 holds.
Indeed, if a strategy is self-financing for the discounted prices, then it is also self-financing for
the original prices since the two price processes differ by multiplication by a strictly positive
adapted process (in fact, a deterministic one).
To discuss martingale properties in general we introduce a filtration of fields Fn as before.
Then we have the familiar definition.
Definition 3.53
A probability Q is called risk-neutral (or a martingale probability) if the discounted
stock prices are martingales
We shall prove that under a risk-neutral probability Ve(x,y) (n) is a martingale as well, thus
generalising Theorem 3.34. To this end we formulate the appropriate theorem using general
notation since it can be used in various applications.
Theorem 3.54
If Mj (n) are martingales and Hj (n), j = 1, . . . , d, are predictable processes, then the
process X(n) defined by
X d
n X n
X
X(n) = X(0) + Hj (k)∆Mj (k) = X(0) + hH(k), ∆M(k)i,
k=1 j=1 k=1
This right-hand side of the above equation is sometimes called a martingale transform
or discrete stochastic integral.
Proof
First note that
d
X
X(n + 1) − X(n) = Hj (n + 1)∆Mj (n + 1)
j=1
d
X
= Hj (n + 1) [Mj (n + 1) − Mj (n)] .
j=1
97
The theorem is formulated for any probability space, hence we use the general symbol
for expectation. We take the conditional expectation of the above equality to get
E(X(n + 1) − X(n)|Fn )
X d
= E Hj (n + 1) [Mj (n + 1) − Mj (n)] Fn
j=1
d
X
= E Hj (n + 1) [Mj (n + 1) − Mj (n)] Fn
j=1
d
X
= Hj (n + 1)E(Mj (n + 1) − Mj (n)|Fn )
j=1
= 0,
where we have used the property of conditional expectations referred to as ‘taking out
what is known’. The process H is predictable, so the value at time n + 1 is measurable
with respect to Fn . The final result E(X(n + 1) − X(n)|Fn ) = 0 proves that X is a
martingale.
Proof
Fix an k = 0, . . . , N − 1, take any A ∈ Fk , and put
1A for n = k + 1,
H(n) =
0 otherwise.
98
By assumption,
N
X
E H(n)∆M (n) = E(1A ∆M (k + 1))
n=1
= E(1A [M (k + 1) − M (k)]) = 0.
path dependent if H(N ) = h(S(0), S(1), . . . , S(N )), where the market model
has time set T = 0, 1, . . . , N , and h : R(N +1)d → R.
Recall that the sample space Ω = {ω1 , . . . , ωM } is finite, we assume that P is defined
on F = 2Ω , and that we take P (ω) > 0 for all ω. In this setting, to ensure the equivalence
99
P ∼ Q, any risk-neutral probability must have Q(ω) > 0 for all ω. It is worth mentioning
that Q is defined on the field 2Ω due to the standing assumption that FN = F.
Proof
Assume first the existence of a risk-neutral probability Q. Take any predictable strategy
(x, y). By Theorem 3.54, its discounted value process Ve(x,y) is a martingale under Q.
Suppose that V(x,y) (0) = 0 and V(x,y) (n) ≥ 0 for each n, which are necessary conditions
for an arbitrage strategy. Then by the martingale property of Ve(x,y) under Q, for each n
we have
EQ (Ve(x,y) (n)) = EQ (Ve(x,y) (0)) = EQ (V(x,y) (0)) = 0,
so after multiplying both sides by (1 + R)n
X
EQ (V(x,y) (n)) = Q(ω)V(x,y) (n, ω) = 0.
ω∈Ω
Since all terms in the sum are non-negative and Q(ω) > 0 for each ω ∈ Ω, it follows
that V(x,y) (n, ω) = 0 for each ω ∈ Ω, and this shows that it is impossible to construct
an arbitrage opportunity.
For the converse implication, assume the absence of arbitrage. Let x =
{(x1 (n), . . . , xd (n)) : n = 1, . . . , N } be a predictable sequence. The probability space
Ω = {ω1 , . . . , ωM } is finite, so a random variable can be regarded as an element of
the Euclidean vector space RM . This means that the discounted gains process G ex has
the form
Gex (n) = (G ex (n, ωM )) ∈ RM
ex (n, ω1 ), . . . , G
for each n, and
ex (N ) : x is predictable} ⊂ RM
W = {G
is a vector subspace of RM since it is clearly closed under linear combinations.
We claim that if G ex (N, ω) ≥ 0 for all ω, then G ex (N, ω) = 0 for all ω. Take
V (0) = 0 and complete x with y(n) so that (x, y) is a self-financing predictable
strategy such that Ve(x,y) (0) = V (0) (see Theorem 3.47). By Theorem 3.49, Ve(x,y) (n) =
V (0) + Gex (n), and so G ex (N, ω) = 0 for all ω, or else there would be an arbitrage
opportunity by Proposition 3.38.
100
To conclude the proof of the theorem, recall that we are working under the No
Arbitrage Principle and that we have identified W as a vector subspace of RM . We
can apply the Separation Lemma (Lemma 1.56), which says that if A ⊂ RM is convex
and compact and W is a vector subspace of RM disjoint from A, then there exists a
z = (z1 , . . . , zM ) ∈ RM such that hz, ai > 0 for all a ∈ A, and hz, wi = 0 for all
w ∈ W (with h·, ·i denoting the Euclidean inner product in RM ). For our application
we take
n M
X o
A = a = (q1 , . . . , qM ) : qi = 1, qi ≥ 0
i=1
so
M
X
Q(ωi )G
ex (N, ωi ) = EQ (G
ex (N )) = 0.
i=1
But the stock holdings x were taken as an arbitrary predictable (vector-valued) process,
and we can fix j = 1, . . . , d and take x(n) = (0, . . . , 0, xj (n), 0 . . . , 0) for each n =
1, . . . , N, where xj (n) is a real-valued predictable process. Then the above identity
reduces to X N
EQ xj (n)∆Sj (n) = 0.
e
n=1
By Theorem 3.55, the Sej (n) are martingales for j = 1, . . . d since the xj (n) are arbitrary
predictable processes.
101
Remark 3.59
The First Fundamental Theorem describes the ‘economic’ no-arbitrage axiom in purely
mathematical terms, namely the existence of an equivalent martingale measure. Our
proof contains the crucial additional assumption that the sample space Ω has finitely
many elements (and we can therefore give each of them positive probabilities). This
naturally begs the question whether this restriction can be removed. In discrete-time
models this does turn out to be the case, although the proof uses considerably more
advanced concepts in functional analysis, which are beyond our scope (see R. J. Elliott
and P. E. Kopp, Mathematics of Financial Markets, Springer 2005).
Even in the relatively simple setting of a finite market model, the above theorem does not
tell us whether or not the risk-neutral probability Q is unique. In fact, we have already seen
in the one-step trinomial model that this need not be the case. Our next result characterises
the pricing models with unique risk-neutral probability.
The basic market of underlying securities consists of d stocks and the risk-free asset.
The extended market includes some specified additional derivative securities.
Definition 3.61
A derivative security can be replicated if there exists a self-financing predictable strat-
egy (x, y) such that
H(N ) = V(x,y) (N ).
Definition 3.62
The market is complete if every derivative security can be replicated.
102
Theorem 3.63
An arbitrage-free market model is complete if and only if there is exactly one risk-neutral
probability Q ∼ P.
Proof
This proof is very similar to that in the single-period case. As in the First Fundamental
Theorem we shall make crucial use of the finiteness of Ω.
First, assume the No Arbitrage Principle and completeness. A risk-neutral prob-
ability exists by the First Fundamental Theorem, so all we have to do is to prove
uniqueness. Suppose we have two risk-neutral probabilities Q1 6= Q2 . Fix A in FN and
consider a derivative security with payoff
H(N ) = 1A .
By completeness, there exists a self-financing predictable strategy (x, y) such that
V(x,y) (N ) = 1A .
Taking expectations with respect to each risk-neutral measure, we can see that
EQi (V(x,y) (N )) = EQi (1A ) = Qi (A) i = 1, 2.
But the discounted values of the strategy form a martingale under each Qi , so the
expectation is constant in time
(1 + R)−N EQi (V(x,y) (N )) = EQi (Ve(x,y) (N )) = V(x,y) (0).
The right-hand side does not depend on i, so Q1 (A) = Q2 (A) (both are equal to
V(x,y) (0)(1 + R)N ). Hence Q1 = Q2 , as A ∈ FN was arbitrary.
For the converse implication, assume that there exists a unique risk-neutral proba-
bility Q ∼ P. By the First Fundamental Theorem, there are no arbitrage opportunities.
To prove completeness, suppose that a certain derivative security with payoff H(N )
cannot be replicated, and show that this allows us to construct another risk-neutral
probability, which would be a contradiction. Consider the set of real random variables
defined on Ω by
W = {V(x,y) (N ) : (x, y) is self-financing and predictable}.
This is a vector subspace of RM with H(N ) ∈ / W. Investing risk free, we can obtain a
deterministic final value so 1 = (1, . . . , 1) ∈ W. Equip RM with the inner product
M
X
hz, wiQ = zi wi Q(ωi ).
i=1
103
By Lemma 1.60 with A = {H(N )}, there exists a z ∈ RM such that hz, H(N )iQ > 0
and hz, wiQ = 0 for each w ∈ W , which means that z 6= 0 and hz, 1iQ = 0. Write
a = maxi=1,...,M |zi | > 0, and define a new probability
zi
Q1 (ωi ) = 1 + Q(ωi ) for i = 1, . . . , M.
2a
We have proved (see the proof of the single-period Second Fundamental Theorem)
that Q1 is a probability equivalent to P (since Q1 (ωi ) > 0 for each i = 1, . . . , M ) and
Q1 6= Q.
It remains to show that Q1 is a risk-neutral probability. To this end take a
predictable sequence x(n), and use it to create a self-financing strategy (x,y) with
initial value V (0) = 0 (see Theorem 3.47). We use the fact that the discounted values
of the strategy form a Q-martingale, so the expectation is constant in time:
M
X zi e
EQ1 (Ve(x,y) (N )) = Q(ωi )(1 + )V(x,y) (N )
i=1
2a
M M
X 1 X
= Q(ωi )Ve(x,y) (N ) + zi V(x,y) (N )Q(ωi )
i=1
2a(1 + R)N i=1
1
= EQ (Ve(x,y) (N )) + hz, V(x,y) (N )iQ
2a(1 + R)N
1
= V (0) + hz, V(x,y) (N )iQ
2a(1 + R)N
= 0,
Example 3.64
We take the single-step trinomial model with
U = 30%, M = 20%, D = −10%
and
R = 10%.
104
This video constructs several risk neutral probabilities in this setting, as well as
an example of a derivative security which cannot be replicated.
for each n = 0, 1, . . . , N .
Proof
To see this, note that if the equality failed at any time, an arbitrage opportunity could
be constructed. The investment would be started at that time, the option sold and
the portfolio bought (or vice versa). The profit would be maintained since at the final
moment the option payoff is identical to the portfolio value.
Example 3.66
In Example 3.64 we have seen that the derivative security with payoff
10 in the u scenario
H(1) = 0 in the m scenario
10 in the d scenario
cannot be replicated.
The video shows another way to verify this, based on Theorem 3.65.
Then, employing the fact that the stock prices and also the values of strategies are
105
martingales with respect to the unique risk-neutral probability, we obtain
so
1
H(n) = EQ (H(N )|Fn ).
(1 + R)N −n
In particular (for n = 0),
H(0) = (1 + R)−N EQ (H(N )).
Forward contracts
One of the simplest examples of a derivative security is a forward contract. This contract
is a binding agreement between two parties to buy or sell the underlying asset at a given
future time N for a specified delivery price K. The party committed to buying the asset holds
a long position, and the party committed to selling the asset holds a short position. The
payoff at time N for the long position is
Hlf (N ) = S(N ) − K
and brings profit if S(N ) > K. For the short position the payoff is
Hsf (N ) = K − S(N ).
H
e lf (n) = EQ (H e ) − K|F
e lf (N )|Fn ) = EQ (S(N e n ) = S(n)
e − K(1 + R)−N .
When discounting is removed, we obtain a formula for the value of a forward contract at
time n. However, this formula can be proved without referring to any model.
106
Proposition 3.67
The No Arbitrage Principle implies that the time n price of a long forward contract
with delivery price K is
Proof
We build a strategy (x, y) such that V(x,y) (N ) = S(N ) − K by taking x(n) = 1,
1
y(n) = − A(0) K(1 + R)−N for each n = 0, . . . , N (this is a static strategy, that is,
the positions are the same for each n). Hence, by the No Arbitrage Principle, at each
time n we must have Hlf (n) = V(x,y) (n). This means that
Hlf (n) = V(x,y) (n) = S(n) + y(n)A(n) = S(n) − K(1 + R)−(N −n) .
Definition 3.68
We define the forward price F (n, N ) as the delivery price that gives zero value to a
contract entered into at time n, with delivery at time N.
Corollary 3.69
No-arbitrage implies that the forward price at time n = 0, . . . , N is
Proof
From H(n) = 0 we get the result at once.
Example 3.70
Let
S(0) = 100
R = 5%
and consider a forward contract exchanged at time 0 with delivery time
N = 10.
Suppose that the long position holder would like to close it at time n = 6.
The video shows how closing the long forward position can be archived.
107
Futures
In addition to stock prices, markets also quote so-called futures prices. The principal difference
between them and forwards is that futures are quoted on an exchange, whereas forwards are
traded ‘over-the-counter’ (OTC), by mutual agreement between the two parties concerned.
The exchange seeks to minimise the risk of default, and investors are required to hold a
margin account with the exchange, which must be adjusted daily to reflect the fluctuations
in futures prices.
We restrict our attention to discrete-time situations.
Definition 3.71
Use f (n, N ) to denote the futures price at time n = 0, 1, . . . , N − 1 for delivery
at time N of an underlying asset S(n). This provides a series of derivative securities
called futures defined by means of the following conditions:
f (N, N ) = S(N ).
f (n, N ) − f (n − 1, N ), n = 1, . . . , N.
f (n − 1, N ) − f (n, N ), n = 1, . . . , N.
The cash flows, called marking-to-market, are credited/debited to the margin ac-
count.
Theorem 3.72
If the interest rate is constant, then
f (0, N ) = F (0, N ).
Proof
We claim that the payoff S(N ) − K of a long forward contract with K = f (0, N ) and
delivery time N can be replicated at zero initial cost in a market with futures contracts.
108
If this is so, then K must be equal to the forward price F (0, N ) or else an arbitrage
opportunity would arise. This means that f (0, N ) = F (0, N ).
For N = 1 the claim is obvious. Let us verify it for N = 2. Consider a portfolio
consisting of the following two futures contracts:
1
1+R
long futures initiated at time 0,
R
1+R
long futures initiated at time 1.
It means that this portfolio of futures replicates the payoff S(2) − f (0, 2), proving the
claim for N = 2.
It remains to extend the above argument to verify the claim for an arbitrary
N > 0. In this case, consider the following portfolio consisting of N positions in
futures contracts:
1
(1+R)N −1
long futures initiated at time 0,
R
(1+R)N −1
long futures initiated at time 1,
R
(1+R)N −2
long futures initiated at time 2,
.. ..
. .
R
(1+R)2
long futures initiated at time N − 2,
R
1+R
long futures initiated at time N − 1.
It is left to the reader to verify that the cash flow of this portfolio invested risk free up
to time N does indeed replicate the payoff S(N ) − f (0, N ).
Remark 3.73
The key point in this argument is that the future interest rate R is known in advance,
which allows us to specify the size of the futures positions initiated at times 0, 1, . . . , n.
109
Example 3.74
The video illustrates the proof of Theorem 3.72 by replicating a forward contract
with futures in this scenario.
Example 3.76
A bottom straddle consists of long positions in both a call and a put with common
strike K and expiry date N. This is a simple example of a combination, using options
of different types on the same underlying. We buy a call and a put, both with strike
K and expiry N. The payoff is |S(N ) − K|, against an initial outlay of C(0) + P (0).
Thus the holder will lose money if the price of the stock stays close to K but can
gain substantially if the final price differs significantly from K. This investment thus
110
suits an investor who expects large price changes, by taking options with strike close
to S(0).
Path-dependent options
Here we provide a brief glimpse of the variety of derivatives that can be constructed from the
basic building blocks we have analysed
Recall, that a European derivative security H is path-dependent if
This video shows pricing and replicating an down-and-in call. Here is the ac-
companying Excel file.
111
Example 3.78
Lookback options come in several guises, using the maximum Mlb or minimum mlb of
the underlying stock price over a preset ’lookback period’. We have the floating strike
variety, where the strike becomes a random variable. Thus the lookback call allows
one to buy the stock at the minimum value achieved in the lookback period, hence
the payoff is (S(N ) − mlb )+ . For the put the holder uses the maximum instead, with
payoff (Mlb − S(N ))+ . There are also fixed strike versions, with payoffs (Mlb − K)+
and (K − mlb )+ , and even partial lookbacks, using a preset fraction λMlb or λmlb
instead. Clearly, all these are designed to ‘eliminate regret’ over lost opportunities, and
their premia will reflect this.
Example 3.79
The Asian call option, in an N -step model has payoff H(N ) = (A(N ) − K)+ ,
where A(N ) is either the arithmetic average N1 N
P
QN i=1 S(i) or the geometric average
1/n
[ i=1 S(i)] of the stock prices, together with a fixed strike K. Clearly, these are
path-dependent options. They are usually traded as of European type, to avoid early
exercise occurring before the averaging period is completed, thus losing the protection
it can provide.
To see why one may wish to use such options in certain situations, consider a
fixed strike K = 1.5, N = 6 and A(N ) as the arithmetic average. Suppose that the
price evolution of the underlying S is given by
n 1 2 3 4 5 6
S(n) 1.5 1.6 1.7 1.8 1.6 1.4
Then H(6) = 0.1, whereas for a European call C(6) = 0. The Asian option may be
preferred by a company which regularly buys some asset and wants to hedge against
high prices, for example, to guard against price spikes near the trading horizon. It can
buy a series of vanilla call options with different maturities or a single Asian call on the
average price. The latter is usually satisfactory from the point of view of securing the
company’s business.
Pricing formulae have been extended to cover many other exotic options, including
choosers, where the option holder can decide at an intermediate time 0 < N1 < N whether
he wishes to treat his option as a put or a call, and compound options, where the underlying
is not a stock but an option with earlier expiry.
112
Chapter 4
American options
As we recall, a European option confers the right to a random payoff at a prescribed time N
in the future. By contrast, an American derivative security allows the holder to exercise the
corresponding right at any time n ≤ N , so for an American call we can obtain the payoff
(S(n) − K)+ on demand at any time n ≤ N. When exercised, such an option is terminated.
The payoff depends on the values of S(n) for all n up to the moment of exercise, hence a
representation of the payoff similar to that of the European case is not possible in general.
Even in the simple binomial setting, a direct analogue of the CRR formula is not possible
as the payoff depends on the holder’s choice of the exercise date. This requires so-called
stopping times to describe the optimal exercise time for the option.
We discuss optimal stopping times in a general discrete-time model with finitely many
trading dates, where we assume that we are given a finite probability space (Ω, F, P ) equipped
with a filtration of fields Fn (see Definition 3.27) generated by a process S(n) representing
the underlying security prices. We illustrate the resulting pricing techniques for single-stock
models numerically throughout in the binomial three model where, as usual, the space Ω is
taken in the form {u, d}N , that is, an element ω is an N -tuple of symbols u, d corresponding
to the returns U, D of the price of the underlying.
4.1 Pricing
Consider the binomial model and assume that the option holder’s choice of exercise date is
made in order to maximise the amount received. At each time she faces the choice between
exercising immediately and postponing this till later. The sum of money given by the payoff
can readily be seen at each time, in each scenario, being a known function of the current
stock price. Valuing the alternative, that is, the choice not to exercise poses a problem and
depends on assumptions about the future behaviour of the stock. This makes it natural to
seek to solve the pricing problem by means of backward induction, while taking into account
the additional choice of whether to exercise or not at each node of the binomial tree. The
method is best illustrated through an example which is sufficiently complex to reveal the
113
various possible scenarios while remaining very simple computationally. Thus, all notions and
theorems in this chapter will be illustrated by a concrete example of a single-stock model on
a 5-step binomial tree.
Example 4.1
We begin with the specification of the dynamics of the underlying security. The initial
price is S(0) = 100, the returns are U = 15%, D = −10%, so the tree of the prices
is as shown in Figure 4.1.
Consider a put option with expiry date N = 5 and exercise price K = 100. Since
the American option can be exercised at any time prior to the expiry, it is necessary to
compute the immediate payoff of the option at each node of the tree. We denote it
by I(n) = (K − S(n))+ . The results are shown in Figure 4.2.
These calculations can be found in the Excel file.
Pricing will be performed in a similar fashion as for European claims, starting from the
expiry time and moving backwards. The value of the American put at time n will be denoted
by P (n), so
P (5) = I(5).
114
Example 4.2
We continue Example 4.1, and our first serious goal is to compute the prices at time
n = 4. At this time the holder of the option has a choice between exercising and
waiting till the final moment.
The decision about ‘waiting’ now depends on the value of the European option
with exercise date one step later. This can easily be computed by means of replication,
or, equivalently, by using the risk-neutral probability q. Suppose the risk-free return is
R = 5%, so q = UR−D −D
= 0.6. At each state at time n = 4 we compute the discounted
expected value of the payoff available at time 5, obtaining the numbers as in Figure 4.3.
In the two lowest states at time n = 4 it is better to exercise immediately rather
than wait and the values of the option are the corresponding payoffs (of course we mean
here the positive numbers, zeros are irrelevant from this point of view). In the middle
state ‘waiting’ has positive value while the immediate payoff is zero. Figure 4.4 shows
the results. The highlighted umbers are those where immediate exercise prevailed.
See the Excel file for details.
We can summarise the above in a concise way. The value of the European put at time 4
115
is the random variable
1 1
EQ (I(5)|F4 ) = EQ (P (5)|F4 ).
1+R 1+R
Therefore, taking the benefit of immediate exercise, if profitable,
1
P (4) = max I(4), EQ (P (5)|F4 ) .
1+R
It is convenient to divide both sides by (1 + R)4 , which will lead to the discounted prices
satisfying n o
P (4) = max I(4), EQ (P (5)|F4 ) .
e e e
The prices P (4) computed above are the true market values of the American put, encapsulat-
ing the rights belonging to the holder: exercise immediately or wait and keep the option. As
such they represent the possible cash flow at time 4, since the option can be sold. Therefore
they form a basis for the next (backward) induction step, moving from time 4 to 3. Applying
the same argument to this step, we obtain
n o
Pe(3) = max I(3),
e EQ (Pe(4)|F3 ) .
Example 4.3
We continue Example 4.2. The numerical results for P (3) are shown in Figure 4.5.
Note that in the DDU state the exercise payoff is 6.85, which is lower than the value
of ‘waiting’.
See the Excel file.
In general we have shown how to develop the following backwards induction procedure,
which has the form
Pe(5) = I(5),
e
n o
Pe(n − 1) = max I(n e − 1), EQ (Pe(n)|Fn−1 )
116
for the discounted values.
Example 4.4
This allows us to complete Example 4.3. The resulting in the numbers given in Fig-
ure 4.6, where in particular we find the time 0 price of the American put in question:
P (0) = 4.51. All numbers here are rounded to two decimal points.
See the Excel file for the full computation details.
Y (N ) = Z(N ),
Y (n − 1) = max{Z(n − 1), E(Y (n)|Fn−1 )}.
The notion is defined in a general setting, but in applications we shall use it for binomial
trees, with expectation with respect to the risk-neutral probability.
So, as we have seen, Y (n) = Pe(n) is the Snell envelope of Z(n) = I(n),e n = 1, . . . , 5
with respect to the risk-neutral probability Q and the filtration generated by the partitions of
the probability space of scenarios in a binomial tree.
Directly from the definition, we can see that Y (n − 1) is Fn−1 -measurable (that is, Y is
adapted), and
Y (n − 1) ≥ E(Y (n)|Fn−1 ),
(these two conditions by definition mean that Y is a supermartingale), as well as
Y (n − 1) ≥ Z(n − 1),
117
so the Snell envelope Y of Z is a supermartingale dominating Z. This turns out to be the
key to characterising Snell envelopes.
Theorem 4.6
The Snell envelope Y of Z is the smallest supermartingale dominating Z. In other
words, if T (n) is a supermartingale satisfying T (n) ≥ Z(n), then Y (n) ≤ T (n).
Proof
We have already observed that Y is a supermartingale and that it dominates Z. Given
any supermartingale T with T (n) ≥ Z(n) for all n, we have, in particular,
T (N ) ≥ Z(N ) = Y (N ).
We shall prove by backward induction that this property holds for all n. Suppose, for
the induction hypothesis, that this is true for some k, that is, T (k) ≥ Y (k). This
immediately implies
E(T (k)|Fk−1 ) ≥ E(Y (k)|Fk−1 ).
The sequence T is a supermartingale, so
T (k − 1) ≥ E(T (k)|Fk−1 ).
T (k − 1) ≥ E(Y (k)|Fk−1 ).
(the last equality is the definition of the Snell envelope), which completes the induction
step and so the proof.
Y (n) = Pe(n)
Z(n) = I(n).
e
118
Example 4.7
In the Excel file the values of Z(n) and Y (n) are computed for the American put in
our 5-step numerical example (see Example 4.4).
Proposition 4.8
The prices CA (n) of an American call and CE (n) a European call on the same un-
derlying S(n) and with the same delivery date N and strike price K are equal to one
another,
CA (n) = CE (n)
for each n = 0, . . . , N .
Remark 4.9
Here we consider an underlying asset that pays no dividends. The equality between
American and European call prices would, in general, fail otherwise.
Proof
Observe that
1 +
CE (n) = EQ (S(N ) − K)
(1 + R)N −n
1
≥ N −n
(EQ (S(N )) − K)+
(1 + R)
+
K
= S(n) −
(1 + R)N −n
≥ (S(n) − K)+
CA (N ) = CE (N ) = (S(N ) − K)+ .
Suppose that
CA (n) = CE (n)
119
for some n = 1, . . . , N . Then
1
+
CA (n − 1) = max (S(n − 1) − K) , EQ (CA (n))
1+R
+ 1
= max (S(n − 1) − K) , EQ (CE (n))
1+R
max (S(n − 1) − K)+ , CE (n − 1)
=
= CE (n − 1),
Continuing Example 4.7, suppose we bought the American put for P (0) = 4.51. The
problem we are facing at all times is the decision whether to exercise this option or
not. (Refer to Figure 4.6 for the option prices computed at each stage.)
Consider first the strategy of exercising at the earliest possible time when the
option price is equal to the available payoff.
Of course, we do not exercise at time 0 since the payoff is 0. Let Bu (resp. Bd ) be
the set of all paths beginning with u (resp. d), and similarly define Buu , Bud , Bdu , Bdd ,
and so on.
Suppose the stock goes down in the first step, that is, consider ω ∈ Bd . We
exercise the option at time 1, cashing 10, which, as we saw, is higher than the
expected profit from waiting.
Suppose the stock goes up in the first step, so let ω ∈ Bu . Here we distinguish
three cases:
We have defined a random variable τ assigning to each ω the time when we exercise
120
the option:
1 if ω ∈ Bd ,
τ (ω) = 4 if ω ∈ Buddd ,
5 otherwise.
The condition means that the decision to stop at time n will be based on the information
available at that moment. This is even clearer in the following equivalent formulation.
Proposition 4.12
Proof
Suppose the condition holds for τ and fix n ≤ N. Since Fk ⊂ Fn for all k ≤ n, it
follows that n
[
{ω : τ (ω) ≤ n} = {ω : τ (ω) = k} ∈ Fn .
k=1
since Fk ⊂ Fn when k ≤ n.
Example 4.13
Going back to Example 4.10, we can define a natural modification of the process of
the option values related to early exercise. These values fluctuate with time when
we observe them along a given scenario. For example, if ω = udddu, we have the
following sequence of prices P (n, ω), n = 0, 1, 2, 3, 4, 5 :
In this scenario our strategy tells us to exercise at time 4. Imagine that we keep the
121
money we have cashed, so the sequence is modified to become
For this particular ω, we have τ (ω) = 4, and we’ve left the sequence unchanged for
n ≤ 4, replacing the subsequent values by the value at time 4, so for n ≥ 4 we have
P (n, ω) = P (τ (ω), ω).
In particular, Xτ (N ) = X(τ ). The stopped process Xτ inherits all the important proper-
ties of X.
Proposition 4.15
Proof
First we observe that
n
X
Xτ (n) = X(0) + 1{j≤τ } (X(j) − X(j − 1)) (4.1)
j=1
for each n = 0, 1, . . . , N .
The sequence 1{j≤τ } is predictable since
{j ≤ τ } = Ω \ {j > τ } = Ω \ {τ ≤ j − 1} ∈ Fj−1 .
Hence Xτ is adapted since all the terms on the right in (4.1) are Fn -measurable. So
if X is a martingale, then Xτ is a martingale transform, so Xτ is also a martingale by
Proposition 3.54.
122
Next, again using the representation of Xτ (n) given in (4.1), we compute the
conditional expectation of the increment:
where we used the ‘take out what is known’ property of the conditional expectation
(1{n+1≤τ } is Fn -measurable). Therefore the random variables E(Xτ (n+1)−Xτ (n)|Fn )
and E(X(n + 1) − X(n)|Fn ) have the same sign: they are either both greater than
or equal to zero, or both less than or equal to zero. Hence if X is a supermartingale,
that is,
E(X(n + 1) − X(n)|Fn ) = E(X(n + 1)|Fn ) − X(n) ≤ 0
then Xτ is a supermartingale.
Remark 4.16
A submartingale is an Fn -adapted sequence satisfying E(X(n + 1)|Fn ) ≥ X(n) for
each n. The above proof also shows that if X is a submartingale, then the stopped
process has the same property.
We seek a criterion for optimality of the exercising strategy. Since the sum of money
generated by such strategies can be random (in our special strategy the decision and the
outcome depends on ω), their comparison is difficult. Random variables are functions, and
functions are rarely comparable. For this reason we need to associate a single number with
each exercising strategy.
A natural candidate for an optimality criterion is to maximise the mathematical expecta-
tion of the payoff obtained at the exercise time. If the moment at which we exercise is denoted
by τ, the money received in a particular scenario ω is the payoff I(τ (ω)). These amounts of
money emerge at different time instants, so we should discount them to make them compa-
rable (or, equivalently, invest them in the money market account to hold until time N ). We
choose the risk-neutral probability Q for the purpose of computing the expectation.
Example 4.17
Returning to Example 4.13, to find the expected value of all discounted payments,
note that we receive 10 for ω ∈ Bd , 16.17 for ω ∈ Buddd , and 3.59 for ω = uuddd,
ω = ududd, or ω = uddud, so that
1 10 16.17 3.59
EQ τ I(τ ) = (1−q) +q(1−q)3 4
+3q 2 (1−q)3 = 4.51,
(1 + R) 1+R (1 + R) (1 + R)5
(4.2)
which, remarkably, is the money we paid for the option. (We show the results up to
123
two decimal points, so the equality is approximate. But we shall prove that the equality
is exact, in general.)
As we shall see, this is the best we can get. Such a claim could be verified numerically
in our example since we have to deal with a finite number of possible strategies, but we will
prove it in general.
First we give a formal definition of an optimal exercise strategy.
Definition 4.18
A stopping time τ is optimal if
Analysing exercising strategies in general, consider again the discounted payoff Z(n) =
I(n) of the American put option. The Snell envelope Y of Z gives the value of our option
e
at each trading date. If it satisfies Y (n) > Z(n), then it is not reasonable to exercise at
time n, since we would be replacing a security worth Y (n) by a smaller amount Z(n). So,
a sensible exercise time τ should satisfy the condition Y (τ ) = Z(τ ). If such an opportunity
emerges, then from the economic point of view we should grab it as soon as possible due to
the time value of money. Below we prove that this intuition gives a correct solution.
Proposition 4.19
The random variable τmin defined by
Proof
First note that τmin is a stopping time since
n−1
\
{τmin = n} = {Y (k) > Z(k)} ∩ {Y (n) = Z(n)}
k=0
and each of the sets on the right belongs to Fn since the random variables involved
are suitably measurable. For instance, for k < n,
124
We will show that the stopped Snell envelope Yτmin is a martingale. Using the
representation (4.1) of the stopped process, we have
If {τmin ≥ n + 1} then Y (n) > Z(n). By the definition of the Snell envelope,
We insert this on the right-hand side above and compute the conditional expectation
the initial value being non-random. In particular, E(Yτmin (N )) = Yτmin (0), but by the
definition of τmin , we have E(Yτmin (N )) = E(Zτmin (N )). Putting all this together, we
get Y (0) = E(Zτmin (N )) as claimed in (4.4).
Finally, we show that τmin is optimal. Take any stopping time ν. We know that Yν
is a supermartingale, so the expectation decreases with time, hence Y (0) = Yν (0) ≥
E(Yν (N )). By the definition of the stopped process, E(Yν (N )) = E(Y (ν)), which is
greater than or equal to E(Z(ν)) by the definition of the Snell envelope. So
But (4.4) holds, so Y (0) is equal to one of the expectations on the right. Therefore,
we must have equality, that is, (4.3) holds, which completes the proof.
Remark 4.20
Optimality of the early exercise strategy follows from simple economic considerations. If
the option holder chooses not to exercise when the payoff is higher than the expectation
of the future possibilities, then she effectively chooses an asset which is worth less than
the payoff, so suffers a loss (which is the profit of the other party, the seller of the
option). This will be seen clearly in the hedging argument presented below.
Putting together (4.4) and optimality of τmin (which means that (4.3) holds), we have
125
the following important conclusion concerning the price of the option:
(Y (0) is the price P (0) if we consider the Snell envelope of a sequence of put option payoffs).
Analysing the optimal exercise strategy in our leading example, path by path, we can see
that before we exercise, the prices follow a martingale. After we exercise (i.e. stop), the
sequence becomes constant, so it is obviously a martingale. Thus we have found a simple
and elegant criterion for optimality, however with little practical meaning since verification of
the martingale condition of the stopped process is not straightforward.
Theorem 4.21
A stopping time τ is optimal if and only if Z(τ ) = Y (τ ) and Yτ (n) is a martingale.
Proof
Suppose that Z(τ ) = Y (τ ) and Yτ (n) is a martingale, which implies (by the constant
expectation property) that
the last equality holding by the definition of the stopped process. The last equality
holds because Y (τ ) = Z(τ ) by our hypothesis. Hence Formula (4.5) means that
E(Z(τ )) = max{E(Z(ν) : ν is a stopping time}, so τ is optimal as claimed.
Conversely, if τ is optimal, then by (4.3) and (4.5) we have Y (0) = E(Z(τ )). By
the definition of the Snell envelope, we have E(Z(τ )) ≤ E(Y (τ )). But we know that
Yτ is a supermartingale, so E(Y (τ )) = E(Yτ (N )) ≤ Y (0). Putting these together, we
get
Y (0) = E(Z(τ )) ≤ E(Y (τ )) = E(Yτ (N )) ≤ Y (0),
so E(Z(τ )) = E(Y (τ )). In addition, we know that Z(τ ) ≤ Y (τ ), so Z(τ ) = Y (τ ).
It remains to prove that Yτ is a martingale. To this end it is sufficient to show
that Yτ (n) = E(Y (τ )|Fn ). We know that Yτ is a supermartingale, hence Yτ (n) ≥
E(Yτ (N )|Fn ) = E(Y (τ )|Fn ). Then to show that the random variables Yτ (n) and
E(Y (τ )|Fn ) are equal, it is sufficient to show that their expectations are equal:
The right-hand side is E(Y (τ )) by the tower property The left-hand side is the same,
126
as the following chain of inequalities shows:
Example 4.22
In the Excel file we check each single-step subtree for the martingale property of the
discounted price process Y (y) in our 5-step numerical example (see Example 4.17).
The stopped process Yτ is computed in the Excel file along each path in the 5-step
binomial tree (in the file we use the notation 1 = u, 0 = d, the yellow fields indicate
the payoff at the stopping time τ , blue fields indicate the nodes where Y satisfies the
martingale property, and purple fields correspond to nodes such that τ ≤ n, where the
Yτ (n) = Y (τ ).
Example 4.23
In the binomial model with
consider an American put with expiry time 4 and strike price K = 100.
The video shows how to price the option and compute the optimal exercise
time τmin . Here is the Excel file used in the video.
4.3 Hedging
Example 4.24
We keep working within the framework of our 5-step binomial example; see Exam-
ple 4.22. Suppose we have written and sold the American put, where S(0) = 100,
U = 15%, D = −10%, K = 100, R = 5%. Having cashed the price of 4.51, we are
exposed to some risk.
127
To make our position secure we can construct a replicating strategy based on
taking a position in the underlying and completing the portfolio with a position in the
money market account. Our initial position x(1) in stock is determined by the delta
(see Definition 1.15), computed as
Pu − Pd 1.23 − 10 ∼
x(1) = u d
= = −0.350644,
S −S 115 − 90
where superscripts denote the prices resulting from the up and down movements of the
stock in the first step. This number is negative, which means short selling the stock.
The money market cash position y(1) is then
and the value of this portfolio agrees with the option price,
An interesting path is uddd. Here at time 4 we have the payoff 16.17 with the
value of waiting for the final, time 5, payoff being 11.40, so again the option holder
128
exercises. Our position allows us to settle similarly as before: we have (see the Excel
file)
66.13 × (1 + 5%) − 0.64 × 84 = 16.17.
By inserting 0 in J37, the same amount is provided by the book-keeping formula in
Q37.
Example 4.25
In this video we continue Example 4.23 to compute the hedging strategy. The
Excel file used in the video is included.
for n = 1, 2, . . . , N .
129
Proof
To find A(1), which, as an F0 -measurable random variable, must be a constant, we
take the expectation on both sides of the target relation:
to find
we take the conditional expectation of this target identity with respect to F1 , recalling
that A is to be predictable, to get
This was an educated guess based on the desired properties, and it is time for a
rigorous argument. First note that A is increasing since the supermartingale property
of Y gives
E(Y (n) − Y (n − 1)|Fn−1 ) ≤ 0.
130
Next, we use induction to prove predictability: A(1) is constant, so F0 -measurable,
and if A(n − 1) is Fn−2 -measurable, it automatically is Fn−1 -measurable. With the
second term on the right of (4.6) also being Fn−1 -measurable, we find that A(n) is
Fn−1 -measurable.
It remains to see that M is a martingale:
E(M (n)|Fn−1 )
= E([A(n) + Y (n)]|Fn−1 )
= A(n) + E(Y (n)|Fn−1 )
= A(n − 1) − E([Y (n) − Y (n − 1)]|Fn−1 ) + E(Y (n)|Fn−1 )
= A(n − 1) + Y (n − 1)
= M (n − 1).
For uniqueness, suppose on the contrary that we have another such decomposition,
with Y (n) = M 0 (n) − A0 (n), so that
Then
M (n) − M 0 (n) = A(n) − A0 (n),
and note that for n = 0 we have
since
A(0) = A0 (0) = 0.
Next, take the conditional expectation on both sides with respect to Fn−1 :
which gives
M (n − 1) − M 0 (n − 1) = A(n) − A0 (n).
So, the equality M (n − 1) = M 0 (n − 1) implies thatA(n) = A0 (n), which yields
M (n) = M 0 (n). By induction on n, this proves uniqueness.
We can see that the role of A is to add to Y what is needed to obtain a martingale
(A(n) ≥ 0 since it starts from 0 and is non-decreasing), which justifies the commonly used
term compensator: the increasing process A ‘makes up for’ the lack of the martingale
131
property in Y.
Corollary 4.27
If X is a submartingale, then it can be written uniquely as
Proof
It is sufficient to note that Y = −X is a supermartingale. The Doob decomposition
of Y gives Y = M − A, so −Y = −M + A, and so −M is the desired martingale for
the decomposition of X.
Going back to the hedging problem, in the notation for the put option we consider the
Doob decomposition of the sequence of discounted option prices Y (n) = Pe(n), so
for some martingale M and increasing predictable A. In the binomial model (or in general, in
any complete model) we can replicate any Fn -measurable random variable given at any time n.
We choose to replicate M (N ), that means we find a self-financing strategy (x(n), y(n)), n =
1, . . . , N, such that
Ve(x.y) (N ) = M (N ).
When writing and selling the option, we receive the price P (0) = Y (0) = V(x,y) (0), so we
can start such a hedging strategy. The process Ve(x,y) (n) is a martingale, so
This implies
Pe(n) = Ve(x,y) (n) − A(n).
The value of our strategy for all n makes our position secure since the value of the strategy
dominates the option payoff at all times
The holder of the option may exercise if I(n) > 0, but this is not sufficient since if P (n) >
I(n), then he is better off by selling the option.
If P (n) = I(n) and A(n) = 0, the inequality Ṽ(x,y) (n) ≥ I(n) ˜ is then an equality and
the financial result for the writer is zero if the option is exercised.
If A(n) > 0, the inequality becomes strict, Ṽ(x,y) (n) > I(n), ˜ which means additional
profit, equal to the value of the compensator, for the writer if the option is exercised.
132
Therefore the following exercise strategy is of interest: we do not exercise as long as Y is
a martingale. If A becomes non-zero, the process Y ceases to be a martingale, so we should
exercise just before that instant. At a given trading date we are able to tell whether A is
going to become positive at the next step since A is predictable. Of course, if A never gets
positive, being zero up to time N , the exercise time has to be the trading horizon:
N if A(N ) = 0,
τmax =
min{n : A(n + 1) 6= 0} otherwise.
As it turns out, this exercise strategy is optimal, but not very relevant from the practical
point of view.
Theorem 4.28
The stopping time τmax is optimal.
Proof
First, directly from the definition, τmax is a stopping time since the set {τmax = n} is
determined by A(0), A(1), . . . , A(n + 1), which are Fn -measurable.
Next, again by the definition, A(τmax ) = 0, so Yτmax (n) = Mτmax (n), and Yτmax is
a martingale.
According to Theorem 4.21, to complete the proof it is sufficient to show that
Y (τmax ) = Z(τmax ). Fix an arbitrary ω and denote by j the time instant τmax (ω).
Insert the Doob decomposition into the definition of the Snell envelope
In practical terms, the irrelevance of τmax follows from its comparison with a much simpler
strategy based on τmin . For exercise to be truly profitable the payoff should be strictly greater
than the value of waiting. In this case, if τmin (ω) = n, then
133
and, moreover, Y (k) > Z(k) for all k ≤ n − 1, which implies Y (k) = E(Y (k + 1)|Fk ).
Consequently,
This gives τmax (ω) = n and so here, with strict inequalities above, both stopping times
coincide: τmax = τmin . This also proves that τmax ≥ τmin .
Example 4.29
See the Excel file for a numerical example, in the same setting as Example 4.24.
Example 4.30
Continuing Example 4.25, this video shows how to construct the martingale and
compensator by a different method than presented above. The optima stopping
time τmax is also computed. Here is the Excel file used in the video.
134