Decision Theory Notes
Decision Theory Notes
CHAPTER
C OMPARING D ECISIONS U NDER
U NCERTAINTY AND R ISK
1.1 Introduction
In this chapter we try to compare different decisions when the outcomes of
the decisions are not completely under the decision maker’s control. In partic-
ular, we consider situations in which the environment of the decision maker
is indifferent to the decision taken by the decision maker. So, the material that
we consider in this chapter does not naturally hold when the decision maker’s
decision is being scrutinized by a competitor, who will take a decision which
will maximize the competitor’s benefit, possibly to the detriment of the origi-
nal decision maker.
Consider for example, a furniture manufacturer who manufactures wood-
en tables and chairs, and whose objective is to maximize the contribution to
profits from these two products. Tables require 3 m2 of wood and 3 labor-
hours to produce and earn a contribution of Rs.1000 per unit. Chairs on the
other hand require 2m2 of wood and 1 labor-hour to produce and earn a con-
tribution of Rs.600 per unit. The manufacturer has a supplier of wood, who
promises to supplies her up to 2000 m2 of wood per month on demand at a
cost of Rs.150 per m2 . In addition to wood, she requires other material, such
as paint, varnish, and fabric to create the tables and chairs. These are abun-
dantly available as long as she orders an adequate quantity well in advance.
She has 10 people working for her on fixed salary, each of whom puts in eight
1
2 CHAPTER 1. COMPARING DECISIONS
hours of labor each working day. She assumes a total of 20 working days in a
month.
If the supplier does indeed supply her with 2000 m2 of wood each month,
and the people turn up for work each day, then she can use a linear program-
ming model to optimize her product mix. This mix is to produce 400 tables
and 400 chairs each month and earns a contribution of Rs.6,30,000. If anyone
suggests an alternate product mix, of say 500 tables and 100 chairs, she can
convince herself that her product mix is better simply by computing the con-
tribution from the other product mix and observing that it is lower than that
for her product mix.
In a realistic scenario, the manufacturer must face uncertainties that are
not resolved at the time of planning. For example, the supplier of wood may
not be able to supply the required amount of wood for her manufacturing
process. She may face absenteeism among her employees. As a simplified
situation, assume that her supplier supplies her with either 2000 m2 of wood
a month, or if the supplier’s stocks are low, only 1500 m2 of wood. In some
months her employees put in 1600 labor-hours per month, and in others when
absenteeism is high they put in 1200 labor hours. Table 1.1 shows the optimal
product mixes for the manufacturer if she knew the conditions that she would
face the following month accurately when determining her product mix.
scenarios. Such a table in which the payoffs for each strategy-scenario pair What is a payoff
are listed is called a payoff table. table?
Scenarios
Strategy Prod. Mix Order I II III IV
A (400,400) Tables first Rs.340000 Rs.220000 Rs.265000 Rs.220000
B (400,400) Chairs first Rs.340000 Rs.266667 Rs.248333 Rs.248333
C (133,800) Tables first Rs.313333 Rs.313333 Rs.238333 Rs.238333
D (133,800) Chairs first Rs.313333 Rs.313333 Rs.225000 Rs.225000
E (500, 0) Rs.275000 Rs.220000 Rs.275000 Rs.220000
F (300,300) Tables first Rs.255000 Rs.255000 Rs.255000 Rs.255000
G (300,300) Chairs first Rs.255000 Rs.255000 Rs.255000 Rs.255000
Notice from the payoff table that each strategy is evaluated in terms of not
one but four payoffs, corresponding to the four scenarios that may arise. So
comparing two strategies is not in terms of comparing two numbers, but in
terms of comparing two vectors of numbers. This is not easy; for example,
Strategy A is better off than Strategy B in Scenario 3, but is worse off in Scenar-
ios 2 and 4. So the task of comparing strategies is the task of combining the
entries of the vectors of numbers corresponding to the strategies into single
numbers which can then be compared. While doing this, an idea about the
relative likelihood of the different scenarios is helpful. However, such an idea
may or may not be available. In this context, it is useful to realize that uncer-
tain decision making situations arise in a spectrum. On one end of the spec- Levels of uncer-
trum are those situations in which decision scenarios either occur completely tainty.
at random, or due to processes which have so many contributing factors that
it is not realistic to assume that we will have enough information to ascer-
tain the likelihood of different scenarios. Such decision making situations
are called situations of (deep) uncertainty. Such situations arise for example,
when one wants to predict the weather, say seven days in advance. On the
other end of the spectrum, there are situations in which the random process
giving rise to the scenarios are understood perfectly, and simple probabilistic
methods are useful to determine the likelihood of scenarios. Such situations
occur, for example, when one is betting on the throw of fair dice. These sit-
uations are called situations of stochastic uncertainty or risk. Most business
scenarios fall somewhere in between; some but not all the important deter-
minants of a phenomenon are known, and collecting enough data about the
determinants will allow the decision maker to have a rough idea about the
likelihood of particular scenarios. These are situations in which management
4 CHAPTER 1. COMPARING DECISIONS
tools such as market research become useful. In any scientific endeavor, the
idea is to start from the deep uncertainty end of the spectrum and move the
situation to the risk end of the spectrum through a better understanding of
the scenario and/or data collection.
The techniques used to make decisions about situations in different parts
of the spectrum are different, and are in general a mixture of techniques used
for deep uncertainty and risk. In the remainder of the chapter we explain
some of the techniques used for situations in the two ends of the spectrum.
Scenarios Payoff
Strategy I II III IV assigned
A Rs.340000 Rs.220000 Rs.265000 Rs.220000 Rs.340000
B Rs.340000 Rs.266667 Rs.248333 Rs.248333 Rs.340000
C Rs.313333 Rs.313333 Rs.238333 Rs.238333 Rs.313333
D Rs.313333 Rs.313333 Rs.225000 Rs.225000 Rs.313333
E Rs.275000 Rs.220000 Rs.275000 Rs.220000 Rs.275000
F Rs.255000 Rs.255000 Rs.255000 Rs.255000 Rs.255000
G Rs.255000 Rs.255000 Rs.255000 Rs.255000 Rs.255000
1.2. DECISION MAKING UNDER DEEP UNCERTAINTY 5
The decision maker then proceeds to choose that strategy for which the
payoff that they have assigned is the highest. In this example, they will be
indifferent between choosing either Strategy A or Strategy B.
Over time, decision makers who choose to follow the maximax strategy
can make large payoffs if observed scenarios are favorable in the long run.
However, if they are not, then such decision makers are liable to lose large
payoffs, and unless they have enough money in reserve, can be wiped out of
the market.
Scenarios Payoff
Strategy I II III IV assigned
A Rs.340000 Rs.220000 Rs.265000 Rs.220000 Rs.220000
B Rs.340000 Rs.266667 Rs.248333 Rs.248333 Rs.248333
C Rs.313333 Rs.313333 Rs.238333 Rs.238333 Rs.238333
D Rs.313333 Rs.313333 Rs.225000 Rs.225000 Rs.225000
E Rs.275000 Rs.220000 Rs.275000 Rs.220000 Rs.220000
F Rs.255000 Rs.255000 Rs.255000 Rs.255000 Rs.255000
G Rs.255000 Rs.255000 Rs.255000 Rs.255000 Rs.255000
for which the payoff that they have assigned is the highest. In this example,
they will be indifferent between choosing either Strategy F or Strategy G.
Over time, decision makers who choose to follow the maximin strategy will
make small payoffs in each period. They will tend to avoid any strategy that
yields a negative payoff in the worst case.
6 CHAPTER 1. COMPARING DECISIONS
Scenarios Maximum
Strategy I II III IV Regret
A Rs.0 Rs.93333 Rs.10000 Rs.35000 Rs.93333
B Rs.0 Rs.46666 Rs.26667 Rs.6667 Rs.46666
C Rs.26667 Rs.0 Rs.36667 Rs.16667 Rs.36667
D Rs.26667 Rs.0 Rs.50000 Rs.30000 Rs.50000
E Rs.65000 Rs.93333 Rs.0 Rs.35000 Rs.93333
F Rs.85000 Rs.58333 Rs.20000 Rs.0 Rs.85000
G Rs.85000 Rs.58333 Rs.20000 Rs.0 Rs.85000
For each decision strategy therefore, there are four regret values, one cor-
responding to each scenario. Since the decision maker does not have any
information about the likelihood of any of the scenarios being realized, the
decision maker chooses the largest of these regrets to represent the regret cor-
responding to a particular strategy. This value is shown in the last column
of Table 1.5 and gives the maximum amount by which the payoff from a par-
ticular strategy will be off from the optimal payoff if that strategy is adopted.
1.3. DECISION MAKING UNDER RISK 7
Obviously, the decision maker would like to minimize the maximum regret,
and chooses that strategy for which the value of the maximum regret is the
minimum. In this example therefore, the decision maker will choose Strategy
C if they adopt the min-max regret approach.
If one adopts a decision making approach using the min-max regret strat-
egy then one can incorporate a reason for sub-optimality of a decision which
is not accounted for in the other two approaches. Consider for example the re-
gret associated with Strategy F in Scenario I. In Strategy F the decision maker
decides to produce 300 tables and 300 chairs. In Scenario I, the amounts of
wood and labor available are 2000 m2 and 1600 labor-hours respectively, so
that even after producing the 600 articles, she has 500 m2 of wood and 400
labor-hours remaining unutilized. Her regret in this case is therefore due to
a loss in opportunity to create more tables and chairs. (She cannot produce
more of tables and chairs, since the other material that she needs for further
production, like paint, varnish, and fabric are not available with her.) This op-
portunity loss factor is not incorporated in either the maximax approach or
the maximin approach.
0.4×Rs.340000+0.3×Rs.220000+0.1×Rs.265000+0.2×Rs.220000 = Rs.272500.
8 CHAPTER 1. COMPARING DECISIONS
Similar calculations show that the expected payoffs of the other six strategies
are Rs.290500, Rs.290833, Rs.286833, Rs.247500, Rs.255000, and Rs.255000 re-
spectively. The decision maker chooses the scenario that maximizes expected
payoff, i.e., chooses Strategy C whose expected payoff is Rs.290833.
When taking decision under risk, the most common strategy used is that
of maximizing expected payoffs (or minimizing expected costs for cost mini-
mization problems). A part of the reason for doing so is that expected values
is one of the best known measures for handling uncertain situations, and is
thus well-understood by all parties involved in decision making.
Now the regret for Strategy A under the j -th scenario is (m j −a j ) while that
for Strategy B is (m j − b j ). So the expected regret E R(A) for Strategy A is
k
X k
X
E R(A) = (m j − a j )p j = m j p j − E P (A)
j =1 j =1
Since E P (A) > E P (B ), it follows that E R(A) < E R(B ) which implies that if the
decision maker’s objective was to minimize expected regret, they would have
chosen Strategy A over Strategy B.
1.4. PROBLEMS 9
So we see that the strategy which has the highest expected value has the
lowest expected regret, and hence decisions using the expected value criterion
match those using the expected regret criterion. Since expected values are
easier to understand and explain than expected regrets, the expected regret
criterion is not used in practice.
1.4 Problems
Problem 1.1: A wholesale shop sells crates of fresh (unprocessed) milk. Each
crate consists of 20 bottles. The shop procures crates at Rs.100 per crate, and
sells it at Rs.180 per crate. From past experience, the shop assesses the prob-
abilities of the daily demand for milk as follows:
Demand
(crates) Probability
1 0.1
2 0.1
3 0.3
4 0.3
5 0.2
b. If the shop uses the maximax criterion to make decisions, how many crates
of milk should they stock each day? How many crates should they stock if
they use the maximin criterion?
c. If the shop uses the maximum regret criterion to make decisions, how
many crates of milk should they stock each day? What would be the value
of the maximum regret for the decision that the shop takes?
d. If the shop uses the expected value criterion to make decisions, how many
crates of milk should they stock each day? What would be the expected
profit for the decision that the shop takes?
c. What is the maximum regret associated with the decision to submit a bid
for Rs.125000?
e. If SciTool wants to use the most likely scenario approach, what decision
should it take?
f. How much payoff should SciTool expect if they decide to submit a bid for
Rs.120000?
g. If SciTool wants to minimize the expected regret associated with their de-
cision, what decision should it take? What will be the value of the expected
regret for their decision?
Option 1: He can distill all the batches of the chemical at a cost of Rs.100 per
batch, and thus have no substandard batches sent to the customer.
Option 2: He can use a chemical reagent to test the quality of each batch at
a cost of Rs.20 per batch, and distill only those batches that are classi-
fied by the test as substandard at the cost of Rs.100 per batch. However
this test is inaccurate. While “good” batches never get classified as sub-
standard by the test, 5% of substandard batches get classified as “good”
and are not distilled. These batches when sent to a customer have to be
replaced at a cost of Rs.400 per batch replaced.
Option 3: He can use a specific gravity test which costs Rs.10 per batch, and
distill only those batches that are classified by the test as substandard at
the cost of Rs.100 per batch. This test is also not accurate. 10% of the
substandard batches get classified as “good” and have to be replaced
12 CHAPTER 1. COMPARING DECISIONS
later (at a cost of Rs.400 per batch). The test also classifies 10% of the
good batches as “substandard”.
Problem 1.5: Tara owns a company that manufactures a fruit product called
CrunchyBites. A packet of this product contains 400gms of dry fruits and
100gms of sugar syrup, and is very popular among young children. It sells
for Rs.11 per packet in the market.
It is the end of the month and Tara has to decide on her purchasing and
manufacturing strategy for the next month. Her supplier for dry fruits is not
reliable; there is a 60% chance that he will supply 4000kg of dry fruits for a
month and a 40% chance that he will supply 6000kg. He supplies dry fruits at
Rs.10 per kilo. As per contract, Tara has to buy the full quantity of dry fruits
that the supplier supplies. She can buy any quantity of dry fruits from the
open market as she likes, but at Rs.15 per kilo.
Tara’s syrup supplier can supply any quantity of syrup that she wants. If
she orders syrup now, before the dry fruits supplier has supplied the dry fruits,
the syrup supplier charges Rs.5 per kilo of syrup. However, if she delays her
order till after her dry fruit supplier supplies dry fruits, the charge goes up to
Rs.7 per kilo of syrup.
Tara’s packing and related overheads come to Rs.2 per pack. She can man-
ufacture a maximum of 15000 packets of CrunchyBites in a month.
The monthly market demand for CrunchyBites is the following:
Strategy 1: Buy 1500 kilos of syrup before the dry fruits supplier supplies fruits.
If the dry fruits supplier supplies 4000kg of fruits, then make and
sell 10000 packets of CrunchyBites; and if he supplies 6000kg, then
make and sell 15000 packets of CrunchyBites.
Strategy 2: Buy 1000 kilos of syrup before the dry fruits supplier supplies fruits.
If the dry fruits supplier supplies 4000kg of fruits, then make and
sell 10000 packets of CrunchyBites; and if he supplies 6000kg, then
make and sell 15000 packets of CrunchyBites buying 500 kilos of
syrup at the higher price.
Strategy 3: Buy 1500 kilos of syrup before the dry fruits supplier supplies fruits.
Make and sell 15000 packets of CrunchyBites, buying 2000kg of
dry fruits from the open market in case the dry fruits supplier sup-
plies 4000kg of fruits.
Strategy 4: Buy 1000 kilos of syrup before the dry fruits supplier supplies fruits.
Regardless of the amount of fruits that the dry fruits supplier sup-
plies, make and sell 10000 packets of CrunchyBites.
a. What is the payoff from each of the strategies if the dry fruits supplier
supplies 4000kgs of dry fruits and the market demand for CrunchyBites
is 15000 packets?
d. Given Tara’s strategies, what is the maximum regret (or opportunity loss)
if Tara decides to follow Strategy 1?
e. Given Tara’s strategies, what is the expected regret (or opportunity loss) if
Tara decides to follow Strategy 1?
f. What is the minimum chance of the demand being 15000 packets for which
your answer for part (c) remains unchanged?
g. Tara decides to insist that her dry fruit supplier supply her with 6000 kilos
of dry fruits every month. For this, the dry fruit supplier can increase his
price of dry fruits. What would be the maximum rate (Rs. per kilo of dry
fruits) that she should be prepared to pay?
2
CHAPTER
M ULTISTAGE D ECISION M AKING
U NDER R ISK
2.1 Introduction
Consider the example provided in Section 1.1. In the example, the decision
maker has to make her decision in one time point; the product mix that she
needs to aim for, and whether to produce tables first, or whether to produce
chairs first. In the majority of decision making scenarios, the decision mak-
ing is more complex. It involves making multiple decisions at different points
in time, and decisions made earlier have an effect on the payoffs of decisions
made later. In this chapter we study such multi-stage decision making situa-
tions.
We make two assumptions in this chapter. The first is that these situations Assumptions.
are of stochastic uncertainty, and each uncertain event is associated with a
probability of encountering it. The second assumption is that we evaluate dif-
ferent strategies of the decision maker in terms of the expected value criterion
described in Section 1.3. Although these assumptions are generally accepted
in practice, it is possible to consider situations in which they are not appro-
priate. The methods for dealing with such violations are not covered in this
chapter.
In order to understand multi-stage decision making processes, consider
the following decision problem faced by the production manager of a com-
pany X. The company produces a product which is seeing large increases in
1
2 CHAPTER 2. DECISION TREES
At the end of the life-cycle of the product, the production manager will
find himself in one of the following scenarios.
Scenario I: No proposal was made, and the production staff worked over-
time. (Additional profit was Rs.5 lakhs.)
2.2. REPRESENTING MULTI-STAGE DECISION PROBLEMS 3
Scenario II: No proposal was made, company Y’s service was used, and com-
pany Y delivered reliably. (Additional profit was Rs.12 lakhs.)
Scenario III: No proposal was made, company Y’s service was used, and com-
pany Y delivered erratically. (Loss of profit was Rs.4 lakhs.)
Scenario IV: A proposal was made and accepted, so that the production fa-
cility was augmented. (Additional profit was Rs.14 lakhs.)
Scenario V: A proposal was made and rejected, and the production staff work-
ed overtime. (Additional profit was Rs.5 lakhs.)
Scenario VI: A proposal was made and rejected, company Y’s service was used,
and company Y delivered reliably. (Additional profit was Rs.10 lakhs.)
Scenario VII: A proposal was made and rejected, company Y’s service was
used, and company Y delivered erratically. (Loss of profit was Rs.5 lakhs.)
Note that every decision scenario will not be observed for each of the de-
cision strategies. For example, if the production manager decides to adopt
strategy B, he will only see one of Scenarios B and C. If we construct a payoff
table for this problem, the table will be
Scenario
Strategy I II III IV V VI VII
A 5
B 12 −4
C 14 5
D 14 10 −5
sequence of decisions and events that make up the realization of the effects of
a decision strategy is represented as a path in the diagram from the present to
the future.
A diagram that shows all possible realizations in a decision problem is one
which starts at a single point on the left corresponding to the present situation
(before the decision making process starts) and then fans out to the right, with
paths representing different possible realizations. For the decision problem
that we consider, this diagram is shown in Figure 2.1. Each scenario represents
Pay
overtime Payoff:
Rs.5 lakhs
Company Y is
reliable Payoff:
Rs.12 lakhs
a b
Approach
Company Y
Payoff:
Company Y is – Rs.4 lakhs
erratic
Proposal is
approved Payoff:
Rs.14 lakhs
Pay
overtime Payoff:
c
Send proposal Rs.5 lakhs
to the board
Company Y is
d reliable Payoff:
Proposal is
rejected Rs.10 lakhs
e
Approach
Company Y
Payoff:
Company Y is – Rs.5 lakhs
erratic
a state at the end of the problem reached by a path (i.e., a realization) from the
left to the right. The payoff to be achieved in the scenario is also marked in the
diagram.
At every junction in the tree (represented by letters from ‘a’ through ‘e’)
there is a possibility of choosing one of multiple realizations. There is how-
ever a difference in the way paths are followed at each junction. In junctions
‘a’ and ‘d’ it is up to the decision maker, the production manager in this case,
2.2. REPRESENTING MULTI-STAGE DECISION PROBLEMS 5
to choose which of the paths he wishes to take. In the other three junctions,
the choice of paths is through a random process, and the decision maker can-
not guide the choice. In the problem situations that we consider, the choice
of paths happen with pre-specified probabilities. If we add these bits of infor-
mation, i.e., the distinction between junctions where the decision maker can
choose paths and where he cannot, and the probabilities with which paths
are chosen at junctions where the decision maker cannot choose the paths,
we obtain an enhanced version of the diagram in Figure 2.1 called a decision
tree. By convention, we represent junctions in which the decision maker can What is a deci-
choose paths with squares and call them decision nodes, while the other junc- sion tree?
tions are represented by circles and are called chance nodes or event nodes.
The probabilities of paths being chosen at each chance node is also added to
the diagram to form the decision tree. Figure 2.2 shows the decision tree for
the production manager’s problem.
Pay
overtime Payoff:
Rs.5 lakhs
Company Y is
reliable Payoff:
p = 0.8 Rs.12 lakhs
a b
Approach
Company Y
p = 0.2 Payoff:
Company Y is – Rs.4 lakhs
erratic
Proposal is
approved Payoff:
p = 0.7 Rs.14 lakhs
Pay
overtime Payoff:
c
Send proposal Rs.5 lakhs
to the board
p = 0.3 Company Y is
d reliable Payoff:
Proposal is
rejected p = 0.8 Rs.10 lakhs
e
Approach
Company Y
p = 0.2 Payoff:
Company Y is – Rs.5 lakhs
erratic
stage decision problem with all data for the problem represented conveniently.
In the next section we will see how to analyze decision trees and choose an op-
timal decision strategy.
is Rs.7 lakhs (the maximum of Rs.5 lakhs and Rs.7 lakhs). This allows us to
compute the expected payoff at chance node ‘c’ as Rs.(14 × 0.7 + 7 × 0.3) lakhs
or Rs.11.9 lakhs. Having computed the expected payoff at node ‘c’, we can
combine this information with the expected payoff at node ‘b’ and the given
payoff at the other node to ascertain that at node ‘a’, the optimal strategy is
to send the proposal of capacity augmentation to the board. And combining
the optimal decisions at decision nodes ‘a’ and ‘c’, we come to the conclusion
that the optimal strategy for the production manager is strategy C, i.e., to send
in a proposal for capacity augmentation to the board, and in case the board
rejects the proposal, to approach company Y with a subcontracting offer. The
expected payoff for this strategy is Rs.11.9 lakhs.
At this point we should be clear about out interpretation of the expected
payoff value of Rs.11.9 lakhs. This figure is an expected value. So this means
that if the production manager faced the same decision problem a large num-
ber (tending to an infinite number) of times, applied strategy C every time,
and computed the average of all the payoffs he received, the average payoff
would be Rs.111.9 lakhs. Every time he takes the decision, he stands to earn
Rs.14 lakhs with probability 0.7, Rs.10 lakhs with probability 0.3 × 0.8 = 0.24,
and lose Rs.5 lakhs with probability 0.3 × 0.2 = 0.06.
since there are two decision nodes ‘a’ and ‘d’ in the decision tree in Figure 2.2,
any change in the optimal strategy will be reflected as a change in the opti-
mal decision at one or both of these decision nodes. Now let us suppose that
submitting a proposal of capacity
£ a expansion to the board remains the opti-
a
¤
mal decision at node ‘a’ if p ∈ p l , p h , and approaching company Y remains
the optimal decision at node ‘d’ if p ∈ p ld , p hd . Then the range of p for which
£ ¤
happens, the optimal decision at node ‘a’ remains optimal when r ∈ [0, 1].
Therefore, strategy C remains unchanged when the probability of company
Y acting reliably is in the range [0.667,1].
It is important to reiterate a few points at this stage. First, when perform-
ing sensitivity analysis, the objective is to find the range of probability values
for which a complete decision strategy remains optimal. Hence it is not suf-
ficient to account only for changes at any one decision node. Note that the
decision at a decision node can change only when the payoffs for at least one
of the alternatives at that node changes. So if a probability value changes, then
decisions “downstream” to the position where that probability value occurs in
the decision tree do not change. Also, any decision “upstream” to that posi-
tion can change only if there are no intermediate decision nodes that filter out
the effect of the change (by choosing a decision alternative which does not lie
on the same path.)
Note that the advise of the expert is worth something to the production
manager before he takes the decision of whether or not to approach company
Y. After the decision has been taken the expert’s advice is of no practical use
since the production manager cannot take advantage of the advice to make
better decisions. Also note that since the expert is always right, and company
Y delivers reliably 80% of the time, the expert will say that company Y will de-
liver reliably with a probability of 0.8. However, in a given situation they will
respond with a definite YES or a definite NO, and not make probabilistic state-
This is an im- ments. Also note that since the expert is always right, the decision maker will
portant point to not second-guess the expert, and will take the expert’s decision as the truth
keep in mind. while taking decisions. (This point seems trivial in the present case, but will
be more consequential when we consider fallible experts.)
We know that without the expert’s advice, the optimal strategy for the pro-
duction manager is strategy C which yields an expected payoff of Rs.11.9 lakhs
(see Section 2.3). We also know that in case the expert says that company Y will
deliver reliably, the manager does not need to consider the option of paying
overtime, since the payoff from approaching company Y will be more than
that from paying overtime. Additionally we know that if the expert says that
company Y will not deliver reliably, then the manager does not need to con-
sider the option of approaching them, since the payoff from paying overtime
is higher. So when there is an option of asking the expert, the decision tree for
the decision process is the one shown in Figure 2.3.
Solving the decision tree we find that the expected payoff if the decision
maker decides to consult the expert is Rs.12.5 lakhs. In this diagram, the op-
timal strategy is to consult the expert (at decision node ’a’) and to approach
the board with a proposal no matter what the expert says (at decision nodes
‘c’ and ‘d’). If the board rejects the proposal, they should approach company Y
if the expert says that they will be reliable in their deliveries, and pay overtime
otherwise. The reason that the expert’s advice earns the production manager
a higher expected payoff is that the expert prevents the production manager
from ending up in scenarios in which he would lose money.
Since the expected payoff with access to the expert is Rs.12.5 lakhs while
that without access to the expert is Rs.11.9 lakhs, the worth of the expert’s
opinion is calculated to be Rs.12.5 lakhs - Rs.11.9 lakhs = Rs.0.6 lakhs. This
figure is an expected payoff figure obtained by assuming perfect information
What is EVPI? from the expert. So this figure is called the Expected Value of Perfect Informa-
tion (EVPI).
2.5. VALUE OF INFORMATION 11
Approach
Company Y Payoff:
Rs.12 lakhs
a Expert says that
Y is reliable Proposal is
c approved Payoff:
p = 0.8 Rs.14 lakhs
p = 0.7
e
Approach board
with proposal
p = 0.3 Payoff:
Proposal is rejected Rs.10 lakhs
b (approach Y)
Ask the expert
Pay
overtime Payoff:
Rs.5 lakhs
p = 0.2 Proposal is
d approved Payoff:
Expert says that
Y is erratic p = 0.7 Rs.14 lakhs
f
Approach board
with proposal
p = 0.3 Payoff:
Proposal is rejected Rs.5 lakhs
(pay overtime)
Next consider an imperfect expert. The imperfection of the expert can lead
the decision maker to two types of mistakes. The first type of mistake is when
an expert states in error that company Y will deliver reliably when it actually
delivers erratically. In such situations, if the production manager follows the
expert, then he makes a monetary loss. The second type of mistake is when
the expert erroneously states that company Y will be erratic in their delivery
when they actually deliver reliably. In this situation, if the production man-
ager follows the expert’s advice, then he will prefer paying overtime to ap-
proaching company Y and will settle for a lower payoff, thus incurring op-
portunity losses. Both these errors clearly do not occur for a perfect expert,
simply because such experts are infallible. These errors are also the reason
why an imperfect expert’s advice cannot be worth more than that of a perfect
expert.
12 CHAPTER 2. DECISION TREES
Why should one take advice from an imperfect expert when we know that
an imperfect expert makes mistakes and can advise a decision maker into sit-
uations in which the decision maker suffers monetary or opportunity losses?
One takes such advice because when the imperfect expert is correct in their
decision, they can prevent a decision maker from taking decisions that lead to
losses. One expects the latter type of situations to be more frequent than situ-
ations in which the expert makes mistakes, and in the expected value sense, a
decision maker is better off with an imperfect expert’s advice than without it.
It is also this reason that suggests that the decision maker follows the expert’s
advice even though there is a chance that the expert had made a mistake in
judgement. The decision tree with the option of asking an imperfect expert is
shown in Figure 2.4. Note that we have not determined the probability with
which such an expert will say that company Y will deliver reliably.
In order to compute the worth of an imperfect expert’s advice, we need to
find out the probability p with which the expert will say that company Y will
deliver reliably. Data on the expert’s past decisions can help us find the value
of p using Bayes’ rule. Consider for instance in our example, we have an expert
who when she says that company Y will deliver reliably is correct 90% of the
time, and when she says that they will deliver erratically is correct 80% of the
time. The joint probability of company Y delivering reliably and the expert
saying that they will is 0.9p and the joint probability of company Y delivering
reliably and the expert saying that they will not is (1 − 0.8)(1 − p). So in terms
of p, the probability that company Y will deliver reliably is 0.9p + 0.2(1 − p) =
0.7p +0.2. We know this value is actually 0.8, so that 0.7p −0.2 = 0.8 or p = 6/7.
This means that such an expert will say that company Y will deliver reliably
6/7-th of the time.
Solving the decision tree we find that the expected payoff if the decision
maker decides to consult the expert is Rs.12.2 lakhs. The optimal strategy for
the production manager in this situation is to ask the expert at node ‘a’ and
submit a proposal to the board at nodes ‘e’, ‘f’, ‘g’, and ‘h’. If the board rejects
the proposal, the production manager should approach company Y if the ex-
pert says that they are reliable, and pay overtime otherwise. In the diagram,
the expert’s erroneous advice causes the production manager to face a possi-
ble monetary loss at chance node ‘k’, and an opportunity loss at chance nodes
‘l’ and ‘m’.
Since the expected payoff with access to the imperfect expert is Rs.12.2
lakhs while that without access to the expert is Rs.11.9 lakhs, the worth of
the expert’s opinion is calculated to be Rs.12.2 lakhs - Rs.11.9 lakhs = Rs.0.3
What is EVSI? lakhs. As with EVPI, this figure is an expected payoff figure. In practice, since
imperfect information is most often obtained through sampling studies, this
expected value is called Expected Value of Sample Information (EVSI).
2.6. PROBLEMS 13
Expert is right
Payoff Rs. 12 Lakhs
Do not ask the Expected Payoff Probability
expert Rs. 11.9 Lakhs 0.9
Approach Y
d
Expert is wrong
Expert says Y is Payoff Rs. -4 Lakhs
reliable Probability 0.1
a Probability p c
Proposal is
approved
Payoff Rs. 14 Lakhs
Probability 0.7
Approach board
with proposal e
Expert is right
Payoff Rs. 10 Lakhs
Proposal is Probability
rejected, 0.9
b approach Y
Ask the expert f
Probability 0.3
Expert is wrong
Pay overtime Payoff Rs. -5 Lakhs
Payoff Rs. 5
Probability 0.1
Lakhs
Expert says Y is
Proposal is
erratic
g approved
Payoff Rs. 14 Lakhs
Probability 1-p
Probability 0.7
h
Approach board
with proposal Proposal is
rejected, pay
overtime
Payoff Rs. 5 Lakhs
Probability 0.3
2.6 Problems
Problem 2.1: A complex airborne navigating system incorporates a sub-ass-
embly which unrolls a map of the flight plan synchronously with the move-
ment of the aeroplane. This sub-assembly is bought on very good terms from
a subcontractor, but is not always in perfect adjustment on delivery. The sub-
assemblies can be readjusted on delivery to guarantee accuracy at a cost of
$220 per sub-assembly. It is not however, possible to distinguish visually those
assemblies that need adjustment.
Alternatively, the sub-assemblies can each be tested electronically to see
if they need adjustment at a cost of $48 per sub-assembly tested. Past experi-
ence shows that about 40 percent of those supplied are defective; the proba-
bility of the test indicating a bad adjustment when the sub-assembly is faulty
is 0.8, while the probability that the test indicates a good adjustment when the
sub-assembly is properly adjusted is 0.9. If the adjustment is not made and
the sub-assembly is found to be faulty when the system has its final check,
the cost of subsequent rectification will be $600.
Draw up a decision tree to show the alternatives open to the purchaser and
use it to determine his appropriate course of action.
Problem 2.2: A company prospecting for minerals divides its exploration area
into ten plots, intending to drill to a depth of 300 feet near the center of each
plot. Geological data suggest that the ten plots are either wholly within a large
mineral field discovered in a neighbouring area or wholly outside the field,
and that there is a 50:50 chance of either. Drilling to 300 feet within the field
would give a 50% chance of a strike, whereas outside the field there would be
virtually no chance of a strike.
On striking minerals the total operating profit can be expected to be $50
million, excluding the cost of exploratory drilling, the cost of which is $100,000
per hole.
After each hole has been drilled a decision is made whether or not to con-
tinue drilling with the next hole. The criterion used for this decision is whether
the expected drilling cost, not including the holes already drilled, exceeds the
expected operating profit.
1. What is the probability that the plots lie within the field, given that no
successful holes have been drilled?
2. How many unsuccessful holes will the company drill before abandoning
the search, and what is the expected drilling cost before the operation
starts?
2.6. PROBLEMS 15
Problem 2.3: There is 60% chance that there are oil-bearing rocks under a
piece of land. With current technology, if a region contains oil-bearing rocks,
there is 80% chance that if an oil and gas company drills a well in that region,
they will hit oil. It requires 1 million dollars to drill a well, and the revenue
earned from the oil extracted is 3 million dollars. The oil and gas company
wants to maximize profits, and follows the expected value criterion to decide
whether to drill a well in that piece of land.
a. Will the company drill a well in that piece of land? What is the expected
value of their best decision?
The company has the option of drilling zero, one, or two wells in that piece of
land. If they decide to drill wells, their decision to drill a second well depends
on the outcome of their drilling the first well.
c. Assume that the company drills a well and the well that they drill hits oil.
How does this fact revise the chance of the presence of oil-bearing rocks
under that piece of land? How does it affect the chance of the company
hitting oil under that piece of land?
d. Assume that the company drills a well and the well that they drill DOES
NOT hit oil. How does this fact revise the chance of the presence of oil-
bearing rocks under that piece of land? How does it affect the chance of
the company hitting oil under that piece of land?
e. Under this policy of deciding on drilling the second well depending on the
result of drilling the first well, what is their expected profit?
The company can make widgets by one of the two processes A and B. Pro-
cess A involves a fixed cost of Rs.10,000 and a variable cost of Rs.40 per widget
16 CHAPTER 2. DECISION TREES
a. Which of the two processes is the better choice for the company if it wants
to minimize expected manufacturing cost?
b. Within what ranges of probabilities will the choice that you suggest in part
(a) remain the better choice?
Problem 2.5: The Alpha-Omega Company is unsure about its market share
for a particular product. The market share could be 10%, 15%, or 35%. The
company’s initial guess about the chances of the market share being these
values are 0.20, 0.35, and 0.45 respectively. The size of the market is 200. If the
company decides to manufacture the product using process A, then it incurs
a fixed cost of Rs.25,000 and a variable cost of Rs.1200 per unit. If it manufac-
tures the product using process B, then it incurs a fixed cost of Rs.50,000 and
a variable cost of Rs.400 per unit.
a. Write down costs of the two processes under different market share situa-
tions.
b. Which of these two processes should the company use under the expected
value criterion? What would be the expected value of cost incurred?
c. The company is quite sure about the 0.20 probability of their market share
being 10% but are unsure of the other two probability values. Within what
range of probability values for the market share being 15% would their de-
cision in Part (b) remain optimal?
Problem 2.6: Consider the ABC Company of Problem 2.4. The company de-
cides to hire the services of a market research firm to have a better idea of its
actual market share. The market research firm charges a flat fee of Rs.100 re-
gardless of the sample size and an additional charge of Rs.10 per respondent
sampled.
b. What is the maximum number of respondents that ABC Company can ask
the market research firm to sample?
2.6. PROBLEMS 17
c. If the market research firm samples 5 respondents and 2 out of the 5 sam-
pled say that they would buy the widgets manufactured by ABC Company,
which of the two processes should ABC Company use to manufacture wid-
gets if they desire to minimize the expected cost of manufacturing wid-
gets? What is their expected manufacturing cost using this process?
b. If 2 out of the 18 say that they use Alpha-Omega’s product, which of these
two processes should the company use?
c. If 3 out of the 18 say that they use Alpha-Omega’s product, which of these
two processes should the company use?
d. Compute the EVSI if the company asks the market research agency to con-
tact 18 customers. What is the expected value of the company’s net gain
from this sampling survey?