Abstract
Complex decision support systems often consist of component modules which, encoding the judgements of panels of domain experts, describe a particular sub-domain of the overall system. Ideally these modules need to be pasted together to provide a comprehensive picture of the whole process. The challenge of building such an integrated system is that, whilst the overall qualitative features are common knowledge to all, the explicit forecasts and their associated uncertainties are only expressed individually by each panel, resulting from its own analysis. The structure of the integrated system therefore needs to facilitate the coherent piecing together of these separate evaluations. If such a system is not available there is a serious danger that this might drive decision makers to incoherent and so indefensible policy choices. In this paper we develop a graphically based framework which embeds a set of conditions, consisting of the agreement usually made in practice of certain probability and utility models, that, if satisfied in a given context, are sufficient to ensure the composite system is truly coherent. Furthermore, we develop new message passing algorithms entailing the transmission of expected utility scores between the panels, that enable the uncertainties within each module to be fully accounted for in the evaluation of the available alternatives in these composite systems.
Similar content being viewed by others
References
Abbas, A. E. (2010). General decompositions of multiattribute utility functions with partial utility independence. Journal of MultiCriteria Decision Analysis, 17, 37–59.
Ackermann, F. (1996). Participants’ perceptions on the role of facilitators using group decision support systems. Group Decision and Negotiation, 5, 93–112.
Anacleto, O., & Queen, C. M. (2013). Dynamic chain graph models for high-dimensional time series. In Technical Report. Department of Statistics, Open University.
Barclay, L. M., Smith, J. Q., Thwaites, P. A., & Nicholson, A. (2013). The dynamic chain event graph. In Technical Report. CRISM, The University of Warwick.
Braziunas, D., & Boutilier, C. (2006). Preference elicitation and generalized additive utility. Proceedings of the National Conference on Artificial Intelligence, 21, 1573–1576.
Cooke, R. M. (1991). Experts in uncertainty: Opinion and subjective probability in science. New York: Oxford University Press.
Cowell, R. G., Dawid, P. A., Lauritzen, S. L., & Spiegelhalter, D. J. (1999). Probabilistic networks and experts systems. New York: Springer.
Dawid, A. P. (1979). Conditional independence in statistical theory. Journal of the Royal Statistical Society. Series B (Methodological), 1–31.
De, S., & Faria, A. E. (2011). Dynamic spatial Bayesian models for radioactivity deposition. Journal of Time Series Analysis, 32, 607–617.
Ehrhardt, J. (1997). The RODOS system: Decision support for nuclear off-site emergency management in Europe. Radiation Protection Dosimetry, 73, 35–40.
Ehrhardt, J., Pasler-Sauer, J., Schule, O., Benz, G., Rafat, M., & Richter, J. (1993). Development of RODOS, a comprehensive decision support system for nuclear emergencies in Europe—an overview. Radiation Protection Dosimetry, 50, 195–203.
Faria, A. E., & Smith, J. Q. (1997). Conditionally externally Bayesian pooling operators in chain graphs. The Annals of Statistics, 25, 1740–1761.
Fishburn, P. C. (1967). Interdependence and additivity in multivariate, unidimensional expected utility theory. International Economic Review, 8, 335–342.
French, S., Ranyard, D. C., & Smith, J. Q. (1995). Uncertainty in RODOS. In Technical report, School of Computer Science, University of Leeds.
French, S. (1997). Uncertainty modelling, data assimilation and decision support for management of off-site nuclear emergencies. Radiation Protection Dosimetry, 73, 11–15.
French, S. (2011). Aggregating expert judgement. Revista de la Real Academia de Ciencias Exactas, Fisicas y Naturales. Serie A. Matematicas, 105, 181–206.
French, S., Maule, J., & Papamichail, N. (2009). Decision behaviour, analysis and support. New York: Cambridge University Press.
French, S., & Rios, Insua D. (2000). Statistical decision theory. London: Arnold.
Gómez M. (2004). Real-world applications of influence diagrams. In Advances in Bayesian networks (pp. 161-180). Berlin: Springer.
Jensen, F. V., & Nielsen, T. D. (2013). Probabilistic decision graphs for optimization under uncertainty. Annals of Operations Research, 204, 223–248.
Keeney, R. L., & Raiffa, H. (1993). Decision with multiple objectives: Preferences and value trade-offs. New York: Cambridge University Press.
Lauritzen, S. L. (1992). Propagation of probabilities, means and variances in mixed graphical association models. Journal of the American Statistical Association, 87, 1098–1108.
Leonelli, M., & Smith, J. Q. (2013a). Dynamic uncertainty handling for coherent decision making in nuclear emergency response. In Proceedings of the winter meeting of the ANS, (to appear).
Leonelli, M., & Smith, J. Q. (2013b). Using graphical models and multi-attribute utility theory for probabilistic uncertainty handling in large system, with application to the nuclear emergency management. In Data engineering workshops (ICDEW), pp. 181–192.
Leonelli, M., Riccomagno, E., & Smith, J. Q. (2015). Using computer algebra to symbolically evaluate discrete influence diagrams. In Technical Report (pp. 15–07). CRISM, The University of Warwick.
Little, R. J. A., & Rubin, D. B. (2014). Statistical analysis with missing data. Hoboken: Wiley.
O’ Hagan, A. (2006). Bayesian analysis of computer code outputs: A tutorial. Reliability Engineering and Safety Systems, 91, 1290–1300.
Oliver, R. M., & Smith, J. Q. (1990). Influence diagrams, belief nets and decision analysis. New York: Wiley.
Pearl, J. (1988). Probabilistic reasoning in intelligent systems: Networks of plausible inference. San Francisco: Morgan Kaufmann.
Phillips, L. D. (1984). A theory of requisite decision models. Acta Psychologica, 56, 29–48.
Queen, C. M., & Smith, J. Q. (1993). Multiregression dynamic models. Journal of the Royal Statistical Society, Series B, 55, 849–870.
Richter, K., Gering, F., & Müller, H. (2002). Data assimilation for assessing radioactive contamination of feed and foodstuffs. In Development and application of computer techniques to environmental studies, WIT Press, pp. 393–402.
Shafer, G. (1976). A mathematical theory of evidence. Princeton: Princeton University Press.
Smith, J. Q., Barons, M. J., & Leonelli, M. (2015). Coherent frameworks for statistical inference serving integrating decision support systems. arXiv preprint arXiv:1507.07394.
Smith, J. Q. (1989a). Influence diagrams for Bayesian decision analysis. European Journal of Operational Research, 40, 363–376.
Smith, J. Q. (1989b). Influence diagrams for statistical modelling. The Annals of Statistics, 17, 654–672.
Smith, J. Q. (1994). Plausible bayesian games. In Bayesian Statistics 5 (pp. 387–406). Clarendon Press.
Smith, J. Q. (2010). Bayesian decision analysis: Principles and practice. Cambridge: Cambridge University Press.
Smith, J. Q., & Anderson, P. E. (2008). Conditional independence and chain event graphs. Artificial Intelligence, 172, 42–68.
Smith, J. Q., & Papamichail, N. (1999). Fast Bayes and the dynamic junction forest. Artifical Intelligence, 107, 99–124.
Von Neumann, L. J., & Morgenstern, O. (1947). Theory of games and economic behavior. Princeton: Princeton University Press.
Von Winterfeldt, D., & Edwards, W. (1986). Decision analysis and behavioral research. Cambridge: Cambridge University Press.
Wakker, P. P. (2008). Explaining the characteristics of the power (CRRA) utility family. Health Economics, 17, 1329–1344.
Walley, P. (1996). Measures of uncertainty in expert systems. Artificial Intelligence, 83, 1–58.
West, M., & Harrison, J. (1997). Bayesian forecasting and dynamic models. New York: Springer.
Wisse, B., Bedford, T., & Quigley, J. (2008). Expert judgement combination using moment methods. Reliability Engineering and System Safety, 93, 675–686.
Zheng, D., Leung, J. K. C., & Lee, B. (2009). Online update of model state and parameters of a Monte Carlo atmospheric dispersion model by using ensemble Kalman filter. Atmospheric Environment, 43, 2005–2011.
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix 1: Proof of Theorem 1
We develop this proof via backward induction both through the vertices of the DAG and through time. For the purpose of this proof define for \(t=T\)
and note that \(\bar{u}^{T,1}\equiv \bar{u}^T\).
First, without any loss of generality, fix a policy \(d^T\). Then start the backward induction from \(\varvec{Y}_n(T)\), which, by construction, is a leaf of the time slice DAG at time T. For a leaf, \(\varvec{Y}_i(T)\) say, it follows from (4) that \(\tilde{u}_{T,i}=u^{\mathscr {G}}_i(\varvec{r}_{A_i})\) and note that consequently \(u^{\mathscr {G}}_i\) is a function of \(\varvec{Y}_n(T)\) only through \(\tilde{u}_{T,n}\). Therefore \(\tilde{u}_{T,n}\) can then be simply marginalized as in Eq. (5) to obtain \(\bar{u}_{T,n}\). Furthermore
Now consider \(\varvec{Y}_{n-1}(T)\). The vertex associated with this random vector in the time slice DAG is either the father of \(\varvec{Y}_{n}(T)\) or a leaf of the DAG. In the latter case, since by construction \(n-1\in U\), the exact same method followed for \(\varvec{Y}_n(T)\) can be applied to \(\varvec{Y}_{n-1}(T)\), and thus
If on the other hand \(\varvec{Y}_{n-1}(T)\) is the father of \(\varvec{Y}_n(T)\), then by construction \(\varvec{Y}_{n-1}(T)\) has only one son. Thus from Eq. (4) \(\bar{u}_{T,n}\equiv \tilde{u}_{T,n-1}\) and Eq. (22) is a function of \(\varvec{Y}_{n-1}(T)\) only through \(\bar{u}_{T,n}\). In order to deduce \(\bar{u}^{T,n-1}\) only \(\tilde{u}_{T,n-1}\) has to be marginalized with respect to \(f_{T,n-1}\) and therefore
We can note from Eqs. (23) and (24) that \(\bar{u}^{T,n-1}\) consists of the linear combination of two summations: the first over the leaves of the graphs with index j smaller than \(n-1\) of utility terms \(u^{\mathscr {G}}_j\); the second over the indices j bigger or equal than \(n-1\) of the terms \(\bar{u}_{T,j}\) such that the father of \(\varvec{Y}_j(T)\) has an index smaller than \(n-1\) in the time slice DAG. So for example in Eq. (23) the second summation is over both n and \(n-1\) since the associated vertices are both leaves of the graphs. On the other hand in Eq. (24) there is no term \(\bar{u}_{T,n}\) since its father has index \(n-1\). More generally, for \(j\in [n]\), \(\bar{u}^{T,j}\) can be written as the linear combination of the following two summations:
-
The first over the indices i in \(Le(\mathscr {G})\cap [j{-1}]\) of \(u^{\mathscr {G}}_i\);
-
The second over the indices k in \(B_j=\{k\ge j: F_k<j\}\) of \(\bar{u}_{T,k}\), where \(F_k\) is the index of the father of \(\varvec{Y}_k^\text {T}\).
Therefore, for a \(j\in [n]\), we have that
where \(Dn_k\) is the set of the indices of the descendants of \(\varvec{Y}^\text {T}_k\). In particular for \(\varvec{Y}_2(T)\) we can write Eq. (25) as
since, by the connectedness of the time slice DAG, \(\varvec{Y}_1(T)\) is the father of all the vertices whose father’s index is not \([n]{\setminus }\{1\}\). It then follows that Eq. (26) corresponds to \(\tilde{u}_{T,1}\), as defined in Eq. (4), and therefore \(\bar{u}^T\) can be written as in Eq. (3). Thus Theorem 1 holds for time T.
Now, since \(\varvec{Y}_1(T)\) is the unique root of the time slice DAG, if \(i,j\in S_1\), then
Suppose that any vertex \(\varvec{Y}_j(T)\), for \(j\in S_1\), is either connected by a path to one only leaf of the DAG or is a leaf of the graph itself. Because of the identity in Eq. (27) and because of the algebraic form of Eq. (26), which consists of a linear combination of the terms \(\bar{u}_{T,j}\), for \(j\in S_1\), we can deduce that Eq. (6) holds for the last time slice. Now, consider the case where one vertex \(\varvec{Y}_j(T)\) with index in \(S_1\) is connected to more than one leaf. Equation (4) guarantees the existence of a vertex \(\varvec{Y}_i(T),\, i>j\), connected to both \(\varvec{Y}_j(T)\) and the above mentioned leaves, such that \(\tilde{u}_{T,i}\) can be written as a linear combination of terms \(\bar{u}_{T,k}\), for which each of these terms is a function of one of the leaves only. It therefore follows that Eq. (6) also holds in this case.
Therefore Eq. (6) guarantees that \(\bar{u}^{T,1}\) can be written as a linear combination of terms involving only variables in the same ancestral set. Since also the probability factorisation does not change as formalised in Proposition 1, the exact same recursions we explicated at time T can then be followed at time \(T-1\) by substituting \(u^{\mathscr {G}}_i\) with \(\hat{u}_{T-1,i},\, i\in Le(\mathscr {G})\), in Eqs. (21)–(25) and by changing the time index. This then also holds for any time slice t, \(1\le t\le T-1\), since \(\bar{u}^{t,1}\) will be again a linear combination of terms \(\hat{u}_{t-1,i},\, i\in Le(\mathscr {G})\), and the probability density function factorizes as in Proposition 1.
Appendix 2: Proof of Theorem 2
To prove Theorem 2 we proceed as follows:
-
We relate the lines of the pseudo-code of Algorithm 3.1 to the Eqs. (3)–(6) of Theorem 1 and their variations which include optimization steps in Eqs. (7) and (8);
-
We then show that each panel and the SB have sufficient information to perform the steps of the algorithm they are responsible for;
-
We conclude by showing that the optimization steps, which in the algorithm correspond to lines (8) and (15), are able to identify optimal decisions using only combinations of quantities individual panels are able to calculate.
We start with the first two bullets. Line (1) describes the backward induction step over the time index, t, while line (2) does the same over the index of the vertices of the graph, i. Now note that in lines (5)–(7), Panel \(G_i:\tilde{u}_{t,i}\) using Eq. (4). Each panel has enough information to do this, since line (10) guarantees that the scores are communicated to the panels overseeing father vertices and line (14) denotes the fact that the SB transmits \(\hat{u}_{t,i}\) to the appropriate panels. The functions \(\tilde{u}_{t,i}\) are then sent to the SB, who performs an optimization step in line (8) and communicates the result back to the panel. We address the validity of this step below.
Since the \(SB:u^*_{t,i}\longrightarrow G_i\), each panel is able to compute \(\bar{u}^*_{t,i}\) (lines 10–11) following Eq. (8). As noted before, if i is not the root of the DAG, \(\bar{u}^*_{t,i}\) is sent to the appropriate panel, whilst if \(i=1\), as specified by the if statement in line (9), \(\longrightarrow SB\). For each time slice with time index \(t\ne 1\) lines (13)–(14) compute \(\hat{u}_{t,i}\), as in Eq. (6). These are sent to the appropriate panels, which can then continue the backward inductive process from the time slice with a lower time index. If on the other hand \(t=1\), then the expected utility is a function of the initial decision space \(\mathscr {D}(0)\) only. The SB can then perform a final optimization step over this space and thus conclude the algorithm (line 15).
We now address the optimization steps. The influence on the scores associated with time slices with index bigger than t of a decision space \(\mathscr {D}_i(t)\) are included, by construction, only in the terms \(\hat{u}_{t,k}\), where k is either the index of a descendant \(\varvec{Y}_k(t)\) of \(\varvec{Y}_i(t)\) or \(k=i\). Further note that the same decision space \(\mathscr {D}_i(t)\) can affect the scores of terms including descendants of \(\varvec{Y}_i(t)\) at the same time point. Thus the whole contribution of \(\mathscr {D}_i(t)\) is summarized within \(\tilde{u}_{t,i}\), as it can be seen by recursively using Eqs. (4) and (5).
Now, as specified by Eq. (7), the optimization step over \(\mathscr {D}_i(t)\) is performed by maximizing \(\tilde{u}_{t,i}\), which carries all the information concerning this decision space. More specifically, no other term is an explicit function of \(\mathscr {D}_i(t)\) at this stage of the algorithm, as guaranteed by Eqs. (1). Finally, Structural Assumption 2 guarantees that all the elements that appears as arguments of \(\tilde{u}_{t,i}\) are observed and therefore known at the time the decision associated to this decision space needs to be made.
Rights and permissions
About this article
Cite this article
Leonelli, M., Smith, J.Q. Bayesian decision support for complex systems with many distributed experts. Ann Oper Res 235, 517–542 (2015). https://doi.org/10.1007/s10479-015-1957-7
Published:
Issue date:
DOI: https://doi.org/10.1007/s10479-015-1957-7