Mack Chain Ladder Sloma
Mack Chain Ladder Sloma
Model of Reserving
with Robust Estimation
by Przemyslaw Sloma
ABSTRACT
In this paper we consider the problem of stochastic claims reserv-
ing in the framework of development factor models (DFM). More
precisely, we provide the generalized Mack chain-ladder (GMCL)
model that expands the approaches of Mack (1993; 1994; 1999),
Saito (2009) and Murphy, Bardis, and Majidi (2012). Our general
flexible tool of reserving provides the solution to the one of the
major challenges of day-to-day actuarial practice, which is quantify-
ing the variability of reserves in the case where different methods of
selecting loss developments factors (LDFs) are applied. We develop
the theoretical background to estimate the conditional mean square
error of prediction (MSEP) of claims reserves that is consistent with
actuarial practice in selecting the LDFs.
Moreover, we present an example of GMCL’s application in
which we indicate how to bridge the estimation of parameters in
the chain-ladder framework with the robust estimation techniques.
Finally, we show how our approach can be used in validation of the
reserve risk evaluation in the Solvency 2 context.
KEYWORDS
Non-life insurance, stochastic claims reserving,
Mack chain ladder, robust estimation solvency 2
(ratio of square root of MSEP of ultimate loss over type stochastic reserving methods to determine the
ultimate loss) in order to derive the MSEP estimators economic capital corresponding to the reserve risk.
of a new approach. It turns out that in general these The bootstrap method allows estimation of a whole
approximations are highly inappropriate (see exam- claims reserves distribution via resampling tech-
ple in Section 6.3). niques and Monte Carlo simulations. It seems to
We think that, in some simple cases (no curve be crucial for non-life insurance companies to be
fitting for LDFs, for example) the approximations able to validate the results given by the industrial
mentioned above are the consequences of bad under- software where we do not have access to the code
standing of the main formula for estimation of MSEP and when the number of shortcuts may be applied.
in Mack (1993). Moreover, the approximations used Our approach can be used to validate the estimation
by actuaries and actuarial software developers could of the first two moments of the loss distribution in
be avoided by using the more appropriate existing the case where selection of development factors was
models. One such model was proposed by Mack employed and the different weights in estimation of
(1999). To our knowledge, this was the first study that chain-ladder factors and volatility parameters were
showed how to measure the uncertainty of reserves used (see Section 6.3 for more details).
in the situation when an actuary selects the LDFs. Second, our general Mack chain-ladder (GMCL)
In our opinion, the important results obtained in this model can be used to construct the proxy solutions
paper are not always used in practice because, instead to overcome the limits of Mack’s (1999) approach,
of explicit formula, the recursive equation is given i.e., the use of the same weights for parameters esti-
there for estimation of MSEP of ultimate loss. mation (see discussion in Section 3.6). This means
Mack (1999) is an important paper that allows that, for the methods where we eliminate the consid-
the fully understanding of the MSEP formula and erable number of observations, we reduce as well the
avoid the inappropriate approximation when it is not data for variability of the reserves. This mechanically
necessary. We summarize the details of this method impacts the estimation of MSEP of loss liabilities,
in Section 3. One of the major limitations of this which in such cases is generally underestimated. We
method is the underestimation the MSEP of ultimate propose then the possible solution to overcome this
loss in the case where the number of excluding kind of difficulty (see Section 6.1 for more details).
data is important. We discuss this topic in detail in Finally, the third and really important application
Section 3.5. One possible solution to overcome this from a practical point of view consists of bridg-
difficulty is to extend the existing approach proposed ing the point estimation of chain-ladder parameters
by Mack (1999). with the theory of robust statistics. As mentioned
Therefore, we propose a general approach for above, the point estimators of chain-ladder factors
stochastic claims reserving in the framework of chain- can be obtained in the linear regression framework
ladder model, extending the model proposed by Mack by applying the weighted least squares procedure. It
(1999) and Murphy, Bardis, and Majidi (2012). This is well known that the OLS estimators are fragile to
extension is three fold. First, our general tool has a the outliers. That is why we propose using the robust
educational role and makes it possible to validate the techniques of estimation such as: M-estimators,
results from other approaches. More precisely, our Lp-estimators, etc (see Section 6.2).
general formula for estimation of MSEP of outstand- The reminder of this paper is organized as follows.
ing loss liabilities can be used to fully understand the In Section 2 we present our notations and definitions.
Mack (1993), Mack (1994), and Mack (1999) model. We review in Section 3 the MCL and its main limi-
Furthermore, under new solvency requirements of tations. In Section 4, we present the (GMCL) and
Solvency II, insurance companies use the bootstrap- the main results are derived in Section 5. Finally,
2. Notations and definitions We use the term claims reserves to describe the pre-
diction of the outstanding loss liabilities. Hence, let
2.1. Run-off triangle
R̂i and R denote the claims reserves for accident year i,
Let Ci,j denote the random variables (cumulative R̂i = Ĉi,I – Ci,I–i+1, i ∈ {1, . . . , I}, and the total claims
payments, inccured, reported claims numbers, etc.) reserves for aggregated accident years, R̂ = ∑ i=1 I
R̂i,
for accident year i ∈ {1, . . . , I} until development year respectively, where Ĉi,I is a predictor for Ci,I.
j ∈ {1, . . . , J}, where the accident year is referred
to as the year in which an event triggering insurance 2.3. (Conditional) mean square error
claims occurs. We assume that Ci,j are random vari- of prediction (MSEP)
ables observable for calendar years i + j ≤ I + 1 and As already stated above, finding suitable pre
non-observable (to be predicted) for calendar years diction of ultimate loss is rather the beginning of
i + j > I + 1. The observable Ci,j are represented by the process of reserving, and insurers need to assess
the so-called run-off trapezoids (I > J) or run-off tri- the variability of these amounts. We are interested then
angles (I = J ). Table 1 gives an example of a typical in the quantification of the prediction uncertainty of
run-off triangle. In order to simplify our notation, the ultimate loss, i.e., Ĉi,I and ∑ i=1
I
Ĉi,I, (or equivalently
we assume that I = J (run-off triangle). However, all of claims reserves, i.e., R̂ i and R̂ = ∑ i=1 I
R̂i). For that,
the results we present here can be easily extended to we have to choose an appropriate risk measure which
the case when the last accident year for which data determines a conception of measuring the “distance”
is available is greater than the last development year, between the prediction and the actual outcomes.
i.e., I > J (run-off trapezoid). In this paper, following the actuarial literature, we
quantify the prediction uncertainty using the most
2.2. Outstanding reserves popular such measure, the so-called mean-square
Let Ri et R denote the outstanding claims liabilities error of prediction (MSEP).
for accident year i ∈ {1, . . . , I},
( )
2
msepCˆiI DI (CiI ) = E Cˆ iI − CiI DI , (2.3)
Table 1. Run-off triangle (I = J )
2
Development Year j I I ˆ I
Accident
msep ˆ iI DI ∑ C
i =1 iI = E ∑ C − ∑ CiI DI ,
∑ i =1 i =1
I iI
Year i 1 2 3 4 j ... J C
i =1
1
(2.4)
2 Ci,j
(observations)
3 where
I–j
I–2 Ci,j DI = {Ci, j : i + j ≤ I + 1} , (2.5)
(to be predicted)
I–1
I denote the claims data available at time t = I.
The parameters fk are often called loss devel- σˆ 2I −1 := min ( σˆ 4I − 2 σˆ 2I −3 , min ( σˆ 2I −3 , σˆ 2I − 2 )) . (3.5)
opment factors (LDF), link ratios or age-to-age
factors. Remark 3.1. The parameter α determines the
2. There exist constants s k2 > 0 such that for all different ways of estimation of fk . For the sake of
1 ≤ i ≤ I and 1 ≤ k ≤ I – 1 we have simplicity, let us assume that wi,j = 1 for all i, j.
We present below the possible choices of α and their This implies, together with the fact that fˆk are
interpretation. uncorrelated, that Ĉi,I is unbiased estimator of
E(Ci,I DI).
1. If α = 1 we get the classical chain ladder estimate
v. The expected values of the estimator
of fk
I −1
I −k I −k Cˆ i,I = Ci,I +1−i i ∏ fˆk ,
∑ Ci,k Fi,k ∑ Ci,k +1 k = I +1− i
fˆk = i =1
I −k = i =1
I −k , for 1 ≤ k ≤ I − 1.
for the ultimate claims amount and of the true ulti-
∑ Ci,k ∑ Ci,k
i =1 i =1 mate claims amount Ci,I are equal, i.e., E(Ĉi,I) =
E(Ci,I), 2 ≤ i ≤ I.
2. If α = 0 we get the model for which the estimators
The proof is postponed to Appendix A.3.
of the age-to-age factors fk are the straightforward
average of the observed individual development
3.4. Estimators of conditional MSEP
factors Fi,j defined via (3.1), i.e.,
in MCL model
1 I −k 3.4.1. Single accident years
fˆk = ∑ Fi,k , for 1 ≤ k ≤ I − 1.
I − k i =1 Under assumptions of the MCL model we have the
following estimator for the conditional estimation
3. If α = 2 we get the model for which the estima- error of a single accident year i ∈ {2, . . . , I}:
tors of the age-to-age factors fk are the results of
ˆ ( Ci , I )
msep
an ordinary regression of {Ci,k+1}i∈{1,...,I–k–1} against Ci,I DI
Here we choose from Blumsohn and Laufer (2009) (5) Median: fˆk are computed as an arithmetic aver-
several such methods where estimates are com- age of individual link ratios Fi,j in the way to
puted as different averages using varying weights obtain the sample median. More precisely, we
and varying number of accident years: all/3/5-years put wi,j = 1 or wi,j = 0 in the way that the estima-
weighted average and all excluding higher and lower tors of the age-to-age factors fk are given by
(AEHL) factor average. We consider as well other
popular methods in actuarial practice based on sam- fˆk = median {Fi,k : i ∈{1, . . . , I − k }} .
ple median.
More precisely, for RAA run-off triangle (see The median denotes the sample median that
Appendix B, Section B.8), we apply the MCL model for the sample X1, . . . , Xn is computed by
from Mack (1999) with the following parameters.
median { X i : i ∈{1, . . . , n}}
For all five methods described below we choose
α = 0 ( fˆk arithmetic averages of Fi,k) and we compute X n +1 if n is odd
the estimators fˆk and ŝ 2k according to formula (3.3) 2
and (3.4)–(3.5), respectively. This allows us to com- := X + X .
n n
2 +1
2
pare the results with sample Median method which otherwise
2
is rather consistent with straightforward average of
development factors (see method number (5) below) where X(k) denotes the k – th order statistics of the
(1) ALL AV: fˆk are computed as arithmetic average sample X1, . . . ,Xn.
of all individual link ratios Fi,j. More precisely,
Remark 3.2. In the case where there is only
we define the weights in the following way:
one observation in estimation of parameter sk (odd
wi,j = 1 for all i, j.
number of data in sample median computation) we
(2) AEHL: fˆk are computed as arithmetic average of choose the additional Fi,k factor in order to have two
all individual link ratios, excluding the highest observations and be able to apply the formula (3.4).
and the lowest values of Fi,j. More precisely,
we define the weights in the following way: In Table 2, we present the estimation of total amount
for fixed j, wi,j = 0 for i such that Fi,j = F(I–j),j and of claims reserves R̂ as well as the value of estimators
Fi,j = F(1),j, where F(k),j for k = 1, . . . , I – j denotes of aggregated MSEP(R̂). Recall that R̂ : = ∑i=1 I
R̂ i,
the order statistics of Fi,j. For remaining indices i, where R̂i := Ĉi,I – Ci,I–i+1. We observe that to obtain R̂ it
for fixed j, we take wi,j = 1 is enough to have the estimators Ci,I of ultimate claims
(3) 5 Years AV: fˆk are computed as an arithmetic Ĉi,I for all accident year i. In consequence MSEP(R̂) =
I
average of individual link ratios Fi,j from five MSEP(∑ i=1 Ĉi,I) and we use the formula (3.7) to esti-
latest accidents years. More precisely, we define mate this quantity. We compute as well the coefficient
the weights in the following way: wi,j = 1 for of variation of R̂, given by CV(R̂) = R̂/MSEP(R̂)1/2.
i = I – j, . . . , I – j – 4. For remaining indices i, The last two lines of Table 2 indicate the relative
for fixed j, we take wi,j = 0 proportion of R̂ and MSEP(R̂)1/2, for each of five
methods considered, in comparison to the ALL AV
(4) 3 Years AV: fˆk are computed as an arithmetic
method which is the reference method in our example.
average of individual link ratios Fi,j from three
latest accidents years. More precisely, we define
the weights in the following way: wi,j = 1 for
3.6. Limits of MCL method
i = I – j, . . . , I – j – 2. For remaining indices i, As can be seen in Table 2, the four last methods
for fixed j, we take wi,j = 0 (columns (2)–(5)) reduce significantly the estimation
Table 2. Estimation of total amount of outstanding loss liabilities (R̂), value of estimator
of aggregated MSEP(R̂)1/2 and coefficient of variation CV(R̂), for five methods
alpha=0
ALL AV AEHL 5 Years AV 3 Years AV Median
Item/method (1) (2) (3) (4) (5)
R̂ 93 643 65 868 75 886 68 645 54 059
MSEP(R̂) 1/2
92 549 21 015 27 486 29 493 14 786
CV(R̂) 99% 32% 36% 43% 27%
(1)/(1) (2)/(1) (3)/(1) (4)/(1) (5)/(1)
R̂ (%) 100% 70% 81% 73% 58%
MSEP(R̂)1/2 (%) 100% 23% 30% 32% 16%
of MSEP(R̂)1/2 comparing to the first method ALL AV. for the expectation of fˆk and Var( fˆkBk) are not correct.
For the methods (3), (4) and (5), this is mainly due Regarding the sample median method, the derivation
to the elimination of relatively significant number of of Var( fˆkBk) requires the computation of the moments
development factors from estimation especially for of order statistics (see Jeng 2010) and those are
the first development years which correspond to the strongly related to the distribution of Fi,j. To over-
columns of the run-off triangle. This phenomena is come these difficulties we propose two solutions:
especially seen in the case of sample median method the simple Proxy method (see Section 6.1) and the
in which, for each development factor we keep at more complex one based on a robust estimation (see
most two of link ratios Fi,j in estimation of fk. From Section 6.2). The first approach is programmed to
statistical point of view, this is clearly not enough avoid artificial volatility increase and it is based on
to perform the robust estimation. As a consequence, all link ratios in estimation of volatility parameters sk
this kind of methods reduce unnaturally the variabil- (scale parameters in linear regression). The second
ity of reserves. This could be dangerous for exam- method consists on developing an approach that allows
ple in terms of evaluation of the economical capital using any robust estimators of fk (location) and sk
for reserve risk required by the new Solvency II (scale) parameters.
regime.
Beyond the limits stated above, there are some
incoherences with application of weights wi,k for 4. General Mack chain-ladder
the AEHL and sample median methods. Indeed, the model
weights wi,k should be Ci,k measurable random vari-
ables in order to be able to derive the main results of 4.1. Model assumptions
MCL approach (see, for example, Proposition A.2). Before stating the main assumptions of our general
Although for the method 5 Year AV and 3 Years AV approach, let us assume that functions gd,j:[0, ∞) →
we can fix the weights without knowing the infor [0, ∞) are Borel measurable. Let di,j be the non-
mation DI (knowing all observation in the run-off
negative random variables defined by, di,j: = gd,j(Ci,j).
triangle, see (5)), this is not a case for the AEHL and
Our model is formalized by the following
sample median methods. The reason is that we need
assumptions:
to know the observation Fi,k in order to specify the
corresponding weights for those two methods. That (GMCL.1) There exist constants fk > 0 such that
is why the weights wi,k are not Ci,k measurable but
rather DI-measurable. This means that the formula E ( Fi,k Ci,1 , . . . , Ci,k ) = fk .
(GMCL.2) There exist constants s 2k > 0 such that where Ik represents the number of weights di,k
for all 1 ≤ i ≤ I and 1 ≤ k ≤ I – 1 we have different from 0, namely, Ik := card{i:di,k ≠ 0}.
In the analogue way to (3.4) we define
σk
2
if δ i,k ≠ 0 a.s.,
Var ( Fi,k Ci,1 , . . . , Ci,k ) = δ i,k σˆ 2I −1 = min ( σˆ 4I − 2 σˆ 2I −3 , min ( σˆ 2I −3 , σˆ 2I − 2 )) . (4.5)
∞ if δ i,k = 0 a.s.,
Proposition 4.1.
(4.1)
(i) The estimators fˆk given in (4.2) are unbiased and
uncorrelated.
where a.s. means almost surely.
(ii) For k = 1, . . . , I – 1, if di,k = γi,k for all i, then
(GMCL.3) The accident years (Ci,1, . . . , Ci,J)1≤i≤I
the estimators fˆk of fk have the minimal variance
are independent
among all unbiased estimators of fk which are
We observe that from the above assumptions the the weighted average of the observed develop-
main difference between MCL and GMCL lies in ment factors Fi,k.
the variance assumption. This modification allows For k = 1, . . . , I – 1, if di,k ≠ γi,k, for some i,
us to introduce different weights in estimation of the then the relative efficiency of s.e. ( fˆkg≠dBk) with
parameters fk and sk. respect to s.e.( fˆkg=dBk), i.e., the ratio
I − k I − k γ 2j,k
It becomes obvious from assumption (GMCL.2) ∑ δ i,k ∑ i 1{δ }
j, k ≠ 0
σk2
j =1 δ j,k
that in order to compute correctly the variance E [ σˆ 2k − σ 2k ] = E .
i =1
− 1
Ik − 1
2
of fˆk (see Proposition A.2 in Appendix) we have I −k
∑ γ j,k
to assume that j =1
{if δ i, j = 0 then γ i, j = 0} . (4.3) (iv) Under the model assumptions (GMCL.1) and
(GMCL.3) we have
• Given the information DI, the variance parameters
s 2k are estimated by E (Ci,I DI ) = Ci,I +1−i fi,I +1−i i . . . i fI −1 .
∑ γ l,k
i=2
for the ultimate claims amount and of the true
ultimate claims amount Ci,I are equal, i.e., l =1
E(Ĉi,I) = E(Ci,I), 2 ≤ i ≤ I. I −k
( γ l,k )2 i 1
i ∑ δ l,k {δ j,k ≠ 0} , (5.4)
The proof of this Proposition is postponed to the l =1
Appendix A.4.
where fˆj and ŝ 2j are defined in (4.2) and (4.4)–(4.5),
Remark 4.1.
respectively.
• If we set γi,j = di,j = wi,jC αi,j, for α ∈ {0,1,2}, in (4.2)
and (4.4) we get the assumptions of MCL model
from Mack (1999) (see also Mack (1993), Mack 6. Applications of GMCL model
(1994) and Saito 2009).
• If we put γi,j = di,j = wi,jC αi,jj , for αj ∈ , in (4.2) In our numerical example in Section 3.5 we have
and (4.4) we get the stochastic chain-ladder model seen that the assumption about the same weights in
from Murphy, Bardis and Majidi (2012). estimation of parameters sk and fk yields for some
methods to an artificial reduction of variability of
reserves amounts (refer to Table 2). To overcome
5. Main results this difficulty, we introduced the different weights
5.1. Single accident years γi,j and di,j in computation of fˆk and ŝk, respectively.
In the following application we indicate how one can
Result 5.1 (Conditional MSEP estimator for a possibly estimate the weights γi,j and di,j and we point
single accident year). out some other interesting applications.
ˆ (Ci,I ) = Cˆ i,I ( ) i ( Γˆ )
2
msep i, I + ∆ˆ i,I , (5.1)
6.1. Method proxy for factors selection
Ci,I DI
∆ˆ i,I =
J −1 σˆ 2k I −k
( γ j,k )2 i 1 all five methods presented, we take α = β = 0. We turn
∑
k
2
i ∑ δ j,k {δ j,k ≠ 0} , (5.3)
back to our numerical example from Section 3.5 and
k = I − i +1 I −k j =1
∑ γ j,k we evaluate the same estimators for the already-
j =1
presented five methods with the only difference in
and fˆj and ŝ 2j are given in (4.2) and (4.4)–(4.5), weights of sk estimation. More precisely:
respectively.
ALL AV: The ŝ 2k are estimated with w i,jd = 1 for
(1)
all i, j. Parameters fˆk are computed as a arithmetic
5.2. Aggregation over prior accident year average of all individual link ratios Fi,j. More
Result 5.2 (Conditional MSEP estimator for precisely, we define the weights in the following
aggregated years). way: w γi,j = 1 for all i, j.
ˆ) and CV(R̂)
Table 3. Estimators of R̂, MSEP (R
alpha = 0
ALL AV AEHL 5 Years AV 3 Years AV Median
Item/method (1) (2) (3) (4) (5)
R̂ 93 643 65 868 75 886 68 645 54 059
MSEP(R̂ ) 1/2
92 549 88 105 101 643 113 904 105 786
CV(R̂) 99% 134% 134% 166% 196%
(1)/(1) (2)/(1) (3)/(1) (4)/(1) (5)/(1)
R̂ (%) 100% 70% 81% 73% 58%
MSEP(R̂) 1/2
(%) 100% 95% 110% 123% 114%
6.2. Robust estimation in GMCL model and Ṽar( f˜k) respectively. These two quantities can
be derived by numerous techniques described in
From the previous two numerical examples (MCL
the literature, such as: M-estimation, Lp estimation,
vs. GMCL results), we observe that, in general, the first
etc. (Huber and Ronchetti 2009) or trimmed mean
approach underestimates and second overestimates
(Jeng 2010).
the MSEP of claims reserves (see Tables 2 and 3).
Step 2. For every k = 1, . . . , I – 1, we find αk
In this section we present an intermediate solution
by solving the following equation fˆk = f˜k, where fˆk is
for our general problem that allows us to evaluate the
given in equation (4.2), namely
estimation of MSEP of reserves in case of develop-
ment factor selection. This go-between solution is I −k
based on the robust statistics in estimation of model ∑ Ciα,k Fi,k
k
Table 4. Estimation of f˜k and s.e(f˜k) using LAD technique corresponding to sample median method
k 1 2 3 4 5 6 7 8 9
f˜k 4,2597 1,5992 1,1635 1,1657 1,1318 1,0335 1,0333 1,0180 1,0092
s.e(f˜k) 1,7974 0,1686 0,1411 0,0270 0,0472 0,0251 0,0065 0,0129 —
CV(f˜k) 42,2% 10,5% 12,1% 2,3% 4,2% 2,4% 0,6% 1,3% —
estimation by means of trimmed mean estimators k = 4, and k = 8 the parameters αk and βk need to be
(Jeng 2010). specified in different way. This could be done using
To apply the above fitting algorithm for sample any other approach that is being judged appropriate
median method, we use the LAD (least absolute by the actuary performing estimation. In our case,
deviation) estimation procedure. The theoretical we put α3 = β3 = α6 = β6 = α8 = β8 = 0 to have from
framework of LAD is presented in Appendix D. one hand the optimal properties (see Proposition 4.1)
The values of f˜k are given by computing the sample but also to be consistent with our choice of α = 0
median from Fi,k as described in Section 3.5. The in our two previous numerical applications (see
standard errors of f˜k are obtained via bootstrap tech- Sections 3.5 and 6.1).
niques. In Table 4 we present the numerical values The estimation of parameters αj and βj are stated
of f˜k, s.e(f˜k) := Ṽar( f˜k)1/2 and CV( f˜k) := s.e( f˜k)/f˜k. Note in Table 5. The values of α̂ j and β̂ j for which we arbi-
that the last value of s.e( f˜k) cannot be estimated from trarily put 0 are indicated with bold font characters.
the data for the reasons discussed in the case of sI–1 The MSEP and claims reserves amount estimators
estimation in Section 3.2 (only one observation avail- are stated in Table 6. Observe that the robust estima-
able). This why we do not fit the parameter βI–1 via tion is a good compromise between the method with
equation (6.2), but we put βI–1 := αI – 1. the same weights (see Section 3.5) and the method
The standard deviations of f˜k from Table 4 were where we use all link ratios in sk estimation (see
obtained by using the rq function integrated in free proxy method in Section 6.1).
R software. Note that the value of f˜k given by this
function are slightly different from those presented 6.3. Validation of results
in Section 3.2. This is probably due to the optimiza- from reserving softwares
tion algorithm that is used in R. Given that these dif- The next interesting and extremely important
ferences are insignificant, we decided to present in application of our GMCL model is the possibility of
Table 4 the same numerical values of the estimators
f˜k as given in Section 3.2. The corresponding R code
is available on request from the author. Table 6. Median method with robust estimation
the solutions of (6.1) and (6.2) are not always avail- Item/method MCL Robust Proxy
able. For instance, in our example, there is no solu- R̂ 54 059 63 165 54 059
tion of equation (6.1) for k = 3,4 and no solution of MSEP(R̂)1/2 14 786 40 312 105 786
CV(R̂) 27% 64% 196%
equation (6.2) for k = 8. This means that for k = 3,
validating the results from industry reserving soft- results we used 100 000 simulations. We begin our
ware. The stochastic chain-ladder type methods are analysis with the classic chain-ladder method in
used to evaluate the economic risk capital required which the estimators of fk are the all volume weighted
by Solvency II for so-called reserve risk. In fact, this average and are consistent with Mack (1993). More
capital requirement for reserve risk is computed as precisely, with the hypothesis of the MCL method
the 99.5th percentile (value at risk) of run-off result with α = 1, we compare the estimate of MSEP
distribution (profit/loss on reserves over one year). obtained by these two techniques: bootstrap from
This means that Solvency II defines the reserve risk ResQ and explicit formula given in MCL approach.
in one-year time horizon, which is different from the The corresponding numerical values are respectively:
standard approach considering the distribution of the 27 150 (see ResQ(Boot) (3) in Table 7) and 26 909
ultimate cost of claims. (see ResQ(MCL) (4) in Table 7). We observe a good
However, one of the methods to derive the one-year convergence for bootstrap (the relative error is less
reserve risk is based on simple scaling of ultimate than 1%). We consider now the different estimator
view. This technique is based on using the results of fk computed as a simple arithmetic average of
of Merz and Wüthrich (2008), which is currently individual link ratios Fi,j. This is equivalent to taking
a popular methodology throughout the market and α = 0 in the MCL framework. In that case, we
taken from the latest technical literature on this topic. observe that the estimates of MSEP for both methods
The empirical loss distribution in ultimate view is become divergent: 75 656 (see ResQ(Mack) (2) in
often derived by using the bootstrap techniques and Table 7) and 58 475 (see ResQ(Boot) (1) in Table 7).
Monte Carlo simulations. The first technique is used This is due to the fact that the ResQ(Mack) method
to evaluate the estimation error and the second to is obtained by approximation based on the MCL for-
approximate the process variance. This kind of boot- mula with α = 1. In fact, according to the technical
strap approach is also available in ResQ software, documentation, the ResQ estimates of parameters
which is used worldwide within the P&C insurance fk and sk in the bootstrap approach are of the form
market. The question is how to validate the results (up to multiplicative constant for bias reduction):
from bootstrap method provided by reserving tools 1 1
fˆk = ∑I–k
i=1Fi,k and s k =
2
∑I–k ˆ 2
i=1Ci,k(Fi,k – fk) .
such as ResQ. One of the possible solutions is to I−k I − k −1
compare the estimation of the first two moments It is easily seen that these estimators are consistent
of loss distribution from bootstrapping (based on with our general approach with α = 0 and β = 1 (see
simulations) with the estimators of reserves and (4.2) and (4.4) in Section 4). The MSEP estimator
MSEP of reserves obtained by the explicit formulas. is equal to 59 065 (see GMCL (5) in Table 7). This
For the sake of simplicity, we assume that there is shows that GMCL method allows one to validate
no factors selection (all weights wi,j are fixed to 1). the results and detect the incoherences. Effectively,
We use the RAA run-off triangle and we present the the choice of estimators in ResQ for the case α = 0
numerical results in Table 7. For all bootstrapping is not optimal in sense of Proposition 4.1. It remains
Table 7. Comparison of ResQ estimators of R̂ and MSEP(R̂) with MCL and GMCL models
alpha = 0,
alpha = 0 alpha = 1 beta = 1
ResQ(Boot) ResQ(Mack) ResQ(Boot) MCL GMCL
Item/method (1) (2) (3) (4) (5)
R̂ 93 630 93 643 52 204 52 135 93 643
MSEP(R̂)1/2 58 475 75 656 27 150 26 909 59 065
∑ I–1
l=I+1–iVar(Ci,l+1DI)∏ k=l+1 f k can be estimated via the
( E (C )
I–1 2
DI ) − Cˆ i,I
2
( )
2
estimate of Var(Fi,lCi,1,...,Ci,l). Indeed, this is achieved = Ci2,I +1−i fI +1−i i . . . i fI −1 − fˆI +1−i i . . . i fˆI −1 . (A.4)
Ci2,l Cˆ i2,l
if we estimate E DI by .
α
wi,l i Ci,l wi,l i Cˆ iα,l As can be easily seen, this expression cannot be
However, in Murphy, Bardis, and Majidi (2012), estimated by replacing fk with fˆk. In order to esti-
the authors used different approach based on normal mate the right hand side of (A.4) we use the same
approximation. approach as in Mack (1993), Mack (1994). Saito
Note that in the Section 6.3 we obtained that the (2009) followed the same technique of estimation.
above estimator of Var(Ci,l+1DI) is consistent with that However, in Murphy, Bardis, and Majidi (2012) we
provided by bootstrap technique from ResQ software. can find different approach which was also presented
This is shown for the particular run-off triangle in Buchwalder et al. (2006a). It is worth noting that
and the assumption that Ci,j are gamma-distributed in the paper of Mack, Quarg, and Braun (2006) the
random variables. It would be interesting to perform authors criticised the approaches of Buchwalder
the extensive simulation study in order to examine et al. (2006a) and showed that the estimate of estima-
the exactitude of this estimate with other data and tion error form Mack (1993) is hard to be improved
probability distributions. (see also Buchwalder et al. 2006b). As the answer for
Ci2,l Cˆ i2,l the criticism of Mack on article of Buchwalder et al.
We apply now Lemma A.1 with Eˆ DI =
δ i,l δˆ i,l (2006a), the authors provided the bounds for estima-
and by replacing the unknown parameters fk et s 2k tion error and claimed that the Mack estimator, in
with their estimators fˆk and ŝ k2. Together with the some particular cases, is closed to these bounds (see
equality Ĉi,l = CI+1–i∏ l–1 ˆ Wüthrich, Merz, and Bühlmann 2008). This should
k= I+1–i f k (see Proposition 4.1 (v))
we conclude be confirmed by performing the extensive simulation
study to quantify the different approaches of error
I −1
Ci2,l I −1 estimation in stochastic Chain-Ladder framework.
Var (Ci,I DI ) = ∑ δ DI σ l2 ∏ f k2
E
l = I +1− i i,l k =l +1 We define,
I −1
Cˆ i2,l 2 I −1 ˆ 2 F = fI +1−i i . . . i fI −1 − fˆI +1−i i . . . i fˆI −1
= ∑ ˆ σˆ l ∏ f k
l = I +1− i δ i,l k = l +1
= SI +1−i + . . . + SI −1 , (A.5)
I −1
σˆ 2 l −1 I −1
= ∑ l Ci2,I +1−i ∏ fˆ k2 i ∏ fˆ k2
ˆ
l = I +1− i δ k = I +1− i k = l +1 with
i,l
I −1
σˆ l2 fˆ l2 I −1 ˆ 2
= Ci2,I +1−i ∑ ∏ fk Sk = fˆI +1−i i . . . i fˆk −1 fk fk +1 i . . . i fI −1
l = I +1− i δˆ k = I +1− i
i,l
F 2 = ( SI +1−i + . . . + SI −1 )2
We now turn to the second summand of the
I −1 I −1 I −1
expression (A.1). Because of Proposition 4.1 (iv) = ∑ Sk2 + 2 ∑ ∑ S j Sk . (A.7)
and (v) we have, k = I +1− i k = I +1− i j < k
Bk = {Ci, j : i + j ≤ I + 1, k ≤ j} ⊂ DI .
2
I I
= E ∑ Cˆ iI − ∑ CiI DI
i =1 i =1
Then, we can estimate F 2 by
I I I 2
= Var ∑ Ci,I DI + E ∑ Ci,I DI − ∑ Cˆ i,I
I −1 I −1 Var ( fˆk Bk ) i =1 i =1 i =1
2 = fˆ l2
F ∏ ∑ .
l = I +1− i k = I +1− i fˆ k2
The independence of accident years yields
I 2
E C D − I Cˆ
I −k
γ 2 ∑ i,I I ∑ i,I
∑δ j, k
i 1{δ ≠ 0}
j, k
i =1 i =1
Var ( fˆk Bk ) = σ 2 j =1 j, k
. (A.8) I 2
= ∑ ( E (Ci,I DI ) − Cˆ i,I )
k I −k 2
γ
∑ j,k i =1
j =1
I
= ∑ ( E (Ci,I DI ) − Cˆ i,I )( E (C j,I DI ) − Cˆ j,I )
The proof of Proposition A.2 is postponed to i, j
Appendix A.5.
Finally, using (A.4) and Proposition A.2 we estimate Taking together
E(Ci,I DI) – Ĉi,I)2 by I I
I ˆ D ∑ Ci ,I = ∑ msep
msep ˆ D (CiI )
∑
i =1
CiI I
i =1 i=2 CiI I
I −k
γ2
∑ δ j,k i 1{δ ≠ 0} I
I −1
σˆ 2 j, k
C 2
i,I +1− i fˆ I2+1−i i . . . i fˆ I2−1 ∑ ˆ 2k
j =1 j, k
I −k 2
+ ∑ 2 i Ci,I +1−iC j,I +1− j Fi Fj ,
k = I +1− i f k γ 2≤i ≤ j ≤ I
∑ j,k
j =1
with
I −k
γ2
∑ δ j,k i 1{δ ≠ 0}
Fi = fI +1−i i . . . i fI −1 − fˆI +1−i i . . . i fˆI −1 =
I −1
I −1
σˆ 2 j, k
∑ Ski ,
= Cˆ 2i,I ∑ 2k
j =1 j, k
. k = I +1− i
ˆ
k = I +1− i f k
I −k
γ
2
∑ j,k
j =1 where
This completes the proof of Result 5.1. Ski = fˆI +1−i i . . . i fˆk −1 ( fk − fˆk ) fk +1 i . . . i fI −1 .
I −k I −k I −k
I −1Var ( fˆk Bk ) = ∑ δ i,k F i2,k − 2 ∑ δ i,k Fi,k i fˆk + ∑ δ i,k fˆk2 .
Fi Fj = ∑ ( fˆI +1−i i . . . i fˆI −1 ) i =1 i =1 i =1
k = I +1− i fˆ k2
Since di,k are s(Ci,k) measurable, we have
i ( fˆ I +1− j i . . . i fˆI −1 ).
I −k
E (( I − k − 1) i σˆ 2k Bk ) = ∑ δ i,k E ( F i2,k Bk )
We finally conclude, from Proposition A.3 i =1
I −k
I − 2 ∑ δ i,k E ( Fi,k i fˆk Bk )
∑
2 i Ci,I +1−iC j,I +1− j Fi Fj
i =1
2≤i < j ≤ I
I −k
+ ∑ δ i,k E ( fˆ k2 Bk ).
I I I −1
σˆ 2k fˆ k2 I −k γ l2,k i 1{δ j,k ≠ 0}
= ∑ Cˆ i,I ∑ Cˆ j,I ∑ ∑
i =1
2 2
i .
I −k
γ δ l,k
∑
i=2 j = i +1 k = I − i +1 l =1
l =1
l, k
In the following derivation we use s(Ci,k)-
measurability of γi,k and definition of fˆk from (4.2).
Furthermore, the assumption GMCL.3 implies that Fi,k
A.3. Proof of Proposition 3.1
and Fj,k are independent for i ≠ j. From assumption
i. See Theorem 2 p. 215 in Mack (1993). GMCL.1 and GMCL.2 we easily see that E(F 2i,kBk) =
ii. See discussion on p. 112, Corollary on p. 141 σ 2k
+ f k2 . Taking together,
and Appendix B on p. 140 in Mack (1994). δ i,k
iii. See Appendix E on p. 151 in Mack (1994).
1 I −k
iv.
See Theorem 1 p. 215 and discussion after (
E Fi,k i fˆk Bk = ) I −k ∑ γ j,k i E ( Fi,k i Fj,k Bk )
the proof of Theorem 2 on page 216 in Mack ∑ γ l,k j =1
l =1
(1993).
I −k
γ i E F2 B + γ
v. see Appendix C p. 142 in Mack (1994):
1 i,k
( i,k k ) ∑ j,k
= I −k
j ≠i
A.4. Proof of Proposition 4.1 ∑ γ l,k i E ( F B ) i E ( F B )
i, k k j, k k
l =1
(i), (iv) and (v), see proofs of (i), (iv) and (v) respec-
1 σ 2k 2
I −k
2
tively in Proposition 3.1. = I −k γ i,k i δ + f k + ∑ γ j,k f k
(ii). The first part of the statement regarding to ∑ γ l,k i, k j ≠i
l =1
the minimal variance of parameters fk can be
easily derived from the proof of (ii) in Propo- 1 σ 2k I −k
= I −k γ i, k i
δ i,k
+ f 2
k ∑ γ j,k
sition 3.1. The rest of the proof is easily seen ∑ γ l,k j=i
i =1 δ i,k i =1
∑ γ i,k
l
δ i,l
i =1
2
Ci,l
γ I −k 2 = σ l2 E DI + f l2Var [Ci,l DI ].
∑δ j, k δ
i,l
I −k
+ ∑ δ i,k σ 2k
j =1 j, k
2 + f 2
k (A.11)
i =1 I −k
∑ γ j,k
j =1 We multiply the both sides by ∏ I–1 k=l+1 f k with the con-
2
I −k
i =1
∑ γ j,k
2
I −1
C i,l I −1
j =1 = ∑ σ l2 E δ DI ∏ f k2
l = I +1− i i,l k =l +1
I − k I − k γ j,k
2
∑ δ ∑
I −1 I −1
∑ Var [Ci ,l DI ] f l2 ∏ f k2Var (Ci ,I DI )
i, k
i =1 j =1 δ j, k +
= ( I − k − 1) σ 2k + σ 2k 2 − 1 . (A.10) l = I +1− i k =l
I −k
∑ γ j,k I −2 I −1
j =1 + ∑ Var (Ci ,l +1 DI ) ∏ f k2
l = I +1− i k = I +1
Finally 2
I −1
Ci,l I −1
= ∑ σ l2 E DI ∏ f k2
E ( σˆ 2k − σ 2k ) = E [ E [( σˆ 2k − σ 2k ) Bk ]] l = I +1− i δ i,l k =l +1
I −1
I − k I − k γ 2j,k + Var [Ci,I +1−i DI ]
∑ δ i,k ∑ ∏ f k2
σk j =1 δ j, k
2 k = I +1− i
= E i =1
− 1 .
I − k − 1 I −k
2
I −1 I −1
∑ γ j,k + ∑ Var [Ci,l DI ]∏ f k2 . (A.12)
j =1 l = I + 2−i k =l
Since Var[Ci,I+1-i DI] = 0 and from the fact that A.5.4. Proof of Proposition A.3
We find the estimator Fi Fj in the similar way to
I −2 I −1 I −1 I −1 2
the estimator F (see proof of Proposition A.1).
∑ Var (Ci,l +1 DI ) ∏ f k2 = ∑ Var [Ci ,l DI ] ∏ f k2 ,
l = I +1− i k = l +1 l = I + 2− i k =l
that we approximate S k2 and Sj Sk by varying and aver- i fˆI2+1−i i . . . i fˆk2−iVar ( fˆk Bk ) fˆk2+1 i . . . i fˆI2−1
aging as little data as possible so that as many values
Ci,k from data observed are kept fixed. Due to Propo- I −1 Var ( fˆk Bk ) ˆ
= ∑ fI +1− j i . . . i fˆI −i
sition 4.1 (i) we have E( fˆk – fk) = 0 and therefore k = I +1− i fˆk2
E(Sj SkBk) = 0 for j < k because all fr, r < k, are scalars
under Bk. Since E(( fk – fˆk)2Bk) = Var( fˆkBk) we obtain i fˆ I2+1−i i . . . i fˆk2−1 i fˆk2 i fˆk2+1 i . . . i fˆI2−1
from (A.6)
I −1 Var ( fˆk Bk ) ˆ
= ∑ ( fI +1− j i . . . i fˆI −1 )
E ( S 2k Bk ) = fˆI2+1−i i . . . i fˆk2−i Var ( fˆk Bk ) f k2+1 i . . . i f I2−1 . k = I +1− i fˆk2
with ∑ I–1
k=I+1-i E(S k Bk) and the unknown parameters
2
I −1 Var ( fˆk Bk )
∑ fˆI2+1−i i . . . i fˆk2−1 fˆk2 fˆk2+1 i . . . i fˆI2−1 .
k = I +1− i fˆk2
I −k
γ F
∑
j =1 j,k j,k
Var ( fˆk Bk ) = Var I − k Bk
∑ γ j,k
j =1
I −k
∑ γ 2j,kVar ( Fi,k Bk ) i 1{δ j,k ≠ 0}
j =1
= I −k 2 ,
γ
∑ j,k
j =1
B. Data
We present in Table B.8 the triangle of RAA data analysed in Mack (1994) and Murphy, Bardis, and
Majidi (2012).
D. LAD estimator
{ } {∑ }
n n m
min ∑ ei = min yi − ∑ xijβ j ,
β β
The least absolute deviation (LAD) method or L1 i =1 i =1 j
, k bi, k ∑ ai, k ci, k , and all coefficients are positive, the second derivative
i =1 i =1
of h is strictly positive. This means that first deriva-
with ai,k := Ci,k, bi,k := γ 2i,k/(∑ i=1
I–k
γi,k)2, ci,k := 1/(I – k – 1) tive of h is increasing function. Together with the pre-
( fˆk – Fi,k) and Nk := I – k. In the sequel, without the vious facts it implies that h has an absolute minimum.
loss of generality, we omit the index k corresponding In consequence, the equation (23) has zero, one or
to the column of run-off triangle. Thus, we consider two solutions.
the function In the case where two solutions of opposite sign
exist, the actuary should decide which one cor
N N
h (β ) := ∑ ai−β bi ∑ aiβci , responds better to the considered line of business.
i =1 i =1 In fact, as mentioned in Murphy, Bardis, and Majidi
(2012), the choice of negative solution does not
with ai ≥ 0, 0 ≤ bi ≤ 1 and ci ≥ 0. We rewrite the
seem to be unreasonable in some situations. This
function h as follows:
issue is out of scope of this paper. In our numerical
N N example the solution is determined by Excel tool
h (β ) := ∑ ∑ ( ai−β bi ) ( aβj c j ) . called solver.
i =1 j =1
Reserving Method (Mack and Murphy Revisited),” ASTIN Merz, M., and M. V. Wüthrich, “Modelling the Claims Develop-
Bulletin 36: 2, 2006a, pp. 521–542. ment Result for Solvency Purposes,” Conference Paper, ASTIN
Buchwalder, M., H. Bühlmann, M. Merz, and M. V. Wüthrich, Colloquium, Manchester, July 2008.
“The Mean Square Error of Prediction in the Chain Ladder Murphy, D. M., “Unbiased Loss Development Factors,” Insurance
Reserving Method—Final Remark,” ASTIN Bulletin 36, 2006b, Mathematics and Economics 18: 3, 1996, p. 228.
pp. 553–553. Murphy, D., M. Bardis, and A. Majidi, “A Family of Chain-Ladder
Huber, P. J., and E. M. Ronchetti, Robust Statistics, 2009. Factor Models for Selected Link Ratios,” Variance 6, 2012,
Jeng, H.-W., “On Small Samples and the Use of Robust Estimators pp.143–160.
in Loss Reserving,” Casualty Actuarial Society E-Forum, Saito, S., “Generalisation of Mack’s Formula for Claims Reserv-
Fall 2010. ing with Arbitrary Exponents for the Variance Assumption,”
Mack, T., “Distribution-Free Calculation of the Standard Error of Journal of Mathematics for Industry 1, 2009, pp. 7–15.
Chain Ladder Reserve Estimates,” AST1N Bulletin 23, 1993, Sloma, P. Contribution to the Weak Convergence of Empirical
pp. 213–222. Copula Process. Contribution to the Stochastic Claims Reserv-
Mack, T., “Measuring the Variability of Chain Ladder Reserve ing in General Insurance. PhD thesis, Université Pierre et Marie
Estimates,” Casualty Actuarial Society Forum 1, 1994, Currie, 2014.
pp. 101–182. Sloma, P., “General Model for Measuring the Uncertainty of the
Mack, T., “The Standard Error of Chain Ladder Reserve Estimates Claims Development Result (CDR),” Proceedings of the Actu-
Recursive Calculation and Inclusion of Tail Factor,” ASTIN arial and Financial Mathematics Conference, February 10–11,
Bulletin 29: 2, 1999, pp. 361–366. 2011.
Mack, T., G. Quarg, and C. Braun, “The Mean Square Error of Pre- Wüthrich, M. V., M. Merz, and H. Bühlmann, “Bounds on the
diction in the Chain Ladder Reserving Method: A Comment,” Estimation Error in the Chain Ladder Method,” Scandinavian
ASTIN Bulletin 36: 2, 2006, pp. 543–552. Actuarial Journal 2008: 4, 2008, pp. 283–300.