Overdispersed-Poisson Model in Claims Reserving: Closed Tool for One-Year Volatility in GLM Framework

Strascia, Stefano Cavastracci; Tripodi, Agostino

doi:10.3390/risks6040139

Open AccessFeature PaperArticle

Overdispersed-Poisson Model in Claims Reserving: Closed Tool for One-Year Volatility in GLM Framework

by

Stefano Cavastracci Strascia

^*,† and

Agostino Tripodi

^†

IVASS, Prudential Supervision, 00187 Rome, Italy

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Risks 2018, 6(4), 139; https://0-doi-org.brum.beds.ac.uk/10.3390/risks6040139

Submission received: 14 September 2018 / Revised: 22 November 2018 / Accepted: 25 November 2018 / Published: 5 December 2018

Download Versions Notes

Abstract

:

The aim of this paper is to carry out a closed tool to estimate the one-year volatility of the claims reserve, calculated through the generalized linear models (GLM), notably the overdispersed- Poisson model. Up to now, this one-year volatility has been estimated through the well-known bootstrap methodology that demands the use of the Monte Carlo method with a re-reserving technique. Nonetheless, this method is time consuming under the calculation point of view; therefore, approximation techniques are often used in practice, such as an emergence pattern based on the link between the one-year volatility—resulting from the Merz–Wüthrich method—and the ultimate volatility—resulting from the Mack method.

Keywords:

claims reserving; prediction error; claims development result; one year view

1. Introduction

About 10 years ago, in Italy, the use of the generalized linear model (GLM)—to estimate the claims reserve began to spread out both in the academic world and in the insurance market; in 2006, some excellent specialized series of lectures on this subject were sponsored by the Concentric Company and held by Richard Verrall of the London Cass Business School, one of the main developers of this implementation—at the stock-exchange offices in Milan. One obstacle, as to the ability to acquire such models in the Italian actuarial practice, was the need to include information about the claim number in this kind of estimate, as we had already done for years with deterministic methodologies.

Today, rather than a higher flexibility in order to better these models inherent predicting ability, a general development of derivative models mostly featuring a different theoretical background and explained in a series of papers issued in later years—can be observed. These models, which have tried to overcome GLM’s own limits, can be classified into four categories:

(a): GLMs including families different from exponential class without distribution restrictions GLZ Venter (2007);
(b): Antonio and Beirlant (2008) GLMM (generalized linear mixed model) that allow for overcoming the hypothesis of independence among payments of claims occurring in the same generation but in different years—processed with stochastic simulation techniques;
(c): Bjökwall et al. (2011) GLMs with smoothing effects;
(d): Hudecovà and Pešta (2013) GEE (generalized estimating equations) implementation, where the connection among payments of the same accident year is made through a closed tool.

However, up to now, their use has remained confined to the academic environment for different reasons: GLZ mainly improves the historical data fitting; the GLMMs—based on a semiparametric regression model—could present heavy computational cost in a professional environment; the GLMs with a smoothing effect always pose the risk to distort the information included in companies data; GEEs are possibly among the most interesting ones. Nevertheless, experts who have studied them have formulated neither an exact theory about the starting correlation estimates to be given to the algorithm of optimisation of the parameters input—which is strictly dependent on them—nor an exhaustive definition of the error prediction formulae.

Back to the GLMs, however, it is important to remark that, in 2016, the Casualty Actuarial Society has issued a substantial monograph Taylor and McGuire (2016) about these models, focused on the diagnostics and modification of the regression patterns. This work, on the contrary, is aimed at deriving the volatility of a particular GLM with Over Dispersed Poisson (ODP) distribution through a closed tool, in a one-year horizon framework. This volatility is particularly relevant to calculate the requirement for the reserve risk capital of the internal models, which so far has been calculated through simulation techniques such as the bootstrapping used in two phases—the so called re-reserving. The authors have drawn inspiration from an essay of Merz and Wüthrich (2015), where the volatility estimate for the overall accident years is calculated in the ODP cross-classified model, through the propagation error in physics. This result has been adapted to the GLM model, this way getting a one-year volatility formulae both for each and for the total of the accident years. The evidence we have talked about will be introduced by a description of the GLM application to the loss-reserving problem, with the addition of useful numerical examples.

On a general basis, the claim reimbursements that have not yet been paid at the end of the financial year imply the claims reserve. The nature of such a balance-sheet item estimate is a major risk source for non-life undertakings, due to the problems that its potential underestimation can bring about. In order to get a proper quantification, actuarial methodologies have already been an integral part of the specific estimation process for a long time. This is often due to the lacking of the case by case assessment of each claim file adopted by companies to calculate the ultimate cost representation for the long tail branches. Despite, in comparison with the traditional methods, stochastic methodologies are less ready-to-use, they have several advantages: they are based on explicit and coherent statistical hypothesis; they get the ad hoc adjustments and discretionary estimates to the minimum; beyond the very accurate best estimate of reserve, they provide confidence intervals of the reserve itself in line with fixed probability levels. Notably, through these methodologies, it is always possible to get to an estimate of the first order-moments mean and the second-order moments variance of the reserve distribution. Of course, also the overall probability distribution can be deducted, either through analytical methods—if further appropriate hypothesis are adopted—or through simulation techniques. The use of stochastic methods has been consolidated through the Solvency II project, by reaching a co-ordinated target in a probabilistic key (best estimate added to risk margin) as a prescribed requisite to estimate the claims reserve and the reserve risk capital. Indeed, in such a framework, an exact definition of best estimate, risk margin and reserve risk capital can only be provided by the application of a stochastic model of estimate to the historical time-series of the claims. Here, the risk margin is additional and aimed at clearly quantifying the risk capital yield according to the uncertainty level of the cash flows to come.

In stochastic estimates—beyond the financial kind of uncertainty, linked to the investments yields and to the legal aspects connected with the paying-off delay—three kinds of risk must be taken into account: model risk, estimation risk, and process risk. Model risk means the risk that an unfitting model could be used to represent the phenomenon; the estimation risk is linked to the volatility of the estimator used in order to infer on the model parameters; the process risk is linked to the variance of the phenomenon under scrutiny.

In order to create a connection with the previous practice, many of the stochastic models for the reserving have been built by widening the traditional deterministic techniques, particularly the well-known chain ladder methodology, based on the development of the cumulative payments. Keeping this in mind, we need to emphasise that some of the most used stochastic methods—Mack and ODP just allow to make automatic estimates of the reserve and only apply when the basic chain ladder hypothesis are met. The claims reserving working party of 2002 British actuaries has spotted nothing short of 26 qualitative factors to be taken into account in the claims reserving. Nonetheless, this limit is more neglectable in determining the capital requisite because it is function of a volatility quantification. As to the estimate of cash flows of future payments which have already happened, and to estimate the different kinds of risk that have to be taken into account in the risk margin assessment, this work uses the stochastic models evolution included in the GLM class. It is known that such models allow for using different distributions for the response variable and the explicative variable’s parameters which are estimated to be linked to the response variable. Therefore, different traditional methods to estimate the claims reserve can be reviewed under this light; as we have already said, claims reserve estimates resulting from particular generalized linear methods indeed match with the ones resulting from deterministic methods to estimate the claims reserve, such as the chain ladder and the separation methods. Some works about the implementation of GLM to estimate the claims reserve are quoted in our bibliography Despeyroux et al. (2003); Englad and Verral (2001); Englad and Verral (2002); Gigante and Sigalotti (2004); Renshaw and Verrall (1998); and Taylor and McGuire (2004). As a general rule, in order to confine the model risk, as a first step, a wide range of models must be considered to pick the one that better fits data, according to a proper good adaptation quantification. In the reference framework—as a measure of goodness of fit to compare the different models—the log-likelihood is estimated in the case of distributions belonging to the exponential family of random variables—for instance gamma, poisson and inverse-Gaussian distribution. The extended quasi-likelihood function introduced by Nelder and Pregibon (1987) is used instead in the semi-parametric case where just the relation between the variance and the mathematical expectation of the response variable is specified.

Once the model has been chosen, the second step consists of approaching data by modifying the regression structure, for example by adding a particular explicative variable to point out an outlier, keeping as a target an optimal figure of the scaled deviance function (over degrees of freedom).

In Section 1, an overview about the use of the generalized linear models to estimate the claims reserve and all the themes connected to it, also with reference to the framework Solvency II, is provided. In Section 2, data organization in the run-off triangle is illustrated, short hints about the widely-used chain ladder method are provided and the notion of claims development result is introduced. In Section 3, the GLM model to estimate the claims reserve is illustrated. In Section 4, the relation to estimate the ultimate volatility through the GLM method is described. Section 5 is the main focus of this paper and explains how to get the algorithm that allows to estimate the one-year volatility through a closed tool in the GLM framework. Finally, in Section 6, a practical case is presented by using an Italian insurance company disguised run-off triangle.

2. Claims Reserve Estimation

In non-life undertakings, in order to estimate the claims reserve for accidents still to be paid generated by an insured risk portfolio at the end of the financial year, we generally make reference to the historical payments triangle, updated at the estimate date.

Notably, we assume that the observations concerning payments already made are connected to accidents happened in a limited previous time-framework; thus, sums paid for accidents happened or generated in previous years are available in this kind of diagram.

For each accident year, data are divided into development years, a variable which quantifies the claim payment year.

2.1. Data Organization

Given

Y_{i j}

as the paid sum, with j as the delay in payment for accidents happened in the i-th year, usually called incremental payment. These payments are usually represented in the so-called run-off triangle (see Table 1).

Given instead

C_{i, j} = \sum_{k = 0}^{j} Y_{i k}

as the cumulative payment, i.e., the sum paid-off for the i generation within the first J development years, the recursive relation

C_{i, j} = C_{i, j - 1} + Y_{i j}

with

j > 0

is effective. The ratio

F_{i, j - 1} = C_{i, j} / C_{i, j - 1}

, named link ratio, is the factor connecting the cumulative payment between two close development years—the

j - 1

and the j- for the same i generation. Assuming that the payment process of each generation will be surely over within J years, the overall cost of the i generation will be:

C_{i, J} = \sum_{k = 0}^{J} Y_{i k}

; writing again the overall cost in the sum of the two addends will make things clearer:

C_{i, J}^{(t)} = \underset{deterministic}{\underset{︸}{\sum_{k = 0}^{t - i} Y_{i k}}} + \underset{stochastic}{\underset{︸}{\sum_{k = t - i + 1}^{J} Y_{i k}}},

(1)

since in the t balance-sheet year the first addend is known for sure, while the second is subjected to randomness. Therefore, the claims reserve estimate for the i generation for the t balance-sheet year, concerns the random component of

C_{i, J}^{(t)}

, that is to say, we have:

R_{i}^{(t)} = \sum_{k = t - i + 1}^{J} Y_{i k} = C_{t - j, J}^{(t)} - C_{t - j, j}

. We will instead name

{\hat{R}}_{i}^{(t)} = \sum_{k = t - i + 1}^{J} {\hat{Y}}_{i k}

the claims reserve estimate, made at the t time, and

{\hat{R}}^{(t)} = \sum_{i + j > t} {\hat{Y}}_{i j}

the overall claims reserve for all generations. In the following passages, we will use t to make reference to the current date of estimate.

2.2. Chain Ladder Method: Basic Concept

The idea underlying the chain ladder method is that there is a proportion between the cumulative payments of two close development years, except for an erratic component with a null mean:

C_{i, j + 1}^{(t)} = C_{i, j} f_{j}^{(t)} + ϵ_{i j} i = 1, \dots, I - j - 1,

(2)

looking at Equation (2), we conclude that, in the chain ladder model, the cumulative payment are showed by a line through the origin for each j development year. If we assume the residuals variance is

V a r (ϵ_{i j}) = σ^{2} C_{i, j}

, the least square solution for the

f_{j}^{(t)}

estimate is:

{\hat{f}}_{j}^{(t)} = \frac{\sum_{k = 1}^{t - j - 1} C_{k, j + 1}}{\sum_{k = 1}^{t - j - 1} C_{k, j}} = \frac{\sum_{k = 1}^{t - j - 1} C_{k, j} F_{k, j}}{\sum_{k = 1}^{t - j - 1} C_{k, j}} j = 0, 1, \dots, J - 1,

(3)

which is the weighted average of all link ratios observed. This approach implies that the cumulative payments

C_{i_{1}, j}

and

C_{i_{2}, j}

for

i_{1} \neq i_{2}

are independent; each ratio j, beyond being independent from the i generation, must also have equal first two moments with a fixed j, thus the process of claim settlement must not have undergone structural changes in time. The ultimate cost

{\hat{C}}_{i, J}^{(t)}

estimate is calculated through the use of the factors

{\hat{f}}_{j}^{(t)}

:

{\hat{C}}_{i, J}^{(t)} = C_{i, t - i} \prod_{j = t - i}^{J - 1} {\hat{f}}_{j}^{(t)},

(4)

thus the claims reserve estimate is:

{\hat{R}}_{i}^{(t)} = {\hat{C}}_{i, J}^{(t)} - C_{i, t - i} .

(5)

2.3. The Claims Development Result

The CDR is the technical result of the evolution of the claim settlement process. In other words, it calculates if the claims reserve

R_{i}^{(t)}

—set aside in the generic t balance-sheet year, for the i generation—is enough to pay the claims

Y_{i, t - i + 1}

, between t and

t + 1

and to set aside the new claims reserve

R_{i}^{(t + 1)}

in

t + 1

formally:

C D R_{i, t + 1} = R_{i}^{(t)} - (Y_{i, t - i + 1} + R_{i}^{(t + 1)}) = C_{i, J}^{(t)} - C_{i, J}^{(t + 1)}

(6)

is a random variable if the observation moment is t, while it is a deterministic value if the observation moment is

t + 1

. In the risk estimate and solvency capital calculation framework, we are interested in t observation random variable, while, in the balance-sheet analysis framework, we are interested in the deterministic aspect observed in

t + 1

. Particularly, we have a loss if

C D R_{i, t + 1} < 0

, while we have a gain with a positive result.

3. Generalized Linear Models to Estimate the Claims Reserve

The GLM are a generally wide range of models in which it is possible to define and maximize the likelihood function while estimating the parameters. Assuming that for this function regularity conditions are respected, the parameter estimated through the maximum likelihood function method have got many properties, such as: consistency, asymptotic correctness and asymptotic normality. These properties allow for getting additional information about parameters and calculations about the goodness of fit. Furthermore, the same reserving estimates calculated through the traditional estimation methods can be replicated by using particular kinds of GLM. In our following calculations, with reference to a generic parameter

α

, we will use

\hat{α}

to make reference to its estimate and the

\tilde{α}

symbol to make reference to its corresponding estimator.

3.1. GLM Models Structure

In GLM models, the response variable is typically represented by observed payments

Y_{i j}

to estimate the claims reserve, while the accident year and the development year are used for the explicative variables. Notably, the explicative variables are used as qualitative factors and therefore coded through the dummy variables. For GLM models, the following properties are valid:

the $Y_{i j}$ are stochastically independent;
the density (or probability) function is in exponential family:

$f (y; θ_{i j}, ϕ) = exp \{\frac{ω_{i j}}{ϕ} [y θ_{i j} - b (θ_{i j})]\} c (y; θ_{i j}, ϕ),$

(7)

where $ω_{i j}$ is an indicated weight, $θ_{i j}$ is the prescribed parameter, $ϕ$ is the dispersion parameter independent from i, and j, and $b (.)$ and $c (.)$ are functions which identify the particular exponential family;
the moments can be generalized as follows:

$E [Y_{i j}] = g^{- 1} (x_{i j}^{⊤} β) = b^{^{'}} (θ_{i j}) a n d V a r [Y_{i j}] = \frac{ϕ}{ω_{i j}} b^{^{″}} (θ_{i j}) = \frac{ϕ}{ω_{i j}} V (μ_{i j}),$

(8)

where $x_{i j}$ is the column vector of the explicative variables, $β$ is the parameters vector and g is a continuous and invertible function which is called link function. In our following calculations, we will use h to make reference to the reverse of the link function, i.e., $h = g^{- 1}$ , while $V (μ_{i j}) = b^{^{″}} (b^{^{'} - 1} (μ_{i j}))$ is the so-called variance function.

We have indicated with X the design matrix, where the generic row is the vector which indicates the explicative variables for the matching response variable calculation, while

η = X β

is the linear predictor. Therefore,

g (.)

is the function that links each element of the

η_{i j} = x_{i j}^{⊤} β

linear predictor with

E [Y_{i j}] = h (η_{i j})

, i.e., with the mathematical expectation. For the regression parameters vector

β = {(c, a_{1}, \dots, a_{I}, b_{0}, \dots, b_{J})}^{⊤}

, the c parameter indicates a feature in common with all the observations—model intercept—, the

a_{1}, a_{2}, \dots, a_{I}

parameters are linked to the accident year, while the

b_{0}, b_{1}, \dots, b_{J}

parameters are connected to the payment development year. The model we have created this way will be over-parametrized, and, above all, defined unless an additive constant. To correct this problem, we will assume the link

a_{1} = b_{0} = 0

; indeed, such parameters are not included in the X matrix, thus the generic parameter

a_{i}

and

b_{j}

indicates the difference from the c intercept.

In GLM models, the

β

parameters estimate is calculated through the maximum likelihood method; this approach allows for calculating the maximum likelihood mathematical expectation estimates given

{\hat{μ}}_{i j} = h (η_{i j})

with

i + j > t

. The dispersion

ϕ

parameter estimate when it is not known can itself be calculated either through the maximum likelihood method or through consistent estimators—for instance, through the one based on Pearson estimator:

\hat{ϕ} = \frac{1}{n - p} \sum_{i + j \geq t} ω_{i j} \frac{{(y_{i j} - {\hat{μ}}_{i j})}^{2}}{V ({\hat{μ}}_{i j})},

(9)

where

n - p

is the number of the model freedom degrees n is the number of the observed data, p is the number of parameters to be estimated particularly; in this case, we have

n = \frac{I (I + 1)}{2}

,

p = I + J = 2 I - 1

and thus

n - p = \frac{I^{2} - 3 I + 2}{2}

.

Remark 1.

The regression structure can be altered by inserting other parameters linked to further explicative variables. We can pick them through preliminary analysis based on data and through inferential analysis to compare models based on adjustment to data validity indicators and on residuals analysis. Notably, a new parameter can be inserted as well, to observe particular interactions between the two variables’ accident year and development year, therefore corresponding to particular data.

3.2. Semi-Parametrical Models

As we have already said, to define the likelihood function, it is necessary to specify the analytic form of distribution of the response variable, while it is possible to define the quasi-likelihood by specifying only the relation between mean and variance as described by Wedderburn (1974)1:

K (y; β, ϕ) = \sum_{i + j \leq t} ω_{i j} \int_{y_{i j}}^{μ_{i j}} \frac{y_{i j} - s}{ϕ V (s)} d s .

(10)

This relation can be used to estimate the

β

parameters. The quasi-likelihood function includes properties that are similar to the likelihood function, therefore also the parameters we can get by maximizing the (10). In the over-dispersed Poisson model with logarithmic link-function instead, the ratio between mean and variance is the following:

E [Y_{i j}] = μ_{i j} = e^{c + a_{i} + b_{j}} and V a r [Y_{i j}] = ϕ V (μ_{i j}) = ϕ μ_{i j} .

(11)

By inserting Equations (11) into (10), we can get the expression of the quasi-likelihood function for the over-dispersed Poisson model:

K (y; β, ϕ) = \sum_{i + j \leq t} \frac{ω_{i j}}{ϕ} [y_{i j} log \frac{μ_{i j}}{y_{i j}} - μ_{i j} + y_{i j}],

(12)

the

\hat{β}

estimate is calculated by searching for the

β

values that maximise Equation (12). The optimization problem can be solved through the Gauss–Newton method.

3.3. Elements for the Observed Data Goodness of Fit

One method often used to estimate the observed data model goodness of fit is to analyse the generalized Pearson residuals. Through them, it is possible to analyse the presence of anomalous data or trends. The calculation formula of these residuals is:

r_{i j} = \frac{y_{i j} - {\hat{μ}}_{i j}}{\sqrt{V ({\hat{μ}}_{i j}) / ω_{i j}}} .

(13)

Usually, under the hypothesis of residual normality, it may happen that they are included in the critical values

\pm 1.96

. In the ODP case, with

ω_{i j} = 1

, the residuals become:

r_{i j} = \frac{y_{i j} - {\hat{μ}}_{i j}}{\sqrt{\hat{ϕ} {\hat{μ}}_{i j}}} .

(14)

In order to calculate the overall discrepancy between empirical and theoretical data, as a rule, we use Pearson statistics

χ^{2} = \sum_{i + j \leq t} ω_{i j} \frac{{(y_{i j} - {\hat{μ}}_{i j})}^{2}}{V ({\hat{μ}}_{i j})}

and the deviance:

D (\hat{μ}; y) = - 2 \sum_{i + j \leq t} ω_{i j} [y_{i j} ({\hat{θ}}_{i j} - θ_{i j}^{*}) - (b ({\hat{θ}}_{i j}) - b (θ_{i j}^{*}))],

(15)

with

{\hat{θ}}_{i j} = b^{^{'} - 1} ({\hat{μ}}_{i j})

and

θ_{i j}^{*} = b^{^{'} - 1} (y_{i j})

. In the quasi-likelihood case, the deviance (15) becomes:

D (\hat{μ}; y) = - 2 \hat{ϕ} K (y; β, ϕ)

.

4. The Claims Reserve Mean Square Error of Prediction

4.1. The General Case

A stochastic model for the claims reserving is a prediction method in which payments to come are modeled through the estimators that are a function of the observed data. Therefore, beyond the variability typical of any random variable—process variance—we also need to take into account the variability inherent in the model parameters estimate—estimation or parameter variance.

First of all, we take into account the random R variable, which indicates the claims reserve. By using a proper model for R, we define an

\tilde{R}

estimator carefully modeled on the observed data and we call mean square error prediction (MSEP)—the following quantity:

M S E P (\tilde{R}) = E [{(R - \tilde{R})}^{2}],

(16)

if

\tilde{R}

is a correct estimator for the R mean—i.e.,

E (R) = E (\tilde{R})

—it will be possible to get the following decomposition:

\begin{matrix} M S E P (\tilde{R}) & = & E [{(R - E (R) + E (R) - \tilde{R})}^{2}] \\ \approx & E [{(R - E (R))}^{2}] + E [{(\tilde{R} - E (R))}^{2}] \\ = & \underset{p r o c e s s}{\underset{︸}{V a r (R)}} + \underset{p a r a m e t e r}{\underset{︸}{V a r (\tilde{R})}} . \end{matrix}

(17)

During Equation (17) differentiation, the covariance term is canceled out because of the hypothesis of independence between past observations and future predictions.

4.2. GLMs Implementation in Claims Reserving

In Section 4.1, we have talked about the estimate of historical data distribution parameters. Of course, in order to assess future cash-flows, we need to make predictions and to take into account the prediction errors for the random elements of the lower triangle. To this end, we will assume that the observed data are the result of random variables being stochastically independent and with probability distributions belonging to the same parameters family. This being the hypothesis, on the basis of the estimates and the estimators that we have obtained from run-off data parameters, we can also calculate estimates of distribution and estimators for the random variables of the lower triangle.

Let’s assume

\hat{β} = {(\hat{c}, {\hat{a}}_{1}, \dots, {\hat{a}}_{I}, {\hat{b}}_{0}, \dots, {\hat{b}}_{J})}^{⊤}

to be the estimate and

\tilde{β} = {(\tilde{c}, {\tilde{a}}_{1}, \dots, {\tilde{a}}_{I}, {\tilde{b}}_{0}, \dots, {\tilde{b}}_{J})}^{⊤}

to be the estimator of the maximum likelihood, and

\hat{ϕ}

to be the estimate of the dispersion parameter. For the random variable

Y_{i j}

with

i + j > t

, we can estimate the mathematical expectation and the variance through:

\hat{E} [Y_{i j}] = {\hat{μ}}_{i j} = h ({\hat{η}}_{i j}) = h (\hat{c} + {\hat{a}}_{i} + {\hat{b}}_{j})

(18)

and \hat{V a r} [Y_{i j}] = \hat{ϕ} V ({\hat{μ}}_{i j}) .

(19)

Let’s assume instead

Y_{i j}

as the transform of the linear predictor

{\tilde{Y}}_{i j} = h ({\tilde{η}}_{i j}) = h (\tilde{c} + {\tilde{a}}_{i} + {\tilde{b}}_{j})

. In the ODP model with a logarithmic link function case, we have:

\begin{matrix} \hat{E} [Y_{i j}] & = & {\hat{μ}}_{i j} = e^{(\hat{c} + {\hat{a}}_{i} + {\hat{b}}_{j})} \\ and \hat{V a r} [Y_{i j}] & = & \hat{ϕ} {\hat{μ}}_{i j} . \end{matrix}

(20)

We can get the estimate of mathematical expectations of the claims reserve—under the hypothesis of stochastic independence—as sums of the previous ones including the whole group of indexes defining different quantities. We can apply the same to the estimators sums. We have:

\hat{E} [R_{i}] = \sum_{j = t - i + 1}^{J} {\hat{μ}}_{i j}, \hat{E} [R] = \sum_{i + j > t} {\hat{μ}}_{i j}, \tilde{E} [R_{i}] = \sum_{j = t - i + 1}^{J} {\tilde{μ}}_{i j} and \tilde{E} [R] = \sum_{i + j > t} {\tilde{μ}}_{i j} .

(21)

Predictions are given by the estimators observed values and they match with the mathematical expectations written above. To calculate the prediction errors instead, we make reference to some asymptotic results about the maximum likelihood estimators of parameters in GLMs. Notably, if

ϕ

is known and if the regularity conditions of the likelihood function are respected, the maximum likelihood estimators satisfy the properties of consistency and asymptotic normality. Therefore, the mathematical expectation of the distribution can be approximated through the

\hat{β}

estimate, while the variance–covariance matrix can be estimated through the inverse of the Fisher information matrix:

I (\hat{β}) = - E [{\frac{\partial^{2} \tilde{l}}{\partial β_{h} \partial β_{j}}|}_{β = \hat{β}}],

(22)

where

\tilde{l}

is the r.v. we get by replacing the observations

y_{i j}

—that are the results of the r.v.

Y_{i j}

—in the expression of the log-likelihood function l. If

ϕ

is not known, previous results will be valid all the same if we replace with its own consistent estimate; thus, the estimator

\tilde{β} = {(\tilde{c}, {\tilde{a}}_{1}, \dots, {\tilde{a}}_{I}, {\tilde{b}}_{0}, \dots, {\tilde{b}}_{J})}^{⊤}

will be generally consistent and asymptotically normal:

\tilde{β} \sim N (\hat{β}, I^{- 1} (\hat{β}))

, where

I^{- 1} (\hat{β})

is indeed the estimate of the variance–covariance matrix of the

\tilde{β}

estimator, particularly:

\hat{V a r} (\hat{β}) = [\begin{matrix} \hat{V a r} (\hat{c}) & \dots & \hat{C o v} (\hat{c}, {\hat{a}}_{i}) & \dots & \hat{C o v} (\hat{c}, {\hat{b}}_{j}) & \dots \\ ⋮ & ⋮ & ⋮ \\ \hat{C o v} ({\hat{a}}_{i}, \hat{c}) & \dots & \hat{V a r} ({\hat{a}}_{i}) & \dots & \hat{C o v} ({\hat{a}}_{i}, {\hat{b}}_{j}) & \dots \\ ⋮ & ⋮ & ⋮ \\ \hat{C o v} ({\hat{b}}_{j}, \hat{c}) & \dots & \hat{C o v} ({\hat{b}}_{j}, {\hat{a}}_{i}) & \dots & \hat{V a r} ({\hat{b}}_{j}) & \dots \\ ⋮ & ⋮ & ⋮ \end{matrix}] = I^{- 1} (\hat{β}) .

(23)

When the mean of a random variable and the one of the estimator match, the MSEP is the sum of the process variance and the parameter variance (see Equation (17)). In the GLM model case, the two predicted values match only in the case of the identity link function, while in the unspecified case we can use the following Taylor first-order approximation:

E [{\tilde{Y}}_{i j}] = E [h ({\tilde{η}}_{i j})] \approx h (η_{i j}) + h^{^{'}} (η_{i j}) E [{\tilde{η}}_{i j} - η_{i j}]

(24)

as the estimator

{\tilde{η}}_{i j}

is asymptomatically correct,

{\tilde{η}}_{i j} \to η_{i j}

, we can conclude that

E [{\tilde{Y}}_{i j}] \approx E [Y_{i j}]

; therefore, as for the (MSEP), we get:

M S E P ({\tilde{Y}}_{i j}) = V a r (Y_{i j}) + E [{({\tilde{Y}}_{i j} - E (Y_{i j}))}^{2}] \approx V a r (Y_{i j}) + V a r ({\tilde{Y}}_{i j}),

(25)

with a similar procedure; as for variance, we get:

V a r ({\tilde{Y}}_{i j}) = V a r [h ({\tilde{η}}_{i j})] \approx {[h^{^{'}} ({\hat{η}}_{i j})]}^{2} V a r ({\tilde{η}}_{i j}),

(26)

the linear predictor variance estimate

{\tilde{η}}_{i j}

can be calculated through the variance–covariance matrix (23), and, in particular, we have:

\hat{V a r} ({\tilde{η}}_{i j}) = \hat{V a r} (\tilde{c}) + \hat{V a r} ({\tilde{a}}_{i}) + \hat{V a r} ({\tilde{b}}_{j}) + 2 [\hat{C o v} (\tilde{c}, {\tilde{a}}_{i}) + \hat{C o v} ({\tilde{a}}_{i}, {\tilde{b}}_{j}) + \hat{C o v} (\tilde{c}, {\tilde{b}}_{j})]

(27)

or in its compact form:

\hat{V a r} ({\tilde{η}}_{i j}) = x_{i j}^{⊤} \hat{V a r} ({\tilde{β}}_{i j}) x_{i j},

(28)

where

x_{i j}

is the dummy variables vector, variables used to code accident and development year.

Finally, as for the MSEP, we get the following formula:

\hat{M S E P} ({\tilde{Y}}_{i j}) = \hat{ϕ} {\hat{μ}}_{i j} + {[h^{^{'}} ({\hat{η}}_{i j})]}^{2} \hat{V a r} ({\hat{η}}_{i j}) .

(29)

By following a similar procedure, we can calculate the MSEP of the claims reserve for the i accident year, i.e.,

{\tilde{R}}_{i} = \sum_{j = t - i + 1}^{J} {\tilde{Y}}_{i j}

, that, on the basis of Equation (17), is calculated as the process variance

V a r (R_{i})

plus the parameter variance

V a r ({\tilde{R}}_{i})

sum, i.e.,:

M S E P ({\tilde{R}}_{i}) = V a r (R_{i}) + V a r ({\tilde{R}}_{i}) .

(30)

As for the process variance estimate, we have:

\hat{V a r} (R_{i}) = \sum_{j = t - i + 1}^{J} \hat{V a r} (Y_{i j}) = \hat{ϕ} \sum_{j = t - i + 1}^{J} {\hat{μ}}_{i j},

(31)

while as for the parameter variance estimate:

\hat{V a r} ({\tilde{R}}_{i}) = \sum_{j = t - i + 1}^{J} \hat{V a r} ({\tilde{Y}}_{i j}) + \sum_{\begin{matrix} j_{1}, j_{2} = t - i + 1 \\ j_{1} \neq j_{2} \end{matrix}}^{J} \hat{C o v} ({\tilde{Y}}_{i, j_{1}}, {\tilde{Y}}_{i, j_{2}}),

(32)

where the estimate of the covariances among incremental payments can be calculated in the following way:

\hat{C o v} ({\tilde{Y}}_{i, j_{1}}, {\tilde{Y}}_{i, j_{2}}) = h^{^{'}} ({\hat{η}}_{i, j_{1}}) \cdot h^{^{'}} ({\hat{η}}_{i, j_{2}}) \cdot \hat{C o v} ({\tilde{η}}_{i, j_{1}}, {\tilde{η}}_{i, j_{2}}),

(33)

while for linear predictor the covariance estimation is:

\hat{C o v} ({\tilde{η}}_{i, j_{1}}, {\tilde{η}}_{i, j_{2}}) = x_{i, j_{1}}^{⊤} \hat{V a r} ({\tilde{β}}_{i j}) x_{i, j_{2}} .

(34)

By inserting (26) and (33) in (32), we get:

\hat{V a r} ({\tilde{R}}_{i}) = \sum_{j = t - i + 1}^{J} {[h^{^{'}} ({\hat{η}}_{i j})]}^{2} V a r ({\tilde{η}}_{i j}) + \sum_{\begin{matrix} j_{1}, j_{2} = t - i + 1 \\ j_{1} \neq j_{2} \end{matrix}}^{J} h^{^{'}} ({\hat{η}}_{i, j_{1}}) \cdot h^{^{'}} ({\hat{η}}_{i, j_{2}}) \cdot \hat{C o v} ({\tilde{η}}_{i, j_{1}}, {\tilde{η}}_{i, j_{2}}),

(35)

in the case of the logarithmic link function, Equation (35) can be written in the following way:

\hat{V a r} ({\tilde{R}}_{i}) = \sum_{j = t - i + 1}^{J} {\hat{μ}}_{i j}^{2} V a r ({\tilde{η}}_{i j}) + \sum_{\begin{matrix} j_{1}, j_{2} = t - i + 1 \\ j_{1} \neq j_{2} \end{matrix}}^{J} {\hat{μ}}_{i, j_{1}} {\hat{μ}}_{i, j_{2}} \hat{C o v} ({\tilde{η}}_{i, j_{1}}, {\tilde{η}}_{i, j_{2}}) .

(36)

For the claims reserve total amount, we have instead

\hat{M S E P} (\tilde{R}) = \hat{V a r} (R) + \hat{V a r} (\tilde{R})

where:

\hat{V a r} (R) = \sum_{i + j > t} \hat{V a r} (Y_{i j})

(37)

and

\hat{V a r} (\tilde{R}) = \sum_{i + j > t} h^{^{'}} {({\hat{η}}_{i j})}^{2} \hat{V a r} ({\tilde{η}}_{i j}) + \sum_{\begin{matrix} i_{1} + j_{1} > t \\ i_{2} + j_{2} > t \\ (i_{1}, j_{1}) \neq (i_{2}, j_{2}) \end{matrix}} h^{^{'}} ({\hat{η}}_{i 1, j_{1}}) \cdot h^{^{'}} ({\hat{η}}_{i 2, j_{2}}) \cdot \hat{C o v} ({\tilde{η}}_{i 1, j_{1}}, {\tilde{η}}_{i 2, j_{2}}),

(38)

and in the case of the logarithmic link function:

\hat{V a r} (\tilde{R}) = \sum_{i + j > t} {\hat{μ}}_{i j}^{2} \hat{V a r} ({\tilde{η}}_{i j}) + \sum_{\begin{matrix} i_{1} + j_{1} > t \\ i_{2} + j_{2} > t \\ (i_{1}, j_{1}) \neq (i_{2}, j_{2}) \end{matrix}} {\hat{μ}}_{i 1, j_{1}} {\hat{μ}}_{i 2, j_{2}} \hat{C o v} ({\tilde{η}}_{i 1, j_{1}}, {\tilde{η}}_{i 2, j_{2}}) .

(39)

Finally, for the ODP,2 it is possible to re-write the formulae to calculate the MSEP in the following more compact forms:

\begin{matrix} \hat{M S E P} ({\tilde{R}}_{i}) & = & \hat{ϕ} \sum_{j = t - i + 1}^{J} {\hat{μ}}_{i j} \\ + & \sum_{j = t - i + 1}^{J} {\hat{μ}}_{i j}^{2} x_{i j}^{⊤} \hat{V a r} (\tilde{β}) x_{i j} + \sum_{\begin{matrix} j_{1}, j_{2} = t - i + 1 \\ j_{1} \neq j_{2} \end{matrix}}^{J} {\hat{μ}}_{i j_{1}} {\hat{μ}}_{i j_{2}} x_{i_{1}, j_{1}}^{⊤} \hat{V a r} (\tilde{β}) x_{i_{2}, j_{2}} \end{matrix}

(40)

and

\begin{matrix} \hat{M S E P} (\tilde{R}) & = & \hat{ϕ} \sum_{i + j > t} {\hat{μ}}_{i j} \\ + & \sum_{i + j > t} {\hat{μ}}_{i j}^{2} x_{i j}^{⊤} \hat{V a r} (\tilde{β}) x_{i j} + \sum_{\begin{matrix} i_{1} + j_{1} > t \\ i_{2} + j_{2} > t \\ (i_{1}, j_{1}) \neq (i_{2}, j_{2}) \end{matrix}} {\hat{μ}}_{i j_{1}} {\hat{μ}}_{i j_{2}} x_{i_{1}, j_{1}}^{⊤} \hat{V a r} (\tilde{β}) x_{i_{2}, j_{2}} . \end{matrix}

(41)

5. One-Year Volatility for the Claims Development Results

5.1. Overall Accident Years Estimate

In order to get to a formula that allows for estimating the one-year volatility—i.e., the standard deviation from the CDR random variable as illustrated in Equation (6)—we have used the error propagation technique, as suggested by Röhr (2016), a technique which allows for calculating at the same time the uncertainty both about the parameters estimate and about the random process underlying the data. As we have already described in Section 2.3, the CDR is the technical result of the claims reserve development in a one-year time-framework. Given Equation (6) and the link ratios illustrated in Section 2.2, in the chain ladder framework, the

C D R_{i, t + 1}

is written in the following way:

{\hat{C D R}}_{i, t + 1} = {\hat{C}}_{i, J}^{(t)} - {\hat{C}}_{i, J}^{(t + 1)} = \underset{{\hat{C}}_{i, t - i + 1}^{(t)}}{\underset{︸}{C_{i, t - i} \cdot {\hat{f}}_{t - i}^{(t)}}} \cdot {\hat{f}}_{t - i + 1}^{(t)} \cdot \dots \cdot {\hat{f}}_{J - 1}^{(t)} - C_{i, t - i + 1} \cdot {\hat{f}}_{t - i + 1}^{(t + 1)} \cdot \dots \cdot {\hat{f}}_{J - 1}^{(t + 1)} .

(42)

It is therefore necessary to estimate the ultimate cost

{\hat{C}}_{i, J}^{(t + 1)}

at the

t + 1

time, but, as we have obtained the observations up to the t time, we can use the

{\hat{f}}_{j}^{(t + 1)}

estimates made on the basis of the observations in t. To this purpose, the best approach is to write (42) in the following way:

{\hat{C D R}}_{i, t + 1} = C_{i, t - i} (\prod_{j = t - i}^{J - 1} {\hat{f}}_{j}^{(t)}) (1 - \frac{C_{i, t - i + 1} / C_{i, t - i}}{{\hat{f}}_{t - i}^{(t)}} \prod_{j = t - i + 1}^{J - 1} \frac{{\hat{f}}_{j}^{(t + 1)}}{{\hat{f}}_{j}^{(t)}}) .

(43)

In a priori absence of data, the ratio between two link ratios linked to the same j development year and estimated in a two-years-in-a-raw balance sheet can be calculated through the credibility factor (see Merz and Wüthrich 2015):

\frac{{\hat{f}}_{j}^{(t + 1)}}{{\hat{f}}_{j}^{(t)}} = α_{j}^{(t)} \frac{C_{t - j, j + 1} / C_{t - j, j}}{{\hat{f}}_{j}^{(t)}} + (1 - α_{j}^{(t)}),

(44)

where

α_{j}^{(t)} = \frac{C_{t - j, j}}{\sum_{i = 1}^{t - j} C_{i j}}

is the credibility coefficient.

By using (44) for the link ratios, the CDR estimate (43) can be also written in the following way:

C D R_{i, t + 1} = \underset{ultimate \cos t in t}{\underset{︸}{{\hat{C}}_{i, J}^{(t)}}} - \underset{ultimate \cos t in t + 1, {\hat{C}}_{i, J}^{(t + 1)}}{\underset{︸}{{\hat{C}}_{i, J}^{(t)} \frac{C_{i, t - i + 1} / C_{i, t - i}}{{\hat{f}}_{t - i}^{(t)}} \prod_{j = t - i + 1}^{J - 1} (α_{j}^{(t)} \frac{C_{t - j, j + 1} / C_{t - j, j}}{{\hat{f}}_{j}^{(t)}} + (1 - α_{j}^{(t)}))}} .

(45)

As shown by Renshaw and Verrall (1998), between the link ratios, we have calculated through the chain ladder method and the parameters calculated through the GLM method there is the following ratio:

{\hat{f}}_{j}^{(t)} = 1 + \frac{e^{{\hat{b}}_{j + 1}}}{\sum_{k = 0}^{j} e^{{\hat{b}}_{k}}} = \frac{\sum_{k = 0}^{j + 1} e^{{\hat{b}}_{k}}}{\sum_{k = 0}^{j} e^{{\hat{b}}_{k}}} j = 0, \dots, J - 1,

(46)

on the (46) basis, as for the link ratios product, we have

\prod_{j = t - i}^{J - 1} {\hat{f}}_{j}^{(t)} = \frac{\sum_{k = 0}^{J} e^{{\hat{b}}_{k}}}{\sum_{k = 0}^{t - i} e^{{\hat{b}}_{k}}}

, thus the ultimate cost estimate can be defined as:

{\hat{C}}_{t - j, J}^{(t)} = C_{t - j, j} \frac{\sum_{k = 0}^{J} e^{{\hat{b}}_{k}}}{\sum_{k = 0}^{j} e^{{\hat{b}}_{k}}} j = 0, \dots, J - 1 .

(47)

If we make reference to the cumulative payment

C_{t - j, j + 1}

instead—that in t is a r.v.—we have:

\begin{matrix} C_{t - j, j + 1} & = & C_{t - j, j} + Y_{t - j, j + 1} \\ = & {\hat{C}}_{t - j, J}^{(t)} \frac{\sum_{k = 0}^{j} e^{{\hat{b}}_{k}}}{\sum_{k = 0}^{J} e^{{\hat{b}}_{k}}} + Y_{t - j, j + 1} \\ = & {\hat{C}}_{t - j, J}^{(t)} \frac{\sum_{k = 0}^{j + 1} e^{{\hat{b}}_{k}}}{\sum_{k = 0}^{J} e^{{\hat{b}}_{k}}} + Y_{t - j, j + 1} - {\hat{C}}_{t - j, J}^{(t)} \frac{e^{{\hat{b}}_{j + 1}}}{\sum_{k = 0}^{J} e^{{\hat{b}}_{k}}} \\ = & = {\hat{C}}_{t - j, J}^{(t)} \frac{\sum_{k = 0}^{j + 1} e^{{\hat{b}}_{k}}}{\sum_{k = 0}^{J} e^{{\hat{b}}_{k}}} + Y_{t - j, j + 1} - e^{\hat{c} + {\hat{a}}_{t - j} + {\hat{b}}_{j + 1}} . \end{matrix}

(48)

In calculating the last derivative, we have taken into account that

{\hat{C}}_{t - j, J}^{(t)} = \sum_{k = 0}^{J} {\hat{Y}}_{t - j, k} = e^{\hat{c} + {\hat{a}}_{t - j}} \sum_{k = 0}^{J} e^{{\hat{b}}_{k}}

. By adding and subtracting the hypothetical value

e^{c + a_{t - j} + b_{j + 1}}

to (48), we get:

C_{t - j, j + 1} = {\hat{C}}_{t - j, J}^{(t)} \frac{\sum_{k = 0}^{j + 1} e^{{\hat{b}}_{k}}}{\sum_{k = 0}^{J} e^{{\hat{b}}_{k}}} + \underset{ξ_{t - j, j + 1}}{\underset{︸}{Y_{t - j, j + 1} - e^{c + a_{t - j} + b_{j + 1}}}} + \underset{ζ_{t - j, j + 1}}{\underset{︸}{e^{c + a_{t - j} + b_{j + 1}} - e^{\hat{c} + {\hat{a}}_{t - j} + {\hat{b}}_{j + 1}}}} .

(49)

The cumulative paid sums

C_{t - j, j + 1}

of the following year can be broken up—as suggested in Röhr (2016)—through the

ξ_{t - j, j + 1}

residual linked to the process variance and the

ζ_{t - j, j + 1}

residual linked to the parameter variance.

Keeping in mind (46), (47) and (49), it is possible to get the following ratio:

\begin{matrix} \frac{C_{t - j, j + 1} / C_{t - j, j}}{{\hat{f}}_{j}^{(t)}} & = & \underset{= 1}{\underset{︸}{\frac{{\hat{C}}_{t - j, J}^{(t)}}{C_{t - j, j}} \frac{\sum_{k = 0}^{j + 1} e^{{\hat{b}}_{k}}}{\sum_{k = 0}^{J} e^{{\hat{b}}_{k}}} \frac{\sum_{k = 0}^{j} e^{{\hat{b}}_{k}}}{\sum_{k = 0}^{j + 1} e^{{\hat{b}}_{k}}}}} + (ξ_{t - j, j + 1} + ζ_{t - j, j + 1}) \frac{1}{C_{t - j, j}} \frac{\sum_{k = 0}^{j} e^{{\hat{b}}_{k}}}{\sum_{k = 0}^{j + 1} e^{{\hat{b}}_{k}}} \\ = & 1 + (ξ_{t - j, j + 1} + ζ_{t - j, j + 1}) \frac{1}{{\hat{C}}_{t - j, j + 1}^{(t)}} . \end{matrix}

(50)

Therefore, the ultimate cost in

(t + 1)

for the i accident year—that is the second addend of Equation (45)—can also be written in the following way:

{\hat{C}}_{i, J}^{(t + 1)} = {\hat{C}}_{i, J}^{(t)} (1 + \frac{ξ_{i, t - i + 1} + ζ_{i, t - i + 1}}{{\hat{C}}_{i, t - i + 1}^{(t)}}) \prod_{j = t - i + 1}^{J - 1} (1 + α_{j}^{(t)} \frac{ξ_{t - j, j + 1} + ζ_{t - j, j + 1}}{{\hat{C}}_{t - j, j + 1}^{(t)}}) .

(51)

As a consequence, the cost of the total of all generations, can be calculated by adding the cost of all the accident years:

\sum_{i = t - J + 1}^{I} {\hat{C}}_{i, J}^{(t + 1)} = \sum_{i = t - J + 1}^{I} {\hat{C}}_{i, J}^{(t)} (1 + \frac{ξ_{i, t - i + 1} + ζ_{i, t - i + 1}}{{\hat{C}}_{i, t - i + 1}^{(t)}}) \prod_{j = t - i + 1}^{J - 1} (1 + α_{j}^{(t)} \frac{ξ_{t - j, j + 1} + ζ_{t - j, j + 1}}{{\hat{C}}_{t - j, j + 1}^{(t)}}) .

(52)

Therefore, it is possible to consider the ultimate cost prediction in

t + 1

as a fluctuation of the prediction in the t time, where the

ξ_{t - j, j + 1}

and

ζ_{t - j, j + 1}

residuals represent the process innovation. A simulation approach would demand the

ξ_{t - j, j + 1}

simulation and the

ζ_{t - j, j + 1}

estimate through the bootstrap method.

Notice that the two residuals are independent as to the model basic assumptions. To determine a closed tool, we will use a different approach by taking into account Taylor’s expansion of such a fluctuation. As Step 1, we consider the fluctuation linked to the process variance, we calculate the derivatives3—making reference to the first residual—and we estimate them in 0—for every

ξ

and

ζ

—thus getting the following weights:

\begin{matrix} q_{k + 1}^{(t)} & = & \partial_{log Y_{t - k, k + 1}} {log (\sum_{i = t - J + 1}^{I} {\hat{C}}_{i, J}^{(t + 1)})|}_{0} \\ = & {\frac{\partial_{log Y_{t - k, k + 1}} \sum_{i = t - J + 1}^{I} {\hat{C}}_{i, J}^{(t + 1)}}{\sum_{i = t - J + 1}^{I} {\hat{C}}_{i, J}^{(t + 1)}}|}_{0} = {\frac{\partial_{log Y_{t - k, k + 1}} \sum_{i = t - k}^{I} {\hat{C}}_{i, J}^{(t + 1)}}{\sum_{i = t - J + 1}^{I} {\hat{C}}_{i, J}^{(t + 1)}}|}_{0}, t - I \leq k \leq J - 1 . \end{matrix}

(53)

To make this calculation easier, we initially consider the first addend of the numerator of Equation (53) derivative development:

\begin{matrix} \partial_{log Y_{t - k, k + 1}} {{\hat{C}}_{t - k, J}^{(t + 1)}|}_{0} & = & {\partial_{log Y_{t - k, k + 1}} {\hat{C}}_{t - k, J}^{(t)} (1 + \frac{ξ_{t - k, k + 1} + ζ_{t - k, k + 1}}{{\hat{C}}_{t - k, k + 1}^{(t)}}) \prod_{j = k + 1}^{J - 1} (1 + α_{j}^{(t)} \frac{ξ_{t - j, j + 1} + ζ_{t - j, j + 1}}{{\hat{C}}_{t - j, j + 1}^{(t)}})|}_{0} \\ = & {{\hat{C}}_{t - k, J}^{(t)} (\partial_{log Y_{t - k, k + 1}} \frac{ξ_{t - k, k + 1}}{{\hat{C}}_{t - k, k + 1}^{(t)}}) \prod_{j = k + 1}^{J - 1} (1 + α_{j}^{(t)} \frac{ξ_{t - j, j + 1} + ζ_{t - j, j + 1}}{{\hat{C}}_{t - j, j + 1}^{(t)}})|}_{0} \\ = & {{\hat{C}}_{t - k, J}^{(t)} (\partial_{log Y_{t - k, k + 1}} \frac{Y_{t - k, k + 1}}{{\hat{C}}_{t - k, k + 1}^{(t)}}) \prod_{j = k + 1}^{J - 1} (1 + α_{j}^{(t)} \frac{ξ_{t - j, j + 1} + ζ_{t - j, j + 1}}{{\hat{C}}_{t - j, j + 1}^{(t)}})|}_{0} \\ = & {{\hat{C}}_{t - k, J}^{(t)} \frac{Y_{t - k, k + 1}}{{\hat{C}}_{t - k, k + 1}^{(t)}} \prod_{j = k + 1}^{J - 1} (1 + α_{j}^{(t)} \frac{ξ_{t - j, j + 1} + ζ_{t - j, j + 1}}{{\hat{C}}_{t - j, j + 1}^{(t)}})|}_{0} \\ = & {\hat{C}}_{t - k, J}^{(t)} \frac{e^{c + a_{t - k} + b_{k + 1}}}{{\hat{C}}_{t - k, k + 1}^{(t)}} f o r t - I \leq k \leq J - 2, \end{matrix}

(54)

and

\begin{matrix} \partial_{log Y_{t - J + 1, J}} {{\hat{C}}_{t - J + 1, J}^{(t + 1)}|}_{0} & = & {\partial_{log Y_{t - J + 1, J}} {\hat{C}}_{t - J + 1, J}^{(t)} (1 + \frac{ξ_{t - J + 1, J} + ζ_{t - J + 1, J}}{{\hat{C}}_{t - J + 1, J}^{(t)}})|}_{0} \\ = & {{\hat{C}}_{t - J + 1, J}^{(t)} (\partial_{log Y_{t - J + 1, J}} \frac{ξ_{t - J + 1, J}}{{\hat{C}}_{t - J + 1, J}^{(t)}})|}_{0} \\ = & {\partial_{log Y_{t - J + 1, J}} Y_{t - J + 1, J}|}_{0} \\ = & e^{c + a_{t - J + 1} + b_{J}} f o r k = J - 1 . \end{matrix}

(55)

By applying the same calculation also to the other addends, we get:

q_{k + 1}^{(t)} = \frac{\frac{e^{c + a_{t - k} + b_{k + 1}}}{{\hat{C}}_{t - k, k + 1}^{(t)}} ({\hat{C}}_{t - k, J}^{(t)} + α_{k}^{(t)} \sum_{i = t - k + 1}^{I} {\hat{C}}_{i, J}^{(t)})}{\sum_{i = t - J + 1}^{I} {\hat{C}}_{i, J}^{(t)}}, t - I \leq k \leq J - 1 .

(56)

In calculating this derivative, we have kept in mind that

\partial_{log x} f (x) = x \frac{\partial}{\partial x} f (x)

.

Considering that

\partial_{log x} log (f (x)) = \frac{x}{f (x)} \frac{\partial}{\partial x} f (x)

instead, it is possible to write:

\partial_{Y_{t - k, k + 1}} {\sum_{i = t - J + 1}^{I} {\hat{C}}_{i, J}^{(t + 1)}|}_{0} = \frac{q_{k + 1}^{(t)}}{e^{c + a_{t - k} + b_{k + 1}}} \sum_{i = t - J + 1}^{I} {\hat{C}}_{i, J}^{(t)} .

(57)

As for the

ζ

reminder, linked to the parameter error, we initially consider the derivative in relation to the estimate of the intercept parameter c:

\partial_{{\hat{c}}_{k + 1}} ζ_{t - k, k + 1} = \partial_{{\hat{c}}_{k + 1}} (e^{c + a_{t - k} + b_{k + 1}} - e^{\hat{c} + {\hat{a}}_{t - k} + {\hat{b}}_{k + 1}}) = - e^{\hat{c} + {\hat{a}}_{t - k} + {\hat{b}}_{k + 1}} .

(58)

Thus, deriving the ultimate cost

\sum_{i = t - k}^{I} {\hat{C}}_{i, J}^{(t + 1)}

with reference to

\hat{c}

, and estimating the derivative in 0—for each

ξ

and

ζ

, for the latter, we also consider the equality for every parameter (component-wise equality)—similarly to what we had already done to calculate Equations (53) and (57), we get:

{\partial_{{\hat{c}}_{k + 1}} \sum_{i = t - k}^{I} {\hat{C}}_{i, J}^{(t + 1)}|}_{0} = - \frac{e^{c + a_{t - k} + b_{k + 1}}}{{\hat{C}}_{t - k, k + 1}^{(t)}} ({\hat{C}}_{t - k, J}^{(t)} + α_{j}^{(t)} \sum_{i = t - k + 1}^{I} {\hat{C}}_{i, J}^{(t)}) = - q_{k + 1}^{(t)} \sum_{i = t - J + 1}^{I} {\hat{C}}_{i, J}^{(t)}

(59)

and, in a very similar way, we get the derivative in comparison with the parameters

{\hat{a}}_{t - k}

and

{\hat{b}}_{k + 1}

:

\begin{matrix} {\partial_{{\hat{a}}_{t - k}} \sum_{i = t - k}^{I} {\hat{C}}_{i, J}^{(t + 1)}|}_{0} & = & - q_{k + 1}^{(t)} \sum_{i = t - J + 1}^{I} {\hat{C}}_{i, J}^{(t)}, \\ {\partial_{{\hat{b}}_{k + 1}} \sum_{i = t - k}^{I} {\hat{C}}_{i, J}^{(t + 1)}|}_{0} & = & - q_{k + 1}^{(t)} \sum_{i = t - J + 1}^{I} {\hat{C}}_{i, J}^{(t)} . \end{matrix}

(60)

By using the above-written derivatives, as to the ratio between the ultimate cost estimated in

t + 1

—which in t is random—and, in t, we get to the Taylor’s first-order approximation, which is:

\begin{matrix} \frac{\sum_{i = t - J + 1}^{I} {\hat{C}}_{i, J}^{(t + 1)}}{\sum_{i = t - J + 1}^{I} {\hat{C}}_{i, J}^{(t)}} & = & \frac{\sum_{i = t - J + 1}^{I} {\hat{C}}_{i, J}^{(t)} (1 + \frac{ξ_{i, t - i + 1} + ζ_{i, t - i + 1}}{{\hat{C}}_{i, t - i + 1}^{(t)}}) \prod_{j = t - i + 1}^{J - 1} (1 + α_{j}^{(t)} \frac{ξ_{t - j, j + 1} + ζ_{t - j, j + 1}}{{\hat{C}}_{t - j, j + 1}^{(t)}})}{\sum_{i = t - J + 1}^{I} {\hat{C}}_{i, J}^{(t)}} \\ \approx & 1 + \sum_{k = t - I}^{J - 1} \frac{q_{k + 1}^{(t)}}{e^{c + a_{t - k} + b_{k + 1}}} ξ_{t - j, j + 1} - \sum_{k = t - I}^{J - 1} q_{k + 1}^{(t)} (c - \hat{c} + a_{t - k} - {\hat{a}}_{t - k} + b_{k + 1} - {\hat{b}}_{k + 1}) \\ = & 1 + \sum_{k = t - I}^{J - 1} \frac{q_{k + 1}^{(t)}}{μ_{t - k, k + 1}} ξ_{t - j, j + 1} - \sum_{k = t - I}^{J - 1} q_{k + 1}^{(t)} (η_{t - k, k + 1} - {\hat{η}}_{t - k, k + 1}) \\ = & 1 + \sum_{k = t - I}^{J - 1} \frac{q_{k + 1}^{(t)}}{μ_{t - k, k + 1}} ξ_{t - j, j + 1} - \sum_{k = t - I}^{J - 1} q_{k + 1}^{(t)} x_{t - k, k + 1}^{⊤} (β - \hat{β}) . \end{matrix}

(61)

Through using Formulae (45) and (61), taking the denominator—after changing its sign—from the first to the second member and by exploiting the assumption of independence among the incremental payments, for the square sum of the CDR, we get to the following first-order approximation:

\begin{matrix} {(\sum_{i = t - J + 1}^{I} {\hat{C D R}}_{i, t + 1})}^{2} & \approx & {(\sum_{i = t - J + 1}^{I} {\hat{C}}_{i, J}^{(t)})}^{2} [\sum_{k = t - I}^{J - 1} \frac{{(q_{k + 1}^{(t)})}^{2}}{μ_{t - k, k + 1}^{2}} ξ_{t - j, j + 1}^{2} \\ + & \sum_{k_{1} = t - I}^{J - 1} \sum_{k_{2} = t - I}^{J - 1} q_{k_{1} + 1}^{(t)} q_{k_{2} + 1}^{(t)} \cdot x_{t - k, k + 1}^{⊤} (β - \hat{β}) {(β - \hat{β})}^{T} x_{t - k, k + 1}] \\ = & {(\sum_{i = t - J + 1}^{I} {\hat{C}}_{i, J}^{(t)})}^{2} [\sum_{k = t - I}^{J - 1} \frac{{(q_{k + 1}^{(t)})}^{2}}{μ_{t - k, k + 1}^{2}} V a r (Y_{t - k, k + 1}) \\ + & \sum_{k_{1} = t - I}^{J - 1} \sum_{k_{2} = t - I}^{J - 1} q_{k_{1} + 1}^{(t)} q_{k_{2} + 1}^{(t)} \cdot x_{t - k, k + 1}^{⊤} V a r (β) x_{t - k, k + 1}] . \end{matrix}

(62)

In Equation (62), we have omitted the products between the

ξ

and

ζ

residuals because, as they are independent, they will cancel out in Equation (63).

By replacing the parameters estimates with the corresponding unknown values, we can get the MSEP of the the CDR—i.e., of the one-year loss—for the total of generations:

\begin{matrix} \hat{M S E P} (\sum_{i = t - J + 1}^{I} {\hat{C D R}}_{i, t + 1}) & = & {(\sum_{i = t - J + 1}^{I} \sum_{j = 0}^{J} {\hat{μ}}_{i j})}^{2} \\ \times & [\underset{p r o c e s s}{\underset{︸}{\hat{ϕ} \sum_{k = t - I}^{J - 1} \frac{{\hat{q}}_{k + 1}^{2}}{{\hat{μ}}_{t - k, k + 1}}}} + \underset{p a r a m e t e r}{\underset{︸}{{\hat{q}}^{⊤} X_{(t + 1)} \hat{V a r} (\hat{β}) X_{(t + 1)}^{⊤} \hat{q}}}], \end{matrix}

(63)

where

X_{(t + 1)}

—hat matrix—is the matrix that codes the accident and development years linked to the incremental payments of

t + 1

year through the dummy variables, while

\hat{q} = {\{{\hat{q}}_{k + 1}\}}_{k = t - I}^{J - 1}

is the vector of the

q_{k + 1}

weights estimate calculated in (56) and better specified by using the GLM language:

{\hat{q}}_{k + 1} = \frac{\frac{e^{\hat{c} + {\hat{a}}_{t - k} + {\hat{b}}_{k + 1}}}{\sum_{j = 0}^{k + 1} e^{\hat{c} + {\hat{a}}_{t - k} + {\hat{b}}_{j}}} (\sum_{j = 0}^{J} e^{\hat{c} + {\hat{a}}_{t - k} + {\hat{b}}_{j}} + α_{k}^{(t)} \sum_{i = t - k + 1}^{I} \sum_{j = 0}^{J} e^{\hat{c} + {\hat{a}}_{i} + {\hat{b}}_{j}})}{\sum_{i = t - J + 1}^{I} \sum_{j = 0}^{J} e^{\hat{c} + {\hat{a}}_{i} + {\hat{b}}_{j}}} .

(64)

Notice that, for

k = 0

, we have

t - k + 1 > I

; under this circumstance, the summation is null. Simplifying, (64) can also be written in an easier way, as explained in (65):

{\hat{q}}_{k + 1} = \frac{\frac{e^{{\hat{b}}_{k + 1}}}{\sum_{j = 0}^{k + 1} e^{{\hat{b}}_{j}}} (e^{{\hat{a}}_{t - k}} + α_{k}^{(t)} \sum_{i = t - k + 1}^{I} e^{{\hat{a}}_{i}})}{\sum_{i = t - J + 1}^{I} e^{{\hat{a}}_{i}}} .

(65)

5.2. One-Year Volatility for Accident Year

In this section, we will illustrate the calculation of the one-year volatility for an

i^{*}

fixed accident year. Starting from Equation (51), that we write again just for convenience,

{\hat{C}}_{i^{*}, J}^{(t + 1)} = {\hat{C}}_{i^{*}, J}^{(t)} (1 + \frac{ξ_{i^{*}, t - i^{*} + 1} + ζ_{i^{*}, t - i^{*} + 1}}{{\hat{C}}_{i^{*}, t - i^{*} + 1}^{(t)}}) \prod_{j = t - i^{*} + 1}^{J - 1} (1 + α_{j}^{(t)} \frac{ξ_{t - j, j + 1} + ζ_{t - j, j + 1}}{{\hat{C}}_{t - j, j + 1}^{(t)}})

(66)

and we proceed to get an approximation through a Taylor residuals expansion in series. Similarly to what we have done in the previous section, we calculate the partial derivative, at the point

ξ, ζ = 0

, of the ultimate cost estimated in

(t + 1)

:

s_{i^{*}, k + 1} = \partial_{log X_{t - k, k + 1}} {log (C_{i^{*}, J}^{(t + 1)})|}_{0} = {\frac{\partial_{log Y_{t - k, k + 1}} C_{i^{*}, J}^{(t + 1)}}{C_{i^{*}, J}^{(t + 1)}}|}_{0},

(67)

by considering first of all the case

i^{*} = t - k

, we get:

\begin{matrix} \partial_{log Y_{t - k, k + 1}} {C_{i^{*}, J}^{(t + 1)}|}_{0} & = & {{\hat{C}}_{i^{*}, J}^{(t)} \frac{Y_{i^{*}, t - i^{*} + 1}}{{\hat{C}}_{i^{*}, t - i^{*} + 1}^{(t)}} \prod_{j = t - i^{*} + 1}^{J - 1} (1 + α_{j}^{(t)} \frac{ξ_{t - j, j + 1} + ζ_{t - j, j + 1}}{{\hat{C}}_{t - j, j + 1}^{(t)}})|}_{0} \\ = & {\hat{C}}_{i^{*}, J}^{(t)} \underset{r_{t - i^{*} + 1}^{(t)}}{\underset{︸}{\frac{e^{c + a_{i^{*}} + b_{t - i^{*} + 1}}}{{\hat{C}}_{i^{*}, t - i^{*} + 1}^{(t)}}}} = {\hat{C}}_{i^{*}, J}^{(t)} r_{t - i^{*} + 1}^{(t)} i^{*} = t - k, \end{matrix}

(68)

while, for

i^{*} = 2

:

\begin{matrix} \partial_{log Y_{2, J}} {C_{2, J}^{(t + 1)}|}_{0} & = & {{\hat{C}}_{2, J}^{(t)} \frac{Y_{2, J}}{{\hat{C}}_{2, J}^{(t)}}|}_{0} \\ = & e^{c + a_{2} + b_{J}} \\ = & {\hat{C}}_{2, J}^{(t)} r_{J}^{(t)} i^{*} = 2; \end{matrix}

(69)

therefore, in the particular case

i^{*} = t - k

, we have that

s_{t - i^{*} + 1}^{(t)} = r_{t - i^{*} + 1}^{(t)}

, while for

i^{*} > t - k

the logarithmic (67) becomes:

\begin{matrix} \partial_{log Y_{t - k, k + 1}} {C_{i^{*}, J}^{(t + 1)}|}_{0} & = & {{\hat{C}}_{i^{*}, J}^{(t)} (1 + \frac{ξ_{i t - i^{*} + 1} + ζ_{i t - i^{*} + 1}}{{\hat{C}}_{i t - i^{*} + 1}^{(t)}}) α_{k}^{(t)} \frac{Y_{t - k, k + 1}}{{\hat{C}}_{t - k, k + 1}^{(t)}} \prod_{\begin{matrix} j = t - i^{*} + 1 \\ j \neq k \end{matrix}}^{J - 1} (1 + α_{j}^{(t)} \frac{ξ_{t - j, j + 1} + ζ_{t - j, j + 1}}{{\hat{C}}_{t - j, j + 1}^{(t)}})|}_{0} \\ = & {\hat{C}}_{i^{*}, J}^{(t)} α_{k}^{(t)} \frac{e^{c + a_{t - k} + b_{k + 1}}}{{\hat{C}}_{t - k, k + 1}^{(t)}} = {\hat{C}}_{i^{*}, J}^{(t)} \underset{s_{i^{*}, k + 1}^{(t)}}{\underset{︸}{α_{k}^{(t)} r_{k + 1}^{(t)}}} i^{*} > t - k \end{matrix}

(70)

and we have

s_{i^{*}, k + 1}^{(t)} = α_{k}^{(t)} r_{k + 1}^{(t)}

. In the particular case of the GLM estimate model, with logarithmic link function and ODP distribution, we get the following simplified form for the

r_{k + 1}^{(t)}

ratio:

r_{k + 1}^{(t)} = \frac{e^{b_{k + 1}}}{\sum_{j = 0}^{k + 1} e^{b_{j}}} = 1 - \frac{1}{f_{k}^{(t)}} k = 0, \dots, J - 1,

(71)

similarly to (57), keeping in mind that

\partial_{log x} log (f (x)) = \frac{x}{f (x)} \frac{\partial}{\partial x} f (x)

, we get:

{\partial_{Y_{t - k, k + 1}} {\hat{C}}_{i^{*}, J}^{(t + 1)}|}_{0} = \{\begin{matrix} \frac{{\hat{C}}_{i^{*}, J}^{(t)} r_{t - i^{*} + 1}^{(t)}}{e^{c + a_{i^{*}} + b_{t - i^{*} + 1}}}, & i^{*} = t - k, \\ \frac{{\hat{C}}_{i^{*}, J}^{(t)} α_{k}^{(t)} r_{k + 1}^{(t)}}{e^{c + a_{t - k} + b_{k + 1}}}, & i^{*} > t - k, \end{matrix}

(72)

equally, by taking the partial derivatives of the

ζ

residuals in comparison with the GLM model parameters written in Equations (58) and (59), we get the partial derivatives—at the

ξ, ζ = 0

point—of the ultimate cost

{\hat{C}}_{i^{*}, J}^{(t + 1)}

, for example in comparison with

{\hat{b}}_{k + 1}

:

\partial_{{\hat{b}}_{k + 1}} {{\hat{C}}_{i^{*}, J}^{(t + 1)}|}_{0} = \{\begin{matrix} - {\hat{C}}_{i^{*}, J}^{(t)} \frac{e^{c + a_{i^{*}} + b_{t - i^{*} + 1}}}{{\hat{C}}_{i^{*}, t - i^{*} + 1}^{(t)}}, & = & - {\hat{C}}_{i^{*}, J}^{(t)} r_{t - i^{*} + 1}^{(t)} & i^{*} = t - k, \\ - {\hat{C}}_{i^{*}, J}^{(t)} \frac{e^{c + a_{t - k} + b_{k + 1}}}{{\hat{C}}_{t - k, k + 1}^{(t)}} α_{k}^{(t)}, & = & - {\hat{C}}_{i^{*}, J}^{(t)} α_{k}^{(t)} r_{k + 1}^{(t)} & i^{*} > t - k . \end{matrix}

(73)

By using previous results, for the

i^{*}

generation, we get the following Taylor approximation for the ratio between the ultimate cost estimate done in

t + 1

and the one done in t:

\begin{matrix} \frac{{\hat{C}}_{i^{*}, J}^{(t + 1)}}{{\hat{C}}_{i^{*}, J}^{(t)}} & \approx & 1 + \sum_{k = t - i^{*}}^{J - 1} \frac{s_{i^{*}, k + 1}^{(t)}}{μ_{t - k, k + 1}} ξ_{t - k, k + 1} - \sum_{k = t - i^{*}}^{J - 1} s_{i^{*}, k + 1}^{(t)} (η_{t - k, k + 1} - {\hat{η}}_{t - k, k + 1}) \\ = & 1 + \sum_{k = t - i^{*}}^{J - 1} \frac{s_{i^{*}, k + 1}^{(t)}}{μ_{t - k, k + 1}} ξ_{t - k, k + 1} - \sum_{k = t - i^{*}}^{J - 1} s_{i^{*}, k + 1}^{(t)} x_{t - k, k + 1}^{⊤} (β - \hat{β}) . \end{matrix}

(74)

Therefore, in a way very similar to Equation (62), for the square claims’ development ratio, we get the following Taylor’s first-order expansion:

\begin{matrix} {\hat{C D R}}_{i^{*}, t + 1}^{2} & \approx & {({\hat{C}}_{i^{*}, J}^{(t)})}^{2} \{\sum_{k = t - i^{*}}^{J - 1} {(\frac{s_{i^{*}, k + 1}^{(t)}}{μ_{t - k, k + 1}} ξ_{t - k, k + 1})}^{2} \\ + & \sum_{k_{1} = t - i^{*}}^{J - 1} \sum_{k_{2} = t - i^{*}}^{J - 1} s_{i^{*}, k_{1} + 1}^{(t)} s_{i^{*}, k_{2} + 1}^{(t)} x_{t - k_{1}, k_{1} + 1}^{⊤} (β - \hat{β}) {(β - \hat{β})}^{⊤} x_{t - k_{2}, k_{2} + 1}\} \\ = & {({\hat{C}}_{i^{*}, J}^{(t)})}^{2} \{\sum_{k = t - i^{*}}^{J - 1} {(\frac{s_{i^{*}, k + 1}^{(t)}}{μ_{t - k, k + 1}})}^{2} V a r (Y_{t - k, k + 1}) \\ + & \sum_{k_{1} = t - i^{*}}^{J - 1} \sum_{k_{2} = t - i^{*}}^{J - 1} s_{i^{*}, k_{1} + 1}^{(t)} s_{i^{*}, k_{2} + 1}^{(t)} x_{t - k_{1}, k_{1} + 1}^{⊤} V a r (β) x_{t - k_{2}, k_{2} + 1}\} \end{matrix}

(75)

Finally, as for the MSEP estimate, we have the following expression:

\begin{matrix} \hat{M S E P} ({\hat{C D R}}_{i^{*}, t + 1}) & = & {(\sum_{j = 0}^{J} e^{\hat{c} + {\hat{a}}_{i^{*}} + {\hat{b}}_{j}})}^{2} \\ \times & (\hat{ϕ} \sum_{k = t - i^{*}}^{J - 1} \frac{{\hat{s}}_{i^{*}, k + 1}^{2}}{{\hat{μ}}_{t - k, k + 1}} + {\hat{s}}_{(i^{*})}^{⊤} X_{i^{*}, (t + 1)} \hat{V a r} (\hat{β}) X_{i^{*}, (t + 1)}^{⊤} {\hat{s}}_{(i^{*})}), \end{matrix}

(76)

where

{\hat{s}}_{(i^{*})}

is a length

i^{*} - 1

vector whose elements are:

{\hat{s}}_{i^{*}, k + 1} = \{\begin{matrix} {\hat{r}}_{t - i^{*} + 1} & t - k = i^{*} \\ {\hat{r}}_{k + 1} α_{k}^{(t)} & 2 \leq t - k < i^{*} \end{matrix} k = t - i^{*}, \dots, J - 1,

(77)

while

X_{i^{*}, (t + 1)}

is the projection matrix —hat matrix— that encodes the incremental payments of the

t + 1

following year for the

i^{*}

generations and the previous ones.

6. Numerical Investigation

In this section, we introduce a numerical application which compares the closed tool method (CT)—illustrated in this paper—with the well-known bootstrapping (BS) method—with re-reserving—whose results are found through Monte Carlo simulation. The triangle used for the comparison is the one represented in Table 2; they are the third party liability segment payments of an Italian company, for obvious privacy reasons, data have been disguised. In Table 3, we have copied the parameters and their standard error estimate calculated by applying the quasi-likelihood (12), while applying (9), we get

\hat{ϕ} = 410.8964

. In Table 4 and Table 5, there are the

{\hat{α}}_{k}

,

{\hat{q}}_{k + 1}

,

{\hat{μ}}_{t - k, k + 1}

,

{\hat{r}}_{k + 1}

and

{\hat{s}}_{i^{*}}

values resulting from the GLM model and it is necessary to apply Equations (63) and (76) formulae linked to the MSEP and CDR one-year closed tool estimate that is written on the CT column of Table 6. As we can see from results of Table 6, the proposed formulae to estimate the MSEP produce results similar to the ones calculated with simulation techniques—re-reserving—, bearing in mind that these results include the simulation error.

The ratio between one year and ultimate volatility makes evident the long tail nature of this general liability’s triangle. With regard to the new estimated parameters

α

’s, q’s, r’s and s’s, we can provide some interpretations:

the coefficients of credibility alpha’s quantify the weight, in terms of influence, of the accident year in the next development factor calculation (decreasing for the more recent accident year);
the q’s indicate the contribution to the overall volatility from the first development year to the latest development year and form a typical u-shape due to the level of the payment in the first year and to the small uncertainty for the oldest years for which residual payments relative to the still open claims over the ultimate cost are low;
the r’s stand for the weight of the k-th development year parameter above the first k parameters decreasing with the increase of development year;
the s’s involved directly in the accident year volatility is a function based on r interesting the next year development for this accident year plus the subsequent development year with credibility decreasing coefficients.

All of the results shown in this section are obtained by using R software; the code is reported in vignettes 1, 2 and 3. The package ChainLadder is requested and must be previously installed because it is not included in the default configuration. In order to replicate results in Table 3, Code 1 has to be run after the run-off triangle showed in Table 2 is uploaded and named Incremetal.Paid. Code 2 computes the outcome of Table 7 and the R Code 3 gives as outcome Table 4, Table 5 and Table 6.

Listing 1. Code for Table 3: Estimation GLM parameters.

Listing 2. Code for Table 7: Estimation rMSEP ultimate.

Listing 3. Code for Table 4, Table 5 and Table 6: Estimation rMSEP one-year.

7. Conclusions

In this work, after a short review about the claims reserve volatility in the GLM framework— calculated in an ultimate view—we have drawn the formulae to calculate the one-year volatilities of the specific ODP model. Nonetheless, while the ultimate view is drawn on the basis of the incremental payments under an indipendence general assumption, the second one year volatility is derived through the use of the chain ladder cumulative payments estimate. The consequence is the one-year volatility of a specific accident year is calculated also according to the previous accident year estimates, similarly to the Merz–Wüthrich model.

Therefore, these formulae could generally be a valid alternative to the Merz–Wüthrich formulae regarding volatility parameters, or undertaking specific parameters (USP), in order to calculate the Solvency Capital Requirement according to the Solvency II framework. Furthermore, these results could also be used to solve the potential distortions caused by approaches such as, for instance, the emergence pattern introduced by England and linked to the single accident year (Casualty Actuaries of Europe Fall Meeting 2009, Zurich). This approach has been mainly developed to overcome the cases where the bootstrapping is not appropriate, like, for example, due to triangle stability matters or computational reasons linked to the re-reserving.

To summarize, the emergence-pattern is based on the assumption that volatility at the ultimate level gradually emerges in time, so that, if a total volatility estimate can be determined, mechanisms that allow for making it emerge by using a specific pattern can also be created. In this case, we assume the Claims Development Result (CDR) to be a best estimate function; as a consequence, the CDR standard deviation is calculated through this relation, by estimating one factor for each of the observed accident years.

Basically, the calibration problem is dealt with by building the factors on the ratios between the CDR and the ultimate cost standard deviations for each accident year—followed by the proper equalizations to calibrate one pattern applicable to each accident year.

Indeed, the distortion can appear when the ultimate view volatilities are obtained by using the ODP—through a closed-tool or bootstrapping technique—but are then distributed in time by using factors drawn from the ratios between Merz–Wüthrich and Mack formulae, thus drawn from a different model. The results from this paper will make these approaches—typical of current practice in the internal models building—more coherent.

Author Contributions

These authors contributed equally to this work.

Funding

This research received no external funding.

Acknowledgments

We thank Riccardo Cesari, Milena Nocente and Valentino Pompili for helpful comments in a previous version of this paper. Stefano Cavastracci and Agostino Tripodi would like to remark that this article reflects the personal view of the authors and not necessarily that of IVASS.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

BS	Bootstrap
CDR	Claims Develpoment Result
CT	Closed Tool
GLM	Generalized Linear Model
MSEP	Mean Square Error of Prediction
rMSEP	$\sqrt{M S E P}$
USP	Undertaking Specific Parameters

References

Antonio, Katrien, and Jan Beirlant. 2008. Issues in claims reserving and credibility: A semiparametric approach with mixed models. Journal of Risk and Insurance 75: 643–76. [Google Scholar] [CrossRef]
Bjökwall, Susanna, Hössjer Ola, Ohlsson Esbjörn, and Richard Verrall. 2011. A generalized linear model with smoothing effects for claims reserving. Insurance: Mathematics and Economics 49: 27–37. [Google Scholar] [Green Version]
Despeyroux, Aurélie, Charles Levi, Christian Partrat, and Jerôme Vignancour. 2003. Techniques for valuation a general insurance company within the framework of IAS standards: Some proposals. Paper presented at the XXXIV International ASTIN Colloquium, Berlin, Germany, August 24–27. [Google Scholar]
England, Peter D., and Richard J. Verrall. 2001. A flexible framework for stochastic claims reserving. Proceedings of the Casualty Actuarial Society 88: 1–38. [Google Scholar]
England, Peter D., and Richard J. Verrall. 2002. Stochastic claims reserving in general insurance. British Actuarial Journal 8: 443–544. [Google Scholar] [CrossRef]
Gigante, Patrizia, and Luciano Sigalotti. 2004. Valutazione della riserva sinistri con i GLM nel contesto dei nuovi standard contabili. In Quaderni del Dipartimento di Matematica Applicata alle Scienze Economiche Statistiche e Attuariali "B. de Finetti". Trieste: Universitá di Trieste, vol. 6. [Google Scholar]
Hudecovà, Sarka, and Michal Pešta. 2013. Modeling Dependencies in Claims Reserving with GEE. Insurance: Mathematics and Economics 53: 786–94. [Google Scholar] [CrossRef]
Leong, Weng Kah, Shaun S. Wang, and Han Chen. 2014. Back-Testing the ODP Bootstrap of the Paid Chain-Ladder Model with Actual Historical Claims Data. Variance 8: 182–202. [Google Scholar]
Merz, Michael, and Mario V. Wüthrich. 2015. Stochastic Claims Reserving Manual: Advances in Dynamic Modeling. SSRN Manuscript 2649057. Geneva: Swiss Finance Institute. [Google Scholar]
Nelder, J. A., and D. Pregibon. 1987. An extended quasi-likelihood function. Biometrika 74: 221–32. [Google Scholar] [CrossRef]
Renshaw, A. E., and Richard Verrall. 1998. A stochastic model underlying the chain-ladder technique. British Actuarial Journal 4: 903–23. [Google Scholar] [CrossRef]
Röhr, Ancus. 2016. Chain Ladder and Error Propagation. ASTIN Bulletin 46: 1–38. [Google Scholar]
Taylor, Greg C., and Gráinne McGuire. 2004. Loss Reserving with GLMs: A Case Study. Research Paper 113. Centre for Actuarial Studies, University of Melbourn, Australia. Sydney: Institute of Actuaries, pp. 489–99. [Google Scholar]
Taylor, Greg C., and Gráinne McGuire. 2016. Stochastic Loss Reserving Using Generalized Linear Models. CAS Monograph No. 3. Arlington: Casualty Actuarial Society. [Google Scholar]
Venter, Gary G. 2007. Generalized Linear Models beyond the Exponential Family with Loss Reserve Applications. ASTIN Bulletin 37: 345–64. [Google Scholar] [CrossRef]
Wedderburn, R. W. M. 1971. Quasi-likelihood functions, generalized linear models, and the Gauss-Newton method. Biometrika 61: 439–47. [Google Scholar]

1	Robert W. M. Wedderburn (1947–1975) could have become one of the most distinguished statistics experts of his time due to his early works in this field, but he died at the young age of 28 because of an anaphylactic shock caused by a wasp bite.
2	A useful reference about backtesting in relation to the use of the ultimate volatility of predictions is in Leong et al. (2014), winner of the Variance Prize.
3	The key to understanding how we have calculated these derivatives is that these residuals fluctuate around zero, the first exactly on average and the second only with the condition of a potential bias in the maximum likelihood estimate. Thus, the replacement of the empirical and the estimated figure with the hypothetical unknown value—respectively for $ξ$ and for $ζ$ —is possible.

Table 1. Run-off triangle of cumulative payment.

$i / j$	0	1	⋯	j	⋯	J
1	$Y_{10}$	$Y_{11}$	⋯	$Y_{1 j}$	⋯	$Y_{1 J}$
2	$Y_{20}$	$Y_{21}$	⋯
⋮	⋮
i	$Y_{i 0}$		$Y_{i j}$
⋮	⋮
I	$Y_{I 0}$

Table 2. Incremental payment used in the empirical application (,000).

i/j	0	1	2	3	4	5	6	7	8	9	10	11	12
1	22,603	39,938	35,073	25,549	20,031	17,593	14,930	15,004	10,319	8240	8104	6020	19,145
2	22,382	41,502	26,508	19,734	18,715	13,983	12,885	16,371	7921	7204	4428	12,897
3	25,355	45,707	33,062	24,232	16,765	13,180	11,639	8864	9994	6044	3954
4	26,830	52,347	37,324	23,590	18,248	13,895	13,142	11,119	9429	5057
5	26,868	62,313	33,772	22,925	16,341	12,419	12,646	9459	6658
6	28,470	56,097	41,672	24,843	22,818	18,787	16,947	14,942
7	26,170	55,362	39,026	26,817	22,881	19,663	19,395
8	24,101	58,520	38,749	22,449	16,008	12,506
9	22,714	48,707	28,970	18,798	13,369
10	19,973	38,262	23,298	14,819
11	17,252	36,994	24,361
12	17,591	30,074
13	16,907

Table 3. Estimation of GLM parameter using the data in Table 2.

Parameter	Estimate	Std. Error	Parameter	Estimate	Std. Error
$\hat{c}$	10.1263	0.0572
${\hat{a}}_{2}$	−0.0883	0.0620	${\hat{b}}_{1}$	0.7024	0.0468
${\hat{a}}_{3}$	−0.0715	0.0629	${\hat{b}}_{2}$	0.3132	0.0513
${\hat{a}}_{4}$	0.0155	0.0620	${\hat{b}}_{3}$	−0.0972	0.0579
${\hat{a}}_{5}$	0.0126	0.0628	${\hat{b}}_{4}$	−0.3241	0.0635
${\hat{a}}_{6}$	0.1579	0.0614	${\hat{b}}_{5}$	−0.5254	0.0703
${\hat{a}}_{7}$	0.1551	0.0627	${\hat{b}}_{6}$	−0.5737	0.0753
${\hat{a}}_{8}$	0.0425	0.0662	${\hat{b}}_{7}$	−0.6904	0.0843
${\hat{a}}_{9}$	−0.1261	0.0716	${\hat{b}}_{8}$	−1.0112	0.1051
${\hat{a}}_{10}$	−0.3171	0.0795	${\hat{b}}_{9}$	−1.2910	0.1317
${\hat{a}}_{11}$	−0.3326	0.0858	${\hat{b}}_{10}$	−1.4622	0.1643
${\hat{a}}_{12}$	−0.4592	0.1044	${\hat{b}}_{11}$	−-0.9285	0.1553
${\hat{a}}_{13}$	−0.3909	0.1660	${\hat{b}}_{12}$	−0.2665	0.1573

Table 4. Estimation of

α

, q and

μ

.

Table 4. Estimation of

α

, q and

μ

.

k	$13 - k$	${\hat{α}}_{k}^{(13)}$	${\hat{q}}_{k + 1}$	${\hat{μ}}_{t - k, k + 1}$	${\hat{r}}_{k + 1}^{(13)}$
0	13	0.0569	0.0415	34127.94	0.6687
1	12	0.0563	0.0192	21598.78	0.3118
2	11	0.0677	0.0127	16260.70	0.1714
3	10	0.0738	0.0097	13162.94	0.1202
4	9	0.0965	0.0094	13026.95	0.0895
5	8	0.1264	0.0108	14693.99	0.0786
6	7	0.1619	0.0115	14633.21	0.0653
7	6	0.1937	0.0096	10647.17	0.0453
8	5	0.2077	0.0075	6959.96	0.0331
9	4	0.2630	0.0078	5882.08	0.0271
10	3	0.3271	0.0158	9194.30	0.0442
11	2	0.4779	0.0412	17527.56	0.0789

Table 5. Estimation

s_{(i^{*}) .}

Table 5. Estimation

s_{(i^{*}) .}

${\hat{s}}_{2}$	${\hat{s}}_{3}$	${\hat{s}}_{4}$	${\hat{s}}_{5}$	${\hat{s}}_{6}$	${\hat{s}}_{7}$	${\hat{s}}_{8}$	${\hat{s}}_{9}$	${\hat{s}}_{10}$	${\hat{s}}_{11}$	${\hat{s}}_{12}$	${\hat{s}}_{13}$
0.0789	0.0442	0.0271	0.0331	0.0453	0.0653	0.0786	0.0895	0.1202	0.1714	0.3118	0.6687
	0.0377	0.0145	0.0071	0.0069	0.0088	0.0106	0.0099	0.0086	0.0089	0.0116	0.0176
		0.0377	0.0145	0.0071	0.0069	0.0088	0.0106	0.0099	0.0086	0.0089	0.0116
			0.0377	0.0145	0.0071	0.0069	0.0088	0.0106	0.0099	0.0086	0.0089
				0.0377	0.0145	0.0071	0.0069	0.0088	0.0106	0.0099	0.0086
					0.0377	0.0145	0.0071	0.0069	0.0088	0.0106	0.0099
						0.0377	0.0145	0.0071	0.0069	0.0088	0.0106
							0.0377	0.0145	0.0071	0.0069	0.0088
								0.0377	0.0145	0.0071	0.0069
									0.0377	0.0145	0.0071
										0.0377	0.0145
											0.0377

Table 6. Estimation of rMSEP one-year: bootstrapping vs. closed tool.

a.y.	$\hat{rMSEP} ({CDR}_{i^{*}, t + 1})$		$Δ$ %	$σ$ %		$Δ$	$\frac{\hat{rMSEP} ({CDR}_{i^{*}, t + 1})}{\hat{rMSEP} ({\hat{R}}_{i})}$
a.y.	BS	CT	$Δ$ %	BS	CT	$Δ$	BS	CT
1	0	0	-	-	-	-	-	-
2	3888	3870	−0.46%	22.12%	22.08%	−0.04%	100.00%	100.00%
3	3238	3234	−0.12%	11.96%	11.97%	0.01%	68.54%	68.52%
4	3083	3073	−0.32%	8.70%	8.69%	−0.01%	56.59%	56.47%
5	3242	3233	−0.28%	7.67%	7.66%	−0.01%	54.97%	54.98%
6	3980	3969	−0.28%	6.68%	6.67%	−0.01%	55.91%	55.72%
7	4477	4473	−0.09%	6.05%	6.05%	0.00%	56.53%	56.43%
8	4494	4490	−0.09%	5.56%	5.56%	0.00%	54.46%	54.53%
9	4319	4333	0.32%	5.31%	5.33%	0.02%	52.08%	52.24%
10	4535	4538	0.07%	5.64%	5.65%	0.01%	53.53%	53.50%
11	5705	5691	−0.25%	5.98%	5.97%	−0.01%	57.11%	56.98%
12	8364	8341	−0.27%	7.91%	7.90%	−0.01%	67.22%	67.34%
13	21,651	21,616	−0.16%	14.69%	14.69%	0.00%	86.09%	86.17%
Tot	38,603	38,578	−0.06%	4.56%	4.56%	0.00%	73.09%	73.18%

Table 7. Estimation of rMSEP ultimate: bootstrapping vs. closed tool.

a.y.	${\hat{R}}_{i}$		$Δ$ %	$\hat{rMSEP} ({\hat{R}}_{i})$		$Δ$ %
a.y.	BS	CT	$Δ$ %	BS	CT	$Δ$ %
1	0	0	0	0	-	-
2	17,573	17,528	−0.26%	3888	3870	−0.46%
3	27,068	27,018	−0.18%	4724	4720	−0.08%
4	35,429	35,356	−0.21%	5448	5442	−0.11%
5	42,295	42,212	−0.20%	5898	5880	−0.31%
6	59,560	59,463	−0.16%	7118	7123	0.07%
7	74,021	73,930	−0.12%	7920	7926	0.08%
8	80,879	80,752	−0.16%	8252	8234	−0.22%
9	81,354	81,245	−0.13%	8293	8295	0.02%
10	80,401	80,285	−0.14%	8472	8483	0.13%
11	95,412	95,309	−0.11%	9989	9988	−0.01%
12	105,715	105,579	−0.13%	12,443	12,386	−0.46%
13	147,336	147,172	−0.11%	25,149	25,085	−0.25%
Tot	847,041	845,851	−0.14%	52,813	52,714	−0.19%

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Strascia, S.C.; Tripodi, A. Overdispersed-Poisson Model in Claims Reserving: Closed Tool for One-Year Volatility in GLM Framework. Risks 2018, 6, 139. https://0-doi-org.brum.beds.ac.uk/10.3390/risks6040139

AMA Style

Strascia SC, Tripodi A. Overdispersed-Poisson Model in Claims Reserving: Closed Tool for One-Year Volatility in GLM Framework. Risks. 2018; 6(4):139. https://0-doi-org.brum.beds.ac.uk/10.3390/risks6040139

Chicago/Turabian Style

Strascia, Stefano Cavastracci, and Agostino Tripodi. 2018. "Overdispersed-Poisson Model in Claims Reserving: Closed Tool for One-Year Volatility in GLM Framework" Risks 6, no. 4: 139. https://0-doi-org.brum.beds.ac.uk/10.3390/risks6040139

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Overdispersed-Poisson Model in Claims Reserving: Closed Tool for One-Year Volatility in GLM Framework

Abstract

1. Introduction

2. Claims Reserve Estimation

2.1. Data Organization

2.2. Chain Ladder Method: Basic Concept

2.3. The Claims Development Result

3. Generalized Linear Models to Estimate the Claims Reserve

3.1. GLM Models Structure

3.2. Semi-Parametrical Models

3.3. Elements for the Observed Data Goodness of Fit

4. The Claims Reserve Mean Square Error of Prediction

4.1. The General Case

4.2. GLMs Implementation in Claims Reserving

5. One-Year Volatility for the Claims Development Results

5.1. Overall Accident Years Estimate

5.2. One-Year Volatility for Accident Year

6. Numerical Investigation

7. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI