14/06/2015 1 A dynamic parameterization modeling for the age-period- cohort mortality The case of...

49
03/25/22 1 A dynamic parameterization modeling for the age-period-cohort mortality The case of E&W males mortality experience 1841-2006
  • date post

    19-Dec-2015
  • Category

    Documents

  • view

    215
  • download

    0

Transcript of 14/06/2015 1 A dynamic parameterization modeling for the age-period- cohort mortality The case of...

04/18/23 1

A dynamic parameterization modeling for the age-period-cohort

mortality

The case of E&W males mortality experience

1841-2006

04/18/23 2

Two important, general approaches have been developed for modeling and forecasting mortality, in discrete-time:

The curve fitting approach

and

The principal component (PC) approach

04/18/23 3

The curve fitting approach involves fitting parametric curves to the cross-sectional mortality rates in age effects, and extrapolating the parameter estimates in time effects

As mentioned by many authors, in the curve fitting approach, the deficiency of this approach comes from the high parameter time series interdependencies which cause serious difficulties for forecasting and for estimating the forecast prediction-intervals.

Building a multivariate time series for the parameters is theoretically possible but seriously complicates the approach and can lead to computational intractability.

Recent developments in mortality modeling have tended to be extrapolative in nature, with the principal components (PC) approach being prominent. The PC approach involves computing PCs to the table of age-time-specific (log)-mortality rates, to obtain a linear transformation of the data in lower dimension and simplified structure, and extrapolating the PCs.

The main statistical tool used was least-squares estimation via singular value decomposition (SVD) of the matrix of the log age specific observed forces of mortality.

04/18/23 5

The most well known PC approach is the Lee - Carter (LC) model (1992) :

where denote the singular value and respective left and right singular vectors of

, ,log( )-tx t x x x t t xz m k

( ) ( )x tk s u x v t , ( ), ( )s u x v x

txz

04/18/23 6

Under the proposed approach, we build a GLM, modeling the actual number of deaths as independent realizations of Poisson random variables, for each calendar year independently, treating age as an explanatory variate, and using a log link function and (orthonormal) polynomial predictor structure in age effects:

or

, 1 , 11

ˆ expk

tx t j x jj

m L

ˆ expt tm L

The optimum degree k for the design matrix (orthonormal polynomials) can be determined by maximizing the Bayesian Information Criterion (BIC) :

where, l is the (maximum) value of the total likelihood for all the n-calendar years (with n*a observations and n*k parameters).

04/18/23 7

2 ( ) ( ) ( )BIC log l n k log n a

04/18/23 8

Next we can apply SVD to the random matrix B to extract the mortality dynamics in age-time effects. Then, the vector of the graduated log-central death rate, for all ages, in year t, inherits the structure of a dynamic mortality factor model:

where the random vector denotes the main age profile, is the matrix of the factor loadings, is the matrix of PC scores, and P is the matrix of eigenvectors.

rt tF P

rt tF P rt tb b b= -rt tb b b= -

ˆlog( )t tm A C F

A L C L P

Sparse Principal Component Analysis

Although PC analysis is a classic tool for analyzing multivariate data, by reducing the dimensionality of the data, one of the key shortcomings of PC analysis however is that these factors or PCs are linear combinations of all variables, that is, all factor coefficients (or loadings) are non-zero.

Additionally, if the input data consists of non-stationary time series, a single linear combination of all the time series can solely explain the variability, because simultaneous drifting of the series may register as correlations of the columns.

04/18/23 9

Further, in empirical studies with various mortality experiences, using PCA, results in factor loadings structures with significant values for both positive and negative loadings.

This undesirable feature gives high interdependence structure for different range of ages, explaining different mortality dynamics with the same factor, resulting in spurious interpretations and forecasting.

SPCA improves this problematic dependent structure by giving a better clustering of significant high factor loading values, seeking a trade-off between the two goals of expressive power (explaining most of the variance) and interpretability (making sure that the factors involve only a few variables).

Luss and Aspremont (2006), solves a penalized formulation of the problem (given a covariance matrix

find sparse factors which explains the maximum amount of variance):

Maximize subject to

Where λ is a scalar which defines the sparsity (higher values of λ gives more sparsity).

04/18/23 11

( ) 1 | | 1Tr A X X

( ) 1Tr X 0X kX S

kA S

Thus, applying SPCA to the random matrix B and keeping an optimum subset p(<k) of the SPCs, which explains the “majority” of the variance, leads to

Where denote the first p-columns of the matrix of sparse factor loadings and the SPC scores respectively, and is the error component which denotes the deviation of the model represented by the excluded SPCs.

04/18/23 12

ˆlog( )= p pt t tm A C F u

&p ptC F

tu

04/18/23 13

, , ,1

uˆlog( )= +p

tx x t i x i t t xi

m A b d F

1 , 11

k

x j x jj

A L

After some simple algebraically manipulations we obtain

,1

p

t i i ti

b m F

, , , 12

k

i x j i x jj

d e L

,0 1,i x im L e

The model now can be viewed as a member of the class of log-linear models, with age and time being the two main random effects (similar to the so-called association model of Goodman, 1991).

Optimum number of SPCs

The values are very informative for the number of components retained, since

1. The decomposition implies that the value can be viewed as mean indices of the factor loadings (or associations) for all the ages with the i-factor

2. The value is a measure of the importance (or a weight) for the i-factor in the construction of the time additive term

04/18/23 14

, ,i x i i xC m d

im

tb

im

Confidence intervals for the values can be constructed by bootstrapping techniques. Alternatively, if we just count the number the values change sign in the bootstrap procedure, then the rate of times where the sign is changed (say a%) will define a confidence coefficient or a confidence level (CL=1-a%) for each SPC. If the values do not change sign in the bootstrap procedure (at any given significant level), it is an indication of robustness and the significance of the particular factor.

Next, we can choose the ‘optimum’ value λ (the scalar which defines the sparsity in the SPCA), to be the value that maximizes the number of significant SPC, as defined by the CL criterion (at a given significance level).

04/18/23 15

im

im

Then, among these significant SPC, we can specify the key age groups by taking, for each age, the maximum association value (MAV), defined by:

It is desirable that the above method should produce distinct key age groups of neighbouring values, which belong to the same components. This desirable feature would lead to a clear picture for the mortality dynamics.

max,

{ }maxx i x

iC C

maxxC

Cohort effects

04/18/23 17

In the presence of cohort effects, the GLM estimates are marginal estimates of age and period effects. If we settle for the residual cohort effect conditional on the already estimated age and period effects we have:

In which the term is treated as

an offset in the rectangular DEBF.

04/18/23 18

2

, , , 1 , 11 1

log( ( )) log( )p k

tx tx x i x i t c j x ji j

E D R A C F b L- -= =

×= + + × +å å

, ,1

log( )p

tx x i x i ti

R A C F=

+ + ×å

04/18/23 19

In the same way with the age period effects we apply the eigenvalue decomposition to the covariance matrix of these conditional (cohort) GLM estimates, leading to graduated central mortality rates in age – period – cohort effects:

defined in the rectangular DEBF.

Forecast estimates become (area DCGF)

, , , , ,1 1

ˆlog( )=p q

tx x i x i t i x i t x t x xi i

m A C F D H

, , , , ,1 1

ˆlog( )=p q

tx x i x i t i x i t x t x xi i

m A C F D H

, , , , ,1 1

ˆˆ ˆ ˆˆlog( )=p q

t x x i x i t i x i t xi i

m A C F D H

Using the same arguments, the log-graduated central mortality rates, can be alternatively decomposed as a member of the class of log-linear models with three main random effects:

where denotes the independent main cohort effect:

for and

04/18/23 20

, , , , ,1 1

ˆˆlog( )=p q

ctx x t t x i x i t i x i t x t x x

i i

m A b c d F d H

t xc

,1

ˆq

ct x i i t x

i

c m H

,0 1,c c ci x im L e , , , 1

2

kc c ci x j i x j

j

d e L

04/18/23 21

ForecastingFor forecast purposes, for each time SPC, we advocate a specific class of dynamic linear regression (dlr) models under state space models and Kalman filter techniques:

for each calendar year t (the so-called regressor) and for each time related SPC (i=1,…,p), with the slope as stochastic time variable parameter (TVP) that follow a random walk (RW) process.

The cohort related PCs are modelled as independent RW plus noise time series model, since conditional on age-time effects, we may assume that the cohort related PCs are mean reverted stochastic processes

, , ,i t i i t i tF b t e

Confidence Intervals (CI) for the true mortality rates

There are several sources of randomness in the proposed model. The true value of the log central mortality rates, assuming the model specification is correct, is given by

Where e( ) denotes the errors in estimating the relevant quantity, is the residual error in fitting the model for age x and cohort t-x using a finite set of PC functions, and is the observation error which comes from the random variation of deaths in the Poisson distribution.

04/18/23 22

, , , , ,1

ˆ ˆ ˆlog( )p

t x x x i x i x i t i ti

m A e A C e C F e F

, , , , , ,1

ˆˆ ˆ ˆ eq

i x i x i t x i t x t x x t x xi

D e D H e H

,t x x

,et x x

Assuming independence among all those different sources of error the total variance is given by

04/18/23 23

, , ,,

2 2 2 2 2 2 2, ,

1 1

ˆ ˆt x x x i x i xi t

p p

E i x i te A e C e Ce Fi i

C F

, ,, ,,

2 2 2 2 2 2 2, , eˆ

1 1

ˆˆ ˆt x x t x xi x i xi t x

q q

i x i t xe D e De Hi i

D H

In order to generate an interval forecast, we can obtain the following 95% confidence interval estimate for :

Where

if

,t xm

, ,1.96 1.96

, , ,ˆ ˆˆ ˆ( ) ,E Et x x t x x

t x t x t xCI m m e m e

, , ,ˆ ˆˆ ˆ= expt x x x t x c xm a b x f t f f

,1

ˆ ˆp

px i x i

i

f b a

, , ,1

ˆ ˆp

pt x i x i t

i

f b b

,1

ˆ ˆp

px i x i

i

f b a

, , ,1

ˆ ˆp

pt x i x i t

i

f b b

, , ,1

ˆ ˆq

cc x i c i x

i

f b b

The construction of complete lifetableFrom various mortality investigations, it has been observed that the number of people surviving up to older ages has increased considerably. Crude mortality rates at these ages are usually lack the required quality demanded for the construction of complete lifetables.

Empirical observations shows that the curve of mortality rates, in logarithmic scale, presents a concave shape for the oldest ages. One of the most well known approach to capture this empirical evidence is the Coale-Kisker method (1990), where assumes that the exponential rate of mortality increase at the oldest ages is not constant but declines linearly.

04/18/23 25

The exponential rate of mortality increase is defined as

at age x, up to ultimate age ω.

We note that in Gompertz law, i.e.

A possible extension to Gompertz law is proposed if we assume for the slope term, b, follows an integrated random walk (IRW) time series model in order to capture the empirical patterns of mortality at oldest ages, such that the projected exponential rate of mortality increase is a linear function of the oldest ages. A more robust approach is if we apply this modelling structure to the age profile

04/18/23 26

1

ln xx

x

mk

m

xk b ln xm a b x

xA

04/18/23 27

The case of E&W

The data

E&W male mortality experience, calendar years 1841-2006, and individual ages 0,1,…,89.

The data are freely provided by the “Human Mortality Database” (www.mortality.org).

, ,c x c ii xT L

³= å

04/18/23 28

SDR, in age, time and cohort effects, with k=25 degree for the design matrix

SDR, in age, time and cohort effects, with p=7 time SPC

04/18/23 29

SDR, in age and cohort effects, with q=3 cohort PCs (final model)

04/18/23 30

Sparse eigenvalues, percentage variance explained, mean associations, CL criterion, with

sparse parameter λ=36, in age-time effects

04/18/23 31

SPC index % var CL

1 6,3E+01 8,7E-01 7,08E-02 99,9%

2 8,2E+00 1,1E-01 7,33E-02 99,9%

3 1,0E+00 1,45E-02 -1,0E-02 100%

4 5,1E-02 7,2E-04 1,07E-03 100%

5 6,4E-04 3,6E-06 7,53E-04 99,9%

6 2,9E-04 1,1E-06 5,97E-04 98,5%

7 6,1E-8 2,4E-9 -7,59E-6 99,9%

8 2,7E-12 1,0E-14 -6,41E-7 71,0%

i im

iim

Eigenvalues, percentage variance explained, mean associations, CL criterion, in age-cohort

effects

SPC index % var CL

1 1,6E-02 14% 4,2E-02 92%

2 2E-02 18% -3E-02 92%

3 2,9E-02 26% 6E-02 80%

4 1,6E-02 10% -5,2E-02 76%

5 9,1E-03 9% -1,8E-03 77%

04/18/23 32

imci

Model Components for the age-time effects

04/18/23 33

xA

tb

04/18/23 34

1( )v x

1( )t

1( )v x1( )v x1( )v x

1( )v x

1,xC

1,xC

1,tF

04/18/23 35

2,xC

2,tF

04/18/23 36

3,xC

3,tF

04/18/23 37

4,xC

4,tF

04/18/23 38

5,xC

5,tF

04/18/23 39

6,xC

6,tF

7,xC

7,tF

Model Components for the age-cohort effects

04/18/23 41

cxA

t xc

04/18/23 42

1,xD

1,ˆ

t xH

04/18/23 43

2,xD

2,ˆ

t xH

3,xD

3,ˆ

t xH

Observed-projected cohort life expectancies at birth (left curves) and observed-projected cohort expected age at death

given survival at age 65 (right curves), with associated bootstrap 95% percentile CIs,

04/18/23 45

04/18/23 46

Log central mortality rates, smoothed-predicted log central mortality rates and

associated CI, for various ages

04/18/23 47

x=20x=10

x=30

x=40

04/18/23 48

x=50 x=60

x=70 x=80

Thank you