---
title: "Insurance portfolio"
author: "Daphné Giorgi, Sarah Kaakai, Vincent Lemaire"
pkgdown:
as_is: true
output:
bookdown::pdf_document2:
toc: yes
toc_depth: 3
number_sections: yes
package: IBMPopSim
vignette: >
%\VignetteIndexEntry{Insurance portfolio}
%\VignetteEncoding{UTF-8}
\usepackage[utf8]{inputenc}
%\VignetteEngine{knitr::rmarkdown}
---
This vignette provides an example of usage of the package `IBMPopSim`, for simulating an heterogeneous life insurance portfolio.
See `vignette('IBMPopSim')` for a detailed presentation of the package.
# Example description
We consider an insurance portfolio composed of males policyholders, characterized by their age and risk class (for instance smokers vs non smokers).
Entries to the portfolio are modeled by Poissonian events, and individuals can die or exit the portfolio with an intensity depending on their age and risk class.
# Population creation
We start with an initial population of 10 000 males of age 65, distributed uniformly in each risk class.
The population data frame has thus the two (mandatory) columns:
- `birth` : Date of birth of the policyholders (here the initial time is $t_0=0$).
- `death`: Date of death (`NA` if alive).
and the column `risk_cls` corresponding to the policyholders risk class.
Since individuals are free to get or interrupt a life insurance at any time, in this type of population there are entry and exit events, hence we create the population with the flags 'entry and 'out' set to `TRUE`. This will create 2 additional columns in the population for recording entry and exit events. At creation, `entry` values are set to `NA` and `out` values are set to `FALSE`.
``` r
N <- 30000
pop_df <- data.frame("birth"=rep(-65,N),"death"=rep(NA,N),"risk_cls"= rep(1:2,each= N/2))
pop_init <- population(pop_df, entry=TRUE, out=TRUE)
```
# Model parameters
## Death intensity
Death intensities are calibrated on England and Wales (EW) males mortality data (source: Human Mortality Database), and forecasted over t=30 years by a Lee-Carter model with the package `StMoMo`.
Individuals in the risk class 1 are assumed to have mortality rates 20% higher that the forecasted mortality rates and individuals in the risk class 2 are assumed to have mortality rates 20% lower than the forecasted rates. The death intensity of a individual in risk class $i=1,2$ is thus the function
\begin{equation}
(\#eq:deathrates) d^i(t,a)= \alpha_i \sum_{k=0}^{29}\mathbf{1}_{\{k\leq t < k+1\}} d_k(a), \quad \alpha_1 = 1.2, \quad \alpha_2 = 0.8,
\end{equation}
and where $d_k(a)$ is the point estimate of the mortality rate for age $a$ and year $2017 + k$.
``` r
EWStMoMoMale <- StMoMoData(EWdata_hmd, series = "male")
#Fitting
LC <- lc()
ages.fit <- 65:100
years.fit <- 1950:2016
LCfitMale <- fit(LC, data = EWStMoMoMale, ages.fit = ages.fit, years.fit = years.fit)
## StMoMo: Start fitting with gnm
## Initialising
## Running start-up iterations..
## Running main iterations.....
## Done
## StMoMo: Finish fitting with gnm
```
``` r
t <- 30
LCforecastMale <- forecast(LCfitMale, h = t)
plot(LCforecastMale)
```
```{r, echo=FALSE, fig.align='center', out.width = '70%'}
knitr::include_graphics("insur_forecast-1.png")
```
Age and time dependent function can be created with the package function `?piecewise_xy`, with allows to define age-specific mortality rates, piecewise constant in time, such as $d^i$ in \@ref(eq:deathrates).
``` r
d_k <- apply(LCforecastMale$rates, 2, function(x) stepfun(66:100, x))
breaks <- 1:29
death_male <- piecewise_xy(breaks,d_k)
```
``` r
death_male(10,65) # Death rate at time t=10 (years 2027) and age 60.
## [1] 0.009082013
```
``` r
params <- list("death_male" = death_male, "alpha" = c(1.3,0.8))
```
## Exit events
Individuals exit the portfolio at a rate $\mu^{i}$, $i=1,2$ depending on their risk class.
``` r
params$mu <- c(0.001,0.06) # Exit event rate
```
## Entry events
Entry events occur at a Poissonian (constant) rate $\lambda$ (in average $\lambda$ individuals enter the population each year).
``` r
params$lambda <- 30000 # Entry events
```
# Events creation
## Death events creation
The vector parameter `alpha` will be transformed into a C++ vector (with index starting at 0) during the model creation, and `death_male` into a C++ function.
``` r
death_event <- mk_event_individual(
type = "death",
intensity_code = "result = alpha[I.risk_cls-1] * death_male(t,age(I, t));"
)
```
## Exit events creation
Each individual in the portfolio can exit the portfolio at a constant (individual) rate $\mu^i$ depending on their risk class.
In the presence of events of type `exit`, the population must have a characteristic named `out`, which is set to `FALSE` by default (see function '?population').
When an individual leaves the population, his characteristic `out` is set to `TRUE` and the date at which he exited the population is recorded in the column `death`.
``` r
exit_event <- mk_event_individual(
type = "exit",
intensity = "result = mu[I.risk_cls-1]; "
)
```
## Entry event creation
New policyholders enter the population at a constant rate $\lambda$ (on average, $\lambda$ individuals enter the portfolio each year). A new individual entering the population at age $a$ given by a uniform variable on $[65,70]$, and is in risk class 1 with probability $p$.
In the presence of events of type `entry`, the population must have a characteristic named `entry`, initially set to `NA` by default (see function '?population').
When an individual enters the population, his characteristic `entry` is set to the date $t$ at which he enters the population.
``` r
params$p <- 0.5
```
``` r
entry_event <- mk_event_poisson(
type = "entry",
intensity = "lambda",
kernel_code = "if (CUnif()
Note that entries can also occur at a rate $\lambda(t)$ depending on time. For more details, see documentation of `?mk_event_poisson_inhomogeneous`.
# Model creation
``` r
model <- mk_model(
characteristics = get_characteristics(pop_init), # Characteristics names and types
events = list(death_event,entry_event, exit_event), # Events list
parameters = params # Model parameters
)
summary(model)
## Events description:
## [[1]]
## Event class : individual
## Event type : death
## Event name : death
## Intensity code : 'result = alpha[I.risk_cls-1] * death_male(t,age(I, t));'
## Kernel code : ''
## [[2]]
## Event class : poisson
## Event type : entry
## Event name : entry
## Intensity code : 'lambda'
## Kernel code : 'if (CUnif()
```{r, echo=FALSE, fig.align='center', out.width = '70%'}
knitr::include_graphics("insur_pyramid-1.png")
```
In order to visualize both risk classes on the same age pyramid with `?plot.pyramid` a column `group_name` containing the individuals risk classes must be added to the age pyramid, and the colors representing each subgroup have to be specified.
``` r
colnames(pyr)[2]<- "group_name"
pyr$group_name <- as.character(pyr$group_name)
colors <- c("1"="#00AFBB","2"="#FC4E07")
plot(pyr,colors,age_breaks = as.integer(seq(1,length(age_grp)-1,by=2)))
```
```{r, echo=FALSE, fig.align='center', out.width = '70%'}
knitr::include_graphics("insur_pyr_group-1.png")
```
## Life tables
Death and exposure tables can be computed from the simulation, taking into account the censoring due to exit events.
Note that when individuals enter the population at different ages, the functions `?death_table` and `?exposure_table` do not take into account this right censoring.
### Risk class 1
Death and exposure data can be computed from the simulated portfolio. Below is an example with individuals in risk class 1. A Lee-Carter model is reestimated from the simulated data, and compared with the initial forecast.
``` r
age_grp <- 65:95
```
``` r
Dx <- death_table(sim_out$population[sim_out$population$risk_cls==1,],
ages = age_grp, period = 0:30)
Ex <- exposure_table(sim_out$population[sim_out$population$risk_cls==1,],
ages = age_grp, period = 0:30)
```
``` r
LC <- lc()
LCfitSim1 <- fit(LC, Dxt = Dx , Ext = Ex,ages=age_grp[-length(age_grp)])
```
```{r, echo=FALSE, fig.align='center', out.width = '70%'}
knitr::include_graphics("insur_LeeCarter_risk1-1.png")
```
### Global portfolio
Due to the mortality differential between risk class 1 and 2, there should be more individuals in risk class 2 at higher ages.
However, due to exit events, more individuals in risk class 1 exit the portfolio over time, leading to a higher proportion of individuals in risk class 1 at higher ages than expected when there are no exit events. Thus, mortality rates are closer to mortality rates in risk class 1 at higher ages.
**Computation of central mortality rates**
``` r
Dx_pop <- death_table(sim_out$population, ages = age_grp, period = 0:30)
Ex_pop <- exposure_table(sim_out$population, ages = age_grp, period = 0:30)
mx_pop <- Dx_pop/Ex_pop
```
```{r, echo=FALSE, fig.align='center', out.width = '70%'}
knitr::include_graphics("insur_mortality_rates-1.png")
```
**Evolution of portfolio mortality rates over time**
The decrease in mortality rates is slower in the portfolio than in the global population due to composition changes.
```{r, echo=FALSE, fig.align='center', out.width = '70%'}
knitr::include_graphics("insur_evol_mortality-1.png")
```
**Cohort mortality rates**
```{r, echo=FALSE, fig.align='center', out.width = '70%'}
knitr::include_graphics("insur_cohort-1.png")
```
# Different simulations with the same model
The initial population and model parameters can be modified without have to recompile the model, in order to simulate the population model with different input. The event bounds must be modified accordingly before running again the simulation.
An event can also be deactivated by setting the event bound to 0.
See `vignette('IBMPopSim_human_pop')` for several examples.