joint work with K. Antonio 1 and E.W. Frees 2 44th Actuarial Research Conference Madison, Wisconsin 30 Jul - 1 Aug 2009

Similar documents
Modeling. joint work with Jed Frees, U of Wisconsin - Madison. Travelers PASG (Predictive Analytics Study Group) Seminar Tuesday, 12 April 2016

Xiaoli Jin and Edward W. (Jed) Frees. August 6, 2013

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is:

Using micro-level automobile insurance data for macro-effects inference

Double Chain Ladder and Bornhutter-Ferguson

A Multivariate Analysis of Intercompany Loss Triangles

A Comprehensive, Non-Aggregated, Stochastic Approach to. Loss Development

Using micro-level automobile insurance data for macro-effects inference

1. You are given the following information about a stationary AR(2) model:

Statistical Analysis of Life Insurance Policy Termination and Survivorship

A multi-state approach and flexible payment distributions for microlevel reserving in general insurance

A Comprehensive, Non-Aggregated, Stochastic Approach to Loss Development

Risk Classification In Non-Life Insurance

APPROACHES TO VALIDATING METHODOLOGIES AND MODELS WITH INSURANCE APPLICATIONS

Dependent Loss Reserving Using Copulas

Individual Loss Reserving with the Multivariate Skew Normal Distribution

arxiv: v1 [q-fin.rm] 13 Dec 2016

High-Frequency Data Analysis and Market Microstructure [Tsay (2005), chapter 5]

From Double Chain Ladder To Double GLM

Validating the Double Chain Ladder Stochastic Claims Reserving Model

Actuarial Society of India EXAMINATIONS

PASS Sample Size Software

Estimation Procedure for Parametric Survival Distribution Without Covariates

Modelling Environmental Extremes

Stochastic Loss Reserving with Bayesian MCMC Models Revised March 31

Modelling Environmental Extremes

Loss reserving for individual claim-by-claim data

Dynamic Corporate Default Predictions Spot and Forward-Intensity Approaches

Longitudinal Modeling of Insurance Company Expenses

Analysis of truncated data with application to the operational risk estimation

SOCIETY OF ACTUARIES EXAM STAM SHORT-TERM ACTUARIAL MATHEMATICS EXAM STAM SAMPLE QUESTIONS

RISK ADJUSTMENT FOR LOSS RESERVING BY A COST OF CAPITAL TECHNIQUE

Homework Problems Stat 479

Can we use kernel smoothing to estimate Value at Risk and Tail Value at Risk?

UPDATED IAA EDUCATION SYLLABUS

SOCIETY OF ACTUARIES Advanced Topics in General Insurance. Exam GIADV. Date: Thursday, May 1, 2014 Time: 2:00 p.m. 4:15 p.m.

A Comparison of Stochastic Loss Reserving Methods

[D7] PROBABILITY DISTRIBUTION OF OUTSTANDING LIABILITY FROM INDIVIDUAL PAYMENTS DATA Contributed by T S Wright

Risky Loss Distributions And Modeling the Loss Reserve Pay-out Tail

Motivation. Method. Results. Conclusions. Keywords.

The long road to enlightenment

Managing Systematic Mortality Risk in Life Annuities: An Application of Longevity Derivatives

Obtaining Predictive Distributions for Reserves Which Incorporate Expert Opinions R. Verrall A. Estimation of Policy Liabilities

Midterm Exam. b. What are the continuously compounded returns for the two stocks?

Log-Robust Portfolio Management

Estimating a Life Cycle Model with Unemployment and Human Capital Depreciation

Homework Problems Stat 479

Hierarchical Generalized Linear Models. Measurement Incorporated Hierarchical Linear Models Workshop

Multidimensional credibility: a Bayesian analysis. of policyholders holding multiple contracts

Practice Exam 1. Loss Amount Number of Losses

3.4 Copula approach for modeling default dependency. Two aspects of modeling the default times of several obligors

DRAFT. Half-Mack Stochastic Reserving. Frank Cuypers, Simone Dalessi. July 2013

Risk Margin Quantile Function Via Parametric and Non-Parametric Bayesian Quantile Regression

Simulation based claims reserving in general insurance

SOCIETY OF ACTUARIES/CASUALTY ACTUARIAL SOCIETY EXAM C CONSTRUCTION AND EVALUATION OF ACTUARIAL MODELS EXAM C SAMPLE QUESTIONS

Non parametric individual claims reserving

Multivariate Cox PH model with log-skew-normal frailties

Modeling dynamic diurnal patterns in high frequency financial data

Reserve Risk Modelling: Theoretical and Practical Aspects

Chapter 2 ( ) Fall 2012

Efficient Valuation of Large Variable Annuity Portfolios

Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions

Modelling, Estimation and Hedging of Longevity Risk

Describing Uncertain Variables

Limit Theorems for the Empirical Distribution Function of Scaled Increments of Itô Semimartingales at high frequencies

Australian Journal of Basic and Applied Sciences. Conditional Maximum Likelihood Estimation For Survival Function Using Cox Model

REINSURANCE RATE-MAKING WITH PARAMETRIC AND NON-PARAMETRIC MODELS

Comments on Asset Allocation Strategies Based on Penalized Quantile Regression (Bonaccolto, Caporin & Paterlini)

BAYESIAN CLAIMS RESERVING* Enrique de Alba Instituto Tecnológico Autónomo de México (ITAM) Río Hondo No. 1 México, D.F MÉXICO

List of tables List of boxes List of screenshots Preface to the third edition Acknowledgements

Process capability estimation for non normal quality characteristics: A comparison of Clements, Burr and Box Cox Methods

Portfolio Optimization. Prof. Daniel P. Palomar

2017 IAA EDUCATION SYLLABUS

Homework Problems Stat 479

Multivariate longitudinal data analysis for actuarial applications

Logit with multiple alternatives

Contagion models with interacting default intensity processes

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION

Survival Analysis APTS 2016/17 Preliminary material

UNIVERSITY OF OSLO. The Poisson model is a common model for claim frequency.

The Leveled Chain Ladder Model. for Stochastic Loss Reserving

Dependence Structure and Extreme Comovements in International Equity and Bond Markets

Smart Beta: Managing Diversification of Minimum Variance Portfolios

Exam STAM Practice Exam #1

Window Width Selection for L 2 Adjusted Quantile Regression

Content Added to the Updated IAA Education Syllabus

CMBS Default: A First Passage Time Approach

INTRODUCTION TO SURVIVAL ANALYSIS IN BUSINESS

Statistical Modeling Techniques for Reserve Ranges: A Simulation Approach

Back-Testing the ODP Bootstrap of the Paid Chain-Ladder Model with Actual Historical Claims Data

Clark. Outside of a few technical sections, this is a very process-oriented paper. Practice problems are key!

LTCI: a multi-state semi-markov model to describe the dependency process for elderly people

Simulation of Extreme Events in the Presence of Spatial Dependence

Non parametric individual claim reserving in insurance

Surrenders in a competing risks framework, application with the [FG99] model

Calibration of Interest Rates

Design of Engineering Experiments Part 9 Experiments with Random Factors

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

Midas Margin Model SIX x-clear Ltd

City, University of London Institutional Repository

Transcription:

joint work with K. Antonio 1 and E.W. Frees 2 44th Actuarial Research Conference Madison, Wisconsin 30 Jul - 1 Aug 2009 University of Connecticut Storrs, Connecticut 1 U. of Amsterdam 2 U. of Wisconsin Madison page 1

Dynamics of claims Development present occurrence declaration payment settlement RBNS IBNR Calendar Time page 2

Synopsis Put focus on RBNS claims: Reported But Not Settled. Use micro-level data to predict future development of open claims. Develop a hierarchical model. for micro level. page 3

The data are from the General Insurance Association of Singapore. Observations are from one company over 10-year period: Jan 1993 Jul 2002. present moment in this case study is 25 Jul 2002. Policy file: characteristics of policyholder and vehicle insured age, gender, vehicle type, vehicle age,... Claims file: keeps track of each accident claim filed with the insurer linked to policy file, contains accident date. file: reports each payment made during observation period. linked to claims file, with payment date, size and type. page 4

The data A claim will have multiple payments during its run off. s may be: own damage (O) (including injury, property, fire, theft); injury (I) to a party other than the insured; property damage (P). Combinations of these types may also occur. Frees and Valdez (2008, JASA) summarized the many payments per claim into one single claim amount. page 5

The data Development of claim 7 Development of claim 9942 0 2000 6000 own injury property 0 4000 8000 closed open 0 2 4 6 8 10 12 0 2 4 6 Acc. Date 12/14/1999 Acc. Date 08/18/2001 Development of claim 21443 Development of claim 24076 0 4000 8000 12000 0 2000 6000 0 20 40 60 80 1 0 1 2 3 4 Acc. Date 04/25/1995 Acc. Date 01/04/1996 page 6

The data Arrival Year 1993 Arrival Year 1998 own injury property own injury property 0 20 40 60 80 100 120 Months since occurrence Arrival Year 2000 0 20 40 60 80 100 120 Months since occurrence own injury property 0 20 40 60 80 100 120 Months since occurrence page 7

A traditional actuarial display Run off triangle: aggregate claims per arrival year (AY) and development year (DY) combination. Run off triangle for property (P) payments: (in 000s, non cumulative) Arrival Development Year Year 1 2 3 4 5 6 7 8 9 10 1993 205.3 847.6 226.3 77.9 47.9 40.6 10.2 1.8 0.0 0.6 1994 1,081.3 1,750.4 534.7 153.8 73.0 51.1 16.2 37.3 5.8 1995 900.9 1,822.7 578.5 202.0 54.1 48.2 9.5 1.3 1996 1,272.8 1,816.9 583.7 255.2 44.2 24.1 11.4 1997 1,188.7 2,257.9 695.2 166.8 92.1 12.9 1998 1,235.4 3,250.0 649.9 211.2 74.1 1999 2,209.8 3,718.7 818.4 266.3 2000 2,662.5 3,487.0 762.7 2001 2,457.3 3,650.3 2002 673.7 Common statistical techniques: chain ladder, distributional, Bayesian, GLMs,... Modeling individual claims run-off is less developed in the literature. page 8

Micro level data: literature Suggestions from actuarial literature: England and Verrall (2002), Taylor and Campbell (2002), Taylor, McGuire, and Sullivan (2006). Some actuarial papers: Arjas (1989, ASTIN), Norberg (1993, ASTIN), Norberg (1999, ASTIN); Haastrup and Arjas (1996, ASTIN); Larsen (2007, ASTIN); Zhao, Zhou, and Wang (2009, IME). Statistical resource: Cook and Lawless (2007), Statistical analysis of recurrent events. page 9

Observable data structure total number of claims in the data set is n = 43, 729; N i, number of events in development period of claim i; T ij, time of event j, in months since the accident date (T i0 = 0 is accident date and T ini is settlement date); C i time of censoring; E ij type of event j. We distinguish: - event type 1: direct settlement without any payments; - event type 2: payment with settlement; - event type 3: payment without settlement. M ij type of payment for event j of claim i. P ijk size of payment of type k (k being own damage (O), injury (I) or property (P)) for event j of claim i. page 10

Timing of events, per event type Event 1: direct settlement Event 2: payment with settlement 0 5 10 15 20 25 30 35 0 1000 2000 3000 4000 0 20 40 60 80 100 0 20 40 60 80 100 Months since occurrence Min=0; Max=87.56 Months since occurrence Min=0; Max=103 Event 3: payment no settlement 0 5000 10000 15000 0 20 40 60 80 100 Months since occurrence Min=0; Max=111 page 11

Time of settlement, number of payments, times between payments Time of settlement Number of payments 0 500 1000 1500 2000 2500 0 5000 10000 15000 20000 0 20 40 60 80 100 1 2 3 4 5 6 7 8 Months since occurrence Min=0; Max=103 Min=1; Max=8 Time between payments (in months) 0 5000 10000 15000 20000 25000 0 20 40 60 80 100 page 12

s Number of payments per type: Claim Type (I) (O) (P) Number 1,417 (1.95%) 45,950 (63.3%) 21,775 (30%) (I,O) (I,P) (O,P) (O,I,P) Number 107 (0.147%) 319 (0.439%) 3017 (4.16%) 9 (0.012%) page 13

Distribution of payments 0 3 Pay_vI (<0) 0 80 Ln_Pay_vI 4000 3000 2000 1000 0 2 4 6 8 10 12 Pay_vINeg log(pay_vipos) Pay_vP (<0) Ln_Pay_vP 0 400 0 3000 30000 20000 10000 0 5 0 5 10 Pay_vPNeg log(pay_vppos) Pay_vO (<0) Ln_Pay_vO (Claim amount) 0 300 0 6000 150000 100000 50000 0 10 5 0 5 10 Pay_vONeg log(pay_vopos) Ln_Pay_vO (Loss amount) 0 6000 5 0 5 10 log(pay_vonoexpos) page 14

Model formulation A claim i (i = 1,..., n c ) is a combination of accident date ( AD i ); set of covariates C i ; development process X i : X i = ({E i (v), M i (v), P i (v)}) v [0,TiNi ]; Development process X i is a jump process. 3 building blocks are used: E i (t ij ) := E ij is the type of the jth event in the development of claim i, occurring at time t ij ; If this event includes a payment, its payment is given by M i (t ij ) := M ij ; Corresponding payment vector is P ij. page 15

Intensity modeling with single type of events at times t ij : N i ( τi ) L i = λ i (t ij ) exp λ i (u)du. 0 j=1 [0, τ i ] is the period of observation of subject i with τ i = min (T ini, C i ). λ i (t) is the event intensity (or hazard rate) at time t for subject i. For multitype events: each subject is at risk of m different types of recurrent events. Specify intensity function for each type of event (k = 1,..., m) with λ ik (t). page 16

How to specify the intensity functions λ 1 (t) (for event 1), λ 2 (t) (for event 2) and λ 3 (t) (for event 3)? Techniques from survival analysis: (k = 1, 2, 3) exponential: λ k (t) := λ k ; Weibull: λ k (t) := α k γ k t α k 1 e γ k t αk ; Cox model: λ k (t) := λ 0k (t) exp (z kβ k ); piecewise constant: λ k (t) = λ k1 for 0 t < t k1 λ k2 for t k1 t < t k2. λ kd for t kd 1 t < t kd. page 17

Hazard rates per event type Hazard Rate Type 1 Hazard Rate Type 2 h.gridw 0.00 0.01 0.02 0.03 0.04 0.05 0.06 const. Weibull piec. const. (12m) piec. const. (3m) h.gridw 0.00 0.01 0.02 0.03 0.04 0.05 0.06 const. Weibull piec. const. (12m) piec. const. (3m) 0 20 40 60 80 100 120 0 20 40 60 80 100 120 t.grid t.grid Hazard Rate Type 3 h.gridw 0.00 0.05 0.10 0.15 0.20 const. Weibull piec. const. (12m) piec. const. (3m) 0 20 40 60 80 100 120 t.grid page 18

M ij represents the combination of payments observed at t ij. 7 combinations are possible: I, O, P, (I, O), (I, P), (O, P) and (O, I, P). Claim type is modeled with multinomial logit model: Pr(M ij = m ij ) = with V ij,m = x ij β M,m. exp V ij,m 7 s=1 exp (V ij,s), Covariate information used in multinomial model: Type of vehicle, vehicle age, age of driver; Arrival Year, Development Year. page 19

Given M ij for the event at time t ij, P ij gives corresponding severities. For the sign of a payment, use: { 1 if P ijk > 0 I ijk = 0 if P ijk < 0, and s ijk = Pr(I ijk = 1). Use logistic regression to model the sign of P ijk : logit(s ijk ) = x ijβ S,k. Covariate information used in logistic models: Development year; Number of previous injury/own damage/property payments. page 20

Negative part of payments Burr regression: f P (p) = λβλ τp τ 1 (β + p τ ) λ+1, with τ ijk = exp (x ijk β P,k) with k for payment type. used for Property and Own Damage payments GB2 regression: f P (p) = α p αγ 1 1 β αγ 2 B(γ 1, γ 2 )(β α + p α ) γ 1+γ 2, with α 0, β, γ 1, γ 2 > 0, B(α 1, α 2 ) the usual beta function and β ijk = exp (x ij β P,k). used for Injury payments page 21

Positive part of payments Inspired by the histograms of the positive payments, we used a mixture of lognormal regression models: log (P) w 1 N 1 (µ 1, σ 2 1) + w 2 N 2 (µ 2, σ 2 2) + w 3 N 3 (µ 3, σ 2 3), where w 1, w 2 and w 3 are weights, specified as w 1 = w 2 = w 3 = exp (a) exp (a) + exp (b) + exp (c), exp (b) exp (a) + exp (b) + exp (c), exp (c) exp (a) + exp (b) + exp (c), and N i (µ i, σ 2 i ) is a normal distribution with mean µ i and variance σ 2 i. Covariate information is incorporated in the weights and parameters µ i and σ 2 i (i = 1, 2, 3). page 22

QQ plots on the negative payments empirical quantile 10 5 0 5 10 empirical quantile 10 8 6 4 2 6 4 2 0 2 4 6 8 10 8 6 4 2 Own damage theoretical quantile Injury theoretical quantile empirical quantile 8 6 4 2 0 2 4 8 6 4 2 0 2 4 Property theoretical quantile page 23

y y y Histograms of the positive payments - own damage Positive Own Damage (log scale) Positive Own Damage (log scale) 0.0 0.1 0.2 0.3 0.4 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0 2 4 6 8 10 12 14 0 2 4 6 8 10 12 14 x Sample: DY=1 x Sample: DY=2 Positive Own Damage (log scale) 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0 2 4 6 8 10 12 14 x Sample: DY>2 page 24

of RBNS claim reserves Step 1: simulate the next event s time interval Step 2: simulate the exact time of the next event Step 3: simulate the event type Step 4: simulate payment type Step 5: simulate payments Step 6: stop or continue, if necessary - depending on whether settled or not page 25

Resulting predictive distributions of reserves - by type Reserve Own Damage Reserve Injury 0 100 200 300 400 500 600 0 50 100 150 2.0e+07 1.0e+07 0.0e+00 5.0e+06 4e+06 6e+06 8e+06 1e+07 Reserve Property 0 50 100 150 7.0e+06 8.0e+06 9.0e+06 1.0e+07 1.1e+07 page 26

Concluding remarks Main idea: claims using statistics for recurrent events. The hope is to improve the prediction of reserves using detailed micro-level recorded information. the cost is the additional complexity in the modeling involved. Additional work to be done: comparing the results with traditional methods. Similar methodology to other areas of actuarial statistics e.g. recurrent episodes in workers compensation. page 27

Thank you! page 28