Where s the Beef Does the Mack Method produce an undernourished range of possible outcomes?

Similar documents
Obtaining Predictive Distributions for Reserves Which Incorporate Expert Opinions R. Verrall A. Estimation of Policy Liabilities

APPROACHES TO VALIDATING METHODOLOGIES AND MODELS WITH INSURANCE APPLICATIONS

Integrating Reserve Variability and ERM:

Developing a reserve range, from theory to practice. CAS Spring Meeting 22 May 2013 Vancouver, British Columbia

Reserve Risk Modelling: Theoretical and Practical Aspects

The Fundamentals of Reserve Variability: From Methods to Models Central States Actuarial Forum August 26-27, 2010

Jacob: What data do we use? Do we compile paid loss triangles for a line of business?

Reserving Risk and Solvency II

MVE051/MSG Lecture 7

Describing Uncertain Variables

Statistical Modeling Techniques for Reserve Ranges: A Simulation Approach

Frequency Distribution Models 1- Probability Density Function (PDF)

Session 178 TS, Stats for Health Actuaries. Moderator: Ian G. Duncan, FSA, FCA, FCIA, FIA, MAAA. Presenter: Joan C. Barrett, FSA, MAAA

The histogram should resemble the uniform density, the mean should be close to 0.5, and the standard deviation should be close to 1/ 12 =

The Leveled Chain Ladder Model. for Stochastic Loss Reserving

GI ADV Model Solutions Fall 2016

Back-Testing the ODP Bootstrap of the Paid Chain-Ladder Model with Actual Historical Claims Data

STAT 157 HW1 Solutions

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

WC-5 Just How Credible Is That Employer? Exploring GLMs and Multilevel Modeling for NCCI s Excess Loss Factor Methodology

CHAPTER 8. Confidence Interval Estimation Point and Interval Estimates

Statistics 431 Spring 2007 P. Shaman. Preliminaries

Copyright 2011 Pearson Education, Inc. Publishing as Addison-Wesley.

QQ PLOT Yunsi Wang, Tyler Steele, Eva Zhang Spring 2016

GN47: Stochastic Modelling of Economic Risks in Life Insurance

[D7] PROBABILITY DISTRIBUTION OF OUTSTANDING LIABILITY FROM INDIVIDUAL PAYMENTS DATA Contributed by T S Wright

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

Study Guide on Measuring the Variability of Chain-Ladder Reserve Estimates 1 G. Stolyarov II

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

Jacob: The illustrative worksheet shows the values of the simulation parameters in the upper left section (Cells D5:F10). Is this for documentation?

The Analysis of All-Prior Data

Clark. Outside of a few technical sections, this is a very process-oriented paper. Practice problems are key!

UPDATED IAA EDUCATION SYLLABUS

The normal distribution is a theoretical model derived mathematically and not empirically.

Stochastic Loss Reserving with Bayesian MCMC Models Revised March 31

The following content is provided under a Creative Commons license. Your support

Maximum Likelihood Estimation

Lecture 6: Non Normal Distributions

Data Analysis and Statistical Methods Statistics 651

Statistics for Business and Economics

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 7.4-1

The Central Limit Theorem. Sec. 8.2: The Random Variable. it s Distribution. it s Distribution

FAV i R This paper is produced mechanically as part of FAViR. See for more information.

by Aurélie Reacfin s.a. March 2016

STA2601. Tutorial letter 105/2/2018. Applied Statistics II. Semester 2. Department of Statistics STA2601/105/2/2018 TRIAL EXAMINATION PAPER

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions.

Non parametric IBNER projection

Statistical Intervals (One sample) (Chs )

This homework assignment uses the material on pages ( A moving average ).

Analysis of 2x2 Cross-Over Designs using T-Tests for Non-Inferiority

1. Variability in estimates and CLT

Evidence from Large Indemnity and Medical Triangles

CHAPTER II LITERATURE STUDY

Gamma Distribution Fitting

Corporate Finance, Module 3: Common Stock Valuation. Illustrative Test Questions and Practice Problems. (The attached PDF file has better formatting.

Study Guide on Risk Margins for Unpaid Claims for SOA Exam GIADV G. Stolyarov II

Using Fat Tails to Model Gray Swans

starting on 5/1/1953 up until 2/1/2017.

Exam 7 High-Level Summaries 2018 Sitting. Stephen Roll, FCAS

Market Risk: FROM VALUE AT RISK TO STRESS TESTING. Agenda. Agenda (Cont.) Traditional Measures of Market Risk

Business Statistics 41000: Probability 3

Normal Probability Distributions

Statistics and Probability

AP STATISTICS FALL SEMESTSER FINAL EXAM STUDY GUIDE

Is a Binomial Process Bayesian?

DRAFT 2011 Exam 7 Advanced Techniques in Unpaid Claim Estimation, Insurance Company Valuation, and Enterprise Risk Management

ECON 214 Elements of Statistics for Economists 2016/2017

Appendix A. Selecting and Using Probability Distributions. In this appendix

On the Use of Stock Index Returns from Economic Scenario Generators in ERM Modeling

Probability Models.S2 Discrete Random Variables

Point Estimation. Some General Concepts of Point Estimation. Example. Estimator quality

Homework Problems Stat 479

Two-Sample T-Test for Non-Inferiority

NCSS Statistical Software. Reference Intervals

Lecture 9 - Sampling Distributions and the CLT

Models of Patterns. Lecture 3, SMMD 2005 Bob Stine

Presented at the 2012 SCEA/ISPA Joint Annual Conference and Training Workshop -

STAT Chapter 6: Sampling Distributions

Stochastic Analysis Of Long Term Multiple-Decrement Contracts

SOCIETY OF ACTUARIES Advanced Topics in General Insurance. Exam GIADV. Date: Thursday, May 1, 2014 Time: 2:00 p.m. 4:15 p.m.

Web Appendix. Are the effects of monetary policy shocks big or small? Olivier Coibion

A new -package for statistical modelling and forecasting in non-life insurance. María Dolores Martínez-Miranda Jens Perch Nielsen Richard Verrall

Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD

ECON Introductory Econometrics. Lecture 1: Introduction and Review of Statistics

Module 3: Sampling Distributions and the CLT Statistics (OA3102)

Midterm Exam III Review

Empirical Distribution Testing of Economic Scenario Generators

Calculating VaR. There are several approaches for calculating the Value at Risk figure. The most popular are the

STA Module 3B Discrete Random Variables

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Anatomy of Actuarial Methods of Loss Reserving

Contents Utility theory and insurance The individual risk model Collective risk models

Learning Objectives for Ch. 7

Board Finance Committee. November 15, 2017

arxiv: v1 [q-fin.rm] 13 Dec 2016

χ 2 distributions and confidence intervals for population variance

The topics in this section are related and necessary topics for both course objectives.

MUNICH CHAIN LADDER Closing the gap between paid and incurred IBNR estimates

XLSTAT TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING

Transcription:

Where s the Beef Does the Mack Method produce an undernourished range of possible outcomes? Daniel Murphy, FCAS, MAAA Trinostics LLC CLRS 2009

In the GIRO Working Party s simulation analysis, actual unpaid losses exceeded the Mack Method s 99 th percentile over 10% of the time GIRO 1 : The Mack Method tends to understate the chance of extreme adverse outcomes, even in situations where the underlying assumptions are perfectly satisfied. (2007) FSA 2 : Commonly used *stochastic+ methods are inadequate to cover the full range of reserving variability. (2009) 1. http://www.actuaries.org.uk/ data/assets/pdf_file/0010/31303/bhprize_gibson.pdf 2. http://www.actuaries.org.uk/ data/assets/pdf_file/0009/146664/sm20090223.pdf Trinostics LLC 2

Agenda Review GIRO WP simulation study Analyze theory, visualize causes Suggest improvements Trinostics LLC 3

GIRO WP simulated 10,000 10x10 triangles 1 st Trial Age 1 losses X 1 : independent, random samples from a lognormal distribution with mean = 1, var = 1 Age 2 losses X 2 : randomly developed from a shifted lognormal with mean = X 1 b 1 and var = X 1 σ 2. Similarly for age 3, 4,, 10 b 4.2890 2.0640 1.5020 1.2680 1.1500 1.0850 1.0480 1.0270 1.0150 σ 2 1 1 1 1 1 1 1 1 1 AY \ age 1 2 3 4 5 6 7 8 9 10 2001 $0.420 $2.873 $7.175 $14.295 $16.676 $18.732 $36.983 $38.045 $38.102 $38.118 2002 0.824 3.742 8.875 12.354 14.176 14.680 15.044 15.080 15.111 15.347 2003 0.353 1.118 2.058 2.961 3.795 4.070 4.284 4.360 4.472 4.517 2004 2.669 8.403 12.937 18.463 24.133 24.811 25.665 25.725 25.729 25.873 2005 0.930 5.056 11.421 13.749 15.209 21.361 21.592 21.618 21.862 22.754 2006 0.357 1.382 2.485 3.002 3.135 6.299 6.455 6.523 6.550 6.565 2007 1.061 4.392 8.382 11.093 13.844 14.495 14.519 14.574 14.964 14.965 2008 1.308 6.626 10.563 14.934 20.307 20.861 26.047 26.345 26.398 26.406 2009 1.142 5.685 10.377 18.663 22.144 25.626 26.687 27.036 27.248 27.250 2010 1.639 7.667 16.534 27.154 37.583 40.970 64.295 64.772 65.366 65.411 Sum 247.205 Paid 136.731 "Ac tual" Unpaid $110. 474 Trinostics LLC 4

Distribution of 10000 trials of actual unpaid claims Trinostics LLC 5

then ran 10,000 Chain Ladder, Mack Methods 1 st Trial Accident Year Current Diagonal Estimated VW ata LDF Estimated Ultimate Estimated Unpaid Simulated Actual Unpaid 2001 $ 38.118 1.0000 $ 38.118 $ - $ - 2002 15.111 1.0004 1.0004 15.118 0.006 0.235 2003 4.360 1.002 1.002 4.369 0.009 0.157 2004 25.665 1.021 1.023 26.255 0.590 0.208 2005 21.361 1.316 1.346 28.757 7.396 1.392 2006 3.135 1.131 1.522 4.772 1.637 3.430 2007 11.093 1.190 1.811 20.089 8.996 3.872 2008 10.563 1.423 2.578 27.227 16.664 15.844 2009 5.685 1.902 4.903 27.874 22.189 21.565 2010 1.639 4.334 21.249 34.827 33.188 63.772 Sum $ 136.731 $ 227.406 $ 90.675 $ 110.474 Mack se 39.085 50% 90% 99% Lognormal percentile $83.269 $141.333 $217.550 Percentile sufficient this trial? NO YES YES Trinostics LLC Y 6 Xb

Insufficiency over all trials was interesting! Percentile Simulated Percentile sufficient? Trial 50% 90% 99% Actual 50% 90% 99% 1 $ 83.269 $ 141.333 $ 217.550 $ 110.474 NO YES YES 2 78.329 120.532 171.279 140.618 NO NO YES 3 59.184 76.377 94.028 101.583 NO NO NO 9999 134.400 177.211 222.018 40.433 YES YES YES 10000 47.780 79.419 120.181 41.434 YES YES YES Percent insufficient 59.0% 25.6% 10.2% WP Table B-3 58.30% 24.55% 10.1% Trinostics LLC 7

How can the Mack 99% VAR be so far off? 1. Is the mean of the distribution too low? 2. Is the variance (MSE) of the distribution too low? 3. Is the lognormal the wrong distribution to use? 4. Something else? Trinostics LLC 8

Is the chain ladder mean unpaid loss too low on average? Accident Year Theoretical mean unpaid mean of actual (simulated) unpaid % difference mean of predicted unpaid % difference 2001 $ - $ - - 2002 0.340 0.351 3.4% 0.427 21.6% 2003 0.935 1.056 12.9% 1.021-3.3% 2004 1.945 1.890-2.8% 2.134 12.9% 2005 3.593 3.567-0.7% 3.598 0.9% 2006 6.122 6.216 1.5% 6.166-0.8% 2007 9.685 9.652-0.3% 9.610-0.4% 2008 14.129 14.155 0.2% 14.590 3.1% 2009 18.693 18.839 0.8% 18.981 0.8% 2010 21.982 22.284 1.4% 22.185-0.4% Sum $77.422 $ 78.011 0.8% $ 78.711 0.9% Chain ladder point estimate looks reasonably close to the actual mean value Not too low on average Trinostics LLC 9

Is the Mack Method standard error of unpaid loss too low on average? Accident Year Theoretical s.e. of unpaid s.e. of actual (simulated) unpaid s.e. of predicted unpaid % difference 2001 $ - $ - - 2002 NA 2.637 8.589 225.8% 2003 NA 6.200 9.149 47.6% 2004 NA 6.954 23.875 243.3% 2005 NA 8.198 9.761 19.1% 2006 NA 12.553 12.643 0.7% 2007 NA 14.163 13.348-5.8% 2008 NA 18.893 41.590 120.1% 2009 NA 23.853 29.272 22.7% 2010 NA 27.333 24.980-8.6% Sum NA $ 46.757 $ 117.950 152.3% Predicted variability does not appear too low on average Trinostics LLC 10

Frequency 0 100 200 300 400 500 600 Do Algorithm A unpaid losses follow the lognormal distribution? Histogram of "Actual" Unpaid pdf certainly appears to resemble a lognormal A popular Excel add-in says shifted lognormal is best Is there a better way to decide? 0 200 400 600 800 # trials= 10000 Trinostics LLC 11

Sample Quantiles 2 3 4 5 6 Do Algorithm A unpaid losses follow the lognormal distribution? Normal Q-Q Plot -4-2 0 2 4 Theoretical Quantiles Per Q-Q plot, unpaid loss distribution has fatter tails than lognormal Shapiro-Wilk test p-value is 0.006 Not lognormal Let s look behind the Chain Ladder / Mack Method Trinostics LLC 12

y = 24 mos 0 5 10 15 The CL method relates x and y values of loss First Trial s age 12-24 development AY 12 24 2001 $ 0.420 $ 2.873 2002 0.824 3.724 2003 0.353 1.118 2004 2.669 8.403 2008 2004 2005 0.930 5.056 2006 3.570 1.382 2007 1.061 4.392 2006 2010 2008 1.308 6.626 2009 1.142 5.685 2010 1.639? 0.0 0.5 1.0 1.5 2.0 2.5 3.0 x = 12 mos Trinostics LLC 13

y = 24 mos 0 5 10 15 Volume weighted ATA is the slope of the line that represents that relationship First Trial s age 12-24 development AY 12 24 2001 $ 0.420 $ 2.873 2002 0.824 3.724 2003 0.353 1.118 2004 2.669 8.403 2005 0.930 5.056 2006 3.570 1.382 2007 1.061 4.392 2008 1.308 6.626 2009 1.142 5.685 2010 1.639? vol wtd ata 4.334 slope of line 2004 2008 2006 2010 0.0 0.5 1.0 1.5 2.0 2.5 3.0 x = 12 mos Trinostics LLC 14

y = 24 mos 0 5 10 15 Chain ladder projection of AY 2010 is the point on the estimated regression line Point Estimate First Trial s age 12-24 development AY 12 24 2001 $ 0.420 $ 2.873 y = 4.334x 2002 0.824 3.724 2003 0.353 1.118 2004 2.669 8.403 2005 0.930 5.056 2006 3.570 1.382 vol wtd ata 4.334 2006 2008 2010 2004 2007 1.061 4.392 2008 1.308 6.626 2009 1.142 5.685 2010 1.639 7.103 0.0 0.5 1.0 1.5 2.0 2.5 3.0 x = 12 mos Trinostics LLC 15

y = 24 mos 0 5 10 15 Blind luck that Trial 1 s volume weighted ATA was so close to true slope First Trial s age 12-24 development AY 12 24 2001 $ 0.420 $ 2.873 2002 0.824 3.724 2003 0.353 1.118 2004 2.669 8.403 2005 0.930 5.056 2006 3.570 1.382 2007 1.061 4.392 2008 1.308 6.626 2009 1.142 5.685 2010 1.639? vol wtd ata 4.334 ( true value = 4.289) slope of line 2004 2008 2006 2010 0.0 0.5 1.0 1.5 2.0 2.5 3.0 x = 12 mos Trinostics LLC 16

y = 24 mos 0 5 10 15 Parameter risk reflects the fact that the slope was estimated from the data Parameter Risk First Trial s age 12-24 development AY 12 24 2001 $ 0.420 $ 2.873 y = (4.334 ± 2*.359)x 2002 0.824 3.724 2003 0.353 1.118 2004 2.669 8.403 2005 0.930 5.056 2006 3.570 1.382 vol wtd ata 4.334 2006 2008 2010 2004 2007 1.061 4.392 2008 1.308 6.626 2009 1.142 5.685 2010 1.639 7.103 0.0 0.5 1.0 1.5 2.0 2.5 3.0 The dashed lines define a twostandard-error region within which x = 12 mos the true line may fall Trinostics LLC 17

Wherever the mean line truly is, losses will vary noisily around that expected value First Trial s age 12-24 development Process Risk Trinostics LLC 18

y = 24 mos 0 5 10 15 Total risk reflects both parameter & process risk First Trial s age 12-24 development 2006 Total Risk y = (4.334 ± 2* (.359 2 +.844 2 ) )x 2008 2010 2004 The dotted lines define a two-standard-error region within which the next possible outcome may fall But that s not how 4.334, 0.359, 0.844 Algorithm A losses develop, is it? The statistics can be derived from Excel s LINEST output 0.0 0.5 1.0 1.5 2.0 2.5 3.0 x = 12 mos Trinostics LLC 19

But Algorithm A development is not symmetric First Trial s age 12-24 development Algorithm A process risk is skewed by virtue of lognormal assumption Trinostics LLC 20

If development were symmetric, 99% VAR insufficiency would drop from 10% to 2% Percent Insufficient 50% 90% 99% Base case 58.49% 24.77% 10.18% Link ratios ~ Normal 52.7% 13.7% 2.2% We can t change reality. What can we do? Trinostics LLC 21

We should reflect uncertainty of spread parameter σ Formulas for Development over a Single Period Model: Point estimate: Variance of link ratio: Variance of point estimate: Variance of prediction: Y=Xb + for some unknown σ Yˆ = Xbˆ where ˆ b = y x 2 Var ( ˆ) b = / x Var ( Yˆ) = X 2 2 / x Var + 2 2 2 ( pred( Y )) = X / x X GIRO b 4.2890 2.0640 1.5020 1.2680 1.1500 1.0850 1.0480 1.0270 1.0150 σ 2 1 1 1 1 1 1 1 1 1 Mack s formulas substitute the sample standard deviation s for σ in the above Practice understates estimated variability (Statistics 101) student-t distribution reflects spread uncertainty Trinostics LLC 22

Assuming spread is certain can significantly understate VAR, especially at mature ages Insufficiency of 99 th Percentile 18% 16% 14% 12% 10% 8% 6% 4% 2% 0% Insufficiency of 99% VAR of N(0,1) 9 8 7 6 5 4 3 2 1 Number of Data Points Normal w/ σ=s Student-t August 2009 Trinostics LLC 23

We can simulate development at each age rather than fit a distribution at the end Simulation can effectively and accurately replicate analytic results, including student-t based formulas Distributions at each age can be chained together to develop the distribution of estimated unpaid claims See for instance Gelman, Data Analysis Using Regression and Multilevel/Hierarchical Models August 2009 Trinostics LLC 24

But, of all the alternative scenarios investigated, the most important discovery was With more accident years, insufficiency can be reduced significantly, even when facing a stacked deck (skewed development) spread is assumed known (σ = s) lognormal fit at the end Percent Insufficient 50% 90% 99% 0. Base case 58.49% 24.77% 10.18% 1. 3-term parameter risk formula 58.51% 24.76% 10.17% 2. Link ratios ~ Normal 52.7% 13.7% 2.2% 3. Number of AY rows = 100 48.3% 11.4% 2.7% Trinostics LLC 25

AY\age 1 2 3 4 5 6 7 8 9 10 2001 0.42 2.87 7.18 14.29 16.68 18.73 36.98 38.04 38.10 38.12 2001 0.42 2.87 7.18 14.29 16.68 18.73 36.98 38.04 38.10 38.12 2001 0.42 2.87 7.18 14.29 16.68 18.73 36.98 38.04 38.10 38.12 2001 0.42 2.87 7.18 14.29 16.68 18.73 36.98 38.04 38.10 38.12 2002 0.82 3.74 8.87 12.35 14.18 14.68 15.04 15.08 15.11 2002 0.82 3.74 8.87 12.35 14.18 14.68 15.04 15.08 15.11 2002 0.82 3.74 8.87 12.35 14.18 14.68 15.04 15.08 15.11 2002 0.82 3.74 8.87 12.35 14.18 14.68 15.04 15.08 15.11 2003 0.35 1.12 2.06 2.96 3.79 4.07 4.28 4.36 2003 0.35 1.12 2.06 2.96 3.79 4.07 4.28 4.36 2003 0.35 1.12 2.06 2.96 3.79 4.07 4.28 4.36 2003 0.35 1.12 2.06 2.96 3.79 4.07 4.28 4.36 2004 2.67 8.40 12.94 18.46 24.13 24.81 25.66 2004 2.67 8.40 12.94 18.46 24.13 24.81 25.66 2004 2.67 8.40 12.94 18.46 24.13 24.81 25.66 2004 2.67 8.40 12.94 18.46 24.13 24.81 25.66 2005 0.93 5.06 11.42 13.75 15.21 21.36 2005 0.93 5.06 11.42 13.75 15.21 21.36 2005 0.93 5.06 11.42 13.75 15.21 21.36 2005 0.93 5.06 11.42 13.75 15.21 21.36 2006 0.36 1.38 2.49 3.00 3.14 2006 0.36 1.38 2.49 3.00 3.14 2006 0.36 1.38 2.49 3.00 3.14 2006 0.36 1.38 2.49 3.00 3.14 2007 1.06 4.39 8.38 11.09 2007 1.06 4.39 8.38 11.09 2007 1.06 4.39 8.38 11.09 2007 1.06 4.39 8.38 11.09 2008 1.31 6.63 10.56 2008 1.31 6.63 10.56 2008 1.31 6.63 10.56 2008 1.31 6.63 10.56 2009 1.14 5.68 2009 1.14 5.68 2009 1.14 5.68 2009 1.14 5.68 2010 1.64 2010 1.64 2010 1.64 2010 1.64 Conclusion t Reflect spread estimate uncertainty Particularly with small triangles (< 40 AYs) Adjust for degrees of freedom: t, chi-square distribution I P U Use analytic-equivalent simulation techniques Simulate hypothesized development Especially useful for complex models, even chain ladder Test for skewed development Are residuals normally distributed? If not, try log transformation, GLMs, Analyze triangles at detailed levels More valuable information, more accurate projections By policy, claim, accident month, region Trinostics LLC 26

Important advances in heart patient treatment have been made in the last ten years through more reliance on hard data and technical analysis and less reliance on expert opinion. Dr. Raymond Stephens, John Muir Medical Center Neurology, Heart of Gold, 2009 Trinostics LLC 27

Trinostics LLC is in the business of collaboration and education in the design and construction of valuable, transparent actuarial models Daniel Murphy, FCAS, MAAA dmurphy@trinostics.com Trinostics LLC 28