REINSURANCE RATE-MAKING WITH PARAMETRIC AND NON-PARAMETRIC MODELS

Similar documents
Case Study: Heavy-Tailed Distribution and Reinsurance Rate-making

QQ PLOT Yunsi Wang, Tyler Steele, Eva Zhang Spring 2016

An Improved Skewness Measure

A Markov Chain Monte Carlo Approach to Estimate the Risks of Extremely Large Insurance Claims

Continuous random variables

Fitting the generalized Pareto distribution to commercial fire loss severity: evidence from Taiwan

MODELLING OF INCOME AND WAGE DISTRIBUTION USING THE METHOD OF L-MOMENTS OF PARAMETER ESTIMATION

Measuring Financial Risk using Extreme Value Theory: evidence from Pakistan

FAV i R This paper is produced mechanically as part of FAViR. See for more information.

The extreme downside risk of the S P 500 stock index

Reliability and Risk Analysis. Survival and Reliability Function

Analysis of truncated data with application to the operational risk estimation

The Application of the Theory of Power Law Distributions to U.S. Wealth Accumulation INTRODUCTION DATA

Window Width Selection for L 2 Adjusted Quantile Regression

Modelling Environmental Extremes

Can we use kernel smoothing to estimate Value at Risk and Tail Value at Risk?

Financial Risk Forecasting Chapter 9 Extreme Value Theory

A New Hybrid Estimation Method for the Generalized Pareto Distribution

Folia Oeconomica Stetinensia DOI: /foli A COMPARISON OF TAIL BEHAVIOUR OF STOCK MARKET RETURNS

Modelling Environmental Extremes

Introduction to Algorithmic Trading Strategies Lecture 8

Introduction to Computational Finance and Financial Econometrics Descriptive Statistics

Interplay of Asymptotically Dependent Insurance Risks and Financial Risks

Normal Distribution. Definition A continuous rv X is said to have a normal distribution with. the pdf of X is

A Comparison Between Skew-logistic and Skew-normal Distributions

A Convenient Way of Generating Normal Random Variables Using Generalized Exponential Distribution

An Application of Extreme Value Theory for Measuring Financial Risk in the Uruguayan Pension Fund 1

The normal distribution is a theoretical model derived mathematically and not empirically.

Longitudinal Modeling of Insurance Company Expenses

Extreme Value Analysis for Partitioned Insurance Losses

Characterization of the Optimum

An Information Based Methodology for the Change Point Problem Under the Non-central Skew t Distribution with Applications.

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651

Mongolia s TOP-20 Index Risk Analysis, Pt. 3

Unit2: Probabilityanddistributions. 3. Normal distribution

STA 248 H1S Winter 2008 Assignment 1 Solutions

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

Chapter 7: Point Estimation and Sampling Distributions

Models of Patterns. Lecture 3, SMMD 2005 Bob Stine

KURTOSIS OF THE LOGISTIC-EXPONENTIAL SURVIVAL DISTRIBUTION

A Two-Dimensional Risk Measure

John Cotter and Kevin Dowd

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is:

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -5 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

Describing Uncertain Variables

Financial Econometrics (FinMetrics04) Time-series Statistics Concepts Exploratory Data Analysis Testing for Normality Empirical VaR

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -26 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

Introduction to Population Modeling

Chapter 7 Sampling Distributions and Point Estimation of Parameters

Modeling. joint work with Jed Frees, U of Wisconsin - Madison. Travelers PASG (Predictive Analytics Study Group) Seminar Tuesday, 12 April 2016

Statistics 431 Spring 2007 P. Shaman. Preliminaries

Lecture 6: Non Normal Distributions

NCSS Statistical Software. Reference Intervals

Absolute Return Volatility. JOHN COTTER* University College Dublin

Probability. An intro for calculus students P= Figure 1: A normal integral

PARAMETRIC AND NON-PARAMETRIC BOOTSTRAP: A SIMULATION STUDY FOR A LINEAR REGRESSION WITH RESIDUALS FROM A MIXTURE OF LAPLACE DISTRIBUTIONS

Probability Weighted Moments. Andrew Smith

Fat Tailed Distributions For Cost And Schedule Risks. presented by:

Chapter 5: Statistical Inference (in General)

DATA SUMMARIZATION AND VISUALIZATION

Modelling catastrophic risk in international equity markets: An extreme value approach. JOHN COTTER University College Dublin

Optimal reinsurance strategies

THE CHANGING SIZE DISTRIBUTION OF U.S. TRADE UNIONS AND ITS DESCRIPTION BY PARETO S DISTRIBUTION. John Pencavel. Mainz, June 2012

Debt Sustainability Risk Analysis with Analytica c

Price Impact and Optimal Execution Strategy

Process capability estimation for non normal quality characteristics: A comparison of Clements, Burr and Box Cox Methods

Market Risk Analysis Volume I

Introduction to R (2)

Leasing and Debt in Agriculture: A Quantile Regression Approach

Homework Problems Stat 479

A Two Dimensional Risk Measure

Extreme Values Modelling of Nairobi Securities Exchange Index

THE USE OF THE LOGNORMAL DISTRIBUTION IN ANALYZING INCOMES

Some Characteristics of Data

THRESHOLD PARAMETER OF THE EXPECTED LOSSES

Quantile Regression. By Luyang Fu, Ph. D., FCAS, State Auto Insurance Company Cheng-sheng Peter Wu, FCAS, ASA, MAAA, Deloitte Consulting

Lecture 2 Describing Data

Statistical Analysis of Data from the Stock Markets. UiO-STK4510 Autumn 2015

[D7] PROBABILITY DISTRIBUTION OF OUTSTANDING LIABILITY FROM INDIVIDUAL PAYMENTS DATA Contributed by T S Wright

An Insight Into Heavy-Tailed Distribution

Introduction Models for claim numbers and claim sizes

Folded- and Log-Folded-t Distributions as Models for Insurance Loss Data

Exam 2 Spring 2015 Statistics for Applications 4/9/2015

Much of what appears here comes from ideas presented in the book:

Continuous Distributions

Advanced Extremal Models for Operational Risk

Stochastic Analysis Of Long Term Multiple-Decrement Contracts

Operational Risk: Evidence, Estimates and Extreme Values from Austria

Analysis of the Oil Spills from Tanker Ships. Ringo Ching and T. L. Yip

David R. Clark. Presented at the: 2013 Enterprise Risk Management Symposium April 22-24, 2013

Value at risk might underestimate risk when risk bites. Just bootstrap it!

Testing for the martingale hypothesis in Asian stock prices: a wild bootstrap approach

In physics and engineering education, Fermi problems

ANALYSIS OF THE DISTRIBUTION OF INCOME IN RECENT YEARS IN THE CZECH REPUBLIC BY REGION

Symmetricity of the Sampling Distribution of CV r for Exponential Samples

1. You are given the following information about a stationary AR(2) model:

Chapter 3 Statistical Quality Control, 7th Edition by Douglas C. Montgomery. Copyright (c) 2013 John Wiley & Sons, Inc.

Assessing Regime Switching Equity Return Models

CHAPTER TOPICS STATISTIK & PROBABILITAS. Copyright 2017 By. Ir. Arthur Daniel Limantara, MM, MT.

Transcription:

REINSURANCE RATE-MAKING WITH PARAMETRIC AND NON-PARAMETRIC MODELS By Siqi Chen, Madeleine Min Jing Leong, Yuan Yuan University of Illinois at Urbana-Champaign

1. Introduction Reinsurance contract is an insurance policy purchased by primary insurer from one or more reinsurance companies. It is important to insurance companies as they can transfer some high risk business with potential losses to reinsurance companies. This transaction carries a mean of risk management and has become more extensive in the global financial market. There are two reinsurance agreements: proportional and non-proportional coverage. Proportional coverage means the same proportion of premium and losses incurred by the policyholders are ceded to the reinsurer, and thus, rating consideration may not considered in this agreement. With non-proportional coverage, the reinsurer would receive reinsurance premium and pay a predefined portion of the claims incurred by the insurer. Excess of loss (XL) reinsurance, which the insurance coverage is paid by insurer to policyholders up to a maximum retention level and any amount exceeds the level will be recouped by the reinsurer. For instance, the insurer may insure a catastrophic risk with a policy limit up to $30 million, and buy a XL reinsurance with retention level of $15 million. In this case the reinsurer will need to pay $5 million to insurer for recovery if the loss is $20 million. However, this XL reinsurance is usually modeled using the concept of mean excess loss, without any assumptions of a parametric model. This non-parametric pricing approach may raise problem to reinsurance companies as it usually does not capture the tail behavior of the loss distribution. A visual technique of studying the tail behavior of the claim size is applied in this text. Lastly, a parametric model is introduced for better reinsurance rate-making. This model uses the concept of extreme value theory for finding the parameter in the pricing distribution. Net premiums for both parametric and non-parametric are computed and compared in the latter section. In this paper, valuation of parametric model and non-parametric model for insurance rate-making is performed. Sample insurance claims data from [4] is used throughout the text to give an idea of the claim data modeling problems faced by actuarial practitioner as well as to manifest the two different pricing methods. A histogram of the sample claims data is shown in next section for the purpose of glancing at the data. 2. Histogram of Sample Claims Data Histogram is one of the common graph to use for displaying the numeric data and illustrating some behaviors of the data. A histogram of the supported data is shown in Figure 1. Figure 1 shows that the claim data is right-skewed where the mean of the losses is larger than the median. In addition, there are few extremely high values (or outliers) that do not fall near to any other data points. Understanding these extreme loss values is important because these have the greatest impact on the total of losses for reinsurance companies. A statistical method will be applied later for viewing the tail behavior of the claim distribution. 1

Histogram of Loss Density 0e+00 1e-07 2e-07 3e-07 4e-07 5e-07 6e-07 1e+06 2e+06 3e+06 4e+06 5e+06 6e+06 7e+06 8e+06 Loss Figure 1: Histogram of sample claims data 3. Non-Parametric XL Model and Quantile Plotting As mentioned in the introduction, XL reinsurance pricing method relies on the concept of mean excess loss function. Let X be a random variable that represent the size of a claim made by policyholder and R be the retention or priority level of the reinsurance contract. Reinsurer is required to pay X R if only X > R. Actuaries working in reinsurance companies would pay more attention on the expected value of such claim, which is given by m(r) = E(X R X > R) where the expression is determined by mean excess function and is existed if E(x) <. Let x 1, x 2,..., x n be the claim data that are from a random sample of X 1, X 2,..., X n, the mean excess function m is often estimated at R = X n k,n, the (k+1)-th largest observation for k = 0, 1, 2,..., n 1 by the empirical excess of the k points that are higher than X n k,k [1]: ˆm k,n = 1 k k X n j+1,n X n k,n. j=1 2

Plot of (Empirical) Mean Excess of Sample Data Mean Excess k,n 400000 800000 1200000 1600000 1e+06 2e+06 3e+06 4e+06 5e+06 6e+06 7e+06 x n-k,n Figure 2: Plot of mean excess function m as a function of x n k,k of sample claims data The estimates of the mean excesses are illustrated in Figure 2 and they play crucial roles in the rating of XL insurance with a retention level R because the corresponding fair net premium (without any policy expenses and other administrative costs) is determined to be [4]: Π(R) = E[(X R) + ] To estimate the net premium, Π(R), the following equation of the empirical (non-parametric) estimator with a range of retention level from 0 to 7,875,000 (step = 125,000) is given by ˆ Π(R) = 1 n n (X n i+1,n R) + i=1 The non-parametric estimator of the net premium as a function of R is shown in Figure 3. 3

Non-Parametric Estimator of Premium Non-Parametric Estimator of Premium 0 500000 1000000 1500000 2000000 2500000 0e+00 2e+06 4e+06 6e+06 8e+06 Retention Level R Figure 3: Non-parametric estimator of the net premium as a function of the retention level R Although the calculation was done with for a wide range of retention levels, the results for high retention level would be more valuable to reinsurer as they are meant to transfer sizable risks (e.g. catastrophic losses). In other words, in order to value the net premium precisely, finding distributions that have similar tail behavior should be emphasized as well as taking account on parametric assumptions. In order to assist in the rating of XL reinsurance on the statistical side, quantile plotting is addressed to view the tail weight of the claim distribution. Random variables that tend to assign higher probabilities to larger observations are said to be heavier tailed. An definition of heavy-tailed distribution can be described as a heavy-tailed distribution has a tail which is heavier than any exponential tail: exp( λx) lim = 0, for any λ > 0 x F (R) in rectangular coordinate system, an exponential quantile plot of points have the coordinates of i ( log( n + 1 ), X n i+1,n) where the empirical distribution, X n i+1,n is the estimated quantile of Q(1 ), for several values of 1 i n+1 (0, 1)[1]. The exponential QQ-plot of the claim size data is constructed below in Figure 4. i n+1 4

Exponential Q-Q Plot for Sample Data Sample Quantiles 1e+06 2e+06 3e+06 4e+06 5e+06 6e+06 7e+06 0 1 2 3 4 5 Theoretical Quantiles Figure 4: Exponential quantile plot for sample claims data From the exponential QQ-plot, the horizontal axis shows the quantile of a standard exponential distribution and the vertical axis represents the empirical quantiles of the claim data. The empirical points in the plot bend upwards and exhibit a convex pattern indicating that the claim size distribution has a heavier tail than expected from a standard exponential distribution[2]. Since most of the reinsurance contracts have the purpose of transferring huge risks of losses, pricing actuaries should use a model that can capture the tail section of the claim distribution. Consequently, an exponential distribution may not be the accurate model of the reinsurance contract since the largest three observations do not fit so well in the exponential QQ-plot. Since exponential distribution is not accurate for the pricing model due to the skew errors in the QQ-plot, an assumption of using another distribution is made. In general, Pareto distribution, as a heavy-tailed distribution provides a good fit of a large claim size data. This statement can be 5

proved by looking at Pareto QQ-plot (Figure 5). The Pareto QQ-plot is more linear compared to the exponential QQ-plot which indicating a reasonable fit of the Pareto model to the tail of the claim sizes. Thus, further discussion on using a Pareto-type model with parametric assumption for reinsurance pricing is needed. Pareto Q-Q Plot for Sample Data Sample Quantiles 1e+06 2e+06 3e+06 4e+06 5e+06 6e+06 7e+06 1.0 1.5 2.0 2.5 3.0 3.5 4.0 Theoretical Quantiles Figure 5: Pareto quantile plot for sample claims data 4. Parametric XL Model: Pareto-type Distribution A scaled Pareto distribution for the sample data is considered and it has a survival probability function of F (x) = x C α, x > C for some large C > 0 and x is C times a strict Pareto random variable. The mean excess function of the random variable is found by e(t) = t α 1, t > 1 where α is the shape parameter of the strict Pareto distribution. α is using to be estimated using 6

the idea of extreme value theory. The estimate of α, which has a reciprocal relationship with a extreme value index (EVI): γ = 1 α > 0. A well-known estimator of the EVI is the Hill Estimator [3]. Combining the result of log-transformed data is an exponential distribution with the the mean excess function of the Pareto random variable, the mean excess value of the log-transformed data (Hill Estimator) [1] where ˆγ k,n = 1 ˆα k,n. ˆγ k,n = 1 k k ln X n j+1,n ln X n k,n j=1 However, the Hill estimator is based on k-th largest observation, which means for each choice of k there would be a different estimator. Plot of Hill Estimator as a Function of k Hill_Estimator 0.1 0.2 0.3 0.4 0.5 0 100 200 300 Figure 6: Plot of Hill Estimator as a function of k k The ways of finding optimal k using various statistical properties are developed by many statistical practitioners. One method of finding optimal k is shown by Danielsson et al. (1997) which k is determined by a two-step subsample bootstrap method through minimizing asymptotic meansquared error (AMSE)[5]. For a given size of n 1 < n, a resample of a n 1 size is drawn and the AMSE for each level of k 1 is calculated. Then, the level k 1,0(n 1 ) that minimizes AMSE for this bootstrap n 1 is found. The procedure for a smaller sample size of n 2 < n 1 is repeated, where n 2 = n12 n. This 7

procedure can find the level of k 2,0(n 2 ) that minimize AMSE for this bootstrap n 2. Lastly, the best choice of value k can be calculated using the equation given by ˆk 0 (n) = (k 1,0(n 1 )) 2 ( (lnk1,0(n 1 )) 2 k2,0 (n 2) (2 ln n 1 ln k1,0 (n 1)) 2 ) (ln n1 ln k 1,0(n 1))/ ln n 1 With the optimal choice of k = 95, the Hill estimator, ˆγ is computed to be 0.2711. Knowing the estimated EVI, the calculation of net premium under parametric assumption is clearly shown in [4] and is written as ˆΠ(R) = 1 1/ˆγ k,n 1 R k ( ) 1/ˆγk,n R n X n k,k The relationship of the net premium estimator with retention levels of R where R (0, 7875000) (incremental step = 125000) is illustrated in Figure 7. Parametric Estimator of Premium Parametric Estimator of Premium 0 500000 1000000 1500000 2000000 2500000 0e+00 2e+06 4e+06 6e+06 8e+06 Retention Level R Figure 7: Parametric estimator of the net premium as a function of the retention level R 5. Comparing Parametric and Non-parametric for Different Retention Levels Comparison between parametric and non-parametric estimators for 0 < R < 7875000 and R > 3000000 has been made. From Figure 8, the parametric estimator does not capture the behavior 8

of left tail part because of the emphasis on the right tail behavior, where the larger observations are located. At a high retention level (around 3 million to 4.5 million), two estimators provide close results. However, beyond 4.5 million retention level, a deviation from two estimators can be visualized. Due to the lack of sample data at high retention level, one would get a low quality result of the net premium estimation using non-parametric approach. On the other hand, the parametric estimator predicts a higher amount of premium because the parametric distribution tends to assign more weight on the tail. Using the technique of modeling larger losses data with a heavy-tailed distribution enables reinsurance companies to make a better forecast on the net premium. Comparison of Estimators of Premium Comparison of Estimators of Premium (R>3000000) Estimators of Premium 0 500000 1000000 1500000 2000000 Non-Parametric Parametric Estimators of Premium 0 50000 100000 150000 Non-Parametric Parametric 0e+00 2e+06 4e+06 6e+06 8e+06 3e+06 4e+06 5e+06 6e+06 7e+06 8e+06 Retention Level R Retention Level R Figure 8: Comparison of the net premium estimators 6. Conclusion An overview of loss modeling process and net premium calculation is done using two methods: non-parametric and parametric. An inspection of the tail behavior for the claims data should be constantly done in order to see if the pricing model is accurate to apply. This paper has provided a suggestion on using parametric model for reinsurance pricing when having insufficient large claims data. However, it may not be a final solution for reinsurers due to different sizes of claims with different risks involved in it. 9

References 1. Beirlant J., Matthys J. and Dierckx G. (2001). Heavy-Tailed Distributions and Rating. ASTIN Bulletin, 31, 37-58. 2. Beirlant J., Goegebeur Y., Teugels J., Segers J., Waal J. J. and Ferro C. (2006). Statistics of extremes: theory and applications. John Wiley & Sons. 11. 3. Hill B.M. (1975). A simple approach to inference about the tail of a distribution. Annals of Statistics. 3: 1163-1174. 4. University of Illinois (2016). Case study: heavy-tailed distribution and reinsurance ratemaking. 5. Danielsson J., De Haan L.,Peng L., De Vries C. G.. (2001). Using a bootstrap method to choose the sample fraction in tail index estimation. Journal of Multivariate Analysis. 76(2). 226-248. 10