Use of GLMs in a competitive market. Ji Yao and Simon Yeung, Advanced Pricing Techniques (APT) GIRO Working Party

Similar documents
Agenda. Current method disadvantages GLM background and advantages Study case analysis Applications. Actuaries Club of the Southwest

Bonus-malus systems 6.1 INTRODUCTION

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

Keywords Akiake Information criterion, Automobile, Bonus-Malus, Exponential family, Linear regression, Residuals, Scaled deviance. I.

Quantile Regression. By Luyang Fu, Ph. D., FCAS, State Auto Insurance Company Cheng-sheng Peter Wu, FCAS, ASA, MAAA, Deloitte Consulting

London Market Pricing Framework

Predictive Modeling GLM and Price Elasticity Model. David Dou October 8 th, 2014

Modeling. joint work with Jed Frees, U of Wisconsin - Madison. Travelers PASG (Predictive Analytics Study Group) Seminar Tuesday, 12 April 2016

Loss Cost Modeling vs. Frequency and Severity Modeling

Free Market Pricing Issues

Institute of Actuaries of India Subject CT6 Statistical Methods

Session 5. Predictive Modeling in Life Insurance

An Actuarial Evaluation of the Insurance Limits Buying Decision

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION

The Real World: Dealing With Parameter Risk. Alice Underwood Senior Vice President, Willis Re March 29, 2007

Probability Weighted Moments. Andrew Smith

M249 Diagnostic Quiz

Dynamic Risk Modelling

The impact of Solvency II on the Asset Management industry

Generalized Linear Models II: Applying GLMs in Practice Duncan Anderson MA FIA Watson Wyatt LLP W W W. W A T S O N W Y A T T.

Contents Utility theory and insurance The individual risk model Collective risk models

By-Peril Deductible Factors

GLM III - The Matrix Reloaded

Correlation and Diversification in Integrated Risk Models

2.1 Random variable, density function, enumerative density function and distribution function

Internal Model Industry Forum (IMIF) Workstream G: Dependencies and Diversification. 2 February Jonathan Bilbul Russell Ward

Integrating Reserve Variability and ERM:

CHAPTER 2 Describing Data: Numerical

Stochastic Analysis Of Long Term Multiple-Decrement Contracts

IFRS 17 beyond implementation, towards commercial implications

Bayesian Finance. Christa Cuchiero, Irene Klein, Josef Teichmann. Obergurgl 2017

SYLLABUS OF BASIC EDUCATION SPRING 2018 Construction and Evaluation of Actuarial Models Exam 4

Risk Business Capital Taskforce. Part 2 Risk Margins Actuarial Standards: 2.04 Solvency Standard & 3.04 Capital Adequacy Standard

UPDATED IAA EDUCATION SYLLABUS

Telematics and the natural evolution of pricing in motor insurance

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -5 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

Session 55 PD, Individual Life Mortality Experience Study Results. Moderator: Cynthia MacDonald, FSA, CFA, MAAA

Duangporn Jearkpaporn, Connie M. Borror Douglas C. Montgomery and George C. Runger Arizona State University Tempe, AZ

Session 178 TS, Stats for Health Actuaries. Moderator: Ian G. Duncan, FSA, FCA, FCIA, FIA, MAAA. Presenter: Joan C. Barrett, FSA, MAAA

WC-5 Just How Credible Is That Employer? Exploring GLMs and Multilevel Modeling for NCCI s Excess Loss Factor Methodology

IOP 201-Q (Industrial Psychological Research) Tutorial 5

Statistical Analysis of Life Insurance Policy Termination and Survivorship

Study Guide on Risk Margins for Unpaid Claims for SOA Exam GIADV G. Stolyarov II

Consistent estimators for multilevel generalised linear models using an iterated bootstrap

INSTITUTE AND FACULTY OF ACTUARIES SUMMARY

Choice Probabilities. Logit Choice Probabilities Derivation. Choice Probabilities. Basic Econometrics in Transportation.

Ways of Estimating Extreme Percentiles for Capital Purposes. This is the framework we re discussing

Cambridge University Press Risk Modelling in General Insurance: From Principles to Practice Roger J. Gray and Susan M.

Risk Classification In Non-Life Insurance

Life 2008 Spring Meeting June 16-18, Session 67, IFRS 4 Phase II Valuation of Insurance Obligations Risk Margins

HANDBOOK OF. Market Risk CHRISTIAN SZYLAR WILEY

AGENT BASED MODELING FOR PREDICTING PROPERTY AND CASUALTY UNDERWRITING CYCLES Presenter: Gao Niu Supervisor: Dr. Jay Vadiveloo, Ph.D.

The Application of the Theory of Power Law Distributions to U.S. Wealth Accumulation INTRODUCTION DATA

PRE CONFERENCE WORKSHOP 3

Gamma Distribution Fitting

Effects of missing data in credit risk scoring. A comparative analysis of methods to gain robustness in presence of sparce data

Copyright 2011 Pearson Education, Inc. Publishing as Addison-Wesley.

Long Term Care Case Studies or LTC Can it be profitable? Dr. Wolfgang Droste, Gen Re

European option pricing under parameter uncertainty

Describing Uncertain Variables

Subject CS2A Risk Modelling and Survival Analysis Core Principles

Monetary Economics Measuring Asset Returns. Gerald P. Dwyer Fall 2015

Frequency Distribution Models 1- Probability Density Function (PDF)

CHAPTER II LITERATURE STUDY

Guidance paper on the use of internal models for risk and capital management purposes by insurers

Financial Risk Modelling for Insurers

Non parametric IBNER projection

the display, exploration and transformation of the data are demonstrated and biases typically encountered are highlighted.

Statistics (This summary is for chapters 17, 28, 29 and section G of chapter 19)

FINANCIAL SIMULATION MODELS IN GENERAL INSURANCE

Operational Risk Aggregation

Practical Aspects of Mortality Improvement Modeling

Idiosyncratic risk, insurance, and aggregate consumption dynamics: a likelihood perspective

Window Width Selection for L 2 Adjusted Quantile Regression

Key Words: emerging markets, copulas, tail dependence, Value-at-Risk JEL Classification: C51, C52, C14, G17

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

APPROACHES TO VALIDATING METHODOLOGIES AND MODELS WITH INSURANCE APPLICATIONS

[D7] PROBABILITY DISTRIBUTION OF OUTSTANDING LIABILITY FROM INDIVIDUAL PAYMENTS DATA Contributed by T S Wright

Copula-Based Pairs Trading Strategy

PBR Regulatory Update and Implementation Challenges

Modelling Bank Loan LGD of Corporate and SME Segment

Alternative VaR Models

INSTITUTE OF ACTUARIES OF INDIA

Calibration and Parameter Risk Analysis for Gas Storage Models

ERM Sample Study Manual

EX-POST VERIFICATION OF PREDICTION MODELS OF WAGE DISTRIBUTIONS

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION

Does my beta look big in this?

Perspectives on European vs. US Casualty Costing

Approximate Variance-Stabilizing Transformations for Gene-Expression Microarray Data

Why Pooling Works. CAJPA Spring Mujtaba Datoo Actuarial Practice Leader, Public Entities Aon Global Risk Consulting

2017 IAA EDUCATION SYLLABUS

Introduction Recently the importance of modelling dependent insurance and reinsurance risks has attracted the attention of actuarial practitioners and

MAKING CLAIMS APPLICATIONS OF PREDICTIVE ANALYTICS IN LONG-TERM CARE BY ROBERT EATON AND MISSY GORDON

Dynamic Replication of Non-Maturing Assets and Liabilities

Forecasting & Futurism

SOCIETY OF ACTUARIES EXAM STAM SHORT-TERM ACTUARIAL MATHEMATICS EXAM STAM SAMPLE QUESTIONS

Notes on a California Perspective of the Dairy Margin Protection Program (DMPP)

UNIT 4 MATHEMATICAL METHODS

INSTITUTE OF ACTUARIES OF INDIA

Transcription:

Use of GLMs in a competitive market Ji Yao and Simon Yeung, Advanced Pricing Techniques (APT) GIRO Working Party 12 June 2013

About the presenters Dr. Ji Yao is a manager with Ernst & Young s EMEIA insurance risk and actuarial services practice. He has extensive first-hand experience in various modelling for pricing, including risk models, demand models and price optimisation, with a solid background in mathematics and statistics. He is the chair of the Advanced Pricing Techniques (APT) GIRO working party. JYao@uk.ey.com Simon Yeung is currently a senior manager at Grant Thornton. Prior to joining Grant Thornton, Simon was the head of motor pricing at Saga for 3 years. Before that, he was a reserving manager at RBS Insurance for 3 years. Before joining RBS Insurance he worked for London market insurers, reinsurers and commercial insurers for four and half years. He is a member of the Advanced Pricing Techniques (APT) GIRO working party. Simon.yeung@uk.gt.com 12 June 2013 2

Agenda Introduction Current market and uses of GLM Three overlooked facts of GLM and their implications Summary and Q&A 12 June 2013

Introduction Advanced Pricing Techniques (APT) GIRO working party was created in 2012 22 members working in three work streams GLM Telematics pricing Conversion/Elasticity modelling One workshop in GIRO 40 and one paper on GLM is being prepared We will focus on GLM in this presentation 12 June 2013 4

Current uses of GLM in the market Risk base pricing Cost plus approach Price optimisation 12 June 2013 5

MarketPerformance Looking in more detail at: Change in claim ratio and frequency over time What, if any, relationships can we derive between the two? How does this relate back to GLM modelling? Data used: Cross section of market (8 companies) Totalling 4.7bn earned premium in 2010 High level data taken from FSA returns 12 June 2013 6

Claim ratio over time Claim ratio 120% 110% 100% 90% 80% 70% 60% Claim ratio over time Steady rise in claim ratio from 2005 due to reasons such as: -aggregators -increase in BI claims -recession -claims farming Fall in claims ratio 2009-2010 due to rate increase Company 1 Company 2 Company 3 Company 4 Company 5 Company 6 Company 7 Company 8 Average 50% 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 Year 12 June 2013 7

Claim frequency over time 28 Claim frequency over time Claim frequency (%) 26 24 22 20 18 16 Some companies have seen their claims frequency fluctuate greatly over time On average claim frequency has been slowly reducing since 2007 Company 1 Company 2 Company 3 Company 4 Company 5 Company 6 Company 7 Company 8 14 Average 12 10 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 Year 12 June 2013 8

Correlation between Frequency and ULR Average Claims Ratio vs Average Claims Frequency 16.2 85% 16.0 Strong correlation (~74%) between Claims Ratio and Claims Frequency 80% Average Claims Frequency (%) 15.8 15.6 15.4 15.2 15.0 14.8 2008-2010 excluded due to recession and hike in petrol prices 2001 2002 2003 2004 2005 2006 2007 Year Market competition putting pressure on price can charge Insufficient data to accurately price the risks 1. Company data is only a sample 2. Unable to model using GLMs 75% 70% 65% 60% Average Claims Ratio Average claims frequency Average claims ratio 12 June 2013 9

Frequency vs Average Premium 16.2 355 16.0 350 Claims Frequency (%) 15.8 15.6 15.4 15.2 345 340 335 Average Premium ( ) Average Claims Frq excl. Company 2 and 8 Average Premium excl. Company 2 and 8 15.0 330 14.8 2001 2002 2003 2004 2005 2006 2007 Year 325 12 June 2013 10

Change in mix of business Fluctuations in frequency due to changes in mix of business Claims Frequency vs Claims Ratio (Company 2) Claims Frequency vs Claims Ratio (Company 8) 25 100% 30 100% Claims Frequency (%) 20 15 10 5 0 2001 2002 2003 2004 2005 2006 2007 Year 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% Claims Ratio Claims Frequency (%) 25 20 15 10 5 0 2001 2002 2003 2004 2005 2006 2007 Year 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% Claims Ratio Claims Frequency Claims Ratio 12 June 2013 11

Claim Frequency vs Average Premium Claim Frequency vs Average Premium over time (Company 2) Claim Frequency vsaverage Premium over time (Company 8) 330 25.00 450 30.00 Average Premium ( ) 325 320 315 310 305 300 295 290 285 20.00 15.00 10.00 5.00 Claim Frequency (%) Average Premium ( ) 400 350 300 250 200 150 100 50 25.00 20.00 15.00 10.00 5.00 Claim Frequency (%) 280 2001 2002 2003 2004 2005 2006 2007 0 0 2001 2002 2003 2004 2005 2006 2007 0.00 Year Year Company 2 Average Premium Company 8 Average Premium Company 2 Claims Frequency Company 8 Claims Frequency 12 June 2013 12

There are a wide range of quoted premium on the market, while GLMs are used as a standard pricing technique throughout market. Quotes for a 30 year old male with a clean license held for 10 years, for a 57 plate manual 1.6L ford focus style 5 door hatchback. Car is kept at home parked on the road, for social use only, approx 9000 annual mileage 12 June 2013 13

Quotes for a 40 year old married female with a clean license held for 15 years for a 59 diesel Golf GTD 2.0L 3 door hatchback. Car is kept at home and parked on a driveway for social use only, approx 7000 miles 12 June 2013 14

Current market and uses of GLM GLM is a standard approach for risk pricing and price optimisation Wide range of price for individual quote Wide range of performance for market player What causes the difference? 12 June 2013 15

GLM technical details A GLM consists of the following three components: 1. Random component Each component of Y is independent and is from one of the exponential family of distributions. 2. Systematic component A linear combination of the estimated parameters gives the linear predictor, η: 3. Link function The relationships between the random and systematic components is specified via a link function, g, such that: 4. Data The dataset that GLM trained on. 12 June 2013 16

Three overlooked facts of GLM 1. GLMs put either zero or full credibility into data 2. GLMs implicitly use median from the distribution of prediction 3. GLM results depend on the mixture of rating variables in the data 12 June 2013 17

Quiz 1: Average weight of yellow balls There is a bag of coloured balls. You sampled a few of them from the bag and obtained the following information: Colour Avgweight (kg) Yellow 6 Red 4 Average All 5 What is your estimation of the average weight of yellow balls? A) Use average of yellow balls ONLY 6kg B) Use average of ALL balls 5kg C) Blended average weight of yellow balls and non-yellow balls D) Other (with suggestions) 12 June 2013 18

GLM fact 1: GLMs put either zero or full credibility into data 12 June 2013 19

A gradual approach to include data is needed in modelling Sample 6 balls from the bag of yellow and red balls, and we obtained these weights: Colour Avg weight (kg) Yellow 4 Yellow 6 Yellow 8 Red 2 Red 4 Red 6 Would you suddenly change your view because of the additional six balls? Keep sampling and if we get 6 more identical balls as before: Colour Avg weight (kg) Yellow 4 Yellow 4 Yellow 6 Yellow 6 Yellow 8 Yellow 8 Red 2 Red 2 Red 4 Red 4 Red 6 Red 6 Testing the colour factor in a GLM shows that Yellow is not significantly different from Red at 95% confidence level (p-value=0.1336). Avg weight of yellow balls = 5 Testing the colour factor in a GLM shows Yellow is now significantly different from Red at 95% confidence level (p-value=0.0339). Avg weight of yellow balls = 6 12 June 2013 20

An important implication is GLMs tend to push relativities and hence price towards extreme levels 5kg 5.Xkg 6kg As the normal GLM practice is to calibrate the base rate after relativities are calculated, extreme relativities will result in more policies being priced at very low (or high) end Over-priced policies never get converted in a competitive market, so insurers are exposed to big underpricing risk Linked to the observed diversified quoted premium on the market 12 June 2013 21

Generalised linear mixed models (GLMMs) provide a potential solution GLMMs are an extension to GLM, in which the linear predictor contains random effects to allow for correlation of the data in addition to the usual fixed effects. It provides a convenient way of applying credibility blending within GLM. Random effect 12 June 2013 22

Quiz 2: Mean, median or mode? a question not only relevant to reserving or capital A pricing analysis gives a range of possible prices for a risk as shown in the table below: Price 400 20% 500 30% 600 20% 700 20% What is the price you will charge for the risk? A) Mode - 500 B) Median - 550 C) Mean - 560 Probability 800 10% 0% D) Other (with suggestions) 12 June 2013 23 35% 30% 25% 20% 15% 10% 5% 400 500 600 700 800

GLM fact 2: GLMs implicitly use median from the distribution of prediction The linear predictor ΣX i β i is asymptotically normally distributed as all β i are asymptotically multivariately normally distributed After the link function transformation, the prediction is no longer normally distributed. Take log link as an example: where Mean: Median: Mode: 12 June 2013 24

Link function is the dominant factor in shaping the distribution of prediction Consider a severity model with Gamma error structure. Results for different link functions: 0.10 0.08 0.06 0.04 0.02 0.00 60 500 1700 Log link 2900 4100 5300 6500 7700 8900 10100 11300 12500 0.0010 0.0008 0.0006 0.0004 0.0002 0.0000 Identical link 1 270 620 970 1320 1670 2020 2370 2720 3070 3420 3770 4120 4470 4820 Inverse link 100 2900 5700 8500 11300 14100 16900 19700 22500 25300 28100 30900 33700 36500 39300 42100 44900 941 2400 6121 153 2400 4647 1240 2400 37656 Lower bound Mean Upper bound These examples show that the upper and lower bounds could be very different, and the prediction is not always the mean of the distribution! 12 June 2013 25

Do GLMs systematically underestimate the cost? For a distribution skewed towards the left, usually it is the case that Mode<Median<Mean, so the median used by GLMs is always lower than the mean To use mean, the term needs to be better understood and calculated. The key difficulty is the correlation matrix between β i. 12 June 2013 26

GLM fact 3: GLM results depend on the mixture of rating variables in the data Driver Age Car Age Claim Old Old 0.2 Old New 0.3 Young Old 0.4 Young New 0.6 Driver Age Car Age Claim Old Old 0.2 Old Old 0.2 Old New 0.3 Young Old 0.4 Young New 0.6 Parameter Level1 Estimate StdErr Intercept 0.4286 0.565 age Old -0.2411 0.6061 age Young 0 0 carage New 0.1339 0.5836 carage Old 0 0 Scale 1 0 Parameter Level1 Estimate StdErr Intercept 0.4305 0.5594 age Old -0.2374 0.5827 age Young 0 0 carage New 0.1297 0.552 carage Old 0 0 Scale 1 0 12 June 2013 27

GLMs results are dragged toward the segment where there is more data 4 data points Driver Age Car Age Claim Prediction Old Old 0.2 0.1875 Old New 0.3 0.32143 Young Old 0.4 0.42857 Young New 0.6 0.5625 5 data points Driver Age Car Age Claim Prediction Old Old 0.2 0.19315 Old Old 0.2 0.19315 Old New 0.3 0.3229 Young Old 0.4 0.43053 Young New 0.6 0.56027 The dependency is not trivial. Some practical examples are: Quote based premium model vs. sale based premium model Modelled loss ratio for quotes vs. Sales Time testing 12 June 2013 28

With a view to future is the key to mitigate this issue GLM should be trained on expected future mixture of portfolio, rather than historical portfolio. Iterative modelling approach: Fit GLM Set price Feed weight into GLM Model conversion 12 June 2013 29

Summary Significant variation in underwriting performance and quoted premiums in the current motor market pose challenges on the pricing techniques used in business. As the standard pricing technique, GLMs are coming cross new issues in a highly competitive market: GLMs put either zero or full credibility into data GLMs implicitly use median from the distribution of prediction GLM results depend on the mixture of rating variables in the data Being able to understand and solve these issues could be one of the key ways to gain a competitive advantage in the market. 12 June 2013 30

Questions Comments Expressions of individual views by members of the Institute and Faculty of Actuaries and its staff are encouraged. The views expressed in this presentation are those of the presenter. 12 June 2013 31