Modeling of Claim Counts with k fold Cross-validation
|
|
- Sydney Bruce
- 5 years ago
- Views:
Transcription
1 Modeling of Claim Counts with k fold Cross-validation Alicja Wolny Dominiak 1 Abstract In the ratemaking process the ranking, which takes into account the number of claims generated by a policy in a given period of insurance, may be helpful For example, such a ranking allows to classify the newly concluded insurance policy to the appropriate tariffication group For this purpose, in this paper we analyze models applicable to the modelling of counter variables In the first part of the paper we present the classical Poisson regression and a modified regression model for data, where there is a large number of zeros in the values of the counter variable, which is a common situation in the insurance data In the second part we expand the classical Poisson regression by adding the random effect The goal is to avoid an unrealistic assumption that in every class all insurance policies are characterized by the same expected number of claims In the last part of the paper we propose to use k fold cross validation to identify the factors which influence the number of insurance claims the most Then, setting the parameters of the Poisson distribution, we create the ranking of polices using estimated parameters of the model, which give the smallest cross validation mean squared error In the paper we use a real world data set taken from literature For all computations we used a free software environment R Keywords: automobile insurance, ratemaking, risk classification, GLM, HGLM, ZIP models 1 Introduction Every person, when applying for an insurance policy, is assigned to a class, that is homogeneous in terms of the tariffication system One of the criteria used for assigning an individual to a certain class is the number of claims Thus it is insurance companies very important task to model the number of claims in a given insurance portfolio In the paper we propose a simple procedure for creating a ranking of insurance policies and also for classifying them due to the number of claims It allows a preliminary classification of a new policy to a group with an adequate premium level The very common choice of a method for modelling the number of claims is a regression model with the use of Poisson distribution (Poisson regression), which is a special case of a Generalized Linear Model (GLM) However the insurance portfolios have a very specific characteristic, ie for many policies there are no claims observed in the insurance history for a given period It means that data contains lots of zeros and, as a consequence, the Poisson regression may not give satisfactory results Therefore when creating the ranking, the GLM model and ZIP model (zero inflated Poisson) and the model with a random effect were considered The ranking creation procedure used a k fold cross-validation and furthermore the ranking was discretized due to a parameter λ We build many different models and then we use a 10-fold cross-validation in order to recognize which rating variables have an impact on the presence of zeros in the policies portfolios The data for the illustrative example has been taken from the literature [6] All the computations were conducted in R the free software environment The procedure for building a model with random effect and a cross validation technique have been written in R language 1 University of Economics in Katowice, Faculty of Economics, Department of Statistical and Mathematical Methods in Economics, ul Bogucicka 14, Katowice, alicjawolny-dominiak@uekatowicepl 1
2 2 Modelling the number of claims in the automobile insurance with the use of cross validation procedure The linear regression models are used for creating a raking of insurance policies due to the number of claims Generalized Linear Models (GLM) In these models we assume that the number of claims is a dependent variable Y that follows Poisson distribution and it depends on certain system of predictors [1]: P(Y i = y i ) = e λi λ yi i, i = 1,,n, (1) y i! where Y i is the number of claims for the ith insured person, y 1,,y n are independent and have equal variances, and the average number of claims is equal to the variance The λ i parameter is the expected number of claims and it depends on predictors X j,j = 1,,k, that describe the insured individual or vehicle, eg sex, age, engine capacity The logarithm is used as a link function: Thus we have that the expected value equals: logλ i = k β ji X ji (2) j=1 λ i = µ i = e k j=1 βjixji (3) We can see that for every linear combination of predictors the expected value of the number of claims is always positive The λ i parameter is adjusted with the use of d i exposition to risk factor for the ith policy This factor shows what part of the analysed period of time was covered by a given policy: When creating the ranking we used min λ i as a criterion λ i = d i e k j=1 βjixji (4) The independence assumption in the above model may not be fulfilled In that case the solution is to use a mixed model and introducing a random effect ν Hierarchical Generalized Linear Models (HGLM) In case of automobile insurance data the region or the vehicle model can be treated as a random effect ν Hierarchical generalized linear model (HGLM) with variable y u following the Poisson distribution, has a form: µ = E(y u) = e Xβ+ν, Var(y u) = φv(µ), ν = logu, where β = [β 1,,β I ], u = [u 1,,u K ] and X is the model matrix The distribution of the random effect may belong to the exponential dispersion family of distributions, eg the gamma distribution with parameter α: E(u) = ψ, (6) Var(u) = αv(ψ) The structural parameters of a model have a following interpretation: parameter β i, i = 1,,I, measures the influence of the ith predictor on the number of claims (which is equal for every category), parameter u k, k = 1,,K, measures the risk level for every category (which is different for every category) (5)
3 Zero inflated generalized linear model Another model used for modelling the number of claims is ZIP model, where counting response variable has many zero values This is exactly the case when modelling the number of counts Analysing different risk portfolios it can be noticed that for many policies there is no claim observed and if the claims occur their number is one, two or three and very rarely more In the ZIP model the independent variables Y i take zero values Y i 0 with the probability i or values from Poisson distribution Y i Pois(λ i ) with probability 1 i It can be written in a form [5]: { i +(1 i)e λi, if y i = 0 P(Y i = y i ) = (1 i) e λ iλ y i i y i!, if y i > 0 i = 1,,n (7) Thus in the ZIP model we have two parameters: λ i and i Both parameters, as in case of Poisson regression, are linked with predictor variables with the following link functions: ( ) i t log = γ ji Z ji, 1 i j=1 (8) k logλ i = β ji X ji, j=1 wherez 1,,Z l arethedependent variablesforthe firstequationandx 1,,X k forthesecondone The expected value and variance of the number of claims for the ith policy in the ZIP model are, respectively: E(Y i ) = λ i (1 i), D 2 (Y i ) = (1 i)(λ i iλ 2 i ) (9) Similarly to Poisson regression case, in the ZIP model we assume that the average number of claims equals the variance The solution to a problem when excessive dispersion occurs is the use of negative bimodal distribution [4] In order to unify the process of comparing presented models, the choice of the model for the number of claims and the choice of the combination of predictor variables that is responsible for generating zeros in the policies have been supported by statistical learning methods In general in these methods we assume we are given a training data set D = {(x i,y i ),i = 1,,N}, where x i,y i R Moreover we assume that data is iid (independent and identically distributed) and it has been taken from the population with a multidimensional distribution defined by an unknown density function: p(x, y) = p(x)p(y x) (10) The task is to search a given set of functions H = {f(x, ) : Ω}, where is a model parameters vector, andto find the best element Using the model f(x, ) H, which isalwaysa simplified equivalent of the analysed phenomenon, we accept some errors that are just the consequence of taking theoretical values instead of real values for response variable These errors (for a given observation) are measured by so called loss functions L(y, f(x, )) In the concept of statistical learning, the risk functional is considered It measures the overall loss, ie the sum of errors for all possible observations One of the methods of estimating the value of risk functional is the cross validation method (CV) [2] In this paper we use 10 fold cross-validation algorithm, ie: a) randomly divide the training set into k = 10 approximately equally sized parts (n the size of a training set, m l the size of the lth subset, l = 1,,10), b) build 10 times a model using 9 of 10 parts (n m l observations), treating excluded observations as validation set, (y ˆµl ) 2 c) calculate 10 times the value of the mean squared error MSE l = using the validation set, d) estimate the cross validation error: cv = 10 l=1 The model with the smallest cv value is selected m l n MSE l m l
4 3 Procedure of creating ranking of property insurance policies and classification of these policies The procedure of building a ranking of policies using linear models presented in the previous part of the paper may be formulated in a few steps: STEP 1: Estimating λ parameter for every policy in the portfolio using three different models: generalized linear model, hierarchical generalized linear model and zero inflated generalized linear model STEP 2: Applying cross validation procedure to every model from Step 1 STEP 3: Choosing the model with the smallest cv error STEP 4: Creating the ranking of insurance policies for every combination of predictor variables X i, using as a criterion min λ STEP 5: Discretizing the ranking due to the values of parameters λ and thus obtaining insurance risk classification which allow to classify a new policy to a group with an adequate premium level Based on the estimated parameter λ for a chosen model, we create ranking and conduct discretization in order to obtain different classes of insurance risk Discretization means dividing the ordered set of values of a given continuous variable onto finite number of disjoint intervals Labels can be assigned to these intervals, eg: high insurance risk level, neutral to risk etc The problem is how to determine the cut points These cut points should separate the object from different risk classes in a best possible way There are two main approaches in discretization: agglomerative and divisive The first one starts with every single empirical value of the continuous variable belonging to a different interval and then neighbouring intervals are merged iteratively until the maximum value of homogeneity of subsets measure is reached The second approach starts with one big interval covering all empirical values of the continuous variable that is iteratively divided, using previously determined cut points 4 Empirical example In order to illustrate the process of creating the ranking and discretizing it, the necessary procedures were implemented in R environment The automobile insurance data set including information about the number of claims has been used for computations The following variables form the data set and have been considered in the model: 1 Driverage age of the insured person (driver) 2 Region: classes from 1 to 7 3 MCclass: classes from 1 to 7 These classes were created based on the EV coefficient defined as: EV = where 75 kg is the average weight of a driver 4 Vehege age of the vehicle engine capacity in kw 100 vehicle weight in kg+75, 5 Numclaims number of claims (the sum within the class) Dependences between predictor variables influencing the number of claims are presented in Figure 1 in Appendix A
5 Procedure for creating the ranking STEP 1: We model the number of claims with the use of three types of models presented above Model 1 GLM for the variable Numclaims assuming Poisson distribution R Code data(dataset) glmformula=numclaims Driverage+Region+MCclass+Vehage glmmodel1=glm(glmformula, family=poisson(link="log"), data=dataset) summary(glmmodel1) β i Standard error e β i Intercept DriverageA 0 1 DriverageB DriverageC DriverageD DriverageE DriverageF DriverageG RegionA 0 1 RegionB RegionC RegionD RegionE RegionF RegionG MCclassA 0 1 MCclassB MCclassC MCclassD MCclassE MCclassF MCclassG VehageA 0 1 VehageB VehageC VehageD Table 1: Parameters for Model 1 The following combination was chosen as a reference categories: DriverageA, RegionA, MCclassA, VehageA Model 2 HGLM of a type POISSON GAMMA for the variable Numclaims assuming Poisson distribution and treating variable Region as a random effect with gamma distribution R Code data(dataset) modelpoissongamma=function(x=x, Z=Z, Y=Y, datasetletters= datasetletters, glmformula=numclaims Driverage+Region+MCclass+Vehage)
6 β i Standard error e β i Intercept DriverageA 0 1 DriverageB DriverageC DriverageD DriverageE DriverageF DriverageG MCclassA 0 1 MCclassB MCclassC MCclassD MCclassE MCclassF MCclassG VehageA 0 1 VehageB VehageC VehageD Table 2: Parameters for Model 2 fixed effects β i Standard error e β i RegionA RegionB RegionC RegionD RegionE RegionF RegionG Table 3: Parameters for Model 2 random effect Region
7 Model 3 Model ZIP taking into account a large number of zero values for variable Numclaims R Code data(dataset) ZIPmodel3=zeroinfl(formula=Numclaims Driverage+Region+ MCclass+Vehage 1, data=dataset) summary(zipmodel3) Function zeroinfl is from the library {pscl} β i Standard error e β i Intercept DriverageA 0 1 DriverageB DriverageC DriverageD DriverageE DriverageF DriverageG RegionA 0 1 RegionB RegionC RegionD RegionE RegionF RegionG MCclassA 0 1 MCclassB MCclassC MCclassD MCclassE MCclassF MCclassG VehageA 0 1 VehageB VehageC VehageD Table 4: Parameters for Model 3 The probability that variable Numclaims takes zero value equals 82% STEP 2: Ten fold cross validation procedure was applied to every model from Step 1, obtaining corresponding cv errors Model 1 MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method:
8 MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSECV for the model of the form: Numclaims Driverage + Region + MCclass + Vehage MSECV for the model equals: ************************************************************************* Model 2 MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: 1977 MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: 1679 MSECV for the model equals: ************************************************************************* Model 3 MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSE on one of 10 validation parts in CV method: MSECV for the model of the form: Numclaims Driverage + Region + MCclass + Vehage 1 MSECV for the model equals: ************************************************************************* STEP 3: The smallest value of MSE cv was obtained for the Model 3, ie for the zero-inflated generalized linear model Thus this model was used further in the ranking creation steps STEP 4/STEP 5: The number of combinations of different empirical values of predictor variables X i equals 1372 After discretization every combination was assigned a label representing a risk class: from 10 the lowest risk of claim to occur, to 1 the highest risk of claim to occur The first five combinations of categories in the ranking were presented in Table 5 for illustration
9 Driverage Region MCclass Vehage λ Risk class DriverageG RegionG MCclassG VehageD DriverageG RegionE MCclassG VehageD DriverageG RegionG MCclassG VehageC DriverageG RegionG MCclassD VehageD DriverageG RegionG MCclassA VehageD Table 5: Part of the ranking and classification based on Model 3 5 Summary The procedure for recognizing risk classes in the insurance policies portfolios proposed in the paper allows to differentiate policies with no claims observed in the insurance history The minimum value of λ criterion used in classification causes that the risk classes and associated premiums are fairer for individuals applying for an insurance policy Essentially the main disadvantage of ZIP model, that turned out to be the best in terms of cv error criterion, is that within every risk class the policies have equal expected number of claims, which is an unrealistic assumption The solution to this issue may be using the mixed Poisson model and introducing a random effect that would differentiate policies (ZIP regression with random effect) However estimating that type of model is computationally very demanding what discourages from using in real world applications References [1] Denuit, M, and Marechal, X, and Pitrebois, S, and Walhin, J: Actuarial Modelling of Claims Counts John Wiley & Sons Ltd, 2007 [2] Gatnar, E: Ensemble Approach in Classification and Regression (in Polish) Wydawnictwo Naukowe PWN, Warszawa, 2008 [3] Gatnar, E, and Walesiak, M: Qualitative and Symbolic Data Analysis with R (in Polish) Wydawnictwo C H Beck, Warszawa, 2011 [4] Kopczewska, K, and Kopczewski, T, and Wjcik, P: Quantitative Methods in R Economic and Financial Applications (in Polish) Cedetupl Wydawnictwa Fachowe, Warszawa, 2009 [5] Lambert, D: Zero-Inflated Poisson Regression, with an Application to Defects in Manufacturing, Technometrics, 18, (1992) [6] Ohlsson, E, and Johansson, B: Non Life Insurance Pricing with Generalized Linear Models Springer Verlag, Berlin, 2010 Appendix A R Code library(vcd) dataset=readcsv2(file="e:/datasetcsv") attach(dataset) tabkontyng=table(driverage,region,mcclass,vehage) diag=list(varoffset=11, rot=0, fill=greycolors, gpvartext=gpar(fontsize=10), gpleveltext=gpar(fontsize=10)) pairs(tabkontyng,diagpanelargs=diag, highlighting=2) detach(dataset)
10 Driver_age ABCDEFG Region ABCDEFG MC_class ABCDEFG Veh_age A B C D Figure 1: Dependences between predictor variables influencing the number of claims
arxiv: v1 [q-fin.rm] 13 Dec 2016
arxiv:1612.04126v1 [q-fin.rm] 13 Dec 2016 The hierarchical generalized linear model and the bootstrap estimator of the error of prediction of loss reserves in a non-life insurance company Alicja Wolny-Dominiak
More informationKeywords Akiake Information criterion, Automobile, Bonus-Malus, Exponential family, Linear regression, Residuals, Scaled deviance. I.
Application of the Generalized Linear Models in Actuarial Framework BY MURWAN H. M. A. SIDDIG School of Mathematics, Faculty of Engineering Physical Science, The University of Manchester, Oxford Road,
More informationEstimation Parameters and Modelling Zero Inflated Negative Binomial
CAUCHY JURNAL MATEMATIKA MURNI DAN APLIKASI Volume 4(3) (2016), Pages 115-119 Estimation Parameters and Modelling Zero Inflated Negative Binomial Cindy Cahyaning Astuti 1, Angga Dwi Mulyanto 2 1 Muhammadiyah
More information1. You are given the following information about a stationary AR(2) model:
Fall 2003 Society of Actuaries **BEGINNING OF EXAMINATION** 1. You are given the following information about a stationary AR(2) model: (i) ρ 1 = 05. (ii) ρ 2 = 01. Determine φ 2. (A) 0.2 (B) 0.1 (C) 0.4
More informationRisk Classification In Non-Life Insurance
Risk Classification In Non-Life Insurance Katrien Antonio Jan Beirlant November 28, 2006 Abstract Within the actuarial profession a major challenge can be found in the construction of a fair tariff structure.
More informationA RIDGE REGRESSION ESTIMATION APPROACH WHEN MULTICOLLINEARITY IS PRESENT
Fundamental Journal of Applied Sciences Vol. 1, Issue 1, 016, Pages 19-3 This paper is available online at http://www.frdint.com/ Published online February 18, 016 A RIDGE REGRESSION ESTIMATION APPROACH
More informationINSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION
INSTITUTE AND FACULTY OF ACTUARIES Curriculum 2019 SPECIMEN EXAMINATION Subject CS1A Actuarial Statistics Time allowed: Three hours and fifteen minutes INSTRUCTIONS TO THE CANDIDATE 1. Enter all the candidate
More informationIntro to GLM Day 2: GLM and Maximum Likelihood
Intro to GLM Day 2: GLM and Maximum Likelihood Federico Vegetti Central European University ECPR Summer School in Methods and Techniques 1 / 32 Generalized Linear Modeling 3 steps of GLM 1. Specify the
More informationDual response surface methodology: Applicable always?
ProbStat Forum, Volume 04, October 2011, Pages 98 103 ISSN 0974-3235 ProbStat Forum is an e-journal. For details please visit www.probstat.org.in Dual response surface methodology: Applicable always? Rabindra
More informationWindow Width Selection for L 2 Adjusted Quantile Regression
Window Width Selection for L 2 Adjusted Quantile Regression Yoonsuh Jung, The Ohio State University Steven N. MacEachern, The Ohio State University Yoonkyung Lee, The Ohio State University Technical Report
More informationRecent analysis of the leverage effect for the main index on the Warsaw Stock Exchange
Recent analysis of the leverage effect for the main index on the Warsaw Stock Exchange Krzysztof Drachal Abstract In this paper we examine four asymmetric GARCH type models and one (basic) symmetric GARCH
More informationOn modelling of electricity spot price
, Rüdiger Kiesel and Fred Espen Benth Institute of Energy Trading and Financial Services University of Duisburg-Essen Centre of Mathematics for Applications, University of Oslo 25. August 2010 Introduction
More informationModeling. joint work with Jed Frees, U of Wisconsin - Madison. Travelers PASG (Predictive Analytics Study Group) Seminar Tuesday, 12 April 2016
joint work with Jed Frees, U of Wisconsin - Madison Travelers PASG (Predictive Analytics Study Group) Seminar Tuesday, 12 April 2016 claim Department of Mathematics University of Connecticut Storrs, Connecticut
More informationOPERATIONAL EXPANDITURE BENCHMARKING OF REGIONAL DISTRIBUTION UNITS AS A TOOL FOR EFFICIENCY EVALUATION AND DESIRED COST LEVEL ESTIMATION
OPERATIONAL EXPANDITURE BENCHMARKING OF REGIONAL DISTRIBUTION UNITS AS A TOOL FOR EFFICIENCY EVALUATION AND DESIRED COST LEVEL ESTIMATION Jerzy ANDRUSZKIEWICZ Wojciech ANDRUSZKIEWICZ Roman SŁOWIŃSKI Enea
More informationInstitute of Actuaries of India Subject CT6 Statistical Methods
Institute of Actuaries of India Subject CT6 Statistical Methods For 2014 Examinations Aim The aim of the Statistical Methods subject is to provide a further grounding in mathematical and statistical techniques
More informationModelling catastrophic risk in international equity markets: An extreme value approach. JOHN COTTER University College Dublin
Modelling catastrophic risk in international equity markets: An extreme value approach JOHN COTTER University College Dublin Abstract: This letter uses the Block Maxima Extreme Value approach to quantify
More informationObjective Bayesian Analysis for Heteroscedastic Regression
Analysis for Heteroscedastic Regression & Esther Salazar Universidade Federal do Rio de Janeiro Colóquio Inter-institucional: Modelos Estocásticos e Aplicações 2009 Collaborators: Marco Ferreira and Thais
More informationSOCIETY OF ACTUARIES EXAM STAM SHORT-TERM ACTUARIAL MATHEMATICS EXAM STAM SAMPLE QUESTIONS
SOCIETY OF ACTUARIES EXAM STAM SHORT-TERM ACTUARIAL MATHEMATICS EXAM STAM SAMPLE QUESTIONS Questions 1-307 have been taken from the previous set of Exam C sample questions. Questions no longer relevant
More informationNon replication of options
Non replication of options Christos Kountzakis, Ioannis A Polyrakis and Foivos Xanthos June 30, 2008 Abstract In this paper we study the scarcity of replication of options in the two period model of financial
More informationBEST LINEAR UNBIASED ESTIMATORS FOR THE MULTIPLE LINEAR REGRESSION MODEL USING RANKED SET SAMPLING WITH A CONCOMITANT VARIABLE
Hacettepe Journal of Mathematics and Statistics Volume 36 (1) (007), 65 73 BEST LINEAR UNBIASED ESTIMATORS FOR THE MULTIPLE LINEAR REGRESSION MODEL USING RANKED SET SAMPLING WITH A CONCOMITANT VARIABLE
More informationSubject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018
` Subject CS1 Actuarial Statistics 1 Core Principles Syllabus for the 2019 exams 1 June 2018 Copyright in this Core Reading is the property of the Institute and Faculty of Actuaries who are the sole distributors.
More informationMarket Risk Analysis Volume I
Market Risk Analysis Volume I Quantitative Methods in Finance Carol Alexander John Wiley & Sons, Ltd List of Figures List of Tables List of Examples Foreword Preface to Volume I xiii xvi xvii xix xxiii
More informationRandom Variables Handout. Xavier Vilà
Random Variables Handout Xavier Vilà Course 2004-2005 1 Discrete Random Variables. 1.1 Introduction 1.1.1 Definition of Random Variable A random variable X is a function that maps each possible outcome
More informationWC-5 Just How Credible Is That Employer? Exploring GLMs and Multilevel Modeling for NCCI s Excess Loss Factor Methodology
Antitrust Notice The Casualty Actuarial Society is committed to adhering strictly to the letter and spirit of the antitrust laws. Seminars conducted under the auspices of the CAS are designed solely to
More informationScienceDirect. A Comparison of Several Bonus Malus Systems
Available online at www.sciencedirect.com ScienceDirect Procedia Economics and Finance 26 ( 2015 ) 188 193 4th World Conference on Business, Economics and Management, WCBEM A Comparison of Several Bonus
More informationDistribution analysis of the losses due to credit risk
Distribution analysis of the losses due to credit risk Kamil Łyko 1 Abstract The main purpose of this article is credit risk analysis by analyzing the distribution of losses on retail loans portfolio.
More informationTwo hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER
Two hours MATH20802 To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER STATISTICAL METHODS Answer any FOUR of the SIX questions.
More informationChapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi
Chapter 4: Commonly Used Distributions Statistics for Engineers and Scientists Fourth Edition William Navidi 2014 by Education. This is proprietary material solely for authorized instructor use. Not authorized
More informationME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions.
ME3620 Theory of Engineering Experimentation Chapter III. Random Variables and Probability Distributions Chapter III 1 3.2 Random Variables In an experiment, a measurement is usually denoted by a variable
More information**BEGINNING OF EXAMINATION** A random sample of five observations from a population is:
**BEGINNING OF EXAMINATION** 1. You are given: (i) A random sample of five observations from a population is: 0.2 0.7 0.9 1.1 1.3 (ii) You use the Kolmogorov-Smirnov test for testing the null hypothesis,
More informationWeek 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals
Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg :
More informationSession 5. Predictive Modeling in Life Insurance
SOA Predictive Analytics Seminar Hong Kong 29 Aug. 2018 Hong Kong Session 5 Predictive Modeling in Life Insurance Jingyi Zhang, Ph.D Predictive Modeling in Life Insurance JINGYI ZHANG PhD Scientist Global
More informationLecture Notes 6. Assume F belongs to a family of distributions, (e.g. F is Normal), indexed by some parameter θ.
Sufficient Statistics Lecture Notes 6 Sufficiency Data reduction in terms of a particular statistic can be thought of as a partition of the sample space X. Definition T is sufficient for θ if the conditional
More informationAgenda. Current method disadvantages GLM background and advantages Study case analysis Applications. Actuaries Club of the Southwest
watsonwyatt.com Actuaries Club of the Southwest Generalized Linear Modeling for Life Insurers Jean-Felix Huet, FSA November 2, 29 Agenda Current method disadvantages GLM background and advantages Study
More informationOmitted Variables Bias in Regime-Switching Models with Slope-Constrained Estimators: Evidence from Monte Carlo Simulations
Journal of Statistical and Econometric Methods, vol. 2, no.3, 2013, 49-55 ISSN: 2051-5057 (print version), 2051-5065(online) Scienpress Ltd, 2013 Omitted Variables Bias in Regime-Switching Models with
More informationUniversität Potsdam Institut für Informatik Lehrstuhl Maschinelles Lernen. Evaluation of Models. Niels Landwehr
Universität Potsdam Institut für Informatik ehrstuhl Maschinelles ernen Evaluation of Models Niels andwehr earning and Prediction Classification, Regression: earning problem Input: training data Output:
More informationAnalysis of truncated data with application to the operational risk estimation
Analysis of truncated data with application to the operational risk estimation Petr Volf 1 Abstract. Researchers interested in the estimation of operational risk often face problems arising from the structure
More informationMATH 3200 Exam 3 Dr. Syring
. Suppose n eligible voters are polled (randomly sampled) from a population of size N. The poll asks voters whether they support or do not support increasing local taxes to fund public parks. Let M be
More informationHierarchical Generalized Linear Models. Measurement Incorporated Hierarchical Linear Models Workshop
Hierarchical Generalized Linear Models Measurement Incorporated Hierarchical Linear Models Workshop Hierarchical Generalized Linear Models So now we are moving on to the more advanced type topics. To begin
More informationMaximum Likelihood Estimates for Alpha and Beta With Zero SAIDI Days
Maximum Likelihood Estimates for Alpha and Beta With Zero SAIDI Days 1. Introduction Richard D. Christie Department of Electrical Engineering Box 35500 University of Washington Seattle, WA 98195-500 christie@ee.washington.edu
More informationModelling Returns: the CER and the CAPM
Modelling Returns: the CER and the CAPM Carlo Favero Favero () Modelling Returns: the CER and the CAPM 1 / 20 Econometric Modelling of Financial Returns Financial data are mostly observational data: they
More informationAssembly systems with non-exponential machines: Throughput and bottlenecks
Nonlinear Analysis 69 (2008) 911 917 www.elsevier.com/locate/na Assembly systems with non-exponential machines: Throughput and bottlenecks ShiNung Ching, Semyon M. Meerkov, Liang Zhang Department of Electrical
More informationFitting financial time series returns distributions: a mixture normality approach
Fitting financial time series returns distributions: a mixture normality approach Riccardo Bramante and Diego Zappa * Abstract Value at Risk has emerged as a useful tool to risk management. A relevant
More informationDiscussion of The Term Structure of Growth-at-Risk
Discussion of The Term Structure of Growth-at-Risk Frank Schorfheide University of Pennsylvania, CEPR, NBER, PIER March 2018 Pushing the Frontier of Central Bank s Macro Modeling Preliminaries This paper
More informationAdaptive Control Applied to Financial Market Data
Adaptive Control Applied to Financial Market Data J.Sindelar Charles University, Faculty of Mathematics and Physics and Institute of Information Theory and Automation, Academy of Sciences of the Czech
More informationThe Determinants of Bank Mergers: A Revealed Preference Analysis
The Determinants of Bank Mergers: A Revealed Preference Analysis Oktay Akkus Department of Economics University of Chicago Ali Hortacsu Department of Economics University of Chicago VERY Preliminary Draft:
More informationA MODIFIED MULTINOMIAL LOGIT MODEL OF ROUTE CHOICE FOR DRIVERS USING THE TRANSPORTATION INFORMATION SYSTEM
A MODIFIED MULTINOMIAL LOGIT MODEL OF ROUTE CHOICE FOR DRIVERS USING THE TRANSPORTATION INFORMATION SYSTEM Hing-Po Lo and Wendy S P Lam Department of Management Sciences City University of Hong ong EXTENDED
More informationChapter 3 Discrete Random Variables and Probability Distributions
Chapter 3 Discrete Random Variables and Probability Distributions Part 4: Special Discrete Random Variable Distributions Sections 3.7 & 3.8 Geometric, Negative Binomial, Hypergeometric NOTE: The discrete
More informationAssessment on Credit Risk of Real Estate Based on Logistic Regression Model
Assessment on Credit Risk of Real Estate Based on Logistic Regression Model Li Hongli 1, a, Song Liwei 2,b 1 Chongqing Engineering Polytechnic College, Chongqing400037, China 2 Division of Planning and
More informationStochastic Claims Reserving _ Methods in Insurance
Stochastic Claims Reserving _ Methods in Insurance and John Wiley & Sons, Ltd ! Contents Preface Acknowledgement, xiii r xi» J.. '..- 1 Introduction and Notation : :.... 1 1.1 Claims process.:.-.. : 1
More informationCalibration of Interest Rates
WDS'12 Proceedings of Contributed Papers, Part I, 25 30, 2012. ISBN 978-80-7378-224-5 MATFYZPRESS Calibration of Interest Rates J. Černý Charles University, Faculty of Mathematics and Physics, Prague,
More informationRandom Variables and Probability Distributions
Chapter 3 Random Variables and Probability Distributions Chapter Three Random Variables and Probability Distributions 3. Introduction An event is defined as the possible outcome of an experiment. In engineering
More informationIIntroduction the framework
Author: Frédéric Planchet / Marc Juillard/ Pierre-E. Thérond Extreme disturbances on the drift of anticipated mortality Application to annuity plans 2 IIntroduction the framework We consider now the global
More informationApplication of MCMC Algorithm in Interest Rate Modeling
Application of MCMC Algorithm in Interest Rate Modeling Xiaoxia Feng and Dejun Xie Abstract Interest rate modeling is a challenging but important problem in financial econometrics. This work is concerned
More informationStatistical Analysis of Life Insurance Policy Termination and Survivorship
Statistical Analysis of Life Insurance Policy Termination and Survivorship Emiliano A. Valdez, PhD, FSA Michigan State University joint work with J. Vadiveloo and U. Dias Sunway University, Malaysia Kuala
More informationBoosting Actuarial Regression Models in R
Carryl Oberson Faculty of Business and Economics University of Basel R in Insurance 2015 Build regression models (GLMs) for car insurance data. 3 types of response variables: claim incidence: y i = 0,
More informationWeek 1 Quantitative Analysis of Financial Markets Distributions B
Week 1 Quantitative Analysis of Financial Markets Distributions B Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg : 6828 0364 : LKCSB 5036 October
More informationMultidimensional credibility: a Bayesian analysis. of policyholders holding multiple contracts
Multidimensional credibility: a Bayesian analysis of policyholders holding multiple contracts Katrien Antonio Montserrat Guillén Ana Maria Pérez Marín May 19, 211 Abstract Property and casualty actuaries
More informationAustralian Journal of Basic and Applied Sciences. Conditional Maximum Likelihood Estimation For Survival Function Using Cox Model
AENSI Journals Australian Journal of Basic and Applied Sciences Journal home page: wwwajbaswebcom Conditional Maximum Likelihood Estimation For Survival Function Using Cox Model Khawla Mustafa Sadiq University
More informationEstimation of a parametric function associated with the lognormal distribution 1
Communications in Statistics Theory and Methods Estimation of a parametric function associated with the lognormal distribution Jiangtao Gou a,b and Ajit C. Tamhane c, a Department of Mathematics and Statistics,
More informationStatistics for Managers Using Microsoft Excel 7 th Edition
Statistics for Managers Using Microsoft Excel 7 th Edition Chapter 5 Discrete Probability Distributions Statistics for Managers Using Microsoft Excel 7e Copyright 014 Pearson Education, Inc. Chap 5-1 Learning
More informationMachine Learning for Quantitative Finance
Machine Learning for Quantitative Finance Fast derivative pricing Sofie Reyners Joint work with Jan De Spiegeleer, Dilip Madan and Wim Schoutens Derivative pricing is time-consuming... Vanilla option pricing
More informationSubject CS2A Risk Modelling and Survival Analysis Core Principles
` Subject CS2A Risk Modelling and Survival Analysis Core Principles Syllabus for the 2019 exams 1 June 2018 Copyright in this Core Reading is the property of the Institute and Faculty of Actuaries who
More information1. For two independent lives now age 30 and 34, you are given:
Society of Actuaries Course 3 Exam Fall 2003 **BEGINNING OF EXAMINATION** 1. For two independent lives now age 30 and 34, you are given: x q x 30 0.1 31 0.2 32 0.3 33 0.4 34 0.5 35 0.6 36 0.7 37 0.8 Calculate
More informationWeek 1 Quantitative Analysis of Financial Markets Basic Statistics A
Week 1 Quantitative Analysis of Financial Markets Basic Statistics A Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg : 6828 0364 : LKCSB 5036 October
More informationExtend the ideas of Kan and Zhou paper on Optimal Portfolio Construction under parameter uncertainty
Extend the ideas of Kan and Zhou paper on Optimal Portfolio Construction under parameter uncertainty George Photiou Lincoln College University of Oxford A dissertation submitted in partial fulfilment for
More informationOptimal Portfolio Inputs: Various Methods
Optimal Portfolio Inputs: Various Methods Prepared by Kevin Pei for The Fund @ Sprott Abstract: In this document, I will model and back test our portfolio with various proposed models. It goes without
More information2 Control variates. λe λti λe e λt i where R(t) = t Y 1 Y N(t) is the time from the last event to t. L t = e λr(t) e e λt(t) Exercises
96 ChapterVI. Variance Reduction Methods stochastic volatility ISExSoren5.9 Example.5 (compound poisson processes) Let X(t) = Y + + Y N(t) where {N(t)},Y, Y,... are independent, {N(t)} is Poisson(λ) with
More informationMultistage risk-averse asset allocation with transaction costs
Multistage risk-averse asset allocation with transaction costs 1 Introduction Václav Kozmík 1 Abstract. This paper deals with asset allocation problems formulated as multistage stochastic programming models.
More informationApproximating a life table by linear combinations of exponential distributions and valuing life-contingent options
Approximating a life table by linear combinations of exponential distributions and valuing life-contingent options Zhenhao Zhou Department of Statistics and Actuarial Science The University of Iowa Iowa
More informationChapter 3 Statistical Quality Control, 7th Edition by Douglas C. Montgomery. Copyright (c) 2013 John Wiley & Sons, Inc.
1 3.1 Describing Variation Stem-and-Leaf Display Easy to find percentiles of the data; see page 69 2 Plot of Data in Time Order Marginal plot produced by MINITAB Also called a run chart 3 Histograms Useful
More informationComparative Study between Linear and Graphical Methods in Solving Optimization Problems
Comparative Study between Linear and Graphical Methods in Solving Optimization Problems Mona M Abd El-Kareem Abstract The main target of this paper is to establish a comparative study between the performance
More informationPackage MixedPoisson
Type Package Title Mixed Poisson Models Version 2.0 Date 2016-11-24 Package MixedPoisson December 9, 2016 Author Alicja Wolny-Dominiak and Maintainer Alicja Wolny-Dominiak
More informationEstimating Car Insurance Premia: a Case Study in High-Dimensional Data Inference
Estimating Car Insurance Premia: a Case Study in High-Dimensional Data Inference Nicolas Chapados, Yoshua Bengio, Pascal Vincent, Joumana Ghosn, Charles Dugas, Ichiro Takeuchi, Linyan Meng University of
More informationChapter 2 Managing a Portfolio of Risks
Chapter 2 Managing a Portfolio of Risks 2.1 Introduction Basic ideas concerning risk pooling and risk transfer, presented in Chap. 1, are progressed further in the present chapter, mainly with the following
More informationEcon 300: Quantitative Methods in Economics. 11th Class 10/19/09
Econ 300: Quantitative Methods in Economics 11th Class 10/19/09 Statistical thinking will one day be as necessary for efficient citizenship as the ability to read and write. --H.G. Wells discuss test [do
More informationM.Sc. ACTUARIAL SCIENCE. Term-End Examination
No. of Printed Pages : 15 LMJA-010 (F2F) M.Sc. ACTUARIAL SCIENCE Term-End Examination O CD December, 2011 MIA-010 (F2F) : STATISTICAL METHOD Time : 3 hours Maximum Marks : 100 SECTION - A Attempt any five
More informationQuantile Regression due to Skewness. and Outliers
Applied Mathematical Sciences, Vol. 5, 2011, no. 39, 1947-1951 Quantile Regression due to Skewness and Outliers Neda Jalali and Manoochehr Babanezhad Department of Statistics Faculty of Sciences Golestan
More informationSELECTION OF VARIABLES INFLUENCING IRAQI BANKS DEPOSITS BY USING NEW BAYESIAN LASSO QUANTILE REGRESSION
Vol. 6, No. 1, Summer 2017 2012 Published by JSES. SELECTION OF VARIABLES INFLUENCING IRAQI BANKS DEPOSITS BY USING NEW BAYESIAN Fadel Hamid Hadi ALHUSSEINI a Abstract The main focus of the paper is modelling
More informationis the bandwidth and controls the level of smoothing of the estimator, n is the sample size and
Paper PH100 Relationship between Total charges and Reimbursements in Outpatient Visits Using SAS GLIMMIX Chakib Battioui, University of Louisville, Louisville, KY ABSTRACT The purpose of this paper is
More informationAn Application of Extreme Value Theory for Measuring Financial Risk in the Uruguayan Pension Fund 1
An Application of Extreme Value Theory for Measuring Financial Risk in the Uruguayan Pension Fund 1 Guillermo Magnou 23 January 2016 Abstract Traditional methods for financial risk measures adopts normal
More informationUPDATED IAA EDUCATION SYLLABUS
II. UPDATED IAA EDUCATION SYLLABUS A. Supporting Learning Areas 1. STATISTICS Aim: To enable students to apply core statistical techniques to actuarial applications in insurance, pensions and emerging
More informationINSTITUTE OF ACTUARIES OF INDIA
INSTITUTE OF ACTUARIES OF INDIA EXAMINATIONS 27 th October 2015 Subject CT3 Probability & Mathematical Statistics Time allowed: Three Hours (10.30 13.30 Hrs.) Total Marks: 100 INSTRUCTIONS TO THE CANDIDATES
More informationA Logistic Regression Based Auto Insurance Rate-Making Model Designed for the Insurance Rate Reform
International Journal of Financial Studies Article A Logistic Regression Based Auto Insurance Rate-Making Model Designed for Insurance Rate Reform Zhengmin Duan 1, *, Yonglian Chang 1, Qi Wang 1, Tianyao
More informationLasso and Ridge Quantile Regression using Cross Validation to Estimate Extreme Rainfall
Global Journal of Pure and Applied Mathematics. ISSN 0973-1768 Volume 12, Number 3 (2016), pp. 3305 3314 Research India Publications http://www.ripublication.com/gjpam.htm Lasso and Ridge Quantile Regression
More informationState Switching in US Equity Index Returns based on SETAR Model with Kalman Filter Tracking
State Switching in US Equity Index Returns based on SETAR Model with Kalman Filter Tracking Timothy Little, Xiao-Ping Zhang Dept. of Electrical and Computer Engineering Ryerson University 350 Victoria
More informationCommonly Used Distributions
Chapter 4: Commonly Used Distributions 1 Introduction Statistical inference involves drawing a sample from a population and analyzing the sample data to learn about the population. We often have some knowledge
More information4-1. Chapter 4. Commonly Used Distributions by The McGraw-Hill Companies, Inc. All rights reserved.
4-1 Chapter 4 Commonly Used Distributions 2014 by The Companies, Inc. All rights reserved. Section 4.1: The Bernoulli Distribution 4-2 We use the Bernoulli distribution when we have an experiment which
More informationIntroduction Recently the importance of modelling dependent insurance and reinsurance risks has attracted the attention of actuarial practitioners and
Asymptotic dependence of reinsurance aggregate claim amounts Mata, Ana J. KPMG One Canada Square London E4 5AG Tel: +44-207-694 2933 e-mail: ana.mata@kpmg.co.uk January 26, 200 Abstract In this paper we
More information[D7] PROBABILITY DISTRIBUTION OF OUTSTANDING LIABILITY FROM INDIVIDUAL PAYMENTS DATA Contributed by T S Wright
Faculty and Institute of Actuaries Claims Reserving Manual v.2 (09/1997) Section D7 [D7] PROBABILITY DISTRIBUTION OF OUTSTANDING LIABILITY FROM INDIVIDUAL PAYMENTS DATA Contributed by T S Wright 1. Introduction
More informationLog-linear Modeling Under Generalized Inverse Sampling Scheme
Log-linear Modeling Under Generalized Inverse Sampling Scheme Soumi Lahiri (1) and Sunil Dhar (2) (1) Department of Mathematical Sciences New Jersey Institute of Technology University Heights, Newark,
More information2011 Pearson Education, Inc
Statistics for Business and Economics Chapter 4 Random Variables & Probability Distributions Content 1. Two Types of Random Variables 2. Probability Distributions for Discrete Random Variables 3. The Binomial
More informationOn Effects of Asymmetric Information on Non-Life Insurance Prices under Competition
On Effects of Asymmetric Information on Non-Life Insurance Prices under Competition Albrecher Hansjörg Department of Actuarial Science, Faculty of Business and Economics, University of Lausanne, UNIL-Dorigny,
More informationThe Stochastic Approach for Estimating Technical Efficiency: The Case of the Greek Public Power Corporation ( )
The Stochastic Approach for Estimating Technical Efficiency: The Case of the Greek Public Power Corporation (1970-97) ATHENA BELEGRI-ROBOLI School of Applied Mathematics and Physics National Technical
More informationA Survey of the Effects of Liberalization of Iran Non-Life Insurance Market by Using the Experiences of WTO Member Countries
A Survey of the Effects of Liberalization of Iran Non-Life Insurance Market by Using the Experiences of WTO Member Countries Marufi Aghdam Jalal 1, Eshgarf Reza 2 Abstract Today, globalization is prevalent
More informationADVANCED OPERATIONAL RISK MODELLING IN BANKS AND INSURANCE COMPANIES
Small business banking and financing: a global perspective Cagliari, 25-26 May 2007 ADVANCED OPERATIONAL RISK MODELLING IN BANKS AND INSURANCE COMPANIES C. Angela, R. Bisignani, G. Masala, M. Micocci 1
More informationECS171: Machine Learning
ECS171: Machine Learning Lecture 15: Tree-based Algorithms Cho-Jui Hsieh UC Davis March 7, 2018 Outline Decision Tree Random Forest Gradient Boosted Decision Tree (GBDT) Decision Tree Each node checks
More informationModel 0: We start with a linear regression model: log Y t = β 0 + β 1 (t 1980) + ε, with ε N(0,
Stat 534: Fall 2017. Introduction to the BUGS language and rjags Installation: download and install JAGS. You will find the executables on Sourceforge. You must have JAGS installed prior to installing
More informationFuzzy Volatility Forecasts and Fuzzy Option Values
Class of Volatility Models Fuzzy Volatility Forecasts and Fuzzy Option Values K. Thiagarajah Illinois State University, Normal, Illinois. 41st Actuarial Research Conference Montreal, Canada August 10-12,
More informationDividend Strategies for Insurance risk models
1 Introduction Based on different objectives, various insurance risk models with adaptive polices have been proposed, such as dividend model, tax model, model with credibility premium, and so on. In this
More information