GLM III - The Matrix Reloaded

Size: px
Start display at page:

Download "GLM III - The Matrix Reloaded"

Transcription

1 GLM III - The Matrix Reloaded Duncan Anderson, Serhat Guven 12 March Towers Watson. All rights reserved.

2 Agenda "Quadrant Saddles" The Tweedie Distribution "Emergent Interactions" Dispersion Modeling Modelling sparse claim types Driver Averaging Model Validation Man (with GLM) vs machine 2

3 Interactions Policyholder Age

4 Interactions Policyholder Age

5 Why are interactions present? Because that's how the factors behave Because the multiplicative model can go wrong at the edges 1.5 * 1.4 * 1.7 * 1.5 * 1.8 * 1.5 * 1.8 = 26! 5

6 Interactions

7 Interactions

8 Interactions Vehicle group b b - b b b b b b b Vehicle group b b - b b b b b b b Vehicle group b b - b b b b b b b b b b Age b b b b b b b b b - b b b b b b b b b b - b b b b b b b b b b - b b b b b b b Age b b b - b b b b b b b b b b - b b b b b b b b b b - b b b b b b b b b b - b b b b b b b b b b - b b b b b b b b b b - b b b b b b b b b b Age b b b b b b b

9 Example

10 Example

11 Example

12 Saddles

13 Saddles

14 Saddles

15 Saddles

16 Saddles

17 Saddles

18 Saddles

19 Saddles

20 Saddles

21 Saddles

22 Saddles

23 23

24 Saddles - model comparison Motor frequency - out of sample 20% 400,000 18% 350,000 16% 300,000 14% 250,000 12% 200,000 10% 150,000 8% 100,000 6% 50,000 4% 50% 70% 90% 94% 98% 102% 106% 110% 130% 150% 0 Exposure Saddle Original

25 Saddles - model comparison Motor frequency - out of sample 20% 400,000 18% 350,000 16% 300,000 14% 250,000 12% 200,000 10% 150,000 8% 100,000 6% 50,000 4% 50% 70% 90% 94% 98% 102% 106% 110% 130% 150% 0 Exposure Observed Saddle Original

26 Saddles - model comparison Motor frequency - out of sample 2.6% % % % % % % % 1.0% % % < 80% 80% - 86% 83% - 89% 86% - 92% 89% - 95% 92% - 98% 95% - 101% 98% % 10 1% % 104 % % 10 7% - 113% 110 % - 116% 113% % 116% % 119 % % 12 2% % 125 % - 131% 128 % % > 130% 0 Exposure Observed Saddle Original

27 Saddles - model comparison Motor frequency - out of time < 80% 80% - 84% 82% - 86% 84% - 88% 86% - 90% 88% - 92% 90% - 94% 92% - 96% 94% - 98% 96% % 98% % 10 0% % 102 % % 10 4% % 106 % % 108 % % 110 % - 114% 112% % 114 % - 118% 116% % 118 % % > 120% 0 Exposure Observed Saddles Original

28 Agenda "Quadrant Saddles" The Tweedie Distribution "Emergent Interactions" Dispersion Modeling Modelling sparse claim types Driver Averaging Model Validation Man (with GLM) vs machine 28

29 Tweedie GLMs Consider the following empirical probability distribution function 29

30 Tweedie GLMs Raw pure premiums Incurred losses have a point mass at zero and then a continuous distribution Poisson and gamma not suited to this Tweedie distribution has point mass at zero a parameter which changes shape above zero f Y ( y; q, l, a) = å n= 1 1-a n {( lw) ka (-1/ y) } exp{ lwq [ y-k ( q )]} G( -na) n! y { ( q )} p( Y = 0) = exp - lwk a 0 0 a 0 fory > 0 30

31 Formulization of GLMs Generally accepted standards for link functions and error distribution Observed Response Most Appropriate Link Function Most Appropriate Error Structure Variance Function Normal µ 0 Claim Frequency Log Poisson µ 1 Claim Severity Log Gamma µ 2 Claim Severity Log Inverse Gaussian µ 3 Raw Pure Premium Log Tweedie µ T Retention Rate Logit Binomial µ (1-µ) Conversion Rate Logit Binomial µ(1- µ) 31

32 Formulization of GLMs More formally: Var(Y) = φv(μˆ) ω Variance Function Scale Parameter Prior Weights Tweedie s Variance function: V(m) = m p p=1 Poisson p=2 Gamma 1<p<2 Poisson/Gamma process Other concerns Need to estimate both j and p when fitting models Typically p»1.5 for incurred claims 32

33 Example 1 Vehicle age - frequency Exposure Model Prediction at Base levels Model Prediction +/- 2 Standard Errors 33

34 Example 1 Vehicle age - amounts Exposure Model Prediction at Base levels Model Prediction + 2 Standard Errors Model Prediction - 2 Standard Errors 34

35 Example 1 Vehicle age - pure premium Exposure Model Prediction +/- 2 Standard Errors Traditional 35

36 Example 1 Vehicle age - pure premium Exposure Tweedie Model Prediction +/- 2 Standard Errors Traditional 0 36

37 Example 2 Gender - frequency Male Female Exposure Model Prediction at Base levels Model Prediction +/- 2 Standard Errors 0 37

38 Example 2 Gender - frequency Male Female Exposure (1998) Exposure (1999) Policyholder Exposure (2000) Sex Exposure (2001) Exposure (2002) Loss Year (1998) Loss Year (1999) Loss Year (2000) Loss Year (2001) Loss Year (2002) 0 38

39 Example 2 Gender - amounts Male Female Exposure Model Prediction at Base levels Model Prediction +/- 2 Standard Errors 0 39

40 Example 2 Gender - amounts Male Female Exposure (1998) Exposure (1999) Policyholder Exposure Sex (2000) Exposure (2001) Exposure (2002) Loss Year (1998) Loss Year (1999) Loss Year (2000) Loss Year (2001) Loss Year (2002) 0 40

41 Example 2 Gender pure premium Male Female Model Prediction at Base levels Model Prediction + 2 Standard Errors Model Prediction - 2 Standard Errors Combined Tweedie 0 41

42 Tweedie GLMs Helpful when it's important to fit to incurred costs directly Similar results to frequency/severity traditional approach if frequency and amounts effects are clearly weak or clearly strong Distorted by large insignificant effects Removes understanding of what is driving results Smoothing harder 42

43 Agenda "Quadrant Saddles" The Tweedie Distribution "Emergent Interactions" Dispersion Modeling Modelling sparse claim types Driver Averaging Model Validation Man (with GLM) vs machine 43

44 Combining models Collision Frequency x Collision Severity Overall rates

45 Combining models BI Frequency x BI Severity + PD Frequency x PD Severity Overall rates + Collision Frequency x Collision Severity

46 Combining models PI Severity TP Frequency x PI Propensity x + PD Severity Overall rates Collision Frequency x Collision Severity Apply e.g. trends, case reserves adjustments, target loss ratio etc.

47 Combining models BI Frequency x BI Severity + PD Frequency x PD Severity Overall rates + Collision Frequency x Collision Severity Apply e.g. trends, case reserves adjustments, target loss ratio etc.

48 Combining models Take models Take relevant mix of business eg current in force policies For each record calculate expected frequencies and severities according to the models For each record, calculate expected total cost of claims "C" Fit a GLM to "C" using all available factors 48

49 Combining models PD Numbers PD Amounts BI Numbers BI Amounts Intercept 32% $ % $4860 Sex Male Female Area Town Country Policy Sex Area NUM1 AMT1 NUM2 AMT2 CC1 CC2 RISKPREM M T 32% % F T 24% % M C 40% 700 9% F C 30% 840 6%

50 Except Policy Sex Area NUM1 AMT1 NUM2 AMT2 CC1 CC2 RISKPREM M T 32% % F T 24% % M C 40% 700 9% F C 30% 840 6% The global risk premium is not multiplicative In the town, women have a modelled claim cost 29% lower than men /903.20=0.706 In the country, women have a modelled claim cost 27% lower than men /644.50=0.730

51 To solve Policy Sex Area NUM1 AMT1 NUM2 AMT2 CC1 CC2 RISKPREM M T 32% % F T 24% % M C 40% 700 9% F C 30% 840 6% We can capture this result exactly with an interaction Total risk premium Intercept $ Sex Male Female Area Town Country

52 Example "emergent" interaction

53 "Emergent" interactions In the above examples the interaction "emerged" from the risk premium step Emergent interactions are not risk insights, there is no subtle risk effect we have just discovered The different behaviour is by peril, and the rating factors are just bad proxies for the peril effects Emergent interactions are corrections to fix problems we have introduced Best solution is by peril pricing Reflects true behaviour Underlying models simple to understand and implement If not, check for emergent interactions in the risk premium 53

54 54

55 Agenda "Quadrant Saddles" The Tweedie Distribution "Emergent Interactions" Dispersion Modeling Modelling sparse claim types Driver Averaging Model Validation Man (with GLM) vs machine 55

56 >.2 >.5 >.7 > 1.4 > 2.2 > 2.9 > 3.6 > 4.3 > 5.0 > 5.7 > 6.5 > 7.2 > 7.9 > 8.6 > 9.3 > 10.1 > 10.8 > 11.5 > 12.2 > 12.9 > 13.6 > 14.4 > 15.1 > 15.8 > 16.5 > 17.2 > 17.9 > 18.7 > 19.4 > 20.1 > 20.8 > 21.5 > 22.3 > 23.0 > 23.7 > 24.4 > 25.1 > 25.8 > 26.6 > 27.3 Modeling the Insurance Risk ISSUE: Heterogeneous exposure bases Different policies within the same line can cover entirely different structures (i.e. commercial property) Goal of a predictive model Ideally would like to separate the heterogenous exposure bases Joint-modeling techniques and quasi-likelihood functions allow for analysis of heterogeneous environment without separation Studentized Standardized Deviance Residuals , ,000 2,00 0 3,000 4,00 0 5,000 6,0 00 7,0 00 8,000 9,000 Fitte d Value Two concentrations suggests two perils: split or use joint modeling 56

57 Heterogeneous Exposure Bases If possible should be modeled separately If model together, exposures with high variability may mask patterns of less random risks If loss trends vary by exposure class, the proportion each represents of the total will change and may mask important trends Independent predictors can have different effects on different perils If cannot, use joint modeling techniques to improve overall fit 57

58 Generalized Linear Models Formulation of deviance logarithm of a ratio of likelihoods Where: D a Act = f ( j) Y æ Act ö = ln ç Exp è ø ~ 2 ~ ( y; θ, j) ' E( Y) = y = b' ( q) Exp= f Y ( y;θˆ, j) ' E( Y) = m ˆ = b' ( qˆ ) Then: D a ( j) æ ç f = ln è f Y Y ~ (y; θ, j) ö (y;θˆ, j) ø 2 é ~ yθ - yθˆ -b = 2 ê ë a ( j) ( θ ~ ) + b( θˆ ) ù ú û 58

59 Generalized Linear Models Analyzing the scale parameter When modeling homogeneous data j a ( j) = Þ j = ω D dof Heterogeneous data requires a more rigorous definition of the scale function Scale parameter could vary in a systematic way with other predictors Construct and fit formal models for the dependence of both the mean and the scale 59

60 Dispersion Model Form Double generalized linear models Response model Y~f Y E(Y) Var(Y) ( y;θ; j) = b' ( θ) jb'' ( θ) = ω Dispersion model D~f D E(D) ( d; x, t) = b' ( x) Var(D) = tb'' ω ( x) Where d = ( Y-m) V( m) 2 60

61 Dispersion Model Form Dispersion adjustments Pearson residual has excess variability (deviance residual has bias) Distribution Adjustment Normal 0 Poisson f/(2m) Gamma 3f Parameter in the adjustment term is the scale parameter from the original response model 61

62 Dispersion Model Results Dispersion model is integrated with original response model Initial Response Model Dispersion Model Final Response Model Response Loss / Exposure Squared Pearson Residual Loss / Exposure Weight Exposure Exposure/ (Exposure + Adjustment) Exposure/ Squared Pearson Residual 62

63 Agenda "Quadrant Saddles" The Tweedie Distribution "Emergent Interactions" Dispersion Modeling Modelling sparse claim types Driver Averaging Model Validation Man (with GLM) vs machine 63

64 Amplification of the BI signal using PD experience Fit straight to BI Use PD model as a guide in free fitting BI More Data Use PD model structure Offset PD relativities onto BI data as starting point BI/PD proportion model: BI frequency = BI/PD proportion * PD frequency Less Data 64

65 Reference models BI Frequency x BI Severity PD Frequency x PD Severity BI Severity TP Frequency x BI Propensity x PD Severity

66 Reference models A B C D E

67 Reference models A B C D E

68 Reference models A B C D E

69 Reference models A B C D E

70 Reference models A B C D E

71 Reference models A B C D E

72 Reference models A B C D E

73 Reference models E[Y i ] = m i = g -1 (SX ij.b j +x i ) Offset term When modeling BI set PD fitted values to be offset term GLM will seek effects over and above assumed PD effect

74 Experiment (1) GLM on BI claims on all the data - the "correct" answer Real large company 10% random sample Pretend small company (2) Traditional GLM on BI claims on the "small company" (3) Propensity reference model on BI claims cf PD claims

75 Example result , , , , , , , , , , Vehicle Group Exposure "Correct" Traditional Reference 0

76 Example result 200, , , , , , ,000 60, ,000 20, Provisional Licence Age Exposure "Correct" Traditional Reference 0

77 Agenda "Quadrant Saddles" The Tweedie Distribution "Emergent Interactions" Dispersion Modeling Modelling sparse claim types Driver Averaging Model Validation Man (with GLM) vs machine 77

78 Household Averaging Historically companies assigned operators to vehicles for the purpose of rating More recently driver averaging strategies are deployed to capture household Straight vs. geometric average Weighted average Modified Average/assignment hybrid Modeling data needs to mimic the transaction 78

79 Model Design In all modeling projects, it is imperative that the data set up mimic the rating Consider the following example Vehicle Operator Vehicle Rate V1 Dad $500 V2 Mom $450 Operator Class Factor Dad 0.80 Mom 0.85 Junior 2.80 Assume Mom had a $1000 claim in Dad s car 79

80 Assignment Actual assignment methodology each record represents a single vehicle with one assigned operator Veh Op Sym MYR Age Sex Type Yths Drvrs Vehs Exp Clm Losses Prem V1 Junior M OO ,000 1, V2 Mom F PO Operator characteristics based on assigned operator Vehicle characteristics based on vehicle Policy characteristics catch other drivers Losses assigned to vehicle 80

81 Straight Average Straight average methodology: h Which can be deconstructed: h h h

82 Straight Average Straight average methodology each record represents a single vehicle and operator combination Veh Op Sym MYR Age Sex Yths Drvrs Vehs Exp Clm Losses Prem V1 Dad M / V1 Mom F /3 1 1, V1 Junior M / V2 Dad M / V2 Mom F / V2 Junior M / Policy characteristics are same, but less predictive Exposure split amongst the vehicle Losses assigned to vehicle/operator combination iid is a major concern What about Comprehensive? 82

83 Geometric Average Geometric average methodology: h / No direct decomposition 83

84 Geometric Average Geometric methodology each record represents a single vehicle Veh Sym MYR # of Dads # of Moms # of Juniors Exp Clm Losses Prem V /3 1/3 1/ , V /3 1/3 1/ Policy characteristics are same, but less predictive Predictors are translated to counts Losses assigned to vehicle More challenging to add operator interactions or variates 84

85 Weighted Average Weighted average methodology for a straight average approach Veh Op Sym MYR Age Sex Type Yths Drvrs Vehs Exp Clm Losses Prem V1 Dad M PO / V1 Mom F OC /3 1 1, V1 Junior M OC / V2 Dad M OC / V2 Mom F PO / V2 Junior M OC / Creates a relationship between the vehicle and the operator Uses the model to determine the weights More accurate as it requires more information h

86 Agenda "Quadrant Saddles" The Tweedie Distribution "Emergent Interactions" Dispersion Modeling Modelling sparse claim types Driver Averaging Model Validation Man (with GLM) vs machine 86

87 Validate Models Holdout samples Holdout samples are effective at validating model Determine estimates based on part of data set Uses estimates to predict other part of data set Full Test/Training for Large Data Sets Partial Test/Training for Smaller Data Sets Train Data Build Models All Data Build Models Data Split Data Data Train Data Refit Parameters Test Data Compare Predictions to Actuals Split Data Test Data Compare Predictions to Actuals Predictions should be close to actuals for heavily populated cells 87

88 Validate Models Gains curves Compare predictiveness of models 88

89 Validate Models Lift curves Compare predictiveness of models Model Validation Model Validation , , , , ,000 Data ,000 Data 20,000 20,000 We i g h te d a vera g e ,000 16,000 14,000 12,000 10,000 Weights Current model Weights We i g h te d a vera g e ,000 16,000 14,000 12,000 10,000 Weights Current model Weights 8,000 8, ,000 6, , , , , > E-02, <= E-02 > E-02, <= E-02 Absolute value: Current model > E-02, <= E-02 > E-02, <= E-02 Absolute value: Current model 0 More intuitive but difficult to assess performance 89

90 Validate Models X-Graphs 90

91 Validate Models Residual analysis Recheck residuals to ensure appropriate shape 10 Studentized Standardized Deviance Residuals by Policyholder Age Is the contour plot symmetric? lt Does the Box-Whisker show symmetry across levels?

92 Agenda "Quadrant Saddles" The Tweedie Distribution "Emergent Interactions" Dispersion Modeling Modelling sparse claim types Driver Averaging Model Validation Man (with GLM) vs machine 92

93 Machine vs man vs

94 Machine vs man vs

95 Machine vs man vs

96 Machine vs man What is the underlying process?

97 Machine vs man Underwriting Claims 1.25 Cover Level Fraud Likelihood <= 4.5 > 4.5, <= 5 > 5, <= 5.5 > 5.5, <= 6 > 6, <= 6.5 > 6.5, <= 7 > 7, <= 7.5 > 7. 5, <= 8 > 8, <= 8.5 > 8. 5, <= 9 > 9, <= 9.5 > New Business Historic New Business Recent Relativity New Business Historic New Business Recent Relativity

98 Machine vs man What are the underlying drivers?

99 Drivers of elasticity Affordability Need Elasticity Alternatives Brand Affinity Shopping Preferences

100 Machine vs man I reckon that lots of recent bodily injury changes are down to new types of claims

101 Claims management companies 101

102 BI models - "insurance" and "compensation" risk 1.20 Modelled Relativities - Accident Year Exposure Compensation Risk Insurance Risk 0

103 1.20 Modelled Relativities - ABI20 Vehicle Group Proportion TPD with BI - Rated Driver Age Exposure Compensation Risk Insurance Risk Proportion TPD with BI - Car Age Exposure Compensation Risk Insurance Risk Exposure Compensation Risk Insurance Risk Unknown 0

104 GLM III - The Matrix Reloaded Duncan Anderson, Serhat Guven 2012 Towers Watson. All rights reserved.

HOUSEHOLD AVERAGING. CAS Annual Meeting Alice Gannon November 2007

HOUSEHOLD AVERAGING. CAS Annual Meeting Alice Gannon November 2007 HOUSEHOLD AVERAGING CAS Annual Meeting 2007 Alice Gannon November 2007 Household Averaging PURPOSE: To discuss household averaging and how it is used to reflect the different relationships between operators

More information

Generalized Linear Models II: Applying GLMs in Practice Duncan Anderson MA FIA Watson Wyatt LLP W W W. W A T S O N W Y A T T.

Generalized Linear Models II: Applying GLMs in Practice Duncan Anderson MA FIA Watson Wyatt LLP W W W. W A T S O N W Y A T T. Generalized Linear Models II: Applying GLMs in Practice Duncan Anderson MA FIA Watson Wyatt LLP W W W. W A T S O N W Y A T T. C O M Agenda Introduction / recap Model forms and model validation Aliasing

More information

Predictive Modeling GLM and Price Elasticity Model. David Dou October 8 th, 2014

Predictive Modeling GLM and Price Elasticity Model. David Dou October 8 th, 2014 Predictive Modeling GLM and Price Elasticity Model David Dou October 8 th, 2014 History of Predictive Modeling Pre-Computer Era: Triangles on a giant spreadsheet PC Era: Microsoft Excel oneway relativities

More information

And The Winner Is? How to Pick a Better Model

And The Winner Is? How to Pick a Better Model And The Winner Is? How to Pick a Better Model Part 2 Goodness-of-Fit and Internal Stability Dan Tevet, FCAS, MAAA Goodness-of-Fit Trying to answer question: How well does our model fit the data? Can be

More information

Loss Cost Modeling vs. Frequency and Severity Modeling

Loss Cost Modeling vs. Frequency and Severity Modeling Loss Cost Modeling vs. Frequency and Severity Modeling 2013 CAS Ratemaking and Product Management Seminar March 13, 2013 Huntington Beach, CA Jun Yan Deloitte Consulting LLP Antitrust Notice The Casualty

More information

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is:

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is: **BEGINNING OF EXAMINATION** 1. You are given: (i) A random sample of five observations from a population is: 0.2 0.7 0.9 1.1 1.3 (ii) You use the Kolmogorov-Smirnov test for testing the null hypothesis,

More information

Session 5. Predictive Modeling in Life Insurance

Session 5. Predictive Modeling in Life Insurance SOA Predictive Analytics Seminar Hong Kong 29 Aug. 2018 Hong Kong Session 5 Predictive Modeling in Life Insurance Jingyi Zhang, Ph.D Predictive Modeling in Life Insurance JINGYI ZHANG PhD Scientist Global

More information

Negative Binomial Model for Count Data Log-linear Models for Contingency Tables - Introduction

Negative Binomial Model for Count Data Log-linear Models for Contingency Tables - Introduction Negative Binomial Model for Count Data Log-linear Models for Contingency Tables - Introduction Statistics 149 Spring 2006 Copyright 2006 by Mark E. Irwin Negative Binomial Family Example: Absenteeism from

More information

Quantile Regression. By Luyang Fu, Ph. D., FCAS, State Auto Insurance Company Cheng-sheng Peter Wu, FCAS, ASA, MAAA, Deloitte Consulting

Quantile Regression. By Luyang Fu, Ph. D., FCAS, State Auto Insurance Company Cheng-sheng Peter Wu, FCAS, ASA, MAAA, Deloitte Consulting Quantile Regression By Luyang Fu, Ph. D., FCAS, State Auto Insurance Company Cheng-sheng Peter Wu, FCAS, ASA, MAAA, Deloitte Consulting Agenda Overview of Predictive Modeling for P&C Applications Quantile

More information

Session 5. A brief introduction to Predictive Modeling

Session 5. A brief introduction to Predictive Modeling SOA Predictive Analytics Seminar Malaysia 27 Aug. 2018 Kuala Lumpur, Malaysia Session 5 A brief introduction to Predictive Modeling Lichen Bao, Ph.D A Brief Introduction to Predictive Modeling LICHEN BAO

More information

Modeling. joint work with Jed Frees, U of Wisconsin - Madison. Travelers PASG (Predictive Analytics Study Group) Seminar Tuesday, 12 April 2016

Modeling. joint work with Jed Frees, U of Wisconsin - Madison. Travelers PASG (Predictive Analytics Study Group) Seminar Tuesday, 12 April 2016 joint work with Jed Frees, U of Wisconsin - Madison Travelers PASG (Predictive Analytics Study Group) Seminar Tuesday, 12 April 2016 claim Department of Mathematics University of Connecticut Storrs, Connecticut

More information

Session 178 TS, Stats for Health Actuaries. Moderator: Ian G. Duncan, FSA, FCA, FCIA, FIA, MAAA. Presenter: Joan C. Barrett, FSA, MAAA

Session 178 TS, Stats for Health Actuaries. Moderator: Ian G. Duncan, FSA, FCA, FCIA, FIA, MAAA. Presenter: Joan C. Barrett, FSA, MAAA Session 178 TS, Stats for Health Actuaries Moderator: Ian G. Duncan, FSA, FCA, FCIA, FIA, MAAA Presenter: Joan C. Barrett, FSA, MAAA Session 178 Statistics for Health Actuaries October 14, 2015 Presented

More information

Agenda. Current method disadvantages GLM background and advantages Study case analysis Applications. Actuaries Club of the Southwest

Agenda. Current method disadvantages GLM background and advantages Study case analysis Applications. Actuaries Club of the Southwest watsonwyatt.com Actuaries Club of the Southwest Generalized Linear Modeling for Life Insurers Jean-Felix Huet, FSA November 2, 29 Agenda Current method disadvantages GLM background and advantages Study

More information

Contents Utility theory and insurance The individual risk model Collective risk models

Contents Utility theory and insurance The individual risk model Collective risk models Contents There are 10 11 stars in the galaxy. That used to be a huge number. But it s only a hundred billion. It s less than the national deficit! We used to call them astronomical numbers. Now we should

More information

Actuarial Research on the Effectiveness of Collision Avoidance Systems FCW & LDW. A translation from Hebrew to English of a research paper prepared by

Actuarial Research on the Effectiveness of Collision Avoidance Systems FCW & LDW. A translation from Hebrew to English of a research paper prepared by Actuarial Research on the Effectiveness of Collision Avoidance Systems FCW & LDW A translation from Hebrew to English of a research paper prepared by Ron Actuarial Intelligence LTD Contact Details: Shachar

More information

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018 ` Subject CS1 Actuarial Statistics 1 Core Principles Syllabus for the 2019 exams 1 June 2018 Copyright in this Core Reading is the property of the Institute and Faculty of Actuaries who are the sole distributors.

More information

Keywords Akiake Information criterion, Automobile, Bonus-Malus, Exponential family, Linear regression, Residuals, Scaled deviance. I.

Keywords Akiake Information criterion, Automobile, Bonus-Malus, Exponential family, Linear regression, Residuals, Scaled deviance. I. Application of the Generalized Linear Models in Actuarial Framework BY MURWAN H. M. A. SIDDIG School of Mathematics, Faculty of Engineering Physical Science, The University of Manchester, Oxford Road,

More information

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION INSTITUTE AND FACULTY OF ACTUARIES Curriculum 2019 SPECIMEN EXAMINATION Subject CS1A Actuarial Statistics Time allowed: Three hours and fifteen minutes INSTRUCTIONS TO THE CANDIDATE 1. Enter all the candidate

More information

Predictive modelling around the world Peter Banthorpe, RGA Kevin Manning, Milliman

Predictive modelling around the world Peter Banthorpe, RGA Kevin Manning, Milliman Predictive modelling around the world Peter Banthorpe, RGA Kevin Manning, Milliman 11 November 2013 Agenda Introduction to predictive analytics Applications overview Case studies Conclusions and Q&A Introduction

More information

Homework Problems Stat 479

Homework Problems Stat 479 Chapter 2 1. Model 1 is a uniform distribution from 0 to 100. Determine the table entries for a generalized uniform distribution covering the range from a to b where a < b. 2. Let X be a discrete random

More information

A Comprehensive, Non-Aggregated, Stochastic Approach to. Loss Development

A Comprehensive, Non-Aggregated, Stochastic Approach to. Loss Development A Comprehensive, Non-Aggregated, Stochastic Approach to Loss Development By Uri Korn Abstract In this paper, we present a stochastic loss development approach that models all the core components of the

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models Generalized Linear Models - IIIb Henrik Madsen March 18, 2012 Henrik Madsen () Chapman & Hall March 18, 2012 1 / 32 Examples Overdispersion and Offset!

More information

Application of statistical methods in the determination of health loss distribution and health claims behaviour

Application of statistical methods in the determination of health loss distribution and health claims behaviour Mathematical Statistics Stockholm University Application of statistical methods in the determination of health loss distribution and health claims behaviour Vasileios Keisoglou Examensarbete 2005:8 Postal

More information

Loss Simulation Model Testing and Enhancement

Loss Simulation Model Testing and Enhancement Loss Simulation Model Testing and Enhancement Casualty Loss Reserve Seminar By Kailan Shang Sept. 2011 Agenda Research Overview Model Testing Real Data Model Enhancement Further Development Enterprise

More information

Actuarial Society of India EXAMINATIONS

Actuarial Society of India EXAMINATIONS Actuarial Society of India EXAMINATIONS 7 th June 005 Subject CT6 Statistical Models Time allowed: Three Hours (0.30 am 3.30 pm) INSTRUCTIONS TO THE CANDIDATES. Do not write your name anywhere on the answer

More information

From Double Chain Ladder To Double GLM

From Double Chain Ladder To Double GLM University of Amsterdam MSc Stochastics and Financial Mathematics Master Thesis From Double Chain Ladder To Double GLM Author: Robert T. Steur Examiner: dr. A.J. Bert van Es Supervisors: drs. N.R. Valkenburg

More information

Institute of Actuaries of India. March 2018 Examination

Institute of Actuaries of India. March 2018 Examination Institute of Actuaries of India Subject ST8 General Insurance: Pricing March 2018 Examination INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the aim of

More information

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage 6 Point Estimation Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage Point Estimation Statistical inference: directed toward conclusions about one or more parameters. We will use the generic

More information

STA 4504/5503 Sample questions for exam True-False questions.

STA 4504/5503 Sample questions for exam True-False questions. STA 4504/5503 Sample questions for exam 2 1. True-False questions. (a) For General Social Survey data on Y = political ideology (categories liberal, moderate, conservative), X 1 = gender (1 = female, 0

More information

1. You are given the following information about a stationary AR(2) model:

1. You are given the following information about a stationary AR(2) model: Fall 2003 Society of Actuaries **BEGINNING OF EXAMINATION** 1. You are given the following information about a stationary AR(2) model: (i) ρ 1 = 05. (ii) ρ 2 = 01. Determine φ 2. (A) 0.2 (B) 0.1 (C) 0.4

More information

Duangporn Jearkpaporn, Connie M. Borror Douglas C. Montgomery and George C. Runger Arizona State University Tempe, AZ

Duangporn Jearkpaporn, Connie M. Borror Douglas C. Montgomery and George C. Runger Arizona State University Tempe, AZ Process Monitoring for Correlated Gamma Distributed Data Using Generalized Linear Model Based Control Charts Duangporn Jearkpaporn, Connie M. Borror Douglas C. Montgomery and George C. Runger Arizona State

More information

Estimating Market Power in Differentiated Product Markets

Estimating Market Power in Differentiated Product Markets Estimating Market Power in Differentiated Product Markets Metin Cakir Purdue University December 6, 2010 Metin Cakir (Purdue) Market Equilibrium Models December 6, 2010 1 / 28 Outline Outline Estimating

More information

Logit Models for Binary Data

Logit Models for Binary Data Chapter 3 Logit Models for Binary Data We now turn our attention to regression models for dichotomous data, including logistic regression and probit analysis These models are appropriate when the response

More information

A Comprehensive, Non-Aggregated, Stochastic Approach to Loss Development

A Comprehensive, Non-Aggregated, Stochastic Approach to Loss Development A Comprehensive, Non-Aggregated, Stochastic Approach to Loss Development by Uri Korn ABSTRACT In this paper, we present a stochastic loss development approach that models all the core components of the

More information

Cambridge University Press Risk Modelling in General Insurance: From Principles to Practice Roger J. Gray and Susan M.

Cambridge University Press Risk Modelling in General Insurance: From Principles to Practice Roger J. Gray and Susan M. adjustment coefficient, 272 and Cramér Lundberg approximation, 302 existence, 279 and Lundberg s inequality, 272 numerical methods for, 303 properties, 272 and reinsurance (case study), 348 statistical

More information

Continuous Probability Distributions & Normal Distribution

Continuous Probability Distributions & Normal Distribution Mathematical Methods Units 3/4 Student Learning Plan Continuous Probability Distributions & Normal Distribution 7 lessons Notes: Students need practice in recognising whether a problem involves a discrete

More information

The Matrix Inverted A Primer in GLM Theory and Practical Issues. March 11-12, 2004 CAS Ratemaking Seminar Roosevelt Mosley, FCAS, MAAA

The Matrix Inverted A Primer in GLM Theory and Practical Issues. March 11-12, 2004 CAS Ratemaking Seminar Roosevelt Mosley, FCAS, MAAA The Matrix Inverted A Primer in GLM Theory and Practical Issues March 1112, 2004 CAS Ratemaking Seminar Roosevelt Mosley, FCAS, MAAA Practical Issues Data Analysis Implementation Data Data Topics How much?

More information

High-Frequency Data Analysis and Market Microstructure [Tsay (2005), chapter 5]

High-Frequency Data Analysis and Market Microstructure [Tsay (2005), chapter 5] 1 High-Frequency Data Analysis and Market Microstructure [Tsay (2005), chapter 5] High-frequency data have some unique characteristics that do not appear in lower frequencies. At this class we have: Nonsynchronous

More information

Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions

Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions ELE 525: Random Processes in Information Systems Hisashi Kobayashi Department of Electrical Engineering

More information

value BE.104 Spring Biostatistics: Distribution and the Mean J. L. Sherley

value BE.104 Spring Biostatistics: Distribution and the Mean J. L. Sherley BE.104 Spring Biostatistics: Distribution and the Mean J. L. Sherley Outline: 1) Review of Variation & Error 2) Binomial Distributions 3) The Normal Distribution 4) Defining the Mean of a population Goals:

More information

the display, exploration and transformation of the data are demonstrated and biases typically encountered are highlighted.

the display, exploration and transformation of the data are demonstrated and biases typically encountered are highlighted. 1 Insurance data Generalized linear modeling is a methodology for modeling relationships between variables. It generalizes the classical normal linear model, by relaxing some of its restrictive assumptions,

More information

Lecture 2. Probability Distributions Theophanis Tsandilas

Lecture 2. Probability Distributions Theophanis Tsandilas Lecture 2 Probability Distributions Theophanis Tsandilas Comment on measures of dispersion Why do common measures of dispersion (variance and standard deviation) use sums of squares: nx (x i ˆµ) 2 i=1

More information

WC-5 Just How Credible Is That Employer? Exploring GLMs and Multilevel Modeling for NCCI s Excess Loss Factor Methodology

WC-5 Just How Credible Is That Employer? Exploring GLMs and Multilevel Modeling for NCCI s Excess Loss Factor Methodology Antitrust Notice The Casualty Actuarial Society is committed to adhering strictly to the letter and spirit of the antitrust laws. Seminars conducted under the auspices of the CAS are designed solely to

More information

Assessing volatility and credibility of experience a comparison of approaches

Assessing volatility and credibility of experience a comparison of approaches Assessing volatility and credibility of experience a comparison of approaches, FSA, MAAA Swiss Re Life & Health America Inc. Agenda Volatility Its definition Its importance How to measure it Credibility

More information

Use of GLMs in a competitive market. Ji Yao and Simon Yeung, Advanced Pricing Techniques (APT) GIRO Working Party

Use of GLMs in a competitive market. Ji Yao and Simon Yeung, Advanced Pricing Techniques (APT) GIRO Working Party Use of GLMs in a competitive market Ji Yao and Simon Yeung, Advanced Pricing Techniques (APT) GIRO Working Party 12 June 2013 About the presenters Dr. Ji Yao is a manager with Ernst & Young s EMEIA insurance

More information

SOCIETY OF ACTUARIES EXAM STAM SHORT-TERM ACTUARIAL MATHEMATICS EXAM STAM SAMPLE QUESTIONS

SOCIETY OF ACTUARIES EXAM STAM SHORT-TERM ACTUARIAL MATHEMATICS EXAM STAM SAMPLE QUESTIONS SOCIETY OF ACTUARIES EXAM STAM SHORT-TERM ACTUARIAL MATHEMATICS EXAM STAM SAMPLE QUESTIONS Questions 1-307 have been taken from the previous set of Exam C sample questions. Questions no longer relevant

More information

2009 Actuarial Research Conference The Role of Research at ISO. Glenn Meyers ISO Innovative Analytics

2009 Actuarial Research Conference The Role of Research at ISO. Glenn Meyers ISO Innovative Analytics 2009 Actuarial Research Conference The Role of Research at ISO Glenn Meyers ISO Innovative Analytics 14 Ph.D. s in the fields of Statistics, Mathematics, Economics, Finance, Machine Learning, Engineering

More information

Intro to GLM Day 2: GLM and Maximum Likelihood

Intro to GLM Day 2: GLM and Maximum Likelihood Intro to GLM Day 2: GLM and Maximum Likelihood Federico Vegetti Central European University ECPR Summer School in Methods and Techniques 1 / 32 Generalized Linear Modeling 3 steps of GLM 1. Specify the

More information

Lecture 6: Non Normal Distributions

Lecture 6: Non Normal Distributions Lecture 6: Non Normal Distributions and their Uses in GARCH Modelling Prof. Massimo Guidolin 20192 Financial Econometrics Spring 2015 Overview Non-normalities in (standardized) residuals from asset return

More information

Therefore, statistical modelling tools are required which make thorough space-time analyses of insurance regression data possible and allow to explore

Therefore, statistical modelling tools are required which make thorough space-time analyses of insurance regression data possible and allow to explore Bayesian space time analysis of health insurance data Stefan Lang, Petra Kragler, Gerhard Haybach and Ludwig Fahrmeir University of Munich, Ludwigstr. 33, 80539 Munich email: lang@stat.uni-muenchen.de

More information

Practice Exam 1. Loss Amount Number of Losses

Practice Exam 1. Loss Amount Number of Losses Practice Exam 1 1. You are given the following data on loss sizes: An ogive is used as a model for loss sizes. Determine the fitted median. Loss Amount Number of Losses 0 1000 5 1000 5000 4 5000 10000

More information

A MODIFIED MULTINOMIAL LOGIT MODEL OF ROUTE CHOICE FOR DRIVERS USING THE TRANSPORTATION INFORMATION SYSTEM

A MODIFIED MULTINOMIAL LOGIT MODEL OF ROUTE CHOICE FOR DRIVERS USING THE TRANSPORTATION INFORMATION SYSTEM A MODIFIED MULTINOMIAL LOGIT MODEL OF ROUTE CHOICE FOR DRIVERS USING THE TRANSPORTATION INFORMATION SYSTEM Hing-Po Lo and Wendy S P Lam Department of Management Sciences City University of Hong ong EXTENDED

More information

Chapter 6 Simple Correlation and

Chapter 6 Simple Correlation and Contents Chapter 1 Introduction to Statistics Meaning of Statistics... 1 Definition of Statistics... 2 Importance and Scope of Statistics... 2 Application of Statistics... 3 Characteristics of Statistics...

More information

Modelling Bank Loan LGD of Corporate and SME Segment

Modelling Bank Loan LGD of Corporate and SME Segment 15 th Computing in Economics and Finance, Sydney, Australia Modelling Bank Loan LGD of Corporate and SME Segment Radovan Chalupka, Juraj Kopecsni Charles University, Prague 1. introduction 2. key issues

More information

CS340 Machine learning Bayesian model selection

CS340 Machine learning Bayesian model selection CS340 Machine learning Bayesian model selection Bayesian model selection Suppose we have several models, each with potentially different numbers of parameters. Example: M0 = constant, M1 = straight line,

More information

arxiv: v1 [q-fin.rm] 13 Dec 2016

arxiv: v1 [q-fin.rm] 13 Dec 2016 arxiv:1612.04126v1 [q-fin.rm] 13 Dec 2016 The hierarchical generalized linear model and the bootstrap estimator of the error of prediction of loss reserves in a non-life insurance company Alicja Wolny-Dominiak

More information

REPORT ON ANALYSIS OF MEDICAL SCHEMES CLAIMS DATA- INITIAL COST ATTRIBUTION ANALYSIS VERSION 2: 8 DECEMBER 2017

REPORT ON ANALYSIS OF MEDICAL SCHEMES CLAIMS DATA- INITIAL COST ATTRIBUTION ANALYSIS VERSION 2: 8 DECEMBER 2017 REPORT ON ANALYSIS OF MEDICAL SCHEMES CLAIMS DATA- INITIAL COST ATTRIBUTION ANALYSIS VERSION 2: 8 DECEMBER 2017 DISCLAIMER The Competition Commission Health Market Inquiry (HMI), through an open tender,

More information

Bivariate Birnbaum-Saunders Distribution

Bivariate Birnbaum-Saunders Distribution Department of Mathematics & Statistics Indian Institute of Technology Kanpur January 2nd. 2013 Outline 1 Collaborators 2 3 Birnbaum-Saunders Distribution: Introduction & Properties 4 5 Outline 1 Collaborators

More information

Institute of Actuaries of India Subject CT6 Statistical Methods

Institute of Actuaries of India Subject CT6 Statistical Methods Institute of Actuaries of India Subject CT6 Statistical Methods For 2014 Examinations Aim The aim of the Statistical Methods subject is to provide a further grounding in mathematical and statistical techniques

More information

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi Chapter 4: Commonly Used Distributions Statistics for Engineers and Scientists Fourth Edition William Navidi 2014 by Education. This is proprietary material solely for authorized instructor use. Not authorized

More information

Homework Problems Stat 479

Homework Problems Stat 479 Chapter 10 91. * A random sample, X1, X2,, Xn, is drawn from a distribution with a mean of 2/3 and a variance of 1/18. ˆ = (X1 + X2 + + Xn)/(n-1) is the estimator of the distribution mean θ. Find MSE(

More information

Clark. Outside of a few technical sections, this is a very process-oriented paper. Practice problems are key!

Clark. Outside of a few technical sections, this is a very process-oriented paper. Practice problems are key! Opening Thoughts Outside of a few technical sections, this is a very process-oriented paper. Practice problems are key! Outline I. Introduction Objectives in creating a formal model of loss reserving:

More information

M249 Diagnostic Quiz

M249 Diagnostic Quiz THE OPEN UNIVERSITY Faculty of Mathematics and Computing M249 Diagnostic Quiz Prepared by the Course Team [Press to begin] c 2005, 2006 The Open University Last Revision Date: May 19, 2006 Version 4.2

More information

CHAPTER 2 Describing Data: Numerical

CHAPTER 2 Describing Data: Numerical CHAPTER Multiple-Choice Questions 1. A scatter plot can illustrate all of the following except: A) the median of each of the two variables B) the range of each of the two variables C) an indication of

More information

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc.

Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Chapter 8 Measures of Center Data that can be any numerical value are called continuous. These are usually things that are measured, such as height, length, time, speed, etc. Data that can only be integer

More information

Measuring Policyholder Behavior in Variable Annuity Contracts

Measuring Policyholder Behavior in Variable Annuity Contracts Insights September 2010 Measuring Policyholder Behavior in Variable Annuity Contracts Is Predictive Modeling the Answer? by David J. Weinsier and Guillaume Briere-Giroux Life insurers that write variable

More information

Chapter 5. Statistical inference for Parametric Models

Chapter 5. Statistical inference for Parametric Models Chapter 5. Statistical inference for Parametric Models Outline Overview Parameter estimation Method of moments How good are method of moments estimates? Interval estimation Statistical Inference for Parametric

More information

Advanced Risk Management Use of Predictive Modeling in Underwriting and Pricing

Advanced Risk Management Use of Predictive Modeling in Underwriting and Pricing Advanced Risk Management Use of Predictive Modeling in Underwriting and Pricing By Saikat Maitra & Debashish Banerjee Abstract In this paper, the authors describe data mining and predictive modeling techniques

More information

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2.

More information

Telematics and the natural evolution of pricing in motor insurance

Telematics and the natural evolution of pricing in motor insurance Telematics and the natural evolution of pricing in motor insurance Montserrat Guillén University of Barcelona www.ub.edu/riskcenter Workshop on data sciences applied to insurance and finance Louvain-la-Neuve,

More information

Study Guide on Risk Margins for Unpaid Claims for SOA Exam GIADV G. Stolyarov II

Study Guide on Risk Margins for Unpaid Claims for SOA Exam GIADV G. Stolyarov II Study Guide on Risk Margins for Unpaid Claims for the Society of Actuaries (SOA) Exam GIADV: Advanced Topics in General Insurance (Based on the Paper "A Framework for Assessing Risk Margins" by Karl Marshall,

More information

Continuous Probability Distributions

Continuous Probability Distributions 8.1 Continuous Probability Distributions Distributions like the binomial probability distribution and the hypergeometric distribution deal with discrete data. The possible values of the random variable

More information

Contents Part I Descriptive Statistics 1 Introduction and Framework Population, Sample, and Observations Variables Quali

Contents Part I Descriptive Statistics 1 Introduction and Framework Population, Sample, and Observations Variables Quali Part I Descriptive Statistics 1 Introduction and Framework... 3 1.1 Population, Sample, and Observations... 3 1.2 Variables.... 4 1.2.1 Qualitative and Quantitative Variables.... 5 1.2.2 Discrete and Continuous

More information

UPDATED IAA EDUCATION SYLLABUS

UPDATED IAA EDUCATION SYLLABUS II. UPDATED IAA EDUCATION SYLLABUS A. Supporting Learning Areas 1. STATISTICS Aim: To enable students to apply core statistical techniques to actuarial applications in insurance, pensions and emerging

More information

STAT:2010 Statistical Methods and Computing. Using density curves to describe the distribution of values of a quantitative

STAT:2010 Statistical Methods and Computing. Using density curves to describe the distribution of values of a quantitative STAT:10 Statistical Methods and Computing Normal Distributions Lecture 4 Feb. 6, 17 Kate Cowles 374 SH, 335-0727 kate-cowles@uiowa.edu 1 2 Using density curves to describe the distribution of values of

More information

This homework assignment uses the material on pages ( A moving average ).

This homework assignment uses the material on pages ( A moving average ). Module 2: Time series concepts HW Homework assignment: equally weighted moving average This homework assignment uses the material on pages 14-15 ( A moving average ). 2 Let Y t = 1/5 ( t + t-1 + t-2 +

More information

Financial Market Analysis (FMAx) Module 4

Financial Market Analysis (FMAx) Module 4 Financial Market Analsis (FMAx) Module 4 erm Structure of Interest Rates his training material is the propert of the International Monetar Fund (IMF) and is intended for use in IMF Institute for Capacit

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Maximum Likelihood Estimation EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #6 EPSY 905: Maximum Likelihood In This Lecture The basics of maximum likelihood estimation Ø The engine that

More information

How Advanced Pricing Analysis Can Support Underwriting by Claudine Modlin, FCAS, MAAA

How Advanced Pricing Analysis Can Support Underwriting by Claudine Modlin, FCAS, MAAA How Advanced Pricing Analysis Can Support Underwriting by Claudine Modlin, FCAS, MAAA September 21, 2014 2014 Towers Watson. All rights reserved. 3 What Is Predictive Modeling Predictive modeling uses

More information

Stat 401XV Exam 3 Spring 2017

Stat 401XV Exam 3 Spring 2017 Stat 40XV Exam Spring 07 I have neither given nor received unauthorized assistance on this exam. Name Signed Date Name Printed ATTENTION! Incorrect numerical answers unaccompanied by supporting reasoning

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Maximum Likelihood Estimation The likelihood and log-likelihood functions are the basis for deriving estimators for parameters, given data. While the shapes of these two functions are different, they have

More information

REGIONAL WORKSHOP ON TRAFFIC FORECASTING AND ECONOMIC PLANNING

REGIONAL WORKSHOP ON TRAFFIC FORECASTING AND ECONOMIC PLANNING International Civil Aviation Organization 27/8/10 WORKING PAPER REGIONAL WORKSHOP ON TRAFFIC FORECASTING AND ECONOMIC PLANNING Cairo 2 to 4 November 2010 Agenda Item 3 a): Forecasting Methodology (Presented

More information

Describing Uncertain Variables

Describing Uncertain Variables Describing Uncertain Variables L7 Uncertainty in Variables Uncertainty in concepts and models Uncertainty in variables Lack of precision Lack of knowledge Variability in space/time Describing Uncertainty

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 4 Random Variables & Probability Distributions Content 1. Two Types of Random Variables 2. Probability Distributions for Discrete Random Variables 3. The Binomial

More information

STAT Chapter 7: Confidence Intervals

STAT Chapter 7: Confidence Intervals STAT 515 -- Chapter 7: Confidence Intervals With a point estimate, we used a single number to estimate a parameter. We can also use a set of numbers to serve as reasonable estimates for the parameter.

More information

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions.

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions. ME3620 Theory of Engineering Experimentation Chapter III. Random Variables and Probability Distributions Chapter III 1 3.2 Random Variables In an experiment, a measurement is usually denoted by a variable

More information

Improving Returns-Based Style Analysis

Improving Returns-Based Style Analysis Improving Returns-Based Style Analysis Autumn, 2007 Daniel Mostovoy Northfield Information Services Daniel@northinfo.com Main Points For Today Over the past 15 years, Returns-Based Style Analysis become

More information

NORMAL RANDOM VARIABLES (Normal or gaussian distribution)

NORMAL RANDOM VARIABLES (Normal or gaussian distribution) NORMAL RANDOM VARIABLES (Normal or gaussian distribution) Many variables, as pregnancy lengths, foot sizes etc.. exhibit a normal distribution. The shape of the distribution is a symmetric bell shape.

More information

Analyzing the Determinants of Project Success: A Probit Regression Approach

Analyzing the Determinants of Project Success: A Probit Regression Approach 2016 Annual Evaluation Review, Linked Document D 1 Analyzing the Determinants of Project Success: A Probit Regression Approach 1. This regression analysis aims to ascertain the factors that determine development

More information

Risk Classification In Non-Life Insurance

Risk Classification In Non-Life Insurance Risk Classification In Non-Life Insurance Katrien Antonio Jan Beirlant November 28, 2006 Abstract Within the actuarial profession a major challenge can be found in the construction of a fair tariff structure.

More information

Multiple Regression and Logistic Regression II. Dajiang 525 Apr

Multiple Regression and Logistic Regression II. Dajiang 525 Apr Multiple Regression and Logistic Regression II Dajiang Liu @PHS 525 Apr-19-2016 Materials from Last Time Multiple regression model: Include multiple predictors in the model = + + + + How to interpret the

More information

4. GLIM for data with constant coefficient of variation

4. GLIM for data with constant coefficient of variation 4. GLIM for data with constant coefficient of variation 4.1. Data with constant coefficient of variation and some simple models Coefficient of variation The coefficient of variation of a random variable

More information

We can solve quadratic equations by transforming the. left side of the equation into a perfect square trinomial

We can solve quadratic equations by transforming the. left side of the equation into a perfect square trinomial Introduction We can solve quadratic equations by transforming the left side of the equation into a perfect square trinomial and using square roots to solve. Previously, you may have explored perfect square

More information

Recreational marijuana and collision claim frequencies

Recreational marijuana and collision claim frequencies Highway Loss Data Institute Bulletin Vol. 34, No. 14 : April 2017 Recreational marijuana and collision claim frequencies Summary Colorado was the first state to legalize recreational marijuana for adults

More information

Modeling of Claim Counts with k fold Cross-validation

Modeling of Claim Counts with k fold Cross-validation Modeling of Claim Counts with k fold Cross-validation Alicja Wolny Dominiak 1 Abstract In the ratemaking process the ranking, which takes into account the number of claims generated by a policy in a given

More information

TABLE OF CONTENTS - VOLUME 2

TABLE OF CONTENTS - VOLUME 2 TABLE OF CONTENTS - VOLUME 2 CREDIBILITY SECTION 1 - LIMITED FLUCTUATION CREDIBILITY PROBLEM SET 1 SECTION 2 - BAYESIAN ESTIMATION, DISCRETE PRIOR PROBLEM SET 2 SECTION 3 - BAYESIAN CREDIBILITY, DISCRETE

More information

Fundamentals of Statistics

Fundamentals of Statistics CHAPTER 4 Fundamentals of Statistics Expected Outcomes Know the difference between a variable and an attribute. Perform mathematical calculations to the correct number of significant figures. Construct

More information

Choice Probabilities. Logit Choice Probabilities Derivation. Choice Probabilities. Basic Econometrics in Transportation.

Choice Probabilities. Logit Choice Probabilities Derivation. Choice Probabilities. Basic Econometrics in Transportation. 1/31 Choice Probabilities Basic Econometrics in Transportation Logit Models Amir Samimi Civil Engineering Department Sharif University of Technology Primary Source: Discrete Choice Methods with Simulation

More information

CHAPTER 4 DATA ANALYSIS Data Hypothesis

CHAPTER 4 DATA ANALYSIS Data Hypothesis CHAPTER 4 DATA ANALYSIS 4.1. Data Hypothesis The hypothesis for each independent variable to express our expectations about the characteristic of each independent variable and the pay back performance

More information

Mixed models in R using the lme4 package Part 3: Inference based on profiled deviance

Mixed models in R using the lme4 package Part 3: Inference based on profiled deviance Mixed models in R using the lme4 package Part 3: Inference based on profiled deviance Douglas Bates Department of Statistics University of Wisconsin - Madison Madison January 11, 2011

More information