Session 5. Predictive Modeling in Life Insurance

Size: px
Start display at page:

Download "Session 5. Predictive Modeling in Life Insurance"

Transcription

1 SOA Predictive Analytics Seminar Hong Kong 29 Aug Hong Kong Session 5 Predictive Modeling in Life Insurance Jingyi Zhang, Ph.D

2 Predictive Modeling in Life Insurance JINGYI ZHANG PhD Scientist Global Research & Analytics Agenda Overview of Predictive Modelling Techniques o What actuary already know - OLS o Generalized Linear Model o Decision Tree Model o Clustering Sharing - Predictive Modelling Projects Introduction - RGA Science Team

3 What Is Predictive Modeling & Analytics? 1 High-quality data 2 Modeling Statistical model 3 Prediction Business decisions Predictive modeling & analytics is about driving business outcomes What actuary already know Are you familiar with the following terms? Linear Regression Ordinary Least Square (OLS) Linear regression model Y target variable, X i predictor variable, error term/noise i parameters to be estimated Underlying assumptions for a valid Linear regression model Normality, ε ~ N(0, 2 ) Homogeneity, Y representative of population, Independence between observations Linearity

4 Ordinary Least Squares Ordinary Least Squares(OLS) β min min For a simple regression β, β β Identical to Maximum likelihood estimator More robust and consistent approach β m,, min ln,, min Use adj R 2 to compare fitness of models 1 Define 1, but it is biased Adjusted 1 1 min if normal distribution portion that has been explained by OLS model portion of TSS for the error 5 Why actuary did not use OLS Processes are inherently linear, or can be well-approximated by LM Effectiveness & Completeness OLS makes very efficient use of the data; good results with relatively small datasets Identical to maximum likelihood estimation Easy to understand and communicate theory is well-understood; Results are easy to communicate Great! But wait There are several issues with OLS Validation of assumptions - Normal w/ constant 2, independent, homogeneous Unbounded data, non-negative value How about insurance application? Distribution of data, variance structure Binomial for rate (mortality/lapse/uw, etc.), 2 ~ r(1-r) Poisson for claim count, ~ mean OLS is not applicable in insurance, but you already know lots about modeling 6

5 What actuary may not know Machine Learning & Statistical Techniques Random Forest XGBoost machine Gradient Boosting Ada Boosting Support vector machine Ensemble method Survey Analysis Feature engineering Non-Parametric Analysis Decision Trees (CART) Neural Networks / Deep Learning Bayesian Analysis Classification/Association Analysis of Variance Mixed Models Survival Analysis Cluster Analysis (e.g. K-Means) PM terminology Supervised vs. Unsupervised Learning Supervised: estimate expected value of Y given values of X. GLM, Cox, CART, MARS, Random Forests, SVM, NN, etc. Unsupervised: find interesting patterns amongst X; no target variable Y Clustering, Correlation / Principal Components / Factor Analysis Classification vs. Regression Classification: to segment observations into 2 or more categories fraud vs. legitimate, lapsed vs. retained, UW class Regression: to predict a continuous amount. Dollars of loss for a policy, ultimate size of claim Parametric vs. Non-Parametric Parametric Statistics: probabilistic model of data Poisson Regression(claims count), Gamma (claim amount) Non-Parametric Statistics: no probability model specified classification trees, NN 8

6 Generalized Linear Model Generalized Linear Model(GLM) Major focus of PM in insurance industry Include most distributions related to insurance Great flexibility in variance structure OLS model is a special case of GLM (Relatively) Easy to understand and communicate Multiplicative model intuitive & consistent with insurance practice 3 components Random component Systematic component link function 9 Generalized Linear Model Random component Observations Y 1,..., Y n are independent w/ density from the exponential family ;,, From maximum likelihood theory,, Each distribution is specified in terms of mean & variance Variance is a function of mean Normal Poisson Binomial Gamma InverseGaussian Name,,,, Range (-,+) (0,+) (0,1) (0,+) (0,+) b e ln(1+e ) ln 2 / e e /(1+e ) 1/ 2 /

7 Why distribution will affect results GLM GLM Different Different Distributions C C A A B B Normal Poisson Gamma Poisson Normal Gamma Variance of different distributions Gaussian, constant Poisson, ~ mean Gamma, ~ mean^2 11 Generalized Linear Model Systematic component A linear predictor for observation i link function, random & systematic are connected by a smooth & invertible function Identity Log Logit Reciprocal ln ln 1 1/ 1/ Log is unique in insurance application - all parameters are multiplicative exp exp exp Consistent with most insurance practices Intuitively easy to understand and communicate 12

8 Generalized Linear Model Solve for parameters () by maximum likelihood Closed form for small data and simple model Iterative numerical techniques for large data set & complex model Use statistical analysis application, such as R Compare OSL and GLM Random Systematic Link OLS Normal only GLM Various distribution Great flexibility Various distribution, variance structure Prior weight and the credibility of data 13 Decision Tree Model Decision Tree - Classification And Regression Tree (CART) Both classification and regression Non-parametric approach (no insight in data structure) CART tree is generated by repeated partitioning of data set is split into two partitions (binary partition) Consider all possible values of all variables. Select the variable/value (X=t 1 ) that produces the greatest separation in the target. Partitions can also be split into sub-partitions (recursive) Until data in end node(leaf) is homogeneous (more or less) Results are very intuitive Identify specific groups that deviate in target variable Yet, algorithm is very sophisticated 14

9 Decision Tree Model Splitting Point Separation defined in many ways; different for regression & classification Regression Trees: use sum of squared errors Select X=t 1 such that max, Classification Trees: use measures of purity/impurity Intuition: an ideal tree model would produce nodes with only either class A or class B - completely pure nodes Gini Index - purity of a node 1 1 1, = probability of class i Entropy information index 1 1 Clustering Clustering algorithm Find similarities in data according to features in data & group similar objects into clusters Unsurprised (no pre-defined), classification, non-parametric How to measure similarities/dissimilarities, e.g. distance Numeric, categorical, and ordinal variables Partitioning (k-means), Hierarchical, etc. 16

10 Clustering Algorithm Partitioning algorithms - K-measn/k-medoids Maintain k clusters with k known; place points into their nearest cluster Hierarchical Objects are more related to nearby objects than to objects farther away; objects are connected by distance; how to define nearby object K-Means Algorithm 1. Select K points as initial centroids, with a given k 2. Repeat 3. Form K clusters by assign each points to its nearest centroid 4. Re-compute the centroids of each cluster 5. Until centroids do not change 17 Clustering Standardization / Normalization Values of variables may have different units Variable with high variability/range will dominate metric & lead to bias How to determine K Business reasons could dictate k Try different k, looking at the change in the average distance to centroid, as k increases; error falls rapidly until right k, then changes little 18

11 Clustering Comments on K-Means Strength: simple, very efficient & fast Weakness Applicable only when mean is defined, (categorical?) Need to know k in advance Unable to handle noisy data & outliers; sensitive to outliers Maybe sensitive to initialization Hierarchical clustering Bottom up or top down produce a dendrogram Important questions - how to represent a cluster of more than one point, & how to determine the nearness of clusters? Single Link: smallest distance between points Complete Link: largest distance between points Average Link: average distance between points Centroid: distance between centroids 19 Conclusion Advantage of actuary Industry knowledge - domain knowledge is a key in modeling process Expertise in data process - data is always #1 issue in data-driven application Unique position in data analytics Opportunity Solid foundation in statistics Education experience in modeling (OLS) Need to pick up new skills & thinking by education, training, and experience Actuaries can not miss it analytics is here to stay; it is changing insurance industry, and will fundamentally change how we run insurance business Actuaries could and should be on top of it and lead the change

12 Sharing - Predictive Modelling Projects Considerations Business Goals Objective is to support profitable growth of business Resources available & strong support from executives Sufficient quantity & high quality to support analytics Satisfactory data depth & width Able to obtain & capable to understand / clean data Environment Regulatory & privacy laws allow such data analytics Distribution channel can support data-driven solutions

13 Across the Value Chain As long as there is data, there is potential to capitalize on it Pre-sale Underwriting In-force management Claims High Predictive underwriting Preferred risk selection Crosssell/upsell Fraud/nondisclosure Multivariate analysis New rating factors Medium Propensity to apply & triggers Distributor quality control Propensity to complete purchase Underwriting triage Determine underwriting ratings Proactive lapse management Low Competitive pricing strategy Customer lifetime value Claims triage Level of client demand Customer Risk Scoring China Client would like to build customer risk score for their cancer product, which can predict the claim risk of the customer. Objectives To predict claim risk of customer To Improve customer experience for best risks with reduced UW & sales process To improve claim experience of existing customers, by identifying high risks Modeling & Lift Plot 6 statistically significant variables in model Claim risk of best group is less than half of their pricing assumption; the risk of worst group is about the double of their pricing assumption Two data source combined o Policy data o Claim data Modelled claim risk using wide range of rating factors & compared to pricing assumption

14 Bancassurance Predictive Underwriting - SEA A bank with a large customer base expressed a strong desire to increase sales penetration of their life product, while streamlining the underwriting process Objectives Simplified underwriting and sales process with high take-up for the best risks Reduce acquisition costs Increase protection sales and product penetration Two data sources combined: o Bank customer information at time of issue o Underwriting decision About 80 variables available for modeling: o Demographic data, bank and insurance product data, banking transaction data etc. Business Application and Lift Plot 11 statistically significant variables in model: o Branch, AUM, customer segment, credit card GIO for the best 20% risks; SIO for next best 20% Introduction - RGA Science Team

15 RGA Science Team Global Presence, Local Focus Global Research & Analytics The DS team collaborates with regional/local offices to focus on regional initiatives and local market projects We leverage local market knowledge to maximize data value & drive business outcomes Research Experience Analytics Strategy Science Regional Strategy Local Office Science team includes data scientists, actuaries and IT experts More than 50% of the team have a Ph.D. and the rest have master s degrees Work closely with UW, actuarial, admin and IT Global (15) Asia (6) RGA Science Team Who are We?

16 Thank You!

Session 5. A brief introduction to Predictive Modeling

Session 5. A brief introduction to Predictive Modeling SOA Predictive Analytics Seminar Malaysia 27 Aug. 2018 Kuala Lumpur, Malaysia Session 5 A brief introduction to Predictive Modeling Lichen Bao, Ph.D A Brief Introduction to Predictive Modeling LICHEN BAO

More information

Predictive modelling around the world Peter Banthorpe, RGA Kevin Manning, Milliman

Predictive modelling around the world Peter Banthorpe, RGA Kevin Manning, Milliman Predictive modelling around the world Peter Banthorpe, RGA Kevin Manning, Milliman 11 November 2013 Agenda Introduction to predictive analytics Applications overview Case studies Conclusions and Q&A Introduction

More information

Session 113 PD, Data and Model Actuaries Should be an Expert of Both. Moderator: David L. Snell, ASA, MAAA

Session 113 PD, Data and Model Actuaries Should be an Expert of Both. Moderator: David L. Snell, ASA, MAAA Session 113 PD, Data and Model Actuaries Should be an Expert of Both Moderator: David L. Snell, ASA, MAAA Presenters: Matthias Kullowatz Kenneth Warren Pagington, FSA, CERA, MAAA Qichun (Richard) Xu, FSA

More information

Quantile Regression. By Luyang Fu, Ph. D., FCAS, State Auto Insurance Company Cheng-sheng Peter Wu, FCAS, ASA, MAAA, Deloitte Consulting

Quantile Regression. By Luyang Fu, Ph. D., FCAS, State Auto Insurance Company Cheng-sheng Peter Wu, FCAS, ASA, MAAA, Deloitte Consulting Quantile Regression By Luyang Fu, Ph. D., FCAS, State Auto Insurance Company Cheng-sheng Peter Wu, FCAS, ASA, MAAA, Deloitte Consulting Agenda Overview of Predictive Modeling for P&C Applications Quantile

More information

ECS171: Machine Learning

ECS171: Machine Learning ECS171: Machine Learning Lecture 15: Tree-based Algorithms Cho-Jui Hsieh UC Davis March 7, 2018 Outline Decision Tree Random Forest Gradient Boosted Decision Tree (GBDT) Decision Tree Each node checks

More information

Forecasting & Futurism

Forecasting & Futurism Article from: Forecasting & Futurism December 2013 Issue 8 PREDICTIVE MODELING IN INSURANCE Modeling Process By Richard Xu In the July 2013 issue of the Forecasting & Futurism Newsletter, we introduced

More information

Subject CS2A Risk Modelling and Survival Analysis Core Principles

Subject CS2A Risk Modelling and Survival Analysis Core Principles ` Subject CS2A Risk Modelling and Survival Analysis Core Principles Syllabus for the 2019 exams 1 June 2018 Copyright in this Core Reading is the property of the Institute and Faculty of Actuaries who

More information

Credit Card Default Predictive Modeling

Credit Card Default Predictive Modeling Credit Card Default Predictive Modeling Background: Predicting credit card payment default is critical for the successful business model of a credit card company. An accurate predictive model can help

More information

UPDATED IAA EDUCATION SYLLABUS

UPDATED IAA EDUCATION SYLLABUS II. UPDATED IAA EDUCATION SYLLABUS A. Supporting Learning Areas 1. STATISTICS Aim: To enable students to apply core statistical techniques to actuarial applications in insurance, pensions and emerging

More information

WC-5 Just How Credible Is That Employer? Exploring GLMs and Multilevel Modeling for NCCI s Excess Loss Factor Methodology

WC-5 Just How Credible Is That Employer? Exploring GLMs and Multilevel Modeling for NCCI s Excess Loss Factor Methodology Antitrust Notice The Casualty Actuarial Society is committed to adhering strictly to the letter and spirit of the antitrust laws. Seminars conducted under the auspices of the CAS are designed solely to

More information

Making the Link between Actuaries and Data Science

Making the Link between Actuaries and Data Science Making the Link between Actuaries and Data Science Simon Lee, Cecilia Chow, Thibault Imbert AXA Asia 2 nd ASHK General Insurance & Data Analytics Seminar Friday 7 October 2016 1 Agenda Data Driving Insurers

More information

2018 Predictive Analytics Symposium Session 10: Cracking the Black Box with Awareness & Validation

2018 Predictive Analytics Symposium Session 10: Cracking the Black Box with Awareness & Validation 2018 Predictive Analytics Symposium Session 10: Cracking the Black Box with Awareness & Validation SOA Antitrust Compliance Guidelines SOA Presentation Disclaimer Cracking the Black Box with Awareness

More information

February 25, Susan Watson, FSA Paul Anderson, FCAS Rahul Parsa, PhD

February 25, Susan Watson, FSA Paul Anderson, FCAS Rahul Parsa, PhD February 25, 2014 Susan Watson, FSA Paul Anderson, FCAS Rahul Parsa, PhD You don t want your privacy: Disney and the meat space data race Volume Velocity Variety Data Mining Process Collect Data Clean

More information

How Can YOU Use it? Artificial Intelligence for Actuaries. SOA Annual Meeting, Gaurav Gupta. Session 058PD

How Can YOU Use it? Artificial Intelligence for Actuaries. SOA Annual Meeting, Gaurav Gupta. Session 058PD Artificial Intelligence for Actuaries How Can YOU Use it? SOA Annual Meeting, 2018 Session 058PD Gaurav Gupta Founder & CEO ggupta@quaerainsights.com Audience Poll What is my level of AI understanding?

More information

Article from. Predictive Analytics and Futurism. June 2017 Issue 15

Article from. Predictive Analytics and Futurism. June 2017 Issue 15 Article from Predictive Analytics and Futurism June 2017 Issue 15 Using Predictive Modeling to Risk- Adjust Primary Care Panel Sizes By Anders Larson Most health actuaries are familiar with the concept

More information

Expanding Predictive Analytics Through the Use of Machine Learning

Expanding Predictive Analytics Through the Use of Machine Learning Expanding Predictive Analytics Through the Use of Machine Learning Thursday, February 28, 2013, 11:10 a.m. Chris Cooksey, FCAS, MAAA Chief Actuary EagleEye Analytics Columbia, S.C. Christopher Cooksey,

More information

Predictive Analytics in Life Insurance. Advances in Predictive Analytics Conference, University of Waterloo December 1, 2017

Predictive Analytics in Life Insurance. Advances in Predictive Analytics Conference, University of Waterloo December 1, 2017 Predictive Analytics in Life Insurance Advances in Predictive Analytics Conference, University of Waterloo December 1, 2017 Format of this session Speakers: Jean-Yves Rioux - Deloitte Kevin Pledge Claim

More information

PREDICTIVE ANALYTICS EVI TEDJASUKMANA 26 OCTOBER 2017 PERSATUAN AKTUARIS INDONESIA (THE SOCIETY OF ACTUARIES OF INDONESIA)

PREDICTIVE ANALYTICS EVI TEDJASUKMANA 26 OCTOBER 2017 PERSATUAN AKTUARIS INDONESIA (THE SOCIETY OF ACTUARIES OF INDONESIA) PREDICTIVE ANALYTICS EVI TEDJASUKMANA 26 OCTOBER 2017 Agenda 1. Predictive analytics why we need it? 2. Sample analytics 1 propensity to buy 3. Sample analytics 2 predictive underwriting 2 BACKGROUND Why

More information

Lectures and Seminars in Insurance Mathematics and Related Fields at ETH Zurich. Spring Semester 2019

Lectures and Seminars in Insurance Mathematics and Related Fields at ETH Zurich. Spring Semester 2019 December 2018 Lectures and Seminars in Insurance Mathematics and Related Fields at ETH Zurich Spring Semester 2019 Quantitative Risk Management, by Prof. Dr. Patrick Cheridito, #401-3629-00L This course

More information

Investing through Economic Cycles with Ensemble Machine Learning Algorithms

Investing through Economic Cycles with Ensemble Machine Learning Algorithms Investing through Economic Cycles with Ensemble Machine Learning Algorithms Thomas Raffinot Silex Investment Partners Big Data in Finance Conference Thomas Raffinot (Silex-IP) Economic Cycles-Machine Learning

More information

And The Winner Is? How to Pick a Better Model

And The Winner Is? How to Pick a Better Model And The Winner Is? How to Pick a Better Model Part 2 Goodness-of-Fit and Internal Stability Dan Tevet, FCAS, MAAA Goodness-of-Fit Trying to answer question: How well does our model fit the data? Can be

More information

Predicting and Preventing Credit Card Default

Predicting and Preventing Credit Card Default Predicting and Preventing Credit Card Default Project Plan MS-E2177: Seminar on Case Studies in Operations Research Client: McKinsey Finland Ari Viitala Max Merikoski (Project Manager) Nourhan Shafik 21.2.2018

More information

2017 Predictive Analytics Symposium

2017 Predictive Analytics Symposium 2017 Predictive Analytics Symposium Session 24, General Insurance Applications of PA Moderator: Stuart Klugman, FSA, CERA, Ph.D. Presenter: Peter Wu, ASA, FCAS, MAA SOA Antitrust Compliance Guidelines

More information

Actuarial. Predictive Modeling. March 23, Dan McCoach, Pricewaterhouse Coopers Ben Williams, Towers Watson

Actuarial. Predictive Modeling. March 23, Dan McCoach, Pricewaterhouse Coopers Ben Williams, Towers Watson Actuarial Data Analytics / Predictive Modeling March 23, 215 Matthew Morton, LTCG Dan McCoach, Pricewaterhouse Coopers Ben Williams, Towers Watson Agenda Introductions LTC Dashboard: Data Analytics Predictive

More information

Lecture 9: Classification and Regression Trees

Lecture 9: Classification and Regression Trees Lecture 9: Classification and Regression Trees Advanced Applied Multivariate Analysis STAT 2221, Spring 2015 Sungkyu Jung Department of Statistics, University of Pittsburgh Xingye Qiao Department of Mathematical

More information

Computational Statistics Handbook with MATLAB

Computational Statistics Handbook with MATLAB «H Computer Science and Data Analysis Series Computational Statistics Handbook with MATLAB Second Edition Wendy L. Martinez The Office of Naval Research Arlington, Virginia, U.S.A. Angel R. Martinez Naval

More information

How Advanced Pricing Analysis Can Support Underwriting by Claudine Modlin, FCAS, MAAA

How Advanced Pricing Analysis Can Support Underwriting by Claudine Modlin, FCAS, MAAA How Advanced Pricing Analysis Can Support Underwriting by Claudine Modlin, FCAS, MAAA September 21, 2014 2014 Towers Watson. All rights reserved. 3 What Is Predictive Modeling Predictive modeling uses

More information

Are New Modeling Techniques Worth It?

Are New Modeling Techniques Worth It? Are New Modeling Techniques Worth It? Tom Zougas PhD PEng, Manager Data Science, TransUnion TORONTO SAS USER GROUP MAY 2, 2018 Are New Modeling Techniques Worth It? Presenter Tom Zougas PhD PEng, Manager

More information

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018 ` Subject CS1 Actuarial Statistics 1 Core Principles Syllabus for the 2019 exams 1 June 2018 Copyright in this Core Reading is the property of the Institute and Faculty of Actuaries who are the sole distributors.

More information

Session 40 PD, How Would I Get Started With Predictive Modeling? Moderator: Douglas T. Norris, FSA, MAAA

Session 40 PD, How Would I Get Started With Predictive Modeling? Moderator: Douglas T. Norris, FSA, MAAA Session 40 PD, How Would I Get Started With Predictive Modeling? Moderator: Douglas T. Norris, FSA, MAAA Presenters: Timothy S. Paris, FSA, MAAA Sandra Tsui Shan To, FSA, MAAA Qinqing (Annie) Xue, FSA,

More information

A new look at tree based approaches

A new look at tree based approaches A new look at tree based approaches Xifeng Wang University of North Carolina Chapel Hill xifeng@live.unc.edu April 18, 2018 Xifeng Wang (UNC-Chapel Hill) Short title April 18, 2018 1 / 27 Outline of this

More information

Institute of Actuaries of India Subject CT6 Statistical Methods

Institute of Actuaries of India Subject CT6 Statistical Methods Institute of Actuaries of India Subject CT6 Statistical Methods For 2014 Examinations Aim The aim of the Statistical Methods subject is to provide a further grounding in mathematical and statistical techniques

More information

2017 IAA EDUCATION SYLLABUS

2017 IAA EDUCATION SYLLABUS 2017 IAA EDUCATION SYLLABUS 1. STATISTICS Aim: To enable students to apply core statistical techniques to actuarial applications in insurance, pensions and emerging areas of actuarial practice. 1.1 RANDOM

More information

Study Guide on Risk Margins for Unpaid Claims for SOA Exam GIADV G. Stolyarov II

Study Guide on Risk Margins for Unpaid Claims for SOA Exam GIADV G. Stolyarov II Study Guide on Risk Margins for Unpaid Claims for the Society of Actuaries (SOA) Exam GIADV: Advanced Topics in General Insurance (Based on the Paper "A Framework for Assessing Risk Margins" by Karl Marshall,

More information

Econometric Models of Expenditure

Econometric Models of Expenditure Econometric Models of Expenditure Benjamin M. Craig University of Arizona ISPOR Educational Teleconference October 28, 2005 1 Outline Overview of Expenditure Estimator Selection Two problems Two mistakes

More information

Gradient Boosting Trees: theory and applications

Gradient Boosting Trees: theory and applications Gradient Boosting Trees: theory and applications Dmitry Efimov November 05, 2016 Outline Decision trees Boosting Boosting trees Metaparameters and tuning strategies How-to-use remarks Regression tree True

More information

Support Vector Machines: Training with Stochastic Gradient Descent

Support Vector Machines: Training with Stochastic Gradient Descent Support Vector Machines: Training with Stochastic Gradient Descent Machine Learning Spring 2018 The slides are mainly from Vivek Srikumar 1 Support vector machines Training by maximizing margin The SVM

More information

MWSUG Paper AA 04. Claims Analytics. Mei Najim, Gallagher Bassett Services, Rolling Meadows, IL

MWSUG Paper AA 04. Claims Analytics. Mei Najim, Gallagher Bassett Services, Rolling Meadows, IL MWSUG 2017 - Paper AA 04 Claims Analytics Mei Najim, Gallagher Bassett Services, Rolling Meadows, IL ABSTRACT In the Property & Casualty Insurance industry, advanced analytics has increasingly penetrated

More information

Examining the Morningstar Quantitative Rating for Funds A new investment research tool.

Examining the Morningstar Quantitative Rating for Funds A new investment research tool. ? Examining the Morningstar Quantitative Rating for Funds A new investment research tool. Morningstar Quantitative Research 27 August 2018 Contents 1 Executive Summary 1 Introduction 2 Abbreviated Methodology

More information

Intro to GLM Day 2: GLM and Maximum Likelihood

Intro to GLM Day 2: GLM and Maximum Likelihood Intro to GLM Day 2: GLM and Maximum Likelihood Federico Vegetti Central European University ECPR Summer School in Methods and Techniques 1 / 32 Generalized Linear Modeling 3 steps of GLM 1. Specify the

More information

GLM III - The Matrix Reloaded

GLM III - The Matrix Reloaded GLM III - The Matrix Reloaded Duncan Anderson, Serhat Guven 12 March 2013 2012 Towers Watson. All rights reserved. Agenda "Quadrant Saddles" The Tweedie Distribution "Emergent Interactions" Dispersion

More information

Predictive Modeling GLM and Price Elasticity Model. David Dou October 8 th, 2014

Predictive Modeling GLM and Price Elasticity Model. David Dou October 8 th, 2014 Predictive Modeling GLM and Price Elasticity Model David Dou October 8 th, 2014 History of Predictive Modeling Pre-Computer Era: Triangles on a giant spreadsheet PC Era: Microsoft Excel oneway relativities

More information

Statistical Case Estimation Modelling

Statistical Case Estimation Modelling Statistical Case Estimation Modelling - An Overview of the NSW WorkCover Model Presented by Richard Brookes and Mitchell Prevett Presented to the Institute of Actuaries of Australia Accident Compensation

More information

An Online Algorithm for Multi-Strategy Trading Utilizing Market Regimes

An Online Algorithm for Multi-Strategy Trading Utilizing Market Regimes An Online Algorithm for Multi-Strategy Trading Utilizing Market Regimes Hynek Mlnařík 1 Subramanian Ramamoorthy 2 Rahul Savani 1 1 Warwick Institute for Financial Computing Department of Computer Science

More information

[D7] PROBABILITY DISTRIBUTION OF OUTSTANDING LIABILITY FROM INDIVIDUAL PAYMENTS DATA Contributed by T S Wright

[D7] PROBABILITY DISTRIBUTION OF OUTSTANDING LIABILITY FROM INDIVIDUAL PAYMENTS DATA Contributed by T S Wright Faculty and Institute of Actuaries Claims Reserving Manual v.2 (09/1997) Section D7 [D7] PROBABILITY DISTRIBUTION OF OUTSTANDING LIABILITY FROM INDIVIDUAL PAYMENTS DATA Contributed by T S Wright 1. Introduction

More information

Decision Trees An Early Classifier

Decision Trees An Early Classifier An Early Classifier Jason Corso SUNY at Buffalo January 19, 2012 J. Corso (SUNY at Buffalo) Trees January 19, 2012 1 / 33 Introduction to Non-Metric Methods Introduction to Non-Metric Methods We cover

More information

2017 Predictive Analytics Symposium

2017 Predictive Analytics Symposium 2017 Predictive Analytics Symposium Session 7, Risk Assessment Applications of Predictive Analytics Moderator: Priyanka Srivastava Presenters: Dihui Lai, Ph.D. Nitin Nayak, Ph.D., MBA Jason L. VonBergen,

More information

Macroeconomic conditions and equity market volatility. Benn Eifert, PhD February 28, 2016

Macroeconomic conditions and equity market volatility. Benn Eifert, PhD February 28, 2016 Macroeconomic conditions and equity market volatility Benn Eifert, PhD February 28, 2016 beifert@berkeley.edu Overview Much of the volatility of the last six months has been driven by concerns about the

More information

DFAST Modeling and Solution

DFAST Modeling and Solution Regulatory Environment Summary Fallout from the 2008-2009 financial crisis included the emergence of a new regulatory landscape intended to safeguard the U.S. banking system from a systemic collapse. In

More information

Predictive Modelling. Document Turning Big Data into Big Opportunities

Predictive Modelling. Document Turning Big Data into Big Opportunities Predictive Modelling Document 218081 Turning Big Data into Big Opportunities Essays on Predictive Modelling: Turning Big Data into Big Opportunities In recent years, data has become a key driver of economic

More information

International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18, ISSN

International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18,   ISSN International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18, www.ijcea.com ISSN 31-3469 AN INVESTIGATION OF FINANCIAL TIME SERIES PREDICTION USING BACK PROPAGATION NEURAL

More information

International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18, ISSN

International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18,   ISSN Volume XII, Issue II, Feb. 18, www.ijcea.com ISSN 31-3469 AN INVESTIGATION OF FINANCIAL TIME SERIES PREDICTION USING BACK PROPAGATION NEURAL NETWORKS K. Jayanthi, Dr. K. Suresh 1 Department of Computer

More information

Lending Club Loan Portfolio Optimization Fred Robson (frobson), Chris Lucas (cflucas)

Lending Club Loan Portfolio Optimization Fred Robson (frobson), Chris Lucas (cflucas) CS22 Artificial Intelligence Stanford University Autumn 26-27 Lending Club Loan Portfolio Optimization Fred Robson (frobson), Chris Lucas (cflucas) Overview Lending Club is an online peer-to-peer lending

More information

Syllabus 2019 Contents

Syllabus 2019 Contents Page 2 of 201 (26/06/2017) Syllabus 2019 Contents CS1 Actuarial Statistics 1 3 CS2 Actuarial Statistics 2 12 CM1 Actuarial Mathematics 1 22 CM2 Actuarial Mathematics 2 32 CB1 Business Finance 41 CB2 Business

More information

From Double Chain Ladder To Double GLM

From Double Chain Ladder To Double GLM University of Amsterdam MSc Stochastics and Financial Mathematics Master Thesis From Double Chain Ladder To Double GLM Author: Robert T. Steur Examiner: dr. A.J. Bert van Es Supervisors: drs. N.R. Valkenburg

More information

Panel Data with Binary Dependent Variables

Panel Data with Binary Dependent Variables Essex Summer School in Social Science Data Analysis Panel Data Analysis for Comparative Research Panel Data with Binary Dependent Variables Christopher Adolph Department of Political Science and Center

More information

1. You are given the following information about a stationary AR(2) model:

1. You are given the following information about a stationary AR(2) model: Fall 2003 Society of Actuaries **BEGINNING OF EXAMINATION** 1. You are given the following information about a stationary AR(2) model: (i) ρ 1 = 05. (ii) ρ 2 = 01. Determine φ 2. (A) 0.2 (B) 0.1 (C) 0.4

More information

ALGORITHMIC TRADING STRATEGIES IN PYTHON

ALGORITHMIC TRADING STRATEGIES IN PYTHON 7-Course Bundle In ALGORITHMIC TRADING STRATEGIES IN PYTHON Learn to use 15+ trading strategies including Statistical Arbitrage, Machine Learning, Quantitative techniques, Forex valuation methods, Options

More information

Machine Learning Applications in Insurance

Machine Learning Applications in Insurance General Public Release Machine Learning Applications in Insurance Nitin Nayak, Ph.D. Digital & Smart Analytics Swiss Re General Public Release Machine learning is.. Giving computers the ability to learn

More information

Advanced Risk Management Use of Predictive Modeling in Underwriting and Pricing

Advanced Risk Management Use of Predictive Modeling in Underwriting and Pricing Advanced Risk Management Use of Predictive Modeling in Underwriting and Pricing By Saikat Maitra & Debashish Banerjee Abstract In this paper, the authors describe data mining and predictive modeling techniques

More information

Analyzing the Determinants of Project Success: A Probit Regression Approach

Analyzing the Determinants of Project Success: A Probit Regression Approach 2016 Annual Evaluation Review, Linked Document D 1 Analyzing the Determinants of Project Success: A Probit Regression Approach 1. This regression analysis aims to ascertain the factors that determine development

More information

MAKING CLAIMS APPLICATIONS OF PREDICTIVE ANALYTICS IN LONG-TERM CARE BY ROBERT EATON AND MISSY GORDON

MAKING CLAIMS APPLICATIONS OF PREDICTIVE ANALYTICS IN LONG-TERM CARE BY ROBERT EATON AND MISSY GORDON MAKING CLAIMS APPLICATIONS OF PREDICTIVE ANALYTICS IN LONG-TERM CARE BY ROBERT EATON AND MISSY GORDON Predictive analytics has taken far too long in getting its foothold in the long-term care (LTC) insurance

More information

Beyond GLMs. Xavier Conort & Colin Priest

Beyond GLMs. Xavier Conort & Colin Priest Beyond GLMs Xavier Conort & Colin Priest 1 Agenda 1. GLMs and Actuaries 2. Extensions to GLMs 3. Automating GLM model building 4. Best practice predictive modelling 5. Conclusion 2 1) GLMs Linear models

More information

Effects of missing data in credit risk scoring. A comparative analysis of methods to gain robustness in presence of sparce data

Effects of missing data in credit risk scoring. A comparative analysis of methods to gain robustness in presence of sparce data Credit Research Centre Credit Scoring and Credit Control X 29-31 August 2007 The University of Edinburgh - Management School Effects of missing data in credit risk scoring. A comparative analysis of methods

More information

Hierarchical Generalized Linear Models. Measurement Incorporated Hierarchical Linear Models Workshop

Hierarchical Generalized Linear Models. Measurement Incorporated Hierarchical Linear Models Workshop Hierarchical Generalized Linear Models Measurement Incorporated Hierarchical Linear Models Workshop Hierarchical Generalized Linear Models So now we are moving on to the more advanced type topics. To begin

More information

High-Frequency Data Analysis and Market Microstructure [Tsay (2005), chapter 5]

High-Frequency Data Analysis and Market Microstructure [Tsay (2005), chapter 5] 1 High-Frequency Data Analysis and Market Microstructure [Tsay (2005), chapter 5] High-frequency data have some unique characteristics that do not appear in lower frequencies. At this class we have: Nonsynchronous

More information

Predicting First Day Returns for Japanese IPOs

Predicting First Day Returns for Japanese IPOs Predicting First Day Returns for Japanese IPOs Executive Summary Goal: To predict the First Day returns on Japanese IPOs (based on first day closing price), using public information available prior to

More information

2017 Predictive Analytics Symposium

2017 Predictive Analytics Symposium 2017 Predictive Analytics Symposium Session 29, Predictive Analytics for Inforce Management Moderator: Rohan Noel Alahakone, ASA, MAAA Presenters: Jenny Jin, FSA, MAAA Assaf Mizan Martin Snow, FSA, MAAA

More information

Multiple Regression and Logistic Regression II. Dajiang 525 Apr

Multiple Regression and Logistic Regression II. Dajiang 525 Apr Multiple Regression and Logistic Regression II Dajiang Liu @PHS 525 Apr-19-2016 Materials from Last Time Multiple regression model: Include multiple predictors in the model = + + + + How to interpret the

More information

Duangporn Jearkpaporn, Connie M. Borror Douglas C. Montgomery and George C. Runger Arizona State University Tempe, AZ

Duangporn Jearkpaporn, Connie M. Borror Douglas C. Montgomery and George C. Runger Arizona State University Tempe, AZ Process Monitoring for Correlated Gamma Distributed Data Using Generalized Linear Model Based Control Charts Duangporn Jearkpaporn, Connie M. Borror Douglas C. Montgomery and George C. Runger Arizona State

More information

Academic Research Review. Classifying Market Conditions Using Hidden Markov Model

Academic Research Review. Classifying Market Conditions Using Hidden Markov Model Academic Research Review Classifying Market Conditions Using Hidden Markov Model INTRODUCTION Best known for their applications in speech recognition, Hidden Markov Models (HMMs) are able to discern and

More information

Examining Long-Term Trends in Company Fundamentals Data

Examining Long-Term Trends in Company Fundamentals Data Examining Long-Term Trends in Company Fundamentals Data Michael Dickens 2015-11-12 Introduction The equities market is generally considered to be efficient, but there are a few indicators that are known

More information

6. Genetics examples: Hardy-Weinberg Equilibrium

6. Genetics examples: Hardy-Weinberg Equilibrium PBCB 206 (Fall 2006) Instructor: Fei Zou email: fzou@bios.unc.edu office: 3107D McGavran-Greenberg Hall Lecture 4 Topics for Lecture 4 1. Parametric models and estimating parameters from data 2. Method

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Maximum Likelihood Estimation EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #6 EPSY 905: Maximum Likelihood In This Lecture The basics of maximum likelihood estimation Ø The engine that

More information

An Application of Data Mining Algorithms For Shipbuilding Cost Estimation

An Application of Data Mining Algorithms For Shipbuilding Cost Estimation An Application of Data Mining Algorithms For Shipbuilding Cost Estimation Bohdan L. Kaluzny Centre for Operational Research & Analysis Defence Research & Development Canada April 2011 Acknowledgements:

More information

Financial Risk Management

Financial Risk Management Financial Risk Management Professor: Thierry Roncalli Evry University Assistant: Enareta Kurtbegu Evry University Tutorial exercices #4 1 Correlation and copulas 1. The bivariate Gaussian copula is given

More information

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions.

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions. ME3620 Theory of Engineering Experimentation Chapter III. Random Variables and Probability Distributions Chapter III 1 3.2 Random Variables In an experiment, a measurement is usually denoted by a variable

More information

Stochastic Claims Reserving _ Methods in Insurance

Stochastic Claims Reserving _ Methods in Insurance Stochastic Claims Reserving _ Methods in Insurance and John Wiley & Sons, Ltd ! Contents Preface Acknowledgement, xiii r xi» J.. '..- 1 Introduction and Notation : :.... 1 1.1 Claims process.:.-.. : 1

More information

Session 79PD, Using Predictive Analytics to Develop Assumptions. Moderator/Presenter: Jonathan D. White, FSA, MAAA, CERA

Session 79PD, Using Predictive Analytics to Develop Assumptions. Moderator/Presenter: Jonathan D. White, FSA, MAAA, CERA Session 79PD, Using Predictive Analytics to Develop Assumptions Moderator/Presenter: Jonathan D. White, FSA, MAAA, CERA Presenters: Missy A. Gordon, FSA, MAAA Brian M. Hartman, ASA SOA Antitrust Disclaimer

More information

Model fit assessment via marginal model plots

Model fit assessment via marginal model plots The Stata Journal (2010) 10, Number 2, pp. 215 225 Model fit assessment via marginal model plots Charles Lindsey Texas A & M University Department of Statistics College Station, TX lindseyc@stat.tamu.edu

More information

Competition price analysis in non-life insurance

Competition price analysis in non-life insurance White Paper on Non-Life Insurance: Competition A Reacfin price White analysis Paper in on non-life Non-Life insurance Insurance: - How machine learning and statistical predictive models can help Competition

More information

Dynamic Copula Methods in Finance

Dynamic Copula Methods in Finance Dynamic Copula Methods in Finance Umberto Cherubini Fabio Gofobi Sabriea Mulinacci Silvia Romageoli A John Wiley & Sons, Ltd., Publication Contents Preface ix 1 Correlation Risk in Finance 1 1.1 Correlation

More information

An introduction to Machine learning methods and forecasting of time series in financial markets

An introduction to Machine learning methods and forecasting of time series in financial markets An introduction to Machine learning methods and forecasting of time series in financial markets Mark Wong markwong@kth.se December 10, 2016 Abstract The goal of this paper is to give the reader an introduction

More information

Session 2. Predictive Analytics in Policyholder Behavior

Session 2. Predictive Analytics in Policyholder Behavior SOA Predictive Analytics Seminar Malaysia 27 Aug. 2018 Kuala Lumpur, Malaysia Session 2 Predictive Analytics in Policyholder Behavior Eileen Burns, FSA, MAAA David Wang, FSA, FIA, MAAA Predictive Analytics

More information

MONTE-CARLO SIMULATION CALCULATION OF VAR (VALUE-AT-RISK) & CVAR (CONDITIONAL VALUE-AT-RISK)

MONTE-CARLO SIMULATION CALCULATION OF VAR (VALUE-AT-RISK) & CVAR (CONDITIONAL VALUE-AT-RISK) MONTE-CARLO SIMULATION CALCULATION OF VAR (VALUE-AT-RISK) & CVAR (CONDITIONAL VALUE-AT-RISK) PRESENTER: SANJOY ROY 15-APR-2018 TERMINOLOGY V-a-R (Value-At-Risk) How much can one expect to lose Parameters

More information

Machine Learning for Physicists Lecture 10. Summer 2017 University of Erlangen-Nuremberg Florian Marquardt

Machine Learning for Physicists Lecture 10. Summer 2017 University of Erlangen-Nuremberg Florian Marquardt Machine Learning for Physicists Lecture 10 Summer 2017 University of Erlangen-Nuremberg Florian Marquardt Function/Image representation Image classification [Handwriting recognition] Convolutional nets

More information

Minimizing Basis Risk for Cat-In- Catastrophe Bonds Editor s note: AIR Worldwide has long dominanted the market for. By Dr.

Minimizing Basis Risk for Cat-In- Catastrophe Bonds Editor s note: AIR Worldwide has long dominanted the market for. By Dr. Minimizing Basis Risk for Cat-In- A-Box Parametric Earthquake Catastrophe Bonds Editor s note: AIR Worldwide has long dominanted the market for 06.2010 AIRCurrents catastrophe risk modeling and analytical

More information

The Next Innovations in Bancassurance

The Next Innovations in Bancassurance The Next Innovations in Bancassurance Ola Oyekan, PhD Research & Development Specialist, RGA 12 th October, 2015 Agenda RGA and bancassurance Bancassurance trends & case studies Business models Bancassurance

More information

PARAMETRIC AND NON-PARAMETRIC BOOTSTRAP: A SIMULATION STUDY FOR A LINEAR REGRESSION WITH RESIDUALS FROM A MIXTURE OF LAPLACE DISTRIBUTIONS

PARAMETRIC AND NON-PARAMETRIC BOOTSTRAP: A SIMULATION STUDY FOR A LINEAR REGRESSION WITH RESIDUALS FROM A MIXTURE OF LAPLACE DISTRIBUTIONS PARAMETRIC AND NON-PARAMETRIC BOOTSTRAP: A SIMULATION STUDY FOR A LINEAR REGRESSION WITH RESIDUALS FROM A MIXTURE OF LAPLACE DISTRIBUTIONS Melfi Alrasheedi School of Business, King Faisal University, Saudi

More information

FE670 Algorithmic Trading Strategies. Stevens Institute of Technology

FE670 Algorithmic Trading Strategies. Stevens Institute of Technology FE670 Algorithmic Trading Strategies Lecture 4. Cross-Sectional Models and Trading Strategies Steve Yang Stevens Institute of Technology 09/26/2013 Outline 1 Cross-Sectional Methods for Evaluation of Factor

More information

TABLE OF CONTENTS - VOLUME 2

TABLE OF CONTENTS - VOLUME 2 TABLE OF CONTENTS - VOLUME 2 CREDIBILITY SECTION 1 - LIMITED FLUCTUATION CREDIBILITY PROBLEM SET 1 SECTION 2 - BAYESIAN ESTIMATION, DISCRETE PRIOR PROBLEM SET 2 SECTION 3 - BAYESIAN CREDIBILITY, DISCRETE

More information

(iii) Under equal cluster sampling, show that ( ) notations. (d) Attempt any four of the following:

(iii) Under equal cluster sampling, show that ( ) notations. (d) Attempt any four of the following: Central University of Rajasthan Department of Statistics M.Sc./M.A. Statistics (Actuarial)-IV Semester End of Semester Examination, May-2012 MSTA 401: Sampling Techniques and Econometric Methods Max. Marks:

More information

THE F FILES. Group benefits fraud what you need to know to fight fraud GET #FRAUDSMART

THE F FILES. Group benefits fraud what you need to know to fight fraud GET #FRAUDSMART THE F FILES Group benefits fraud what you need to know to fight fraud GET #FRAUDSMART SPRING 2018 LOOKING INTO THE FUTURE OF FRAUD WITH PREDICTIVE ANALYTICS Big data it is fundamental in the fight against

More information

Master s in Financial Engineering Foundations of Buy-Side Finance: Quantitative Risk and Portfolio Management. > Teaching > Courses

Master s in Financial Engineering Foundations of Buy-Side Finance: Quantitative Risk and Portfolio Management.  > Teaching > Courses Master s in Financial Engineering Foundations of Buy-Side Finance: Quantitative Risk and Portfolio Management www.symmys.com > Teaching > Courses Spring 2008, Monday 7:10 pm 9:30 pm, Room 303 Attilio Meucci

More information

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi Chapter 4: Commonly Used Distributions Statistics for Engineers and Scientists Fourth Edition William Navidi 2014 by Education. This is proprietary material solely for authorized instructor use. Not authorized

More information

Data Mining Applications in Health Insurance

Data Mining Applications in Health Insurance Data Mining Applications in Health Insurance Salford Systems Data Mining Conference New York, NY March 28-30, 2005 Lijia Guo,, PhD, ASA, MAAA University of Central Florida 1 Agenda Introductions to Data

More information

Estimating Mixed Logit Models with Large Choice Sets. Roger H. von Haefen, NC State & NBER Adam Domanski, NOAA July 2013

Estimating Mixed Logit Models with Large Choice Sets. Roger H. von Haefen, NC State & NBER Adam Domanski, NOAA July 2013 Estimating Mixed Logit Models with Large Choice Sets Roger H. von Haefen, NC State & NBER Adam Domanski, NOAA July 2013 Motivation Bayer et al. (JPE, 2007) Sorting modeling / housing choice 250,000 individuals

More information

The data definition file provided by the authors is reproduced below: Obs: 1500 home sales in Stockton, CA from Oct 1, 1996 to Nov 30, 1998

The data definition file provided by the authors is reproduced below: Obs: 1500 home sales in Stockton, CA from Oct 1, 1996 to Nov 30, 1998 Economics 312 Sample Project Report Jeffrey Parker Introduction This project is based on Exercise 2.12 on page 81 of the Hill, Griffiths, and Lim text. It examines how the sale price of houses in Stockton,

More information

Model 0: We start with a linear regression model: log Y t = β 0 + β 1 (t 1980) + ε, with ε N(0,

Model 0: We start with a linear regression model: log Y t = β 0 + β 1 (t 1980) + ε, with ε N(0, Stat 534: Fall 2017. Introduction to the BUGS language and rjags Installation: download and install JAGS. You will find the executables on Sourceforge. You must have JAGS installed prior to installing

More information

Statistical Data Mining for Computational Financial Modeling

Statistical Data Mining for Computational Financial Modeling Statistical Data Mining for Computational Financial Modeling Ali Serhan KOYUNCUGIL, Ph.D. Capital Markets Board of Turkey - Research Department Ankara, Turkey askoyuncugil@gmail.com www.koyuncugil.org

More information