Beyond GLMs. Xavier Conort & Colin Priest

Size: px
Start display at page:

Download "Beyond GLMs. Xavier Conort & Colin Priest"

Transcription

1 Beyond GLMs Xavier Conort & Colin Priest 1

2 Agenda 1. GLMs and Actuaries 2. Extensions to GLMs 3. Automating GLM model building 4. Best practice predictive modelling 5. Conclusion 2

3 1) GLMs Linear models for statistical distributions that aren t Normal Taught in the actuarial education process Widely used by actuaries for Pricing Understanding lapse rates Marketing Claims reserving 3

4 GLMs are old Developed in 1972 in an era of small data and no PCs Used by actuaries for decades There are newer techniques to choose from 4

5 2) Extensions to GLMs GAMs GAMLSS GLMMs Regularized GLMs 5

6 GAMs Generalized additive models Automatically fits nonlinear relationships Works a bit like rolling averages 6

7 GAMs Good For Designing data transformations for GLMs Unsuitable For Pricing, or whenever a formula is required Data containing only categorical variables 7

8 GAMLSS Generalized additive models with location, scale and shape Allows you to model how the variance and skewness varies 8

9 GAMLSS Good For understanding variability simulating risk e.g. internal models data exhibiting heteroskedasticity Unsuitable For when you only want the prediction data exhibiting homoskedasticity 9

10 GLMMs Generalized linear mixed models Allows for automatic credibility weighting 10

11 GLMMs Good For Categorical variables and interactions between categorical and numeric Sparse data Hierarchical relationships e.g. vehicle make and model Unsuitable For Large amounts of highly credible data Data without categorical variables 11

12 Regularized GLMs Regularisation is any method that penalises overfitting or complexity in models Automatically chooses predictors Automatically allows for credibility 12

13 Regularized GLMs Good For Collinearity in data Making GLMs more reliable Lots of input variables Sparse data Unsuitable For Complex interactions between variables 13

14 3) Automating GLM Building Why automate? Variable selection (feature selection) Linearising (feature engineering) Dimensionality reduction 14

15 Why Automate? Building GLMs requires time and people and both of these are expensive! Most of the resource intensive work is ruledriven without much complex judgement required It s just like when books were copied by hand! 15

16 Variable Selection Variable Importance using machine learning 16

17 Variable Selection Using lasso regularized GLMs 17

18 Variable Selection Using genetic algorithms 18

19 Linearising Using GAMs 19

20 Linearising Using GBMs 20

21 Dimensionality Reduction variable importance for reducing number of categories 21

22 Dimensionality Reduction Text mining for grouping categories together and reducing the number of categories 22

23 4) Best practices observed in 23

24 My own learning curve Previously... now! 24

25 What is Kaggle? A social fight club for data geeks In 2010, Anthony Goldbloom took the SIGKDD and Netflix s model And attracted 371,397 data geeks as of Sept 17, 2015! Kaggle worked with more than 20 Fortune 500 companies including 3 leading insurance companies + 1 Australian insurer represented by Deloitte 25

26 Machine Learning works for insurance too! Won by my colleague Owen Zhang and me! 26

27 Why Geeks like to fight? My motivation has been to learn new things. 27

28 What did I learn? The Machine works much faster and harder than me Feature engineering is key to success. And actuaries are good in this. They can however learn more by being exposed to problems and datasets outside the insurance industry Top Kagglers use actuarial tricks such as credibility estimates Most popular and powerful Machine Learning algorithms used by the data science community are open source algorithms 28

29 Competitions that boosted my learning curve The Machine seems much smarter than I am at capturing complexity in the data even for simple datasets! Humans can help the Machine too! But don t oversimplify and discard any data. Don t be impatient. My best GBM had 24,500 trees with learning rate = 0.01! SVM and feature selection matter too! 29

30 Competitions that boosted my learning curve Word n-grams and character n-grams can make a big difference Parallel processing and big servers can help with complex feature engineering! Glmnet can do a great job! Sklearn in Python is cool too! 30

31 & Machine Learning algos to know to automatically capture complexity in the data Gradient Boosting Machine packages 1. R gbm 2. R xgboost 3. Sklearn GradientBoostingClassifier and GradientBoostingRegressor Forest packages 1. R randomforest 2. Sklearn RandomForestClassifier and RandomForestRegressor 3. R extratrees 4. Sklearn ExtraTreesClassifier and ExtraTreesRegressor Support Vector Machine packages 1. R e Sklearn svc and svr 3. Sklearn Nystroem 31

32 & Machine Learning algos to know. to take advantage of high cardinality categorical features or text data Regularized generalized linear models 1. R glmnet 2. Sklearn Ridge 3. Sklearn LogisticRegression Feature Extraction for categorical features or text data 1. R Matrix 2. Sklearn OneHotEncoder and DictVectorizer 3. R tau 4. Sklearn TfidfVectorizer 32

33 & tools to know to make your code efficient Data manipulation at faster speed 1. R data.table 2. Python pandas Parallel computing 1. R foreach / domc 2. Python joblib 33

34 Adapt feature engineering to ML algo Machine Learning (ML) algo Categorical variables support Features subsampling Sparse support Insensitive to scale & uniform transf Automated nonlinear and interactions modelling Handle missing value R Random Forest Yes. up to 32 levels Yes No Yes Yes No Sklearn Random Forest No Yes Yes but slow Yes Yes No R Gradient Boosting Machine Yes. up to 1024 levels No No Yes Yes Yes Sklearn Gradient Boosted Regression Trees No Yes Yes but slow Yes Yes No extreme Gradient Boosting No Yes Yes Yes Yes Yes Regularized GLMs No No Yes No No No Support Vector Machine No No Yes No Yes No 34

35 Other popular algorithms popular in 35

36 Most important. Don t forget to use your actuarial intuition to help the machine! Always consider simple feature engineering that makes sense for your business such differences / ratios of features Be creative, feature engineering is often key to success. Don t trust features that are too good They can make the Machine lazy! An example: GE Flight Quest or they are likely to be caused by a bug or a leak! 36

37 Hans Bu hlmann ) Conclusion It s time to become actuaries of the 5 th kind Paul Embrechts 2005 Big Data Working Party Actuaries of the First Kind Actuaries of the Second Kind Actuaries of the Third Kind Actuaries of the Fourth Kind Actuaries of the Fifth Kind 17 th century: Life insurance, Deterministic methods Early 20 th century: General insurance, Probabilistic methods 1980s: Assets/derivatives, Contingencies Stochastic processes Early 21 st century: ERM Second decade of 21 st century: Big Data 37

38 Conclusion So that we aren t replaced by robots (or data scientists) 38

ECS171: Machine Learning

ECS171: Machine Learning ECS171: Machine Learning Lecture 15: Tree-based Algorithms Cho-Jui Hsieh UC Davis March 7, 2018 Outline Decision Tree Random Forest Gradient Boosted Decision Tree (GBDT) Decision Tree Each node checks

More information

B.A.U. for actuaries:

B.A.U. for actuaries: B.A.U. for actuaries: Big data, Analytics & Unstructured data Big Data Working Party Mudit Gupta (Chair), Colin Priest, David Menezes, Frank Devlin, Frankie Chan, Xavier Conort SAS GI Conference 2015 1

More information

How Can YOU Use it? Artificial Intelligence for Actuaries. SOA Annual Meeting, Gaurav Gupta. Session 058PD

How Can YOU Use it? Artificial Intelligence for Actuaries. SOA Annual Meeting, Gaurav Gupta. Session 058PD Artificial Intelligence for Actuaries How Can YOU Use it? SOA Annual Meeting, 2018 Session 058PD Gaurav Gupta Founder & CEO ggupta@quaerainsights.com Audience Poll What is my level of AI understanding?

More information

Machine Learning: from flight data to claims management Xavier Conort Principal Research Engineer Institute for Infocomm Research Agency for Science,

Machine Learning: from flight data to claims management Xavier Conort Principal Research Engineer Institute for Infocomm Research Agency for Science, Machine Learning: from flight data to claims management Xavier Conort Principal Research Engineer Institute for Infocomm Research Agency for Science, Technology and Research (A*STAR) Previously, Now, Evolu&on(of(Data(and(Analy&cs(

More information

Making the Link between Actuaries and Data Science

Making the Link between Actuaries and Data Science Making the Link between Actuaries and Data Science Simon Lee, Cecilia Chow, Thibault Imbert AXA Asia 2 nd ASHK General Insurance & Data Analytics Seminar Friday 7 October 2016 1 Agenda Data Driving Insurers

More information

Session 79PD, Using Predictive Analytics to Develop Assumptions. Moderator/Presenter: Jonathan D. White, FSA, MAAA, CERA

Session 79PD, Using Predictive Analytics to Develop Assumptions. Moderator/Presenter: Jonathan D. White, FSA, MAAA, CERA Session 79PD, Using Predictive Analytics to Develop Assumptions Moderator/Presenter: Jonathan D. White, FSA, MAAA, CERA Presenters: Missy A. Gordon, FSA, MAAA Brian M. Hartman, ASA SOA Antitrust Disclaimer

More information

Session 5. Predictive Modeling in Life Insurance

Session 5. Predictive Modeling in Life Insurance SOA Predictive Analytics Seminar Hong Kong 29 Aug. 2018 Hong Kong Session 5 Predictive Modeling in Life Insurance Jingyi Zhang, Ph.D Predictive Modeling in Life Insurance JINGYI ZHANG PhD Scientist Global

More information

ig data, nalytics & nstructured data for actuaries Frankie Chan and David Menezes Singapore Actuarial Society s Big Data Working Party

ig data, nalytics & nstructured data for actuaries Frankie Chan and David Menezes Singapore Actuarial Society s Big Data Working Party ig data, nalytics & nstructured data for actuaries Frankie Chan and David Menezes Singapore Actuarial Society s Big Data Working Party nd ASHK General Insurance & Data Analytics Seminar Friday 7 October

More information

February 25, Susan Watson, FSA Paul Anderson, FCAS Rahul Parsa, PhD

February 25, Susan Watson, FSA Paul Anderson, FCAS Rahul Parsa, PhD February 25, 2014 Susan Watson, FSA Paul Anderson, FCAS Rahul Parsa, PhD You don t want your privacy: Disney and the meat space data race Volume Velocity Variety Data Mining Process Collect Data Clean

More information

Predictive modelling around the world Peter Banthorpe, RGA Kevin Manning, Milliman

Predictive modelling around the world Peter Banthorpe, RGA Kevin Manning, Milliman Predictive modelling around the world Peter Banthorpe, RGA Kevin Manning, Milliman 11 November 2013 Agenda Introduction to predictive analytics Applications overview Case studies Conclusions and Q&A Introduction

More information

ALGORITHMIC TRADING STRATEGIES IN PYTHON

ALGORITHMIC TRADING STRATEGIES IN PYTHON 7-Course Bundle In ALGORITHMIC TRADING STRATEGIES IN PYTHON Learn to use 15+ trading strategies including Statistical Arbitrage, Machine Learning, Quantitative techniques, Forex valuation methods, Options

More information

UPDATED IAA EDUCATION SYLLABUS

UPDATED IAA EDUCATION SYLLABUS II. UPDATED IAA EDUCATION SYLLABUS A. Supporting Learning Areas 1. STATISTICS Aim: To enable students to apply core statistical techniques to actuarial applications in insurance, pensions and emerging

More information

Article from. Predictive Analytics and Futurism. June 2017 Issue 15

Article from. Predictive Analytics and Futurism. June 2017 Issue 15 Article from Predictive Analytics and Futurism June 2017 Issue 15 Using Predictive Modeling to Risk- Adjust Primary Care Panel Sizes By Anders Larson Most health actuaries are familiar with the concept

More information

Session 5. A brief introduction to Predictive Modeling

Session 5. A brief introduction to Predictive Modeling SOA Predictive Analytics Seminar Malaysia 27 Aug. 2018 Kuala Lumpur, Malaysia Session 5 A brief introduction to Predictive Modeling Lichen Bao, Ph.D A Brief Introduction to Predictive Modeling LICHEN BAO

More information

Lectures and Seminars in Insurance Mathematics and Related Fields at ETH Zurich. Spring Semester 2019

Lectures and Seminars in Insurance Mathematics and Related Fields at ETH Zurich. Spring Semester 2019 December 2018 Lectures and Seminars in Insurance Mathematics and Related Fields at ETH Zurich Spring Semester 2019 Quantitative Risk Management, by Prof. Dr. Patrick Cheridito, #401-3629-00L This course

More information

Better decision making under uncertain conditions using Monte Carlo Simulation

Better decision making under uncertain conditions using Monte Carlo Simulation IBM Software Business Analytics IBM SPSS Statistics Better decision making under uncertain conditions using Monte Carlo Simulation Monte Carlo simulation and risk analysis techniques in IBM SPSS Statistics

More information

Machine Learning in Risk Forecasting and its Application in Low Volatility Strategies

Machine Learning in Risk Forecasting and its Application in Low Volatility Strategies NEW THINKING Machine Learning in Risk Forecasting and its Application in Strategies By Yuriy Bodjov Artificial intelligence and machine learning are two terms that have gained increased popularity within

More information

WC-5 Just How Credible Is That Employer? Exploring GLMs and Multilevel Modeling for NCCI s Excess Loss Factor Methodology

WC-5 Just How Credible Is That Employer? Exploring GLMs and Multilevel Modeling for NCCI s Excess Loss Factor Methodology Antitrust Notice The Casualty Actuarial Society is committed to adhering strictly to the letter and spirit of the antitrust laws. Seminars conducted under the auspices of the CAS are designed solely to

More information

Session 113 PD, Data and Model Actuaries Should be an Expert of Both. Moderator: David L. Snell, ASA, MAAA

Session 113 PD, Data and Model Actuaries Should be an Expert of Both. Moderator: David L. Snell, ASA, MAAA Session 113 PD, Data and Model Actuaries Should be an Expert of Both Moderator: David L. Snell, ASA, MAAA Presenters: Matthias Kullowatz Kenneth Warren Pagington, FSA, CERA, MAAA Qichun (Richard) Xu, FSA

More information

Quantile Regression. By Luyang Fu, Ph. D., FCAS, State Auto Insurance Company Cheng-sheng Peter Wu, FCAS, ASA, MAAA, Deloitte Consulting

Quantile Regression. By Luyang Fu, Ph. D., FCAS, State Auto Insurance Company Cheng-sheng Peter Wu, FCAS, ASA, MAAA, Deloitte Consulting Quantile Regression By Luyang Fu, Ph. D., FCAS, State Auto Insurance Company Cheng-sheng Peter Wu, FCAS, ASA, MAAA, Deloitte Consulting Agenda Overview of Predictive Modeling for P&C Applications Quantile

More information

Topic-based vector space modeling of Twitter data with application in predictive analytics

Topic-based vector space modeling of Twitter data with application in predictive analytics Topic-based vector space modeling of Twitter data with application in predictive analytics Guangnan Zhu (U6023358) Australian National University COMP4560 Individual Project Presentation Supervisor: Dr.

More information

Credit Card Default Predictive Modeling

Credit Card Default Predictive Modeling Credit Card Default Predictive Modeling Background: Predicting credit card payment default is critical for the successful business model of a credit card company. An accurate predictive model can help

More information

Predicting Foreign Exchange Arbitrage

Predicting Foreign Exchange Arbitrage Predicting Foreign Exchange Arbitrage Stefan Huber & Amy Wang 1 Introduction and Related Work The Covered Interest Parity condition ( CIP ) should dictate prices on the trillion-dollar foreign exchange

More information

Are New Modeling Techniques Worth It?

Are New Modeling Techniques Worth It? Are New Modeling Techniques Worth It? Tom Zougas PhD PEng, Manager Data Science, TransUnion TORONTO SAS USER GROUP MAY 2, 2018 Are New Modeling Techniques Worth It? Presenter Tom Zougas PhD PEng, Manager

More information

2017 IAA EDUCATION SYLLABUS

2017 IAA EDUCATION SYLLABUS 2017 IAA EDUCATION SYLLABUS 1. STATISTICS Aim: To enable students to apply core statistical techniques to actuarial applications in insurance, pensions and emerging areas of actuarial practice. 1.1 RANDOM

More information

Moderator: Missy A Gordon FSA,MAAA. Presenters: Missy A Gordon FSA,MAAA Roger Loomis FSA,MAAA

Moderator: Missy A Gordon FSA,MAAA. Presenters: Missy A Gordon FSA,MAAA Roger Loomis FSA,MAAA Session 52PD: Financial Analysis: Impairment, Stress Testing and Predictive Modeling for Health Companies Moderator: Missy A Gordon FSA,MAAA Presenters: Missy A Gordon FSA,MAAA Roger Loomis FSA,MAAA SOA

More information

Loan Approval and Quality Prediction in the Lending Club Marketplace

Loan Approval and Quality Prediction in the Lending Club Marketplace Loan Approval and Quality Prediction in the Lending Club Marketplace Milestone Write-up Yondon Fu, Shuo Zheng and Matt Marcus Recap Lending Club is a peer-to-peer lending marketplace where individual investors

More information

Loan Approval and Quality Prediction in the Lending Club Marketplace

Loan Approval and Quality Prediction in the Lending Club Marketplace Loan Approval and Quality Prediction in the Lending Club Marketplace Final Write-up Yondon Fu, Matt Marcus and Shuo Zheng Introduction Lending Club is a peer-to-peer lending marketplace where individual

More information

Least Squares Monte Carlo (LSMC) life and annuity application Prepared for Institute of Actuaries of Japan

Least Squares Monte Carlo (LSMC) life and annuity application Prepared for Institute of Actuaries of Japan Least Squares Monte Carlo (LSMC) life and annuity application Prepared for Institute of Actuaries of Japan February 3, 2015 Agenda A bit of theory Overview of application Case studies Final remarks 2 Least

More information

MWSUG Paper AA 04. Claims Analytics. Mei Najim, Gallagher Bassett Services, Rolling Meadows, IL

MWSUG Paper AA 04. Claims Analytics. Mei Najim, Gallagher Bassett Services, Rolling Meadows, IL MWSUG 2017 - Paper AA 04 Claims Analytics Mei Najim, Gallagher Bassett Services, Rolling Meadows, IL ABSTRACT In the Property & Casualty Insurance industry, advanced analytics has increasingly penetrated

More information

PREDICTIVE ANALYTICS EVI TEDJASUKMANA 26 OCTOBER 2017 PERSATUAN AKTUARIS INDONESIA (THE SOCIETY OF ACTUARIES OF INDONESIA)

PREDICTIVE ANALYTICS EVI TEDJASUKMANA 26 OCTOBER 2017 PERSATUAN AKTUARIS INDONESIA (THE SOCIETY OF ACTUARIES OF INDONESIA) PREDICTIVE ANALYTICS EVI TEDJASUKMANA 26 OCTOBER 2017 Agenda 1. Predictive analytics why we need it? 2. Sample analytics 1 propensity to buy 3. Sample analytics 2 predictive underwriting 2 BACKGROUND Why

More information

Internet Appendix. Additional Results. Figure A1: Stock of retail credit cards over time

Internet Appendix. Additional Results. Figure A1: Stock of retail credit cards over time Internet Appendix A Additional Results Figure A1: Stock of retail credit cards over time Stock of retail credit cards by month. Time of deletion policy noted with vertical line. Figure A2: Retail credit

More information

Wide and Deep Learning for Peer-to-Peer Lending

Wide and Deep Learning for Peer-to-Peer Lending Wide and Deep Learning for Peer-to-Peer Lending Kaveh Bastani 1 *, Elham Asgari 2, Hamed Namavari 3 1 Unifund CCR, LLC, Cincinnati, OH 2 Pamplin College of Business, Virginia Polytechnic Institute, Blacksburg,

More information

Behavioral patterns of long term saving : Predictive analysis of adverse behaviors on a savings portfolio

Behavioral patterns of long term saving : Predictive analysis of adverse behaviors on a savings portfolio Behavioral patterns of long term saving : Predictive analysis of adverse behaviors on a savings portfolio Introduction What is the context of this case study and what about the underlying challenges? Introduction

More information

Practical example of an Economic Scenario Generator

Practical example of an Economic Scenario Generator Practical example of an Economic Scenario Generator Martin Schenk Actuarial & Insurance Solutions SAV 7 March 2014 Agenda Introduction Deterministic vs. stochastic approach Mathematical model Application

More information

Xiaoli Jin and Edward W. (Jed) Frees. August 6, 2013

Xiaoli Jin and Edward W. (Jed) Frees. August 6, 2013 Xiaoli and Edward W. (Jed) Frees Department of Actuarial Science, Risk Management, and Insurance University of Wisconsin Madison August 6, 2013 1 / 20 Outline 1 2 3 4 5 6 2 / 20 for P&C Insurance Occurrence

More information

Expanding Predictive Analytics Through the Use of Machine Learning

Expanding Predictive Analytics Through the Use of Machine Learning Expanding Predictive Analytics Through the Use of Machine Learning Thursday, February 28, 2013, 11:10 a.m. Chris Cooksey, FCAS, MAAA Chief Actuary EagleEye Analytics Columbia, S.C. Christopher Cooksey,

More information

An introduction to Machine learning methods and forecasting of time series in financial markets

An introduction to Machine learning methods and forecasting of time series in financial markets An introduction to Machine learning methods and forecasting of time series in financial markets Mark Wong markwong@kth.se December 10, 2016 Abstract The goal of this paper is to give the reader an introduction

More information

Fundamentals of Catastrophe Modelling. Ben Miliauskas Aon Benfield

Fundamentals of Catastrophe Modelling. Ben Miliauskas Aon Benfield Fundamentals of Catastrophe Modelling Ben Miliauskas Aon Benfield Commonly used in Insurance Experience GLM Exposure Sales and Distribution Claims Reserving Economic Scenario Generators Insurance companies

More information

Content Added to the Updated IAA Education Syllabus

Content Added to the Updated IAA Education Syllabus IAA EDUCATION COMMITTEE Content Added to the Updated IAA Education Syllabus Prepared by the Syllabus Review Taskforce Paul King 8 July 2015 This proposed updated Education Syllabus has been drafted by

More information

HEALTH ACTUARIES AND BIG DATA

HEALTH ACTUARIES AND BIG DATA HEALTH ACTUARIES AND BIG DATA What is Big Data? The term Big Data does not only refer to very large datasets. It is typically understood to refer to high volumes of data, requiring high velocity of ingestion

More information

Using Least Squares Monte Carlo techniques in insurance with R

Using Least Squares Monte Carlo techniques in insurance with R Using Least Squares Monte Carlo techniques in insurance with R Sébastien de Valeriola sebastiendevaleriola@reacfincom Amsterdam, June 29 th 2015 Solvency II The major difference between Solvency I and

More information

Subject CS2A Risk Modelling and Survival Analysis Core Principles

Subject CS2A Risk Modelling and Survival Analysis Core Principles ` Subject CS2A Risk Modelling and Survival Analysis Core Principles Syllabus for the 2019 exams 1 June 2018 Copyright in this Core Reading is the property of the Institute and Faculty of Actuaries who

More information

Prediction for Stock Marketing Using Machine Learning

Prediction for Stock Marketing Using Machine Learning Prediction for Stock Marketing Using Machine Learning Shubham Jain Student, Department of Information Technology Maharaja Agrasen Institute of Technology, Delhi, India shjain6670@gmail.com Mark Kain Student,

More information

How the Casualty Actuarial Society Is Breaking the Mould

How the Casualty Actuarial Society Is Breaking the Mould How the Casualty Actuarial Society Is Breaking the Mould Presented by: Jim Christie, President Elect, CAS Caribbean Actuarial Association Conference Nassau, Bahamas November 30, 2017 Agenda CAS Highlights

More information

Fast Convergence of Regress-later Series Estimators

Fast Convergence of Regress-later Series Estimators Fast Convergence of Regress-later Series Estimators New Thinking in Finance, London Eric Beutner, Antoon Pelsser, Janina Schweizer Maastricht University & Kleynen Consultants 12 February 2014 Beutner Pelsser

More information

Forecasting & Futurism

Forecasting & Futurism Article from: Forecasting & Futurism December 2013 Issue 8 PREDICTIVE MODELING IN INSURANCE Modeling Process By Richard Xu In the July 2013 issue of the Forecasting & Futurism Newsletter, we introduced

More information

Stock Prediction Using Twitter Sentiment Analysis

Stock Prediction Using Twitter Sentiment Analysis Problem Statement Stock Prediction Using Twitter Sentiment Analysis Stock exchange is a subject that is highly affected by economic, social, and political factors. There are several factors e.g. external

More information

International Journal of Advance Engineering and Research Development REVIEW ON PREDICTION SYSTEM FOR BANK LOAN CREDIBILITY

International Journal of Advance Engineering and Research Development REVIEW ON PREDICTION SYSTEM FOR BANK LOAN CREDIBILITY Scientific Journal of Impact Factor (SJIF): 4.72 International Journal of Advance Engineering and Research Development Volume 4, Issue 12, December -2017 e-issn (O): 2348-4470 p-issn (P): 2348-6406 REVIEW

More information

Individual Claims Reserving with Stan

Individual Claims Reserving with Stan Individual Claims Reserving with Stan August 29, 216 The problem The problem Desire for individual claim analysis - don t throw away data. We re all pretty comfortable with GLMs now. Let s go crazy with

More information

Machine Learning in High Frequency Algorithmic Trading

Machine Learning in High Frequency Algorithmic Trading Machine Learning in High Frequency Algorithmic Trading I am very excited to be talking to you today about not just my experience with High Frequency Trading, but also about opportunities of technology

More information

Financial Computing with Python

Financial Computing with Python Introduction to Financial Computing with Python Matthieu Mariapragassam Why coding seems so easy? But is actually not Sprezzatura : «It s an art that doesn t seem to be an art» - The Book of the Courtier

More information

Economic Capital Modeling

Economic Capital Modeling Economic Capital Modeling Proxy Model Implementation Experience Clint Thompson Chief Risk Officer, Hannover Life Reassurance Co. of America ERM Symposium June 2015 Agenda 1. Risk appetite and linkage to

More information

Analyzing Representational Schemes of Financial News Articles

Analyzing Representational Schemes of Financial News Articles Analyzing Representational Schemes of Financial News Articles Robert P. Schumaker Information Systems Dept. Iona College, New Rochelle, New York 10801, USA rschumaker@iona.edu Word Count: 2460 Abstract

More information

Milestone2. Zillow House Price Prediciton. Group: Lingzi Hong and Pranali Shetty

Milestone2. Zillow House Price Prediciton. Group: Lingzi Hong and Pranali Shetty Milestone2 Zillow House Price Prediciton Group Lingzi Hong and Pranali Shetty MILESTONE 2 REPORT Data Collection The following additional features were added 1. Population, Number of College Graduates

More information

Stay or Go? The science of departures from superannuation funds

Stay or Go? The science of departures from superannuation funds Stay or Go? The science of departures from superannuation funds Actuaries Summit 2017 22 May 2017 SYDNEY MELBOURNE ABN 35 003 186 883 Level 1 Level 20 AFSL 239 191 2 Martin Place Sydney NSW 2000 303 Collins

More information

International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18, ISSN

International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18,   ISSN Volume XII, Issue II, Feb. 18, www.ijcea.com ISSN 31-3469 AN INVESTIGATION OF FINANCIAL TIME SERIES PREDICTION USING BACK PROPAGATION NEURAL NETWORKS K. Jayanthi, Dr. K. Suresh 1 Department of Computer

More information

Copyright 2011 Pearson Education, Inc. Publishing as Addison-Wesley.

Copyright 2011 Pearson Education, Inc. Publishing as Addison-Wesley. Appendix: Statistics in Action Part I Financial Time Series 1. These data show the effects of stock splits. If you investigate further, you ll find that most of these splits (such as in May 1970) are 3-for-1

More information

International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18, ISSN

International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18,   ISSN International Journal of Computer Engineering and Applications, Volume XII, Issue II, Feb. 18, www.ijcea.com ISSN 31-3469 AN INVESTIGATION OF FINANCIAL TIME SERIES PREDICTION USING BACK PROPAGATION NEURAL

More information

2018 Predictive Analytics Symposium Session 10: Cracking the Black Box with Awareness & Validation

2018 Predictive Analytics Symposium Session 10: Cracking the Black Box with Awareness & Validation 2018 Predictive Analytics Symposium Session 10: Cracking the Black Box with Awareness & Validation SOA Antitrust Compliance Guidelines SOA Presentation Disclaimer Cracking the Black Box with Awareness

More information

INNOVATING FOR SUCCESS:

INNOVATING FOR SUCCESS: 2nd ASHK General Insurance & Data Analytics Seminar INNOVATING FOR SUCCESS: Insurance Technology and Analytics to Profit and Thrive Friday 7 October 2016, Hotel ICON, Tsim Sha Tsui East, Kowloon, Hong

More information

Improving Stock Price Prediction with SVM by Simple Transformation: The Sample of Stock Exchange of Thailand (SET)

Improving Stock Price Prediction with SVM by Simple Transformation: The Sample of Stock Exchange of Thailand (SET) Thai Journal of Mathematics Volume 14 (2016) Number 3 : 553 563 http://thaijmath.in.cmu.ac.th ISSN 1686-0209 Improving Stock Price Prediction with SVM by Simple Transformation: The Sample of Stock Exchange

More information

ECONOMIC CAPITAL MODELING CARe Seminar JUNE 2016

ECONOMIC CAPITAL MODELING CARe Seminar JUNE 2016 ECONOMIC CAPITAL MODELING CARe Seminar JUNE 2016 Boston Catherine Eska The Hanover Insurance Group Paul Silberbush Guy Carpenter & Co. Ronald Wilkins - PartnerRe Economic Capital Modeling Safe Harbor Notice

More information

Application of Soft-Computing Techniques in Accident Compensation

Application of Soft-Computing Techniques in Accident Compensation Application of Soft-Computing Techniques in Accident Compensation Prepared by Peter Mulquiney Taylor Fry Consulting Actuaries Presented to the Institute of Actuaries of Australia Accident Compensation

More information

Foreign Exchange Forecasting via Machine Learning

Foreign Exchange Forecasting via Machine Learning Foreign Exchange Forecasting via Machine Learning Christian González Rojas cgrojas@stanford.edu Molly Herman mrherman@stanford.edu I. INTRODUCTION The finance industry has been revolutionized by the increased

More information

Naïve Bayesian Classifier and Classification Trees for the Predictive Accuracy of Probability of Default Credit Card Clients

Naïve Bayesian Classifier and Classification Trees for the Predictive Accuracy of Probability of Default Credit Card Clients American Journal of Data Mining and Knowledge Discovery 2018; 3(1): 1-12 http://www.sciencepublishinggroup.com/j/ajdmkd doi: 10.11648/j.ajdmkd.20180301.11 Naïve Bayesian Classifier and Classification Trees

More information

Abstract. Estimating accurate settlement amounts early in a. claim lifecycle provides important benefits to the

Abstract. Estimating accurate settlement amounts early in a. claim lifecycle provides important benefits to the Abstract Estimating accurate settlement amounts early in a claim lifecycle provides important benefits to the claims department of a Property Casualty insurance company. Advanced statistical modeling along

More information

A new look at tree based approaches

A new look at tree based approaches A new look at tree based approaches Xifeng Wang University of North Carolina Chapel Hill xifeng@live.unc.edu April 18, 2018 Xifeng Wang (UNC-Chapel Hill) Short title April 18, 2018 1 / 27 Outline of this

More information

The Pennsylvania State University. The Graduate School. Department of Industrial Engineering AMERICAN-ASIAN OPTION PRICING BASED ON MONTE CARLO

The Pennsylvania State University. The Graduate School. Department of Industrial Engineering AMERICAN-ASIAN OPTION PRICING BASED ON MONTE CARLO The Pennsylvania State University The Graduate School Department of Industrial Engineering AMERICAN-ASIAN OPTION PRICING BASED ON MONTE CARLO SIMULATION METHOD A Thesis in Industrial Engineering and Operations

More information

Predictive Risk Categorization of Retail Bank Loans Using Data Mining Techniques

Predictive Risk Categorization of Retail Bank Loans Using Data Mining Techniques National Conference on Recent Advances in Computer Science and IT (NCRACIT) International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume

More information

Use of GLMs in a competitive market. Ji Yao and Simon Yeung, Advanced Pricing Techniques (APT) GIRO Working Party

Use of GLMs in a competitive market. Ji Yao and Simon Yeung, Advanced Pricing Techniques (APT) GIRO Working Party Use of GLMs in a competitive market Ji Yao and Simon Yeung, Advanced Pricing Techniques (APT) GIRO Working Party 12 June 2013 About the presenters Dr. Ji Yao is a manager with Ernst & Young s EMEIA insurance

More information

Lending Club Loan Portfolio Optimization Fred Robson (frobson), Chris Lucas (cflucas)

Lending Club Loan Portfolio Optimization Fred Robson (frobson), Chris Lucas (cflucas) CS22 Artificial Intelligence Stanford University Autumn 26-27 Lending Club Loan Portfolio Optimization Fred Robson (frobson), Chris Lucas (cflucas) Overview Lending Club is an online peer-to-peer lending

More information

Application of Big Data Analytics via Soft Computing. Yunus Yetis

Application of Big Data Analytics via Soft Computing. Yunus Yetis Application of Big Data Analytics via Soft Computing Yunus Yetis INTRODUCTION Ø System of Systems (SoS) and cyberphysic are integrated, independently operating systems working in a cooperative mode to

More information

Intro to Quant Investing

Intro to Quant Investing Intro to Quant Investing Brainteaser Problem: A drawer contains 2 red and 8 black pens. Alice and Bob randomly take pens from the drawer until a red pen is selected. Alice selects the first pen, then Bob

More information

Stock Market Predictor and Analyser using Sentimental Analysis and Machine Learning Algorithms

Stock Market Predictor and Analyser using Sentimental Analysis and Machine Learning Algorithms Volume 119 No. 12 2018, 15395-15405 ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu Stock Market Predictor and Analyser using Sentimental Analysis and Machine Learning Algorithms 1

More information

Compliance and Regulatory Reports with kdb+ May 24, 2018

Compliance and Regulatory Reports with kdb+ May 24, 2018 Compliance and Regulatory Reports with kdb+ May 24, 2018 Compliance and regulatory professionals are tasked with developing effective and efficient surveillance and supervision programs for increasingly

More information

arxiv: v1 [q-fin.rm] 13 Dec 2016

arxiv: v1 [q-fin.rm] 13 Dec 2016 arxiv:1612.04126v1 [q-fin.rm] 13 Dec 2016 The hierarchical generalized linear model and the bootstrap estimator of the error of prediction of loss reserves in a non-life insurance company Alicja Wolny-Dominiak

More information

2017 IAA EDUCATION GUIDELINES

2017 IAA EDUCATION GUIDELINES 2017 IAA EDUCATION GUIDELINES 1. An IAA Education Syllabus and Guidelines were approved by the International Forum of Actuarial Associations (IFAA) in June 1998, prior to the creation of the IAA. This

More information

Investing through Economic Cycles with Ensemble Machine Learning Algorithms

Investing through Economic Cycles with Ensemble Machine Learning Algorithms Investing through Economic Cycles with Ensemble Machine Learning Algorithms Thomas Raffinot Silex Investment Partners Big Data in Finance Conference Thomas Raffinot (Silex-IP) Economic Cycles-Machine Learning

More information

Using Advanced Analytics to Identify Fraud in Property and Casualty Insurance

Using Advanced Analytics to Identify Fraud in Property and Casualty Insurance AN EXL WHITE PAPER Using Advanced Analytics to Identify Fraud in Property and Casualty Insurance Neeraj Sibal Senior Manager, Analytics lookdeeper@exlservice.com The Coalition Against Insurance Fraud (CAIF)

More information

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018 ` Subject CS1 Actuarial Statistics 1 Core Principles Syllabus for the 2019 exams 1 June 2018 Copyright in this Core Reading is the property of the Institute and Faculty of Actuaries who are the sole distributors.

More information

Homeowners Ratemaking Revisited

Homeowners Ratemaking Revisited Why Modeling? For lines of business with catastrophe potential, we don t know how much past insurance experience is needed to represent possible future outcomes and how much weight should be assigned to

More information

Predictive Modelling. Document Turning Big Data into Big Opportunities

Predictive Modelling. Document Turning Big Data into Big Opportunities Predictive Modelling Document 218081 Turning Big Data into Big Opportunities Essays on Predictive Modelling: Turning Big Data into Big Opportunities In recent years, data has become a key driver of economic

More information

Intro to GLM Day 2: GLM and Maximum Likelihood

Intro to GLM Day 2: GLM and Maximum Likelihood Intro to GLM Day 2: GLM and Maximum Likelihood Federico Vegetti Central European University ECPR Summer School in Methods and Techniques 1 / 32 Generalized Linear Modeling 3 steps of GLM 1. Specify the

More information

of Complex Systems to ERM and Actuarial Work

of Complex Systems to ERM and Actuarial Work Developments in the Application of Complex Systems to ERM and Actuarial Work Joshua Corrigan, Milliman Milliman Agenda Overview of Complex Systems Sciences Strategic Risk Application and Example Operational

More information

A Multi-topic Approach to Building Quant Models. Bringing Semantic Intelligence to Financial Markets

A Multi-topic Approach to Building Quant Models. Bringing Semantic Intelligence to Financial Markets A Multi-topic Approach to Building Quant Models Bringing Semantic Intelligence to Financial Markets Data is growing at an incredible speed Source: IDC - 2014, Structured Data vs. Unstructured Data: The

More information

1 Introduction. Term Paper: The Hall and Taylor Model in Duali 1. Yumin Li 5/8/2012

1 Introduction. Term Paper: The Hall and Taylor Model in Duali 1. Yumin Li 5/8/2012 Term Paper: The Hall and Taylor Model in Duali 1 Yumin Li 5/8/2012 1 Introduction In macroeconomics and policy making arena, it is extremely important to have the ability to manipulate a set of control

More information

A positive outlook on auto-enrolment contributions phasing. High

A positive outlook on auto-enrolment contributions phasing. High A positive outlook on auto-enrolment contributions phasing High Summary UK businesses are focusing on securing the organisation s future by strengthening their competitive position, increasing revenue

More information

Reserving in the Pressure Cooker (General Insurance TORP Working Party) 18 May William Diffey Laura Hobern Asif John

Reserving in the Pressure Cooker (General Insurance TORP Working Party) 18 May William Diffey Laura Hobern Asif John Reserving in the Pressure Cooker (General Insurance TORP Working Party) 18 May 2018 William Diffey Laura Hobern Asif John Disclaimer The views expressed in this presentation are those of the presenter(s)

More information

Web Science & Technologies University of Koblenz Landau, Germany. Lecture Data Science. Statistics and Probabilities JProf. Dr.

Web Science & Technologies University of Koblenz Landau, Germany. Lecture Data Science. Statistics and Probabilities JProf. Dr. Web Science & Technologies University of Koblenz Landau, Germany Lecture Data Science Statistics and Probabilities JProf. Dr. Claudia Wagner Data Science Open Position @GESIS Student Assistant Job in Data

More information

F U T U R E O F W O R K A N D I M P L I C A T I O N S F O R S O U T H A U S T R A L I A

F U T U R E O F W O R K A N D I M P L I C A T I O N S F O R S O U T H A U S T R A L I A F U T U R E O F W O R K A N D I M P L I C A T I O N S F O R S O U T H A U S T R A L I A A I I A N a v i g a t i n g Te c h n o l o g y a n d t h e J o b s o f t h e F u t u r e T h u r s d a y 1 2 t h

More information

A Cash Flow-Based Approach to Estimate Default Probabilities

A Cash Flow-Based Approach to Estimate Default Probabilities A Cash Flow-Based Approach to Estimate Default Probabilities Francisco Hawas Faculty of Physical Sciences and Mathematics Mathematical Modeling Center University of Chile Santiago, CHILE fhawas@dim.uchile.cl

More information

Stock Price Prediction using Recurrent Neural Network (RNN) Algorithm on Time-Series Data

Stock Price Prediction using Recurrent Neural Network (RNN) Algorithm on Time-Series Data Stock Price Prediction using Recurrent Neural Network (RNN) Algorithm on Time-Series Data Israt Jahan Department of Computer Science and Operations Research North Dakota State University Fargo, ND 58105

More information

Deep learning analysis of limit order book

Deep learning analysis of limit order book Washington University in St. Louis Washington University Open Scholarship Arts & Sciences Electronic Theses and Dissertations Arts & Sciences Spring 5-18-2018 Deep learning analysis of limit order book

More information

Making sense of Schedule Risk Analysis

Making sense of Schedule Risk Analysis Making sense of Schedule Risk Analysis John Owen Barbecana Inc. Version 2 December 19, 2014 John Owen - jowen@barbecana.com 2 5 Years managing project controls software in the Oil and Gas industry 28 years

More information

And The Winner Is? How to Pick a Better Model

And The Winner Is? How to Pick a Better Model And The Winner Is? How to Pick a Better Model Part 2 Goodness-of-Fit and Internal Stability Dan Tevet, FCAS, MAAA Goodness-of-Fit Trying to answer question: How well does our model fit the data? Can be

More information

Gradient Boosting Trees: theory and applications

Gradient Boosting Trees: theory and applications Gradient Boosting Trees: theory and applications Dmitry Efimov November 05, 2016 Outline Decision trees Boosting Boosting trees Metaparameters and tuning strategies How-to-use remarks Regression tree True

More information

Article from. The Actuary. October/November 2015 Issue 5

Article from. The Actuary. October/November 2015 Issue 5 Article from The Actuary October/November 2015 Issue 5 FEATURE PREDICTIVE ANALYTICS THE USE OF PREDICTIVE ANALYTICS IN THE DEVELOPMENT OF EXPERIENCE STUDIES Recently, predictive analytics has drawn a lot

More information

A case study on using generalized additive models to fit credit rating scores

A case study on using generalized additive models to fit credit rating scores Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS071) p.5683 A case study on using generalized additive models to fit credit rating scores Müller, Marlene Beuth University

More information

Machine Learning Performance over Long Time Frame

Machine Learning Performance over Long Time Frame Machine Learning Performance over Long Time Frame Yazhe Li, Tony Bellotti, Niall Adams Imperial College London yli16@imperialacuk Credit Scoring and Credit Control Conference, Aug 2017 Yazhe Li (Imperial

More information