FAV i R This paper is produced mechanically as part of FAViR. See for more information.

Similar documents
FAV i R This paper is produced mechanically as part of FAViR. See for more information.

REINSURANCE RATE-MAKING WITH PARAMETRIC AND NON-PARAMETRIC MODELS

Modelling Premium Risk for Solvency II: from Empirical Data to Risk Capital Evaluation

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2017, Mr. Ruey S. Tsay. Solutions to Final Exam

Introduction to Algorithmic Trading Strategies Lecture 8

Measuring Financial Risk using Extreme Value Theory: evidence from Pakistan

An Introduction to Statistical Extreme Value Theory

An Application of Extreme Value Theory for Measuring Financial Risk in the Uruguayan Pension Fund 1

A New Hybrid Estimation Method for the Generalized Pareto Distribution

A Comparison Between Skew-logistic and Skew-normal Distributions

Tail fitting probability distributions for risk management purposes

MEASURING EXTREME RISKS IN THE RWANDA STOCK MARKET

Estimating Parameters for Incomplete Data. William White

Mongolia s TOP-20 Index Risk Analysis, Pt. 3

[D7] PROBABILITY DISTRIBUTION OF OUTSTANDING LIABILITY FROM INDIVIDUAL PAYMENTS DATA Contributed by T S Wright

Fitting the generalized Pareto distribution to commercial fire loss severity: evidence from Taiwan

Lecture 6: Non Normal Distributions

Introduction to Computational Finance and Financial Econometrics Descriptive Statistics

David R. Clark. Presented at the: 2013 Enterprise Risk Management Symposium April 22-24, 2013

A Markov Chain Monte Carlo Approach to Estimate the Risks of Extremely Large Insurance Claims

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

WC-5 Just How Credible Is That Employer? Exploring GLMs and Multilevel Modeling for NCCI s Excess Loss Factor Methodology

ARCH Proceedings

The extreme downside risk of the S P 500 stock index

Fitting financial time series returns distributions: a mixture normality approach

Analysis of the Oil Spills from Tanker Ships. Ringo Ching and T. L. Yip

Risk Management and Time Series

Discussion of Elicitability and backtesting: Perspectives for banking regulation

Modelling and Management of Cyber Risk

Introduction to R (2)

AN EXTREME VALUE APPROACH TO PRICING CREDIT RISK

The Application of the Theory of Power Law Distributions to U.S. Wealth Accumulation INTRODUCTION DATA

Financial Econometrics (FinMetrics04) Time-series Statistics Concepts Exploratory Data Analysis Testing for Normality Empirical VaR

Extreme Values Modelling of Nairobi Securities Exchange Index

MEASURING PORTFOLIO RISKS USING CONDITIONAL COPULA-AR-GARCH MODEL

Financial Risk Forecasting Chapter 9 Extreme Value Theory

Advanced Extremal Models for Operational Risk

Technical Analysis of Capital Market Data in R - First Steps

Lecture 1: Empirical Properties of Returns

Modeling. joint work with Jed Frees, U of Wisconsin - Madison. Travelers PASG (Predictive Analytics Study Group) Seminar Tuesday, 12 April 2016

The Use of Penultimate Approximations in Risk Management

PROBLEMS OF WORLD AGRICULTURE

Modelling Joint Distribution of Returns. Dr. Sawsan Hilal space

ANALYSIS OF EXTREME DEPENDENCE BETWEEN ISTANBUL STOCK EXCHANGE AND OIL RETURNS Gözde Ünal, Bogazici University Derya Korman, Bogazici University

John Cotter and Kevin Dowd

QUANTIFICATION OF OPERATIONAL RISKS IN BANKS: A THEORETICAL ANALYSIS WITH EMPRICAL TESTING

ADVANCED OPERATIONAL RISK MODELLING IN BANKS AND INSURANCE COMPANIES

Window Width Selection for L 2 Adjusted Quantile Regression

International Business & Economics Research Journal January/February 2015 Volume 14, Number 1

Master s in Financial Engineering Foundations of Buy-Side Finance: Quantitative Risk and Portfolio Management. > Teaching > Courses

The actuar Package. March 24, bstraub... 1 hachemeister... 3 panjer... 4 rearrangepf... 5 simpf Index 8. Buhlmann-Straub Credibility Model

Scaling conditional tail probability and quantile estimators

Rating Exotic Price Coverage in Crop Revenue Insurance

Analysis of extreme values with random location Abstract Keywords: 1. Introduction and Model

LAST SECTION!!! 1 / 36

Risk Analysis for Three Precious Metals: An Application of Extreme Value Theory

Modelling catastrophic risk in international equity markets: An extreme value approach. JOHN COTTER University College Dublin

2. Copula Methods Background

Subject CS2A Risk Modelling and Survival Analysis Core Principles

Modeling the extremes of temperature time series. Debbie J. Dupuis Department of Decision Sciences HEC Montréal

Modelling Environmental Extremes

Financial Risk 2-nd quarter 2012/2013 Tuesdays Thursdays in MVF31 and Pascal

Operational Risk: Evidence, Estimates and Extreme Values from Austria

ก ก ก ก ก ก ก. ก (Food Safety Risk Assessment Workshop) 1 : Fundamental ( ก ( NAC 2010)) 2 3 : Excel and Statistics Simulation Software\

THE DYNAMICS OF PRECIOUS METAL MARKETS VAR: A GARCH-TYPE APPROACH. Yue Liang Master of Science in Finance, Simon Fraser University, 2018.

Modelling Environmental Extremes

Paper Series of Risk Management in Financial Institutions

Relative Error of the Generalized Pareto Approximation. to Value-at-Risk

Quantile Regression due to Skewness. and Outliers

Lecture 2 Describing Data

TEACHERS RETIREMENT BOARD INVESTMENT COMMITTEE

Bivariate Extreme Value Analysis of Commodity Prices. Matthew Joyce BSc. Economics, University of Victoria, 2011

An Application of Data Fusion Techniques in Quantitative Operational Risk Management

Economic factors and solvency

Modeling Credit Risk of Loan Portfolios in the Presence of Autocorrelation (Part 2)

STAT758. Final Project. Time series analysis of daily exchange rate between the British Pound and the. US dollar (GBP/USD)

Introduction Recently the importance of modelling dependent insurance and reinsurance risks has attracted the attention of actuarial practitioners and

Portfolio modelling of operational losses John Gavin 1, QRMS, Risk Control, UBS, London. April 2004.

Describing Data: One Quantitative Variable

The Leveled Chain Ladder Model. for Stochastic Loss Reserving

Data Analysis and Statistical Methods Statistics 651

Modelling insured catastrophe losses

Exploring Data and Graphics

Market Risk Analysis Volume I

Honor Code: By signing my name below, I pledge my honor that I have not violated the Booth Honor Code during this examination.

Extreme Value Analysis for Partitioned Insurance Losses

Longitudinal Modeling of Insurance Company Expenses

Parametric Statistics: Exploring Assumptions.

Case Study: Applying Generalized Linear Models

Portfolio Optimization. Prof. Daniel P. Palomar

Forecasting Value-at-Risk using GARCH and Extreme-Value-Theory Approaches for Daily Returns

Modeling Clusters of Extreme Losses

Multivariate longitudinal data analysis for actuarial applications

Probability Weighted Moments. Andrew Smith

Regression and Simulation

Folded- and Log-Folded-t Distributions as Models for Insurance Loss Data

yuimagui: A graphical user interface for the yuima package. User Guide yuimagui v1.0

Occasional Paper. Risk Measurement Illiquidity Distortions. Jiaqi Chen and Michael L. Tindall

STA 248 H1S Winter 2008 Assignment 1 Solutions

Using New SAS 9.4 Features for Cumulative Logit Models with Partial Proportional Odds Paul J. Hilliard, Educational Testing Service (ETS)

Transcription:

The POT package By Avraham Adler FAV i R This paper is produced mechanically as part of FAViR. See http://www.favir.net for more information. Abstract This paper is intended to briefly demonstrate the POT package for use in Extreme Value Theory. 1 Introduction Extreme Value Theory (EVT) is one of the methods used by actuaries to estimate the tails of loss severity distributions. McNeil [3] discusses how the Generalized Pareto distribution (GPD) can be used to to model the tails of extreme events. 1 There exist a number of packages for the R statistical platform which may be used to investigate data in this framework. One of them is called POT, or the Peaks Over Threshold package [4]. The package does more than mere Generalized Pareto fitting, but lends itself nicely to such. This brief paper assumes a basic knowledge of EVT, and is focused on demonstrating the use of the POT package. 2 Example Using actuar [1] we can create a dataset for investigation. We will set a specific seed so that the results are reproducible. > set.seed(254) > test.data <- rpareto(n = 1000, shape = 1.5, scale = 100000) Let s take a look at this data. Percentiles are below and histograms are in figures 1 and 2. 1 A distinction must be made between the Generalized Pareto of Extreme Value Theory and the Generalized Pareto of actuarial literature as defined in Klugman, Panjer, and Wilmot (KPW) [2]. The GPD of EVT is a two-parameter pareto distribution where the shape and scale factors are less correlated than the classic two-parameter pareto. The GPD of KPW is a three-parameter distribution. 1

800 600 Count 400 200 0 0 1000000 2000000 3000000 4000000 5000000 6000000 Value Figure 1: Basic Histogram > summary(test.data) Min. 1st Qu. Median Mean 3rd Qu. Max. 40.2 18700.0 59400.0 164000.0 155000.0 6110000.0 In EVT analysis, one often wants to identify the threshold over which the tail exhibits Pareto behavior. One of the primary tools used is the Sample Mean Excess or Mean Residual Life plot. Where this plot begins to appears linear is often a decent estimate of an appropriate threshold. The POT package has a function to display such a plot: mrlplot. > par(mfrow = c(1, 2)) > mrlplot(test.data) > mrlplot(test.data, xlim = c(0, 1000000)) 2 FAV i R

100 80 Count 60 40 20 0 10 2 10 3 10 4 10 5 10 6 10 7 Value Figure 2: Log-scale Histogram Mean Residual Life Plot Mean Residual Life Plot Mean Excess 500000 1500000 2500000 Mean Excess 500000 1500000 2500000 0 1000000 2500000 Threshold 0 400000 1000000 3 Threshold FAV i R

Looking at the plot, a reasonable selection for the threshold would be 300,000. Once the threashold is selected, POT uses the fitgpd command to fit a GPD with the selected threshold. > GPD1 <- fitgpd(test.data, threshold = 300000) > GPD1 Estimator: MLE Deviance: 3212.4 AIC: 3216.4 Varying Threshold: FALSE Threshold Call: 300000 Number Above: 113 Proportion Above: 0.113 Estimates 586836.724 0.247 Standard Error Type: expected Standard Errors 87174.571 0.117 Asymptotic Variance Covariance scale 7.60e+09-6.47e+03 shape -6.47e+03 1.38e-02 Optimization Information Convergence: successful Function Evaluations: 14 Gradient Evaluations: 6 The default parameters that fitgpd passes to optim often prevent good convergence, so it pays to re-run the optimization passing a vector of parameter scales. > GPD2 <- fitgpd(test.data, threshold = 300000, control = list(parscale = c(100000, + 0.1))) > GPD2 4 FAV i R

Estimator: MLE Deviance: 3192 AIC: 3196 Varying Threshold: FALSE Threshold Call: 300000 Number Above: 113 Proportion Above: 0.113 Estimates 279868.290 0.582 Standard Error Type: observed Standard Errors 48091.411 0.154 Asymptotic Variance Covariance scale 2.31e+09-4.34e+03 shape -4.34e+03 2.37e-02 Optimization Information Convergence: successful Function Evaluations: 18 Gradient Evaluations: 11 Note how the fit is now significantly better. Lastly, POT comes with built-in plotting methods, so fits can be analyzed and compared. Below, the two GPD fits will be plotted using default methods. > par(mfrow = c(2, 2)) > plot(gpd1) 5 FAV i R

Probability plot QQplot Model 0.0 0.4 0.8 Empirical 1000000 4000000 0.0 0.2 0.4 0.6 0.8 1.0 1000000 4000000 7000000 Empirical Model Density Plot Return Level Plot Density 0.0000000 0.0000010 2000000 6000000 Return Level 0 4000000 8000000 1 2 5 20 50 200 Quantile Return Period (Years) 6 FAV i R

3 BIBLIOGRAPHY > par(mfrow = c(2, 2)) > plot(gpd2) Probability plot QQplot Model 0.0 0.4 0.8 Empirical 1000000 4000000 0.0 0.2 0.4 0.6 0.8 1.0 0 4000000 8000000 Empirical Model Density 0.0000000 0.0000025 Density Plot 2000000 6000000 Return Level 0 6000000 14000000 Return Level Plot 1 2 5 20 50 200 Quantile Return Period (Years) The POT package contains much more functionality than Generalized Pareto fitting and there are other EVT packages which can be found on CRAN such as evir, evd, etc. 3 Bibliography 1. Christophe Dutang, Vincent Goulet, and Mathieu Pigeon. actuar: An r package for actuarial science. Journal of Statistical Software 2. Stuart A. Klugman, Harry H. Panjer, and Gordon E. Willmot. Loss models: from data to decisions Wiley series in probability and statistics, New York, NY, 1998. 3. Alexander J. McNeil. Estimating the tails of loss severity distributions using extreme value theory. ASTIN Bulletin, 27(1):117 138, May 1997. 7 FAV i R

4 LEGAL 4. Mathieu Ribatet. POT: Generalized Pareto Distribution and Peaks Over Threshold, 2009. R package version 1.1-0. 4 Legal Copyright 2010 Avraham Adler This paper is part of the FAViR project. All the R source code used to produce it is freely distributable under the GNU General Public License. See http://www.favir.net for more information on FAViR or to download the source code for this paper. Copying and distribution of this paper itself, with or without modification, are permitted in any medium without royalty provided the copyright notice and this notice are preserved. This paper is offered as-is, without any warranty. This paper is intended for educational purposes only and should not be used to violate anti-trust law. The authors and FAViR editors do not necessarily endorse the information or techniques in this paper and assume no responsibility for their accuracy. 8 FAV i R