Generalized Additive Modelling for Sample Extremes: An Environmental Example

Similar documents
Advanced Extremal Models for Operational Risk

Modeling the extremes of temperature time series. Debbie J. Dupuis Department of Decision Sciences HEC Montréal

An Introduction to Statistical Extreme Value Theory

NEWCASTLE UNIVERSITY SCHOOL OF MATHEMATICS & STATISTICS SEMESTER /2013 MAS8304. Environmental Extremes: Mid semester test

Measuring Financial Risk using Extreme Value Theory: evidence from Pakistan

Modelling Environmental Extremes

Generalized MLE per Martins and Stedinger

Modelling Environmental Extremes

GPD-POT and GEV block maxima

Financial Risk Forecasting Chapter 9 Extreme Value Theory

An Application of Extreme Value Theory for Measuring Financial Risk in the Uruguayan Pension Fund 1

IMPLEMENTING THE SPECTRAL CALIBRATION OF EXPONENTIAL LÉVY MODELS

Modelling catastrophic risk in international equity markets: An extreme value approach. JOHN COTTER University College Dublin

1. You are given the following information about a stationary AR(2) model:

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER

On modelling of electricity spot price

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is:

Risk Management and Time Series

STOCHASTIC MODELING OF HURRICANE DAMAGE UNDER CLIMATE CHANGE

Analysis of truncated data with application to the operational risk estimation

QUANTIFYING THE RISK OF EXTREME EVENTS IN A CHANGING CLIMATE. Rick Katz. Joint Work with Holger Rootzén Chalmers and Gothenburg University, Sweden

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -5 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

Estimating Term Structure of U.S. Treasury Securities: An Interpolation Approach

Analysis of extreme values with random location Abstract Keywords: 1. Introduction and Model

Statistical Models and Methods for Financial Markets

Online Appendix to Grouped Coefficients to Reduce Bias in Heterogeneous Dynamic Panel Models with Small T

Linda Allen, Jacob Boudoukh and Anthony Saunders, Understanding Market, Credit and Operational Risk: The Value at Risk Approach

Analyzing Oil Futures with a Dynamic Nelson-Siegel Model

AN EXTREME VALUE APPROACH TO PRICING CREDIT RISK

Financial Econometrics

Quantitative Models for Operational Risk

9.1 Principal Component Analysis for Portfolios

Limit Theorems for the Empirical Distribution Function of Scaled Increments of Itô Semimartingales at high frequencies

Portfolio Optimization. Prof. Daniel P. Palomar

M.I.T Fall Practice Problems

Extended Libor Models and Their Calibration

EVA Tutorial #1 BLOCK MAXIMA APPROACH IN HYDROLOGIC/CLIMATE APPLICATIONS. Rick Katz

Duangporn Jearkpaporn, Connie M. Borror Douglas C. Montgomery and George C. Runger Arizona State University Tempe, AZ

Multivariate Cox PH model with log-skew-normal frailties

Window Width Selection for L 2 Adjusted Quantile Regression

Modelling Joint Distribution of Returns. Dr. Sawsan Hilal space

IMPA Commodities Course : Forward Price Models

Lecture 3: Probability Distributions (cont d)

Estimate of Maximum Insurance Loss due to Bushfires

Subject CS2A Risk Modelling and Survival Analysis Core Principles

Chapter 3 Common Families of Distributions. Definition 3.4.1: A family of pmfs or pdfs is called exponential family if it can be expressed as

A New Hybrid Estimation Method for the Generalized Pareto Distribution

Dynamic Replication of Non-Maturing Assets and Liabilities

Value at Risk Estimation Using Extreme Value Theory

Vladimir Spokoiny (joint with J.Polzehl) Varying coefficient GARCH versus local constant volatility modeling.

Simultaneous optimization for wind derivatives based on prediction errors

STA 4504/5503 Sample questions for exam True-False questions.

Lecture Note 9 of Bus 41914, Spring Multivariate Volatility Models ChicagoBooth

Machine Learning for Quantitative Finance

Consistent estimators for multilevel generalised linear models using an iterated bootstrap

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Solutions to Final Exam

Fitting the generalized Pareto distribution to commercial fire loss severity: evidence from Taiwan

Extended Libor Models and Their Calibration

Two-step conditional α-quantile estimation via additive models of location and scale 1

Modelling and Management of Cyber Risk

Semiparametric Modeling, Penalized Splines, and Mixed Models

Numerical Methods for Pricing Energy Derivatives, including Swing Options, in the Presence of Jumps

Extreme Value Theory with an Application to Bank Failures through Contagion

value BE.104 Spring Biostatistics: Distribution and the Mean J. L. Sherley

Evaluating Electricity Generation, Energy Options, and Complex Networks

Point Process Models for Extreme Returns: Harnessing implied volatility

Simulation of Extreme Events in the Presence of Spatial Dependence

Semiparametric Modeling, Penalized Splines, and Mixed Models David Ruppert Cornell University

Survival Analysis APTS 2016/17 Preliminary material

Statistics 6 th Edition

Quantile Regression. By Luyang Fu, Ph. D., FCAS, State Auto Insurance Company Cheng-sheng Peter Wu, FCAS, ASA, MAAA, Deloitte Consulting

Self-Exciting Corporate Defaults: Contagion or Frailty?

Forecasting with Inadequate Data. The Piggyback Model. The Problem. The Solution. Iain Currie Heriot-Watt University. Universidad Carlos III de Madrid

Two-dimensional COS method

Portfolio Management and Optimal Execution via Convex Optimization

Modeling the dependence between a Poisson process and a continuous semimartingale

PARAMETRIC AND NON-PARAMETRIC BOOTSTRAP: A SIMULATION STUDY FOR A LINEAR REGRESSION WITH RESIDUALS FROM A MIXTURE OF LAPLACE DISTRIBUTIONS

ERASMUS UNIVERSITY ROTTERDAM. Erasmus School of Economics. Extreme quantile estimation under serial dependence

Estimation Procedure for Parametric Survival Distribution Without Covariates

Valuing Water Resources in Developing Countries: A Semiparametric Approach to Valuation Models

Pricing in markets modeled by general processes with independent increments

Stochastic Models. Statistics. Walt Pohl. February 28, Department of Business Administration

Financial Risk Management

A RISK RETURN RELATION IN STOCK MARKETS: EVIDENCE FROM A NEW SEMI-PARAMETRIC GARCH-IN-MEAN MODEL. A Dissertation NAPON HONGSAKULVASU

Modelling insured catastrophe losses

Spline Methods for Extracting Interest Rate Curves from Coupon Bond Prices

Bivariate Extreme Value Analysis of Commodity Prices. Matthew Joyce BSc. Economics, University of Victoria, 2011

Universität Regensburg Mathematik

Modelling financial data with stochastic processes

Statistics for Managers Using Microsoft Excel 7 th Edition

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Solutions to Midterm

Is the Potential for International Diversification Disappearing? A Dynamic Copula Approach

Extreme Values Modelling of Nairobi Securities Exchange Index

A Multivariate Analysis of Intercompany Loss Triangles

WC-5 Just How Credible Is That Employer? Exploring GLMs and Multilevel Modeling for NCCI s Excess Loss Factor Methodology

Risk Classification In Non-Life Insurance

Chapter 2 ( ) Fall 2012

Energy Systems under Uncertainty: Modeling and Computations

Modeling the Spot Price of Electricity in Deregulated Energy Markets

Practical methods of modelling operational risk

Transcription:

Generalized Additive Modelling for Sample Extremes: An Environmental Example V. Chavez-Demoulin Department of Mathematics Swiss Federal Institute of Technology Tokyo, March 2007

Changes in extremes? Likely to be slow in environmental applications May be difficult to detect because of noise Aim to combine the point process approach to exceedances with smoothing methods to give a flexible exploratory approach to modelling changes in extremes

Stations in Swiss Alps

Winter temperatures at 21 Swiss stations height 197019751980198519901995 height height 0 5 10 15 20 0 height height height height height height 5 10 Excesses 15 20 height height height height height height 0 5 10 15 20 0 height height height height height height 5 10 15 20 197019751980198519901995 197019751980198519901995 Year 197019751980 198519901995

Swiss winter temperatures by year 0 5 10 15 0 20 40 60 80 1995 1996 1997 20 1989 1990 1991 1992 1993 1994 0 5 10 15 Excesses 0 5 10 15 20 1983 1984 1985 1986 1987 1988 1977 1978 1979 1980 1981 1982 20 0 5 10 15 20 0 1971 1972 1973 1974 1975 1976 5 10 15 20 0 20 40 60 80 0 20 40 60 80 0 20 40 60 80 Day

Summary Climate change and extremes? Need flexible models Mix threshold approach to extremal modelling, semiparametric smoothing, and bootstrap Brief description of the threshold method Implementation of spline smoothers Application to the Swiss Alps data Discussion

Traditional Method The mathematical foundation of EVT is the class of extreme value limit laws X 1, X 2,..., are independent random variables with common distribution function F and M n = max {X 1,..., X n } for suitable normalising constants a n > 0 and b n, we{ seek a limit } law satisfying P Mn b n a n x = F n (a n x + b n ) G(x)

There are only 3 fundamental types of extreme value limit laws that can be combined into a simple GEV distribution H(x) = exp { ( 1 + κ x µ ψ ) 1/κ + } The parameters < µ <, ψ > 0 and < κ < are resp. the location, scale and shape parameters

r-largest Extremes M n,..., M n r : the r-largest observations among X 1,..., X n to get more information about the extremes than the max alone The asymptotic joint distribution of M1 n,..., Mr n at m { n 1,..., mn r ( is ) } 1/κ exp 1 + κ mr n µ ψ ( ) 1/κ 1 r j=1 1 ψ 1 + κ mi n µ ψ + which forms a likelihood for the parameters

In case m years of data are available, the likelihood is constructed from the r-largest values in each year, considering data for different years as independent, an overall likelihood is simply the product of such terms, for all years Choice of r; bias if r is too large

Threshold method Treat occurrences of events over (or under) threshold u as Poisson process Number of exceedances N over u follows homogeneous Poisson process, rate λ Exceedance sizes W j = Y j u are random sample from GPD 1 (1 + κw/σ) 1/κ + if κ 0 G(w) = 1 exp( w/σ) if κ = 0 where σ and κ are scale and shape parameters

Use orthogonal parametrization κ, ν = σ(1 + κ) below Log likelihood for data splits into two parts l(λ, κ, σ) = l N (λ) + l W (κ, ν)

Semiparametric model Generalize previous approach Take λ to be time-varying, where λ = exp { x T α + f(t) } Take exceedances to be GPD with κ = x T β + g(t), ν = exp { x T η + s(t) }

f, g and s are smooth functions of time t, and parameters can also depend on ordinary covariates Penalize roughness of f, g and s through second derivatives Other link functions possible

Penalized log likelihoods For rate of exceedances λ, maximize l N (λ) 1 2 γ λ f (t) 2 dt, equivalent to fitting standard generalized additive model For sizes of exceedances, maximize l W {κ(β, g), ν(η, s)} 1 2 γ κ g (t) 2 dt 1 2 γ ν s (t) 2 dt

If g, s are cubic splines, equivalent to maximizing l W {κ(β, g), ν(η, s)} 1 2 γ κg T Kg 1 2 γ νs T Ks over β, η, g, s and leads to generalized ridge regression Parameters γ λ, γ κ and γ ν control smoothness of f, g and s

Methodology Choose forms for λ, κ and ν and fit Choose smoothing parameters γ λ etc using AIC Use likelihood ratio statistics/aic for model comparisons

When model correct, residuals R j = ˆκ 1 j log {1 ˆκ j W j (1 ˆκ j )ˆν j } are approximately independent unit exponential variables

Bootstrap uncertainty assessment Need model-robust assessment of uncertainty Clustering across stations must be taken into account Use bootstrap, either resampling the R j computed from undersmoothed curves added to oversmoothed curves or resample seasons within blocks Either yields percentile confidence intervals/pointwise bands

Alpine winter temperatures Fitted intensity log ˆλ = ˆα 0 + ˆf(d, 4) + ˆq(t, 2) at Vattis for 1984 5 (left) and for January 1 from 1971 95 (right) lambda 0.0 0.05 0.10 0.15 0.20 lambda 0.05 0.10 0.15 0.20 0 20 40 60 80 Day 1970 1975 1980 1985 1990 1995 Year

Fitted model and 20-year return level log ˆλ = ˆα 0 + ˆf(d, 4) + ˆq(t, 2), ˆκ = ˆβ 0 +10 2 (h 1000) ˆβ 1, log ˆν = ˆη 0 + ˆη 2 t+ŝ(d, 4) Rheinfelden Vattis Arosa Temperatures & 20-year return level -30-25 -20-15 -10-5 1970 1975 1980 1985 1990 1995 Year Temperatures & 20-year return level -30-25 -20-15 -10-5 1970 1975 1980 1985 1990 1995 Year Temperatures & 20-year return level -30-25 -20-15 -10-5 1970 1975 1980 1985 1990 1995 Year

Discussion Inhomogeneous Poisson process λ depends on time but not location Shape parameter κ varies with altitude exceedances at higher stations have shorter tails Scale parameter ν depends on time but not on altitude Increase since 1985 is consistent with the supposed effect of climate change but also with short-term fluctuations (decrease from 1970 85!)

Conclusion Exceedances over/under thresholds widely-used approach with natural interpretation exceedance times modelled using existing code (GAM) Smoothing extremes by penalized log likelihood convenient and rapid exploration technique highlights features of underlying distribution

References Chavez-Demoulin, V. and Davison, A. C. (2005) Applied Statistics Chavez-Demoulin, V., Embrechts, P. (2004) Smooth Extremal Models in Finance and Insurance. Journal of Risk and Insurance. Davison, A. C. and Smith, R. L. (1990) JRSS,B Embrechts, P., Klüppelberg, C. and Mikosch, T. (1997) Modelling Extremal Events for Insurance and Finance Springer Green, P. J., Silverman, B. W. (1994)