Simulation of Extreme Events in the Presence of Spatial Dependence

Similar documents
Modelling Environmental Extremes

Modelling Environmental Extremes

Dependence Structure and Extreme Comovements in International Equity and Bond Markets

Modeling Co-movements and Tail Dependency in the International Stock Market via Copulae

EVA Tutorial #1 BLOCK MAXIMA APPROACH IN HYDROLOGIC/CLIMATE APPLICATIONS. Rick Katz

Introduction to Algorithmic Trading Strategies Lecture 8

Financial Risk Forecasting Chapter 9 Extreme Value Theory

GPD-POT and GEV block maxima

An Introduction to Statistical Extreme Value Theory

ADVANCED OPERATIONAL RISK MODELLING IN BANKS AND INSURANCE COMPANIES

Chapter 8: Sampling distributions of estimators Sections

Limit Theorems for the Empirical Distribution Function of Scaled Increments of Itô Semimartingales at high frequencies

Exam 2 Spring 2015 Statistics for Applications 4/9/2015

Chapter 7: Estimation Sections

Modeling dynamic diurnal patterns in high frequency financial data

Generalized MLE per Martins and Stedinger

Financial Risk Management

Objective Bayesian Analysis for Heteroscedastic Regression

Analysis of extreme values with random location Abstract Keywords: 1. Introduction and Model

ST440/550: Applied Bayesian Analysis. (5) Multi-parameter models - Summarizing the posterior

AN EXTREME VALUE APPROACH TO PRICING CREDIT RISK

Web-based Supplementary Materials for. A space-time conditional intensity model. for invasive meningococcal disease occurence

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2014, Mr. Ruey S. Tsay. Solutions to Midterm

(5) Multi-parameter models - Summarizing the posterior

Chapter 7: Estimation Sections

Measuring Financial Risk using Extreme Value Theory: evidence from Pakistan

Business Statistics 41000: Probability 3

Risk Measurement in Credit Portfolio Models

1 Bayesian Bias Correction Model

Window Width Selection for L 2 Adjusted Quantile Regression

Lecture 1: Empirical Properties of Returns

Chapter 5 Univariate time-series analysis. () Chapter 5 Univariate time-series analysis 1 / 29

QQ PLOT Yunsi Wang, Tyler Steele, Eva Zhang Spring 2016

Amath 546/Econ 589 Univariate GARCH Models

Statistical Analysis of Data from the Stock Markets. UiO-STK4510 Autumn 2015

Copulas? What copulas? R. Chicheportiche & J.P. Bouchaud, CFM

Is the Potential for International Diversification Disappearing? A Dynamic Copula Approach

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing Examples

Chapter 7: Estimation Sections

Master s in Financial Engineering Foundations of Buy-Side Finance: Quantitative Risk and Portfolio Management. > Teaching > Courses

IEOR E4703: Monte-Carlo Simulation

Value at Risk with Stable Distributions

Geostatistical Inference under Preferential Sampling

Analyzing Oil Futures with a Dynamic Nelson-Siegel Model

Bayesian Hierarchical/ Multilevel and Latent-Variable (Random-Effects) Modeling

Financial Risk Management

An Introduction to Bayesian Inference and MCMC Methods for Capture-Recapture

ROM SIMULATION Exact Moment Simulation using Random Orthogonal Matrices

An Information Based Methodology for the Change Point Problem Under the Non-central Skew t Distribution with Applications.

The Use of Penultimate Approximations in Risk Management

Market Risk Analysis Volume IV. Value-at-Risk Models

Statistical Inference and Methods

Introduction to Computational Finance and Financial Econometrics Descriptive Statistics

Midterm Exam. b. What are the continuously compounded returns for the two stocks?

NEWCASTLE UNIVERSITY SCHOOL OF MATHEMATICS & STATISTICS SEMESTER /2013 MAS8304. Environmental Extremes: Mid semester test

A Multivariate Analysis of Intercompany Loss Triangles

ERASMUS UNIVERSITY ROTTERDAM. Erasmus School of Economics. Extreme quantile estimation under serial dependence

Likelihood Methods of Inference. Toss coin 6 times and get Heads twice.

Loss Simulation Model Testing and Enhancement

QUANTIFYING THE RISK OF EXTREME EVENTS IN A CHANGING CLIMATE. Rick Katz. Joint Work with Holger Rootzén Chalmers and Gothenburg University, Sweden

High-Frequency Data Analysis and Market Microstructure [Tsay (2005), chapter 5]

Optimally Thresholded Realized Power Variations for Lévy Jump Diffusion Models

Machine Learning for Quantitative Finance

IEOR E4602: Quantitative Risk Management

Moment-based approximation with finite mixed Erlang distributions

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi

Log-Robust Portfolio Management

An Empirical Analysis of the Dependence Structure of International Equity and Bond Markets Using Regime-switching Copula Model

Strategies for High Frequency FX Trading

CS340 Machine learning Bayesian model selection

Graduate School of Business, University of Chicago Business 41202, Spring Quarter 2007, Mr. Ruey S. Tsay. Solutions to Final Exam

Generating Random Numbers

Example 5 European call option (ECO) Consider an ECO over an asset S with execution date T, price S T at time T and strike price K.

The stochastic calculus

Estimating a Dynamic Oligopolistic Game with Serially Correlated Unobserved Production Costs. SS223B-Empirical IO

Extended Libor Models and Their Calibration

Self-Exciting Corporate Defaults: Contagion or Frailty?

BROWNIAN MOTION Antonella Basso, Martina Nardon

Advanced Tools for Risk Management and Asset Pricing

Solar Energy Portfolio Analysis

Statistical analysis and bootstrapping

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is:

P VaR0.01 (X) > 2 VaR 0.01 (X). (10 p) Problem 4

Review for Final Exam Spring 2014 Jeremy Orloff and Jonathan Bloom

Lecture 6: Non Normal Distributions

Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions

1. You are given the following information about a stationary AR(2) model:

Improved Inference for Signal Discovery Under Exceptionally Low False Positive Error Rates

1. Covariance between two variables X and Y is denoted by Cov(X, Y) and defined by. Cov(X, Y ) = E(X E(X))(Y E(Y ))

Mongolia s TOP-20 Index Risk Analysis, Pt. 3

Lecture 17: More on Markov Decision Processes. Reinforcement learning

The Inverse Gaussian Process as a Degradation Model

MEASURING PORTFOLIO RISKS USING CONDITIONAL COPULA-AR-GARCH MODEL

Estimating Value at Risk of Portfolio: Skewed-EWMA Forecasting via Copula

Value at Risk and Self Similarity

Lecture Note 9 of Bus 41914, Spring Multivariate Volatility Models ChicagoBooth

On modelling of electricity spot price

Conditional Heteroscedasticity

Some Simple Stochastic Models for Analyzing Investment Guarantees p. 1/36

Heterogeneous Firm, Financial Market Integration and International Risk Sharing

Transcription:

Simulation of Extreme Events in the Presence of Spatial Dependence Nicholas Beck Bouchra Nasri Fateh Chebana Marie-Pier Côté Juliana Schulz Jean-François Plante Martin Durocher Marie-Hélène Toupin Jean-François Quessy Jonathan Jalbert Véronique Tremblay Nouredine Daili Desjardins General Insurance Group

Problem Evaluate the risk of coastal floods (from salt water) Deterministic model for the propagation of water in land Create a model to simulate water levels on the coast Capture spatial dependence to evaluate risk properly

Data and related choices Measure from tide gauges 21 locations, east of Québec City (at most) 50 years of data (much fewer in some locations) Interest in Z, the annual maximum water level L, the normal (theoretical) level of a high tide is known Longitude and latitude used for location

Working hypothesis #1 No temporal trends in the annual maxima X1700 3.0 3.2 3.4 3.6 3.8 4.0 4.2 X1805 2.2 2.4 2.6 2.8 3.0 0 10 20 30 40 50 0 10 20 30 40 50 X2145 2.9 3.0 3.1 3.2 X2780 3.4 3.6 3.8 4.0 0 10 20 30 40 50 0 10 20 30 40 50 X3057 6.8 7.0 7.2 7.4 7.6 X835 2.8 2.9 3.0 3.1 3.2 0 10 20 30 40 50 0 10 20 30 40 50 Shown above : Annual maximum vs year at 6 different sites. Climate change ignored : insurance contract are for one year.

Working hypothesis #2 Spatial trend in mean Tide 2 4 6 8 Tide (corrected) 0.5 1.0 1.5 70 65 60 55 longitude 70 65 60 55 longitude Annual maxima vs longitude. Left : raw data (Z ) ; Right : corrected for tide level (Z L)

Notation Notation Let a random fields Z (s), where Z (s) are annual maxima at location s R 2 Joint distribution H(z 1,..., z n ) = Pr {Z (s 1 ) z 1,..., Z (s n ) z n } Looking for a model for H

Plan Marginal model Generalized Extreme Value Spatial model t Copula Solution #1 : Frequentist approach Solution #2 : Bayesian approach Conclusion

Extreme value theory Theorem (Fisher-Tippett) Let Y 1,..., Y m be an iid sequence of variables and Z m = max{y 1,..., Y m }. If there exists some b m > 0 and a m R such that Z m a m b m H, for some non-degenerate distribution H, then H has a generalized extreme value (GEV) distribution, that is exp { (1 + ξ y µ σ F(y; µ, σ, ξ) = ) 1/ξ}, ξ 0, 1 + ξ y µ σ > 0 exp { exp ( y µ )}, ξ = 0, y R. σ

Extreme value theory In particular, when m is large enough, Z m GEV (µ, σ, ξ) and this approximation holds true even if Y 1,..., Y m is not independent. - short/medium range autocorrelation "disappears" when taking maximum (yay!)

Copula approach Decomposition H(z 1,..., z 21 ) = C Σθ,γ {F β1 (z 1 ),..., F β21 (z 21 )} C Σθ,γ is a copula and F β are marginal distributions ( ) δij (Σ θ,γ ) ij = g γ θ Isotropic random field

Copula approach Student copula Non zero upper tail dependence { } λ U = lim Pr X 1 > F 1 u 1 1 (u) X 2 > F 1 2 (u) Close under margins Power exponential model g ν (δ) = exp [ ( ) δ γ ] 3 θ where θ is a range parameter and γ is a smoothness parameter.

Copula approach Effect of the tail dependence Gaussian t dof=8 latitude 0.0 0.2 0.4 0.6 0.8 1.0 1.0 0.8 0.6 0.4 0.2 0.0 latitude 0.0 0.2 0.4 0.6 0.8 1.0 1.0 0.8 0.6 0.4 0.2 0.0 0.0 0.2 0.4 0.6 0.8 1.0 longitude 0.0 0.2 0.4 0.6 0.8 1.0 longitude

Copula approach Effect of the range parameter θ t dof=8, theta=1000 t dof=8, theta=3000 latitude 0.0 0.2 0.4 0.6 0.8 1.0 1.0 0.8 0.6 0.4 0.2 latitude 0.0 0.2 0.4 0.6 0.8 1.0 1.0 0.8 0.6 0.4 0.2 0.0 0.0 0.2 0.4 0.6 0.8 1.0 longitude 0.0 0.2 0.4 0.6 0.8 1.0 longitude

Copula approach Effect of the smoothness parameter γ t dof=8, nu=1 t dof=8, nu=0.2 latitude 0.0 0.2 0.4 0.6 0.8 1.0 0.8 0.6 0.4 0.2 latitude 0.0 0.2 0.4 0.6 0.8 1.0 1.0 0.8 0.6 0.4 0.2 0.0 0.0 0.2 0.4 0.6 0.8 1.0 longitude 0.0 0.2 0.4 0.6 0.8 1.0 longitude

Frequentist model Marginal GEV model Marginal distribution with covariates (Z L) = Z GEV (µ (s lon ), σ, ξ) Location : µ (s lon ) = a 0 + a 1 s lon + a 2 s 2 lon Constant scale σ and shape ξ Likelihood for the marginal L(a 0, a 1, a 2, σ, ξ) = n k=1 where n = number of year-site f (z k ; a 0, a 1, a 2, σ, ξ)

Frequentist model Marginal GEV model µ = 4.74 + 0.16s long + 0.002slong 2 σ = 0.20 ξ = 0.07

Frequentist model Estimation of the location parameters µ

Frequentist model Copula estimation Marginal parameters are known : β i = ( µ i, σ, ξ) Second step : Pseudo-likelihood L(θ, γ) = n i=1 } c (d i ) Σ θ,γ (Z {F β1 i,1 ),..., (Z F βdi i,di ), where d i is the number of non missing sites at time i = 1,..., n.

Frequentist model Evolution of the dependence (ˆθ = 2850, ˆγ =.37 df = 21)

Frequentist model Boxplot of simulated (green) vs true (blue) values

Hierarchical Bayesian model 1) Data level Z(s) θ has copula C θ and Z (s i ) µ(s i ), σ, ξ GEV (µ(s i ), σ, ξ) 2) Process level Non-informative priors are used for σ, ξ and θ = (θ 1, θ 2 ) and (µ(s) β, τ, ϱ) N d. 3) Hyper priors β = (β 0, β 1 ) and ϱ have non-informative priors and τ has a non-informative prior.

Hierarchical Bayesian model Given the parameters µ, σ, ξ and θ, the joint distribution of the maxima is F Z µ,σ,ξ,θ (z) = C θ [F{z(s 1 ) µ 1, σ, ξ},..., F {z(s d ) µ d, σ, ξ}], where C θ is a t copula F is the cdf of the GEV distribution. Fitting is performed by MCMC.

Hierarchical Bayesian model copula parameters The degrees of freedom of the t copula are fixed. The correlation depends on the distance between the stations : ρ z (s i, s j ) = θ dist(s i,s j ) θ 2 1, with f θ1 is U(0, 1) and f θ2 is U(0, 2).

Hierarchical Bayesian model marginal parameters At the process level, we use non-informative priors for σ and ξ, while, for s = (s 1,..., s d ), µ(s) N d (X(s)β, Σ(s)), where X(s i )β = β 0 + β 1 x i and x i is the deterministic predicted maximum tide level at station s i, Σ(s) = (ρ µ (s i, s j )) d d, with ρ µ (s i, s j ) = τ exp( dist(s i, s j )/ϱ). A non-informative prior was used for β, f ϱ is U(0, max i,j {dist(s i, s j )}) and f τ (τ) 1/τ.

Goodness of fit testing 3.5 7.5 Estimated Quantiles 3 2.5 2 1.5 1.8 2 2.2 2.4 2.6 2.8 Empirical Quantiles (a) Station 2 Estimated Quantiles 7 6.5 6 5.5 6.2 6.4 6.6 6.8 Empirical Quantiles (b) Station 11 FIGURE QQ Plots for GEV margins

Goodness of fit testing (cont.) 9 8 7 GEV Location 6 5 4 3 2 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 Station FIGURE Boxplots for stations location parameter

Simulation Simulation from this model can be done with the following (simple!) algorithm : (1) Generate the parameters β, τ, ϱ, µ(s), σ, ξ and θ (2) Generate realization from the copula C θ, i.e. u = (u 1,..., u d ) (3) Using the GEV margins, invert the copula realizations using x i = F 1 (u i µ(s i ), σ, ξ), i = 1,..., d. Can also be used for interpolation between sites as the hierarchical structure accounts for spatiality.

Conclusion Applicable model with reasonnable results Scalable Annual maxima vs multiple claims the same year Room to improve the modelling of µ and σ Need to check the fit on simultaneous data Great team!