Backtesting value-at-risk: a comparison between filtered bootstrap and historical simulation

Similar documents
A Quick Tool to forecast VaR using Implied and Realized Volatilities

Risk Measuring of Chosen Stocks of the Prague Stock Exchange

GARCH vs. Traditional Methods of Estimating Value-at-Risk (VaR) of the Philippine Bond Market

Backtesting value-at-risk: Case study on the Romanian capital market

How Accurate are Value-at-Risk Models at Commercial Banks?

Value-at-Risk forecasting ability of filtered historical simulation for non-normal. GARCH returns. First Draft: February 2010 This Draft: January 2011

Bayesian Estimation of the Markov-Switching GARCH(1,1) Model with Student-t Innovations

Financial Econometrics Notes. Kevin Sheppard University of Oxford

Financial Risk Management and Governance Beyond VaR. Prof. Hugues Pirotte

Value at risk might underestimate risk when risk bites. Just bootstrap it!

CAN LOGNORMAL, WEIBULL OR GAMMA DISTRIBUTIONS IMPROVE THE EWS-GARCH VALUE-AT-RISK FORECASTS?

Modeling the Market Risk in the Context of the Basel III Acord

Scaling conditional tail probability and quantile estimators

Properties of the estimated five-factor model

Section 3 describes the data for portfolio construction and alternative PD and correlation inputs.

Violation duration as a better way of VaR model evaluation : evidence from Turkish market portfolio

European Journal of Economic Studies, 2016, Vol.(17), Is. 3

FORECASTING OF VALUE AT RISK BY USING PERCENTILE OF CLUSTER METHOD

The Fundamental Review of the Trading Book: from VaR to ES

Evaluating the Accuracy of Value at Risk Approaches

An empirical evaluation of risk management

Assessing Value-at-Risk

APPROACHES TO VALIDATING METHODOLOGIES AND MODELS WITH INSURANCE APPLICATIONS

Back-testing Value at Risk models on Indian stock markets using Covariance and Historical Simulation approach

ABILITY OF VALUE AT RISK TO ESTIMATE THE RISK: HISTORICAL SIMULATION APPROACH

Model Construction & Forecast Based Portfolio Allocation:

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2017, Mr. Ruey S. Tsay. Solutions to Final Exam

THE INFORMATION CONTENT OF IMPLIED VOLATILITY IN AGRICULTURAL COMMODITY MARKETS. Pierre Giot 1

A Regime Switching model

Market Risk Prediction under Long Memory: When VaR is Higher than Expected

Margin Backtesting. August 31st, Abstract

ARCH and GARCH models

Experience with the Weighted Bootstrap in Testing for Unobserved Heterogeneity in Exponential and Weibull Duration Models

Volatility Spillovers and Causality of Carbon Emissions, Oil and Coal Spot and Futures for the EU and USA

Downside Risk: Implications for Financial Management Robert Engle NYU Stern School of Business Carlos III, May 24,2004

Risk Management and Time Series

Cross-Sectional Distribution of GARCH Coefficients across S&P 500 Constituents : Time-Variation over the Period

Measuring Financial Risk using Extreme Value Theory: evidence from Pakistan

The new Basel III accord appeared amid

Backtesting Lambda Value at Risk

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2009, Mr. Ruey S. Tsay. Solutions to Final Exam

Backtesting Trading Book Models

A Simplified Approach to the Conditional Estimation of Value at Risk (VAR)

NBER WORKING PAPER SERIES A REHABILITATION OF STOCHASTIC DISCOUNT FACTOR METHODOLOGY. John H. Cochrane

A STUDY ON ROBUST ESTIMATORS FOR GENERALIZED AUTOREGRESSIVE CONDITIONAL HETEROSCEDASTIC MODELS

Backtesting Value-at-Risk: A Duration-Based Approach 1

Market Risk Analysis Volume II. Practical Financial Econometrics

Intraday Volatility Forecast in Australian Equity Market

Week 7 Quantitative Analysis of Financial Markets Simulation Methods

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

Research Article The Volatility of the Index of Shanghai Stock Market Research Based on ARCH and Its Extended Forms

Application of Conditional Autoregressive Value at Risk Model to Kenyan Stocks: A Comparative Study

12. Conditional heteroscedastic models (ARCH) MA6622, Ernesto Mordecki, CityU, HK, 2006.

Forecasting Volatility movements using Markov Switching Regimes. This paper uses Markov switching models to capture volatility dynamics in exchange

Lecture 5: Univariate Volatility

Backtesting Trading Book Models

Linda Allen, Jacob Boudoukh and Anthony Saunders, Understanding Market, Credit and Operational Risk: The Value at Risk Approach

Market Risk Analysis Volume IV. Value-at-Risk Models

FINANCIAL ECONOMETRICS AND EMPIRICAL FINANCE MODULE 2

EWS-GARCH: NEW REGIME SWITCHING APPROACH TO FORECAST VALUE-AT-RISK

Does the Assumption on Innovation Process Play an Important Role for Filtered Historical Simulation Model?

MEASURING TRADED MARKET RISK: VALUE-AT-RISK AND BACKTESTING TECHNIQUES

Testing for the martingale hypothesis in Asian stock prices: a wild bootstrap approach

Chapter 4 Level of Volatility in the Indian Stock Market

The Great Moderation Flattens Fat Tails: Disappearing Leptokurtosis

Amath 546/Econ 589 Univariate GARCH Models: Advanced Topics

Introductory Econometrics for Finance

Amath 546/Econ 589 Univariate GARCH Models

Backtesting Value-at-Risk: A Duration-Based Approach 1

List of tables List of boxes List of screenshots Preface to the third edition Acknowledgements

starting on 5/1/1953 up until 2/1/2017.

PERFORMANCE AND CONSERVATISM OF MONTHLY FHS VAR: AN INTERNATIONAL INVESTIGATION

yuimagui: A graphical user interface for the yuima package. User Guide yuimagui v1.0

A Quantile Regression Approach to the Multiple Period Value at Risk Estimation

Modeling Credit Risk of Loan Portfolios in the Presence of Autocorrelation (Part 2)

The GARCH-GPD in market risks modeling: An empirical exposition on KOSPI

Measurement of Market Risk

Internet Appendix for Asymmetry in Stock Comovements: An Entropy Approach

Financial Econometrics

Non-parametric VaR Techniques. Myths and Realities

FORECASTING PERFORMANCE OF MARKOV-SWITCHING GARCH MODELS: A LARGE-SCALE EMPIRICAL STUDY

P2.T5. Market Risk Measurement & Management. Jorion, Value-at Risk: The New Benchmark for Managing Financial Risk, 3 rd Edition

Tail Risk Literature Review

Evaluating Combined Forecasts for Realized Volatility Using Asymmetric Loss Functions

Modelling financial data with stochastic processes

Study on Financial Market Risk Measurement Based on GJR-GARCH and FHS

Experience with the Weighted Bootstrap in Testing for Unobserved Heterogeneity in Exponential and Weibull Duration Models

An implicit backtest for ES via a simple multinomial approach

An Empirical Research on Chinese Stock Market Volatility Based. on Garch

Lecture 8: Markov and Regime

Robust Critical Values for the Jarque-bera Test for Normality

Course information FN3142 Quantitative finance

Statistical Models and Methods for Financial Markets

Lecture 9: Markov and Regime

FOREX Risk: Measurement and Evaluation using Value-at-Risk. Don Bredin University College Dublin and. Stuart Hyde University of Manchester

A new approach to backtesting and risk model selection

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2010, Mr. Ruey S. Tsay Solutions to Final Exam

FINITE SAMPLE DISTRIBUTIONS OF RISK-RETURN RATIOS

Sample Size for Assessing Agreement between Two Methods of Measurement by Bland Altman Method

Introduction Dickey-Fuller Test Option Pricing Bootstrapping. Simulation Methods. Chapter 13 of Chris Brook s Book.

Transcription:

Journal of Risk Model Validation Volume /Number, Winter 1/13 (3 1) Backtesting value-at-risk: a comparison between filtered bootstrap and historical simulation Dario Brandolini Symphonia SGR, Via Gramsci 7, 111 Torino, Italy; email: dario.brandolini@symphonia.it Stefano Colucci Symphonia SGR, Via Gramsci 7, 111 Torino Italy; email: stefano.colucci@symphonia.it The purpose of this paper is to compare ex ante value-at-risk (VaR) estimation produced by two risk models: historical simulation and Monte Carlo filtered bootstrap. We perform three tests: unconditional coverage, independence and conditional coverage. We present results on both VaR 1% and VaR 5% on a one-day horizon for the following indexes: S&P 5, Topix, Dax, MSCI United Kingdom, MSCI France, Italy Comit Globale, MSCI Canada, MSCI Emerging Markets and RJ/CRB. Our results show that the Monte Carlo filtered bootstrap approach satisfies conditional coverage for all tested indexes, while historical simulation has many rejection cases. We also test the two models in a regulatory framework (rolling window of 5 daily observations) and discuss the advantages of using a conditional coverage methodology to validate risk models. 1 INTRODUCTION In the last ten years financial markets have suffered many periods of turbulence, such as the dot-com bubble (1 ), emerging markets fall (), the subprime financial crisis with the defaults of large investment banks () and the sovereign debt crisis (1 11), when losses reached values well above the Gaussian hypotheses in terms of frequencies and amounts. Many models can be used to estimate market risk but they must satisfy two conditions: unconditional coverage and independence. For backtesting purposes, the regulator (the Committee of European Securities Regulators (CESR), now the European Securities and Markets Authority) will accept no The authors thank Loriana Pelizzon of Venice University for her useful comments during the XIII Workshop on Quantitative Finance. The views expressed in this paper represent those of the authors and not necessarily those of Symphonia SGR. 3

D. Brandolini and S. Colucci more than seven failures in a 5-day rolling window when VaR 1% is estimated on a one-day horizon. This is a coverage condition, while no tests are required regarding independence. The aim of this paper is to discuss and compare two different models (filtered bootstrap (FB) and historical simulation (HS)) with no Gaussian hypothesis over a relatively long period of time (from January 3, to August, 11), and to then analyze the test results. We perform both coverage (unconditional and conditional) and independence tests to validate the risk models and check their compliance with regulatory rules. The outline of the paper is as follows. Data is presented in Section. In Section 3 we briefly describe how value-at-risk (VaR) is estimated in the two models. In Sections and 5, unconditional coverage independence and conditional coverage tests are discussed, together with the regulatory hit function for both models. DATA PRESENTATION We perform both VaR 1% and VaR 5% estimations on a one-day horizon for the FB approach (Barone-Adesi et al (1999)) and the HS approach for the following indexes: S&P 5, Topix (TPX), Dax (DAX), MSCI United Kingdom (MXGB), MSCI France (MXFR), Italy Comit Globale (COMIT), MSCI Canada (MXCA), MSCI Emerging Markets (MXEF), RJ/CRB (CRYTR) in US dollars and also in local currency (TPX LC, DAX LC, MXGB LC, MXFR LC, COMIT LC, MXCA LC) if the US dollar is not the original currency. Value-at-risk estimation will cover the period from January 3, to August, 11, but data sets start 5 days earlier in order to initialize the estimation. All indexes follow the US financial calendar so there are no missing values. If one market is closed, the previous value is replicated and the return for that day is zero. In that way we have the same data set for all markets and it is possible to evaluate the whole sample in joint VaR estimation. 3 VAR ESTIMATION: FILTERED BOOTSTRAP AND HISTORICAL SIMULATION A simple HS model specification is applied. For each index and each working day, we calculate the 1% and 5% quantiles of the distribution of equally weighted realized returns of the previous 5 working days. Because we analyze each index independently, this one is the risk factor that we want to estimate. Thus, it is sufficient to calculate the historical percentile (1 and 5 in our case) of the index realized distribution on 5 working days to obtain the VaR estimate for next day. There are many methods of computing quantile Q p. We use the default quantile Matlab estimator Journal of Risk Model Validation Volume /Number, Winter 1/13

Backtesting value-at-risk 5 Q p D 1.x dh 1=e C x bhc1=c /, where h D Np C 1, p is the probability and N is the sample size. This estimator is slightly more conservative than the Excel function. The FB model 1 step by step procedure is as follows. (1) Fit the best autoregressive moving average generalized autoregressive conditional heteroskedasticity (ARMA-GARCH) model. () Standardize residuals: divide residuals by estimated sigmas. (3) Bootstrap standardized residuals. () Pass the bootstrapped residuals in a forward simulation using the ARMA- GARCH estimated model. (5) Collect estimated returns. () Calculate VaR on the distribution of returns. Each day, the best ARMA.p; q/-exponential general autoregressive conditional heteroskedastic (EGARCH).p; q/ model is fitted to the data according to the following hierarchy. ARMA(1,)-EGARCH(1,1): if the AR coefficient is statistically significant. ARMA(,1)-EGARCH(1,1): if the MA coefficient is statistically significant. ARMA(,)-EGARCH(1,1): if the leverage coefficient is statistically significant. ARMA(1,)-GARCH(1,1): if the AR coefficient is statistically significant. ARMA(,1)-GARCH(1,1): if the MA coefficient is statistically significant. ARMA(,)-GARCH(1,1). Using the best model estimate, the simulation process then generates 1 scenarios and calculates 1% and 5% quantiles of the distribution of returns; again, we use the same estimator as in HS. 1 For details on the filtered bootstrap approach, refer to Barone Adesi et al (1999), Brandolini et al () and Pallotta and Zenti (). Research Paper www.risk.net/journal

D. Brandolini and S. Colucci TEST AND RESULTS In this section we apply the benchmark test proposed by Christoffersen and Pelletier (). Consider a time series of daily ex post portfolio returns, R t, t 1;:::;T, and a corresponding time series of ex ante value-at-risk forecasts, VaR t.p/, t 1;:::;T, with expected coverage rate p, such that ideally: Pr.R t < VaR t.p// D p t 1 Define the hit sequence of VaR t violations as: ( 1 if Rt < VaR t.p/ I t D otherwise Note that the hit sequence discards the information regarding the size of violations. Christoffersen (199) tests the null hypothesis that: against the alternative that: I t iid Bernoulli.p/ I t iid Bernoulli./ and refers to this as the test of correct unconditional coverage (UC): H ;uc W D p thus, testing that on average the coverage is correct. In fact, the above test implicitly assumes that the hits are independent: an assumption that has to be tested explicitly. In order to test this hypothesis, an alternative test is proposed where the hit sequence follows a first-order Markov sequence with switching probability matrix: " # 1 1 1 D 1 11 11 where ij is the probability of an event i on day t 1 being followed by an event j on day t. The test of independence (ind) is then: H ;ind W 1 D 11 Finally, the two tests can be combined in a test of conditional coverage (CC): H ;CC W 1 D 11 D p Journal of Risk Model Validation Volume /Number, Winter 1/13

Backtesting value-at-risk 7 The idea behind the Markov alternative is that clustered violations might be a signal of risk model misspecification. Violation clustering is important as it suggests repeated severe losses, which together could result in bankruptcy. The likelihood function for a sample I 1 ;:::;I T of iid observations from a Bernoulli variable with known probability p is written as: L.I; p/ D p T 1.1 p/ T T 1 where T 1 D P T td1 I t is the number of 1s in the sample. The likelihood function for an iid Bernoulli variable with unknown probability parameter, 1, to be estimated is: L.I; 1 / D T 1 1.1 1/ T T 1 The maximum-likelihood (ML) estimate of 1 is: O 1 D T 1 =T and we can thus write a likelihood ratio test of unconditional coverage as: LR UC D Œln L.I; O 1 / ln L.I; p/ For the independence test, the likelihood under the alternative hypothesis is: L.I; 1 ; 11 / D.1 1 / T T 1 T 1 1.1 11/ T 1 T 11 T 11 11 where T ij denotes the number of observations when a j follows an i.the ML estimates are: O 1 D T 1 and O 11 D T 11 T T 1 and the independence test statistic is: LR ind D Œln L.I; O 1 ; O 11 / ln L.I; O 1 / Finally, the test of conditional coverage is written as: LR CC D Œln L.I; O 1 ; O 11 / ln L.I; p/ We note that all the tests are carried out conditionally on the first observation. The tests are asymptotically chi-square distributed with one degree of freedom for the UC and ind tests and two for the CC test. Research Paper www.risk.net/journal

D. Brandolini and S. Colucci TABLE 1 Unconditional coverage test on filtered bootstrap VaR estimation (in percent). when when Index p D 1% p-value p D 5% p-value S&P 5 1.7 1.7 5. 11.9 TPX 1.3. 5. 51.71 TPX LC 1.37 5.7 5.5 1.95 DAX 1..3.1.73 DAX LC 1.7 1.7.5 1.1 MXGB 1.7. 5. 51.71 MXGB LC 1. 3.97 5. 3.37 MXFR 1.57.1.5. MXFR LC 1.33.51 5. 1.5 COMIT 1.7 1.7.5.7 COMIT LC 1.7 1.7 5.7 5.99 MXCA 1.7 1.7 5.91.75 MXCA LC 1..1.3. MXEF 1.33.51 5.9 1. CRYTR 1..3 5.3 57. Finally, as a practical matter, if the sample has T 11 D, as can easily happen in small samples and with small coverage rates, then the first-order Markov likelihood is computed as: L.I; 1 ; 11 / D.1 1 / T T 1 T 1 1 and the tests are carried out as above. Following this procedure, as in Christoffersen and Pelletier (), we perform all three tests (UC, ind and CC) for the two methods. A confidence interval of 99% is chosen to make sure that we reject the null hypothesis only when the estimate is far from the expected value. Table 1 and Table on the facing page display the results of the UC tests for VaR 1% and VaR 5%, as estimated using FB and HS. Hypotheses rejected al 99% confidence are given in bold. For FB, there are 3 rejections in the VaR 1% case, and in the VaR 5% case. For HS, the numbers of rejections are and 3, so FB seems to be superior in terms of unconditional coverage. Table 3 on the facing page and Table on page 1 display the corresponding results of independence tests. In these, FB performs much better (1 and rejections at 99% confidence) than HS ( and 11 rejections), the latter being vulnerable to highly dependent VaR estimations, indicated by clustering of ones in the hit function. Table 5 on page 1 and Table on page 11 display the corresponding results of the conditional coverage test. Again, FB performs much better (5 and 3 rejections at 99% confidence) than HS (13 and 11 rejections). Journal of Risk Model Validation Volume /Number, Winter 1/13

Backtesting value-at-risk 9 TABLE Unconditional coverage test on historical simulation VaR estimation (in percent). when when Index p D 1% p-value p D 5% p-value S&P 5 1.. 5. 11.9 TPX 1.1 39. 5.5 1.95 TPX LC 1.5 1. 5.13 75. DAX 1.7. 5.5 5.13 DAX LC 1.1...9 MXGB 1.7.1. 1.5 MXGB LC 1.7. 5.7 1.1 MXFR 1.71.5.39.5 MXFR LC 1.5 1..3.1 COMIT 1. 3.97 5..11 COMIT LC 1.5 1. 5.1.97 MXCA 1.7 1.7 5.7 7.19 MXCA LC 1.57.1 5.3.7 MXEF 1.3 1. 5.57 1.39 CRYTR 1.7. 5.3.7 TABLE 3 Independence test on filtered bootstrap VaR estimation. p D 1% p D 5% ƒ ƒ Index 1 11 LR ind p-value 1 11 LR ind p-value S&P 5 1.39%.9%..7% 5.9%.5%.1.3% TPX 1.1%.33%.7.9% 5.3% 5.%.11 7.% TPX LC 1.35%.5%.31 57.% 5.%.1%.1 9.3% DAX 1.%.% 1..7%.%.7%.11 73.9% DAX LC 1.%.33%.19.3%.9%.%.9 1.3% MXGB 1.%.1% 3. 5.7% 5.% 5.1%. 97.% MXGB LC 1.39%.%.7.57% 5.1%.9%.3 53.% MXFR 1.9%.5%..%.51% 7.9% NaN.% MXFR LC 1.3%.5%.3 5.3% 5.1%.71%.3 53.7% COMIT 1.9%.% 1. 5.7%.3%.5%.3.1% COMIT LC 1.9%.% 1. 5.7% 5.%.73%.3 53.3% MXCA 1.39%.9%..7% 5.7%.9% 1.3 3.% MXCA LC 1.%.17% 1.37.%.5%.5% NaN.% MXEF 1.35%.% 1.5 3.7% 5.9%.9%.3.1% CRYTR 1.%.3%.3 3.% 5.3% 3.9%. 3.9% NaN ( not a number ) denotes test failed. Research Paper www.risk.net/journal

1 D. Brandolini and S. Colucci TABLE Independence test on historical simulation VaR estimation. p D 1% p D 5% ƒ ƒ Index 1 11 LR ind p-value 1 11 LR ind p-value S&P 5 1.1% 5.5%.1 1.5% 5.% 9.7%.5 3.% TPX 1.% 11.7% 1.1.5% 5.1% 1.5%.79.9% TPX LC 1.%.%.5 3.1% 5.31% 7.55% 1.3 5.7% DAX 1.9% 1.% 1..1%.3%.%..% DAX LC 1.% 1.% 11.3.% 5.53% 1.1% 1.13.% MXGB 1.% 11.5% 13.35.3% 5.7% 17.1% 31.1.% MXGB LC 1.9% 1.% 1..1% 5.11% 15.%.95.% MXFR 1.53% 1.% 1..% 5.91% 13.37% 1.7.3% MXFR LC 1.39% 9.9%.5.1%.% 1.75% 5. 1.93% COMIT 1.% 9.7% 9..3% 5.% 11.7% 9.5.% COMIT LC 1.%.%.5 3.1% 5.3% 1.1% 17..% MXCA 1.%.5% 1.9 1.13% 5.% 1.7% 7.53.% MXCA LC 1.9%.5%..%.95% 13.% 1.9.% MXEF 1.%.3%.1 5.3%.3% 1.7% 5.13.% CRYTR 1.3%.% 1..1% 5.31% 7.55% 1.3 5.7% TABLE 5 Conditional coverage test on filtered bootstrap VaR estimation. p D 1% p D 5% ƒ ƒ Index 1 11 LR CC p-value 1 11 LR CC p-value S&P 5 1.39%.9% 1.57.51% 5.9%.5%..7% TPX 1.1%.33%. 1.5% 5.3% 5.%.53 7.% TPX LC 1.35%.5% 3. 1.3% 5.%.1% 1.7 3.% DAX 1.%.%.1.59%.%.7% 7.3.5% DAX LC 1.%.33% 5. 5.%.9%.% 1..% MXGB 1.%.1% 1..% 5.% 5.1%.51 1.% MXGB LC 1.39%.%.5 1.5% 5.1%.9%.9.% MXFR 1.9%.5% 1..%.51% 7.9% NaN.% MXFR LC 1.3%.5% 3.33 1.9% 5.1%.71%.55 7.93% COMIT 1.9%.%.9 3.%.3%.5% 9. 1.9% COMIT LC 1.9%.%.9 3.% 5.%.73% 3.9 1.1% MXCA 1.39%.9% 1.57.51% 5.7%.9%.9.3% MXCA LC 1.35%.17%. 13.1%.5%.5% 5..1% MXEF 1.%.% 11.53.31% 5.9%.9% NaN.% CRYTR 1.%.3% 5.1 7.5% 5.3% 3.9%.9 3.% NaN ( not a number ) denotes test failed. Journal of Risk Model Validation Volume /Number, Winter 1/13

Backtesting value-at-risk 11 TABLE Conditional coverage test on historical simulation VaR estimation. p D 1% p D 5% ƒ ƒ Index 1 11 LR CC p-value 1 11 LR CC p-value S&P 5 1.1% 5.5%.7.% 5.% 9.7%.9 3.5% TPX 1.% 11.7% 1.5.1% 5.1% 1.5%.3 1.5% TPX LC 1.%.% 11.15.3% 5.31% 7.55%.5 9.3% DAX 1.9% 1.% 5..%.3%.%.7.% DAX LC 1.% 1.%..% 5.53% 1.1%.9.% MXGB 1.% 11.5% 7.5.% 5.7% 17.1% 37.1.% MXGB LC 1.9% 1.% 5..% 5.11% 15.% 3.3.% MXFR 1.53% 1.%.7.% 5.91% 13.37% 3..% MXFR LC 1.39% 9.9% 1.75.%.% 1.75% 15.9.3% COMIT 1.% 9.7% 13.51.1% 5.% 11.7% 13..13% COMIT LC 1.%.% 11.15.3% 5.3% 1.1%.7.% MXCA 1.%.5% 7..1% 5.% 1.7% 3.77.% MXCA LC 1.9%.5% 1..%.95% 13.% 1.5.1% MXEF 1.%.3%..37%.3% 1.7% 5.7.% CRYTR 1.3%.% 1.5.% 5.31% 7.55%.5 9.3% We conclude that FB seems to be superior to HS in terms of UC, independence and CC. In particular, FB is less vulnerable to periods of clustered large losses. 5 REGULATORS BACKTESTING PROCEDURE The CESR s guidelines state that backtesting results must be in line with the selected VaR 1% confidence interval, ie, in the last 5 rolling days the hit function must present at most seven failures (hits or overshootings) at 99% confidence or six failures at 95% confidence. We test VaR 1% at the 95% confidence level to have an early warning on model performances, which can be translated in terms of frequency as Œ%; :% (see Table 7 on the next page). If the hit ratio is higher than.%, then some kind of measure has to be taken in order to reduce the risk model misspecification, while, if there are no failures, nothing Where the backtesting results give rise to consistently inaccurate estimates and an unacceptable number of overshootings (that is to say, that the number of overshootings is not in line with the confidence interval selected for the calculation of the VaR), competent authorities reserve the right to take measures and eg, apply stricter criteria to the use of VaR or, if need be, to disallow the use of the model for the purpose of measuring global exposure. The competent authorities may, for example, also require that results of the calculation of the UCITS VaR to be scaled up by a multiplication factor. (See CESR (1).) Research Paper www.risk.net/journal

1 D. Brandolini and S. Colucci TABLE 7 Number of hits, corresponding frequency and associated p-value. Number of hits Frequency (%) p-value (%) 1. 7.. 7. 3 1. 75. 1. 3. 5. 1.. 5.9 7. 1.9 has to be done, even if this is statistically incorrect. In the following paragraphs we will examine the empirical results applying this backtesting procedure to VaR 1% forecasts on a one-day horizon with both FB and HS. Figure 1 on the facing page shows, for nine indexes, the time series for the two models (FB and HS) together with the frequency level consistent with VaR 1% forecasts (.%). Typically, the FB model is compliant more time than the HS model, the latter suffering from overshooting dependence. In Table on page 1 and Table 9 on page 15 we summarize the results of the extensive backtesting procedure performed for filtered bootstrap and historical simulation models on a sample of equity indexes (in local currency and in US$) according to regulatory rules. When FB overshoots the backtesting hit ratio, the number of failures is always lower than in the HS case, meaning that the FB model is wrong for a shorter period of time (see the first two columns of Table on page 1). This is particularly clear during and, when market volatility was very high, and FB was better able than HS to catch risk jumps in equity markets. Thanks to the independence property, the FB model is compliant with the maximum failures allowed by regulators in every market for a longer period of time than HS (see the last two columns of Table 9 on page 15). On average, FB has 3. violations but with a standard deviation of, and HS has 3. violations with a standard deviation of. The ability of the FB model to quickly adjust the risk forecast to market conditional volatility is also evident in the opposite way, ie, when volatility is going down. Therefore, the FB model, having a greater ability to catch the ups and downs of portfolio risk estimates, is more suitable not only to comply with regulatory rules, but also to be used for a portfolio risk policy that dynamically controls the portfolio risk budget utilization. Journal of Risk Model Validation Volume /Number, Winter 1/13

Backtesting value-at-risk 13 FIGURE 1 time series in FB and HS (in percent). (a) 1 (b) 1 (c) 1 (d) 1 (e) 1 (f) 1 (g) 1 (h) 1 (i) 1 (a) S&P 5 VaR backtest regulator framework. (b) Topix VaR backtest regulator framework. (c) DAX VaR backtest regulator framework. (d) MSCI GB VaR backtest regulator framework. (e) MSCI FRANCE VaR backtest regulator framework. (f) COMIT VaR backtest regulator framework. (g) MSCI CANADA VaR backtest regulator framework. (h) MSCI EMERGING VaR backtest regulator framework. (i) CRYTR VaR backtest regulator framework. Thick black line: FB. Gray line: HS. Dashed black line: upper bound VaR 1%. All values are in local currency. Research Paper www.risk.net/journal

Journal of Risk Model Validation Volume /Number, Winter 1/13 TABLE Summary statistics of two models with respect to regulatory rules. FB HS FB HS Number Number Number Number of days of days of days of days not compliant not compliant Max Max Index not compliant not compliant in and in and FB HS S&P 5 9 (1.9%) 55 (.%) 135 (7.%) 53 (5.%).%.% TOPIX 7 (7.7%) (3.%) (.%) 5 (11.%) 3.%.% TOPIX LC 111 (.1%) 5 (17.%) (.%) 1 (.%).% 5.% DAX 9 (.%) (3.9%) 13 (.%) 9 (53.%) 3.%.% DAX LC 13 (.%) 9 (3.5%) 1 (.%) 19 (39.%) 3.% 5.% MSCI UK 311 (11.%) 53 (31.9%) 33 (.%) 31 (3.%) 3.%.% MSCI UK LC (3.1%) 9 (33.%) (.%) 31 (3.%).% 5.% MSCI France (15.9%) 95 (33.%) 3 (.%) 31 (.%) 3.%.% MSCI France LC 119 (.%) (.%) 3 (.%) 35 (1.%) 3.% 5.% COMIT 11 (.%) 7 (17.7%) (.%) 7 (13.%).%.% COMIT LC 37 (13.7%) 37 (1.5%) (1.%) 75 (15.%) 3.%.% MSCI Canada 111 (.1%) 13 (15.%) 111 (.%) 1 (.%).% 7.% MSCI Canada LC 57 (1.%) 5 (19.5%) 35 (7.%) 1 (.%).%.% MSCI Emerging 5 (.%) (1.9%) (1.%) 5 (17.%) 3.%.% Markets RJ/CRB 7 (.%) 33 (13.%) (.%) 139 (7.%).% 7.% 1 D. Brandolini and S. Colucci

Research Paper www.risk.net/journal TABLE 9 Statistics of violation of two models and time span on zero and over six hits. FB HS FB HS FB time HS time FB time HS time violations violations violations violations span on span on span in span in Index mean mean std std hit hit 1 hits 1 hits S&P 5 3... 5..1% 5.9% 7.% 53.7% TOPIX.. 1.9.3 13.3% 3.9% 79.% 73.1% TOPIX LC 3.3 3.5 1.7.9.% 13.% 95.9% 7.1% DAX 3.7.3 1.9.7 1.%.% 9.%.7% DAX LC 3.7 3.9 1. 3.9.% 3.5% 91.9%.% MSCI UK.. 1.9..% 3.%.% 37.9% MSCI UK LC 3..1 1.5.1.% 35.% 9.9% 3.% MSCI France 3..1.. 11.3% 3.% 7.% 3.% MSCI France LC 3.3 3.7 1. 3..7% 3.% 9.% 3.% COMIT 3. 3...1.% 3.% 91.1% 9.9% COMIT LC 3. 3.7 1. 3.9.%.9%.3% 5.7% MSCI Canada 3. 3. 1.7.5 1.% 1.% 9.1%.7% MSCI Canada LC.1 3...9.9%.% 73.9% 5.% MSCI Emerging 3. 3. 1.9 3. 7.%.9% 3.% 5.1% Markets RJ/CRB 3.. 1.7 3.9 3.% 9.9% 9.% 7.% Backtesting value-at-risk 15

1 D. Brandolini and S. Colucci The backtesting analysis also suggests some remarks on regulatory rules. First of all, overly conservative models always stay at the boundary (or even never producing any VaR violations at all) will be judged compliant, although they are highly questionable statistically (Kupiec (1995)). In particular, a model producing no VaR violations ( D %) cannot pass coverage and independence tests (see columns 5 and of Table 9 on the preceding page). In addition, regulatory rules are not concerned with the distribution of violations: it does not matter much whether a model fails with seven overshootings in a row, or with seven violations distributed over 5 days (although the model will have been compliant for most of the time). The latter feature seems much less worrying than the former, but according to the backtesting hit ratio this does not make any difference in evaluating risk models. REFERENCES Artzner, P., Delbaen, F., Eber, J.-M., and Heath, D. (1999). Coherent measures of risk. Mathematical Finance 9(3), 3. Barone-Adesi, G. K., Giannopoulos, K., and Vosper, L. (1999). VaR without correlations for portfolios of derivative securities. Journal of Futures Markets 19(5), 53. Barone-Adesi, G. K., Giannopoulos, K., and Vosper, L. (). Backtesting derivative portfolios with filtered historical simulation (FHS). European Financial Management (1), 31 5. Basel Committee on Banking Supervision (199). Amendment to the capital accord to incorporate market risks. Bank for International Settlements (January). URL: www.bis.org/publ/bcbs.pdf. Brandolini, D., Pallotta, M., and Zenti, R.(). Risk management in an asset management company: a practical case. Working Paper. CESR (1). CESR s guidelines on risk measurement and the calculation of global exposure and counterparty risk for UCITS. Report, CESR/1-7. Christoffersen, P. (199). Evaluating interval forecasts. International Economic Review 39, 1. Christoffersen, P. (3). Elements of Financial Risk Management. Academic Press, San Diego, CA. Christoffersen, P., and Pelletier, D. (). Backtesting value-at-risk: a duration-based approach. Journal of Financial Econometrics (1), 1. Christoffersen, P., Hahn, J., and Inoue, A. (1). Testing and comparing value-at-risk measures. Journal of Empirical Finance, 35 3. Kupiec, P. (1995). Techniques for verifying the accuracy of risk measurement models. Journal of Derivatives 3, 73. Pallotta, M., and Zenti, R. (). Risk analysis for asset managers: historical simulation, the bootstrap approach and value at risk calculation. Working Paper. Journal of Risk Model Validation Volume /Number, Winter 1/13