Effects of skewness and kurtosis on model selection criteria

Similar documents
Do core inflation measures help forecast inflation? Out-of-sample evidence from French data

MEASURING PORTFOLIO RISKS USING CONDITIONAL COPULA-AR-GARCH MODEL

Structural Cointegration Analysis of Private and Public Investment

Window Width Selection for L 2 Adjusted Quantile Regression

Chapter 5 Univariate time-series analysis. () Chapter 5 Univariate time-series analysis 1 / 29

Exchange Rate Market Efficiency: Across and Within Countries

Prerequisites for modeling price and return data series for the Bucharest Stock Exchange

MODELLING AND PREDICTING THE REAL MONEY DEMAND IN ROMANIA. Literature review

Forecasting Exchange Rate between Thai Baht and the US Dollar Using Time Series Analysis

COMPARISON OF RATIO ESTIMATORS WITH TWO AUXILIARY VARIABLES K. RANGA RAO. College of Dairy Technology, SPVNR TSU VAFS, Kamareddy, Telangana, India

Non-linear logit models for high frequency currency exchange data

AN EMPIRICAL ANALYSIS OF THE PUBLIC DEBT RELEVANCE TO THE ECONOMIC GROWTH OF THE USA

Trade Openness, Economic Growth and Unemployment Reduction in Arab Region

Are Greek budget deficits 'too large'? National University of Ireland, Galway

Two-term Edgeworth expansions of the distributions of fit indexes under fixed alternatives in covariance structure models

A New Test for Correlation on Bivariate Nonnormal Distributions

MODELLING OF INCOME AND WAGE DISTRIBUTION USING THE METHOD OF L-MOMENTS OF PARAMETER ESTIMATION

A New Multivariate Kurtosis and Its Asymptotic Distribution

Determinants of Stock Prices in Ghana

An Empirical Study on the Determinants of Dollarization in Cambodia *

Brief Sketch of Solutions: Tutorial 1. 2) descriptive statistics and correlogram. Series: LGCSI Sample 12/31/ /11/2009 Observations 2596

Chapter 4 Level of Volatility in the Indian Stock Market

Institute of Actuaries of India Subject CT6 Statistical Methods

Threshold cointegration and nonlinear adjustment between stock prices and dividends

SELECTION OF VARIABLES INFLUENCING IRAQI BANKS DEPOSITS BY USING NEW BAYESIAN LASSO QUANTILE REGRESSION

Financial Econometrics Notes. Kevin Sheppard University of Oxford

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2016, Mr. Ruey S. Tsay. Solutions to Midterm

Forecasting Volatility in the Chinese Stock Market under Model Uncertainty 1

Factor Affecting Yields for Treasury Bills In Pakistan?

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2010, Mr. Ruey S. Tsay. Solutions to Midterm

Uncertainty and the Transmission of Fiscal Policy

Stochastic Valuation of Segregated Fund Contracts in an Emerging Market

Graduate School of Business, University of Chicago Business 41202, Spring Quarter 2007, Mr. Ruey S. Tsay. Final Exam

Forecasting Volatility movements using Markov Switching Regimes. This paper uses Markov switching models to capture volatility dynamics in exchange

Graduate School of Business, University of Chicago Business 41202, Spring Quarter 2007, Mr. Ruey S. Tsay. Midterm

Investigation of Relationship between Stock Prices, Interest Rate and Exchange Rate Fluctuations

ESTIMATION OF MODIFIED MEASURE OF SKEWNESS. Elsayed Ali Habib *

Does Exchange Rate Volatility Influence the Balancing Item in Japan? An Empirical Note. Tuck Cheong Tang

Robust Critical Values for the Jarque-bera Test for Normality

Modeling Exchange Rate Volatility using APARCH Models

Volatility Spillovers and Causality of Carbon Emissions, Oil and Coal Spot and Futures for the EU and USA

INTERNATIONAL JOURNAL FOR INNOVATIVE RESEARCH IN MULTIDISCIPLINARY FIELD ISSN Volume - 3, Issue - 2, Feb

A RIDGE REGRESSION ESTIMATION APPROACH WHEN MULTICOLLINEARITY IS PRESENT

An Empirical Research on Chinese Stock Market Volatility Based. on Garch

Analysis of the Influence of the Annualized Rate of Rentability on the Unit Value of the Net Assets of the Private Administered Pension Fund NN

Recent analysis of the leverage effect for the main index on the Warsaw Stock Exchange

THE IMPACT OF BANKING RISKS ON THE CAPITAL OF COMMERCIAL BANKS IN LIBYA

Models of the Minimum Wage Impact upon Employment, Wages and Prices: The Romanian Case

Monetary Policy Shock Analysis Using Structural Vector Autoregression

Inflation and inflation uncertainty in Argentina,

REAL EXCHANGE RATES AND REAL INTEREST DIFFERENTIALS: THE CASE OF A TRANSITIONAL ECONOMY - CAMBODIA

Testing for the martingale hypothesis in Asian stock prices: a wild bootstrap approach

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2014, Mr. Ruey S. Tsay. Solutions to Midterm

A Skewed Truncated Cauchy Logistic. Distribution and its Moments

Asian Economic and Financial Review SOURCES OF EXCHANGE RATE FLUCTUATION IN VIETNAM: AN APPLICATION OF THE SVAR MODEL

Okun s Law - an empirical test using Brazilian data

Application of Conditional Autoregressive Value at Risk Model to Kenyan Stocks: A Comparative Study

Fitting financial time series returns distributions: a mixture normality approach

Power of t-test for Simple Linear Regression Model with Non-normal Error Distribution: A Quantile Function Distribution Approach

Equity, Vacancy, and Time to Sale in Real Estate.

A market risk model for asymmetric distributed series of return

Research Article The Volatility of the Index of Shanghai Stock Market Research Based on ARCH and Its Extended Forms

Université de Montréal. Rapport de recherche. Empirical Analysis of Jumps Contribution to Volatility Forecasting Using High Frequency Data

Nonparametric Estimation of a Hedonic Price Function

The source of real and nominal exchange rate fluctuations in Thailand: Real shock or nominal shock

ESTIMATING SAVING FUNCTIONS WITH A ZERO-INFLATED BIVARIATE TOBIT MODEL * Alessandra Guariglia University of Kent at Canterbury.

PARAMETRIC AND NON-PARAMETRIC BOOTSTRAP: A SIMULATION STUDY FOR A LINEAR REGRESSION WITH RESIDUALS FROM A MIXTURE OF LAPLACE DISTRIBUTIONS

Empirical Study on Short-Term Prediction of Shanghai Composite Index Based on ARMA Model

Yafu Zhao Department of Economics East Carolina University M.S. Research Paper. Abstract

THE USE OF THE LOGNORMAL DISTRIBUTION IN ANALYZING INCOMES

LAMPIRAN. Null Hypothesis: LO has a unit root Exogenous: Constant Lag Length: 1 (Automatic based on SIC, MAXLAG=13)

Financial Risk, Liquidity Risk and their Effect on the Listed Jordanian Islamic Bank's Performance

A Demonstration of the Central Limit Theorem Using Java Program

The Economic and Social BOOTSTRAPPING Review, Vol. 31, No. THE 4, R/S October, STATISTIC 2000, pp

A COMPARATIVE ANALYSIS OF REAL AND PREDICTED INFLATION CONVERGENCE IN CEE COUNTRIES DURING THE ECONOMIC CRISIS

Backtesting value-at-risk: Case study on the Romanian capital market

A Skewed Truncated Cauchy Uniform Distribution and Its Moments

Hedging effectiveness of European wheat futures markets

How do stock prices respond to fundamental shocks?

On the Distribution and Its Properties of the Sum of a Normal and a Doubly Truncated Normal

Optimal Window Selection for Forecasting in The Presence of Recent Structural Breaks

Dependence Structure between TOURISM and TRANS Sector Indices of the Stock Exchange of Thailand

Econometric Models for the Analysis of Financial Portfolios

Bayesian Estimation of the Markov-Switching GARCH(1,1) Model with Student-t Innovations

ANALYSIS OF CORRELATION BETWEEN THE EXPENSES OF SOCIAL PROTECTION AND THE ANTICIPATED OLD AGE PENSION

An Improved Version of Kurtosis Measure and Their Application in ICA

Solving real-life portfolio problem using stochastic programming and Monte-Carlo techniques

Market Risk Analysis Volume I

2. Copula Methods Background

THE IMPACT OF FINANCIAL CRISIS IN 2008 TO GLOBAL FINANCIAL MARKET: EMPIRICAL RESULT FROM ASIAN

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Solutions to Midterm

ECONOMIC GROWTH AND UNEMPLOYMENT RATE OF THE TRANSITION COUNTRY THE CASE OF THE CZECH REPUBLIC

Panel Regression of Out-of-the-Money S&P 500 Index Put Options Prices

Forecasting Stock Index Futures Price Volatility: Linear vs. Nonlinear Models

Empirical Analysis of Private Investments: The Case of Pakistan

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

PROBLEMS OF WORLD AGRICULTURE

Chapter 5 Univariate time-series analysis. () Chapter 5 Univariate time-series analysis 1 / 59

Statistical Evidence and Inference

Effects of skewness and kurtosis on normal-theory based maximum likelihood test statistic in multilevel structural equation modeling

Transcription:

Economics Letters 59 (1998) 17 Effects of skewness and kurtosis on model selection criteria * Sıdıka Başçı, Asad Zaman Department of Economics, Bilkent University, 06533, Bilkent, Ankara, Turkey Received 9 July 1997; accepted 4 February 1998 Abstract We consider the behavior of model selection criteria in AR models where the error terms are not normal by varying skewness and kurtosis. The probability of estimating the true lag order for varying degrees of freedom (k) is the interest. For both small and large samples skewness does not effect the performance of criteria under consideration. On the other hand, kurtosis does effect some of the criteria considerably. In large samples and for large values of k the usual asymptotic theory results for normal models are confirmed. Moreover, we showed that for small sample sizes performance of some newly introduced criteria which were not considered in Monte Carlo studies before are better. 1998 Elsevier Science S.A. Keywords: Model Selection Criteria; AR lag order determination; Robustness JEL classification: C; C51 1. Introduction In a Monte-Carlo study Lutkepohl (1985) compares the finite-sample performance of 1 different identification approaches for AR models. In his study, Schwarz Criterion (SC) (Schwarz, 1978; Rissanen, 1978) and Hannan-Quinn Criterion (HQC) (Hannan and Quinn, 1979; Quinn, 1980) emerge as being the best among the compared criteria. Akaike Information Criterion (AIC) (Akaike, 1973, 1974) also performs well. Koreisha and Pukkila (1993) augment Lutkepohl (1985) and show that the performance of the above mentioned criteria depends on the number of nonzero elements of the matrices of the AR parameters and the maximum possible lag order that is used. These two studies and some other prior theoretical and empirical studies assume that the distribution of the error terms are normal. Our object in this paper is to relax this assumption and compare the performance of above mentioned criteria, as well as a few others, when the AR process may have nonnormal errors. We do this by varying skewness and kurtosis. Since it is known that some of the financial data posses skewness and kurtosis it is important for applied econometricians to know the behaviour of various criteria under such circumstances. We find that although skewness does not have much effect on the performance of criteria while estimating the lag order of the AR model, kurtosis effects the performance of some of the criteria. We reached same conclusion as Lutkepohl (1985) that SC and HQC are best criteria under normal error *Corresponding author. Tel.: 190 31 664769; fax: 190 31 664960/ 665140; e-mail: sbasci@bilkent.edu.tr 0165-1765/ 98/ $19.00 1998 Elsevier Science S.A. All rights reserved. PII S0165-1765(98)00036-6

18 S. Başçı, A. Zaman / Economics Letters 59 (1998) 17 distributions and large samples. More than that we showed that they are also best for nonnormal error distributions if the sample size is large. For small sample sizes performance of other criteria which are not included in Lutkepohl (1985) are better than SC and HQC under the assumptions of both normal and nonnormal error distributions.. Criteria for AR lag order selection Assume that a given set of data X 5 (X, X,...,X )9 is generated by a stationary AR( p) process 0 1 T (allowing for nonzero mean a ): 0 X 5 a 1 ax 1... 1 ax 1 u t 0 1 t1 p tp t where the a i, i50,... p, are unknown parameters and ut is an i.i.d error term with common distribution F. Our object is to find p, the true unknown lag order. It is assumed that there is an a prior maximum order M, so that our estimate for p can be any integer between 0,1,...,M. All prior studies of the issue, both theoretical and empirical, assume the errors to be normally distributed. In this paper, our goal is to assess the performance of these criteria when errors may be nonnormal. From the theory of Edgeworth expansions, we know that the first two terms in asymptotic approximations arising from lack of normality correspond to skewness and kurtosis respectively. Thus a good approach to assess robustness is to vary skewness and kurtosis and study the behavior of the model selection criteria. It appears that skewness does not have much effect on the relative performance of the criteria we studied, while kurtosis does have a great effect. We first briefly review the model selection criteria included in this study. To describe the criteria, let a ˆ,...,aˆ be the OLS estimates of the parameters of the AR( p) model. Let Xˆ 5aˆ 1aˆ 0 p t 0 1Xt1 1 1 T...1aˆpX tp be the OLS estimate of X t. Define the usual variance estimate ˆ s p5(t( p11)) ot5p ˆ (XtX t). The first criteria AIC was introduced by Akaike (1973), and has been a very popular p criterion for model selection. Define AIC( p)5ln(s ˆ p)1 ] T, and estimate p to be that integer between 1,...,M which minimizes the criterion AIC. Shibata (1976) shows that AIC is not consistent but overestimate p asymptotically with positive probability if M $p. Zaman (1984) shows how to calculate the probability distribution of the estimate pˆ produced by AIC. Shibata (1980) discusses an asymptotic efficiency property of AIC. Hurvich and Tsai (1989) makes a bias correction to the AIC, defined by AIC C( p)5t ln(s ˆ p)1 T[11( p/t )]/[1( p1)t]. They show that it is asymptotically efficient if the true model is infinite dimensional. When the true model is finite dimensional AICC chooses the true lag order most often compared to other asymptotically efficient criteria. The Schwarz Criterion, SC( p)5ln (s ˆ p)1p ln(t )/T was introduced by Schwarz (1978) based on Bayesian reasoning. It has the advantage of being consistent over the AIC. This and the Hannan- Quinn criterion HQC( p)5ln (s ˆ p)1p ln(ln(t )) /T both perform well in the Lutkepohl (1985) study, and hence are included in our study. In addition to the AICC which was not part of the Lutkepohl (1985) study, we introduce two other criterion which have not been studied in this context. The criterion PRESS, closely related to Quenouille (1949) jacknife, was first introduced by Allen (1974). This is defined as PRESS( p)5

S. Başçı, A. Zaman / Economics Letters 59 (1998) 17 19 T o t51 (xtx t) where x t, t51,...t, is the predicted value of xt from an AR( p) model after omitting 1 the t-th observation in the process of estimation. The value of p minimizing PRESS is selected. The last criterion included in our study is the often-used sequential F test. We start with the largest model AR(M), and examine the t statistic of a ˆ M. If this is insignificant, we drop it and re-estimate an AR(M1) model. We keep dropping the last coefficient until we get a significant one. In the simulation study we took the significance level as 5%. 3. Simulation results To compare the small sample performance of the criteria discussed in the previous section we have two types of processes where one assumes three lags and the other assumes two lags as the true lag order. a0 is one. In order to have a stationary process we generate a i, i51,,3, from a uniform distribution in the region (1/,1/). We set M56. For our robustness studies, we use two classes of iid error distributions. Let X &(k,);x k. To study the effect of skewness, we considered u t F, where Œ] Œ] 3 Œ] F is the distribution of (X/( k)) k. This has mean 0, variance 1, and Eut 5 k so that skewness increases with k. Fig. 1a,b below give the probability of correctly estimating p using the various criteria for varying k. Fig. 1a has sample size 30, representing the small sample situation, and Fig. 1b has sample size 00, representing the large sample situation. From the figures we can see that skewness does not have much effect on the performance of criteria while estimating the true lag order. Although AICC is the best criterion for sample size 30, it becomes one of the worst criteria for sample size 00. Performance of PRESS is similar to AIC C. Asymptotically consistent criteria, SC and HQC, are not doing that well for sample size 30 but they estimate the true lag most often when we increase sample size to 00. Performance of AIC and sequential F test are close to each other. iid To study the effect of kurtosis, let u t Fk where F is a Student s t distribution scaled to have variance 1. As k increases, the kurtosis decrease, converging to that of the normal asymptotically. Note that skewness is 0 for this class of distributions. When k is one we have the Slash distribution defined in Morgenthaler and Tukey (1991) as a normal divided by a Uniform (0,1). Fig. a,b below give the probabilities for correctly estimating p. We see from the figures that kurtosis has a considerable effect on the criteria under consideration, especially on AICC and PRESS. When k is small, that is for heavy-tailed error distributions, performance of PRESS and sequential F test are good but as k increase, that is when we get closer to normal distribution, we see a decline in their performance. Converse is true for AIC C. The result for AICC is what is to be expected because Hurvich and Tsai (1989) makes the bias correction under the assumption of normality. Results can be seen more clearly for the case of sample size 00. Performance of other criteria slightly increase as we move towards normal distribution when sample size is 30. For sample size 00, it seems that kurtosis 1 There are potential problems which arise from omitting middle observations in time series. We made adjustments to account for such problems but found that such adjustments made no difference in the outcome, and hence have chosen to report results without any adjustments. Since the results for three lags are similar to the ones for two lags, we only present in this paper the results for two lags.

0 S. Bas c ı, A. Zaman / Economics Letters 59 (1998) 17 Fig. 1. (a) Probability of estimating the true lag order (true lag order5, sample size530, a skewed distribution). (b) Probability of estimating the true lag order (true lag order5, sample size500, a skewed distribution).

S. Başçı, A. Zaman / Economics Letters 59 (1998) 17 1 Fig.. (a) Probability of estimating the true lag order (true lag order5, sample size530, t distribution). (b) Probability of estimating the true lag order (true lag order5, sample size500, t distribution). does not effect these criteria. Once again SC and HQC are performing badly when sample size is 30 but they are the best ones when we increase sample size to 00. PRESS and AICC are the worst criteria for sample size 00.

S. Başçı, A. Zaman / Economics Letters 59 (1998) 17 4. Conclusions To conclude we can say that skewness does not effect the probability of estimating the true lag order but kurtosis effects it especially for the criteria PRESS and AIC C. When the sample size is small and error distributions are not heavy-tailed (corresponding to large values of degrees of freedom, k) we suggest the use of AICC for lag order selection. On the other hand, for heavy-tailed error distributions PRESS and sequential F test perform very well. For large samples SC and HQC are definitely the best criteria to be used. We also studied forecasting performance based on models selected by the various criteria. This does not correlate very well with the ability of the model to pick out the true p. In addition, we also studied the probability distribution of the estimate of p. This reveals that PRESS has higher probabilities for models size m satisfying m$p relative to the other criteria. If the maximum size M is reduced the performance of PRESS improves substantially. Results for the normal distribution are very similar to those for t 100. References Akaike, H., 1973. Information theory and an extension of the maximum likelihood principle. In: Petrov, B.N., Csaki, ` F., nd International Symposium on Information Theory. Akademiai ` Kiado, ` Budapest, pp. 67 81. Akaike, H., 1974. A new look at the statistical model identification. IEEE Transactions on Automatic Control AC 19, 716 73. Allen, D.M., 1974. The relationship between variable selection and data augmentation and a method for prediction. Technometrics 16, 15 17. Hannan, E.J., Quinn, B.G., 1979. The determination of the order of an autoregression. Journal of the Royal Statistical Society B41, 190 195. Hurvich, C.M., Tsai, C.L., 1989. Regression and time series model selection in small samples. Biometrika 76 (), 97 307. Koreisha, S.G., Pukkila, T., 1993. Determining the order of a vector autoregression when the number of component series is large. Journal of Time Series Analysis 14 (1), 47 69. Lutkepohl, H., 1985. Comparison of criteria for estimating the order of a vector autoregressive process. Journal of Time Series Analysis 6 (1), 35 5. Quenouille, M., 1949. Approximate tests of correlation in time series. J. Royal Statistical Society Series B11, 18 84. Morgenthaler, S., Tukey, J.W. (Eds.), 1991. Configural Polysampling: A Route to Practicle Robustness. Wiley, New York. Quinn, B.G., 1980. Order determination for a multivariate autoregression. Journal of the Royal Statistical Society B4, 18 185. Rissanen, J., 1978. Modeling by shortest data description. Automatica 14, 465 471. Schwarz, G., 1978. Estimating the dimension of a model. The Annals of Statistics 6, 461 464. Shibata, R., 1976. Selection of the order of an autoregressive model by Akaike s information criterion. Biometrika 63, 117 16. Shibata, R., 1980. Asymptotically efficient selection of the order of the model for estimating parameters of a linear process. The Annals of Statistics 8, 147 164. Zaman, A., 1984. Avoiding model selection by the use of shrinkage techniques. Journal of Econometrics 5, 39 346.