SELFIS: A Short Tutorial

Similar documents
Long memory in volatilities of German stock returns 1

GARCH Models. Instructor: G. William Schwert

LONG MEMORY IN VOLATILITY

Financial Econometrics

High-Dimensional Time Series Modeling for Factors Driving Volatility Strings

Chapter Introduction

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2014, Mr. Ruey S. Tsay. Solutions to Midterm

Basic Procedure for Histograms

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2011, Mr. Ruey S. Tsay. Solutions to Final Exam.

This homework assignment uses the material on pages ( A moving average ).

Trends in currency s return

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2009, Mr. Ruey S. Tsay. Solutions to Final Exam

WEB APPENDIX 8A 7.1 ( 8.9)

STAT758. Final Project. Time series analysis of daily exchange rate between the British Pound and the. US dollar (GBP/USD)

2.1 Properties of PDFs

The Great Moderation Flattens Fat Tails: Disappearing Leptokurtosis

A gentle introduction to the RM 2006 methodology

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Solutions to Midterm

σ e, which will be large when prediction errors are Linear regression model

Statistical Analysis of Data from the Stock Markets. UiO-STK4510 Autumn 2015

Linda Allen, Jacob Boudoukh and Anthony Saunders, Understanding Market, Credit and Operational Risk: The Value at Risk Approach

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2010, Mr. Ruey S. Tsay. Solutions to Midterm

Financial Time Series Analysis (FTSA)

THE UNIVERSITY OF CHICAGO Graduate School of Business Business 41202, Spring Quarter 2003, Mr. Ruey S. Tsay

ME3620. Theory of Engineering Experimentation. Spring Chapter III. Random Variables and Probability Distributions.

High-Frequency Data Analysis and Market Microstructure [Tsay (2005), chapter 5]

1 Volatility Definition and Estimation

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2010, Mr. Ruey S. Tsay Solutions to Final Exam

Market Risk Prediction under Long Memory: When VaR is Higher than Expected

INFORMATION EFFICIENCY HYPOTHESIS THE FINANCIAL VOLATILITY IN THE CZECH REPUBLIC CASE

Empirical Analysis of the US Swap Curve Gough, O., Juneja, J.A., Nowman, K.B. and Van Dellen, S.

TESTING STATISTICAL HYPOTHESES

Tests for One Variance

Lecture 5a: ARCH Models

Scaling power laws in the Sao Paulo Stock Exchange. Abstract

CHAPTER-3 DETRENDED FLUCTUATION ANALYSIS OF FINANCIAL TIME SERIES

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Midterm

Occasional Paper. Risk Measurement Illiquidity Distortions. Jiaqi Chen and Michael L. Tindall

Models of Patterns. Lecture 3, SMMD 2005 Bob Stine

Chapter 6 Forecasting Volatility using Stochastic Volatility Model

Introduction to Time Series Analysis. Madrid, Spain September Case study: exploring a time series and achieving stationarity

INSTITUTE OF ACTUARIES OF INDIA EXAMINATIONS. 20 th May Subject CT3 Probability & Mathematical Statistics

Two Hours. Mathematical formula books and statistical tables are to be provided THE UNIVERSITY OF MANCHESTER. 22 January :00 16:00

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Solutions to Final Exam

Value at Risk and Self Similarity

ARIMA ANALYSIS WITH INTERVENTIONS / OUTLIERS

Strategies for High Frequency FX Trading

Volatility Analysis of Nepalese Stock Market

Software Tutorial ormal Statistics

Model Construction & Forecast Based Portfolio Allocation:

Introduction to Computational Finance and Financial Econometrics Descriptive Statistics

John Hull, Risk Management and Financial Institutions, 4th Edition

Web Appendix. Are the effects of monetary policy shocks big or small? Olivier Coibion

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2016, Mr. Ruey S. Tsay. Solutions to Midterm

Quantitative relations between risk, return and firm size

Modeling the volatility of FTSE All Share Index Returns

ARCH and GARCH models

Yafu Zhao Department of Economics East Carolina University M.S. Research Paper. Abstract

Quantile Regression due to Skewness. and Outliers

PRICE DISTRIBUTION CASE STUDY

Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions

Simulation Lecture Notes and the Gentle Lentil Case

Financial Econometrics: Problem Set # 3 Solutions

Properties of the estimated five-factor model

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER

Estimation of Long Memory in Volatility

M249 Diagnostic Quiz

Amath 546/Econ 589 Univariate GARCH Models: Advanced Topics

8.1 Estimation of the Mean and Proportion

REGIONAL WORKSHOP ON TRAFFIC FORECASTING AND ECONOMIC PLANNING

2.4 STATISTICAL FOUNDATIONS

Universal Properties of Financial Markets as a Consequence of Traders Behavior: an Analytical Solution

Lecture 6: Non Normal Distributions

Lecture 8: Markov and Regime

WC-5 Just How Credible Is That Employer? Exploring GLMs and Multilevel Modeling for NCCI s Excess Loss Factor Methodology

INTERNATIONAL JOURNAL OF ADVANCED RESEARCH IN ENGINEERING AND TECHNOLOGY (IJARET)

Compartmentalising Gold Prices

Non-stationary volatility with highly anti-persistent increments: An alternative paradigm in volatility modeling?

Financial Econometrics Jeffrey R. Russell. Midterm 2014 Suggested Solutions. TA: B. B. Deng

Indian Institute of Management Calcutta. Working Paper Series. WPS No. 797 March Implied Volatility and Predictability of GARCH Models

Financial Economics (I) Instructor: Shu-Heng Chen Department of Economics National Chengchi University

Graduate School of Business, University of Chicago Business 41202, Spring Quarter 2007, Mr. Ruey S. Tsay. Final Exam

STATISTICAL ANALYSIS OF HIGH FREQUENCY FINANCIAL TIME SERIES: INDIVIDUAL AND COLLECTIVE STOCK DYNAMICS

Mixed models in R using the lme4 package Part 3: Inference based on profiled deviance

Lecture 9: Markov and Regime

Financial Returns: Stylized Features and Statistical Models

Shifting our focus. We were studying statistics (data, displays, sampling...) The next few lectures focus on probability (randomness) Why?

University of New South Wales Semester 1, Economics 4201 and Homework #2 Due on Tuesday 3/29 (20% penalty per day late)

Absolute Return Volatility. JOHN COTTER* University College Dublin

I. Time Series and Stochastic Processes

Economics 483. Midterm Exam. 1. Consider the following monthly data for Microsoft stock over the period December 1995 through December 1996:

Financial Econometrics

Example 1 of econometric analysis: the Market Model

Dependence Structure and Extreme Comovements in International Equity and Bond Markets

Graduate School of Business, University of Chicago Business 41202, Spring Quarter 2007, Mr. Ruey S. Tsay. Midterm

Chapter IV. Forecasting Daily and Weekly Stock Returns

Value at Risk with Stable Distributions

Annual VaR from High Frequency Data. Abstract

Prerequisites for modeling price and return data series for the Bucharest Stock Exchange

STOCK MARKET EFFICIENCY, NON-LINEARITY AND THIN TRADING EFFECTS IN SOME SELECTED COMPANIES IN GHANA

Transcription:

SELFIS: A Short Tutorial Thomas Karagiannis (tkarag@csucredu) November 8, 2002 This document is a short tutorial of the SELF-similarity analysis software tool Section 1 presents briefly useful definitions Section 2 describes the bucket shuffling methodology Finally, section 3 is a quick manual of how to use SELFIS 1 Definitions Autocorrelation Function (ACF): Autocorrelation is a statistical measure of the relationship, if any, between a random variable and itself, at different time lags The autocorrelation coefficient can range between +1 (very high positive correlation) and -1 (very high negative correlation) The Autocorrelation function (ACF) shows the value of the autocorrelation coefficient for different time lags k: ρ(k) = E[(X t µ)(x t+k µ)] σ 2 Long-Range Dependence: Long-range dependence measures the memory of a process Intuitively, distant events in time are correlated This correlation is captured by the autocorrelation function (ACF), which measures how similar is a series with shifted version of itself For LRD data the ACF decays very slowly to zero On the contrary, short-range dependence is characterized by quickly decaying correlations (eg ARMA processes) The strength of the long-range dependence is quantified by the Hurst exponent (H) A series exhibits LRD when 1 2 < H < 1 Furthermore, the closer H is to 1, the stronger the dependence of the process is

More rigorously, a stationary process X t has long-memory or is longrange dependent, if there exists a real number α (0, 1) and a constant c p > 0 such that lim k ρ(k)/[c pk α ] = 1 where ρ(k) the sample Autocorrelation function (ACF) The definition states that the autocorrelation function of long-memory processes, decays to zero with rate approximately k α, where H = 1 α 2 is the Hurst exponent Hurst Estimators: There are many estimators that are used to estimate the value of the Hurst parameter Below is a list of the estimators implemented in SELFIS 1 Absolute Value method, where an aggregated series X (m) is defined, using different block sizes m The log-log plot of the aggregation level versus the absolute first moment of the aggregated series X (m) should be a straight line with slope of H-1, if the data are long-range dependent 2 Variance method, where the log-log plot of the sample variance versus the aggregation level must be a straight line with slope β greater than -1 In this case H = 1 + β 2 3 R/S methoda log-log plot of the R/S statistic versus the number of points of the aggregated series should be a straight line with the slope being an estimation of the Hurst exponent 4 Periodogram method This method plots the logarithm of the spectral density of a time series versus the logarithm of the frequencies The slope provides an estimate of H The Periodogram is given by I(ν) = 1 N 2πN X(j)e ijν 2 where ν is the frequency, N is the length of the time series and X is the actual time series 5 Whittle estimator The method is based on the minimization of a likelihood function, which is applied to the Periodogram of the time series It gives an estimation of H and produces the confidence interval It does not produce a graphical output j=1

6 Variance of Residuals A log-log plot of the aggregation level versus the average of the variance of the residuals of the series should be a straight line with slope of H/2 7 Abry-Veitch Wavelets are used in order to estimate the Hurst exponent The energy of the series in various scales is studied to provide an estimate 2 Bucket Shuffling Bucket shuffling decouples the short-range from long-range correlations in a series to study the effects of long-range dependence This is achieved through partitioning the time series into a set of buckets of length b Thus, we define the contents of the uth bucket to be items X u b,, X (u+1) b 1 from the series, and the home of item X i to be bucket H(i) i/b Also, we say that two items (X i, X j ) form an inbucket pair if H(i) = H(j); otherwise, they form an outbucket pair with an offset of H(i) H(j) buckets Note that this classification depends on the (fixed) locations of the bucket boundaries, and not just the separation between two items in the time series Once the series has been partitioned in this way, we can then apply one of the following bucket shuffling algorithms to reorder its items: External Shuffling (EX): The order of buckets is shuffled, whereas the content of each bucket remains intact This can be achieved by labelling each bucket with a bucket-id between 0 and Time-SeriesLength/b, and shuffling the bucket-ids External shuffling preserves all correlations among the inbucket pairs, while equalizing all correlations among the outbucket pairs with different offsets Thus, if the series is sufficiently long, the ACF should not exhibit significant correlations beyond the bucket size Internal Shuffling (IN): The order of the buckets remains unchanged while the contents of each bucket are shuffled As a result, correlations among the inbucket pairs are equalized, while correlations among the outbucket pairs are preserved, but rounded to a common value for each offset Thus, if the original signal has long-memory, then the ACF of the internally-shuffled series will still show power-law behavior Two-Level Shuffling (2L): Each bucket is further subdivided into a series of atoms of size a Thereafter, we apply external shuffling to the block of b/a atoms within each bucket As a result,

both short-range correlations (within each atom) and long-range correlations (across multiple buckets) are preserved, while medium-range correlations (across multiple atoms within the same bucket) are equalized 3 SELFIS Manual Input: The input file must be a timeseries in the following two formats: x1 y1 x2 y2 x3 y3 OR y1 y2 y3 y4 The input file must have at least 64 values When the file is plotted the X axis always denotes the number of observations in the input file and not the true value of the x1,x2,x3 if any A sample input file is fgn08, which was generated with a fractional Gaussian noise generator Hurst Exponent Estimation: In the Hurst Exponent Estimation menu item the preferred estimator can be selected after a file was opened Note that depending on the size of the input file, the Whittle and Abry-Veitch estimators may take up to a few minutes to compute the Hurst exponent The Run All Estimators option opens a new form that displays the results for each of the seven estimators Bucket Shuffling: In the Bucket Shuffling menu item, three types of bucket shuffling can be performed The Restore Original Data option cancels any shuffling that was performed and re-plots the original

input file After performing any shuffling all the estimations (Hurst estimation, Autocorrelation function, etc) refer to the new shuffled time series You can also save the shuffled time series with the save option in the file menu item Tools: The Tools menu item includes the calculation of basic statistics (mean, variance, standard deviation, skewness, kurtosis), the estimation of the autocorrelation function and the power spectrum and the zoom in and out in the plot You can also zoom in by defining a rectangle by clicking in the plot and dragging the mouse The rectangle defines the area that will be shown Zooming out can be performed by right mouse click inside the plot ACKNOWLEDGEMENTS I am especially grateful to Dr M Faloutsos, Dr M Molle and Dr R H Riedi for their help and contributions throughout this project Thanks are also due to Theodoros Folias for the splash screen of SELFIS