High-Frequency Data Analysis and Market Microstructure [Tsay (2005), chapter 5]

Similar documents
The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2009, Mr. Ruey S. Tsay. Solutions to Final Exam

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Solutions to Final Exam

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2011, Mr. Ruey S. Tsay. Solutions to Final Exam.

Graduate School of Business, University of Chicago Business 41202, Spring Quarter 2007, Mr. Ruey S. Tsay. Solutions to Final Exam

Market MicroStructure Models. Research Papers

Lecture Note 9 of Bus 41914, Spring Multivariate Volatility Models ChicagoBooth

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2014, Mr. Ruey S. Tsay. Solutions to Midterm

Amath 546/Econ 589 Univariate GARCH Models

12. Conditional heteroscedastic models (ARCH) MA6622, Ernesto Mordecki, CityU, HK, 2006.

FE570 Financial Markets and Trading. Stevens Institute of Technology

Financial Econometrics

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2017, Mr. Ruey S. Tsay. Solutions to Final Exam

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2010, Mr. Ruey S. Tsay Solutions to Final Exam

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2010, Mr. Ruey S. Tsay. Solutions to Midterm

Choice Probabilities. Logit Choice Probabilities Derivation. Choice Probabilities. Basic Econometrics in Transportation.

Business Statistics 41000: Probability 3

Chapter 5 Univariate time-series analysis. () Chapter 5 Univariate time-series analysis 1 / 29

Mathematics of Finance Final Preparation December 19. To be thoroughly prepared for the final exam, you should

Bivariate Birnbaum-Saunders Distribution

On modelling of electricity spot price

Financial Econometrics Jeffrey R. Russell. Midterm 2014 Suggested Solutions. TA: B. B. Deng

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is:

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Solutions to Midterm

Modeling dynamic diurnal patterns in high frequency financial data

John Hull, Risk Management and Financial Institutions, 4th Edition

Financial Time Series and Their Characterictics

Lecture Note of Bus 41202, Spring 2014: Simple Nonlinear Models & Market Microstructure

ARCH and GARCH models

Estimation of High-Frequency Volatility: An Autoregressive Conditional Duration Approach

1. You are given the following information about a stationary AR(2) model:

Intraday Value-at-Risk: An Asymmetric Autoregressive Conditional Duration Approach

Actuarial Society of India EXAMINATIONS

I. Return Calculations (20 pts, 4 points each)

Financial Econometrics Notes. Kevin Sheppard University of Oxford

Final Exam Suggested Solutions

Financial Time Series Analysis (FTSA)

Intraday Dynamics of Volatility and Duration: Evidence from Chinese Stocks

This homework assignment uses the material on pages ( A moving average ).

Indian Institute of Management Calcutta. Working Paper Series. WPS No. 797 March Implied Volatility and Predictability of GARCH Models

INSTITUTE OF ACTUARIES OF INDIA EXAMINATIONS. 20 th May Subject CT3 Probability & Mathematical Statistics

Financial Data Analysis, WS08/09. Roman Liesenfeld, University of Kiel 1

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2011, Mr. Ruey S. Tsay. Final Exam

Forecasting the Volatility in Financial Assets using Conditional Variance Models

Lecture Note: Analysis of Financial Time Series Spring 2017, Ruey S. Tsay

Financial Returns: Stylized Features and Statistical Models

Statistical Inference and Methods

Testing for efficient markets

Volatility Analysis of Nepalese Stock Market

Estimating Bivariate GARCH-Jump Model Based on High Frequency Data : the case of revaluation of Chinese Yuan in July 2005

Variance clustering. Two motivations, volatility clustering, and implied volatility

Volatility Models and Their Applications

Empirical Analysis of the US Swap Curve Gough, O., Juneja, J.A., Nowman, K.B. and Van Dellen, S.

FE670 Algorithmic Trading Strategies. Stevens Institute of Technology

List of tables List of boxes List of screenshots Preface to the third edition Acknowledgements

Strategies for High Frequency FX Trading

Conditional Heteroscedasticity

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -26 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

Introductory Econometrics for Finance

The test has 13 questions. Answer any four. All questions carry equal (25) marks.

Optimal Hedge Ratio and Hedging Effectiveness of Stock Index Futures Evidence from India

Course information FN3142 Quantitative finance

Continuous Distributions

Algorithmic and High-Frequency Trading

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER

Volatility Clustering of Fine Wine Prices assuming Different Distributions

Financial Engineering. Craig Pirrong Spring, 2006

Mixed Logit or Random Parameter Logit Model

Modelling Returns: the CER and the CAPM

Stochastic Volatility (SV) Models

3.4 Copula approach for modeling default dependency. Two aspects of modeling the default times of several obligors

ARCH Models and Financial Applications

Random Variables and Probability Distributions

Forecasting Stock Index Futures Price Volatility: Linear vs. Nonlinear Models

A potentially useful approach to model nonlinearities in time series is to assume different behavior (structural break) in different subsamples

The Constant Expected Return Model

Economics 883: The Basic Diffusive Model, Jumps, Variance Measures. George Tauchen. Economics 883FS Spring 2015

Loss Simulation Model Testing and Enhancement

Lecture 8: Markov and Regime

1.1 Interest rates Time value of money

Financial Risk Management

Is the Potential for International Diversification Disappearing? A Dynamic Copula Approach

The Great Moderation Flattens Fat Tails: Disappearing Leptokurtosis

Lecture 9: Markov and Regime

IEOR E4703: Monte-Carlo Simulation

LONG MEMORY IN VOLATILITY

Risk Management and Time Series

Portfolio construction by volatility forecasts: Does the covariance structure matter?

Idiosyncratic risk, insurance, and aggregate consumption dynamics: a likelihood perspective

2.4 STATISTICAL FOUNDATIONS

INTERTEMPORAL ASSET ALLOCATION: THEORY

1 Volatility Definition and Estimation

Example 1 of econometric analysis: the Market Model

Using Halton Sequences. in Random Parameters Logit Models

APPLYING MULTIVARIATE

Lecture 5: Fundamentals of Statistical Analysis and Distributions Derived from Normal Distributions

Time series: Variance modelling

Pricing Default Events: Surprise, Exogeneity and Contagion

Multi-Path General-to-Specific Modelling with OxMetrics

Dynamic Replication of Non-Maturing Assets and Liabilities

GARCH Models for Inflation Volatility in Oman

Transcription:

1 High-Frequency Data Analysis and Market Microstructure [Tsay (2005), chapter 5] High-frequency data have some unique characteristics that do not appear in lower frequencies. At this class we have: Nonsynchronous trading Bid-ask spread Duration models Price movements that are in mjultiples of tick size Bivariate models for price changes Time durations between transctions associated with price changes

2 Nonsynchronous Trading Stock tradings do not occur in a synchronous manner. Different stocks have different trading frequencies, and even for a single stock the trading intensity varies from hour to hour and from day to day. We use for daily series the closing price, the last transaction price of the stock in a trading day. So, we incorrectly assume daily returns as an equally spaced time series with a 24-hour interval and it can lead to erroneous conclusions about the predictability of stock returns even if the true return series are serially independent. For daily stock return, nonsynchronous trading can introduce: Lag-1 cross-correlation between stock returns; Lag-1 serial correlation in a portfolio return; In some situations, negative serial correlations of the return series of a single stock.

3 Example: Two independent stocks A and B. A is traded more frequently than B. Suppose a special new that arrives near the closing hour on one day. A is more likely than B to show the effect of the news on the same day. The effect on B will eventually appear, but it may be delayed until the following trading day. So, stock A appears to lead that of stock B. The return series may show a significant lag-1 cross-correlation from A to B even though the two stocks are independent. For a portfolio that holds stocks A and B, the prior cross-correlation would become a significant lag-1 serial correlation.

4 Lo and Mackinlay (1990) model (simplified): Let r t be the continuously compounded return of a security at the time index t. {r t } is a sequence of iid random variables with mean E(r t ) = µ and variance Var(r t ) = σ 2. Let π be the probability that the security is not traded. Let be the observed return. When there is no trade at time index t, so we have = 0. When there is a trade at time index t, we define as the cumulative return from the previous trade,, k t is the largest non-negative integer such that no trade occurred in the periods.

5 there are trades at both t and t-1. Probability is (1-π) 2. there are trades at both t and t-2, but no trade at t-1. Probability is (1- π) 2 π. there are trades at both t and t-3, but no trade at t-1 and t-2, and so on. Probability is (1- π) 2 π 2.

6 As expected, What is the expectation of? We use the result

7 What is the variance of? We know that, so first we need. To see the last equality, let It s easy to see that (1 - π)h = G and (1 - π)g = 1 + 2π + 2π 2 + 2π 3 + 2π 4 + So (1 π) 2 H = 2(1 + π + π 2 + π 3 + ) 1 = term in brackets.

Then, 8

9 What is the lag-1 autocovariance of? We know that, so first we need. We have, is zero if there is no trade at t, no trade at t-1, or no trade at both t and t-1. So, the probability is the sum π(1- π) + π(1- π) + π 2 = 2π - π 2. Notice that there are three consecutive trades at t-2, t-1 and t with probability (1- π) 3. And so on,

It s easy to see that: 2π - π 2 + (1 π) 3 [1 + π + π 2 + π 3 + ] = 1 10

11 Because, we have And, Conclusion: the nonsynchronous trading induces a negative lag-1 autocorrelation in daily return,.

12 In general, When µ 0, the nonsynchronous trading induces negative autocorrelations in an observed security return series. Generalization to the return series of a portfolio see Campbell, Lo an Mackinlay (1997, chapter 3).

13 Bid-Ask Spread Market makers have monopoly rights by the exchange to post different prices for purchases and sales of a security. They buy at the bid price P b and sell at a higher ask price P a. P a P b = bid-ask spread (source of compensation for market makers) Typically, it s small (one or two ticks) Bid-ask bounce = bid-ask spread introduces negative lag-1 serial correlation in an asset return.

14 Roll (1984) model (simplified): The change price process if the fundamental value of the asset does not change is,.

15 Suppose, Then, Conclusion: The bid-ask spread introduces a negative lag-1 serial correlation in the series of observed price changes. But, it does not introduce any serial correlation beyond lad 1. Example: Assuming that the fundamental price is equal to (P a + P b )/2, P t assumes the value P a or P b. Thus, is either 0 or S, if the previous observed price is P a, and either 0 or S, if the previous observed price is P b.

16 Now, assume a more realistic formulation that follows a random walk, independent of {I t }., iid sequence with mean zero and variance σ 2 and It s easy to see that, P t = ε t + (I t I t-1 )S/2 with the same mean and covariance, but Furthermore, is reduced, but negative.

17 Empirical characteristics 1. Unequally spaced time intervals 2. Discrete-value prices 3. Existence of a daily periodic or diurnal pattern 4. Multiple transactions at the same time

18 Example: IBM transactions data from November 1, 1990 to January 31, 1991. (63 trading days and 60.328 transactions) Ignore the price changes between trading days and focusing on the transactions that occurred in the normal trading hours from 9:30 am to 4:00 pm Eastern time. The distribution of positive and negative price changes was approximately symmetric, with high frequency in zero and one ticks. Consider the number of transactions in a 5-minute time interval, x t

19

20 cycle pattern of the ACF with a periodicity of 78, which is the number of 5- minute intervals in a trading day. In other words, the number of transactions exhibits a daily pattern. Now, consider the average number of transactions within 5-minute time intervals over the 63 days. We have 78 such averages. the plot exhibits a smiling or U shape, indicating heavier trading at the opening and closing of the market and thinner trading during the lunch hours.

21

22 Now, consider the classification of price movements: up (+), unchanged (0) and down (-). And the price movements between two consecutive trades: (i-1)th to ith transaction. price reversals in intraday transactions data.

23 Models for price changes 1) Ordered Probit Model [Hauseman, Lo and MacKinlay (1992)] 2) A decomposition Model [McCulloch and Tsay (2000), Rydberg and Shephard (2003)]

24 Ordered Probit Model,. Assumes that is a continuous random variable and follows a model: where x i is a p-dimensional row vector of explanatory variables at time t i-1. The conditional variance is assumed to be a positive function of the explanatory variable w i. In w i, we have the time interval t i = t i t i-1 (time duration) and some conditional heteroscedastic variables. Furthermore, one also assumes that the conditional distribution of ε i given x i and w i is Gaussian.

25 Assume that the observed price change y i may assume k possible values as the number of ticks, in practice. Let {s 1,, s k } the set of k possible values. We observe, Under conditional Gaussian distribution, we have

26 Example: Hauseman, Lo and MacKinlay (1992) IBM, 1998, 206.794 trades. Sample means (standard deviation): y i = -0.001 (0.753) t i = 27.21 (34.13) bid-ask spread = 1.947 (1.4625)

27 Model: where,

28

29

30 boundary partitions are not equally spaced, but almost symmetric with respect to zero. the transaction duration,, affects both the conditional mean and conditional variance of y i. the coefficients of lagged price changes are negative and highly significant, indicating price reversals. the bid-ask spread at time t i-1 significantly affects the conditional variance.

31 A Decomposition Model Decomposition in 3 components: an indicator for price change, the direction of price movement if there is a change, and the size of price change if a change occurs.

32 Let F i be the information set available at the ith transaction. We have an ordered decomposition of the price change probability. Since A i is a binary variables, Let,

Because the price change, if it occurs, is at least 1 tick, we added 1 to the geometric distribution. We take the logistic transformation to ensure that 33

34 Define 3 categories and I i (j), j =1,2,3 its indicator variables. So, Thus, it is possible to use the log likelihood function to estimate the parameters β, γ, θ u, θ d

Example: intraday transactions of IBM stock from November 1, 1990 to January 31, 1991, with 63 trading days and 59.838 intraday transactions in the normal trading hours. 35

36 Some features: the price change may occur in clusters and, as expected, most transactions are without price change: [P(A i = 0 A i-1 = 0) = 0.742] if A i-1 = 0, then the chances for a price increase or decrease at the ith trade are about even the probability of consecutive price increase or decrease are very low. Consequently, the effect of bid-ask bounce and supports price reversals in highfrequency trading.

37 So, the probability of a price increase by one tick is 0.827 at the ith trade if the transaction results in a price increase and S i-1 = 1. The probability reduces to 0.709 if S i-1 = 2 and to 0.556 if S i-1 = 3. Consequently, the probability of a large S i is proportional to S i-1 given that there is a price increase at the ith trade. Note: A difference between the ADS and ordered probit models is that the former does not require any truncation or grouping in the size of a price change.

38 Duration Models Engle and Russel (1998): ACD model - autoregressive conditional duration Concepts similar to the ARCH models for volatility For heavily traded stocks Duration is a positive number, so we need no-gaussian errors Used distributions: exponential (EACD), Weibull (WACD), generalized Gamma (GACD) First, it s necessary to adjust the time duration: where, f(t i ) is a deterministic function consisting of the cyclical component of.

For example: 39

40

41 For IBM data, the coefficients the linear regression, are obtained by the least squares method of The fitted model is:

42

43 The ACD Model Let We have a ACD(r,s) model

44 So, ε i exponential EACD(r,s) model ε i Weibull WACD(r,s) model ε i Gamma GACD(r,s) model Assuming weak stationarity,

45 EACD(1,1) Model Where ε i follows the standard exponential distribution. Under weak stationarity of x i, And

46 Simulation 500 observations from the ACD(1,1) model Using: Standardized Weibull distribution with parameter α = 1.5 Standardized generalized gamma distribution with parameters k = 1.5 and α = 1.5.

47

48

49

50

51

52 Estimation ACD(r,s) model The likelihood function of the durations Use a conditional likelihood method, ignoring For WACD: When α = 1, the (conditional) log likelihood function reduces to that of an EACD(r,s) model.

53 For GACD(r,s) model: See Example 5.3 and 5.4 (pp. 233-236) Note: Estimation of EACD models can be carried out by using programs for ARCH models with some minor modification.

54 Bivariate models for price change and duration Here, we consider jointly the process of price change and the associated duration. Problem: many intraday transactions of a stock result in no price change and they are highly relevant to trading intensity, but don t contain direct information on price movement. focus on transactions that result in a price change and consider a price change and duration (PCD) model to describe the multivariate dynamics of price change and the associated time duration Note: This choice can reduce the sample size dramatically. Example: IBM stock on November 21, 1990 (726 transactions intraday trades, but only 195 with a price change).

55

56 Let t i :the calendar time of the ith change of an asset, measured in seconds. t i = t i - t i-1 : time duration between price changes : transaction price when the ith price change occurred N i : number of trades in the time interval (t i - t i-1 ) that result in no price change D i : the direction of the ith price change [D i = 1, price goes up, and D i = -1, price comes down] S i : size of the ith price change measured in ticks Each transaction consist of for the ith price change. PCD Model: decomposes the joint distribution of given F i-1

57 McCulloch and Tsay (2000): For time duration between price changes: a multiple linear regression model with lagged variables, where σ is a positive number and {ε i } is a sequence of iidn(0,1) random variables. For N i, it s necessary to partitioned into two parts, because empirical data suggest a concentration of N i at 0. exp( α 0 + α1 ln( ti ) A logit model: p( Ni = 0 t, Fi 1) = 1+ exp( α + α ln( t ) 0 1 i g(λ) denotes a geometric distribution with parameter λ.

58 For D i, direction: D i is governed by the sign of a normal random variable with mean µ i and variance. Price reversal feature ϖ 1 negative But the variance equation allows for a local trend by increasing the uncertainty in the direction of price movement when the past data showed evidence of a local trend.

59 For size of a price change: The model can be estimated jointly by the maximum likelihood method See example 5.5 (pp. 240)