Modeling Trade Direction

Similar documents
The Reporting of Island Trades on the Cincinnati Stock Exchange

Market Microstructure Invariants

Classification of trade direction for an equity market with price limit and order match: evidence from the Taiwan stock market

Lecture 4. Market Microstructure

The Accuracy of Trade Classification Rules: Evidence from Nasdaq

Adaptive Monitoring of Intraday Market Data

Internet Appendix: High Frequency Trading and Extreme Price Movements

Hierarchical Generalized Linear Models. Measurement Incorporated Hierarchical Linear Models Workshop

Random Walks, liquidity molasses and critical response in financial markets

Potential Pilot Problems. Charles M. Jones Columbia Business School December 2014

The Variability of IPO Initial Returns

Is Information Risk Priced for NASDAQ-listed Stocks?

Decimalization and Illiquidity Premiums: An Extended Analysis

Execution Quality in Open Outcry Futures Markets

Do Retail Trades Move Markets? Brad Barber Terrance Odean Ning Zhu

Market MicroStructure Models. Research Papers

High Frequency Autocorrelation in the Returns of the SPY and the QQQ. Scott Davis* January 21, Abstract

Financial Econometrics Jeffrey R. Russell. Midterm 2014 Suggested Solutions. TA: B. B. Deng

Evaluation of the biases in execution cost estimation using trade and quote data $

On the Performance of the Tick Test

Return Volatility, Market Microstructure Noise, and Institutional Investors: Evidence from High Frequency Market

Inflation Dynamics During the Financial Crisis

Updating traditional trade direction algorithms with liquidity motivation

On the importance of timing specifications in market microstructure research

Subsidizing Liquidity: The Impact of Make/Take Fees on Market Quality

Price Impact and Optimal Execution Strategy

ONLINE APPENDIX Inverted Fee Structures, Tick Size, and Market Quality

A Comparison of the Results in Barber, Odean, and Zhu (2006) and Hvidkjaer (2006)

Washington University Fall Economics 487

Modeling dynamic diurnal patterns in high frequency financial data

Real-time Volatility Estimation Under Zero Intelligence

ALL THINGS CONSIDERED, TAXES DRIVE THE JANUARY EFFECT. Abstract

ISSN BWPEF Probability of Informed Trading and Volatility for an ETF. Dimitrios Karyampas Birkbeck, University of London

Order flow and prices

High-Frequency Data Analysis and Market Microstructure [Tsay (2005), chapter 5]

Business School Discipline of Finance. Discussion Paper

Dividend drop ratios and tax theory: An intraday analysis under different tax and price quoting regimes

Change in systematic trading behavior and the cross-section of stock returns during the global financial crisis: Fear or Greed?

Washington University Fall Economics 487. Project Proposal due Monday 10/22 Final Project due Monday 12/3

Latency and liquidity provision in a limit order book Julius Bonart and Martin D. Gould

Term Premium Dynamics and the Taylor Rule. Bank of Canada Conference on Fixed Income Markets

A Synthesis of Accrual Quality and Abnormal Accrual Models: An Empirical Implementation

Tilburg University. An Empirical Analysis of the Role of the Trading Intensity in Information Dissemination on the NYSE Spierdijk, L.

Small Sample Performance of Instrumental Variables Probit Estimators: A Monte Carlo Investigation

Market Microstructure Invariants

Order toxicity and liquidity crisis: An academic point of view on Flash Crash

Financial Econometrics

Forecasting prices from level-i quotes in the presence of hidden liquidity

Bid-Ask Spreads: Measuring Trade Execution Costs in Financial Markets

Data Sources. Olsen FX Data

Short Sales, Long Sales, and the Lee-Ready Trade Classification Algorithm Revisited

Three essays on corporate acquisitions, bidders' liquidity, and monitoring

A potentially useful approach to model nonlinearities in time series is to assume different behavior (structural break) in different subsamples

Supplementary online material to Information tradeoffs in dynamic financial markets

A Liquidity Motivated Algorithm for Discerning Trade Direction

Empirical Asset Pricing for Tactical Asset Allocation

Liquidity surrounding Sell-Side Equity Analyst Recommendation Revisions on the Australian Securities Exchange

Penny Quoting Pilot Program Report

SOLUTION Fama Bliss and Risk Premiums in the Term Structure

GARCH Models. Instructor: G. William Schwert

Agenda 1. May 6th General Market Context 2. Preliminary Findings 3. Initial Q&A 4. Next Steps and Analysis 5. Closing Q&A

Is the Potential for International Diversification Disappearing? A Dynamic Copula Approach

Technical Document Market Microstructure Database Xetra

Volatility Risk Pass-Through

Ultra High Frequency Volatility Estimation with Market Microstructure Noise. Yacine Aït-Sahalia. Per A. Mykland. Lan Zhang

Participation Strategy of the NYSE Specialists to the Trades

Discussion of The Term Structure of Growth-at-Risk

Large tick assets: implicit spread and optimal tick value

Market Transparency Jens Dick-Nielsen

Retrospective. Christopher G. Lamoureux. November 7, Experimental Microstructure: A. Retrospective. Introduction. Experimental.

NYSE Execution Costs

Sensitivity Analysis for Unmeasured Confounding: Formulation, Implementation, Interpretation

Predicting RMB exchange rate out-ofsample: Can offshore markets beat random walk?

Earnings Announcements and Intraday Volatility

Caught on Tape: Institutional Trading, Stock Returns, and Earnings Announcements

List of tables List of boxes List of screenshots Preface to the third edition Acknowledgements

U.S. Quantitative Easing Policy Effect on TAIEX Futures Market Efficiency

Caught On Tape: Predicting Institutional Ownership With Order Flow

Context Power analyses for logistic regression models fit to clustered data

Competing Business Models

Actuarial Mathematics and Statistics Statistics 5 Part 2: Statistical Inference Tutorial Problems

Hedging Under Jump Diffusions with Transaction Costs. Peter Forsyth, Shannon Kennedy, Ken Vetzal University of Waterloo

Robert Engle and Robert Ferstenberg Microstructure in Paris December 8, 2014

Estimating Order Imbalance Using Low Frequency. Data

LECTURE 5 The Effects of Fiscal Changes: Aggregate Evidence. September 19, 2018

Global Trading Advantages of Flexible Equity Portfolios

Predictive Regressions: A Present-Value Approach (van Binsbe. (van Binsbergen and Koijen, 2009)

Risks for the Long Run: A Potential Resolution of Asset Pricing Puzzles

Carnets d ordres pilotés par des processus de Hawkes

Market Microstructure. Hans R. Stoll. Owen Graduate School of Management Vanderbilt University Nashville, TN

PRE-CLOSE TRANSPARENCY AND PRICE EFFICIENCY AT MARKET CLOSING: EVIDENCE FROM THE TAIWAN STOCK EXCHANGE Cheng-Yi Chien, Feng Chia University

High Frequency Trading & Microstructural Cost Effects For Institutional Algorithms

The Role of Exponential-Growth Bias and Present Bias in Retirment Saving Decisions

Shades of Darkness: A Pecking Order of Trading Venues

Measurement Effects and the Variance of Returns After Stock Splits and Stock Dividends

MA Advanced Macroeconomics: 11. The Smets-Wouters Model

Lecture 9: Markov and Regime

The relation between bank losses & loan supply an analysis using panel data

On modelling of electricity spot price

The Changing Relation Between Stock Market Turnover and Volatility

Transcription:

UIC Finance Liautaud Graduate School of Business 7 March 2009

Motivation Financial markets trades result from two or more orders. Later arriving order: the initiator (aggressor). Was the initiator a buy or a sell? aka What was the initiating trade s direction? sign? side? Needed for some microstructure research, e.g. price impact. Trades impart a small bias (impact) to the price process. Price impact modeled as function of trade size and direction. Initiating side (buy/sell) is not available in real-time. Fitting impact models is hard, can save $ billions/year. Want to guess initiator as accurately as possible.

Current Thinking Lee and Ready (1991) first considered delays. Compare trade to midpoint with earlier timestamp. 1988 NYSE and AMEX data: 5 seconds; 1987 data: 2 seconds. Resolve ties with tick test (+ and 0+ ticks: buys) The current debate: What method and delay to use? Midpoint test: Vergote (2005) 2s; Henker and Wang (2006) 1s. Bid/ask test, 0s: Ellis, Michaely, and O Hara (2000); Peterson and Sirri (2003). Tick test: Finucane (2000).

Problems with Previous Studies Previous work on trade signing has some problems: Old Data Pre-electronic, pre-decimalization trades. 1987 (Lee and Ready) to 1999 (Henker and Wang). Narrow Data Trades for only a few stocks. 144 (TORQdb) to 401 (Henker and Wang). Biased Data Only large-cap stocks (all preceding studies). Time Skew No simultaneous analyses of NYSE, Nasdaq trades. Polluted? Some now-common problems affect many studies. Why care? This delay is decreasing to nearly 0 seconds. Still a problem: delay decreased, but quote volume increased.

Better Quotes and a Modeled Approach Picking the correct prevailing quotes may be noisy. Instead, try to get close to the prevailing quote. Average quotes across time via approximate delay distribution. Also use an approach that allows for richer models: Include other information (e.g. tick test, bid/ask test); Account for information strengths; Allow for auto-correlated and cross-correlated buys/sells. Acknowledge differences in markets (e.g. NYSE vs. Nasdaq). Accommodate effects of market capitalization, liquidity, etc. Model can even estimate probability of correct prediction.

Model Notation b t,a t,m t = bid, ask, midpoint initiator saw at time t. p t = price of trade at time t; p t = price of trade preceding time t; p t = differing trade price preceding time t; B t = side of trade at time t (1=buy, 0=sell); g = normalized difference function, e.g. log(p t ) log( ˆm t ); J = signed indicator-like function (-1,+1 if p t ˆb t, â t ; 0 else). J needed: estimated quotes may not be decimalized. Bid/Ask Metric 1.0 0.5 0.0 0.5 1.0 Bid Spread Ask 0.02 0.01 0.00 0.01 0.02 Log Price Difference from Midpoint J for 1% spread; τ: 0.1% - - 1% 5%

Trade Direction Model P(B jt = Buy F t ; θ o, c k, d kl ) = π jt = logit(η jt ) η jt = β 0 }{{} bias 0? φ o η jt }{{} AR effect j indexes stocks; l indexes sectors; + β o1 g(p jt, ˆm jt ) + β o2 g(p jt, p }{{} jt ) + β o3 J(p t, ˆb t, â t ) + }{{}}{{} midpoint test tick test bid/ask test + c }{{} k + d kl overall effect }{{} withinsector effect k indexes ten-minute time bins ; o indexes markets. Random effects: handle (+) correlations, pseudoreplication. Instead of φ o η jt AR term, used lagged metrics. (1)

Dataset and Estimation Use ArcaTrade dataset from NYSE Archipelago ECN. Includes initiating side for NYSE, Nasdaq, and AMEX stocks 1. Universe: 2,836 different stocks (2004 Russell 3000 ). Dec 2004: 1, 2 for estimation; 3 31 for out-of-sample testing. In-sample estimation uses almost 2.2 MM observations. Out-of-sample testing uses 16.5 MM observations. Nonlinear parameters found by conjugate direction (CD). CD uses loop: try parameters, estimate quotes, fit GLMM. Penalized quasi-likelihood used to fit GLMM. 1 Volume share: 2.3%, 22.5%, 23.3% of NYSE, Nasdaq, AMEX.

Estimated Model Fixed Effect AMEX Nasdaq NYSE J width τ Overall: 2.1 10 4 (0.3) Delay scale ν 1.66 (0.58) 1.65 (0.65) 0.62 (0.47) Delay rate λ 0.35 (3.7) 0.33 (0.40) 0.78 (0.35) Intercept Overall: 0.06 (0.02) Midpoint 209 (11) 122 (13) Tick 29.4 (8.4) -20.5 (8.5) Bid/Ask 1.20 (0.25) 1.41 (0.02) 2.04 (0.20) Prev. Bid/Ask 0.33 (0.31) -0.14 (0.01) -0.17 (0.05) Random Effect Std. Dev. Time Bin 0.08 (0.01) Sector Time Bin 0.27 (0.03) Overdispersion Parameter: 1.0086

Estimation Summary Negative prior bid/ask coefficient: agrees with bid-ask bounce. Opposite tick coefficient signs: differing short-sale price tests? Random effects non-zero, imply buying/selling correlation of: 0.2% across all stocks in 10-minute period. 2% across same-sector stocks in 10-minute period. Delay parameter fitting preferred old quotes (30s 120s) Indicates ultra-short-term persistence of quote changes. Overdispersion parameter not of practical concern.

Out of Sample: Across Markets Percent of Trades Correctly Classified Market N Modeled EMO LR Tick AMEX 19,435 69.8% 70.3% 59.2% 52.5% Nasdaq 15,220,579 74.3% 72.3% 71.8% 66.7% NYSE 1,264,866 80.7% 79.6% 76.1% 60.7% Overall 16,504,880 74.7% 72.8% 72.1% 66.2% EMO = Ellis, Michaely, and O Hara bid/ask test. LR = Lee and Ready midpoint test. Tick = tick test. Shocker: LR is the current gold standard.

Out of Sample: Across Sectors, Spread, Time Sectors: Best method across all sectors except one (small). Spread: Best method across spread with two exceptions: 0.1% less accurate for 4.4MM trades at ask; and, More abysmal than winner 2 for 30,000 trades at midpoint. Dates: Best method for each out-of-sample date. 2 45.5% vs. 48.8%.

Performance Attribution: Results Change in Percent of Trades Correctly Classified Baseline Convert Tests Add Lag-1 Ad-hoc Full Market N (All Tests) to Metrics Metrics Delay Model AMEX 19,435 67.7% +2.5% +0.4% -0.8% +0.0% Nasdaq 15,220,579 70.3% +3.0% -0.1% +0.9% +0.2% NYSE 1,264,866 79.8% +1.1% -0.6% +0.7% -0.3% Overall 16,504,880 71.1% +2.7% -0.1% +0.9% -0.1% To attribute performance, I fit a series of nested models. Information strength (±1 tests to metrics) gains 1% 3%. Adding lagged bid/ask metric gains 0.4% for AMEX trades. Basic delay model gains 0.8% for NYSE, Nasdaq trades.

Contributions and Further Work Beat next-best method by 1 2% 3 across almost all groupings. Introduced delay theory to estimation of prevailing quotes. Opened doors to richer trade signing models: Use multiple sources of information. Consider strength of information. Correct for microstructure peculiarities. Allow for autocorrelations and cross-correlations. Interaction between volume/volatility/spread and metrics? Shown short- and ultra-short-term buying/selling persistence. Developed Edgeworth expansions for average delays. Conduct experiments to infer BLUPs and make money? 3 cf. Most published results beat EMO or LR by 0.5% in one group.