Risk Control of Mean-Reversion Time in Statistical Arbitrage,

Similar documents
Risk control of mean-reversion time in. statistical arbitrage

9.1 Principal Component Analysis for Portfolios

A brief historical perspective on financial mathematics and some recent developments

MS&E 448 Cluster-based Strategy

Financial instabilities with a brief historical perspective on financial mathematics

Some remarks on VIX futures and ETNs

Market Risk Analysis Volume I

Some remarks on VIX futures and ETNs

Modelling Credit Spreads for Counterparty Risk: Mean-Reversion is not Needed

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2009, Mr. Ruey S. Tsay. Solutions to Final Exam

Utility Indifference Pricing and Dynamic Programming Algorithm

Cluster-Based Statistical Arbitrage Strategy

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2011, Mr. Ruey S. Tsay. Solutions to Final Exam.

Modeling Capital Market with Financial Signal Processing

Does Portfolio Theory Work During Financial Crises?

FE670 Algorithmic Trading Strategies. Stevens Institute of Technology

On modelling of electricity spot price

Portfolio Optimization

u (x) < 0. and if you believe in diminishing return of the wealth, then you would require

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2010, Mr. Ruey S. Tsay Solutions to Final Exam

Rough volatility models: When population processes become a new tool for trading and risk management

Linearity-Generating Processes, Unspanned Stochastic Volatility, and Interest-Rate Option Pricing

Cross-Section Performance Reversion

Mathematics of Finance Final Preparation December 19. To be thoroughly prepared for the final exam, you should

Volatility Futures and ETNs: Statistics and Trading

Interest Rate Curves Calibration with Monte-Carlo Simulatio

Introduction Credit risk

Using discounted flexibility values to solve for decision costs in sequential investment policies.

Applied Macro Finance

MASM006 UNIVERSITY OF EXETER SCHOOL OF ENGINEERING, COMPUTER SCIENCE AND MATHEMATICS MATHEMATICAL SCIENCES FINANCIAL MATHEMATICS.

Hedging Under Jump Diffusions with Transaction Costs. Peter Forsyth, Shannon Kennedy, Ken Vetzal University of Waterloo

Theoretical Problems in Credit Portfolio Modeling 2

OPTIMAL PORTFOLIO CONTROL WITH TRADING STRATEGIES OF FINITE

Beyond the Black-Scholes-Merton model

Mathematics in Finance

High Frequency Price Movement Strategy. Adam, Hujia, Samuel, Jorge

ON SOME ASPECTS OF PORTFOLIO MANAGEMENT. Mengrong Kang A THESIS

Internet Appendix to Idiosyncratic Cash Flows and Systematic Risk

Control Improvement for Jump-Diffusion Processes with Applications to Finance

STOCHASTIC VOLATILITY AND OPTION PRICING

LECTURE NOTES 3 ARIEL M. VIALE

Replication and Absence of Arbitrage in Non-Semimartingale Models

BUSM 411: Derivatives and Fixed Income

OPTIMAL RISKY PORTFOLIOS- ASSET ALLOCATIONS. BKM Ch 7

(A note) on co-integration in commodity markets

Statistical Arbitrage in the U.S. Equities Market

Hedging Credit Derivatives in Intensity Based Models

Jaime Frade Dr. Niu Interest rate modeling

Portfolio Optimization. Prof. Daniel P. Palomar

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Solutions to Final Exam

Markowitz portfolio theory

Covariance Matrix Estimation using an Errors-in-Variables Factor Model with Applications to Portfolio Selection and a Deregulated Electricity Market

How quantitative methods influence and shape finance industry

arxiv:cond-mat/ v2 [cond-mat.str-el] 5 Nov 2002

King s College London

Statistical Arbitrage in the U.S. Equity Market

Dynamic Relative Valuation

Modeling Volatility Risk in Equity Options: a Cross-sectional approach

King s College London

Mean Reverting Asset Trading. Research Topic Presentation CSCI-5551 Grant Meyers

Portfolio Risk Management and Linear Factor Models

B35150 Winter 2014 Quiz Solutions

Market Volatility and Risk Proxies

Applied Macro Finance

Trading Volatility Using Options: a French Case

Illiquidity, Credit risk and Merton s model

TEST OF BOUNDED LOG-NORMAL PROCESS FOR OPTIONS PRICING

Price manipulation in models of the order book

Hedging with Life and General Insurance Products

Statistical Models and Methods for Financial Markets

Stochastic Portfolio Theory Optimization and the Origin of Rule-Based Investing.

APPENDIX TO LECTURE NOTES ON ASSET PRICING AND PORTFOLIO MANAGEMENT. Professor B. Espen Eckbo

The Dispersion Bias. Correcting a large source of error in minimum variance portfolios. Lisa Goldberg Alex Papanicolaou Alex Shkolnik 15 November 2017

Modelling the Term Structure of Hong Kong Inter-Bank Offered Rates (HIBOR)

Lattice (Binomial Trees) Version 1.2

CFA Level II - LOS Changes

Diversifying Risk Parity

The Implied Volatility Index

Optimal Portfolio Liquidation and Macro Hedging

2.1 Mathematical Basis: Risk-Neutral Pricing

CFA Level II - LOS Changes

Discrete Random Variables

Discrete Random Variables

PREPRINT 2007:3. Robust Portfolio Optimization CARL LINDBERG

Pairs Trading. Prof. Daniel P. Palomar. The Hong Kong University of Science and Technology (HKUST)

Log-Robust Portfolio Management

THE MARTINGALE METHOD DEMYSTIFIED

The Black-Scholes Model

Lecture notes on risk management, public policy, and the financial system Credit risk models

Robust portfolio optimization

Short-Time Asymptotic Methods in Financial Mathematics

The Impact of Volatility Estimates in Hedging Effectiveness

Example 1 of econometric analysis: the Market Model

Equilibrium Asset Returns

Improving Returns-Based Style Analysis

Optimal Hedging of Variance Derivatives. John Crosby. Centre for Economic and Financial Studies, Department of Economics, Glasgow University

Optimal trading strategies under arbitrage

Slides for Risk Management Credit Risk

FACTOR MISALIGNMENT AND PORTFOLIO CONSTRUCTION. Jose Menchero

Parametric Inference and Dynamic State Recovery from Option Panels. Nicola Fusari

Transcription:

Risk Control of Mean-Reversion Time in Statistical Arbitrage George Papanicolaou Stanford University CDAR Seminar, UC Berkeley April 6, 8 with Joongyeub Yeo Risk Control of Mean-Reversion Time in Statistical Arbitrage, J. Yeo and G. Papanicolaou, Risk and Decision Analysis, vol 6, 7, p. 63-9. G. Papanicolaou, CDAR-UCB Risk Control /4

Outline: Statistical Arbitrage: It is a widely practiced general form of pairs trading. The Merton problem and pairs trading. Factor model for US equity returns and its implementation 3. Controlling the mean reversion of residuals 4. Trading: optimal allocation and PnL 5. Results with daily S&P5 data (4-4) exactly as they would be obtained in real time 6. Academic studies of statistical arbitrage: Andrew Lo (MIT), and Marco Avellaneda (NYU-Courant) Hedge Funds: An Analytic Perspective, Princeton University Press, Statistical arbitrage in the US equities market, Quantitative Finance, G. Papanicolaou, CDAR-UCB Risk Control /4

Mean-variance optimal allocation portfolio in continuous time This is the now classic portfolio allocation problem formulated by Markowitz (95), and by Merton (97) in continuous time. S t is the price of a risky asset, ds t S t is the (infinitesimal) return assumed to be Brownian motion with drift ds t S t = µdt + σdb t, µ = mean return, σ = volatility Portfolio value is W t and a fraction u t goes into the risky asset while u t into a bank account with interest rate r. This leads to the portfolio returns dw t W t = u t ds t S t + ( u t )rdt = (( u t )r + µu t )dt + u t σdb t The problem is to find the optimal allocation ratio u t that maximizes the expected return E{U(W T )}, where U is a given utility function G. Papanicolaou, CDAR-UCB Risk Control 3/4

Merton portfolio, log utility, zero mean return Optimal allocation ratio u is a constant (fundamental portfolio theorem) and wealth log growth rate are u = µ r (µ r) σ, r + σ (= r + Sharpe ratio squared/) Since d log S t = (µ σ /)dt + σdb t, when µ = σ mean log return of stock is zero. Then u = r ( σ ) and portfolio log growth rate is If r σ r + σ r ( 8 σ ) is small and σ >.5, say, then the constant portfolio allocation strategy will realize considerable growth above the basic interest rate even when the stock or index is not growing at all. The portfolio allocation ratio in stock will never be more than 5%. The best environment for this kind of portfolio strategy is: low real interest rates and highly fluctuating stocks that have no (log) growth. G. Papanicolaou, CDAR-UCB Risk Control 4/4

Pairs trading: Realizing pure fluctuations Suppose P t and Q t are the prices of two stocks (or a stock and an index or ETF) that are strongly correlated, two energy stocks, two bank stocks, a technology stock and a technology sector ETF,... Regress their returns over some time window (sixty days, for example) so that dp t P t = β dq t Q t + X t (Rough) Trading signal: The residual X t. If it is positive then Q is momentarily undervalued relative to P (β > ). Action: Long (buy) Q and short (sell) P. When later X changes sign, switch position... (Invented in the early nineties at Morgan-Stanley (M. Avellaneda)). Compare with Merton portfolio when there is no underlying growth. Pairs trading creates such a context by focusing on the residual. Role of mean reversion of the residual, an issue not visibly present in the Merton problem. Andrew Lo: Betting on mean reversion. G. Papanicolaou, CDAR-UCB Risk Control 5/4

Generalized pairs trading: Factor decomposition We start with the N T matrix of stock returns, R and decompose it into factors: R = LF + U () where F is a p T matrix of factors L is an N p matrix of factor loadings, U is a N T matrix of residuals. Only R is observable, so L, F, and U must be estimated. We obtain factors using principal components analysis (PCA) so only their number p is to be determined. Next, the factor loadings (L) are estimated by multivariate regressions of R on the subspace spanned by F. Finally, the residual (U) is expressed as the remaining parts, after subtracting the estimated common factors from the original returns: Û = R ˆLˆF. () where the hat ( ˆ ) notation indicates estimators, since only R is observable. All this in a time window that is rolling forward in time. G. Papanicolaou, CDAR-UCB Risk Control 6/4

What do we know about the factors? In Economics: The three-factor models for stock returns of French and Fama (99). In Econophysics: How to extract factors using PCA so that the residuals have a correlation matrix that has Marchnko-Pastur (purely random) statistics. Many papers in late nineties. Survey of Bouchaud and Potters (9) presents the ideas and methods in perspective. Some startling empirical facts: The principal (left) singular vector of the return matrix R when ordered by size lines up with the market capitalization of the stocks. The correlations of returns uncover market capitalization and market sectors without knowing anything other than the (time series of the) returns themselves. G. Papanicolaou, CDAR-UCB Risk Control 7/4

The trading strategy and mean reversion of residuals Since the Market has been taken out by going to residuals, it is essential that these residuals have statistical regularity. Market risk has been transformed into risk of residual irregularity over time, that is, risk from anomalies in the assumed mean reverting behavior of the residuals. Investment portfolios are not bets on returns normalized by volatility (Sharpe ratios). They are now bets on mean reversion of residuals (A. Lo). Integrated (summed) residuals are modeled by OU processes (discrete time): dx = κxdt + σdw. Estimate the rate of mean reversion κ over a past (rolling) window using MLE or least squares for each residual. Use residuals for trading only when the estimated mean reversion time is relatively short and the error in the estimation is not too large. This reduces risk substantially! G. Papanicolaou, CDAR-UCB Risk Control 8/4

The long/short trading strategy with normalized residuals as signals: Buy low - sell high When the trading signal hits.5, it is presumed to be over-valued, so we sell the stock to open the short position (green dots), expecting it to go down soon. If it hits.5, we buy the stock to close the short position, which leads us to obtain profits. Similar strategy can be applied to the open/close of long position (red dots). G. Papanicolaou, CDAR-UCB Risk Control 9/4

The trading algorithm: Yes, buy low and sell high. But with what allocation? The dollar amount q i, i =,,..., N (signed for long/short) to invest is determined at each time t by the following: Market-neutrality : Dollar-neutrality : Total Leverage : L ik q i =, for each k=,...,p (3) i q i = (4) i q i = I (5) i Long/Short : sgn(q i ) is given by trading signals. (6) Here L is the factor loading matrix, p is the number of factors, and I is the total leverage level. G. Papanicolaou, CDAR-UCB Risk Control /4

Profit and Loss The performance from trading can be measured by the cumulative profit-and-loss (PnL): N N N E t+ t = E t + E t r t + q it R it t q it r t q i(t+ t) q it ɛ (7) i= E t is the value of the portfolio at time t, r is the reference interest rate, ɛ is for frictional cost (measured in basis points). Note that the fourth term on the right hand side must be zero if the dollar neutral condition is completely satisfied. However, there is numerical tolerance imposed for the feasibility, so we keep that term here. i= i= G. Papanicolaou, CDAR-UCB Risk Control /4

The US equities data used in backtesting SP5 daily data from 5 to 4, split into 5 time periods: 5-7 (before crisis), 7-8 (in crisis), 9- (crisis recovery), - (post crisis I), 3-4 (post crisis II). Use only 378 stocks from the SP5 to have uninterrupted records and avoid necessary preprocessing of data. Use 3- day rolling windows in the analysis Use - PCA factors in generating residulas Avellaneda ( and later) uses a much larger set of equities, beyond the SP5 (the over one billion capitalization group). He also uses ETFs for factors, both real and synthetic ones. G. Papanicolaou, CDAR-UCB Risk Control /4

Risk control with estimated mean reversion time of residuals Since risk has migrated into the mean reverting behavior of the residuals, we need to control it through the estimates of mean reversion times. The error (in MLE) is proportional to the reciprocal square root of the time window. And we cannot use large time windows because information in the residuals becomes stale (non-stationary). Typical tradeoff in applied science (channel estimation, geosciences, biology, genomics, etc): We want to use a large enough data window to have statistical stability but also one that is small enough to have approximately stationary statistical properties. Bottom line: Use residuals that have (a) relatively short mean reversion times and (b) relatively high statistical confidence score: Use diversity by proper selection in a high-dimensional setting. G. Papanicolaou, CDAR-UCB Risk Control 3/4

The effect of mean-reversion control on Sharpe ratios Sharpe ratio.5.5 N=75, 5bp cost, no R screen Controlled Random Low Cap High Cap.5 5 6 7 8 9 3 4 Period The size of portfolio is 75, the transaction cost is 5bp per volume, and R -screening is not applied here. It is clear that the portfolio selection using the estimated mean-reversion speed (red line) performs better than the other portfolios. G. Papanicolaou, CDAR-UCB Risk Control 4/4

Sample results of cumulative PnL for each period; Controlled Cumulative PnL.4.3.. Controlled Portfolio. Cost=5bp 5 6 (SR:.96) 7 8 (SR:.57) 9 (SR:.4) (SR:.59) 3 4 (SR:.73).9 5 6 7 8 9 3 4 Time (year) Cumulative PnL.4.3.. Controlled Portfolio. Cost=bp 5 6 (SR:.9) 7 8 (SR:.4) 9 (SR:.9) (SR:.59) 3 4 (SR:.35).9 5 6 7 8 9 3 4 Time (year) Two transaction costs (of 5bp and bp). Portfolio size is N = 75; Number of factors is 5; Window length is 3 days. R -screening is applied. G. Papanicolaou, CDAR-UCB Risk Control 5/4

The out-of-sample performance from four different portfolios.5 N=75, 5bp cost, no R screen Controlled Random Low Cap High Cap.5 N=75, 5bp cost, R screen Controlled Random Low Cap High Cap Sharpe ratio.5 Sharpe ratio.5.5.5 5 6 7 8 9 3 4 Period 5 6 7 8 9 3 4 Period Number of selected stocks is 75; Transaction costs are 5bp; Left: Without R -screening. Right: With R -screening. The overall level of performance is enhanced by selective trading based on R -screening. G. Papanicolaou, CDAR-UCB Risk Control 6/4

Estimated and realized mean-reversion times (in days) Estimated mean reversion time (days) 3 9 8 7 Estimated mean reversion time Controlled Random Low Cap High Cap Realized mean reversion time (days) 3 9 8 7 Realized mean reversion time Controlled Random Low Cap High Cap 5 6 7 8 9 3 4 Period 5 6 7 8 9 3 4 Period Estimated (Left) and realized (Right). The mean-reversion controlled portfolio has significantly shorter trading intervals compared to other three portfolios. (The values are averaged). G. Papanicolaou, CDAR-UCB Risk Control 7/4

The performance surface as a function of number of factors (p) and window length (T) Low transaction cost (ε=5bp) High transaction cost (ε=5bp) Sharpe ratio 4 Sharpe ratio 4 5 5 Number of factors 8 6 4 Window length 5 5 Number of factors 8 6 4 Window length The performance surface as a function of number of factors (p) and window length (T), for two different transaction costs. G. Papanicolaou, CDAR-UCB Risk Control 8/4

Sharpe ratios with respect to portfolio sizes Performance with 5bp Cost Sharpe Ratios.5.5 Controlled Random Low Cap High Cap 5 5 75 Portfolio size The horizontal axis is portfolio sizes (5, 5, 75, and stocks). The vertical axis is Sharpe ratios. The results are presented with transaction costs of 5bp. Averaged results over all number of factors, estimation windows, and periods. In general, the performance is better with larger portfolio sizes. However, for controlled portfolio, using N = 75 is better than using N =. G. Papanicolaou, CDAR-UCB Risk Control 9/4

Performance with and without optimization for asset allocation.5 Effect of Optimization for Allocations With optimization Without optimization Sharpe ratio.5.5 5 6 7 8 9 3 4 Period Without optimized asset allocations, the trading strategy is unstable and is not market-neutral. G. Papanicolaou, CDAR-UCB Risk Control /4

Dynamics of number of factors that explain different levels of variance Number of factors 3 5 5 5 Window: 6 days % 3% 4% 5% 6% SP5 5 6 7 8 9 3 4 Year 3 Window: days 5 Number of factors 5 5 5 6 7 8 9 3 4 Year Window: 6 days (top) and days (bottom). N = 378 stocks. G. Papanicolaou, CDAR-UCB Risk Control /4

The largest eigenvalue of correlation matrix of normalized original returns with moving window Level of variance explained by by the largest eigenvalu.8.6.4. Dynamics of largest eigenvalue Window: 6 days Window days 5 6 7 8 9 3 4 Year The shorter windows response is more sensitively to the market. G. Papanicolaou, CDAR-UCB Risk Control /4

Statistics of mean reversion time and R values ( factors) 4 4 Distribution of mean reversion time (window=6 days) x 8 7 7 6 6 5 5 freq freq 8 4 3 4 9 4 3 Distribution of mean reversion time (window= days) x x 4 6 8 estimated mean reversion time (days) 4 6 Distribution of R value (window=6 days) 4 x 4 6 8 estimated mean reversion time (days) 4 6 Distribution of R value (window= days) 9 8 8 7 7 6 5 freq freq 6 5 4 4 3 3.4.5.6.7.8.9.4.5.6.7.8 R value R value estimated mean reversion time vs. R estimated mean reversion time vs. R.8.8.7.7 R.9 R.9.9.6.6.5.5.4.4 4 6 8 estimated mean reversion time (days) 4 6 8 estimated mean reversion time (days) Top: Mean-reversion times. Middle: R values. Bottom: Scatter plots for the two. Window is 6 (Left) and days (Right). G. Papanicolaou, CDAR-UCB Risk Control 3/4

Concluding remarks A carefully implemented statistical arbitrage portfolio has remarkably stable returns even in very hostile financial environments. This comes out using real data exactly as they would be used in practice (out of sample). You do not need to know anything about markets or finance. It is a purely quantitative approach. Just signal processing. An innovation in this study: Mean reversion based control and optimal allocation of trading amounts. Otherwise we follow the Avellaneda-Lee analysis. A very large number of deep and largely untouched mathematical problems come up, in the factor analysis, in the mean reversion analysis, and in the optimal allocation algorithm... The basic idea of the trading strategy is simple and intuitive but the details in the implementation are all important and make a huge difference in performance. Statarb is not meant to be used exclusively. It should be used together with traditional (long only) value portfolios and other diversified instruments in a carefully integrated strategy. G. Papanicolaou, CDAR-UCB Risk Control 4/4