Department of Economics Working Paper

Similar documents
COINTEGRATION AND MARKET EFFICIENCY: AN APPLICATION TO THE CANADIAN TREASURY BILL MARKET. Soo-Bin Park* Carleton University, Ottawa, Canada K1S 5B6

Discussion Paper Series No.196. An Empirical Test of the Efficiency Hypothesis on the Renminbi NDF in Hong Kong Market.

A Note on the Oil Price Trend and GARCH Shocks

ESTIMATING MONEY DEMAND FUNCTION OF BANGLADESH

Applied Econometrics and International Development. AEID.Vol. 5-3 (2005)

Yafu Zhao Department of Economics East Carolina University M.S. Research Paper. Abstract

Are Bitcoin Prices Rational Bubbles *

AN EMPIRICAL ANALYSIS OF THE PUBLIC DEBT RELEVANCE TO THE ECONOMIC GROWTH OF THE USA

Cointegration and Price Discovery between Equity and Mortgage REITs

Threshold cointegration and nonlinear adjustment between stock prices and dividends

Corresponding author: Gregory C Chow,

Personal income, stock market, and investor psychology

GDP, Share Prices, and Share Returns: Australian and New Zealand Evidence

Exchange Rate Market Efficiency: Across and Within Countries

Volume 29, Issue 2. Measuring the external risk in the United Kingdom. Estela Sáenz University of Zaragoza

Intraday arbitrage opportunities of basis trading in current futures markets: an application of. the threshold autoregressive model.

A Note on the Oil Price Trend and GARCH Shocks

Why the saving rate has been falling in Japan

A1. Relating Level and Slope to Expected Inflation and Output Dynamics

An Empirical Analysis of the Relationship between Macroeconomic Variables and Stock Prices in Bangladesh

Financial Econometrics Notes. Kevin Sheppard University of Oxford

Volume 35, Issue 1. Thai-Ha Le RMIT University (Vietnam Campus)

Performance of Statistical Arbitrage in Future Markets

TESTING THE EXPECTATIONS HYPOTHESIS ON CORPORATE BOND YIELDS. Samih Antoine Azar *

The Dynamics between Government Debt and Economic Growth in South Asia: A Time Series Approach

A study on the long-run benefits of diversification in the stock markets of Greece, the UK and the US

Thi-Thanh Phan, Int. Eco. Res, 2016, v7i6, 39 48

The Effects of Oil Shocks on Turkish Macroeconomic Aggregates

Foreign direct investment and profit outflows: a causality analysis for the Brazilian economy. Abstract

Sectoral Analysis of the Demand for Real Money Balances in Pakistan

Lecture 5. Predictability. Traditional Views of Market Efficiency ( )

The expectations hypothesis (EH) of the term structure implies that the yield. The Expectations Hypothesis of the Term Structure: The Case of Ireland*

Government Tax Revenue, Expenditure, and Debt in Sri Lanka : A Vector Autoregressive Model Analysis

An Empirical Study on the Determinants of Dollarization in Cambodia *

Determinants of Stock Prices in Ghana

Research Article The Volatility of the Index of Shanghai Stock Market Research Based on ARCH and Its Extended Forms

Market Integration, Price Discovery, and Volatility in Agricultural Commodity Futures P.Ramasundaram* and Sendhil R**

Do core inflation measures help forecast inflation? Out-of-sample evidence from French data

Structural Cointegration Analysis of Private and Public Investment

The Demand for Money in China: Evidence from Half a Century

The Relationship between Inflation, Inflation Uncertainty and Output Growth in India

Surasak Choedpasuporn College of Management, Mahidol University. 20 February Abstract

Institute of Economic Research Working Papers. No. 63/2017. Short-Run Elasticity of Substitution Error Correction Model

The source of real and nominal exchange rate fluctuations in Thailand: Real shock or nominal shock

Equity Price Dynamics Before and After the Introduction of the Euro: A Note*

Inflation and inflation uncertainty in Argentina,

RISK SPILLOVER EFFECTS IN THE CZECH FINANCIAL MARKET

Current Account Balances and Output Volatility

Analysis of the Relation between Treasury Stock and Common Shares Outstanding

How do stock prices respond to fundamental shocks?

Asian Economic and Financial Review SOURCES OF EXCHANGE RATE FLUCTUATION IN VIETNAM: AN APPLICATION OF THE SVAR MODEL

Indian Institute of Management Calcutta. Working Paper Series. WPS No. 797 March Implied Volatility and Predictability of GARCH Models

Keywords: China; Globalization; Rate of Return; Stock Markets; Time-varying parameter regression.

IS INFLATION VOLATILITY CORRELATED FOR THE US AND CANADA?

Government expenditure and Economic Growth in MENA Region

Amath 546/Econ 589 Univariate GARCH Models

Blame the Discount Factor No Matter What the Fundamentals Are

Does Commodity Price Index predict Canadian Inflation?

Is the real effective exchange rate biased against the PPP hypothesis?

The relationship between output and unemployment in France and United Kingdom

CAN MONEY SUPPLY PREDICT STOCK PRICES?

ARE EXPORTS AND IMPORTS COINTEGRATED? EVIDENCE FROM NINE MENA COUNTRIES* HUSEIN, Jamal ** Abstract

Multivariate Causal Estimates of Dividend Yields, Price Earning Ratio and Expected Stock Returns: Experience from Malaysia

Trends in currency s return

A SEARCH FOR A STABLE LONG RUN MONEY DEMAND FUNCTION FOR THE US

Optimal Hedge Ratio and Hedging Effectiveness of Stock Index Futures Evidence from India

Jet Fuel-Heating Oil Futures Cross Hedging -Classroom Applications Using Bloomberg Terminal

The Comovements Along the Term Structure of Oil Forwards in Periods of High and Low Volatility: How Tight Are They?

The Balassa-Samuelson Effect and The MEVA G10 FX Model

Modeling Exchange Rate Volatility using APARCH Models

Cointegration Tests and the Long-Run Purchasing Power Parity: Examination of Six Currencies in Asia

International journal of Science Commerce and Humanities Volume No 2 No 1 January 2014

Long Run Money Neutrality: The Case of Guatemala

Investigating Causal Relationship between Indian and American Stock Markets , Tamilnadu, India

Savings Investment Correlation in Developing Countries: A Challenge to the Coakley-Rocha Findings

Most recent studies of long-term interest rates have emphasized term

DATABASE AND RESEARCH METHODOLOGY

Tax or Spend, What Causes What? Reconsidering Taiwan s Experience

University of Pretoria Department of Economics Working Paper Series

Long-run Consumption Risks in Assets Returns: Evidence from Economic Divisions

1 Volatility Definition and Estimation

Unemployment and Labor Force Participation in Turkey

The Relationship between Foreign Direct Investment and Economic Development An Empirical Analysis of Shanghai 's Data Based on

What the hell statistical arbitrage is?

CFA Level II - LOS Changes

Risk-Adjusted Futures and Intermeeting Moves

How can saving deposit rate and Hang Seng Index affect housing prices : an empirical study in Hong Kong market

Empirical Study on Short-Term Prediction of Shanghai Composite Index Based on ARMA Model

Forecasting Stock Index Futures Price Volatility: Linear vs. Nonlinear Models

Travel Hysteresis in the Brazilian Current Account

Dividend, investment and the direction of causality

The Random Walk Hypothesis in Emerging Stock Market-Evidence from Nonlinear Fourier Unit Root Test

MONEY, PRICES AND THE EXCHANGE RATE: EVIDENCE FROM FOUR OECD COUNTRIES

Examination on the Relationship between OVX and Crude Oil Price with Kalman Filter

THE PREDICTABILITY OF THE SOCIALLY RESPONSIBLE INVESTMENT INDEX: A NEW TMDCC APPROACH

Recent analysis of the leverage effect for the main index on the Warsaw Stock Exchange

INFORMATION EFFICIENCY HYPOTHESIS THE FINANCIAL VOLATILITY IN THE CZECH REPUBLIC CASE

A joint Initiative of Ludwig-Maximilians-Universität and Ifo Institute for Economic Research

Prerequisites for modeling price and return data series for the Bucharest Stock Exchange

Relationship between Inflation and Unemployment in India: Vector Error Correction Model Approach

Transcription:

Department of Economics Working Paper Rethinking Cointegration and the Expectation Hypothesis of the Term Structure Jing Li Miami University George Davis Miami University August 2014 Working Paper # - 2014-05

Rethinking Cointegration and the Expectation Hypothesis of the Term Structure Jing Li and George Davis Miami University Abstract We show that the expectations hypothesis implies that as the time spread between long and short term yields increases, it becomes increasingly less likely to find that the spread is stationary; even if the two yields are cointegrated. In the data, we find that as the time spread increases, rejection rates for cointegration also increase in line with this theoretical prediction. Our results suggests that cointegration tests may not be appropriate tests of the expectations hypothesis. Keywords: Cointegration, Term Structure, Expectation Hypothesis JEL Classification: E32 Department of Economics, Miami University, 800 E. High Street, Oxford, OH 45056. Phone: 513-529- 4393. Fax: 513-529-6992. Email: lij14@miamioh.edu. Corresponding Author. Department of Economics, Miami University, 800 E. High Street, Oxford, OH 45056. Phone: 513-529-2836. Fax: 513-529-6992. Email: davisgk@miamioh.edu. 1

Introduction Expectations of future short term interest rates have long played an important role in explanations of the determination of long term interest rates. In its purest form, the long term interest rate is just the average of current and expected future short term rates. The expectations hypothesis (EH) generalizes this idea to include a risk or liquidity premium in the long term rate. Empirically, both long term and short term rates are not stationary, and are typically found to be integrated of order one. If the risk or liquidity premiums are stationary, then the EH suggests that short term and long term interest rates may be cointegrated. In a seminal paper, Hall et al. (1992) use the EH to motivate the modeling of the term structure of Treasury rates as a cointegrated system. However, evidence on this hypothesis is mixed at best. 1 In this paper, we show that the EH implies that the spread between long and short term rates will converge to a random walk as the term to maturity on the long rate approaches infinity. A somewhat surprising implication of this result is that the EH predicts it will be more difficult to find cointegration as the time spread between rates increases. More specifically, if the spread is used to test for cointegration, then as the time spread between rates increases, it will become increasingly more difficult to reject the null of no cointegration, and so make it more difficult to find the traditional cointegration evidence used to support the EH. Our analysis begins by formally proving that as the time spread between rates goes to infinity the term spread converges to an infinite order moving average (MA) process identical to the MA representation of a random walk. So, in the limit, the spread will not be stationary, and the null of no cointegration will not be rejected. The practical importance of this limiting proposition (LP) is an empirical question. To 1 The literature on term structure is vast. The works of Campbell and Shiller (1991), Anderson (1997), Tzavalis and Wickens (1997), and Sarno and Thornton (2003) are closely related to ours. 2

examine this question, we investigate whether cointegration defined by the spread is more likely to be found in spreads where the time span between the long and the short rates is relatively brief. In particular, we test for bivariate cointegration between combinations of the federal funds rate, the 1-year, 3-year, 5-year, and the 10-year treasury rates. In order to allow for the instability emphasized by Park and Hahn (1999), we carry out a rolling cointegration analysis for each pair of interest rates. We also allow for asymmetric adjustment considered by Enders and Siklos (2001), Clarida et al. (2006) and Sarno et al. (2007). We find that as the time span between rates increase, we are less likely to reject the null of no cointegration. We next examine an error correction model of interest rates. If the interest rates are cointegrated, then the coefficient on the error correction term should be significantly different than zero. The LP predicts that it will become increasingly less likely to find such a significant coefficient as the time spread increases, and we find support for this implication of the proposition. The EH predicts that the cointegrating vector will be (1,-1). We utilize the dynamic OLS estimator of Stock and Watson (1993) to estimate the cointegrating vectors, and to test the null that the vector is consistent with the prediction of the EH. In general, it is not. Given this result, we relax the assumption that the cointegrating vector is (1,-1) and reexamine the implications of our theoretical result. In this more general setting, we continue to find support for the the LP, but the support is weaker. 3

Expectation Hypothesis Let i m,t be the yield to maturity of a m-period pure discount bond that is bought at time t and matures m periods ahead. The weakest form of EH can be written as 2 i m,t i 1,t = 1 m [ m 1 i=1 ] i E t i 1,t+j + r m, (1) where E t denotes the expectation conditional on the information at time t, and r m represents the (assumed) constant risk premia. The proposition below concerns the limiting behavior j=1 of the spread i m,t i 1,t when the maturity m rises. Limiting Proposition (LP): Assuming j=1 E t i 1,t+j <. As m, lim i m,t i 1,t = m E t i 1,t+j + r m (2) j=1 The assumption j=1 E t i 1,t+j < is innocuous. It allows the spread i m,t i 1,t to increase with m, but requires it to remain finite. If even very long term interest rates are finite, this condition will hold. The proof of this proposition is in the appendix. If we assume, consistent with most of the literature and our empirical results below, that E t i 1,t+j are white noise, LP implies that in the limit the spread i m,t i 1,t has an infinite-order moving average MA( ) representation. 3 Moreover, this moving average has a constant coefficient on each term equal to one. A random walk process has the same MA representation, and therefore in the limit the spread behaves like a random walk process, a nonstationary process. 4 Intuitively, the nonstationarity is introduced by the double summations in (1). In a single summation of stationary terms, the law of large number implies that multiplying the 2 See equation (4) of Hall et al., (1992) or equation (3) of Clarida et al. (2006) for a detailed derivation. 3 In general, E t i 1,t+j can possess a high degree of temporal dependence and moderate heteroskedasticity, see Phillips and Durlauf (1986) for instance. 4 The easiest way to see nonstationary is noticing that the variance of j=1 E t i 1,t+j in (2) is infinite. 4

sum by 1 yields stationarity. However, in the double summations 1 does not decay to zero m m sufficiently, and the necessary scalar for stationarity becomes 1 m 3/2. When m is large, i m,t i 1,t may be represented as the sum of many summed stationary components, and, as a result, the spread behaves like an integrated, that is, a summed process. LP implies that if we use the spread as the error correction term, then as m rises, the probability of rejecting the null hypothesis of no cointegration will decrease. In the limit, this probability becomes zero. LP may seriously weaken the link between cointegration and EH, and, at least for long time spreads, render cointegration tests of EH inappropriate. Indeed, the failure to find cointegration is predicted by EH in this case. Data In this section, we investigate the link between the time spread and the results from cointegration tests. We use monthly observations of the Treasury constant maturity 10-year (l 10 ), 5-year (l 5 ), 3-year (l 3 ), and 1-year (l 1 ) yields to maturity, and the effective federal funds rate (s f ). The data were downloaded from Federal Reserve Economic Data. 5 The full sample consists of 371 observations from October 1982 to August 2013. 6 Table 1 contains the summary statistics of our data. As we would expect, the yield to maturity increases with the term to maturity. Table 1 also reports results from augmented Dickey-Fuller tests. We find that all series are nonstationary and integrated of order one. Panel A of Figure 1 plots three of the series: the federal funds rate s f, 3-year rate l 3, and the 10-year rate l 10. Overall, the three series move together. The strength of co-movement of s f and l 3, however, seems stronger than that for s f and l 10. In certain periods, such as after year 2009, the link between s f and l 10 becomes particularly weak. 5 http://research.stlouisfed.org/fred2/ 6 Our sample starts in October 1982 since it was then the Federal Reserve ended the policy of targeting monetary aggregates, and resumed targeting interest rates. 5

Rolling Regression Analysis Throughout this paper, y t stands for the long term rate, and x t the short term rate. Note that the classification of long and short term rates are relative. l 3 is the long term rate relative to s f, but the short term rate relative to l 10. The spread, denoted by sp, is given by sp t = y t x t (3) The spread between the ten year Treasury rate and the federal funds rate is plotted in Panel B of Figure 1. Tables 2 and 3 present the result of a rolling regression analysis. We adopt the rolling approach for two reasons. First, we allow for instability, perhaps caused by structural changes, in the long-run equilibrium. Second, this method generates enough regression results to obtain meaningful rejection frequency of the cointegration test. Each window contains 96 consecutive observations, 8 years of data. 7 This window size is long enough for the cointegration test to have adequate power, but not too long to be contaminated by structural changes. The windows are partially overlapping: the first window uses the first 96 observations; the second window moves one period ahead, and uses the next 96 observations, and so on. Cointegration Tests We start with the traditional (linear) cointegration test, which in effect is the unit root test applied to the interest spread sp t. The testing regression is sp t = γ 0 + γ 1 sp t 1 + p c i sp t i + v t. (4) i=1 7 We increase the window size to 120 (10 years of data), and the results are qualitatively unchanged. 6

The null hypothesis is that the bivariate system of (x t, y t ) is not cointegrated, or equivalently, sp t is nonstationary. The null hypothesis is rejected at the 5% level if the t statistic of γ 1 is less than -2.86. 8 The number of lags p is chosen by Ng and Perron (1995) method. 9 The linear cointegration test imposes the restrictive assumption that the adjustment speed, measured by γ 1, is constant. The threshold unit root test of Enders and Granger (1998) and the threshold cointegration test of Enders and Siklos (2001) relax this restriction, and are based on the indicator-augmented regression sp t = γ 0 + γ 1 sp t 1 I(sp t 1 < τ) + γ 2 sp t 1 I(sp t 1 τ) + p c i sp t i + ṽ t. (5) i=1 where the indicator function I(.) = 1 if the event in the parenthesis is true and 0 otherwise. Now the adjustment speeds measured by γ 1 and γ 2 are possibly asymmetric (i.e., γ 1 γ 2 ). The null hypothesis of no cointegration becomes H 0 : γ 1 = 0, γ 2 = 0. We try two threshold values: τ = 0, and τ = sample mean of sp t 1 (unreported). The results are largely the same. The percentages of times that the hypothesis of no cointegration cannot be rejected are reported in Table 2. For the spread between the one year Treasury rate and the federal funds rate, the linear cointegration test fails to reject the null of no cointegration 80% of the time. For the spread between the three year Treasury and the federal funds rate, the rejection rate increases to 90%. LP predicts that this rejection rate increases as the time spread increases and Table 2 reveals this pattern. The threshold test also produces this pattern. 8 Because the cointegrating vector (1,-1) is predetermined, the cointegration test follows the Dickey-Fuller distribution asymptotically. 9 A well known fact is that the outcome of the unit root test and cointegration test can be sensitive to the lag number p. Accordingly, we redo the tests use the popular method of minimizing the Akaike information criterion (AIC) to select p. We find the same pattern. 7

Error Correction Models An alternative method to investigate the cointegration of short term and long term rates is to estimate the error correction model. We estimate y t = α 01 + α 1sp t 1 + p c 1i y t i + i=1 p d 1i x t i + v 1t, (6) i=1 According to LP the spread will behave increasingly like a nonstationary process as the time spread rises. This property, in turn, implies that it will become increasingly less likely to find a significant coefficient on α1 in (6) as the time spread increases. Table 3 reports the rejection rates for the significance of the error correction coefficient. 10 For spreads with the federal funds rate, the rejection rate increases from 56% for the one year spread, to 85% for the three year spread, and then settles in the low to mid 90% range for the longer time spreads. For the Treasury spreads, with one exception, the rejection rates are all in the low to mid 90% range. Cointegrating Vectors The cointegrating vector specified by the spread sp t is restricted to be (1, 1). We can test this restriction by considering the cointegrating regression y t = β 0 + β 1 x t + e t. (7) We estimate the cointegrating regression (7) using rolling windows. Panel A of Figure 2 plots the series of ˆβ 1 for differing long term rates when the short term rate is the federal funds rate. There are two main findings from Panel A. First, as the time spread increases, ˆβ 1 gets smaller, and farther away from one. This increasing deviation from one is consistent 10 The t statistic of α1 follows a nonstandard distribution under the null hypothesis, so we ran simulations to obtain the critical values. 8

with LP. When ˆβ 1 is unity, ê t, which is the OLS residual, reflects fluctuations in the term spread. As the spread behaves more and more like nonstationary series, then ˆβ 1 will move away from unity since OLS estimation will, by its nature, attempt to maximize the chance of stationarity in the residual. Second, there are clearly two periods that show evidence of instability, 1982-1984 and 1990-1994. It is beyond the scope of this paper to investigate the source of this instability, but we do note that both periods occur near the trough of a business cycle. Also, the earlier period follows shortly after the Volcker disinflation, while the later period follows the creation of the Resolution Trust Corporation. These conclusions carry over to Panel B where term spreads between Treasury securities are plotted. Panels C and D of Figure 2 plot the serial-correlation adjusted t statistics for the null hypothesis H 0 : β 1 = 1 using the dynamic OLS estimator of Stock and Watson (1993). Stock and Watson propose the following regression p y t = β 0 + β 1 x t + k i x t i + u t. (8) i= p This regression includes the leads and lags of x t as additional regressors so that a modified t statistic asymptotically follows the normal distribution. Here, we let p = 4, and estimate the long run variance (spectral density at frequency zero) of u t by running an auxiliary autoregression for the residual û t. 11 For most spreads and for most periods, the t statistics are outside the (-1.96, 1.96) bound. Nearly all t statistics are less than -1.96 after 1994, while between 1982 and 1985, ˆβ 1 is insignificantly different from unity. 11 See equation (19.3.27) of Hamilton (1994) for details. We also try various lag p for the dynamic OLS regression (8), and we re-estimate the long run variance of u t using the method of Newey and West (1987) with truncation at 4 or 8. There is no qualitative change. 9

More General Cointegration Given the results of the previous section, we relax the restriction that the cointegrating vector is (1,-1). We can rationalize this generalization by dropping the assumption that the risk premium is constant. Instead, we assume that this premium may be non-stationary and correlated with the interest rate. If this is the case, then the spread is no longer the correct variable of interest and we must allow for a more general form of cointegration. From (7) we now estimate the error correction term as ê t = y t ˆβ 0 ˆβ 1 x t (9) We plot the estimated error correction term ê t and the spread between the ten year Treasury and the federal funds rate in Panel B of Figure 1. There are a number of periods, for example after 2009 and in the early part of the sample, where the spread and ê t clearly differ. Table 4 reports the percentage of times that we fail to reject the null of no cointegration in the more general setting applying the Engle-Granger and Enders-Siklos tests to ê t. Comparing Table 4 to Table 2, it is interesting to note that the rejection rates are smaller for nearly every spread, and more importantly, the pattern of increasing percentages as the time spread increases becomes less clear. Table 5 reports the results for estimating the more general error correction model 12 y t = α 02 + α 2ê t 1 + p c2 i y t i + i=1 p d2 i x t i + v2 t. (10) i=1 Again, the story is much the same for ê t as it is for the spread. Once we allow for a potentially nonstationary risk premium, we find less evidence against EH. Both (6) and (10) assume that the error correction term is stable. More specifically, that 12 The critical values are simulated under the null of no cointegration. 10

the error correction term is always the spread or always ê t. The varying values of ˆβ 1 shown in Figure 2 suggest that sometimes the true error correction term is the spread and sometimes it is ê t. This is possible if the degree to which the markets of federal funds and Treasury bonds are segmented varies over time, or, if the risk premium is sometimes stationary and sometimes nonstationary. 13 We propose an augmented error correction model (AECM) that allows this possibility. It is given by y t = α 03 + α 1 sp t 1 + α 2 ê t 1 + p c3 i y t i + i=1 p d3 i x t i + v3 t (11) i=1 This new model has two merits. First, it compares the significance of sp and ê directly in a single model. Second, it allows for a time-varying error correction term. That is, it can accommodate the possibility that in one period sp is significant, and so is the appropriate error correction term, and in another period ê is significant, and so it is the appropriate correction term. 14 Table 6 reports the results for the augmented model. When the federal funds rate is the short rate, 66% of the time the augmented model failed to reject the null that the coefficient on the error correction term equaled zero. For a three year time spread and above, the model failed to reject the same null more than 80% of the time. For Treasury spreads, the rejection rates are high for all time spreads and relatively stable. Notice that all the percentages in Table 6 are less than those in Table 5, and with only two exceptions, are less than those in Table 3. The augmented model provides the strongest evidence of cointegration. Put differently, the traditional error correction model may fail to find cointegration due to the restriction that the error correction term is time-invariant. It is interesting to compare α 1 to α 2 as it tells us which error correction term is driving the adjustment. Figure 3 plots the series of t values of α 1 (Panel A) and α 2 (Panel B) in the rolling augmented error correction model (11), where x t = s f. It is clear that α 1 tends to be 13 For instance, Panel A of Figure 2 suggests that the risk premium may be nonstationary in early 1990s. 14 If ê and sp are similar, then both can be significant. 11

significant when y t = l 1 ; whereas α 2 be significant when y t = l 10. This finding is consistent with Figure 2. Overall, Figures 2 and 3 indicate that, when the difference between maturity is large, the adjustment can be driven mostly by the estimated error correction, which differs significantly from the spread. Conclusion Cointegration tests have often been used as tests of the EH. We show that as the time spread between long term and short term rates increases, the yield spread behaves more and more like a random walk. As a result, it becomes less likely to reject the null hypothesis of no cointegration as the time spread increases. This implies that even if the EH holds, we may still fail to find cointegration, especially as the time spread between rates increases. In short, declining rejection rates of cointegration tests is consistent with the expectation hypothesis of the term structure of interest rates. We show, using a variety of approaches, that it is more likely to fail to reject the null of no cointegration as the time spread increases when the error correction term is restricted to equal the spread. If we relax the assumption that the risk premium is always stationary, the EH allows for a more general error correction term. We propose a new augmented error correction model, in which the spread and estimated error correction term are both included. We can interpret this model as allowing for the stationarity of the risk premium to vary over time. We find some support for this model in the data, and it suggests the implications of relaxing the assumption that the stationarity of the risk premium is time-invariant be investigated in the future. 12

Appendix: Proof of Limiting Proposition Rearranging (1) leads to i m,t i 1,t = 1 m [ m 1 i=1 ] i E t i 1,t+j + r m (12) j=1 = 1 m [E t i 1,t+1 + (E t i 1,t+1 + E t i 1,t+2 ) +...] + r m (13) = 1 m [(m 1)E t i 1,t+1 + (m 2)E t i 1,t+2 +... + E t i 1,t+m 1 ] + r m (14) m 1 ( = 1 j ) E t i 1,t+j + r m (15) m = j=1 m 1 j=1 E t i 1,t+j 1 m m 1 j=1 je t i 1,t+j + r m (16) By assumption m 1 j=1 E t i 1,t+j is finite, and so is a Cauchy sequence. It follows that (e.g., page 438 of Hayashi (2000)) lim m m 1 1 m j=1 je t i 1,t+j = 0. (17) The proof is done after taking limits on both sides of (16). 13

References Anderson, H. M. (1997), Transaction costs and non-linear adjustment towards equilibrium in the US Treasury bill market, Oxford Bulletin of Economics and Statistics, 59, 465 484. Campbell, J. Y., and Shiller, R. J. (1991), Yield spreads and interest rate movements: A bird s eye view, Review of Economic Studies, 58, 495 514. Clarida, R. H., Sarno, L., Taylor, M. P., and Valente, G. (2006), The role of asymmetries and regime shifts in the term structure of interest rates, Journal of Business, 79, 1193 1224. Enders, W., and Granger, C.W.J. (1998), Unit root tests and asymmetric adjustment with an example using the term structure of interest rates, Journal of Business & Economic Statistics, 16, 304 311. Enders, W., and Siklos, P. L. (2001), Cointegration and threshold adjustment, Journal of Business & Economic Statistics, 19, 166 176. Hall, A. D., Anderson, H. M., and Granger, C. W. J. (1992), A cointegration analysis of treasury bill yields, Review of Economics and Statistics, 74, 116 126. Hamilton, J. D. (1994), Time Series Analysis, Princeton: Princeton University Press. Hayashi, F. (2000), Econometrics, Princeton: Princeton University Press. Newey, W. K., and West, K. D. (1987), A simple, positive semi-definite heteoskedasticity and auto-correlation consistent covariance matrix, Econometrica, 55, 703 708. Ng, S., and Perron, P. (1995), Unit root tests in ARMA models with data-dependent methods for the selection of the truncation lag, Journal of the American Statistical Association, 90, 268 281. 14

Park, J. Y., and Hahn, S. B. (1999), Cointegrating regressions with time varying coefficients, Econometric Theory, 15, 664703. Phillips, P. C. B., and Durlauf, S. (1986), Multiple time series regression with integrated processes, Review of Economic Studies, 53, 473 496. Sarno, L., and Thornton, DL. (2003), The dynamic relationship between the federal funds rate and the Treasury bill rate: An empirical investigation, Journal of Banking and Finance, 27, 1079 1110. Sarno, L., Thornton, D. L., and Valente, G. (2007), The empirical failure of the expectations hypothesis of the term structure of bond yields, Journal of Financial and Quantitative Analysis, 42, 81 100. Stock, J. H., and Watson, M. W. (1993), A simple estimator of cointegrating vectors in higher order integrated systems, Econometrica, 61, 783 820. Tzavalis, E., and Wickens, M.R. (1997), Explaining the failures of the term spread model of the rational expectation hypothesis of the term structure, Journal of Money, Credit and Banking, 29, 364 380. 15

Table 1: Summary of Full Sample (1982:10-2013:8) Series n Mean SE Min Max ADF ADF s f 371 4.58 2.98 0.07 11.64-1.69 5.12 l 1 371 4.69 2.95 0.10 12.08-1.84 4.73 l 3 371 5.29 2.99 0.33 13.18-1.25 6.32 l 5 371 5.67 2.86 0.62 13.48-1.31 6.69 l 10 371 6.17 2.62 1.53 13.56-1.87 5.79 Note: a. n is the sample size; Mean is the sample mean; SE is the standard error; Min is the minimum; Max is the maximum. b. ADF is the augmented Dickey-Fuller t test applied to the level. c. ADF is the augmented Dickey-Fuller t test applied to the first difference. d. For the ADF test, the number of lag p is chosen by the method of Ng and Perron (1995). All testing regressions includes a constant, and no trend. e. denotes rejecting the null hypothesis of nonstationarity at the 5% level. Table 2: % Fail to Reject No Cointegration Hypothesis Using sp t FF rate is short rate a Treasury spreads b Spread T1-FF T3-FF T5-FF T10-FF T3-T1 T5-T3 T5-T1 T10-T5 T10-T3 T10-T1 Linear 80 90 95 95 90 93 99 100 95 95 Threshold 68 88 96 95 90 94 95 94 94 94 Note: a. Each cell below the label reports the test outcome when the spread is between a Treasury yield and the federal funds rate. T1 is the one year, T3 is the three year rate, and so on. b. Each cell below the label reports the test outcome for differing Treasury spreads. The spread T3-T1 is the spread between the three year and one year Treasury rates, T5-T3 is the spread between the five year and three year rate, and so on. 16

Table 3: % Fail to Reject Zero Error Correction Term Hypothesis Using sp t FF rate is short rate Treasury spreads T1-FF T3-FF T5-FF T10-FF T3-T1 T5-T3 T5-T1 T10-T5 T10-T3 T10-T1 56 85 94 93 93 92 94 81 90 93 Note: a.the number in each cell is the rate that the hypothesis H 0 : α 1 = 0 fails to be rejected at 5% level in regression (6). Table 4: % Fail to Reject No Cointegration Hypothesis Using ê t FF rate is short rate Treasury spreads Spread T1-FF T3-FF T5-FF T10-FF T3-T1 T5-T3 T5-T1 T10-T5 T10-T3 T10-T1 Linear a 67 53 59 76 73 87 75 98 91 89 Threshold b 74 62 71 80 82 88 83 99 92 85 Note: a. Each cell is the percent that the null hypothesis of no cointegration cannot be rejected after applying the Engle-Granger Test to the estimated error correction term ê t. b. Each cell is the percent that the null hypothesis of no cointegration cannot be rejected after applying the Enders-Siklos Test to the estimated error correction term ê t. Table 5: % Fail to Reject Zero Error Correction Term Hypothesis Using ê t FF rate is short rate Treasury spreads T1-FF T3-FF T5-FF T10-FF T3-T1 T5-T3 T5-T1 T10-T5 T10-T3 T10-T1 70 93 94 93 93 79 92 86 90 95 Note: a. The number in each cell is the rate that the hypothesis H 0 : α 2 = 0 fails to be rejected at 5% level in regression (10). Table 6: % Fail to Reject Zero Error Correction Term Hypothesis Augmented Error Correction Model FF rate is short rate Treasury spreads T1-FF T3-FF T5-FF T10-FF T3-T1 T5-T3 T5-T1 T10-T5 T10-T3 T10-T1 66 85 86 84 84 55 81 71 76 86 Note: a. Each cell is the rate that neither the hypothesis H 0 : α 1 = 0 nor the hypothesis H 0 : α 2 = 0 is rejected at 5% level in regression (11). 17

Figure 1: Time Series Plot 14 Panel A: SF, L3, and L10 3 Panel B: Two Error Correction Terms 12 2 10 1 8 0 6-1 4 2-2 0 1982 1985 1988 1991 1994 1997 2000 2003 2006 2009 2012 Time -3 1982 1985 1988 1991 1994 1997 2000 2003 2006 2009 2012 Time SF L3 L10 SP EHAT Note: a. Panel A plots the federal funds rate s f (solid line), the 3-year treasury bond rate l 10 (dotted line) and the 10-year treasury bond rate l 10 (dash line). b. Panel B plots the centered interest rate spread sp = l 10 s f (solid line), and the OLS residual of regressing l 10 on a constant and s f (dotted line). 18

Figure 2: Estimated Beta1 1.2 Panel A: X=SF 1.1 Panel B: X=L1 1.0 1.0 0.9 0.8 0.8 0.6 0.7 Beta1 0.4 Beta1 0.6 0.2 0.5 0.4 0.0 0.3-0.2 1982 1984 1986 1988 1990 1992 1994 1996 1998 2000 2002 2004 Time 0.2 1982 1984 1986 1988 1990 1992 1994 1996 1998 2000 2002 2004 Time Y=L1 Y=L3 Y=L5 Y=L10 Y=L3 Y=L5 Y=L10 2.5 Panel C: X=SF 2.5 Panel D: X=L1 0.0 0.0-2.5-2.5 T test Beta1=1-5.0-7.5-10.0 T test Beta1=1-5.0-7.5-10.0-12.5-12.5-15.0-15.0-17.5 1982 1984 1986 1988 1990 1992 1994 1996 1998 2000 2002 2004 Time -17.5 1982 1984 1986 1988 1990 1992 1994 1996 1998 2000 2002 2004 Time Y=L1 Y=L3 Y=L5 Y=L10 Y=L3 Y=L5 Y=L10 Note: a. Panel A plots the series of estimated β 1 in (7) using x t = s f, y t = l 1, l 3, l 5, l 10. The regression is fitted for the rolling window that starts at t 0 and ends at t 0 + w 1, where the window size is w = 96, and t 0 = (1,..., 371 + 1 w). b. Panel B plots the series of estimated β 1 in (7) using x t = l 1, y t = l 3, l 5, l 10. c. Panel C plots the serial-correlation adjusted t statistic for H 0 : β 1 = 1 in (8) using x t = s f. d. Panel D plots the serial-correlation adjusted t statistic for H 0 : β 1 = 1 in (8) using x t = l 1. 19

Figure 3: T Values of Alphas in Rolling AECMs 6 Panel A: Alpha1 2 Panel B: Alpha2 5 1 4 0 3 T Value of Alpha1 2 1 T Value of Alpha2-1 -2-3 0-1 -4-2 1983 1986 1989 1992 1995 1998 2001 2004-5 1983 1986 1989 1992 1995 1998 2001 2004 Time Time Y=L1 Y=L3 Y=L10 Y=L1 Y=L3 Y=L10 Note: a. Panel A plots the series of t value of α 1 in (11) using x t = s f, y t = l 1, l 3, l 10. The regression is fitted for the rolling window that starts at t 0 and ends at t 0 + w 1, where the window size is w = 96, and t 0 = (1,..., 371 + 1 w). b. Panel B plots the series of t value of α 2 in (11). 20