US HFCS Price Forecasting Using Seasonal ARIMA Model

Similar documents
A Predictive Model for Monthly Currency in Circulation in Ghana

Forecasting Exchange Rate between Thai Baht and the US Dollar Using Time Series Analysis

Chapter 4 Level of Volatility in the Indian Stock Market

THE IMPACT OF FINANCIAL CRISIS IN 2008 TO GLOBAL FINANCIAL MARKET: EMPIRICAL RESULT FROM ASIAN

Modeling Volatility of Price of Some Selected Agricultural Products in Ethiopia: ARIMA-GARCH Applications

International Journal of Business and Administration Research Review. Vol.3, Issue.22, April-June Page 1

Determinants of Stock Prices in Ghana

Modeling Volatility Clustering of Bank Index: An Empirical Study of BankNifty

Indian Institute of Management Calcutta. Working Paper Series. WPS No. 797 March Implied Volatility and Predictability of GARCH Models

ANALYSIS OF THE RELATIONSHIP OF STOCK MARKET WITH EXCHANGE RATE AND SPOT GOLD PRICE OF SRI LANKA

Modeling Exchange Rate Volatility using APARCH Models

ARIMA ANALYSIS WITH INTERVENTIONS / OUTLIERS

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2010, Mr. Ruey S. Tsay. Solutions to Midterm

Computer Lab Session 2 ARIMA, ARCH and GARCH Models

Empirical Analysis of the US Swap Curve Gough, O., Juneja, J.A., Nowman, K.B. and Van Dellen, S.

Uncertainty and the Transmission of Fiscal Policy

Econometrics II. Seppo Pynnönen. Spring Department of Mathematics and Statistics, University of Vaasa, Finland

The Economic Consequences of Dollar Appreciation for US Manufacturing Investment: A Time-Series Analysis

MODELING NIGERIA S CONSUMER PRICE INDEX USING ARIMA MODEL

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2013, Mr. Ruey S. Tsay. Midterm

INFORMATION EFFICIENCY HYPOTHESIS THE FINANCIAL VOLATILITY IN THE CZECH REPUBLIC CASE

Forecasting Financial Markets. Time Series Analysis

MODELING VOLATILITY OF BSE SECTORAL INDICES

Inflation and inflation uncertainty in Argentina,

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2016, Mr. Ruey S. Tsay. Solutions to Midterm

Oil Price Effects on Exchange Rate and Price Level: The Case of South Korea

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2014, Mr. Ruey S. Tsay. Solutions to Midterm

A Study of Stock Return Distributions of Leading Indian Bank s

Yafu Zhao Department of Economics East Carolina University M.S. Research Paper. Abstract

Exchange Rate Market Efficiency: Across and Within Countries

A Comparative Study of Various Forecasting Techniques in Predicting. BSE S&P Sensex

STAT758. Final Project. Time series analysis of daily exchange rate between the British Pound and the. US dollar (GBP/USD)

Economics 413: Economic Forecast and Analysis Department of Economics, Finance and Legal Studies University of Alabama

Empirical Study on Short-Term Prediction of Shanghai Composite Index Based on ARMA Model

AN EMPIRICAL ANALYSIS OF THE PUBLIC DEBT RELEVANCE TO THE ECONOMIC GROWTH OF THE USA

Forecasting the Philippine Stock Exchange Index using Time Series Analysis Box-Jenkins

Efficiency in the Australian Stock Market, : A Note on Extreme Long-Run Random Walk Behaviour

Modelling Rates of Inflation in Ghana: An Application of Arch Models

Research Article The Volatility of the Index of Shanghai Stock Market Research Based on ARCH and Its Extended Forms

Are Bitcoin Prices Rational Bubbles *

STOCK MARKET EFFICIENCY, NON-LINEARITY AND THIN TRADING EFFECTS IN SOME SELECTED COMPANIES IN GHANA

Some Comments On Fractionally Integration Processes Involving Two Agricultural Commodities

DATABASE AND RESEARCH METHODOLOGY

The Impact of Falling Crude Oil Price on Financial Markets of Advanced East Asian Countries

STRESS TEST MODELLING OF PD RISK PARAMETER UNDER ADVANCED IRB

A Study on Impact of WPI, IIP and M3 on the Performance of Selected Sectoral Indices of BSE

FORECASTING TIME SERIES STOCK RETURNS USING ARIMA: EVIDENCE FROM S&P BSE SENSEX

Modeling and Forecasting Consumer Price Index (Case of Rwanda)

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2009, Mr. Ruey S. Tsay. Solutions to Final Exam

INTERNATIONAL JOURNAL OF ADVANCED RESEARCH IN ENGINEERING AND TECHNOLOGY (IJARET)

Analysis of the Relation between Treasury Stock and Common Shares Outstanding

An Empirical Study on Forecasting Potato Prices in Tamil Nadu. National Academy of Agricultural Science (NAAS) Rating : 3. 03

ANALYSIS OF STOCHASTIC PROCESSES: CASE OF AUTOCORRELATION OF EXCHANGE RATES

RE-EXAMINE THE INTER-LINKAGE BETWEEN ECONOMIC GROWTH AND INFLATION:EVIDENCE FROM INDIA

Unemployment and Labor Force Participation in Turkey

Modelling Stock Market Return Volatility: Evidence from India

Univariate Time Series Analysis of Forecasting Asset Prices

Discussion Paper Series No.196. An Empirical Test of the Efficiency Hypothesis on the Renminbi NDF in Hong Kong Market.

Modeling Philippine Stock Exchange Composite Index Using Time Series Analysis

Volume 29, Issue 2. Measuring the external risk in the United Kingdom. Estela Sáenz University of Zaragoza

An Analysis of Spain s Sovereign Debt Risk Premium

Empirical Analysis of Private Investments: The Case of Pakistan

THE UNIVERSITY OF CHICAGO Graduate School of Business Business 41202, Spring Quarter 2003, Mr. Ruey S. Tsay

GARCH Models for Inflation Volatility in Oman

Stock Price Volatility in European & Indian Capital Market: Post-Finance Crisis

Variance clustering. Two motivations, volatility clustering, and implied volatility

Forecasting Short term USD/INR Exchange Rate -ARMA Approach

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Solutions to Midterm

Unemployment and Labour Force Participation in Italy

Market Risk Management for Financial Institutions Based on GARCH Family Models

A Long Memory Model with Mixed Normal GARCH for US Inflation Data 1

Linkage between Gold and Crude Oil Spot Markets in India-A Cointegration and Causality Analysis

Forecasting Inflation in Kenya Using Arima - Garch Models

THE CORRELATION BETWEEN VALUE ADDED TAX AND ECONOMIC GROWTH IN ROMANIA

Foreign direct investment and profit outflows: a causality analysis for the Brazilian economy. Abstract

Weak Form Efficiency of Gold Prices in the Indian Market

ARCH modeling of the returns of first bank of Nigeria

UNIT ROOT TEST OF SELECTED NON-AGRICULTURAL COMMODITIES AND MACRO ECONOMIC FACTORS IN MULTI COMMODITY EXCHANGE OF INDIA LIMITED

Effects of International Trade On Economic Growth: The Case Study of Pakistan

Conditional Heteroscedasticity

The Demand for Money in Mexico i

Demand For Life Insurance Products In The Upper East Region Of Ghana

Thi-Thanh Phan, Int. Eco. Res, 2016, v7i6, 39 48

Market Integration, Price Discovery, and Volatility in Agricultural Commodity Futures P.Ramasundaram* and Sendhil R**

Lloyds TSB. Derek Hull, John Adam & Alastair Jones

Information Flows Between Eurodollar Spot and Futures Markets *

Volatility in the Indian Financial Market Before, During and After the Global Financial Crisis

Diesen Termin sollten Sie sich nicht entgehen lassen! Sponsor. Inhalt. NEWSLETTER Juli 2014

The impact of the financial crisis on the interbank money markets behavior. Evidence from several CEE transition economies 1

Gloria Gonzalez-Rivera Forecasting For Economics and Business Solutions Manual

VOLATILITY OF SELECT SECTORAL INDICES OF INDIAN STOCK MARKET: A STUDY

MODELING EXCHANGE RATE VOLATILITY OF UZBEK SUM BY USING ARCH FAMILY MODELS

A SEARCH FOR A STABLE LONG RUN MONEY DEMAND FUNCTION FOR THE US

Blame the Discount Factor No Matter What the Fundamentals Are

COINTEGRATION AND MARKET EFFICIENCY: AN APPLICATION TO THE CANADIAN TREASURY BILL MARKET. Soo-Bin Park* Carleton University, Ottawa, Canada K1S 5B6

Forecasting Exchange Rate Between the Ghana Cedi and the US Dollar using Time Series Analysis

Dynamic Linkages between Newly Developed Islamic Equity Style Indices

Hedging effectiveness of European wheat futures markets

Non-stationary volatility with highly anti-persistent increments: An alternative paradigm in volatility modeling?

ECONOMETRIC ANALYSIS OF VALUE ADDED TAX WITH COLOMBO CONSUMER PRICE INDEX IN SRI LANKA. ^UVERSITY OF MORATUWA. SRI IAAIK CflQRATUWA. P.T.

Transcription:

US HFCS Price Forecasting Using Seasonal ARIMA Model Prithviraj Lakkakula Research Assistant Professor Department of Agribusiness and Applied Economics North Dakota State University Email: prithviraj.lakkakula@ndsu.edu Selected Paper prepared for presentation at the Southern Agricultural Economics Association s 2016 Annual Meeting, San Antonio, Texas, February 6-9, 2016 Copyright 2016 by Prithviraj Lakkakula. All rights reserved. Readers may make verbatim copies of this document for non-commercial purposes by any means, provided that this copyright notice appears on all such copies. 1

Abstract This paper focuses on forecasting US high fructose corn syrup (HFCS) prices using a seasonal autoregressive integrated moving average model. We use both monthly and quarterly data to forecast HFCS prices for the 1994 2015 period. We perform the Augmented Dickey Fuller test for ensuring that the HFCS prices are stationary. We use mean absolute error, in sample root mean square error, and out of sample root mean square error for evaluating the predictive accuracy of the models. Based on the out of sample performance, we found that the quarterly model performed well in predicting HFCS prices compared to monthly model. The results will help make better decision concerning the operation of corn-wet milling plant and HFCS production. 2

Introduction: Sugar and HFCS have been major source of caloric sweetener use in the United States. However, since 2002, the total caloric sweetener consumption has consistently decreased, primarily due to a decrease in HFCS per-capita consumption. For example, the per-capita HFCS consumption declined from 37.7 pounds in 2002 to 26.8 pounds in 2014 (USDA, 2016a). 1 On the other hand, HFCS prices have increased from 13.05 cents per pound in 2002 to 22.89 cents per pound in 2014 (USDA, 2016b). In 2015, the HFCS price was at 26.58 cents per pound (USDA, 2016b). Due to a decrease in HFCS utilization, the corn-wet milling plants are at risk of closure. For instance, on September 16, 2014, Cargill announced its decision to shut its corn-wet milling facility in Memphis (US) due to its underutilization and increased operational expenses (Memphis Business Journal, 2014). The motivation for forecasting HFCS prices comes from the current HFCS market situation with the HFCS utilization versus its price dynamics. The results of this study will help make better decision in the operation of corn-wet mill. For example, HFCS price forecast results in conjunction with HFCS export demand could be used to evaluate the amount of HFCS to be produced and the profitability of its production in a corn-wet mill facility. Autoregressive integrated moving average (ARIMA) model is widely used in time series analysis for forecasting univariate time series. Box and Jenkins (1979) proposed ARIMA model and hence these models were popularly known as Box-Jenkins models. 1 It is important to note that HFCS consumption (at consumer level) is different from the HFCS deliveries (at market level). USDA calculates the HFCS consumption at consumer level after accounting for losses at wholesale level, and retail level etc. The HFCS deliveries have decreased from 64.1 pounds per capita in 2002 to 45.6 pounds per capita in 2014. 3

The ARIMA model was later transformed to include seasonal autoregressive and moving average terms to become the seasonal ARIMA model. This study uses seasonal autoregressive integrated moving average (SARIMA) method to forecast HFCS prices. Specifically, we use quarterly and monthly HFCS prices to design both quarterly and monthly models and evaluate the accuracy of the forecasts. Three forecast accuracy measures in sample mean absolute error (MAE), in sample root mean squared error (IRMSE), and out of sample root mean squared error (ORMSE) are used to compare the performance of various SARIMA models. Many argue that the reliability of any estimated model would be analyzed based on the out of sample prediction accuracy (Stock and Watson, 2003). Therefore, in our study, the SARIMA model that has the lowest ORMSE was preferred to forecast HFCS prices. Model: In this section, we represent the specifications of both non-seasonal and seasonal autoregressive integrated moving average models in addition to several test procedures before forecasting HFCS prices. The test procedures include Ljung-Box test for autocorrelation in the residuals and Jarque-Bera test for normality of the residuals of the fitted model. We follow the backshift notation used in Box and Jenkins (1976) to describe the ARIMA model. Non-seasonal autoregressive integrated moving average (ARIMA) model: ARIMA(p,d,q) is represented as follows: (1 ϕ1b ΦpB p ) (1 B) d yt = c + (1 + θ1b + + ϴqB q ) et 4

where p, d, q are number of non-seasonal autoregressive lags, number of non-seasonal differences required to attain stationary, number of non-seasonal moving average lags, respectively. For seasonal ARIMA model we have SARIMA(p,d,q)(P,D,Q)m as shown below (Athanasopoulos, and Hyndman, 2012): where P, D, Q are number of seasonal autoregressive lags, number of seasonal differences required to attain stationary, number of seasonal moving average lags, respectively. For example, a quarterly SARIMA (1,1,1)(1,1,1)4 model is represented as follows (Athanasopoulos, and Hyndman, 2012): where B is backshift operator representing the lag, and ϕ1, Φ, θ, ϴ are the parameters of the model to be estimated. Similarly, the monthly model is represented as follows: 5

Forecast Accuracy Measures: We use three accuracy measures for selecting the best SARIMA model to forecast the HFCS prices. They are Mean Absolute Error: MAE = 1 n n i=1 y i y i = 1 n e n i=1 i In sample Root Mean Square Error: IRMSE = 1 n n i=1 (y i y i ) 2 Out of sample Root Mean Square Error: ORMSE = 1 k k (y j y j ) 2 j=1 where n, k, y i, y i, y j, y j represent number of observation in the training set, number of observation in the test set, forecasts (predicted values) of HFCS prices in the training set used for estimaion, actual HFCS prices of training set, forecasts of HFCS prices in test set, and actual HFCS prices in the test set, respectively. 2 Diagnostic Tests: 1). Unit root test 2 Training set is the set of observations used in the model estimation; while the test set (holdout set) is the set of out of sample observations used for testing the performance of the estimated model. 6

In time series analysis, a stationary (constant mean and constant auto-covariance) data series is a pre-condition in order maintain the stability of the model. There are many tests to perform unit root tests in the time series including Kwiatkowski Phillips Schmidt Shin (KPSS) (1992), and Augmented Dickey-Fuller (ADF) test etc. We used ADF test (Dickey and Fuller, 1979) for testing unit root in the HFCS prices for both quarterly and monthly models. The ADF test for a differenced time series is shown below: 3 y t = α + ρy t 1 + γ 1 y t 1 + + γ L y t L + e t, where y t is the first non-seasonal difference, y t 1 is one period lag of prices, and y t L is the L period lag of differenced prices. If the test statistic of the ρ value is less than 2.9, then we reject the null of nonstationarity. It is important to note that, when a trend variable (for a trend non-differenced data) is included in the ADF test, the critical value of the test statistic would be 3.5. 2). Choice of Lag in Model Selection: We used autocorrelation functions (ACF) and partial autocorrelation functions (PACF) of the stationary HFCS prices for a first guess model. However, various models are selected randomly and evaluated by comparing the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC). The model that gives the lowest AIC and/or BIC value is selected for the estimation. 3). Autocorrelation Test: 3 In quarterly model, HFCS prices are stationary only at the difference of differenced series: 2 y t = α + ρy t 1 + γ 1 2 y t 1 + + γ L 2 y t L + e t. 7

To detect any autocorrelations in the residuals of the fitted model, we carry out Ljung- Box test. The Ljung-Box test for autocorrelation in the residuals is defined as follows (Ljung and Box, 1978): X = n(n + 2) ρ l 2 l l=1 n l where n is the sample size, ρ k is the sample autocorrelation at lag l, and h is the number of lags being tested. The null hypothesis is that the residuals are independently distributed or there is no autocorrelation in the residuals, while the alternate hypothesis is that there is autocorrelation in the residuals. The test statistic is a chi-square distribution with h degrees of freedom. The degrees of freedom (h) is computed based on autoregressive and moving average lags. For example, if we are fitting an SARIMA(p,d,q)(P,D,Q) model, then the degrees of freedom equals [h (p+q+p+q)]. 4). Normality Test: We perform the Jarque-Bera test for normality of the residuals as shown below (Jarque, and Bera, 1980): JB = ( n 2 6 S) + ( n 2 24 (K 3)) where n is the sample size, S is the sample skewness coefficient, K is the sample kurtosis coefficient, and JB is the Jarque-Bera statistic. The JB statistic has a chi-square (χ 2 ) distribution with two degrees of freedom. The null hypothesis of the normality of the residuals is rejected if the JB test statistic is larger than the critical value of the (χ 2 (2) ) distribution. At 5% level of significance, the critical value of the (χ 2 ) distribution is 5.99 (Newbold, Carlson, and Thorne, 2013). Therefore, at p < 0.05, if the JB statistic is less than 5.99, we fail to reject the null hypothesis of normality and hence conclude that the 8

residuals of the fitted model exhibit normal distribution (Newbold, Carlson, and Thorne, 2013). Data: We collected the US HFCS 42 spot price data for the 1994 2015 period from the United States Department of Agriculture s sugar and sweetener yearbook tables (USDA, 2016b). The HFCS 42 spot price data were measured in cents per pound. For both quarterly and monthly models, we used data from 1994 2013 period for the model estimation and the last two year data were used as holdout sample for evaluating out of sample performance of the fitted model. Therefore, in quarterly model, we have 88 observations in total, out of which 80 observations were used in model estimation and remaining 8 observations were used as a holdout sample. Similarly, in monthly model, we have 264 observations in total, out of which 240 observations were used in model estimation, and the holdout set consists of remaining 24 observations. Empirical Results and Discussion: Quarterly Model: In this section, we present and discuss the results comparing both quarterly and monthly models. We also discuss the results of different diagnostic tests before forecasting HFCS 9

prices. As mentioned earlier, the foremost step for forecasting HFCS prices is to test the data for stationarity using ADF test. For quarterly model, we found that the non seasonal first differenced HFCS prices are not stationary in addition to the raw data. Therefore, we transformed the data by taking the difference of differenced HFCS prices to ensure it is stationary. The double differenced HFCS prices are shown in Figure 1 (Panel, A). We performed ADF test to verify that the HFCS prices (after difference of the differenced series) are stationary. The Augmented Dickey Fuller test statistic (ρ ) was found to be 5.56, which is less than the critical value ( 2.9) at 5% level of significance. Figure 1 (Panel A) shows the difference of differenced HFCS prices along with its ACF and PACF. Based on the ACF and PACF, our first guess model was SARIMA(3,2,1)(0,0,0)4. We estimated the first guess model along with the other five models created randomly by changing both the seasonal and non seasonal autoregressive and moving average lags. The AIC and BIC results of the models are shown in Table 1. Based on the results in Table 1, we chose the model SARIMA (0,2,1)(1,0,1)4 as it has the lowest AIC and BIC values. After estimation of the model SARIMA (0,2,1)(1,0,1)4, we performed residual analysis as shown in Figure 2. Based on the residual analysis (figure 2), we conclude that there is no significant autocorrelation in the model residuals as we fail to reject the null hypothesis of the Ljung Box test statistic (p values are high). In order to compare our chosen model with other competing models, we selected a total of 12 models (including the models shown in Table 1) to assess the performance of the out of sample prediction accuracy. Although our chosen model performed well 10

compared to most other models, we found SARIMA(0,2,0)(3,0,2)4 with the least out of sample RMSE as shown in Table 2. We performed the diagnostic tests such as Jarque Bera test for normality of the residuals for the SARIMA(0,2,0)(3,0,2)4. We found the Jarque Bera test statistic to be 4.07, which is less than the critical value of chi square distribution at 5% level of significance (5.99). Hence, we fail to reject the null hypothesis and conclude that the residuals of the SARIMA(0,2,0)(3,0,2)4 exhibit a normal distribution. Finally, we performed Ljung Box test for autocorrelation in the residuals of the SARIMA(0,2,0)(3,0,2)4. We found that the Ljung Box test statistic is 33.06 and we conclude that there was no evidence of any autocorrelation in the residuals of the fitted model After performing residual analysis and the diagnostic tests, we believe that the SARIMA(0,2,0)(3,0,2)4 was correctly specified. Subsequently, it was used for forecasting HFCS prices. The forecasts generated from the SARIMA(0,2,0)(3,0,2)4 are shown in Figure 3 (Panel, A). Monthly Model: Alternatively, the monthly HFCS prices are found to be stationary after taking a non seasonal first difference Figure 1 (Panel, B). The ADF test statistic (ρ ) was found to be 5.79 (less than the critical value, 2.9, at p < 0.05). Figure 1 (Panel B) shows the differenced HFCS prices along with its ACF and PACF. Our first guess model was SARIMA (1,1,1)(1,0,1)12 based on ACF and PACF. Similar to the quarterly model, we estimated the first guess model along with the other 11

five models. Based on the lowest values of AIC and BIC, we chose the model SARIMA (0,1,1)(2,0,1)12 for further estimation. After estimation of the model SARIMA (0,1,1)(2,0,1)12, we performed residual analysis as shown in Figure 2 (Panel B). The residual analysis do not show significant autocorrelation in the model residuals as we fail to reject the null hypothesis of the Ljung Box test statistic (p values are high). Similar to the quarterly model, we selected a total of 12 models to evaluate the performance of the out of sample prediction accuracy. We found SARIMA (3,2,1)(1,0,1)12 with the least out of sample RMSE as shown in Table 2. We performed the diagnostic tests such as Jarque Bera test for normality of the residuals for the SARIMA (3,2,1)(1,0,1)12. We found that the Jarque Bera test statistic to be 1211 (larger than the critical value of chi square distribution at 5% level of significance [5.99]). Hence, we reject the null hypothesis and conclude that the residuals of the SARIMA (3,2,1)(1,0,1)12 model do not exhibit a normal distribution. Therefore, a further investigation is required. Finally, we also performed Ljung Box test for autocorrelation in the residuals of the SARIMA (3,2,1)(1,0,1)12. The Ljung Box test statistic was 5.73 and hence we found no evidence of any autocorrelation in the residuals of the fitted model. As a whole, the quarterly model performed well compared to monthly model. Additionally, the quarterly model also satisfied the normality of the residuals of the fitted model. The forecast results of both quarterly and monthly models show an increasing trend. 12

Historically, increase in HFCS prices during 2002 2014 may be due to an increase in export demand of HFCS especially from Mexico. For instance, US HFCS exports to Mexico have increased from 9,773 MT in 2002 to 1,147,289 MT in 2013 (USDA, 2016c). It will be interesting to see how the HFCS exports respond to a Mexican excise tax policy on beverages introduced in 2014. Recent studies have indicated that on an average the Mexican excise tax policy on beverages has decreased its purchases by 6% (Colchero, Popkin, Rivera & Ng, 2016). HFCS price would depend on the impact of the Mexican tax policy on its beverage sales in the near future and thereby affecting the US HFCS exports to Mexico. Conclusions: We used quarterly and monthly data to forecast HFCS prices for the 1994 2015 period. We used two year (2014 2015) data as a holdout set for evaluating the performance of the models and the remaining data for estimation of the model. Based on the lowest out of sample root mean square error criterion, the quarterly model performed well compared to the monthly model. Quarterly model resulted in the SARIMA (0,2,0)(3,0,2)4 model, while the monthly model resulted in the SARIMA (3,2,1)(1,0,1)12. The forecasts of HFCS prices of both quarterly and monthly models show an increasing trend for two years ahead. Our study did not test for the presence of heteroscedasticity in the residuals of the fitted models. 13

References: Athanasopoulos, G., Hyndman, RJ. 2012. Forecasting Principles and Practice. https://www.otexts.org/fpp Box, GEP, & Jenkins, GM.1970. Time Series Analysis: Forecast and Control. San Francisco: Holden-Day Colchero, MA., Popkin, BM., Rivera, JA., & Ng, SW. 2016. Beverage purchases from stores in Mexico under the excise tax on sugar sweetened beverages: observational study. bmj, 352, h6704 (last accessed on 22 January 2016) http://press.psprings.co.uk/bmj/january/sugartax.pdf Dickey, DA, & Fuller, WA. 1979. Distribution of the estimators for autoregressive time series with a unit root. Journal of the American Statistical Association 74(336a): 427 431. Jarque, CM., & Bera AK. 1980. Efficient test for normality, homoscedasticity and serial independence of regression residuals. Economic Letters, 6: 255 259. Kwiatkowski, D., Phillips, PC, Schmidt, B., & Shin, Y. 1992. Testing the Null Hypothesis of Stationarity against the Alternative of a Unit Root. Journal of Econometrics 54, 159 178. Ljung, GM, & Box, GEP. 1978. On a measure of lack of fit in time series models. Biometrika 65, 297 303. Memphis Business Journal. 2014. http://www.bizjournals.com/memphis/news/2014/09/16/cargill-shutting-downmemphis-corn-mill.html Newbold, P., Carlson, W., & Thorne, B. 2013. Statistics for Business and Economics. 8 th Edition. Published by Pearson. Stock, J.H. and M.W. Watson. 2003. Introduction to Econometrics. Addison Wesley. USDA. 2016a. Sugar and Sweetener Yearbook Table-52, Economic Research Service. http://www.ers.usda.gov/data-products/sugar-and-sweeteners-yearbooktables.aspx 2016b. Sugar and Sweetener Yearbook Table-09, Economic Research Service. http://www.ers.usda.gov/data-products/sugar-and-sweeteners-yearbooktables.aspx 14

2016c. Sugar and Sweetener Yearbook Table-32, Economic Research Service. http://www.ers.usda.gov/data-products/sugar-and-sweeteners-yearbooktables.aspx 15

Panel A. Quarterly Model Panel (B). Monthly Model Figure 1. Panel (A): Double-Differenced HFCS prices of Quarterly Model. Panel (B): Differenced HFCS prices of Monthly Model. 16

Panel (A). Quarterly Model: ARIMA (0,2,1)(1,0,1)4 Panel (B). Monthly Model: ARIMA (0,1,1)(2,0,1)12 Figure 2. Residual Analysis of the respective models 17

Panel (A): Quarterly Forecasts of ARIMA(0,2,0)(3,0,2)4 Panel (B). Monthly forecasts of ARIMA(3,2,1)(1,0,1)12 Figure 3. HFCS Price Forecasts of Two Models 18

Table 1. Identification of ARIMA Models S.No. ARIMA order (p,d,q)(p,d,q)m AIC BIC Quarterly Models 1. (3,2,1)(0,0,0)4 198.12 186.33 2. (3,2,1)(1,0,1)4 198.62 182.12 3. (2,2,2)(1,0,1)4 199.63 183.14 4. (1,2,0)(1,0,1)4 172.25 162.82 5. (0,2,1)(1,0,1)4 202.50 193.07 6. (3,0,0)(2,1,0)4 181.29 167.31 Monthly Models 1. (1,1,1)(1,0,1)12 986.68 969.29 2. (3,1,1)(1,0,1)12 983.36 959.02 3. (2,1,2)(2,0,1)12 984.98 957.17 4. (1,1,0)(2,0,1)12 986.72 969.34 5. (0,1,1)(2,0,1)12 990.21 972.83 6. (3,0,0)(2,1,0)12 891.32 870.74 Notes: ARIMA models shown in italics are our first guess models based on ACFs and PACFs. ARIMA models shown in bold are the models chosen based on lower AIC and BIC values 19

Table 2. Evaluation of Out-of-Sample Performance Across Different Models S.No. ARIMA order (p,d,q)(p,d,q)m Out of Sample RMSE Quarterly Models 1. (3,2,1)(1,0,1)4 5.06 2. (2,2,2)(1,0,1)4 3.93 3. (0,2,0)(3,0,2)4 1.29 4. (0,2,1)(1,0,1)4 4.03 5. (3,0,0)(2,1,0)4 5.81 6. (3,0,0)(2,1,0)4 5.81 7. (3,0,1)(2,1,0)4 5.69 8. (3,0,2)(2,1,0)4 5.69 9. (3,0,1)(0,1,1)4 5.84 10. (3,0,3)(0,1,1)4 5.85 11. (3,0,2)(0,1,1)4 5.49 12. (2,1,5)(0,1,1)4 4.66 Monthly Model 1. (3,2,1)(1,0,1)12 3.98 2. (2,1,2)(2,0,1)12 4.80 3. (0,1,0)(1,0,1)12 4.68 4. (0,1,1)(2,0,1)12 4.81 5. (3,0,0)(2,1,0)12 6.04 6. (3,0,1)(0,1,2)12 5.95 7. (3,0,1)(2,1,0)12 6.02 8. (3,0,2)(2,1,0)12 6.11 9. (3,0,1)(1,1,0)12 6.48 10. (4,0,3)(0,1,1)12 7.55 11. (3,0,2)(0,1,1)12 6.02 12. (2,1,4)(0,1,1)12 5.78 Notes: Models are randomly chosen. 20

Table 3. In sample and out of sample accuracy measures of top two models S.No. ARIMA order MAE IRMSE ORMSE (p,d,q)(p,d,q)m Quarterly Models 1. (0,2,0)(3,0,2)4 0.85 1.23 1.29 2. (0,2,1)(1,0,1)4 0.63 1.07 4.03 Monthly Models 1. (3,2,1)(1,0,1)12 0.256 0.539 3.980 2. (0,1,1)(2,0,1)12 0.238 0.522 4.810 Notes: MAE is Mean Absolute Error, IRMSE is in sample Root Mean Square Error, and ORMSE is out of sample Root Mean Square Error. 21