REGIONAL WORKSHOP ON TRAFFIC FORECASTING AND ECONOMIC PLANNING

Similar documents
WEB APPENDIX 8A 7.1 ( 8.9)

Jacob: The illustrative worksheet shows the values of the simulation parameters in the upper left section (Cells D5:F10). Is this for documentation?

University of Texas at Dallas School of Management. Investment Management Spring Estimation of Systematic and Factor Risks (Due April 1)

SFSU FIN822 Project 1

Gamma Distribution Fitting

Hertha Longo, CSA Matt Wade

Group Assignment I. database, available from the library s website) or national statistics offices. (Extra points if you do.)

DECISION SUPPORT Risk handout. Simulating Spreadsheet models

Jacob: What data do we use? Do we compile paid loss triangles for a line of business?

starting on 5/1/1953 up until 2/1/2017.

ExcelSim 2003 Documentation

Chapter 17. Investment Reports

Name Name. To enter the data manually, go to the StatCrunch website ( and log in (new users must register).

Creating a Rolling Income Statement

ANALYSIS OF THE GDP IN THE REPUBLIC OF MOLDOVA BASED ON MAJOR MACROECONOMIC INDICATORS. Ştefan Cristian CIUCU

PASS Sample Size Software

Further Mathematics 2016 Core: RECURSION AND FINANCIAL MODELLING Chapter 6 Interest and depreciation

Non-Inferiority Tests for the Ratio of Two Means in a 2x2 Cross-Over Design

Homework Assignment Section 3

Point-Biserial and Biserial Correlations

DATABASE AND RESEARCH METHODOLOGY

Tests for the Difference Between Two Linear Regression Intercepts

Prepared By. Handaru Jati, Ph.D. Universitas Negeri Yogyakarta.

Introduction to Time Series Analysis. Madrid, Spain September Case study: exploring a time series and achieving stationarity

Spreadsheet Directions

Web Extension: Continuous Distributions and Estimating Beta with a Calculator

Written by N.Nilgün Çokça. Advance Excel. Part One. Using Excel for Data Analysis

ESTIMATING THE DISTRIBUTION OF DEMAND USING BOUNDED SALES DATA

Graphing Calculator Appendix

XLSTAT TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING

Workshop 1. Descriptive Statistics, Distributions, Sampling and Monte Carlo Simulation. Part I: The Firestone Case 1

Quantitative Techniques Term 2

AP Stats: 3B ~ Least Squares Regression and Residuals. Objectives:

Risk Analysis. å To change Benchmark tickers:

Logistic Regression Analysis

Summary of Statistical Analysis Tools EDAD 5630

<Partner Name> <Partner Product> RSA ARCHER GRC Platform Implementation Guide. 6.3

Revenue Projection Tool Users' Guide. (for version 5)

Copyright 2011 Pearson Education, Inc. Publishing as Addison-Wesley.

Tests for Two Variances

HandDA program instructions

Tests for One Variance

Descriptive Statistics

MBEJ 1023 Dr. Mehdi Moeinaddini Dept. of Urban & Regional Planning Faculty of Built Environment

Pertmaster - Risk Register Module

Confidence Intervals for an Exponential Lifetime Percentile

DazStat. Introduction. Installation. DazStat is an Excel add-in for Excel 2003 and Excel 2007.

R & R Study. Chapter 254. Introduction. Data Structure

Equivalence Tests for the Ratio of Two Means in a Higher- Order Cross-Over Design

Math 1526 Summer 2000 Session 1

Introduction to Population Modeling

PCGENESIS FINANCIAL ACCOUNTING AND REPORTING (FAR) SYSTEM OPERATIONS GUIDE

Master User Manual. Last Updated: August, Released concurrently with CDM v.1.0

Ford Financial Statement 2016

Data screening, transformations: MRC05

Decision Trees Using TreePlan

Analyzing the Elements of Real GDP in FRED Using Stacking

PCGENESIS FINANCIAL ACCOUNTING AND REPORTING (FAR) SYSTEM OPERATIONS GUIDE

Presented at the 2003 SCEA-ISPA Joint Annual Conference and Training Workshop -

Probability distributions

The Simple Regression Model

Estimation, Analysis and Projection of India s GDP

Upload Budget Item Rates

chapter 2-3 Normal Positive Skewness Negative Skewness

Problem Set 1: Review of Mathematics; Aspects of the Business Cycle

User guide for employers not using our system for assessment

The histogram should resemble the uniform density, the mean should be close to 0.5, and the standard deviation should be close to 1/ 12 =

THE ECONOMICS OF BANK ROBBERIES IN NEW ENGLAND 1. Kimberly A. Leonard, Diane L. Marley & Charlotte A. Senno

Software Tutorial ormal Statistics

Solution to Exercise E5.

The Simple Regression Model

Two-Sample T-Test for Non-Inferiority

Computing interest and composition of functions:

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR

Solvency Assessment and Management. Non-Life Underwriting Risk Data Request 2012 User Manual

Chapter 18. Indebtedness

Tests for Two ROC Curves

Non-Inferiority Tests for the Ratio of Two Means

INSURANCE Training Guide

When determining but for sales in a commercial damages case,

Final Exam - section 1. Thursday, December hours, 30 minutes

P&C Rate Data Collection and Management System. PCRDCMS Rate Collection User Manual

BEx Analyzer (Business Explorer Analyzer)

This homework assignment uses the material on pages ( A moving average ).

Non-linearities in Simple Regression

Tests for Two Exponential Means

Preferred Customer Service at U.S. Airways ASSIGNMENT QUESTIONS Exhibit 5 From Frequency From Frequency

M249 Diagnostic Quiz

TAA Scheduling. User s Guide

ESG Yield Curve Calibration. User Guide

You should already have a worksheet with the Basic Plus Plan details in it as well as another plan you have chosen from ehealthinsurance.com.

YEAR 12 Trial Exam Paper FURTHER MATHEMATICS. Written examination 1. Worked solutions

You created this PDF from an application that is not licensed to print to novapdf printer (

One Proportion Superiority by a Margin Tests

Confidence Intervals for Paired Means with Tolerance Probability

Two-Sample T-Test for Superiority by a Margin

Financial Econometrics

A SEARCH FOR A STABLE LONG RUN MONEY DEMAND FUNCTION FOR THE US

Bidding Decision Example

Use the data you collected and plot the points to create scattergrams or scatter plots.

Transcription:

International Civil Aviation Organization 27/8/10 WORKING PAPER REGIONAL WORKSHOP ON TRAFFIC FORECASTING AND ECONOMIC PLANNING Cairo 2 to 4 November 2010 Agenda Item 3 a): Forecasting Methodology (Presented by the Secretariat) Introduction 1. Under this agenda item, among other topics, model development using regression analyses will be discussed in detail. Hands-on exercises will be conducted with the participation of attendees with the objective of achieving a greater understanding of this technique and its application in the development of forecasts. Attached is a step-by-step approach to regression analysis using Microsoft Excel in the form of a user manual. (27 pages)

- 2 - Intentionally blank

- 3 - A STEP-BY-STEP APPROACH TO REGRESSION ANALYSIS USING MS EXCEL

- 4 - Outline 1. Defining the model 2. Preparing the data 3. Using the MS Excel regression analysis tool 4. Regression output and model 5. Testing the model 6. How to calculate the RPKs forecast to the year 2015 by using the equation 7. Sensitivity analysis 1. Defining the model The first step in the development of forecasts using regression analyses is to define the variable to be forecast (explained variable) as well as the explanatory variables. The latter variables are those thought to be driving variations in the forecast variable. In the case of air traffic, the explained variable can be the number of passengers, passenger-kilometres flown, freight tonnes or freight tonne-kilometres flown. The explanatory variables can be Income as measured by gross domestic product (GDP), average prices as measured by yields (or unit revenues), exports, imports, etc. For specific routes, other variables may include frequency of flights, distance, etc. Once the explained and the explanatory variables are defined, the model functional form needs to be chosen. Functional forms can be linear, logarithmic, exponential, power, etc. The choice of the functional form is determined by the nature of the relationship between the explained and the explanatory variables. For this exercise, the forecast (explained) variable is world revenue passenger-kilometres (RPK) carried out by scheduled airlines. The explanatory variables are GDP and yield (YLD). The functional form is a constant elasticity model: RPK = k * GDP α * yield β Where k, α and β are constants. α is a measure of the elasticity of demand to GDP and β is a measure of the elasticity of demand to price. In order to estimate the parameters of this model (k, α and β) using linear regression, the following transformation needs to be performed: Ln (RPK) = Ln(k * GDP α * yield β ) = Ln(k) + αln(gdp) + βln(yield) 2. Preparing the data The historic data for the variables concerned can be yearly, quarterly or monthly depending upon the type of forecast required. Generally, the longer the historic data the better the result. Data should be treated for missing or invalid values.

- 5 - Basic data Table 1 shows the basic data for RPK, GDP and YLD for the 1975 to 2005 period. Table 1 also shows the year-to-year growth in per cent for these variables. The formula to calculate the per cent growth from one year to another in a variable is given below: Absolute value for year 2 Growth rate (in per cent) = ( --------------------------------- - 1 ) * 100 Absolute value for year 1 In general, the effect of GDP on the RPK is direct, meaning that if GDP goes up then the traffic will follow the same trend; however, the effect of YLD on RPK is inverse, meaning that an increase (or positive) in YLD will affect the RPK growth adversely.

- 6 - Basic data World scheduled revenue parssenger-kilometres World real GDP growth (%) and Index World current yield in US Cents and growth (%) TABLE 1 World Real GDP Real GDP Current YLD YLD Year RPK RPK Index Growth US Cents Change (%) (%) (%) 75 697 100.0 1.2 5.2 76 764 9.5 105.6 5.6 5.3 1.92 77 818 7.1 110.1 4.3 5.7 7.55 78 936 14.4 114.5 4.0 5.8 1.75 79 1060 13.2 119.4 4.2 6.2 6.90 1980 1089 2.7 122.0 2.2 7.5 20.97 1981 1119 2.7 124.7 2.2 7.8 4.00 1982 1142 2.1 126.2 1.2 7.6-2.56 1983 1190 4.2 130.1 3.1 7.6 0.00 1984 1278 7.4 136.3 4.8 7.4-2.63 1985 1367 7.0 141.5 3.8 7.3-1.35 1986 1452 6.2 146.7 3.7 7.5 2.74 1987 1589 9.5 152.5 3.9 8 6.67 1988 1705 7.3 159.8 4.8 8.4 5.00 1989 1774 4.0 165.8 3.8 8.7 3.57 1990 1894 6.8 170.8 3 9.2 5.75 1991 1845-2.6 173.7 1.7 9.6 4.35 1992 1929 4.5 177.9 2.4 9.2-4.17 1993 1949 1.1 182.3 2.5 9.2-0.27 1994 2100 7.7 189.5 3.9 9.2 0.00 1995 2248 7.1 196.5 3.7 9.4 1.97 1996 2432 8.2 204.5 4.1 9.1-2.81 1997 2573 5.8 213.1 4.2 8.8-3.75 1998 2628 2.1 219.1 2.8 8.6-1.66 1999 2798 6.5 227.2 3.7 8.3-3.90 2000 3038 8.6 238.3 4.9 8.3-0.25 2001 2950-2.9 244.5 2.6 7.9-3.85 2002 2965 0.5 252.1 3.1 8.0 0.85 2003 3019 1.8 262.4 4.1 8.5 6.25 2004 3445 14.1 276.3 5.3 8.8 3.53 2005 3720 8.0 289.9 4.9 9.1 3.41

- 7 - Treating data for input Data in terms of absolute values or growth rates often comes in current monetary values which includes the effect of inflation. Most econometric modeling, however, is performed on variables expressed in real (constant) terms i.e. excluding the effect of inflation 1. Table 2 shows GDP data in real terms and YLD data in both current and real terms. As previously explained, GDP index is drawn from GDP growth rates which are already available in real terms, whereas the YLD variable, available in current terms, has to be converted into real terms. The conversion procedure requires an additional series to be used as a deflator 2 which could be the consumer price index (CPI) or the producer price index (PPI). A deflator must correspond the same origin as the variable being deflated, e.g., a variable from a country should be deflated by a deflator from the same country. Also, a variable in terms of US dollars should be deflated by a deflator based upon US dollars. Most deflators are available in UN economic publications as well as International Monetary Fund (IMF) publications. There are two ways to convert a variable into real terms: 1) The variable is available in current terms in the form of year-to-year growth rates: The growth rate of the deflator is subtracted from the growth rate of the variable for each year as described b the formula below: Growth (%) of series in real-term for year 1 = (Growth (%) of series in current-term for year 1) (Growth (%) of deflator for the same year).a new year-to-year growth rate variable expressed in real terms is obtained. This variable can be converted into an index by using the procedure described below. This method is applied in Table 2 to calculate the real YLD index. 2) The variable is available in current terms in the form of absolute values: Each value of the variable expressed in current terms, is divided by the value of the deflator for the same year. A new variable expressed in real terms is obtained. Since deflators are often available in per cent growth and to be able to make the division it has to be transformed into an index first. How to make an index from a growth rate series Given the functional form used in this exercise, the use of an index for one or more explanatory variables does not affect the values of the elasticities α and β. The use of indices is sometimes necessary because the data is only available in the form of growth rates and not absolute values. Notes: 1 The overall general upward price movement of goods and services in an economy, usually as measured by the consumer price index (CPI) or the producer price index (PPI). 2 A tool used to convert money into inflation-adjusted money, in order to compare prices over time after factoring out the overall effects of inflation.

- 8 - Note that the GDP variable in Table 1 is provided as an index. This is a series built from year-to-year growth rates of GDP. The world GDP data is available in the form of annual per cent growth rates of GDP volume in constant terms. An index must be built from these growth rates in order to achieve consistency with the other variables expressed in volume. The volume of GDP (in constant dollars for example) can also be used if available. To build an index from per cent growth rates, the value of one of the years (the base year) is chosen to be 100 and the values for the rest of the years can be calculated by the formula below: growth rate for year 2 Index value for year 2 = Index value for year 1 * (1+-----------------------------) 100 In Table 2, the YLD is derived from the aggregate of total world passenger revenues in US $ and divided by the corresponding RPKs. The value obtained is in current US cents and to deflate this value, the industrial countries CPI is considered to be the closest deflator.

- 9 - TABLE 2 Transform yield in current values into real-term using Industrial Countries' CPI Build a Series or an Index from growth rates (%) REAL World Real GDP Real GDP Current YLD YLD CPI YLD GROWTH (%) REAL YLD Year RPK RPK Index Growth US Cents Change IND.CPI (YLD%-CPI%) Index (%) (%) (%) (%) 75 697 100.0 1.2 5.2 11.5 100.00 76 764 9.5 105.6 5.6 5.3 1.92 8.7-6.8 93.22 77 818 7.1 110.1 4.3 5.7 7.55 9-1.5 91.87 78 936 14.4 114.5 4.0 5.8 1.75 7.6-5.8 86.50 79 1060 13.2 119.4 4.2 6.2 6.90 9.6-2.7 84.16 1980 1089 2.7 122.0 2.2 7.5 20.97 12.3 8.7 91.45 1981 1119 2.7 124.7 2.2 7.8 4.00 10.4-6.4 85.60 1982 1142 2.1 126.2 1.2 7.6-2.56 7.7-10.3 76.82 1983 1190 4.2 130.1 3.1 7.6 0.00 5.3-5.3 72.74 1984 1278 7.4 136.3 4.8 7.4-2.63 5-7.6 67.19 1985 1367 7.0 141.5 3.8 7.3-1.35 4.4-5.8 63.33 1986 1452 6.2 146.7 3.7 7.5 2.74 2.6 0.1 63.42 1987 1589 9.5 152.5 3.9 8 6.67 3.1 3.6 65.68 1988 1705 7.3 159.8 4.8 8.4 5.00 3.4 1.6 66.73 1989 1774 4.0 165.8 3.8 8.7 3.57 4.6-1.0 66.04 1990 1894 6.8 170.8 3 9.2 5.75 5.1 0.6 66.47 1991 1845-2.6 173.7 1.7 9.6 4.35 4.2 0.1 66.57 1992 1929 4.5 177.9 2.4 9.2-4.17 3.2-7.4 61.66 1993 1949 1.1 182.3 2.5 9.2-0.27 2.8-3.1 59.77 1994 2100 7.7 189.5 3.9 9.2 0.00 2.3-2.3 58.40 1995 2248 7.1 196.5 3.7 9.4 1.97 2.4-0.4 58.15 1996 2432 8.2 204.5 4.1 9.1-2.81 2.2-5.0 55.24 1997 2573 5.8 213.1 4.2 8.8-3.75 2-5.8 52.06 1998 2628 2.1 219.1 2.8 8.6-1.66 1.5-3.2 50.41 1999 2798 6.5 227.2 3.7 8.3-3.90 1.4-5.3 47.74 2000 3038 8.6 238.3 4.9 8.3-0.25 2.2-2.5 46.57 2001 2950-2.9 244.5 2.6 7.9-3.85 2.1-6.0 43.80 2002 2965 0.5 252.1 3.1 8.0 0.85 1.5-0.6 43.51 2003 3019 1.8 262.4 4.1 8.5 6.25 1.8 4.5 45.45 2004 3445 14.1 276.3 5.3 8.8 3.53 2 1.5 46.15 2005 3720 8.0 289.9 4.9 9.1 3.41 2.3 1.1 46.66

- 10 - Input data Table 3 shows a complete set of data ready for input into the regression. The model equation that will be used is of the form: y = k x 1 α * x 2 β where y = traffic (RPKs) x 1 = Income (GDP) x 2 = Price (YLD) k, α and β are constants. The equation above assumes a non-linear relationship between air traffic demand (RPK), income (GDP) and price (YLD). Since we will run a linear regression using Excel, we need to turn the equation above into a linear form by taking the natural logarithm (Ln) of its both sides, resulting in the following form: Ln (RPK) = k + α * Ln(GDP) + β * Ln(YLD) Table 3 shows the Ln values of RPK, GDP and YLD by simply invoking the LN function using the Excel spreadsheet. Note that in addition, average annual growth rates of each variable for the last 20 years have also been calculated using the formula given below: Formula for calculating average annual growth: Value for last year 1 Average annual growth (%) = ((----------------------------------)^(--------------------------) 1)*100 Value for first year Number of years A DUMMY variable is also given in Table 3, the purpose of which is to offset the effects of abnormal events leading to drastic changes in the variables (on a yearly basis). This variable only assumes the values of 0 and 1. It should be noted that the ICAO Manual on Air Traffic Forecasting (Doc 8991) explains in detail the techniques and uses of econometric regression analyses. This manual can be purchased through the ICAO website or by writing to ICAO Document Sales Unit.

- 11 - TABLE 3 TAKE NATURAL LOGARITHM OF RPK, REAL GDP AND REAL YLD ADD DUMMY VARIABLE AND RUN THE MODEL CALCULATE AVERAGE ANNUAL GROWTH OF HISTORICAL DATA World Real GDP Real GDP Current YLD YLD CPI REAL YLD Year RPK RPK Index Growth US Cents Change IND.CPI YLD%-CPI% Index =LN(RPK) =LN(GDP) =LN(YLD) (%) (%) (%) (%) RPK GDP YLD DUMMY 75 697 100.0 1.2 5.2 11.5 100.00 6.547194 4.60517 4.60517 0 76 764 9.5 105.6 5.6 5.3 1.92 8.7-6.8 93.22 6.638256 4.659658 4.534995 0 77 818 7.1 110.1 4.3 5.7 7.55 9-1.5 91.87 6.707229 4.70176 4.52036 0 78 936 14.4 114.5 4.0 5.8 1.75 7.6-5.8 86.50 6.841991 4.74098 4.460126 0 79 1060 13.2 119.4 4.2 6.2 6.90 9.6-2.7 84.16 6.966247 4.782122 4.432719 1 1980 1089 2.7 122.0 2.2 7.5 20.97 12.3 8.7 91.45 6.993133 4.803884 4.515844 1 1981 1119 2.7 124.7 2.2 7.8 4.00 10.4-6.4 85.60 7.02025 4.825645 4.449704 0 1982 1142 2.1 126.2 1.2 7.6-2.56 7.7-10.3 76.82 7.040705 4.837574 4.341405 0 1983 1190 4.2 130.1 3.1 7.6 0.00 5.3-5.3 72.74 7.081513 4.868103 4.286949 0 1984 1278 7.4 136.3 4.8 7.4-2.63 5-7.6 67.19 7.153189 4.914987 4.207564 0 1985 1367 7.0 141.5 3.8 7.3-1.35 4.4-5.8 63.33 7.220628 4.952282 4.14833 0 1986 1452 6.2 146.7 3.7 7.5 2.74 2.6 0.1 63.42 7.280735 4.988614 4.149727 0 1987 1589 9.5 152.5 3.9 8 6.67 3.1 3.6 65.68 7.371154 5.026873 4.184772 1 1988 1705 7.3 159.8 4.8 8.4 5.00 3.4 1.6 66.73 7.441574 5.073757 4.200645 1 1989 1774 4.0 165.8 3.8 8.7 3.57 4.6-1.0 66.04 7.480825 5.111052 4.190306 0 1990 1894 6.8 170.8 3 9.2 5.75 5.1 0.6 66.47 7.546576 5.140611 4.196757 1 1991 1845-2.6 173.7 1.7 9.6 4.35 4.2 0.1 66.57 7.520461 5.157468 4.198234 1 1992 1929 4.5 177.9 2.4 9.2-4.17 3.2-7.4 61.66 7.564717 5.181185 4.121713 0 1993 1949 1.1 182.3 2.5 9.2-0.27 2.8-3.1 59.77 7.575288 5.205877 4.090555 0 1994 2100 7.7 189.5 3.9 9.2 0.00 2.3-2.3 58.40 7.649662 5.244136 4.067288 0 1995 2248 7.1 196.5 3.7 9.4 1.97 2.4-0.4 58.15 7.717796 5.280468 4.063004 0 1996 2432 8.2 204.5 4.1 9.1-2.81 2.2-5.0 55.24 7.796469 5.32065 4.011598 0 1997 2573 5.8 213.1 4.2 8.8-3.75 2-5.8 52.06 7.852828 5.361792 3.952368 0 1998 2628 2.1 219.1 2.8 8.6-1.66 1.5-3.2 50.41 7.873978 5.389407 3.920207 0 1999 2798 6.5 227.2 3.7 8.3-3.90 1.4-5.3 47.74 7.93666 5.425739 3.865778 0 2000 3038 8.6 238.3 4.9 8.3-0.25 2.2-2.5 46.57 8.018955 5.473576 3.840951 0 2001 2950-2.9 244.5 2.6 7.9-3.85 2.1-6.0 43.80 7.98956 5.499244 3.779586 0 2002 2965 0.5 252.1 3.1 8.0 0.85 1.5-0.6 43.51 7.994632 5.529773 3.773076 0 2003 3019 1.8 262.4 4.1 8.5 6.25 1.8 4.5 45.45 8.012681 5.569955 3.816614 0 2004 3445 14.1 276.3 5.3 8.8 3.53 2 1.5 46.15 8.144679 5.621598 3.831793 0 2005 3720 8.0 289.9 4.9 9.1 3.41 2.3 1.1 46.66 8.221479 5.669436 3.842823 0 1975-2005 AVERAGE ANNUAL 5.7 3.6 1.9-2.5 GROWTH (%)

- 12-3. Using the MS Excel regression analysis tool This section provides a step-by-step approach on how to perform linear regression using MS Excel. Step 1: Check if the Analysis Toolpak is installed The Analysis Toolpak is an Excel add-in (add-in: A supplemental program that adds custom commands or custom features to Microsoft Office) program that is available when you install Microsoft Office or Excel. To use it in Excel, however, you need to load it first. 1. On the Tools menu, click Add-Ins. 2. In the Add-Ins available box, select the check box next to Analysis Toolpak, and then click OK. Tip If Analysis Toolpak is not listed, click Browse to locate it. 3. If you see a message that tells you the Analysis Toolpak is not currently installed on your computer, click Yes to install it. 4. Click Tools on the menu bar. When you load the Analysis Toolpak, the Data Analysis command is added to the Tools menu. Step 2: Entering the data The first step is to have the data presented in Table 3 entered into an Excel spreadsheet as shown in Figure 1 below. Figure 1 Step 3: Selecting the Data Analysis option Then, select Tools on the main menu and Data Analysis from the drop-down list as illustrated in Figure 2.

- 13 - Figure 2 Step 4: Select Regression From the list of data analysis options, select Regression and then click OK as shown in Figure 3. Figure 3 Step 5: Select the input data and parameters After selecting Regression, a new window appears on the screen prompting the user to input the data and parameters required for the regression as shown in Figures 4 and 5.

- 14 - Figure 4 The regression window in Figure 5 has four sections: Input, Output options, Residuals and Normal Probability. The first section is described in this step and the second will be described in the next step. The Input section has the following items: Input Y Range: explained variable. Input X Range: explanatory variables. Labels: Allows variable names (labels) to be included in the selection. Constant is Zero : Allows the user to impose the constant (equation parameter k in our case) to be equal to zero. Confidence Level: Allows the selection of the confidence level to apply to the equation parameters in the regression output. By default, the confidence level is 95%.

- 15 - Figure 5 To select the explained variable (RPK), click on the button to the right in front of Input Y Range textbox. The regression window will be minimized allowing you to make the selection of explained variable. In Figure 6, the explained variable RPK is under column N. When the selection is complete, click on the button on the right side of the minimized regression window. Figure 6

- 16 - Figure 7 To select the explanatory variables (GDP and YLD), click on the button to the right in front of Input X Range textbox. The regression window will be minimized allowing you to make the selection of explanatory variables. In Figure 8, the explanatory variables RPK is under columns O and P. When the selection is complete, click on the button on the right side of the minimized regression window. Figure 8 As can be seen on Figure 9, both the Y and X input ranges are selected. If the selection includes the names of the variables, then the check-box in front of the item Labels should be checked. There is no need to impose Constant is Zero and no need to change the default Confidence Level.

- 17 - Figure 9 Step 6: Select Output option The Output parameters section on the regression window has the following items: Output Range: Allows the selection of regression output location on the same spreadsheet as the input data. New Worksheet Ply: Directs the regression output to a new worksheet ply. New Workbook: Directs the regression output to a new workbook. Click on the radio-button to the left of Output Range to select the range and then click on the button to the right in front of Output Range textbox. The regression window will be minimized allowing you to select the output range. All that is required is to put the cursor in the cell where you want the output of the regression to start. In Figure 10, this cell is S7. When the selection is complete, click on the button on the right side of the minimized regression window.

- 18 - Figure 10 Then, the Regression window will re-appear as shown in Figure 11. Figure 11

- 19 - To perform the regression click on OK. The regression output will be produced as shown on Figure 12. Figure 12

- 20-4. Regression output and model Figure 12 shows the Summary Output generated by the regression. The definition of each term in this summary is as follow: Residuals: Actual (or observed) values minus the predicted (estimated using the regression equation) values of the explained variable. Multiple R: The multiple correlation coefficient which measures the correlation between the actual (observed) and the predicted (estimated using the regression equation) values of the explained variable. It lies between -1 and +1. R-Square: The Square of the multiple correlation coefficient. It is also called coefficient of determination. It measures the goodness of fit of the regression (i.e. the proportion of the variability in the explained variable that is explained by the regression). R-Square lies between zero and 1 and the closer it is to 1, the higher the goodness of fit. Adjusted R-Square: The R-Square adjusted to take into account the number of explanatory variables in the regression (or the degrees of freedom). The adjusted R-Square can be negative, and is always less than or equal to R-Square. Standard Error: The standard error of the regression (it measures the average error/residual due to each observation, corrected for the number of variables included in the regression). df: Degrees of freedom of the regression (equal to the number of explanatory variables) of the residuals as well as the total (the number of observations minus one). SS: Sum of squares (squared deviations from the mean) of the regression of the residuals as well as the total. MS: Mean square (mean of squared deviations from the mean) of the regression of the residuals as well as the total. It is equal to the corresponding SS divided by the corresponding df. F: The F statistic. A measure to assess the overall significance of the regression. It is equal to the ratio of two mean squares: the mean square of the regression divided by the mean square of the residuals. In order for the regression to be significant, the F-value must be greater than a critical value, determined by the regression degrees of freedom and the F statistical distribution. Significance F: The probability that the regression is not significant which is equivalent to the probability that all coefficients are equal to zero. Intercept: The estimated constant term in the regression equation. It can be interpreted as the mean effect on the explained variable of all the variables excluded from the regression. Coefficients: The estimated coefficients of the explanatory variables in the regression equation. t-stat: A value of the Student t statistic to assess the significance of the coefficient. In order for the coefficients to be significant (not equal to zero) in general, the absolute value of t-stat must be 2 or more.

- 21 - P-value: Probability that the corresponding intercept or coefficient is equal to zero. Upper and Lower Confidence Levels: Lower 95% and Upper 95% means that there is a 95% chance that the coefficient computed by regression falls between these limits. The upper and lower confidence levels for each coefficient must not cross 0, which would imply that there is a possibility that a coefficient could assume the value 0. If you change the confidence level in the Regression window, these values will change. From this summary, the coefficients of the regression are: Constant = 2.314 GDP = 1.275 YLD = -0.34 Dummy = 0.077 Hence the equation will be: Ln(RPK) = 2.314 + 1.275*Ln(GDP) - 0.34*Ln(YLD) + 0.77 * (Dummy) The R-Square and the Adjusted R-Square are both close to one signalling a high goodness of fit. The Significance F is very close to zero, implying that the regression is highly significant overall. The t-stat of all the coefficients are much higher than 2, except for the intercept, implying that the coefficients of GDP, YLD and the DUMMY are highly significant.

- 22-5. Testing the model Table 4 below shows how to test the model for its accuracy and if it can be used for producing forecasts. In order to manually assess the goodness to fit, residual values can be analyzed. Using the regression equation, calculate the predicted values for each year between 1975 and 2005 and compare them with the actual values. How to calculate predicted values for the sample data (between 1975 and 2005) The equation obtained from the regression is: Ln(RPK) = 2.314 + 1.275*Ln(GDP) 0.34*Ln(YLD) + 0.077 * (Dummy) By simply plugging in the Ln(GDP) and Ln(YLD) along with Dummy values in the above equation between the years 1975 to 2005, the predicted value for each year can be calculated as shown in Table 4. The actual minus the predicted would be the errors (or the residuals). These errors can then be put in terms of per cent errors. A graph of the residuals will show if they are following any particular pattern. Any cyclical or symmetric patterns would indicate the interference of errors in the computation of Model or what is called Autocorrelation.

- 23 - TABLE 4 USE THE EQUATION FROM MODEL TO CALCULATE RESIDUAL VALUES EXAMINE THE ERROR PERCENT PLOT A GRAPH OF RESIDUALS TO CHECK FOR AUTO-CORRELATION EQUATION: ln (RPK) = 2.314 + 1.275*ln(GDP) - 0.34*ln(YLD) + 0.077*(DUMMY) YEAR ACTUAL PREDICTED ERROR ERR (%) 75 697 750-53 -7.5 76 764 823-59 -7.8 77 818 873-55 -6.7 78 936 937 0 0.0 79 1060 1076-16 -1.5 1980 1089 1076 14 1.2 GRAPH OF RESIDUALS (ERRORS) 1981 1119 1047 72 6.4 1982 1142 1103 39 3.4 1983 1190 1168 22 1.8 200 1984 1278 1274 4 0.3 1985 1367 1363 4 0.3 100 1986 1452 1427 25 1.7 0 1987 1589 1599-10 -0.6-100 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 1988 1705 1689 17 1.0 1989 1774 1646 128 7.2-200 1990 1894 1842 53 2.8-300 1991 1845 1881-35 -1.9 1992 1929 1842 87 4.5-400 1993 1949 1921 28 1.4 1994 2100 2033 67 3.2 1995 2248 2133 115 5.1 1996 2432 2284 148 6.1 1997 2573 2456 117 4.5 1998 2628 2572 56 2.1 1999 2798 2745 53 1.9 2000 3038 2942 96 3.2 2001 2950 3104-154 -5.2 2002 2965 3234-269 -9.1 2003 3019 3354-335 -11.1 2004 3445 3564-119 -3.5 2005 3720 3774-54 -1.5

- 24-6. How to calculate RPKs forecast for the next year using the regression equation The actual RPK data used in the model is until 2005. In order to calculate the RPK forecast value for the year 2006, we need to assume growths of GDP and YLD for the same year. Since the Model uses the GDP and YLD index values, the same index values of GPD and YLD will be projected for the year 2006 and will be used for 2006 RPK projection in the Equation. Table 5 on the next page shows the year-toyear projection of RPK based on selected assumptions. An extract of Table 5 is shown below as an example. World Real GDP Real GDP REAL YLD REAL YLD Year RPK RPK Index Growth Index GROWTH (%) DUMMY (%) (%) 2005 3720 8.0 289.9 4.9 46.7 1.1 0 2006 4037 8.5 304.4 5 46.0-1.5 0 The assumption for GDP growth for 2006 is 5%, hence the GDP 2005 index 289.9 grown by 5% (289.9*1.05) is 304.4 as the 2006 GDP index value. Similarly, the 2006 YLD index value after -1.5 decline is 46.0. We will use these two values in our equation to calculate the 2006 RPK. Please note that the value for DUMMY is zero in all the forecast years. The regression equation is: ln(rpk) = 2.314 + 1.275*ln(GDP) - 0.34*ln(YLD) + 0.077*(DUMMY) This is a logarithmic equation, therefore, the antilog function (or exponential function) will be used to get 2006 RPK, as shown below: EXP(2.314 + 1.275*Log(304.4) - 0.34*Log(46.0) + 0.077*(0)) As a result of this calculation, the RPK forecast for 2006 is 4037 with a growth of 8.5% over 2005. The RPK forecasts for all the years from 2007 to 2015 can be calculated by doing the same operation to GDP and YLD index values and plugging them into the equation, as shown in the next page.

- 25 - TABLE 5 A SAMPLE FORECAST USING FORECAST EQUATION AND ASSUMPTIONS FOR GDP AND YLD 10-YEAR RPK FORECAST ASSUMPTIONS FOR THE NEXT 10 YEARS: GDP = 5 YLD = -1.5 EQUATION: ln (RPK) = 2.314 + 1.275*ln(GDP) - 0.34*ln(YLD) + 0.077*(DUMMY) World Real GDP Real GDP REAL YLD REAL YLD Year RPK RPK Index Growth Index GROWTH (%) DUMMY (%) (%) 2005 3720 8.0 289.9 4.9 46.7 1.1 0 2006 4037 8.5 304.4 5 46.0-1.5 0 2007 4318 7.0 319.6 5 45.3-1.5 0 2008 4619 7.0 335.6 5 44.6-1.5 0 2009 4941 7.0 352.3 5 43.9-1.5 0 2010 5285 7.0 370.0 5 43.3-1.5 0 2011 5654 7.0 388.5 5 42.6-1.5 0 2012 6047 7.0 407.9 5 42.0-1.5 0 2013 6469 7.0 428.3 5 41.3-1.5 0 2014 6919 7.0 449.7 5 40.7-1.5 0 2015 7401 7.0 472.2 5 40.1-1.5 0 2005-2015 Average Annual 7.1 5.0-1.5 Growth (%) PLOT A GRAPH SHOWING HISTORICAL AND FORECAST TRENDS YEAR RPK 1995 2248 1996 2432 1997 2573 1998 2628 1999 2798 2000 3038 2001 2950 2002 2965 2003 3019 2004 3445 2005 3720 2006 4037 2007 4318 2008 4619 2009 4941 2010 5285 2011 5654 2012 6047 2013 6469 2014 6919 2015 7401 8000 6000 4000 2000 0 1995 1997 1999 2001 2003 2005 2007 2009 2011 2013 2015

- 26-7. Sensitivity analysis To obtain the RPK forecast for 2015, an average annual growth rate (%) is assumed for GDP and YLD for the forecast years covering 2005 to 2015 period. Table 6 on the next page shows RPKs calculated for the year 2015 with various YLD and GDP assumptions. It also shows the annual average growth rates between 2005 and 2015 for each set of assumptions. An extract of this table is produced below as an example: GDP ======> AVERAGE ANNUAL GROWTHS, 2005-2015 ASSUMPTIONS 2.5% 3.0% 3.5% 4.0% 4.5% 5.0% YLD ASSUMPTIONS ------------------------ RPK's in 2015 ---------------------------------------- 1.5% 4915 5229 5562 5915 6288 6682 2005-2015 Growth-------> 2.8% 3.5% 4.1% 4.7% 5.4% 6.0% Several combinations of high and low growth rates for the GDP and YLD can be assumed for the forecast period. For example, in one of the options in the sensitivity analysis table above, the assumed growth for the GDP until 2015 is 2.5% and for the YLD it is 1.5%. The GDP and YLD indices for 2005 are then projected by applying these growth rates until 2015, which is a period of 10 years 1 : GDP index value for 2015 = 289.9*1.025^10 = 371.1 YLD index value for 2015 = 46.7*1.015^10 = 54.2 and plugged in the equation as shown below: EXP(2.314+1.275*Log(371.1) - 0.34*Log(54.2) + 0.077*(0)) = 4915 The RPK value for the year 2015 obtained from the equation is 4915, which represents an annual average growth of 2.5% between 2005 and 2015. Similarly, various low and high RPK forecasts for 2015 can be calculated depending upon different low and high assumptions of GDP and YLD as shown in the table on next page. 1 Projected annual growth = (present year value)*(1 + %growth/100)^(number of years)

- 27 - TABLE 6 DEVELOP A "SENSITIVITY ANALYSIS" MATRIX USING THE FORECAST EQUATION ASSUME HIGH AND LOW GROWTH RATES OF GDP AND YLD FOR THE FORECAST PERIOD EQUATION: ln (RPK) = 2.314 + 1.275*ln(GDP) - 0.34*ln(YLD) + 0.077*(DUMMY) ACTUAL RPK 2005= 3720 GDP ======> AVERAGE ANNUAL GROWTHS, 2005-2015 ASSUMPTIONS 2.5% 3.0% 3.5% 4.0% 4.5% 5.0% YLD ASSUMPTIONS ------------------------ RPK's in 2015 ---------------------------------------- 1.5% 4915 5229 5562 5915 6288 6682 2005-2015 Growth-------> 2.8% 3.5% 4.1% 4.7% 5.4% 6.0% 1.0% 4998 5318 5657 6015 6394 6796 AVERAGE 2005-2015 Growth-------> 3.0% 3.6% 4.3% 4.9% 5.6% 6.2% ANNUAL GROWTH, 0.0% 5170 5501 5851 6222 6614 7029 2005-2015 2005-2015 Growth-------> 3.3% 4.0% 4.6% 5.3% 5.9% 6.6% PERIOD -1.0% 5350 5692 6055 6438 6844 7274 2005-2015 Growth-------> 3.7% 4.3% 5.0% 5.6% 6.3% 6.9% -1.5% 5443 5791 6160 6550 6963 7400 2005-2015 Growth-------> 3.9% 4.5% 5.2% 5.8% 6.5% 7.1% -2.0% 5538 5892 6267 6664 7085 7529 2005-2015 Growth-------> 4.1% 4.7% 5.4% 6.0% 6.7% 7.3% END