Quantitative Techniques Term 2

Similar documents
The relationship between GDP, labor force and health expenditure in European countries

Advanced Econometrics

Labor Force Participation and the Wage Gap Detailed Notes and Code Econometrics 113 Spring 2014

u panel_lecture . sum

Your Name (Please print) Did you agree to take the optional portion of the final exam Yes No. Directions

The data definition file provided by the authors is reproduced below: Obs: 1500 home sales in Stockton, CA from Oct 1, 1996 to Nov 30, 1998

Final Exam - section 1. Thursday, December hours, 30 minutes

Econ 371 Problem Set #4 Answer Sheet. 6.2 This question asks you to use the results from column (1) in the table on page 213.

Trade Imbalance and Entrepreneurial Activity: A Quantitative Panel Data Analysis

Problem Set 6 ANSWERS

Effect of Health Expenditure on GDP, a Panel Study Based on Pakistan, China, India and Bangladesh

Econometrics is. The estimation of relationships suggested by economic theory

[BINARY DEPENDENT VARIABLE ESTIMATION WITH STATA]

Problem Set 9 Heteroskedasticty Answers

AN EMPIRICAL ANALYSIS OF THE RELATIONSHIP BETWEEN FOREIGN TRADE AND ECONOMIC GROWTH IN CENTRAL AFRICA

A COMPARATIVE ANALYSIS OF REAL AND PREDICTED INFLATION CONVERGENCE IN CEE COUNTRIES DURING THE ECONOMIC CRISIS

Example 2.3: CEO Salary and Return on Equity. Salary for ROE = 0. Salary for ROE = 30. Example 2.4: Wage and Education

Cross-country comparison using the ECHP Descriptive statistics and Simple Models. Cheti Nicoletti Institute for Social and Economic Research

tm / / / / / / / / / / / / Statistics/Data Analysis User: Klick Project: Limited Dependent Variables{space -6}

International Journal of Multidisciplinary Consortium

Professor Brad Jones University of Arizona POL 681, SPRING 2004 INTERACTIONS and STATA: Companion To Lecture Notes on Statistical Interactions

Time Invariant and Time Varying Inefficiency: Airlines Panel Data

sociology SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 SO5032 Quantitative Research Methods

EQUITY FORMATION AND FINANCIAL PERFORMANCE OF LISTED DEPOSIT MONEY BANKS IN NIGERIA

İnsan TUNALI 8 November 2018 Econ 511: Econometrics I. ASSIGNMENT 7 STATA Supplement

Violent Conflict and Foreign Direct Investment in Developing Economies: A Panel Data Analysis

Testing the Solow Growth Theory

Time series data: Part 2

You created this PDF from an application that is not licensed to print to novapdf printer (

Model fit assessment via marginal model plots

Impact of Stock Market, Trade and Bank on Economic Growth for Latin American Countries: An Econometrics Approach

Modeling wages of females in the UK

Relation between Income Inequality and Economic Growth

Solutions for Session 5: Linear Models

Cameron ECON 132 (Health Economics): FIRST MIDTERM EXAM (A) Fall 17

ECON Introductory Econometrics. Seminar 4. Stock and Watson Chapter 8

F^3: F tests, Functional Forms and Favorite Coefficient Models

Housing Prices, Macroeconomic Variables and Corruption Index in ASEAN

Effect of Education on Wage Earning

Assignment #5 Solutions: Chapter 14 Q1.

Does a financial crisis affect operating risk? Evidence from Polish listed companies 1

M. Candasamy. KPMG Mauritius - Advisory, Mauritius. Bhavish Jugurnath. University of Mauritius, Moka, Mauritius

Handout seminar 6, ECON4150

Advanced Industrial Organization I Identi cation of Demand Functions

Empirical Methods for Corporate Finance. Panel Data, Fixed Effects, and Standard Errors

Categorical Outcomes. Statistical Modelling in Stata: Categorical Outcomes. R by C Table: Example. Nominal Outcomes. Mark Lunt.

Chapter 11 Part 6. Correlation Continued. LOWESS Regression

CHAPTER 4 ESTIMATES OF RETIREMENT, SOCIAL SECURITY BENEFIT TAKE-UP, AND EARNINGS AFTER AGE 50

The Multivariate Regression Model

Examination of State Lotteries

CHAPTER 5 RESULTS AND ANALYSIS

Chapter 4 Level of Volatility in the Indian Stock Market

THE IMPACT OF BANKING RISKS ON THE CAPITAL OF COMMERCIAL BANKS IN LIBYA

Heteroskedasticity. . reg wage black exper educ married tenure

The Effect of Exchange Rate Risk on Stock Returns in Kenya s Listed Financial Institutions

Syntax Menu Description Options Remarks and examples Stored results Methods and formulas Acknowledgment References Also see

Financial Risk, Liquidity Risk and their Effect on the Listed Jordanian Islamic Bank's Performance

1) The Effect of Recent Tax Changes on Taxable Income

Econometric Methods for Valuation Analysis

Jet Fuel-Heating Oil Futures Cross Hedging -Classroom Applications Using Bloomberg Terminal

Does Corporate Finance and Macroeconomics Fundamental Affect Stock Returns? Evidence from Malaysian Services Sector

REGIONAL WORKSHOP ON TRAFFIC FORECASTING AND ECONOMIC PLANNING

Dummy variables 9/22/2015. Are wages different across union/nonunion jobs. Treatment Control Y X X i identifies treatment

Econometric Methods for Valuation Analysis

Modeling Panel Data: Choosing the Correct Strategy. Roberto G. Gutierrez

Labor Market Returns to Two- and Four- Year Colleges. Paper by Kane and Rouse Replicated by Andreas Kraft

Exchange Rate Exposure and Firm-Specific Factors: Evidence from Turkey

THE DETERMINANTS OF FINANCIAL INDUSTRY PROFITABILITY IN MALAYSIA

FOREIGN CURRENCY DERIVATIES AND CORPORATE VALUE: EVIDENCE FROM CHINA

Impact of Household Income on Poverty Levels

Statistical Models of Stocks and Bonds. Zachary D Easterling: Department of Economics. The University of Akron

ECON Introductory Econometrics Seminar 2, 2015

Technical Documentation for Household Demographics Projection

Module 4 Bivariate Regressions

An Examination of the Impact of the Texas Methodist Foundation Clergy Development Program. on the United Methodist Church in Texas

General Business 706 Midterm #3 November 25, 1997

Exploring the Determinants of Financial Development (Using Panel Data on Developed and Developing Countries)

The Impact of Aid on the Economic Growth of Developing Countries (LDCs) in Sub-Saharan Africa

Table 4. Probit model of union membership. Probit coefficients are presented below. Data from March 2008 Current Population Survey.

Catherine De Vries, Spyros Kosmidis & Andreas Murr

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, Last revised January 13, 2018

Postestimation commands predict Remarks and examples References Also see

Cross- Country Effects of Inflation on National Savings

Example 8.1: Log Wage Equation with Heteroscedasticity-Robust Standard Errors

Two-stage least squares examples. Angrist: Vietnam Draft Lottery Men, Cohorts. Vietnam era service

Analysis on Factors that Affect Stock Prices: A Study on Listed Cement Companies at Dhaka Stock Exchange

STATISTICS 110/201, FALL 2017 Homework #5 Solutions Assigned Mon, November 6, Due Wed, November 15

The relationship between external debt and foreign direct investment in D8 member countries ( )

Final Exam, section 1. Thursday, May hour, 30 minutes

FS January, A CROSS-COUNTRY COMPARISON OF EFFICIENCY OF FIRMS IN THE FOOD INDUSTRY. Yvonne J. Acheampong Michael E.

STATA log file for Time-Varying Covariates (TVC) Duration Model Estimations.

FE670 Algorithmic Trading Strategies. Stevens Institute of Technology

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, Last revised January 10, 2017

EC327: Limited Dependent Variables and Sample Selection Binomial probit: probit

Logistic Regression Analysis

The impact of cigarette excise taxes on beer consumption

Determinants of FII Inflows:India

The Effect of Financial Leverageas a Financial Distress Factor on Financial Performance on Commercial Banks in Kenya

Getting Started in Logit and Ordered Logit Regression (ver. 3.1 beta)

Example 7.1: Hourly Wage Equation Average wage for women

Transcription:

Quantitative Techniques Term 2 Laboratory 7 2 March 2006 Overview The objective of this lab is to: Estimate a cost function for a panel of firms; Calculate returns to scale; Introduce the command cluster to calculate robust standard errors in a panel. Start Stata 9 from the Windows Start menu (Programs, F. Departmental Software, E. Social Science, Stata). Do not update the version that is installed on your machine. Before starting your tasks, make sure that the following windows are visible on your screen and choose their size so that they do not overlap: Review, Variables, Results, Command. Go to www.staff.city.ac.uk/a.banal-estanol/teaching.htm, download Lab7.dta to your directory. The source of the dataset is Greene (2003), Example 13.1. 1 The file contains data on 6 airlines over 15 years on the following variables: c: total cost of production (in $ 000s); q: output, measured as revenue passenger miles it is an index; pf: price of input (i.e. the price of fuel); 2 lf: the load factor is average capacity utilization of the fleet. In addition, you have an indicator for the number of airlines (idairline) and the number of time periods (time). These are needed to identify the structure of the dataset. Our sample is a balanced panel, which means that we have the same number of observations for each firm. The techniques for balanced panels can easily accommodate panels in which the group sizes differ (unbalanced panels). Task 1 Estimation of a cost function for a panel of airlines In this lab, we estimate a Cobb-Douglas cost function which we linearize by taking logarithms: lnc it = β*lnq it +γ*lnpf it + δ*lf it + u i + ε it 1 Greene, W. (2003), Econometric Analysis, 5 th edition, Prentice Hall. 2 Note that the price of fuel differs across airlines due to different mixes of planes and regional variation in supply characteristics. See Greene (2003). 1

where i indicates the firm and t refers to the time period. Note that the intercept u i is firmspecific in the above equation. Finally, the error terms ε it are assumed to be homoscedastic and uncorrelated. Before proceeding to the analysis you need to: Generate the logarithms of the variables, e.g. gen lnc=ln(c) and similarly for output and the price of fuel; Declare the dataset to be a panel. From the Menu bar, you can choose Statistics Longitudinal / Panel Data Analysis Declare dataset to be cross-sectional timeseries. Alternatively, you can type in the Command window: tsset idairline time Note that this is the same command we used for time-series, but in this case we also have the cross-section dimension. 1. Pooled OLS The first estimator we use on the cost function is OLS, which is equivalent to assume that all the firms have the same intercept, i.e. u i = u for all firms. Note that this is a way of exploring the characteristics of the data, but does not exploit the panel structure of our sample.. reg lnc lnq lnpf lf Source SS df MS Number of obs = 90 -------------+------------------------------ F( 3, 86) = 2419.34 Model 112.705452 3 37.5684839 Prob > F = 0.0000 Residual 1.33544153 86.01552839 R-squared = 0.9883 -------------+------------------------------ Adj R-squared = 0.9879 Total 114.040893 89 1.28135835 Root MSE =.12461 lnc Coef. Std. Err. t P> t [95% Conf. Interval] lnq.8827385.0132545 66.60 0.000.8563895.9090876 lnpf.453977.0203042 22.36 0.000.4136136.4943404 lf -1.62751.345302-4.71 0.000-2.313948 -.9410727 _cons 9.516923.2292445 41.51 0.000 9.0612 9.972645 As expected, the coefficients on output and on the price of input are positive. Moreover, capacity utilization is negatively related to costs. 2. Fixed effects (or least squares dummy variable) 3 For this estimator, we go back to the initial formulation, where u i varies for each 3 Explanation of intercept in the fixed effect model + comparison with estimating OLS with dummies: http://www.stata.com/support/faqs/stat/xtreg2.html 2

individual in the sample. Intuitively, this formulation captures the fact that two observations for the same individual are likely to be more similar than observations from different individuals. In addition, the fixed effects estimator takes account of the fact that u i may be correlated with the regressors, i.e. lnq it, lnpf it and LF it. Stata has a command to perform panel-data estimation: xtreg. 4 The syntax is similar to the reg command we have been using for OLS. In order to choose the fixed effects option, we need to specify, fe after the list of variables: 5. xtreg lnc lnq lnpf lf, fe Fixed-effects (within) regression Number of obs = 90 Group variable (i): idairline Number of groups = 6 R-sq: within = 0.9926 Obs per group: min = 15 between = 0.9856 avg = 15.0 overall = 0.9873 max = 15 F(3,81) = 3604.80 corr(u_i, Xb) = -0.3475 Prob > F = 0.0000 lnc Coef. Std. Err. t P> t [95% Conf. Interval] lnq.9192846.0298901 30.76 0.000.8598126.9787565 lnpf.4174918.0151991 27.47 0.000.3872503.4477333 lf -1.070396.20169-5.31 0.000-1.471696 -.6690963 _cons 9.713528.229641 42.30 0.000 9.256614 10.17044 sigma_u.1320775 sigma_e.06010514 rho.82843653 (fraction of variance due to u_i) F test that all u_i=0: F(5, 81) = 57.73 Prob > F = 0.0000 The overall significance of the regression is good, as shown by the F-test on the right-hand side of the output above. This F test does not include the firm-specific effects, but only lnq, lnpf and lf. Individual coefficients are also significant and of the expected sign, broadly in line with the OLS estimates. You can see the estimated fixed effects by typing: predict fixed, u This is the same command we have used for fitted values and residuals, the only difference is the u option instead of residuals. You can choose any name other than fixed. Note that Stata automatically tests the null hypothesis that all the firm-specific intercepts are jointly zero (see last line in output above). In this case, we can reject the null hypothesis that the fixed effects are zero. 4 http://www.stata.com/help.cgi?xtreg 5 The estimation can also be carried out from the Menu bar, Statistics Longitudinal/Panel data Linear regression. 3

3. Random effects The random effects estimator assumes that α i is uncorrelated with the regressors, i.e. lnq it, lnpf it and LF it. We use the same Stata command as before. If no option is specified, then the default estimator for xtreg is random effects. 6. xtreg lnc lnq lnpf lf Random-effects GLS regression Number of obs = 90 Group variable (i): idairline Number of groups = 6 R-sq: within = 0.9925 Obs per group: min = 15 between = 0.9856 avg = 15.0 overall = 0.9876 max = 15 Random effects u_i ~ Gaussian Wald chi2(3) = 11091.33 corr(u_i, X) = 0 (assumed) Prob > chi2 = 0.0000 lnc Coef. Std. Err. z P> z [95% Conf. Interval] lnq.9066805.025625 35.38 0.000.8564565.9569045 lnpf.4227784.0140248 30.15 0.000.3952904.4502665 lf -1.064499.2000703-5.32 0.000-1.456629 -.672368 _cons 9.627909.210164 45.81 0.000 9.215995 10.03982 sigma_u.12488859 sigma_e.06010514 rho.81193816 (fraction of variance due to u_i) The results from the random effects model are similar to the estimates from the fixed effects model. The main difference is in the underlying assumptions. Note that zero correlation between the firm-specific intercepts and the regressors is assumed to be zero. The option predict name_of_variable, u can also be used after the random effects estimator. Summary of Task 1: Estimates from pooled OLS, fixed effects and random effects models have produced similar results for the Cobb-Douglas cost function; The assumption of uncorrelation between the firm-specific effect and the other independent variables is often too strong and fixed effects estimators are commonly used in studies on cost functions; The fixed effects estimator is more appropriate when the model applies only to the firms in the sample, but not for out-of-sample prediction. However, it is costly in terms of degrees of freedom; There are tests to check which assumptions are more appropriate, e.g. Hausman test see Greene (2003), Section 13.4.4. 6 Alternative syntax: xtreg lnc lnq lnpf lnlf, re 4

Further exercise: Try and create a do file with the commands we have used for this task. Create also a log file in text format (reminder: the file extension is.log, otherwise it can only be read in Stata); For those interested in cost functions, the translog cost function is often employed in academic papers focusing on regulated sectors. See Berndt (1996), 7 Chapter 9, or Greene (2003), section 14.3; Similarly to the fixed effects model, you can test whether the classical regression model is appropriate or the random effects estimator is preferable. This is the socalled Breusch and Pagan test for the random effects model. In this example, we can reject the null hypothesis of the classical regression model in favour of the random effects model. See Greene (2003), Section 13.4.3. 8 Task 2 Returns to scale A direct estimate of returns to scale can be obtained from the cost function as: Returns to scale = (1/β) Note that, in a Cobb-Douglas cost function, β is constant and returns to scale do not vary with the level of output. If (1/β) = 1, there are constant returns to scale. Use the estimates obtained above to establish whether the airlines in our sample exhibit constant returns to scale. 1. Pooled OLS From the pooled OLS estimates, the coefficient on output is 0.8827, which indicates increasing returns to scale (1/0.8827) > 1 2. Fixed effects The coefficient on output is 0.9192846, and this indicates increasing returns to scale. 3. Random effects The coefficient on output is 0.9066805, which confirms the finding of increasing returns to scale. Further exercise: If you want to test the null hypothesis of constant returns to scale, this is equivalent to test that β = 1. Type in the Command window: test lnq=1 You can practice with two outputs. In this case, you need to replace β with the sum of the coefficients on both outputs; In a multi-product cost function, you can test the null hypothesis of constant returns to scale by an F-test. In Stata, type in the Command window: test output1 + 7 Berndt, E. (1996), The Practice of Econometrics, 2 nd edition, Addison-Wesley. 8 After estimating a random effects model, type the command xttest0. 5

output2 = 1, where output1 and output2 are the names of the two output variables in your dataset. Task 3 Cluster option We now revisit the panel data estimators with a further complication added. In a sample with a panel structure, residuals often are not only heteroscedastic, but also correlated within each individual/firm. 9 In Stata, standard errors can be adjusted to take this into account using the cluster option. These are the standard errors usually reported in academic papers. The syntax requires the specification of the group variable, e.g. the airline in our example.. xtreg lnc lnq lnpf lf, fe cluster(idairline) Fixed-effects (within) regression Number of obs = 90 Group variable (i): idairline Number of groups = 6 R-sq: within = 0.9926 Obs per group: min = 15 between = 0.9856 avg = 15.0 overall = 0.9873 max = 15 F(3,87) = 647.73 corr(u_i, Xb) = -0.3475 Prob > F = 0.0000 (Std. Err. adjusted for 6 clusters in idairline) Robust lnc Coef. Std. Err. t P> t [95% Conf. Interval] lnq.9192846.0328726 27.97 0.000.8347828 1.003786 lnpf.4174918.0193486 21.58 0.000.3677546.4672289 lf -1.070396.4286714-2.50 0.055-2.172331.0315387 _cons 9.713528.3188995 30.46 0.000 8.893771 10.53329 sigma_u.1320775 sigma_e.06010514 rho.82843653 (fraction of variance due to u_i) Note that the coefficients are the same as the estimates we obtained before, while standard errors are larger. Remember from Term 1 that we do not know in advance whether standard errors in the presence of heteroscedasticity and/or serial correlation are larger or smaller. In the output above, the significance of the coefficient on load factor is reduced once we use robust standard errors. This command is useful not only with panel data, but also if you have groups and subgroups, e.g. the observations for firms within the same area may not be independent. 9 Note that the option robust (i.e. without cluster) takes into account heteroscedasticity but not correlation of residuals across observations. 6