Problem Set 6 ANSWERS

Similar documents
Time series data: Part 2

Final Exam - section 1. Thursday, December hours, 30 minutes

Econ 371 Problem Set #4 Answer Sheet. 6.2 This question asks you to use the results from column (1) in the table on page 213.

Your Name (Please print) Did you agree to take the optional portion of the final exam Yes No. Directions

Dummy variables 9/22/2015. Are wages different across union/nonunion jobs. Treatment Control Y X X i identifies treatment

Quantitative Techniques Term 2

u panel_lecture . sum

You created this PDF from an application that is not licensed to print to novapdf printer (

Advanced Econometrics

Cameron ECON 132 (Health Economics): FIRST MIDTERM EXAM (A) Fall 17

The relationship between GDP, labor force and health expenditure in European countries

. tsset year, yearly time variable: year, 1959 to 1994 delta: 1 year. . reg lhous ldpi lrealp

Two-stage least squares examples. Angrist: Vietnam Draft Lottery Men, Cohorts. Vietnam era service

The Multivariate Regression Model

Example 2.3: CEO Salary and Return on Equity. Salary for ROE = 0. Salary for ROE = 30. Example 2.4: Wage and Education

Heteroskedasticity. . reg wage black exper educ married tenure

Labor Force Participation and the Wage Gap Detailed Notes and Code Econometrics 113 Spring 2014

Categorical Outcomes. Statistical Modelling in Stata: Categorical Outcomes. R by C Table: Example. Nominal Outcomes. Mark Lunt.

Problem Set 9 Heteroskedasticty Answers

İnsan TUNALI 8 November 2018 Econ 511: Econometrics I. ASSIGNMENT 7 STATA Supplement

tm / / / / / / / / / / / / Statistics/Data Analysis User: Klick Project: Limited Dependent Variables{space -6}

Econometrics is. The estimation of relationships suggested by economic theory

Solutions for Session 5: Linear Models

Example 7.1: Hourly Wage Equation Average wage for women

Handout seminar 6, ECON4150

Labor Market Returns to Two- and Four- Year Colleges. Paper by Kane and Rouse Replicated by Andreas Kraft

[BINARY DEPENDENT VARIABLE ESTIMATION WITH STATA]

Professor Brad Jones University of Arizona POL 681, SPRING 2004 INTERACTIONS and STATA: Companion To Lecture Notes on Statistical Interactions

ECON Introductory Econometrics. Seminar 4. Stock and Watson Chapter 8

Advanced Industrial Organization I Identi cation of Demand Functions

Example 8.1: Log Wage Equation with Heteroscedasticity-Robust Standard Errors

The data definition file provided by the authors is reproduced below: Obs: 1500 home sales in Stockton, CA from Oct 1, 1996 to Nov 30, 1998

1) The Effect of Recent Tax Changes on Taxable Income

Assignment #5 Solutions: Chapter 14 Q1.

Determinants of FII Inflows:India

F^3: F tests, Functional Forms and Favorite Coefficient Models

Chapter 6 Part 3 October 21, Bootstrapping

ECON Introductory Econometrics Seminar 2, 2015

The impact of cigarette excise taxes on beer consumption

*1A. Basic Descriptive Statistics sum housereg drive elecbill affidavit witness adddoc income male age literacy educ occup cityyears if control==1

STATA Program for OLS cps87_or.do

Appendixes Appendix 1 Data of Dependent Variables and Independent Variables Period

Modeling wages of females in the UK

Sean Howard Econometrics Final Project Paper. An Analysis of the Determinants and Factors of Physical Education Attendance in the Fourth Quarter

Effect of Education on Wage Earning

Chapter 11 Part 6. Correlation Continued. LOWESS Regression

Does Globalization Improve Quality of Life?

CHAPTER 7 MULTIPLE REGRESSION

An analysis of the relationship between economic development and demographic characteristics in the United States

. ********** OUTPUT FILE: CARD & KRUEGER (1994)***********.. * STATA 10.0 CODE. * copyright C 2008 by Tito Boeri & Jan van Ours. * "THE ECONOMICS OF

Impact of Household Income on Poverty Levels

Model fit assessment via marginal model plots

STATA log file for Time-Varying Covariates (TVC) Duration Model Estimations.

Relation between Income Inequality and Economic Growth

Technical Documentation for Household Demographics Projection

Openness and Inflation

Impact of Minimum Wage and Government Ideology on Unemployment Rates: The Case of Post-Communist Romania

EC327: Limited Dependent Variables and Sample Selection Binomial probit: probit

Logistic Regression Analysis

Testing the Solow Growth Theory

An Examination of the Impact of the Texas Methodist Foundation Clergy Development Program. on the United Methodist Church in Texas

LAMPIRAN PERHITUNGAN EVIEWS

A COMPARATIVE ANALYSIS OF REAL AND PREDICTED INFLATION CONVERGENCE IN CEE COUNTRIES DURING THE ECONOMIC CRISIS

Impact of Stock Market, Trade and Bank on Economic Growth for Latin American Countries: An Econometrics Approach

The SAS System 11:03 Monday, November 11,

Export and Import Regressions on 2009Q1 preliminary release data Menzie Chinn, 23 June 2009 ( )

sociology SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 SO5032 Quantitative Research Methods

6 Multiple Regression

Question 1a 1b 1c 1d 1e 1f 2a 2b 2c 2d 3a 3b 3c 3d M ult:choice Points

Don t worry one bit about multicollinearity, because at the end of the day, you're going to be working with a favorite coefficient model.

Visualisierung von Nicht-Linearität bzw. Heteroskedastizität

Jet Fuel-Heating Oil Futures Cross Hedging -Classroom Applications Using Bloomberg Terminal

International Journal of Multidisciplinary Consortium

Hasil Common Effect Model

Trade Imbalance and Entrepreneurial Activity: A Quantitative Panel Data Analysis

The Predictive Power of Financial Blogs

Final Exam, section 1. Thursday, May hour, 30 minutes

Cameron ECON 132 (Health Economics): SECOND MIDTERM EXAM (A) Fall 17

Syntax Menu Description Options Remarks and examples Stored results Methods and formulas Acknowledgment References Also see

(ii) Give the name of the California website used to find the various insurance plans offered under the Affordable care Act (Obamacare).

Prof. Dr. Ben Jann. University of Bern, Institute of Sociology, Fabrikstrasse 8, CH-3012 Bern

Effect of Health Expenditure on GDP, a Panel Study Based on Pakistan, China, India and Bangladesh

Consumption Volatility, Liquidity Constraints and Household Welfare

Gloria Gonzalez-Rivera Forecasting For Economics and Business Solutions Manual

SAS Simple Linear Regression Example

Description Remarks and examples References Also see

Review questions for Multinomial Logit/Probit, Tobit, Heckit, Quantile Regressions

Econometric Methods for Valuation Analysis

Methods for A Time Series Approach to Estimating Excess Mortality Rates in Puerto Rico, Post Maria 1 Menzie Chinn 2 August 10, 2018 Procedure:

The Trend of the Gender Wage Gap Over the Business Cycle

Limited Dependent Variables

Final Exam, section 2. Tuesday, December hour, 30 minutes

Postestimation commands predict Remarks and examples References Also see

Regression Discontinuity Design

Module 9: Single-level and Multilevel Models for Ordinal Responses. Stata Practical 1

Getting Started in Logit and Ordered Logit Regression (ver. 3.1 beta)

REGIONAL WORKSHOP ON TRAFFIC FORECASTING AND ECONOMIC PLANNING

Brief Sketch of Solutions: Tutorial 2. 2) graphs. 3) unit root tests

Per Capita Housing Starts: Forecasting and the Effects of Interest Rate

ECON 5010 Solutions to Problem Set #3

Transcription:

Economics 20 Part I. Problem Set 6 ANSWERS Prof. Patricia M. Anderson The first 5 questions are based on the following information: Suppose a researcher is interested in the effect of class attendance on college performance, and plans to estimate the following model: colgpa = β 0 + β 1 hsgpa + β 2 ACT + β 3 skipped + u, where colgpa is current GPA, hsgpa is high school GPA, ACT is score on a college entrance exam and skipped is the average number of classes skipped per week. The researcher believes that a component of u is the student s inherent laziness. 1. OLS estimates of this model will most likely a) be biased and inconsistent, because skipped is endogenous The researcher believes that inherent laziness is a component of u. Assuming that lazier students skip more classes, skipped would be correlated with u, and thus OLS will be biased and inconsistent. 2. The researcher has information on the distance in miles students live from class (dist) and whether they have any classes at 8am (early), and regresses skipped on dist, early, hsgpa, and ACT. He then saves the residuals, uhat, from this regression. If he is planning on doing IV, he should b) test for the joint significance of dist and early The researcher has just run the first stage regression. For early and dist to be valid instruments, they must be correlated with skipped. So, they need to be jointly significant in this first stage regression. 3. The researcher next obtains the following estimates: Source SS df MS Number of obs = 141 ---------+------------------------------ F( 4, 136) = 10.38 Model 4.53802712 4 1.13450678 Residual 14.8680723 136.109324061 R-squared = 0.2338 ---------+------------------------------ Adj R-squared = 0.2113 Total 19.4060994 140.138614996 Root MSE =.33064 colgpa Coef. Std. Err. t P> t [95% Conf. Interval] ---------+-------------------------------------------------------------------- hsgpa.4135611.0943636 4.383 0.000.2269514.6001708 ACT.0144984.0106536 1.361 0.176 -.0065698.0355666 skipped -.0796302.0308488-2.581 0.011 -.1406356 -.0186249 uhat -.0122316.0578108-0.212 0.833 -.1265559.1020928 _cons 1.385228.3333431 4.156 0.000.7260219 2.044434 We can conclude that: d) all of the above This is a Hausman test for the endogeneity of skipped. Since uhat is not significant, we conclude that IV and OLS are not significantly different. This implies that we reject the null that skipped is endogenous, so the OLS estimates are consistent. Additionally, we can interpret these IV estimates, which imply that skipping reduces GPA by about.08 points. 4. The researcher estimates the model using IV, saves the residuals (uhativ) and then obtains: Source SS df MS Number of obs = 141 ---------+------------------------------ F( 4, 136) = 0.21 Model.09339178 4.023347945 Prob > F = 0.9298 Residual 14.7815227 136.108687667 R-squared = 0.0063

---------+------------------------------ Adj R-squared = -0.0229 Total 14.8749145 140.106249389 Root MSE =.32968 uhativ Coef. Std. Err. t P> t [95% Conf. Interval] ---------+-------------------------------------------------------------------- hsgpa.003987.0935295 0.043 0.966 -.1809732.1889472 ACT.0003169.0104884 0.030 0.976 -.0204246.0210584 dist.0163524.0185409 0.882 0.379 -.0203132.0530181 early -.111073.1233414-0.901 0.369 -.3549881.1328422 _cons -.0592899.3391357-0.175 0.861 -.7299514.6113716 The above estimates would imply that c) students probably can t completely choose where to live and whether to have 8am classes or not This is an overid test of whether our instruments are truly exogenous. To carry out the test we form the nr 2 = 141*.0063=.8883 which is very small. The critical value for significance at the 10% level is 2.71 for a chi-square distribution with 1 degree of freedom. Thus, we can t reject the null that dist and early are exogenous (and hence unrelated to inherent laziness). If students could completely choose where to live and whether they have 8am classes, we might expect them to be related to laziness. 5. Turning to the IV estimates the researcher must have obtained in question 4, we can predict that d) the coefficient on skipped will definitely be exactly -.0796302 We know that 2SLS is the same as IV, except for the standard errors. We also know that the Hausman test in 3 is a form of 2SLS. So, when the researcher did IV, he would get exactly the same estimates. 6. Which of the following are true about time-series estimation? c) Seasonality is not an issue when using annual time series observations With time series data, we do not have a random sample and can t just assume that observations are independent. In fact, most time series processes are correlated over time. There is no problem using a trending variable as a dependent variable we may need to be careful with interpretation, and often will want to include a trend as an independent variable. Since seasonality refers to differences across months or quarters or such, it is impossible to have seasonality in data collected at the year level. Part II. Stata Problems. 1. Before starting Stata, I opened smoke.xls and chose Save As from the file menu. I then chose Text (tab delimited) and saved the file as smoke.txt.. insheet using smoke.txt (7 vars, 807 obs). desc Contains data obs: 807 vars: 7 size: 14,526 (100.0% of memory free) - storage display value variable name type format label variable label - education float %9.0g Education cigprice float %9.0g Cig Price

whitedummy byte %8.0g White Dummy age byte %8.0g Age income int %8.0g Income cigsperday byte %8.0g Cigs per Day restaurantres~s byte %8.0g Restaurant Restrictions - Sorted by: Note: dataset has changed since last saved. sum Variable Obs Mean Std. Dev. Min Max -------------+----------------------------------------------------- education 807 12.47088 3.057161 6 18 cigprice 807 60.30041 4.738469 44.004 70.129 whitedummy 807.8785626.3268375 0 1 age 807 41.23792 17.02729 17 88 income 807 19304.83 9142.958 500 30000 cigsperday 807 8.686493 13.72152 0 80 restaurant~s 807.2465923.4312946 0 1 a) In order to estimate the model, I need to create a couple variables.. gen lincome=ln(income). gen agesq=age^2. reg lincome cigsperday education whitedummy age agesq, robust Regression with robust standard errors Number of obs = 807 F( 5, 801) = 31.54 R-squared = 0.1671 Root MSE =.65246 Robust lincome Coef. Std. Err. t P> t [95% Conf. Interval] cigsperday.0017166.0014324 1.20 0.231 -.0010951.0045284 education.0601146.0074493 8.07 0.000.0454922.074737 whitedummy -.1004146.0615368-1.63 0.103 -.221207.0203777 age.0582598.0091838 6.34 0.000.0402326.0762871 agesq -.0006375.0000981-6.50 0.000 -.0008301 -.000445 _cons 7.877208.2118737 37.18 0.000 7.461314 8.293101 b) The reduced form models regress the endogenous variables (cigsperday and lincome) on all of the exogenous variables in the system. Since these are the first stage regressions for an IV estimate of the original model, I also test for the significance of the instruments (cigprice and restaurantrestrictions). reg cigsperday cigprice restaurantrestrictions education whitedummy age ages > q, robust Regression with robust standard errors Number of obs = 807 F( 6, 800) = 10.69 R-squared = 0.0512 Root MSE = 13.416 Robust

cigsperday Coef. Std. Err. t P> t [95% Conf. Interval] cigprice.0008924.1062094 0.01 0.993 -.2075896.2093744 restaurant~s -2.797495 1.009079-2.77 0.006-4.778251 -.8167386 education -.4515367.1558069-2.90 0.004 -.7573752 -.1456981 whitedummy -.6231239 1.372816-0.45 0.650-3.31787 2.071622 age.8257312.1352017 6.11 0.000.5603392 1.091123 agesq -.0096307.0014337-6.72 0.000 -.012445 -.0068165 _cons.6160884 7.620352 0.08 0.936-14.34216 15.57433. test cigprice restaurantrestrictions ( 1) cigprice = 0.0 ( 2) restaurantrestrictions = 0.0 F( 2, 800) = 3.89 Prob > F = 0.0209. reg lincome cigprice restaurantrestrictions education whitedummy age agesq, > robust Regression with robust standard errors Number of obs = 807 F( 6, 800) = 28.92 R-squared = 0.1727 Root MSE =.65067 Robust lincome Coef. Std. Err. t P> t [95% Conf. Interval] cigprice.0076459.0052416 1.46 0.145 -.0026431.0179349 restaurant~s.0955453.0529191 1.81 0.071 -.0083314.199422 education.0582089.00731 7.96 0.000.04386.0725579 whitedummy -.0807037.0614338-1.31 0.189 -.2012942.0398868 age.0593675.009088 6.53 0.000.0415284.0772066 agesq -.0006505.0000966-6.74 0.000 -.0008401 -.000461 _cons 7.394159.403195 18.34 0.000 6.602714 8.185604 c) Only the first equation is identified. This is because cigprice and restaurantrestrictions are excluded from the income equation, and thus can be used as instruments for cigsperday. There is nothing excluded from the cigsperday equation that can be used as an instrument for lincome.. reg lincome cigsperday education whitedummy age agesq (cigprice restaurantre > strictions education whitedummy age agesq), robust IV (2SLS) regression with robust standard errors Number of obs = 807 F( 5, 801) = 19.72 R-squared =. Root MSE =.84426 Robust lincome Coef. Std. Err. t P> t [95% Conf. Interval] cigsperday -.0380873.0232067-1.64 0.101 -.0836403.0074658 education.0413177.0148594 2.78 0.006.0121497.0704858 whitedummy -.1097679.0798215-1.38 0.169 -.2664519.0469161 age.0911009.0216528 4.21 0.000.048598.1336038

agesq -.0010196.0002475-4.12 0.000 -.0015055 -.0005337 _cons 7.871618.259547 30.33 0.000 7.362146 8.381091 2. Look at the data.. use consump, clear. desc Contains data from consump.dta obs: 37 vars: 24 20 May 2002 22:37 size: 3,626 (100.0% of memory free) - storage display value variable name type format label variable label - year int %9.0g 1959-1995 i3 float %9.0g 3 mo. T-bill rate inf float %9.0g inflation rate; CPI rdisp float %9.0g disp. inc., 1992 $, bils. rnondc float %9.0g nondur. cons., 1992 $, bils. rserv float %9.0g services, 1992 $, bils. pop float %9.0g population, 1000s y float %9.0g per capita real disp. inc. rcons float %9.0g rnondc + rserv c float %9.0g per capita real cons. r3 float %9.0g i3 - inf; real ex post int. lc float %9.0g log(c) ly float %9.0g log(y) gc float %9.0g lc - lc[_n-1] gy float %9.0g ly - ly[_n-1] gc_1 float %9.0g gc[_n-1] gy_1 float %9.0g gy[_n-1] r3_1 float %9.0g r3[_n-1] lc_ly float %9.0g lc - ly lc_ly_1 float %9.0g lc_ly[_n-1] gc_2 float %9.0g gc[_n-2] gy_2 float %9.0g gy[_n-2] r3_2 float %9.0g r3[_n-2] lc_ly_2 float %9.0g lc_ly[_n-2] - Sorted by:. sum Variable Obs Mean Std. Dev. Min Max -------------+----------------------------------------------------- year 37 1977 10.82436 1959 1995 i3 37 6.061622 2.678825 2.38 14.03 inf 37 4.637838 3.124042.7 13.5 rdisp 37 3154.111 1046.487 1530.1 4945.8 rnondc 37 1003.87 245.8753 606.3 1421.9 rserv 37 1556.989 586.6619 687.4 2577 pop 37 220748.9 24755.96 177830 263034 y 37 13940.48 3209.831 8604.285 18802.89 rcons 37 2560.859 831.7756 1293.7 3998.9 c 37 11328.65 2505.241 7274.925 15202.98 r3 37 1.423784 2.064335-3.26 5.43 lc 37 9.309927.2309508 8.892189 9.629247 ly 37 9.514696.2443278 9.060016 9.841766 gc 36.0204738.0126375 -.0091066.0402088 gy 36.0217153.0182399 -.0169735.061985

gc_1 35.0206894.0127547 -.0091066.0402088 gy_1 35.0216098.0184951 -.0169735.061985 r3_1 36 1.388056 2.081984-3.26 5.43 lc_ly 37 -.2047689.0185397 -.2377205 -.1608601 lc_ly_1 36 -.2045536.0187557 -.2377205 -.1608601 gc_2 34.0208501.0129105 -.0091066.0402088 gy_2 34.0218382.018723 -.0169735.061985 r3_2 35 1.379429 2.111726-3.26 5.43 lc_ly_2 35 -.2046825.0190133 -.2377205 -.1608601 2.a) and b) While I could just use year to reflect the trend, I created a trend variable that goes from 1 to 37. The series do appear to be related, even more once they have been detrended. (See commands below that obtained these graphs).. gen t=year-1958 9.84177 log(y) log(c).087457 Residuals Residuals 8.89219 -.077823 1 37 time trend 1 37 time trend. graph ly lc t. reg lc ly Source SS df MS Number of obs = 37 -------------+------------------------------ F( 1, 35) =11959.60 Model 1.91457552 1 1.91457552 Residual.005603043 35.000160087 R-squared = 0.9971 -------------+------------------------------ Adj R-squared = 0.9970 Total 1.92017857 36.053338294 Root MSE =.01265 lc Coef. Std. Err. t P> t [95% Conf. Interval] ly.9438697.0086309 109.36 0.000.9263481.9613912 _cons.3292942.0821463 4.01 0.000.1625284.49606 The elasticity is.94 b) From these regressions we can see that income and consumption are both growing by about 2% per year (2.22 and 2.11 respectively).. reg ly t Source SS df MS Number of obs = 37 -------------+------------------------------ F( 1, 35) = 966.83 Model 2.07397957 1 2.07397957 Residual.075079958 35.002145142 R-squared = 0.9651 -------------+------------------------------ Adj R-squared = 0.9641 Total 2.14905953 36.059696098 Root MSE =.04632

ly Coef. Std. Err. t P> t [95% Conf. Interval] t.0221743.0007131 31.09 0.000.0207265.023622 _cons 9.093386.0155425 585.06 0.000 9.061833 9.124939. predict lydetrend, resid. reg lc t Source SS df MS Number of obs = 37 -------------+------------------------------ F( 1, 35) = 1626.49 Model 1.87972918 1 1.87972918 Residual.040449391 35.001155697 R-squared = 0.9789 -------------+------------------------------ Adj R-squared = 0.9783 Total 1.92017857 36.053338294 Root MSE =.034 lc Coef. Std. Err. t P> t [95% Conf. Interval] t.0211103.0005234 40.33 0.000.0200477.0221729 _cons 8.908832.0114082 780.92 0.000 8.885672 8.931992. predict lcdetrend, resid. graph lydetrend lcdetrend t c) We get an elasticity of.72 with the detrended data (or just including a trend), which is lower than before. Some of the relationship estimated before was due to both variables trending up.. reg lcdetrend lydetrend Source SS df MS Number of obs = 37 -------------+------------------------------ F( 1, 35) = 817.35 Model.038788413 1.038788413 Residual.001660978 35.000047457 R-squared = 0.9589 -------------+------------------------------ Adj R-squared = 0.9578 Total.040449392 36.001123594 Root MSE =.00689 lcdetrend Coef. Std. Err. t P> t [95% Conf. Interval] lydetrend.7187684.0251412 28.59 0.000.6677291.7698078 _cons -1.81e-10.0011325-0.00 1.000 -.0022991.0022991. reg lc ly t Source SS df MS Number of obs = 37 -------------+------------------------------ F( 2, 34) =19635.89 Model 1.91851759 2.959258795 Residual.001660979 34.000048852 R-squared = 0.9991 -------------+------------------------------ Adj R-squared = 0.9991 Total 1.92017857 36.053338294 Root MSE =.00699 lc Coef. Std. Err. t P> t [95% Conf. Interval] ly.7187684.0255082 28.18 0.000.6669294.7706074 t.0051721.0005758 8.98 0.000.004002.0063423 _cons 2.372793.2319681 10.23 0.000 1.901377 2.844209

d) In order to estimate Newey-West standard errors or use Cochrane-Orcutt estimation we need to tell stata what the time variable is.. tsset t time variable: t, 1 to 37. newey lc ly t, lag(4) Regression with Newey-West standard errors Number of obs = 37 maximum lag : 4 F( 2, 34) = 19535.97 Newey-West lc Coef. Std. Err. t P> t [95% Conf. Interval] ly.7187684.0277335 25.92 0.000.6624072.7751297 t.0051721.0006073 8.52 0.000.003938.0064063 _cons 2.372793.2518517 9.42 0.000 1.860969 2.884618 These are the same coefficients as OLS, but different standard errors. That s what we expected.. prais lc ly t, corc Iteration 0: rho = 0.0000 Iteration 1: rho = 0.3742 Iteration 2: rho = 0.3941 Iteration 3: rho = 0.3976 Iteration 4: rho = 0.3983 Iteration 5: rho = 0.3984 Iteration 6: rho = 0.3984 Iteration 7: rho = 0.3984 Iteration 8: rho = 0.3984 Cochrane-Orcutt AR(1) regression -- iterated estimates Source SS df MS Number of obs = 36 -------------+------------------------------ F( 2, 33) = 7084.09 Model.61289322 2.30644661 Residual.001427529 33.000043258 R-squared = 0.9977 -------------+------------------------------ Adj R-squared = 0.9975 Total.614320749 35.017552021 Root MSE =.00658 lc Coef. Std. Err. t P> t [95% Conf. Interval] ly.7035581.0376261 18.70 0.000.6270072.780109 t.0054885.000828 6.63 0.000.0038039.0071731 _cons 2.511284.3427244 7.33 0.000 1.814006 3.208562 rho.3984385 Durbin-Watson statistic (original) 1.236912 Durbin-Watson statistic (transformed) 1.900697 This is a different estimator it s based on assuming exactly AR(1) serial correlation, but the implications are similar to the previous estimates, which is also as expected. 3. Look at the data.. use intdef, clear

. desc Contains data from intdef.dta obs: 49 vars: 13 24 Jan 2000 22:55 size: 2,499 (100.0% of memory free) - storage display value variable name type format label variable label - year int %9.0g 1948-1996 i3 float %9.0g 3 mo. T bill rate inf float %9.0g CPI inf. rate rec float %9.0g fed. receipts, % GDP out float %9.0g fed. outlays, % GDP def float %9.0g out - rec i3_1 float %9.0g i3[_n-1] inf_1 float %9.0g inf[_n-1] def_1 float %9.0g def[_n-1] ci3 float %9.0g i3 - i3_1 cinf float %9.0g inf - inf_1 cdef float %9.0g def - def_1 y77 byte %9.0g =1 year >= 1977; change in FY - Sorted by:. sum Variable Obs Mean Std. Dev. Min Max -------------+----------------------------------------------------- year 49 1972 14.28869 1948 1996 i3 49 5.06898 2.965661.95 14.03 inf 49 4.108163 3.182821-1.2 13.5 rec 49 17.83878 1.058855 14.5 19.7 out 49 19.69184 2.514528 11.7 23.6 def 49 1.853061 2.04268-4.7 6 i3_1 48 5.07 2.997035.95 14.03 inf_1 48 4.13125 3.212354-1.2 13.5 def_1 48 1.8625 2.063216-4.7 6 ci3 48.0829167 1.381285-3.34 2.97 cinf 48 -.10625 2.566926-9.3 6.6 cdef 48.1270833 1.496057-3.200001 4.499999 y77 49.4081633.496587 0 1 a) The finite distributed lag model has one lag of each independent variable:. reg i3 inf inf_1 def def_1 Source SS df MS Number of obs = 48 -------------+------------------------------ F( 4, 43) = 28.23 Model 293.744216 4 73.436054 Residual 111.851766 43 2.60120386 R-squared = 0.7242 -------------+------------------------------ Adj R-squared = 0.6986 Total 405.595982 47 8.62970175 Root MSE = 1.6128 i3 Coef. Std. Err. t P> t [95% Conf. Interval] inf.4251947.1288993 3.30 0.002.1652445.6851449 inf_1.2732321.1412654 1.93 0.060 -.0116568.558121

def.1630251.2569521 0.63 0.529 -.3551682.6812185 def_1.4047176.217547 1.86 0.070 -.0340078.8434431 _cons 1.234579.4410125 2.80 0.008.3451927 2.123966 a) The impact propensity is the coefficient on the current time period. Thus the impact propensity for inflation is.425 and for the deficit it is.163. b) The long-run propensity is the sum of the coefficients on the current and lagged variables. For inflation it is.698 for the deficit it is.568. We need to test whether this sum is significant not whether the two coefficients are jointly significant.. test inf + inf_1=0 ( 1) inf + inf_1 = 0.0 F( 1, 43) = 66.70. test def+def_1=0 ( 1) def + def_1 = 0.0 F( 1, 43) = 16.25 Prob > F = 0.0002 Both long run propensities are statistically significant.