Handout seminar 6, ECON4150

Similar documents
Heteroskedasticity. . reg wage black exper educ married tenure

You created this PDF from an application that is not licensed to print to novapdf printer (

Solutions for Session 5: Linear Models

Time series data: Part 2

*1A. Basic Descriptive Statistics sum housereg drive elecbill affidavit witness adddoc income male age literacy educ occup cityyears if control==1

İnsan TUNALI 8 November 2018 Econ 511: Econometrics I. ASSIGNMENT 7 STATA Supplement

Your Name (Please print) Did you agree to take the optional portion of the final exam Yes No. Directions

ECON Introductory Econometrics Seminar 2, 2015

Cameron ECON 132 (Health Economics): FIRST MIDTERM EXAM (A) Fall 17

Chapter 6 Part 3 October 21, Bootstrapping

Final Exam - section 1. Thursday, December hours, 30 minutes

u panel_lecture . sum

ECON Introductory Econometrics. Seminar 4. Stock and Watson Chapter 8

Professor Brad Jones University of Arizona POL 681, SPRING 2004 INTERACTIONS and STATA: Companion To Lecture Notes on Statistical Interactions

[BINARY DEPENDENT VARIABLE ESTIMATION WITH STATA]

Advanced Econometrics

Econ 371 Problem Set #4 Answer Sheet. 6.2 This question asks you to use the results from column (1) in the table on page 213.

Example 2.3: CEO Salary and Return on Equity. Salary for ROE = 0. Salary for ROE = 30. Example 2.4: Wage and Education

F^3: F tests, Functional Forms and Favorite Coefficient Models

Labor Force Participation and the Wage Gap Detailed Notes and Code Econometrics 113 Spring 2014

tm / / / / / / / / / / / / Statistics/Data Analysis User: Klick Project: Limited Dependent Variables{space -6}

Chapter 11 Part 6. Correlation Continued. LOWESS Regression

Problem Set 9 Heteroskedasticty Answers

The Multivariate Regression Model

The data definition file provided by the authors is reproduced below: Obs: 1500 home sales in Stockton, CA from Oct 1, 1996 to Nov 30, 1998

Problem Set 4 Answer Key

The relationship between GDP, labor force and health expenditure in European countries

Dummy variables 9/22/2015. Are wages different across union/nonunion jobs. Treatment Control Y X X i identifies treatment

Rationale. Learning about return and risk from the historical record and beta estimation. T Bills and Inflation

Example 1 of econometric analysis: the Market Model

Problem Set 6 ANSWERS

Quantitative Techniques Term 2

Assignment #5 Solutions: Chapter 14 Q1.

Labor Market Returns to Two- and Four- Year Colleges. Paper by Kane and Rouse Replicated by Andreas Kraft

Example 8.1: Log Wage Equation with Heteroscedasticity-Robust Standard Errors

Lecture 6: Non Normal Distributions

Econometrics is. The estimation of relationships suggested by economic theory

. ********** OUTPUT FILE: CARD & KRUEGER (1994)***********.. * STATA 10.0 CODE. * copyright C 2008 by Tito Boeri & Jan van Ours. * "THE ECONOMICS OF

(ii) Give the name of the California website used to find the various insurance plans offered under the Affordable care Act (Obamacare).

Appendixes Appendix 1 Data of Dependent Variables and Independent Variables Period

Lecture 1: Empirical Properties of Returns

Stat 328, Summer 2005

Categorical Outcomes. Statistical Modelling in Stata: Categorical Outcomes. R by C Table: Example. Nominal Outcomes. Mark Lunt.

Financial Econometrics: Problem Set # 3 Solutions

Financial Econometrics Jeffrey R. Russell Midterm 2014

Question 1a 1b 1c 1d 1e 1f 2a 2b 2c 2d 3a 3b 3c 3d M ult:choice Points

GGraph. Males Only. Premium. Experience. GGraph. Gender. 1 0: R 2 Linear = : R 2 Linear = Page 1

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 7.4-1

Chapter 4 Level of Volatility in the Indian Stock Market

Statistics & Flood Frequency Chapter 3. Dr. Philip B. Bedient

Question scores. Question 1a 1b 1c 1d 1e 2a 2b 2c 2d 2e 2f 3a 3b 3c 3d M ult:choice Points

Example 7.1: Hourly Wage Equation Average wage for women

Economics 483. Midterm Exam. 1. Consider the following monthly data for Microsoft stock over the period December 1995 through December 1996:

Visualisierung von Nicht-Linearität bzw. Heteroskedastizität

EC327: Limited Dependent Variables and Sample Selection Binomial probit: probit

SAS Simple Linear Regression Example

The impact of cigarette excise taxes on beer consumption

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, Last revised January 10, 2017

Brief Sketch of Solutions: Tutorial 1. 2) descriptive statistics and correlogram. Series: LGCSI Sample 12/31/ /11/2009 Observations 2596

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, Last revised January 13, 2018

Lampiran 1 : Grafik Data HIV Asli

Impact of Household Income on Poverty Levels

Analysis of the Influence of the Annualized Rate of Rentability on the Unit Value of the Net Assets of the Private Administered Pension Fund NN

Gloria Gonzalez-Rivera Forecasting For Economics and Business Solutions Manual

Percentiles, STATA, Box Plots, Standardizing, and Other Transformations

sociology SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 SO5032 Quantitative Research Methods

The SAS System 11:03 Monday, November 11,

Does Globalization Improve Quality of Life?

Key Objectives. Module 2: The Logic of Statistical Inference. Z-scores. SGSB Workshop: Using Statistical Data to Make Decisions

Notice that X2 and Y2 are skewed. Taking the SQRT of Y2 reduces the skewness greatly.

Written by N.Nilgün Çokça. Advance Excel. Part One. Using Excel for Data Analysis

David Tenenbaum GEOG 090 UNC-CH Spring 2005

Tests for One Variance

An Empirical Research on Chinese Stock Market Volatility Based. on Garch

Washington University Fall Economics 487

Business Statistics 41000: Probability 3

INFLUENCE OF CONTRIBUTION RATE DYNAMICS ON THE PENSION PILLAR II ON THE

Modeling Volatility of Price of Some Selected Agricultural Products in Ethiopia: ARIMA-GARCH Applications

Kabupaten Langkat Suku Bunga Kredit. PDRB harga berlaku

Catherine De Vries, Spyros Kosmidis & Andreas Murr

1) The Effect of Recent Tax Changes on Taxable Income

Financial Risk, Liquidity Risk and their Effect on the Listed Jordanian Islamic Bank's Performance

Hypothesis Tests: One Sample Mean Cal State Northridge Ψ320 Andrew Ainsworth PhD

Statistical Models of Stocks and Bonds. Zachary D Easterling: Department of Economics. The University of Akron

Washington University Fall Economics 487. Project Proposal due Monday 10/22 Final Project due Monday 12/3

Kerkar Puja Paresh Dr. P. Sriram

Monetary Economics Measuring Asset Returns. Gerald P. Dwyer Fall 2015

Robust Critical Values for the Jarque-bera Test for Normality

Relation between Income Inequality and Economic Growth

Statistical Analysis of Data from the Stock Markets. UiO-STK4510 Autumn 2015

1 Exercise One. 1.1 Calculate the mean ROI. Note that the data is not grouped! Below you find the raw data in tabular form:

Lecture 07: Measures of central tendency

Model fit assessment via marginal model plots

Unit 2 Statistics of One Variable

Simple Descriptive Statistics

Two-Sample T-Test for Superiority by a Margin

Two-Sample T-Test for Non-Inferiority

11/28/2018. Overview. Multiple Linear Regression Analysis. Multiple regression. Multiple regression. Multiple regression. Multiple regression

Research Article The Volatility of the Index of Shanghai Stock Market Research Based on ARCH and Its Extended Forms

Chapter 6 Part 6. Confidence Intervals chi square distribution binomial distribution

Transcription:

Handout seminar 6, ECON4150 Herman Kruse March 17, 2013 Introduction - list of commands This week, we need a couple of new commands in order to solve all the problems. hist var1 if var2, options - creates a histogram of var1 with var2 as reference. Options are many and can be found using findit su(mmarize) - summarizes some key properties of the specified variable. Option detail is added to get the more detailed properties (such as kurtosis and skewness) scalar - creates a scalar product gen lprice = ln(price) reg lprice sqft -------------+----- F( 1, 878) = 2143.38 Model 88.3556977 1 88.3556977 Prob > F = 0.0000 Residual 36.1934444 878.041222602 R-squared = 0.7094 -------------+----- Adj R-squared = 0.7091 Total 124.549142 879.141694132 Root MSE =.20303 lprice Coef. Std. Err. t P> t [95% Conf. Interval] -------------+-------------- sqft.000596.0000129 46.30 0.000.0005707.0006212 _cons 10.59379.02185 484.84 0.000 10.5509 10.63667 predict ehat1, residuals mean sqft mean price Mean estimation Number of obs = 880 ------------ Mean Std. Err. [95% Conf. Interval] -------------+----------------------- sqft 1611.968 17.93339 1576.771 1647.165 ------------ Thanks to Erling Skancke for excellent suggestions to this document 1

Handout seminar 6 2 Mean estimation Number of obs = 880 ------------ Mean Std. Err. [95% Conf. Interval] -------------+----------------------- price 112810.8 1780.356 109316.6 116305.1 ------------ gen lsqft = ln(sqft) reg lprice lsqft -------------+----- F( 1, 878) = 1993.88 Model 86.4716562 1 86.4716562 Prob > F = 0.0000 Residual 38.0774859 878.043368435 R-squared = 0.6943 -------------+----- Adj R-squared = 0.6939 Total 124.549142 879.141694132 Root MSE =.20825 lprice Coef. Std. Err. t P> t [95% Conf. Interval] -------------+-------------- lsqft 1.006582.0225423 44.65 0.000.9623386 1.050825 _cons 4.170677.1655084 25.20 0.000 3.845839 4.495515 predict ehat2, residuals reg price sqft -------------+----- F( 1, 878) = 1799.75 Model 1.6479e+12 1 1.6479e+12 Prob > F = 0.0000 Residual 8.0391e+11 878 915618929 R-squared = 0.6721 -------------+----- Adj R-squared = 0.6717 Total 2.4518e+12 879 2.7893e+09 Root MSE = 30259 price Coef. Std. Err. t P> t [95% Conf. Interval] -------------+-------------- sqft 81.38899 1.918489 42.42 0.000 77.62363 85.15435 _cons -18385.65 3256.424-5.65 0.000-24776.94-11994.37 predict ehat3, residuals hist ehat1 if lprice, bin(35) start (-1) hist ehat2 if lprice, bin(35) start (-1) hist ehat3 if price, bin(35) start (-110000)

Handout seminar 6 3 Figure 1: Histograms of residuals For Jarque-Bera-testing, we need to know how to construct the test-statistic. It has the following formula: JB = n 6 (kurtosis [(skewness2 3)2 ) + ] 4 And what we really test is the hypothesis about normality in the residuals. If the observed value is above some critical value, we reject the hypothesis and conclude that the residuals are not compatible with an assumption about normality. The skewness refers to how symmetric the residuals are around zero, while kurtosis refers to the peakedness of the distribution. For a normal distribution, the skewness is equal to zero, while the kurtosis is equal to three. So we need to check whether the skewness is sufficiently different from zero and kurtosis sufficiently different from three in order to conclude that the residuals are not normally distributed. When the residuals are normally distributed, the Jarque-Bera statistic has a chi-square distribution with two degrees of freedom. So we reject the null-hypothesis when we have a test statistic exceeding χ 2 2,0.95 = 5.99 with a 5% significance-level. Note that if we do not reject the null-hypothesis, this does not directly imply normality in the residuals. There are more distributions with skewness 0 and kurtosis 3 (or so-called symmetric and mesokurtosic distributions). So the Jarque-Bera test will, if we reject, say we have strong evidence about a skewed distribution, or a sharply peaked distribution. su ehat1, detail su ehat2, detail su ehat3, detail ----------- 1% -.4598814 -.710299 5% -.3142879 -.6619065 10% -.2410653 -.6477303 Obs 880 25% -.1200507 -.6345798 Sum of Wgt. 880 50% -.0139173 Mean -1.73e-10 Largest Std. Dev..202918 75%.1158161.7670023

Handout seminar 6 4 90%.2606355.7681834 Variance.0411757 95%.3558994.8957195 Skewness.3239307 99%.5422422.9086631 Kurtosis 4.315611 ----------- 1% -.49264 -.7518981 5% -.3091487 -.6739541 10% -.2441997 -.6184166 Obs 880 25% -.1303633 -.5487044 Sum of Wgt. 880 50% -.018237 Mean -2.38e-10 Largest Std. Dev..2081324 75%.1182303.7180918 90%.2746109.7190305 Variance.0433191 95%.3657138.8612714 Skewness.3488042 99%.5302507.8624387 Kurtosis 3.975605 ----------- 1% -68089.01-101224.1 5% -41894.24-91337.09 10% -30454.09-84395.79 Obs 880 25% -16140.87-76857.88 Sum of Wgt. 880 50% -2667.711 Mean -9.86e-06 Largest Std. Dev. 30241.98 75% 12093.73 166920.7 90% 28794.58 168413 Variance 9.15e+08 95% 50104.84 186850.3 Skewness 1.59206 99% 112023.8 204279.8 Kurtosis 10.53922 scalar jb = (880/6)*(0.3239307)^2 + (880/6)*((4.315611-3)^2)/4 scalar jb = (880/6)*(0.3488042)^2 + (880/6)*((3.975605-3)^2)/4

Handout seminar 6 5 scalar jb = (880/6)*(1.59206)^2 + (880/6)*((10.53922-3)^2)/4 Jarque-Bera Statistic = 78.853746 Jarque-Bera p-value = 7.536e-18 Jarque-Bera Statistic = 52.743629 Jarque-Bera p-value = 3.523e-12 Jarque-Bera Statistic = 2455.8768 Jarque-Bera p-value = 0 scatter ehat1 sqft scatter ehat2 sqft scatter ehat3 sqft Figure 2: Scatter plot residuals on sqft Mis-calculation: di exp(10.59379+0.000596*2700) di exp(4.170677)*2700^1.006582 di (81.38899*2700-18385.65) 199384.42 184183.62 201364.62 Correct calculation: di exp(10.59379+0.000596*2700+0.041222602/2) di exp(4.170677+1.006582*ln(2700)+0.043368435/2) di (81.38899*2700-18385.65) 203536.64 188221.12 201364.62