You created this PDF from an application that is not licensed to print to novapdf printer (

Size: px
Start display at page:

Download "You created this PDF from an application that is not licensed to print to novapdf printer (http://www.novapdf.com)"

Transcription

1 Monday October 3 10:11: Page 1 (R) / / / / / / / / / / / / Statistics/Data Analysis Education Box and save these files in a local folder. name: <unnamed log: M:\pc\Dokumenter\ECON4135\Extra_sem\Extraseminar.smcl log type: smcl opened on: 3 Oct 2011, 10:06:48 1. /* ECON4135 Applied Statistics and Econometrics Autumn 2011 STATA course UiO Problem set for the extra seminar on week 39 Exercise 1 Consider the Education Box, on page 202 in the textbook (Chapter 5). It is claimed that earnings of workers with high education are more spread than those w ith lower education. And this will result in heteroskedastic error terms when we regress hourly earnings on education. In this exercise, we will provide more explicit justification for some of the claims in the Education Box. First of all, save the excel dataset and the file that describes the data by following these instructions: 1. Go to the course homepage: Econ4135 Autumn Click on the Lecture and seminar plan link. 3. When the pdf file opens, click on the Student resource link and allow the connection. 4. On the textbook website that opens up, click on Datasets for Replicating Empirical on the left column. 5. Choose Data Description and Excel Data Set, under Economic value of a Year of Education Box and save these files in a local folder. NOTE: The above dataset comes also in Stata format, which is simpler to save and load on Stat a directly. For the sake of practicing how to upload a dataset in excel format onto Stata, let us u se the excel version /* a) Open the dataset in an Excel sheet. Save the dataset as a tab delimited text file: name _of_file.txt. The txt ending is important here. Upload this txt file on Stata by using the command insheet. Remember that you need to include the directory path with the name of the txt file when you use the insheet command, so that Stata will be able to find the file. 4. insheet using "M:\pc\Dokumenter\ECON4135\Extra_sem\ch5_cps_box.txt" (4 vars, 2989 obs) /* b) Use the information on the file that describes the dataset to label the variables. (HINT: Stata command label var ) 8. label var a_age "age"

2 Monday October 3 10:11: Page 2 9. label var a_sex "1 if male; 2 if female" 10. label var ahe "Average Hourly Earnings in 2004" 11. label var yrseduc "Years of Education" /* c) What are the types (i.e. string or numeric) of variables in this dataset? Transform all the variables in numeric format. (HINT: Stata command destring; pay particular attention to the option dpcomma; you may also want to use the replace option) 14. describe Contains data obs: 2,989 vars: 4 size: 65,758 (99.9% of memory free) storage display value variable name type format label variable label a_age byte %8.0g age a_sex byte %8.0g 1 if male; 2 if female ahe str11 %11s Average Hourly Earnings in 2004 yrseduc byte %8.0g Years of Education Sorted by: Note: dataset has changed since last saved 15. destring ahe, replace dpcomma ahe has all characters numeric; replaced as double /* d) Are there any missing values in this dataset? If yes, from which variable(s)? 18. gen x=1 if missing( a_age) missing( a_sex) missing(ahe) missing( yrseduc) (2989 missing values generated) 19. tab x, missing 20. drop x x Freq. Percent Cum.. 2, Total 2, /* e) Consider the variable a_sex. Create a dummy variable called male that takes the value 1 if a_sex refers to a male an d 0 otherwise. Label the values of the variable male by using the Stata command label.

3 Monday October 3 10:11: Page gen male=1 if a_sex==1 (1331 missing values generated) 24. replace male=0 if missing(male) (1331 real changes made) 25. label define male_label 1 "male" 0 "female" 26. label values male male_label 27. tab male, missing male Freq. Percent Cum. female 1, male 1, Total 2, /* f) Replicate the regression results (5.23) and figure (5.3) in the Education Box on page regress ahe yrseduc Source SS df MS Number of obs = 2989 F( 1, 2987) = Model Prob F = Residual R-squared = Adj R-squared = Total Root MSE = ahe Coef. Std. Err. t P t [95% Conf. Interval] yrseduc _cons twoway (scatter ahe yrseduc) (lfit ahe yrseduc) /* g) Calculate the predicted values from the above regression by using the Stata command pre dict with the option xb. (e.g. predict ahe_hat, xb). Notice that this command will work only if you have already run the regression. Now generate the residuals of the regression (e.g. gen resid= ahe ahe_hat). 34. predict ahe_hat, xb 35. gen resid=ahe-ahe_hat /* h) To verify that the claims on the last paragraph in the Education Box, find the standard deviation of the residuals for the following subgroups: -those with 10 years of education -those with a high school diploma (i.e. 12 years of education) -those with a college degree (i.e. 16 years of education)

4 Monday October 3 10:11: Page sum resid if yrseduc==10 resid sum resid if yrseduc==12 resid sum resid if yrseduc==16 resid /* i) Run a regression of earnings on years of education (just like (5.23) in the Education B ox) only for men. Does the distribution of earnings for men spread out as education inc reases? 44. regress ahe yrseduc if male==1 Source SS df MS Number of obs = 1658 F( 1, 1656) = Model Prob F = Residual R-squared = Adj R-squared = Total Root MSE = ahe Coef. Std. Err. t P t [95% Conf. Interval] yrseduc _cons *if condition is important 46. predict ahe_hat_men if male==1, xb (1331 missing values generated) 47. *if condition is VERY important 48. gen resid_men=ahe-ahe_hat_men if male==1 (1331 missing values generated) sum resid_men if yrseduc==10 & male==1 resid_men sum resid_men if yrseduc==12 & male==1 resid_men

5 Monday October 3 10:11: Page sum resid_men if yrseduc==16 & male==1 resid_men /* Exercise 2 There is a typing mistake in the book. Two exercises are numbered E5.2 on page 215. Do both! In order to have access to the data that these exercises refer, follow these instructio ns: 1. Go to the course homepage: Econ4135 Autumn Click on the Lecture and seminar plan link. 3. When the pdf file opens, click on the Student resource link and allow the connection. 4. On the textbook website that opens up, click on Data for Empirical Exercises a nd Test on the left column. 5. Choose College Distance and Teacher Ratings Data from the list of data. Save th e Stata version of the datasets on a local folder so that you can open them with Stata. Use the PDF files that provide a description of the datasets in order to become acquainted with the datasets. Here is the command that loads a datasets (of Stata type) on Stata: use filename.dta, clear The dta ending of the filename is important. filename should include also the directory path that tells Stata where to find the saved filename. 55. *E5.2 (the first) 56. use "M:\pc\Dokumenter\ECON4135\Extra_sem\TeachingRatings.dta", clear 57. reg course_eval beauty Source SS df MS Number of obs = 463 F( 1, 461) = Model Prob F = Residual R-squared = Adj R-squared = Total Root MSE = course_eval Coef. Std. Err. t P t [95% Conf. Interval] beauty _cons *E5.2 (the second) 61. use "M:\pc\Dokumenter\ECON4135\Extra_sem\CollegeDistance.dta", clear 62. *doing the calculations 63. regress ed dist Source SS df MS Number of obs = 3796 F( 1, 3794) = Model Prob F = Residual R-squared = Adj R-squared = Total Root MSE = ed Coef. Std. Err. t P t [95% Conf. Interval] dist _cons

6 Monday October 3 10:11: Page tab female, missing female Freq. Percent Cum. 0 1, , Total 3, bysort female: regress ed dist - female = 0 Source SS df MS Number of obs = 1726 F( 1, 1724) = Model Prob F = Residual R-squared = Adj R-squared = Total Root MSE = ed Coef. Std. Err. t P t [95% Conf. Interval] dist _cons female = 1 Source SS df MS Number of obs = 2070 F( 1, 2068) = Model Prob F = Residual R-squared = Adj R-squared = Total Root MSE = ed Coef. Std. Err. t P t [95% Conf. Interval] dist _cons *Coefficient of interest 67. display (-0.064) *Variance of the coefficient 69. display * * *St. error 71. display sqrt( ) *t statisitc 73. display -0.02/

7 Monday October 3 10:11: Page *Letting Stata do the calculations 76. *females 77. regress ed dist if female==1 Source SS df MS Number of obs = 2070 F( 1, 2068) = Model Prob F = Residual R-squared = Adj R-squared = Total Root MSE = ed Coef. Std. Err. t P t [95% Conf. Interval] dist _cons matrix b=e(b) 79. matrix V=e(V) matrix list b b[1,2] dist _cons y matrix list V symmetric V[2,2] dist _cons dist _cons scalar coef_female=b[1, 1] 85. scalar var_female=v[1,1] display coef_female display var_female *males 91. regress ed dist if female==0 Source SS df MS Number of obs = 1726 F( 1, 1724) = Model Prob F = Residual R-squared = Adj R-squared = Total Root MSE = ed Coef. Std. Err. t P t [95% Conf. Interval] dist _cons

8 Monday October 3 10:11: Page matrix b=e(b) 93. matrix V=e(V) 94. scalar coef_male=b[1, 1] 95. scalar var_male=v[1,1] 96. display coef_male display var_male scalar t_stat=(coef_male-coef_female)/sqrt(var_male+var_female) 100. display t_stat /* Exercise 3 Repeat the exercise from the last seminar. Use the nlsw88.dta data. (Remember that to upload this dataset you need to type: sysuse nslw88.dta) 105. sysuse nlsw88.dta (NLSW, 1988 extract) 106. *1. What is mean of schooling grade in the sample? 107. sum grade, detail current grade completed Percentiles Smallest 1% 7 0 5% % 11 4 Obs % 12 4 Sum of Wgt % 12 Mean Largest Std. Dev % % Variance % Skewness % Kurtosis *2. Are there any missing values? 110. tab grade, missing current grade completed Freq. Percent Cum

9 Monday October 3 10:11: Page Total 2, *3. What is the age of those with missing values in education? 113. tab age if missing(grade) age in current year Freq. Percent Cum Total *4. Are they union workers? 116. tab union if missing(grade) union worker Freq. Percent Cum. nonunion union Total *5. Do they live in the south? 119. sum south if missing(grade) south *6. Do they live in smsa? 122. sum smsa if missing(grade) smsa *7. What is the relationship between wages and grades (education)? 125. *Create a scatter and a best linear fit with confidence intervals twoway (scatter wage grade) (lfit wage grade) *8. Regress wages on grades reg wage grade Source SS df MS Number of obs = 2244 F( 1, 2242) = Model Prob F = Residual R-squared = Adj R-squared = Total Root MSE = wage Coef. Std. Err. t P t [95% Conf. Interval] grade _cons

10 Monday October 3 10:11: Page *9. Test whether the slope is equal to test grade=1 ( 1) grade = 1 F( 1, 2242) = Prob F = *10. Regress wages on grades separately for union and non-union members bysort union: reg wage grade - union = nonunion Source SS df MS Number of obs = 1416 F( 1, 1414) = Model Prob F = Residual R-squared = Adj R-squared = Total Root MSE = wage Coef. Std. Err. t P t [95% Conf. Interval] grade _cons union = union Source SS df MS Number of obs = 460 F( 1, 458) = Model Prob F = Residual R-squared = Adj R-squared = Total Root MSE = wage Coef. Std. Err. t P t [95% Conf. Interval] grade _cons union =. Source SS df MS Number of obs = 368 F( 1, 366) = Model Prob F = Residual R-squared = Adj R-squared = Total Root MSE = wage Coef. Std. Err. t P t [95% Conf. Interval] grade _cons

11 Monday October 3 10:11: Page *11. Predict wages from the regression of wages on grades reg wage grade Source SS df MS Number of obs = 2244 F( 1, 2242) = Model Prob F = Residual R-squared = Adj R-squared = Total Root MSE = wage Coef. Std. Err. t P t [95% Conf. Interval] grade _cons predict wage_hat, xb (2 missing values generated) *12. Add the predicted wages to the graph above twoway (scatter wage grade) (lfit wage grade) (scatter wage_hat grade) log close name: <unnamed log: M:\pc\Dokumenter\ECON4135\Extra_sem\Extraseminar.smcl log type: smcl closed on: 3 Oct 2011, 10:06:57

Econ 371 Problem Set #4 Answer Sheet. 6.2 This question asks you to use the results from column (1) in the table on page 213.

Econ 371 Problem Set #4 Answer Sheet. 6.2 This question asks you to use the results from column (1) in the table on page 213. Econ 371 Problem Set #4 Answer Sheet 6.2 This question asks you to use the results from column (1) in the table on page 213. a. The first part of this question asks whether workers with college degrees

More information

ECON Introductory Econometrics. Seminar 4. Stock and Watson Chapter 8

ECON Introductory Econometrics. Seminar 4. Stock and Watson Chapter 8 ECON4150 - Introductory Econometrics Seminar 4 Stock and Watson Chapter 8 empirical exercise E8.2: Data 2 In this exercise we use the data set CPS12.dta Each month the Bureau of Labor Statistics in the

More information

tm / / / / / / / / / / / / Statistics/Data Analysis User: Klick Project: Limited Dependent Variables{space -6}

tm / / / / / / / / / / / / Statistics/Data Analysis User: Klick Project: Limited Dependent Variables{space -6} PS 4 Monday August 16 01:00:42 2010 Page 1 tm / / / / / / / / / / / / Statistics/Data Analysis User: Klick Project: Limited Dependent Variables{space -6} log: C:\web\PS4log.smcl log type: smcl opened on:

More information

İnsan TUNALI 8 November 2018 Econ 511: Econometrics I. ASSIGNMENT 7 STATA Supplement

İnsan TUNALI 8 November 2018 Econ 511: Econometrics I. ASSIGNMENT 7 STATA Supplement İnsan TUNALI 8 November 2018 Econ 511: Econometrics I ASSIGNMENT 7 STATA Supplement. use "F:\COURSES\GRADS\ECON511\SHARE\wages1.dta", clear. generate =ln(wage). scatter sch Q. Do you see a relationship

More information

Professor Brad Jones University of Arizona POL 681, SPRING 2004 INTERACTIONS and STATA: Companion To Lecture Notes on Statistical Interactions

Professor Brad Jones University of Arizona POL 681, SPRING 2004 INTERACTIONS and STATA: Companion To Lecture Notes on Statistical Interactions Professor Brad Jones University of Arizona POL 681, SPRING 2004 INTERACTIONS and STATA: Companion To Lecture Notes on Statistical Interactions Preliminaries 1. Basic Regression. reg y x1 Source SS df MS

More information

Chapter 6 Part 3 October 21, Bootstrapping

Chapter 6 Part 3 October 21, Bootstrapping Chapter 6 Part 3 October 21, 2008 Bootstrapping From the internet: The bootstrap involves repeated re-estimation of a parameter using random samples with replacement from the original data. Because the

More information

Example 2.3: CEO Salary and Return on Equity. Salary for ROE = 0. Salary for ROE = 30. Example 2.4: Wage and Education

Example 2.3: CEO Salary and Return on Equity. Salary for ROE = 0. Salary for ROE = 30. Example 2.4: Wage and Education 1 Stata Textbook Examples Introductory Econometrics: A Modern Approach by Jeffrey M. Wooldridge (1st & 2d eds.) Chapter 2 - The Simple Regression Model Example 2.3: CEO Salary and Return on Equity summ

More information

Handout seminar 6, ECON4150

Handout seminar 6, ECON4150 Handout seminar 6, ECON4150 Herman Kruse March 17, 2013 Introduction - list of commands This week, we need a couple of new commands in order to solve all the problems. hist var1 if var2, options - creates

More information

Heteroskedasticity. . reg wage black exper educ married tenure

Heteroskedasticity. . reg wage black exper educ married tenure Heteroskedasticity. reg Source SS df MS Number of obs = 2,380 -------------+---------------------------------- F(2, 2377) = 72.38 Model 14.4018246 2 7.20091231 Prob > F = 0.0000 Residual 236.470024 2,377.099482551

More information

Problem Set 6 ANSWERS

Problem Set 6 ANSWERS Economics 20 Part I. Problem Set 6 ANSWERS Prof. Patricia M. Anderson The first 5 questions are based on the following information: Suppose a researcher is interested in the effect of class attendance

More information

Solutions for Session 5: Linear Models

Solutions for Session 5: Linear Models Solutions for Session 5: Linear Models 30/10/2018. do solution.do. global basedir http://personalpages.manchester.ac.uk/staff/mark.lunt. global datadir $basedir/stats/5_linearmodels1/data. use $datadir/anscombe.

More information

[BINARY DEPENDENT VARIABLE ESTIMATION WITH STATA]

[BINARY DEPENDENT VARIABLE ESTIMATION WITH STATA] Tutorial #3 This example uses data in the file 16.09.2011.dta under Tutorial folder. It contains 753 observations from a sample PSID data on the labor force status of married women in the U.S in 1975.

More information

Final Exam - section 1. Thursday, December hours, 30 minutes

Final Exam - section 1. Thursday, December hours, 30 minutes Econometrics, ECON312 San Francisco State University Michael Bar Fall 2013 Final Exam - section 1 Thursday, December 19 1 hours, 30 minutes Name: Instructions 1. This is closed book, closed notes exam.

More information

Your Name (Please print) Did you agree to take the optional portion of the final exam Yes No. Directions

Your Name (Please print) Did you agree to take the optional portion of the final exam Yes No. Directions Your Name (Please print) Did you agree to take the optional portion of the final exam Yes No (Your online answer will be used to verify your response.) Directions There are two parts to the final exam.

More information

u panel_lecture . sum

u panel_lecture . sum u panel_lecture sum Variable Obs Mean Std Dev Min Max datastre 639 9039644 6369418 900228 926665 year 639 1980 2584012 1976 1984 total_sa 639 9377839 3212313 682 441e+07 tot_fixe 639 5214385 1988422 642

More information

Dummy variables 9/22/2015. Are wages different across union/nonunion jobs. Treatment Control Y X X i identifies treatment

Dummy variables 9/22/2015. Are wages different across union/nonunion jobs. Treatment Control Y X X i identifies treatment Dummy variables Treatment 22 1 1 Control 3 2 Y Y1 0 1 2 Y X X i identifies treatment 1 1 1 1 1 1 0 0 0 X i =1 if in treatment group X i =0 if in control H o : u n =u u Are wages different across union/nonunion

More information

Labor Force Participation and the Wage Gap Detailed Notes and Code Econometrics 113 Spring 2014

Labor Force Participation and the Wage Gap Detailed Notes and Code Econometrics 113 Spring 2014 Labor Force Participation and the Wage Gap Detailed Notes and Code Econometrics 113 Spring 2014 In class, Lecture 11, we used a new dataset to examine labor force participation and wages across groups.

More information

The Multivariate Regression Model

The Multivariate Regression Model The Multivariate Regression Model Example Determinants of College GPA Sample of 4 Freshman Collect data on College GPA (4.0 scale) Look at importance of ACT Consider the following model CGPA ACT i 0 i

More information

Advanced Econometrics

Advanced Econometrics Advanced Econometrics Instructor: Takashi Yamano 11/14/2003 Due: 11/21/2003 Homework 5 (30 points) Sample Answers 1. (16 points) Read Example 13.4 and an AER paper by Meyer, Viscusi, and Durbin (1995).

More information

*1A. Basic Descriptive Statistics sum housereg drive elecbill affidavit witness adddoc income male age literacy educ occup cityyears if control==1

*1A. Basic Descriptive Statistics sum housereg drive elecbill affidavit witness adddoc income male age literacy educ occup cityyears if control==1 *1A Basic Descriptive Statistics sum housereg drive elecbill affidavit witness adddoc income male age literacy educ occup cityyears if control==1 Variable Obs Mean Std Dev Min Max --- housereg 21 2380952

More information

F^3: F tests, Functional Forms and Favorite Coefficient Models

F^3: F tests, Functional Forms and Favorite Coefficient Models F^3: F tests, Functional Forms and Favorite Coefficient Models Favorite coefficient model: otherteams use "nflpricedata Bdta", clear *Favorite coefficient model: otherteams reg rprice pop pop2 rpci wprcnt1

More information

Labor Market Returns to Two- and Four- Year Colleges. Paper by Kane and Rouse Replicated by Andreas Kraft

Labor Market Returns to Two- and Four- Year Colleges. Paper by Kane and Rouse Replicated by Andreas Kraft Labor Market Returns to Two- and Four- Year Colleges Paper by Kane and Rouse Replicated by Andreas Kraft Theory Estimating the return to two-year colleges Economic Return to credit hours or sheepskin effects

More information

ECON Introductory Econometrics Seminar 2, 2015

ECON Introductory Econometrics Seminar 2, 2015 ECON4150 - Introductory Econometrics Seminar 2, 2015 Stock and Watson EE4.1, EE5.2 Stock and Watson EE4.1, EE5.2 ECON4150 - Introductory Econometrics Seminar 2, 2015 1 / 14 Seminar 2 Author: Andrea University

More information

Cameron ECON 132 (Health Economics): FIRST MIDTERM EXAM (A) Fall 17

Cameron ECON 132 (Health Economics): FIRST MIDTERM EXAM (A) Fall 17 Cameron ECON 132 (Health Economics): FIRST MIDTERM EXAM (A) Fall 17 Answer all questions in the space provided on the exam. Total of 36 points (and worth 22.5% of final grade). Read each question carefully,

More information

The data definition file provided by the authors is reproduced below: Obs: 1500 home sales in Stockton, CA from Oct 1, 1996 to Nov 30, 1998

The data definition file provided by the authors is reproduced below: Obs: 1500 home sales in Stockton, CA from Oct 1, 1996 to Nov 30, 1998 Economics 312 Sample Project Report Jeffrey Parker Introduction This project is based on Exercise 2.12 on page 81 of the Hill, Griffiths, and Lim text. It examines how the sale price of houses in Stockton,

More information

Quantitative Techniques Term 2

Quantitative Techniques Term 2 Quantitative Techniques Term 2 Laboratory 7 2 March 2006 Overview The objective of this lab is to: Estimate a cost function for a panel of firms; Calculate returns to scale; Introduce the command cluster

More information

Time series data: Part 2

Time series data: Part 2 Plot of Epsilon over Time -- Case 1 1 Time series data: Part Epsilon - 1 - - - -1 1 51 7 11 1 151 17 Time period Plot of Epsilon over Time -- Case Plot of Epsilon over Time -- Case 3 1 3 1 Epsilon - Epsilon

More information

Assignment #5 Solutions: Chapter 14 Q1.

Assignment #5 Solutions: Chapter 14 Q1. Assignment #5 Solutions: Chapter 14 Q1. a. R 2 is.037 and the adjusted R 2 is.033. The adjusted R 2 value becomes particularly important when there are many independent variables in a multiple regression

More information

Problem Set 9 Heteroskedasticty Answers

Problem Set 9 Heteroskedasticty Answers Problem Set 9 Heteroskedasticty Answers /* INVESTIGATION OF HETEROSKEDASTICITY */ First graph data. u hetdat2. gra manuf gdp, s([country].) xlab ylab 300000 manufacturing output (US$ miilio 200000 100000

More information

Chapter 11 Part 6. Correlation Continued. LOWESS Regression

Chapter 11 Part 6. Correlation Continued. LOWESS Regression Chapter 11 Part 6 Correlation Continued LOWESS Regression February 17, 2009 Goal: To review the properties of the correlation coefficient. To introduce you to the various tools that can be used to decide

More information

Categorical Outcomes. Statistical Modelling in Stata: Categorical Outcomes. R by C Table: Example. Nominal Outcomes. Mark Lunt.

Categorical Outcomes. Statistical Modelling in Stata: Categorical Outcomes. R by C Table: Example. Nominal Outcomes. Mark Lunt. Categorical Outcomes Statistical Modelling in Stata: Categorical Outcomes Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester Nominal Ordinal 28/11/2017 R by C Table: Example Categorical,

More information

Example 7.1: Hourly Wage Equation Average wage for women

Example 7.1: Hourly Wage Equation Average wage for women 1 Stata Textbook Examples Introductory Econometrics: A Modern Approach by Jeffrey M. Wooldridge (1st & 2nd eds.) Chapter 7 - Multiple Regression Analysis with Qualitative Information: Binary (or Dummy)

More information

STATA Program for OLS cps87_or.do

STATA Program for OLS cps87_or.do STATA Program for OLS cps87_or.do * the data for this project is a small subsample; * of full time (30 or more hours) male workers; * aged 21-64 from the out going rotation; * samples of the 1987 current

More information

Visualisierung von Nicht-Linearität bzw. Heteroskedastizität

Visualisierung von Nicht-Linearität bzw. Heteroskedastizität Visualisierung von Nicht-Linearität bzw. Heteroskedastizität. use..\wooldridge\stata\wage2, clear. scatter wage IQ Kommentar: Folie 38. graph copy a3, replace. summ IQ Variable Obs Mean Std. Dev. Min Max

More information

Econometrics is. The estimation of relationships suggested by economic theory

Econometrics is. The estimation of relationships suggested by economic theory Econometrics is Econometrics is The estimation of relationships suggested by economic theory Econometrics is The estimation of relationships suggested by economic theory The application of mathematical

More information

Two-stage least squares examples. Angrist: Vietnam Draft Lottery Men, Cohorts. Vietnam era service

Two-stage least squares examples. Angrist: Vietnam Draft Lottery Men, Cohorts. Vietnam era service Two-stage least squares examples Angrist: Vietnam Draft Lottery 1 2 Vietnam era service 1980 Men, 1940-1952 Cohorts Defined as 1964-1975 Estimated 8.7 million served during era 3.4 million were in SE Asia

More information

Example 8.1: Log Wage Equation with Heteroscedasticity-Robust Standard Errors

Example 8.1: Log Wage Equation with Heteroscedasticity-Robust Standard Errors 1 Stata Textbook Examples Introductory Econometrics: A Modern Approach by Jeffrey M. Wooldridge (1st & 2nd eds.) Chapter 8 - Heteroskedasticity Example 8.1: Log Wage Equation with Heteroscedasticity-Robust

More information

Logistic Regression Analysis

Logistic Regression Analysis Revised July 2018 Logistic Regression Analysis This set of notes shows how to use Stata to estimate a logistic regression equation. It assumes that you have set Stata up on your computer (see the Getting

More information

The SAS System 11:03 Monday, November 11,

The SAS System 11:03 Monday, November 11, The SAS System 11:3 Monday, November 11, 213 1 The CONTENTS Procedure Data Set Name BIO.AUTO_PREMIUMS Observations 5 Member Type DATA Variables 3 Engine V9 Indexes Created Monday, November 11, 213 11:4:19

More information

Module 9: Single-level and Multilevel Models for Ordinal Responses. Stata Practical 1

Module 9: Single-level and Multilevel Models for Ordinal Responses. Stata Practical 1 Module 9: Single-level and Multilevel Models for Ordinal Responses Pre-requisites Modules 5, 6 and 7 Stata Practical 1 George Leckie, Tim Morris & Fiona Steele Centre for Multilevel Modelling If you find

More information

Longitudinal Logistic Regression: Breastfeeding of Nepalese Children

Longitudinal Logistic Regression: Breastfeeding of Nepalese Children Longitudinal Logistic Regression: Breastfeeding of Nepalese Children Scientific Question Determine whether the breastfeeding of Nepalese children varies with child age and/or sex of child. Data: Nepal

More information

Percentiles, STATA, Box Plots, Standardizing, and Other Transformations

Percentiles, STATA, Box Plots, Standardizing, and Other Transformations Percentiles, STATA, Box Plots, Standardizing, and Other Transformations Lecture 3 Reading: Sections 5.7 54 Remember, when you finish a chapter make sure not to miss the last couple of boxes: What Can Go

More information

GGraph. Males Only. Premium. Experience. GGraph. Gender. 1 0: R 2 Linear = : R 2 Linear = Page 1

GGraph. Males Only. Premium. Experience. GGraph. Gender. 1 0: R 2 Linear = : R 2 Linear = Page 1 GGraph 9 Gender : R Linear =.43 : R Linear =.769 8 7 6 5 4 3 5 5 Males Only GGraph Page R Linear =.43 R Loess 9 8 7 6 5 4 5 5 Explore Case Processing Summary Cases Valid Missing Total N Percent N Percent

More information

sociology SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 SO5032 Quantitative Research Methods

sociology SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 SO5032 Quantitative Research Methods 1 SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 Lecture 10: Multinomial regression baseline category extension of binary What if we have multiple possible

More information

1) The Effect of Recent Tax Changes on Taxable Income

1) The Effect of Recent Tax Changes on Taxable Income 1) The Effect of Recent Tax Changes on Taxable Income In the most recent issue of the Journal of Policy Analysis and Management, Bradley Heim published a paper called The Effect of Recent Tax Changes on

More information

Question 1a 1b 1c 1d 1e 1f 2a 2b 2c 2d 3a 3b 3c 3d M ult:choice Points

Question 1a 1b 1c 1d 1e 1f 2a 2b 2c 2d 3a 3b 3c 3d M ult:choice Points Economics 102: Analysis of Economic Data Cameron Spring 2015 April 23 Department of Economics, U.C.-Davis First Midterm Exam (Version A) Compulsory. Closed book. Total of 30 points and worth 22.5% of course

More information

Stat 328, Summer 2005

Stat 328, Summer 2005 Stat 328, Summer 2005 Exam #2, 6/18/05 Name (print) UnivID I have neither given nor received any unauthorized aid in completing this exam. Signed Answer each question completely showing your work where

More information

The relationship between GDP, labor force and health expenditure in European countries

The relationship between GDP, labor force and health expenditure in European countries Econometrics-Term paper The relationship between GDP, labor force and health expenditure in European countries Student: Nguyen Thu Ha Contents 1. Background:... 2 2. Discussion:... 2 3. Regression equation

More information

Lecture 13: Identifying unusual observations In lecture 12, we learned how to investigate variables. Now we learn how to investigate cases.

Lecture 13: Identifying unusual observations In lecture 12, we learned how to investigate variables. Now we learn how to investigate cases. Lecture 13: Identifying unusual observations In lecture 12, we learned how to investigate variables. Now we learn how to investigate cases. Goal: Find unusual cases that might be mistakes, or that might

More information

Description Remarks and examples References Also see

Description Remarks and examples References Also see Title stata.com example 41g Two-level multinomial logistic regression (multilevel) Description Remarks and examples References Also see Description We demonstrate two-level multinomial logistic regression

More information

Effect of Education on Wage Earning

Effect of Education on Wage Earning Effect of Education on Wage Earning Group Members: Quentin Talley, Thomas Wang, Geoff Zaski Abstract The scope of this project includes individuals aged 18-65 who finished their education and do not have

More information

Impact of Household Income on Poverty Levels

Impact of Household Income on Poverty Levels Impact of Household Income on Poverty Levels ECON 3161 Econometrics, Fall 2015 Prof. Shatakshee Dhongde Group 8 Annie Strothmann Anne Marsh Samuel Brown Abstract: The relationship between poverty and household

More information

Don t worry one bit about multicollinearity, because at the end of the day, you're going to be working with a favorite coefficient model.

Don t worry one bit about multicollinearity, because at the end of the day, you're going to be working with a favorite coefficient model. In theory, you might think that dummy variables would facilitate a simple and compelling test for bias or discrimination. For example, suppose you wanted to test for gender bias in pay. It's really very

More information

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2.

More information

. ********** OUTPUT FILE: CARD & KRUEGER (1994)***********.. * STATA 10.0 CODE. * copyright C 2008 by Tito Boeri & Jan van Ours. * "THE ECONOMICS OF

. ********** OUTPUT FILE: CARD & KRUEGER (1994)***********.. * STATA 10.0 CODE. * copyright C 2008 by Tito Boeri & Jan van Ours. * THE ECONOMICS OF ********** OUTPUT FILE: CARD & KRUEGER (1994)*********** * STATA 100 CODE * copyright C 2008 by Tito Boeri & Jan van Ours * "THE ECONOMICS OF IMPERFECT LABOR MARKETS" * by Tito Boeri & Jan van Ours (2008)

More information

The impact of cigarette excise taxes on beer consumption

The impact of cigarette excise taxes on beer consumption The impact of cigarette excise taxes on beer consumption Jeremy Cluchey Frank DiSilvestro PPS 313 18 April 2008 ABSTRACT This study attempts to determine what if any impact a state s decision to increase

More information

Technical Documentation for Household Demographics Projection

Technical Documentation for Household Demographics Projection Technical Documentation for Household Demographics Projection REMI Household Forecast is a tool to complement the PI+ demographic model by providing comprehensive forecasts of a variety of household characteristics.

More information

REGIONAL WORKSHOP ON TRAFFIC FORECASTING AND ECONOMIC PLANNING

REGIONAL WORKSHOP ON TRAFFIC FORECASTING AND ECONOMIC PLANNING International Civil Aviation Organization 27/8/10 WORKING PAPER REGIONAL WORKSHOP ON TRAFFIC FORECASTING AND ECONOMIC PLANNING Cairo 2 to 4 November 2010 Agenda Item 3 a): Forecasting Methodology (Presented

More information

Advanced Industrial Organization I Identi cation of Demand Functions

Advanced Industrial Organization I Identi cation of Demand Functions Advanced Industrial Organization I Identi cation of Demand Functions Måns Söderbom, University of Gothenburg January 25, 2011 1 1 Introduction This is primarily an empirical lecture in which I will discuss

More information

Notice that X2 and Y2 are skewed. Taking the SQRT of Y2 reduces the skewness greatly.

Notice that X2 and Y2 are skewed. Taking the SQRT of Y2 reduces the skewness greatly. Notice that X2 and Y2 are skewed. Taking the SQRT of Y2 reduces the skewness greatly. The MEANS Procedure Variable Mean Std Dev Minimum Maximum Skewness ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ

More information

Modeling wages of females in the UK

Modeling wages of females in the UK International Journal of Business and Social Science Vol. 2 No. 11 [Special Issue - June 2011] Modeling wages of females in the UK Saadia Irfan NUST Business School National University of Sciences and

More information

Summary of Statistical Analysis Tools EDAD 5630

Summary of Statistical Analysis Tools EDAD 5630 Summary of Statistical Analysis Tools EDAD 5630 Test Name Program Used Purpose Steps Main Uses/Applications in Schools Principal Component Analysis SPSS Measure Underlying Constructs Reliability SPSS Measure

More information

Two-Sample T-Test for Non-Inferiority

Two-Sample T-Test for Non-Inferiority Chapter 198 Two-Sample T-Test for Non-Inferiority Introduction This procedure provides reports for making inference about the non-inferiority of a treatment mean compared to a control mean from data taken

More information

Chapter 6 Part 6. Confidence Intervals chi square distribution binomial distribution

Chapter 6 Part 6. Confidence Intervals chi square distribution binomial distribution Chapter 6 Part 6 Confidence Intervals chi square distribution binomial distribution October 8, 008 Brief review of what we covered last time. In order to get a confidence interval for the population mean

More information

Testing the Solow Growth Theory

Testing the Solow Growth Theory Testing the Solow Growth Theory Dilip Mookherjee Ec320 Lecture 5, Boston University Sept 16, 2014 DM (BU) 320 Lect 5 Sept 16, 2014 1 / 1 EMPIRICAL PREDICTIONS OF SOLOW MODEL WITH TECHNICAL PROGRESS 1.

More information

Question scores. Question 1a 1b 1c 1d 1e 2a 2b 2c 2d 2e 2f 3a 3b 3c 3d M ult:choice Points

Question scores. Question 1a 1b 1c 1d 1e 2a 2b 2c 2d 2e 2f 3a 3b 3c 3d M ult:choice Points Economics 02: Analysis of Economic Data Cameron Winter 204 January 30 Department of Economics, U.C.-Davis First Midterm Exam (Version A) Compulsory. Closed book. Total of 30 points and worth 22.5% of course

More information

(ii) Give the name of the California website used to find the various insurance plans offered under the Affordable care Act (Obamacare).

(ii) Give the name of the California website used to find the various insurance plans offered under the Affordable care Act (Obamacare). Cameron ECON 132 (Health Economics): FIRST MIDTERM EXAM (A) Winter 18 Answer all questions in the space provided on the exam. Total of 36 points (and worth 22.5% of final grade). Read each question carefully,

More information

NCSS Statistical Software. Reference Intervals

NCSS Statistical Software. Reference Intervals Chapter 586 Introduction A reference interval contains the middle 95% of measurements of a substance from a healthy population. It is a type of prediction interval. This procedure calculates one-, and

More information

Two-Sample T-Test for Superiority by a Margin

Two-Sample T-Test for Superiority by a Margin Chapter 219 Two-Sample T-Test for Superiority by a Margin Introduction This procedure provides reports for making inference about the superiority of a treatment mean compared to a control mean from data

More information

. tsset year, yearly time variable: year, 1959 to 1994 delta: 1 year. . reg lhous ldpi lrealp

. tsset year, yearly time variable: year, 1959 to 1994 delta: 1 year. . reg lhous ldpi lrealp - opened on: 24 Mar 2012, 21:29:52. use "G:\stata\( )\demand_2011.dta", clear. de - variable name type format label variable label - year float %ty year time float %9.0g trend pop float %9.0g population,

More information

SAS Simple Linear Regression Example

SAS Simple Linear Regression Example SAS Simple Linear Regression Example This handout gives examples of how to use SAS to generate a simple linear regression plot, check the correlation between two variables, fit a simple linear regression

More information

Does Globalization Improve Quality of Life?

Does Globalization Improve Quality of Life? University of Tennessee, Knoxville Trace: Tennessee Research and Creative Exchange University of Tennessee Honors Thesis Projects University of Tennessee Honors Program 5-2017 Does Globalization Improve

More information

Determinants of FII Inflows:India

Determinants of FII Inflows:India MPRA Munich Personal RePEc Archive Determinants of FII Inflows:India Ravi Saraogi February 2008 Online at https://mpra.ub.uni-muenchen.de/22850/ MPRA Paper No. 22850, posted 22. May 2010 23:04 UTC Determinants

More information

EC327: Limited Dependent Variables and Sample Selection Binomial probit: probit

EC327: Limited Dependent Variables and Sample Selection Binomial probit: probit EC327: Limited Dependent Variables and Sample Selection Binomial probit: probit. summarize work age married children education Variable Obs Mean Std. Dev. Min Max work 2000.6715.4697852 0 1 age 2000 36.208

More information

Model fit assessment via marginal model plots

Model fit assessment via marginal model plots The Stata Journal (2010) 10, Number 2, pp. 215 225 Model fit assessment via marginal model plots Charles Lindsey Texas A & M University Department of Statistics College Station, TX lindseyc@stat.tamu.edu

More information

Point-Biserial and Biserial Correlations

Point-Biserial and Biserial Correlations Chapter 302 Point-Biserial and Biserial Correlations Introduction This procedure calculates estimates, confidence intervals, and hypothesis tests for both the point-biserial and the biserial correlations.

More information

Descriptive Analysis

Descriptive Analysis Descriptive Analysis HERTANTO WAHYU SUBAGIO Univariate Analysis Univariate analysis involves the examination across cases of one variable at a time. There are three major characteristics of a single variable

More information

Descriptive Statistics

Descriptive Statistics Chapter 3 Descriptive Statistics Chapter 2 presented graphical techniques for organizing and displaying data. Even though such graphical techniques allow the researcher to make some general observations

More information

Appendixes Appendix 1 Data of Dependent Variables and Independent Variables Period

Appendixes Appendix 1 Data of Dependent Variables and Independent Variables Period Appendixes Appendix 1 Data of Dependent Variables and Independent Variables Period 1-15 1 ROA INF KURS FG January 1,3,7 9 -,19 February 1,79,5 95 3,1 March 1,3,7 91,95 April 1,79,1 919,71 May 1,99,7 955

More information

Relation between Income Inequality and Economic Growth

Relation between Income Inequality and Economic Growth Relation between Income Inequality and Economic Growth Ibrahim Alsaffar, Robert Eisenhardt, Hanjin Kim Georgia Institute of Technology ECON 3161: Econometric Analysis Dr. Shatakshee Dhongde Fall 2018 Abstract

More information

R & R Study. Chapter 254. Introduction. Data Structure

R & R Study. Chapter 254. Introduction. Data Structure Chapter 54 Introduction A repeatability and reproducibility (R & R) study (sometimes called a gauge study) is conducted to determine if a particular measurement procedure is adequate. If the measurement

More information

Basic Procedure for Histograms

Basic Procedure for Histograms Basic Procedure for Histograms 1. Compute the range of observations (min. & max. value) 2. Choose an initial # of classes (most likely based on the range of values, try and find a number of classes that

More information

Sociology Exam 3 Answer Key - DRAFT May 8, 2007

Sociology Exam 3 Answer Key - DRAFT May 8, 2007 Sociology 63993 Exam 3 Answer Key - DRAFT May 8, 2007 I. True-False. (20 points) Indicate whether the following statements are true or false. If false, briefly explain why. 1. The odds of an event occurring

More information

ECO220Y, Term Test #2

ECO220Y, Term Test #2 ECO220Y, Term Test #2 December 4, 2015, 9:10 11:00 am U of T e-mail: @mail.utoronto.ca Surname (last name): Given name (first name): UTORID: (e.g. lihao8) Instructions: You have 110 minutes. Keep these

More information

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 10, 2017

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 10, 2017 Maximum Likelihood Estimation Richard Williams, University of otre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 0, 207 [This handout draws very heavily from Regression Models for Categorical

More information

Data screening, transformations: MRC05

Data screening, transformations: MRC05 Dale Berger Data screening, transformations: MRC05 This is a demonstration of data screening and transformations for a regression analysis. Our interest is in predicting current salary from education level

More information

Lecture 07: Measures of central tendency

Lecture 07: Measures of central tendency Lecture 07: Measures of central tendency Ernesto F. L. Amaral September 21, 2017 Advanced Methods of Social Research (SOCI 420) Source: Healey, Joseph F. 2015. Statistics: A Tool for Social Research. Stamford:

More information

West Coast Stata Users Group Meeting, October 25, 2007

West Coast Stata Users Group Meeting, October 25, 2007 Estimating Heterogeneous Choice Models with Stata Richard Williams, Notre Dame Sociology, rwilliam@nd.edu oglm support page: http://www.nd.edu/~rwilliam/oglm/index.html West Coast Stata Users Group Meeting,

More information

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 13, 2018

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 13, 2018 Maximum Likelihood Estimation Richard Williams, University of otre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 3, 208 [This handout draws very heavily from Regression Models for Categorical

More information

ECON Introductory Econometrics. Lecture 1: Introduction and Review of Statistics

ECON Introductory Econometrics. Lecture 1: Introduction and Review of Statistics ECON4150 - Introductory Econometrics Lecture 1: Introduction and Review of Statistics Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 1-2 Lecture outline 2 What is econometrics? Course

More information

1 Inferential Statistic

1 Inferential Statistic 1 Inferential Statistic Population versus Sample, parameter versus statistic A population is the set of all individuals the researcher intends to learn about. A sample is a subset of the population and

More information

SFSU FIN822 Project 1

SFSU FIN822 Project 1 SFSU FIN822 Project 1 This project can be done in a team of up to 3 people. Your project report must be accompanied by printouts of programming outputs. You could use any software to solve the problems.

More information

Catherine De Vries, Spyros Kosmidis & Andreas Murr

Catherine De Vries, Spyros Kosmidis & Andreas Murr APPLIED STATISTICS FOR POLITICAL SCIENTISTS WEEK 8: DEPENDENT CATEGORICAL VARIABLES II Catherine De Vries, Spyros Kosmidis & Andreas Murr Topic: Logistic regression. Predicted probabilities. STATA commands

More information

1.2 Describing Distributions with Numbers, Continued

1.2 Describing Distributions with Numbers, Continued 1.2 Describing Distributions with Numbers, Continued Ulrich Hoensch Thursday, September 6, 2012 Interquartile Range and 1.5 IQR Rule for Outliers The interquartile range IQR is the distance between the

More information

IPUMS Int.l Extraction and Analysis

IPUMS Int.l Extraction and Analysis Minnesota Population Center Training and Development IPUMS Int.l Extraction and Analysis Exercise 1 OBJECTIVE: Gain an understanding of how the IPUMS dataset is structured and how it can be leveraged to

More information

LAMPIRAN PERHITUNGAN EVIEWS

LAMPIRAN PERHITUNGAN EVIEWS LAMPIRAN PERHITUNGAN EVIEWS DESCRIPTIVE PK PDRB TP TKM Mean 12.22450 10.16048 14.02443 12.63677 Median 12.41945 10.09179 14.22736 12.61400 Maximum 13.53955 12.73508 15.62581 13.16721 Minimum 10.34509 8.579417

More information

Loss Simulation Model Testing and Enhancement

Loss Simulation Model Testing and Enhancement Loss Simulation Model Testing and Enhancement Casualty Loss Reserve Seminar By Kailan Shang Sept. 2011 Agenda Research Overview Model Testing Real Data Model Enhancement Further Development Enterprise

More information

The FREQ Procedure. Table of Sex by Gym Sex(Sex) Gym(Gym) No Yes Total Male Female Total

The FREQ Procedure. Table of Sex by Gym Sex(Sex) Gym(Gym) No Yes Total Male Female Total Jenn Selensky gathered data from students in an introduction to psychology course. The data are weights, sex/gender, and whether or not the student worked-out in the gym. Here is the output from a 2 x

More information

Limited Dependent Variables

Limited Dependent Variables Limited Dependent Variables Christopher F Baum Boston College and DIW Berlin Birmingham Business School, March 2013 Christopher F Baum (BC / DIW) Limited Dependent Variables BBS 2013 1 / 47 Limited dependent

More information

Getting Started in Logit and Ordered Logit Regression (ver. 3.1 beta)

Getting Started in Logit and Ordered Logit Regression (ver. 3.1 beta) Getting Started in Logit and Ordered Logit Regression (ver. 3. beta Oscar Torres-Reyna Data Consultant otorres@princeton.edu http://dss.princeton.edu/training/ Logit model Use logit models whenever your

More information