Catherine De Vries, Spyros Kosmidis & Andreas Murr

Size: px
Start display at page:

Download "Catherine De Vries, Spyros Kosmidis & Andreas Murr"

Transcription

1 APPLIED STATISTICS FOR POLITICAL SCIENTISTS WEEK 8: DEPENDENT CATEGORICAL VARIABLES II Catherine De Vries, Spyros Kosmidis & Andreas Murr Topic: Logistic regression. Predicted probabilities. STATA commands and features: logit, logistic, char, test, margins Data set: bes00.dta, taken from the British Election Study 00. Readings: Alan Agresti and Barbara Finlay (997). Statistical Methods for the Social Sciences, 3 rd ed. Upper Saddle River, NJ: Prentice Hall. [CHAPTER 5] N.B.: Please note that Assignment 3 is due on the nd of December at 6 pm.. INTRODUCTION This week, we will continue using the British Election Survey from 00 to investigate social structural and attitudinal causes of vote choice. As we will be using a binary measure of vote choice (Conservative vs. non-conservative), we use logistic regression, which is a form of regression used for dealing with 0/ variables. The variable vote is a categorical variable with several categories. Quite often it is possible to focus attention on just one binary distinction within the categorical variable. Like last week we will choose the decision to vote Conservative or not. Compute a dummy variable for Conservative voters with:. recode bq_ (- -=.) (= "Conservative voter") ( 3/9=0 "Non Conservative voter"), gen(con). label variable con "Conservative vote dummy variable". tabulate bq_ con Note that anyone who did not give a party choice is treated as missing. For clarity it will help us to also recode the variable aq66_ (whether the respondent is a trade union member) as a binary variable, indicating trade membership.. recode aq66_ (-=.) ( =) (3=0), gen(union). label variable union "Union member". tabulate aq66_ union Finally, recode the variable aq67 into the variable educ the following way:. recode aq67 (- -=.) (6 7=5), gen(educ). lab var educ "Education". tab aq67 educ Week 8 Page of 7

2 . CATEGORICAL INDEPENDENT VARIABLES As with linear regression, categorical variables can be added to the model with the xi command.. xi: logit con union i.educ STATA creates a reference category with the first value of educ (left school at the age of 5 years or younger). Although changing the reference category for a dependent variable does not change the model, the choice of the reference category can make a big difference to how clearly you can see what is going on. We can control which category STATA uses as the reference with the command,. char educ[omit] 5. xi: logit con union i.educ Now those who left education at 9 years or older are treated as the reference category. Compare the two models; the statistical significance and size of the education coefficients change from one model to another. This is because the coefficients represent a comparison between categories, so that changing the reference category changes the point of comparison. Given that we are comparing categories, what we are actually interested in is whether any one category is different from any other category. For example, are those with the lowest education different from those in the middle category of education in terms of their voting behaviour? One way of checking this is to set the lowest education class again as the reference category (i.e. char educ[omit] ), but then it is of course difficult to tell whether, say, those in the middle differ from those with the highest education. The size of the coefficients for the lowest and middle categories, but are they statistically significantly different? We can use the test command to check whether two or more coefficients are equal to each other. To test whether the lowest and middle category of education differ, simply type:. test _Ieduc_ = _Ieduc_3 chi( ) = 6.0 Prob > chi = 0.04 We are testing here whether we can distinguish statistically between the size of the two coefficients. Our null hypothesis here is that the coefficients are the same. Given that the p-value is 0.0, we can reject this hypothesis, and therefore conclude that the coefficients are different. EXERCISE Now run a model with union membership, education and interest in the election (aq) predicting Conservative vote. Remember to account for missing values. Oddly, election interest is coded as = very interested, = somewhat interested, 3 = not very interested, and 4 = not at all interested. Generate a new variable interest, where increasing values signify more interest. Week 8 Page of 7

3 Test whether all coefficients of election interest, except that for category (i.e. those which are not at all interested), could be equal to zero. How could me make the model more parsimonious by recoding election interest? 3. MORE STRAIGHTFORWARD INTERPRETATIONS Producing tables of logistic regression coefficients is standard practice but they basically mean nothing to the general reader. Instead of talking in terms of statistically significant coefficients we should be talking about real quantities that people understand. So instead of saying that the odds of voting Conservative are x times higher for trade unionists than for non-trade unionists after controlling for such and such, we would rather say the probability of voting Conservative is y amount greater for trade unionists than other voters after controlling for such and such. Unfortunately this isn't easy, because the difference in predicted probabilities for union members and others depend on the values of all the other variables in the model. There are two packages that have been written for STATA to work out predicted probabilities, which we describe later on. In STATA and more recent versions, this can easily be done with margins. One can also produce these probabilities manually which, at least at first, helps one to understand what s going on. So first run the logistic regression that you are interested in (remembering to account for missing data). Let us take a hypothetical example of a model that predicts Conservative vote choice (relative to all other parties) using union membership and age.. logit con union age Second, examine the table of coefficients. So for this example the coefficients are 0.44 for union membership and 0.04 for age. Both are statistically significant. The constant is.3. As we don t know what these coefficients mean, we want to calculate them in the terms of probabilities. To do this we can calculate the predicted probability for any hypothetical observation using the formula below: π = e + e a+ b X + b X a+ b X + b X where in our case X is union membership and X is age, and b is the coefficient for union membership, b is the coefficient for age and a is the constant. An easy way of calculating these probabilities is to open a new STATA window, and then generate a new variable using this formula to represent predicted probabilities for a range of the independent variable that you are interested in. To do this, enter some Week 8 Page 3 of 7

4 values for the variable, in this case let s say age, over a plausible range in the empty Data Editor, again in this case let s say 8 up to 90:. set obs 90. gen age=_n. drop if age<8 Then generate a new variable that corresponds to the formula above; i.e. for our example the formula for non-trade unionists would be: e age π = + e age In the newly opened version of STATA, we will call this new variable prob and generate it as follows:. generate prob = exp( *age)/( + (exp( *age))) We now have predicted probabilities for each value of age and can plot prob against prob to look at how the predicted probability of voting Conservative changes by age. The same logic applies for a bigger model, again we need to pick values for the other independent variables (just as we picked non-trade unionists for the predicted probabilities above). Generally for interval level variables use the mean, for categorical use the mode. For example if our model included income and education (5 categories), then we would include the mean of income multiplied by its coefficient, and the mode category for education (i.e. the fifth category in this case). For interaction terms, we need to include the variable we are interested in, both the main effect and the interaction term. If we interacted age and union membership then we would need to generate two separate sets of probabilities one for trade unionists and one for non-trade unionists: For non trade unionists. generate nonunionprob = exp(constant + b union *0 + b age *age + b interaction *0*age)/( + (exp(constant + b union *0 + b age *age + b interaction *0*age))) = exp(constant + b age *age)/( + (exp(constant + b age *age))) For trade unionists. generate unionprob = exp(constant + b union * + b age *age + b interaction **age)/( + (exp(constant + b union * + b age *age + b interaction **age))) = exp(constant + b union + b age *age + b interaction *age)/( + (exp(constant + b union + b age *age + b interaction *age))) Obviously we can also calculate predicted probabilities for categorical variables, just pick the categories that you wish to compare and enter the numbers into the formula (sometimes this may be easiest with a calculator and pencil and paper if the model is small). Week 8 Page 4 of 7

5 EXERCISE Run a model with education, trade union membership, and self-placements on two eleven-point scales about attitudes towards taxation (aq35_) and crime (aq38_). Recode the self-placement variables such that higher values refer to cutting taxes and reducing crime, respectively. Open a new STATA window and plot the effect of attitudes towards cutting taxes on vote choice. Don t forget that as measured here you can t have a mean level of trade union membership, education, and attitudes towards crime, so you need to pick a particular type of respondent and plot the impact of libertarian-authoritarian values for them. 4. MARGINS Although it is important to know how to calculate predicted probabilities manually, margins in STATA will do this for you.. logit con i.union age Holding age at its mean, margins union will calculate the predicted probabilities of the Conservative vote separately for union members and non-members, and the associated confidence intervals, holding age at its mean:. margins union, atmeans Delta-method Margin Std. Err. z P> z [95% Conf. Interval] union To obtain the predicted probabilities for each category of age for union members:. margins, at(age=(8()97) union=). marginsplot Week 8 Page 5 of 7

6 To obtain the predicted probabilities for each category of age for non-members:. margins, at(age=(8()97) union=0) Unlike in OLS regression, the marginal effect of each predictor depends on the values of all other predictors. To obtain the marginal effect of union membership on the predicted probability of voting Conservative at the mean of age:. margins, dydx(union) atmeans Delta-method dy/dx Std. Err. z P> z [95% Conf. Interval] union Note: dy/dx for factor levels is the discrete change from the base level. Since union is a binary variable, the marginal effect is the effect of a discrete change from 0 to. It is simply the difference in the predicted probabilities of the Conservative vote between trade unionists and non-trade unionists. To obtain the marginal effect of age at the two values of union:. margins, dydx(age) at(union=(0 )) DELTA-METHOD DY/DX STD. ERR. Z P> Z [95% CONF. INTERVAL] AGE _AT The margins command is particularly useful when interactions or non-linear terms are present. Week 8 Page 6 of 7

7 EXERCISE 3 Please estimate a binary logit model where voting for the Conservatives is a function of income (aq70), attitudes towards cutting taxes, and their interaction. Create a plot to show how the marginal effect of attitudes towards cutting taxes depends on income. Interpret the result. Week 8 Page 7 of 7

Logistic Regression Analysis

Logistic Regression Analysis Revised July 2018 Logistic Regression Analysis This set of notes shows how to use Stata to estimate a logistic regression equation. It assumes that you have set Stata up on your computer (see the Getting

More information

sociology SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 SO5032 Quantitative Research Methods

sociology SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 SO5032 Quantitative Research Methods 1 SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 Lecture 10: Multinomial regression baseline category extension of binary What if we have multiple possible

More information

Getting Started in Logit and Ordered Logit Regression (ver. 3.1 beta)

Getting Started in Logit and Ordered Logit Regression (ver. 3.1 beta) Getting Started in Logit and Ordered Logit Regression (ver. 3. beta Oscar Torres-Reyna Data Consultant otorres@princeton.edu http://dss.princeton.edu/training/ Logit model Use logit models whenever your

More information

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 13, 2018

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 13, 2018 Maximum Likelihood Estimation Richard Williams, University of otre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 3, 208 [This handout draws very heavily from Regression Models for Categorical

More information

Getting Started in Logit and Ordered Logit Regression (ver. 3.1 beta)

Getting Started in Logit and Ordered Logit Regression (ver. 3.1 beta) Getting Started in Logit and Ordered Logit Regression (ver. 3. beta Oscar Torres-Reyna Data Consultant otorres@princeton.edu http://dss.princeton.edu/training/ Logit model Use logit models whenever your

More information

Final Exam - section 1. Thursday, December hours, 30 minutes

Final Exam - section 1. Thursday, December hours, 30 minutes Econometrics, ECON312 San Francisco State University Michael Bar Fall 2013 Final Exam - section 1 Thursday, December 19 1 hours, 30 minutes Name: Instructions 1. This is closed book, closed notes exam.

More information

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 10, 2017

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 10, 2017 Maximum Likelihood Estimation Richard Williams, University of otre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 0, 207 [This handout draws very heavily from Regression Models for Categorical

More information

Multinomial Logit Models - Overview Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 13, 2017

Multinomial Logit Models - Overview Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 13, 2017 Multinomial Logit Models - Overview Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 13, 2017 This is adapted heavily from Menard s Applied Logistic Regression

More information

[BINARY DEPENDENT VARIABLE ESTIMATION WITH STATA]

[BINARY DEPENDENT VARIABLE ESTIMATION WITH STATA] Tutorial #3 This example uses data in the file 16.09.2011.dta under Tutorial folder. It contains 753 observations from a sample PSID data on the labor force status of married women in the U.S in 1975.

More information

Module 4 Bivariate Regressions

Module 4 Bivariate Regressions AGRODEP Stata Training April 2013 Module 4 Bivariate Regressions Manuel Barron 1 and Pia Basurto 2 1 University of California, Berkeley, Department of Agricultural and Resource Economics 2 University of

More information

Sociology Exam 3 Answer Key - DRAFT May 8, 2007

Sociology Exam 3 Answer Key - DRAFT May 8, 2007 Sociology 63993 Exam 3 Answer Key - DRAFT May 8, 2007 I. True-False. (20 points) Indicate whether the following statements are true or false. If false, briefly explain why. 1. The odds of an event occurring

More information

tm / / / / / / / / / / / / Statistics/Data Analysis User: Klick Project: Limited Dependent Variables{space -6}

tm / / / / / / / / / / / / Statistics/Data Analysis User: Klick Project: Limited Dependent Variables{space -6} PS 4 Monday August 16 01:00:42 2010 Page 1 tm / / / / / / / / / / / / Statistics/Data Analysis User: Klick Project: Limited Dependent Variables{space -6} log: C:\web\PS4log.smcl log type: smcl opened on:

More information

Quantitative Techniques Term 2

Quantitative Techniques Term 2 Quantitative Techniques Term 2 Laboratory 7 2 March 2006 Overview The objective of this lab is to: Estimate a cost function for a panel of firms; Calculate returns to scale; Introduce the command cluster

More information

Allison notes there are two conditions for using fixed effects methods.

Allison notes there are two conditions for using fixed effects methods. Panel Data 3: Conditional Logit/ Fixed Effects Logit Models Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised April 2, 2017 These notes borrow very heavily, sometimes

More information

Module 9: Single-level and Multilevel Models for Ordinal Responses. Stata Practical 1

Module 9: Single-level and Multilevel Models for Ordinal Responses. Stata Practical 1 Module 9: Single-level and Multilevel Models for Ordinal Responses Pre-requisites Modules 5, 6 and 7 Stata Practical 1 George Leckie, Tim Morris & Fiona Steele Centre for Multilevel Modelling If you find

More information

Categorical Outcomes. Statistical Modelling in Stata: Categorical Outcomes. R by C Table: Example. Nominal Outcomes. Mark Lunt.

Categorical Outcomes. Statistical Modelling in Stata: Categorical Outcomes. R by C Table: Example. Nominal Outcomes. Mark Lunt. Categorical Outcomes Statistical Modelling in Stata: Categorical Outcomes Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester Nominal Ordinal 28/11/2017 R by C Table: Example Categorical,

More information

Lecture 10: Alternatives to OLS with limited dependent variables, part 1. PEA vs APE Logit/Probit

Lecture 10: Alternatives to OLS with limited dependent variables, part 1. PEA vs APE Logit/Probit Lecture 10: Alternatives to OLS with limited dependent variables, part 1 PEA vs APE Logit/Probit PEA vs APE PEA: partial effect at the average The effect of some x on y for a hypothetical case with sample

More information

Your Name (Please print) Did you agree to take the optional portion of the final exam Yes No. Directions

Your Name (Please print) Did you agree to take the optional portion of the final exam Yes No. Directions Your Name (Please print) Did you agree to take the optional portion of the final exam Yes No (Your online answer will be used to verify your response.) Directions There are two parts to the final exam.

More information

Logit Models for Binary Data

Logit Models for Binary Data Chapter 3 Logit Models for Binary Data We now turn our attention to regression models for dichotomous data, including logistic regression and probit analysis These models are appropriate when the response

More information

The data definition file provided by the authors is reproduced below: Obs: 1500 home sales in Stockton, CA from Oct 1, 1996 to Nov 30, 1998

The data definition file provided by the authors is reproduced below: Obs: 1500 home sales in Stockton, CA from Oct 1, 1996 to Nov 30, 1998 Economics 312 Sample Project Report Jeffrey Parker Introduction This project is based on Exercise 2.12 on page 81 of the Hill, Griffiths, and Lim text. It examines how the sale price of houses in Stockton,

More information

Handout seminar 6, ECON4150

Handout seminar 6, ECON4150 Handout seminar 6, ECON4150 Herman Kruse March 17, 2013 Introduction - list of commands This week, we need a couple of new commands in order to solve all the problems. hist var1 if var2, options - creates

More information

Analyzing the Determinants of Project Success: A Probit Regression Approach

Analyzing the Determinants of Project Success: A Probit Regression Approach 2016 Annual Evaluation Review, Linked Document D 1 Analyzing the Determinants of Project Success: A Probit Regression Approach 1. This regression analysis aims to ascertain the factors that determine development

More information

Time series data: Part 2

Time series data: Part 2 Plot of Epsilon over Time -- Case 1 1 Time series data: Part Epsilon - 1 - - - -1 1 51 7 11 1 151 17 Time period Plot of Epsilon over Time -- Case Plot of Epsilon over Time -- Case 3 1 3 1 Epsilon - Epsilon

More information

Longitudinal Logistic Regression: Breastfeeding of Nepalese Children

Longitudinal Logistic Regression: Breastfeeding of Nepalese Children Longitudinal Logistic Regression: Breastfeeding of Nepalese Children Scientific Question Determine whether the breastfeeding of Nepalese children varies with child age and/or sex of child. Data: Nepal

More information

List of figures. I General information 1

List of figures. I General information 1 List of figures Preface xix xxi I General information 1 1 Introduction 7 1.1 What is this book about?........................ 7 1.2 Which models are considered?...................... 8 1.3 Whom is this

More information

Ordinal Multinomial Logistic Regression. Thom M. Suhy Southern Methodist University May14th, 2013

Ordinal Multinomial Logistic Regression. Thom M. Suhy Southern Methodist University May14th, 2013 Ordinal Multinomial Logistic Thom M. Suhy Southern Methodist University May14th, 2013 GLM Generalized Linear Model (GLM) Framework for statistical analysis (Gelman and Hill, 2007, p. 135) Linear Continuous

More information

Supporting Information: Preferences for International Redistribution: The Divide over the Eurozone Bailouts

Supporting Information: Preferences for International Redistribution: The Divide over the Eurozone Bailouts Supporting Information: Preferences for International Redistribution: The Divide over the Eurozone Bailouts Michael M. Bechtel University of St.Gallen Jens Hainmueller Massachusetts Institute of Technology

More information

Morten Frydenberg Wednesday, 12 May 2004

Morten Frydenberg Wednesday, 12 May 2004 " $% " * +, " --. / ",, 2 ", $, % $ 4 %78 % / "92:8/- 788;?5"= "8= < < @ "A57 57 "χ 2 = -value=. 5 OR =, OR = = = + OR B " B Linear ang Logistic Regression: Note. = + OR 2 women - % β β = + woman

More information

West Coast Stata Users Group Meeting, October 25, 2007

West Coast Stata Users Group Meeting, October 25, 2007 Estimating Heterogeneous Choice Models with Stata Richard Williams, Notre Dame Sociology, rwilliam@nd.edu oglm support page: http://www.nd.edu/~rwilliam/oglm/index.html West Coast Stata Users Group Meeting,

More information

F^3: F tests, Functional Forms and Favorite Coefficient Models

F^3: F tests, Functional Forms and Favorite Coefficient Models F^3: F tests, Functional Forms and Favorite Coefficient Models Favorite coefficient model: otherteams use "nflpricedata Bdta", clear *Favorite coefficient model: otherteams reg rprice pop pop2 rpci wprcnt1

More information

Using New SAS 9.4 Features for Cumulative Logit Models with Partial Proportional Odds Paul J. Hilliard, Educational Testing Service (ETS)

Using New SAS 9.4 Features for Cumulative Logit Models with Partial Proportional Odds Paul J. Hilliard, Educational Testing Service (ETS) Using New SAS 9.4 Features for Cumulative Logit Models with Partial Proportional Odds Using New SAS 9.4 Features for Cumulative Logit Models with Partial Proportional Odds INTRODUCTION Multicategory Logit

More information

WWS 508b Precept 10. John Palmer. April 27, 2010

WWS 508b Precept 10. John Palmer. April 27, 2010 WWS 508b Precept 10 John Palmer April 27, 2010 Example: married women s labor force participation The MROZ.dta data set has information on labor force participation and other characteristics of married

More information

Appendix B: Methodology and Finding of Statistical and Econometric Analysis of Enterprise Survey and Portfolio Data

Appendix B: Methodology and Finding of Statistical and Econometric Analysis of Enterprise Survey and Portfolio Data Appendix B: Methodology and Finding of Statistical and Econometric Analysis of Enterprise Survey and Portfolio Data Part 1: SME Constraints, Financial Access, and Employment Growth Evidence from World

More information

You created this PDF from an application that is not licensed to print to novapdf printer (http://www.novapdf.com)

You created this PDF from an application that is not licensed to print to novapdf printer (http://www.novapdf.com) Monday October 3 10:11:57 2011 Page 1 (R) / / / / / / / / / / / / Statistics/Data Analysis Education Box and save these files in a local folder. name:

More information

Introduction to POL 217

Introduction to POL 217 Introduction to POL 217 Brad Jones 1 1 Department of Political Science University of California, Davis January 9, 2007 Topics of Course Outline Models for Categorical Data. Topics of Course Models for

More information

Model fit assessment via marginal model plots

Model fit assessment via marginal model plots The Stata Journal (2010) 10, Number 2, pp. 215 225 Model fit assessment via marginal model plots Charles Lindsey Texas A & M University Department of Statistics College Station, TX lindseyc@stat.tamu.edu

More information

Assignment #5 Solutions: Chapter 14 Q1.

Assignment #5 Solutions: Chapter 14 Q1. Assignment #5 Solutions: Chapter 14 Q1. a. R 2 is.037 and the adjusted R 2 is.033. The adjusted R 2 value becomes particularly important when there are many independent variables in a multiple regression

More information

Econometrics is. The estimation of relationships suggested by economic theory

Econometrics is. The estimation of relationships suggested by economic theory Econometrics is Econometrics is The estimation of relationships suggested by economic theory Econometrics is The estimation of relationships suggested by economic theory The application of mathematical

More information

3. Multinomial response models

3. Multinomial response models 3. Multinomial response models 3.1 General model approaches Multinomial dependent variables in a microeconometric analysis: These qualitative variables have more than two possible mutually exclusive categories

More information

NPTEL Project. Econometric Modelling. Module 16: Qualitative Response Regression Modelling. Lecture 20: Qualitative Response Regression Modelling

NPTEL Project. Econometric Modelling. Module 16: Qualitative Response Regression Modelling. Lecture 20: Qualitative Response Regression Modelling 1 P age NPTEL Project Econometric Modelling Vinod Gupta School of Management Module 16: Qualitative Response Regression Modelling Lecture 20: Qualitative Response Regression Modelling Rudra P. Pradhan

More information

Description Remarks and examples References Also see

Description Remarks and examples References Also see Title stata.com example 41g Two-level multinomial logistic regression (multilevel) Description Remarks and examples References Also see Description We demonstrate two-level multinomial logistic regression

More information

Limited Dependent Variables

Limited Dependent Variables Limited Dependent Variables Christopher F Baum Boston College and DIW Berlin Birmingham Business School, March 2013 Christopher F Baum (BC / DIW) Limited Dependent Variables BBS 2013 1 / 47 Limited dependent

More information

Nonlinear Econometric Analysis (ECO 722) Answers to Homework 4

Nonlinear Econometric Analysis (ECO 722) Answers to Homework 4 Nonlinear Econometric Analysis (ECO 722) Answers to Homework 4 1 Greene and Hensher (1997) report estimates of a model of travel mode choice for travel between Sydney and Melbourne, Australia The dataset

More information

Why do the youth in Jamaica neither study nor work? Evidence from JSLC 2001

Why do the youth in Jamaica neither study nor work? Evidence from JSLC 2001 VERY PRELIMINARY, PLEASE DO NOT QUOTE Why do the youth in Jamaica neither study nor work? Evidence from JSLC 2001 Abstract Abbi Kedir 1 University of Leicester, UK E-mail: ak138@le.ac.uk and Michael Henry

More information

Lecture 21: Logit Models for Multinomial Responses Continued

Lecture 21: Logit Models for Multinomial Responses Continued Lecture 21: Logit Models for Multinomial Responses Continued Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University

More information

Sociology 704: Topics in Multivariate Statistics Instructor: Natasha Sarkisian. Binary Logit

Sociology 704: Topics in Multivariate Statistics Instructor: Natasha Sarkisian. Binary Logit Sociology 704: Topics in Multivariate Statistics Instructor: Natasha Sarkisian Binary Logit Binary models deal with binary (0/1, yes/no) dependent variables. OLS is inappropriate for this kind of dependent

More information

Econometric Methods for Valuation Analysis

Econometric Methods for Valuation Analysis Econometric Methods for Valuation Analysis Margarita Genius Dept of Economics M. Genius (Univ. of Crete) Econometric Methods for Valuation Analysis Cagliari, 2017 1 / 25 Outline We will consider econometric

More information

u panel_lecture . sum

u panel_lecture . sum u panel_lecture sum Variable Obs Mean Std Dev Min Max datastre 639 9039644 6369418 900228 926665 year 639 1980 2584012 1976 1984 total_sa 639 9377839 3212313 682 441e+07 tot_fixe 639 5214385 1988422 642

More information

Advanced Econometrics

Advanced Econometrics Advanced Econometrics Instructor: Takashi Yamano 11/14/2003 Due: 11/21/2003 Homework 5 (30 points) Sample Answers 1. (16 points) Read Example 13.4 and an AER paper by Meyer, Viscusi, and Durbin (1995).

More information

2SLS HATCO SPSS, STATA and SHAZAM. Example by Eddie Oczkowski. August 2001

2SLS HATCO SPSS, STATA and SHAZAM. Example by Eddie Oczkowski. August 2001 2SLS HATCO SPSS, STATA and SHAZAM Example by Eddie Oczkowski August 2001 This example illustrates how to use SPSS to estimate and evaluate a 2SLS latent variable model. The bulk of the example relates

More information

Problem Set 2. PPPA 6022 Due in class, on paper, March 5. Some overall instructions:

Problem Set 2. PPPA 6022 Due in class, on paper, March 5. Some overall instructions: Problem Set 2 PPPA 6022 Due in class, on paper, March 5 Some overall instructions: Please use a do-file (or its SAS or SPSS equivalent) for this work do not program interactively! I have provided Stata

More information

An Examination of the Impact of the Texas Methodist Foundation Clergy Development Program. on the United Methodist Church in Texas

An Examination of the Impact of the Texas Methodist Foundation Clergy Development Program. on the United Methodist Church in Texas An Examination of the Impact of the Texas Methodist Foundation Clergy Development Program on the United Methodist Church in Texas The Texas Methodist Foundation completed its first, two-year Clergy Development

More information

A generalized Hosmer Lemeshow goodness-of-fit test for multinomial logistic regression models

A generalized Hosmer Lemeshow goodness-of-fit test for multinomial logistic regression models The Stata Journal (2012) 12, Number 3, pp. 447 453 A generalized Hosmer Lemeshow goodness-of-fit test for multinomial logistic regression models Morten W. Fagerland Unit of Biostatistics and Epidemiology

More information

PASS Sample Size Software

PASS Sample Size Software Chapter 850 Introduction Cox proportional hazards regression models the relationship between the hazard function λ( t X ) time and k covariates using the following formula λ log λ ( t X ) ( t) 0 = β1 X1

More information

Table 4. Probit model of union membership. Probit coefficients are presented below. Data from March 2008 Current Population Survey.

Table 4. Probit model of union membership. Probit coefficients are presented below. Data from March 2008 Current Population Survey. 1. Using a probit model and data from the 2008 March Current Population Survey, I estimated a probit model of the determinants of pension coverage. Three specifications were estimated. The first included

More information

Point-Biserial and Biserial Correlations

Point-Biserial and Biserial Correlations Chapter 302 Point-Biserial and Biserial Correlations Introduction This procedure calculates estimates, confidence intervals, and hypothesis tests for both the point-biserial and the biserial correlations.

More information

Effect of Education on Wage Earning

Effect of Education on Wage Earning Effect of Education on Wage Earning Group Members: Quentin Talley, Thomas Wang, Geoff Zaski Abstract The scope of this project includes individuals aged 18-65 who finished their education and do not have

More information

Multiple regression - a brief introduction

Multiple regression - a brief introduction Multiple regression - a brief introduction Multiple regression is an extension to regular (simple) regression. Instead of one X, we now have several. Suppose, for example, that you are trying to predict

More information

Chapter 6: Supply and Demand with Income in the Form of Endowments

Chapter 6: Supply and Demand with Income in the Form of Endowments Chapter 6: Supply and Demand with Income in the Form of Endowments 6.1: Introduction This chapter and the next contain almost identical analyses concerning the supply and demand implied by different kinds

More information

Statistical Models of Stocks and Bonds. Zachary D Easterling: Department of Economics. The University of Akron

Statistical Models of Stocks and Bonds. Zachary D Easterling: Department of Economics. The University of Akron Statistical Models of Stocks and Bonds Zachary D Easterling: Department of Economics The University of Akron Abstract One of the key ideas in monetary economics is that the prices of investments tend to

More information

Logistics Regression & Industry Modeling

Logistics Regression & Industry Modeling Logistics Regression & Industry Modeling Framing Financial Problems as Probabilities Russ Koesterich, CFA Chief North American Strategist Logistics Regression & Probability So far as the laws of mathematics

More information

Problem Set 9 Heteroskedasticty Answers

Problem Set 9 Heteroskedasticty Answers Problem Set 9 Heteroskedasticty Answers /* INVESTIGATION OF HETEROSKEDASTICITY */ First graph data. u hetdat2. gra manuf gdp, s([country].) xlab ylab 300000 manufacturing output (US$ miilio 200000 100000

More information

Dummy variables 9/22/2015. Are wages different across union/nonunion jobs. Treatment Control Y X X i identifies treatment

Dummy variables 9/22/2015. Are wages different across union/nonunion jobs. Treatment Control Y X X i identifies treatment Dummy variables Treatment 22 1 1 Control 3 2 Y Y1 0 1 2 Y X X i identifies treatment 1 1 1 1 1 1 0 0 0 X i =1 if in treatment group X i =0 if in control H o : u n =u u Are wages different across union/nonunion

More information

ECON Introductory Econometrics. Seminar 4. Stock and Watson Chapter 8

ECON Introductory Econometrics. Seminar 4. Stock and Watson Chapter 8 ECON4150 - Introductory Econometrics Seminar 4 Stock and Watson Chapter 8 empirical exercise E8.2: Data 2 In this exercise we use the data set CPS12.dta Each month the Bureau of Labor Statistics in the

More information

Tests for the Odds Ratio in a Matched Case-Control Design with a Binary X

Tests for the Odds Ratio in a Matched Case-Control Design with a Binary X Chapter 156 Tests for the Odds Ratio in a Matched Case-Control Design with a Binary X Introduction This procedure calculates the power and sample size necessary in a matched case-control study designed

More information

Introduction to fractional outcome regression models using the fracreg and betareg commands

Introduction to fractional outcome regression models using the fracreg and betareg commands Introduction to fractional outcome regression models using the fracreg and betareg commands Miguel Dorta Staff Statistician StataCorp LP Aguascalientes, Mexico (StataCorp LP) fracreg - betareg May 18,

More information

Econ 371 Problem Set #4 Answer Sheet. 6.2 This question asks you to use the results from column (1) in the table on page 213.

Econ 371 Problem Set #4 Answer Sheet. 6.2 This question asks you to use the results from column (1) in the table on page 213. Econ 371 Problem Set #4 Answer Sheet 6.2 This question asks you to use the results from column (1) in the table on page 213. a. The first part of this question asks whether workers with college degrees

More information

Copyright 2005 Pearson Education, Inc. Slide 6-1

Copyright 2005 Pearson Education, Inc. Slide 6-1 Copyright 2005 Pearson Education, Inc. Slide 6-1 Chapter 6 Copyright 2005 Pearson Education, Inc. Measures of Center in a Distribution 6-A The mean is what we most commonly call the average value. It is

More information

Intro to GLM Day 2: GLM and Maximum Likelihood

Intro to GLM Day 2: GLM and Maximum Likelihood Intro to GLM Day 2: GLM and Maximum Likelihood Federico Vegetti Central European University ECPR Summer School in Methods and Techniques 1 / 32 Generalized Linear Modeling 3 steps of GLM 1. Specify the

More information

Comparing Odds Ratios and Marginal Effects from Logistic Regression and Linear Probability Models

Comparing Odds Ratios and Marginal Effects from Logistic Regression and Linear Probability Models Western Kentucky University From the SelectedWorks of Matt Bogard Spring March 11, 2016 Comparing Odds Ratios and Marginal Effects from Logistic Regression and Linear Probability Models Matt Bogard Available

More information

STATA Program for OLS cps87_or.do

STATA Program for OLS cps87_or.do STATA Program for OLS cps87_or.do * the data for this project is a small subsample; * of full time (30 or more hours) male workers; * aged 21-64 from the out going rotation; * samples of the 1987 current

More information

proc genmod; model malform/total = alcohol / dist=bin link=identity obstats; title 'Table 2.7'; title2 'Identity Link';

proc genmod; model malform/total = alcohol / dist=bin link=identity obstats; title 'Table 2.7'; title2 'Identity Link'; BIOS 6244 Analysis of Categorical Data Assignment 5 s 1. Consider Exercise 4.4, p. 98. (i) Write the SAS code, including the DATA step, to fit the linear probability model and the logit model to the data

More information

Descriptive Statistics

Descriptive Statistics Chapter 3 Descriptive Statistics Chapter 2 presented graphical techniques for organizing and displaying data. Even though such graphical techniques allow the researcher to make some general observations

More information

The mathematical definitions are given on screen.

The mathematical definitions are given on screen. Text Lecture 3.3 Coherent measures of risk and back- testing Dear all, welcome back. In this class we will discuss one of the main drawbacks of Value- at- Risk, that is to say the fact that the VaR, as

More information

Multiple Regression. Review of Regression with One Predictor

Multiple Regression. Review of Regression with One Predictor Fall Semester, 2001 Statistics 621 Lecture 4 Robert Stine 1 Preliminaries Multiple Regression Grading on this and other assignments Assignment will get placed in folder of first member of Learning Team.

More information

Professor Brad Jones University of Arizona POL 681, SPRING 2004 INTERACTIONS and STATA: Companion To Lecture Notes on Statistical Interactions

Professor Brad Jones University of Arizona POL 681, SPRING 2004 INTERACTIONS and STATA: Companion To Lecture Notes on Statistical Interactions Professor Brad Jones University of Arizona POL 681, SPRING 2004 INTERACTIONS and STATA: Companion To Lecture Notes on Statistical Interactions Preliminaries 1. Basic Regression. reg y x1 Source SS df MS

More information

The relationship between GDP, labor force and health expenditure in European countries

The relationship between GDP, labor force and health expenditure in European countries Econometrics-Term paper The relationship between GDP, labor force and health expenditure in European countries Student: Nguyen Thu Ha Contents 1. Background:... 2 2. Discussion:... 2 3. Regression equation

More information

Rescaling results of nonlinear probability models to compare regression coefficients or variance components across hierarchically nested models

Rescaling results of nonlinear probability models to compare regression coefficients or variance components across hierarchically nested models Rescaling results of nonlinear probability models to compare regression coefficients or variance components across hierarchically nested models Dirk Enzmann & Ulrich Kohler University of Hamburg, dirk.enzmann@uni-hamburg.de

More information

Unit 5: Study Guide Multilevel models for macro and micro data MIMAS The University of Manchester

Unit 5: Study Guide Multilevel models for macro and micro data MIMAS The University of Manchester Unit 5: Study Guide Multilevel models for macro and micro data MIMAS The University of Manchester 5.1 Introduction 5.2 Learning objectives 5.3 Single level models 5.4 Multilevel models 5.5 Theoretical

More information

Problem Set 6 ANSWERS

Problem Set 6 ANSWERS Economics 20 Part I. Problem Set 6 ANSWERS Prof. Patricia M. Anderson The first 5 questions are based on the following information: Suppose a researcher is interested in the effect of class attendance

More information

Determining Probability Estimates From Logistic Regression Results Vartanian: SW 541

Determining Probability Estimates From Logistic Regression Results Vartanian: SW 541 Determining Probability Estimates From Logistic Regression Results Vartanian: SW 541 In determining logistic regression results, you will generally be given the odds ratio in the SPSS or SAS output. However,

More information

mfx: Marginal Effects, Odds Ratios and Incidence Rate Ratios for GLMs

mfx: Marginal Effects, Odds Ratios and Incidence Rate Ratios for GLMs mfx: Marginal Effects, Odds Ratios and Incidence Rate Ratios for GLMs Fernihough, A. mfx: Marginal Effects, Odds Ratios and Incidence Rate Ratios for GLMs Document Version: Publisher's PDF, also known

More information

ECON Introductory Econometrics Seminar 2, 2015

ECON Introductory Econometrics Seminar 2, 2015 ECON4150 - Introductory Econometrics Seminar 2, 2015 Stock and Watson EE4.1, EE5.2 Stock and Watson EE4.1, EE5.2 ECON4150 - Introductory Econometrics Seminar 2, 2015 1 / 14 Seminar 2 Author: Andrea University

More information

Cross-country comparison using the ECHP Descriptive statistics and Simple Models. Cheti Nicoletti Institute for Social and Economic Research

Cross-country comparison using the ECHP Descriptive statistics and Simple Models. Cheti Nicoletti Institute for Social and Economic Research Cross-country comparison using the ECHP Descriptive statistics and Simple Models Cheti Nicoletti Institute for Social and Economic Research Comparing income variables across countries Income variables

More information

SPSS I: Menu Basics Practice Exercises Target Software & Version: SPSS V Last Updated on January 17, 2007 Created by Jennifer Ortman

SPSS I: Menu Basics Practice Exercises Target Software & Version: SPSS V Last Updated on January 17, 2007 Created by Jennifer Ortman SPSS I: Menu Basics Practice Exercises Target Software & Version: SPSS V. 14.02 Last Updated on January 17, 2007 Created by Jennifer Ortman PRACTICE EXERCISES Exercise A Obtain descriptive statistics (mean,

More information

Ti 83/84. Descriptive Statistics for a List of Numbers

Ti 83/84. Descriptive Statistics for a List of Numbers Ti 83/84 Descriptive Statistics for a List of Numbers Quiz scores in a (fictitious) class were 10.5, 13.5, 8, 12, 11.3, 9, 9.5, 5, 15, 2.5, 10.5, 7, 11.5, 10, and 10.5. It s hard to get much of a sense

More information

Calculating the Probabilities of Member Engagement

Calculating the Probabilities of Member Engagement Calculating the Probabilities of Member Engagement by Larry J. Seibert, Ph.D. Binary logistic regression is a regression technique that is used to calculate the probability of an outcome when there are

More information

Chapter 18: The Correlational Procedures

Chapter 18: The Correlational Procedures Introduction: In this chapter we are going to tackle about two kinds of relationship, positive relationship and negative relationship. Positive Relationship Let's say we have two values, votes and campaign

More information

Module 4: Point Estimation Statistics (OA3102)

Module 4: Point Estimation Statistics (OA3102) Module 4: Point Estimation Statistics (OA3102) Professor Ron Fricker Naval Postgraduate School Monterey, California Reading assignment: WM&S chapter 8.1-8.4 Revision: 1-12 1 Goals for this Module Define

More information

Data Analysis. BCF106 Fundamentals of Cost Analysis

Data Analysis. BCF106 Fundamentals of Cost Analysis Data Analysis BCF106 Fundamentals of Cost Analysis June 009 Chapter 5 Data Analysis 5.0 Introduction... 3 5.1 Terminology... 3 5. Measures of Central Tendency... 5 5.3 Measures of Dispersion... 7 5.4 Frequency

More information

Modelling Economic Variables

Modelling Economic Variables ucsc supplementary notes ams/econ 11a Modelling Economic Variables c 2010 Yonatan Katznelson 1. Mathematical models The two central topics of AMS/Econ 11A are differential calculus on the one hand, and

More information

Estimating Heterogeneous Choice Models with Stata

Estimating Heterogeneous Choice Models with Stata Estimating Heterogeneous Choice Models with Stata Richard Williams Notre Dame Sociology rwilliam@nd.edu West Coast Stata Users Group Meetings October 25, 2007 Overview When a binary or ordinal regression

More information

İnsan TUNALI 8 November 2018 Econ 511: Econometrics I. ASSIGNMENT 7 STATA Supplement

İnsan TUNALI 8 November 2018 Econ 511: Econometrics I. ASSIGNMENT 7 STATA Supplement İnsan TUNALI 8 November 2018 Econ 511: Econometrics I ASSIGNMENT 7 STATA Supplement. use "F:\COURSES\GRADS\ECON511\SHARE\wages1.dta", clear. generate =ln(wage). scatter sch Q. Do you see a relationship

More information

XLSTAT TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING

XLSTAT TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING XLSTAT TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING INTRODUCTION XLSTAT makes accessible to anyone a powerful, complete and user-friendly data analysis and statistical solution. Accessibility to

More information

Regression with a binary dependent variable: Logistic regression diagnostic

Regression with a binary dependent variable: Logistic regression diagnostic ACADEMIC YEAR 2016/2017 Università degli Studi di Milano GRADUATE SCHOOL IN SOCIAL AND POLITICAL SCIENCES APPLIED MULTIVARIATE ANALYSIS Luigi Curini luigi.curini@unimi.it Do not quote without author s

More information

Estimating Ordered Categorical Variables Using Panel Data: A Generalised Ordered Probit Model with an Autofit Procedure

Estimating Ordered Categorical Variables Using Panel Data: A Generalised Ordered Probit Model with an Autofit Procedure Journal of Economics and Econometrics Vol. 54, No.1, 2011 pp. 7-23 ISSN 2032-9652 E-ISSN 2032-9660 Estimating Ordered Categorical Variables Using Panel Data: A Generalised Ordered Probit Model with an

More information

Duration Models: Parametric Models

Duration Models: Parametric Models Duration Models: Parametric Models Brad 1 1 Department of Political Science University of California, Davis January 28, 2011 Parametric Models Some Motivation for Parametrics Consider the hazard rate:

More information

Trading and Enforcing Patent Rights. Carlos J. Serrano University of Toronto and NBER

Trading and Enforcing Patent Rights. Carlos J. Serrano University of Toronto and NBER Trading and Enforcing Patent Rights Alberto Galasso University of Toronto Mark Schankerman London School of Economics and CEPR Carlos J. Serrano University of Toronto and NBER OECD-KNOWINNO Workshop @

More information

1. f(x) = x2 + x 12 x 2 4 Let s run through the steps.

1. f(x) = x2 + x 12 x 2 4 Let s run through the steps. Math 121 (Lesieutre); 4.3; September 6, 2017 The steps for graphing a rational function: 1. Factor the numerator and denominator, and write the function in lowest terms. 2. Set the numerator equal to zero

More information

STATA log file for Time-Varying Covariates (TVC) Duration Model Estimations.

STATA log file for Time-Varying Covariates (TVC) Duration Model Estimations. STATA log file for Time-Varying Covariates (TVC) Duration Model Estimations. This STATA 8.0 log file reports estimations in which CDER Staff Aggregates and PDUFA variable are assigned to drug-months of

More information