Determining Probability Estimates From Logistic Regression Results Vartanian: SW 541

Size: px
Start display at page:

Download "Determining Probability Estimates From Logistic Regression Results Vartanian: SW 541"

Transcription

1 Determining Probability Estimates From Logistic Regression Results Vartanian: SW 541 In determining logistic regression results, you will generally be given the odds ratio in the SPSS or SAS output. However, you will not be given the probability estimates for different values of your independent variables. For example, you may be examining the likelihood of being in poverty and would like to know what the probability is for single mothers, for those with 5 children, or for single mothers with 5 children. Through the method I will show you below, you will be able to determine the probability estimates for any group you=re interested in. I will show you how to determine these probability estimates in SAS. Let=s say we=re examining the likelihood of having income below the poverty line (inpov) and are using 4 independent variables: 1.Whether a person lives in public housing (pubhouse B dummy variable) 2. Whether a person lives in a big city (over 750,000 population) (bigcit -- dummy variable) 3. The wages of the head of household (wagehd B continuous variable) 4. The level of education of the head of household (edhd -- continuous variable) The first step will be to use SAS to determine the logistic regression results. SAS automatically determines the likelihood of the 0 condition instead of the likelihood of the 1 condition (for example, the likelihood of not being in poverty), unless you use a descending command, which we will do. Prob(pov = 1)= Remember that the logistic equation takes the following form: We will use this form to determine probability estimates for the different variables within the model. While some researchers use mean values of all other variables to determine the probability estimates of a given variable, you will use the actual observed values to determine these probabilities. e 1+e You will use SAS commands to determine the probabilities of having income below the poverty line for individuals with specific conditions. Below, I have presented the first set of SAS commands to determine the logistic regression results and then to determine the probability estimates from these results. a+xb a+ xb libname in 'P:\pubdata\gssw'; (or wherever the data is located) D:\WP60\LECT2.PHD\LOGIST\LOGIST.PROBAB.SAS.WPD Page 1

2 data a;set in.psidphd; * You will use this a variable as a merge variable later. For some reason, SAS insists on having this variable to merge by. proc logist descending outest=dd maxiter=100; output out=cc xbeta=xb; model inpov=pubhouse bigcit wagehd edhd; *proc logist is the command for determining logistic regression results. data f;set dd; rename pubhouse=cpubhous bigcit=cbigcit wagehd=cwagehd edhd=cedhd; drop _type_; * You=re creating this merge variable in the second data set, so you can eventually merge the two data sets together. What the different commands mean: The outest=dd. This command tells SAS to create a new data set that only contains coefficient estimates from the logistic regression model. These newly created variables (the b coefficients) have the same names as the original variables (the X variables). We will need to change the names of these because we will eventually be multiply the Xs and the Bs together to determine the probability of an event occurring. Notice that we SET the variables from data set CC into data set f, and we use a rename statement to give these coefficient estimates new names. I=ve changed the names of the variables from their regular names to names that begin with c (for coefficient estimate). Maxiter=100. Logistic regression uses an interative process to determine b coefficients. The process keeps iterating until a stable b coefficient if found. The default number of iterations in SAS is 25. I have simply upped this number to 100 iterations. Out=CC. This is a means for creating a new data set that contains all the variables from the data that went into the logistic regression analysis. Thus, the data in the new data set CC, contains all the variables that went into the logistic run. You will need these variables to determine the probability estimates B these are your X variables. Xbeta=XB. This gives you the estimate for the variable values for each observation times the b coefficient. This XB estimate includes the estimate for the intercept (or a+xb). You=ll be able to use this value for determining probability estimates. However, you will be adding or subtracting from this value, depending on the probability you=re examining. D:\WP60\LECT2.PHD\LOGIST\LOGIST.PROBAB.SAS.WPD Page 2

3 SECOND SET OF SAS COMMANDS: data g;merge f cc;by a; Now, we will merge together the two created data sets, which will put the b coefficient estimates and the variable values into one data set. Each observation will have the same values for the b coefficients and will have different values for the variables. For example, each observation in the new merged data set below will have the same value for cpubhous=1.4513, while the value for pubhouse will depend on whether or not they lived in public housing (1=yes, 0=no). xb_nopub=xb-cpubhous*pubhouse; * This is the derivation of your XB for the likelihood of being in poverty for those who do not live in public housing. What I=ve done is taken the overall XB (which includes the XB for public housing) and subtracted off the coefficient*variable estimate for living in public housing. This is the same as setting pubhouse=0. In other words, we=re asking for the likelihood of being in poverty, given that all individuals do not live in public housing. This is just as what we did with the estimates in class. That is, we wanted to determine the probability of being in poverty if someone had 5 kids. We substituted the number 5 for the X and multiplied it by the b coefficient. What we will do here is determine this probability for all individuals (holding all else equal), and then take a mean for the sample. This will give us the overall likelihood of being in the condition. xb_pub=xb_nopub+cpubhous; * In this second estimate for public housing, I=m determining the XB for those who live in public housing. We again need to subtract off the coefficient*variable estimates for public housing (as we did above), and then add back in cpubhous*1 (or simply cpubhous). We=re again forcing everyone into a particular state, they live in public housing. We=ll then see how this affects their likelihood of being in poverty. Xb_nobig=xb-cbigcit*bigcit; xb_big=xb_nobig+cbigcit; xb_wag5=xb-cwagehd*wagehd+cwagehd*5; xb_wag10=xb-cwagehd*wagehd+cwagehd*10; xb_wag1=xb-cwagehd*wagehd+cwagehd*1; xb_ed10=xb-cedhd*edhd+cedhd*10; xb_ed12=xb-cedhd*edhd+cedhd*12; xb_ed16=xb-cedhd*edhd+cwagehd*16; With interval/ratio scale variables, we again need to subtract off the coefficient estimates of the variable times the actual value of the variable since these values are already contained in xb. We want to determine estimates for this variable at specific levels B not at the particular level of any individual. So we will add back on the coefficient estimate and multiply it by 10 (to get an estimate of those who have a 10 th grade education), or 12 (high school graduate) or 16 (college D:\WP60\LECT2.PHD\LOGIST\LOGIST.PROBAB.SAS.WPD Page 3

4 graduate). Below, we take the Xb estimates from above and put them into a logistic form (see the formula above). Each individual within the sample will a value for each of the variables below. We will then take the mean, which will give us an overall average probability of being in poverty. pr_nopub=(exp(xb_nopub))/(1+exp(xb_nopub)); pr_pub=(exp(xb_pub))/(1+exp(xb_pub)); pr_nobig=(exp(xb_nobig))/(1+exp(xb_nobig)); pr_big=(exp(xb_big))/(1+exp(xb_big)); pr_wag5=(exp(xb_wag5))/(1+exp(xb_wag5)); pr_wag10=(exp(xb_wag10))/(1+exp(xb_wag10)); pr_wag1=(exp(xb_wag1))/(1+exp(xb_wag1)); pr_ed10=(exp(xb_ed10))/(1+exp(xb_ed10)); pr_ed12=(exp(xb_ed12))/(1+exp(xb_ed12)); pr_ed16=(exp(xb_ed16))/(1+exp(xb_ed16)); proc means;var pr_nopub pr_pub pr_nobig pr_big pr_wag5 pr_wag10 pr_wag1 pr_ed10 pr_ed12 pr_ed16; weight weight; run; Results The LOGISTIC Procedure Data Set: WORK.A Response Variable: INPOV Response Levels: 2 Number of Observations: Link Function: Logit Response Profile Ordered Value INPOV Count (In this sample, 3,208 lived below the poverty line, 12,198 did not.) Model Fitting Information and Testing Global Null Hypothesis BETA=0 Intercept Intercept and Criterion Only Covariates Chi-Square for Covariates AIC SC LOG L with 4 DF (p=0.0001) Score with 4 DF (p=0.0001) Analysis of Maximum Likelihood Estimates D:\WP60\LECT2.PHD\LOGIST\LOGIST.PROBAB.SAS.WPD Page 4

5 Parameter Standard Wald Pr > Standardized Odds Variable DF Estimate Error Chi-Square Chi-Square Estimate Ratio INTERCPT PUBHOUSE BIGCIT WAGEHD EDHD Notice that all coefficient estimates are significantly related to being in poverty. Those living in public housing and in big cities are positively related to having income below the poverty line. Wages and education level of the head have a negative relationship to having income below the poverty line. Also note that -2 Log L is significant (p=.0001). Thus, all of the variables together are related to the dependent variable. Below are the probability estimates, or the likelihood of being in poverty, given particular conditions. The likelihood of being in poverty given that you don=t live in public housing, controlling for all other variables in the model, is 17.12%. For the probability for those living in public housing is %. For those with wages of $1/hour, the likelihood of being in poverty is 43.04%, while for those earning $10/hour, this likelihood is 7.64%. All of the probability estimates are determined holding all other variables within the model constant. The SAS System 15:40 Thursday, January 28, 1999 Variable N Mean Std Dev Minimum Maximum PR_NOPUB E PR_PUB E PR_NOBIG E PR_BIG E PR_WAG PR_WAG PR_WAG PR_ED E PR_ED E PR_ED E D:\WP60\LECT2.PHD\LOGIST\LOGIST.PROBAB.SAS.WPD Page 5

6 The SAS program: libname in 'P:\pubdata\gssw'; (or wherever the data is located) data a;set in.psidphd; proc logist descending outest=dd maxiter=100; output out=cc xbeta=xb; model inpov=pubhouse bigcit wagehd edhd; data f;set dd; rename pubhouse=cpubhous bigcit=cbigcit wagehd=cwagehd edhd=cedhd; drop _type_; data g;merge f cc;by a; xb_nopub=xb-cpubhous*pubhouse; xb_pub=xb_nopub+cpubhous; Xb_nobig=xb-cbigcit*bigcit; xb_big=xb_nobig+cbigcit; xb_wag5=xb-cwagehd*wagehd+cwagehd*5; xb_wag10=xb-cwagehd*wagehd+cwagehd*10; xb_wag1=xb-cwagehd*wagehd+cwagehd*1; xb_ed10=xb-cedhd*edhd+cedhd*10; xb_ed12=xb-cedhd*edhd+cedhd*12; xb_ed16=xb-cedhd*edhd+cwagehd*16; pr_nopub=(exp(xb_nopub))/(1+exp(xb_nopub)); pr_pub=(exp(xb_pub))/(1+exp(xb_pub)); pr_nobig=(exp(xb_nobig))/(1+exp(xb_nobig)); pr_big=(exp(xb_big))/(1+exp(xb_big)); pr_wag5=(exp(xb_wag5))/(1+exp(xb_wag5)); pr_wag10=(exp(xb_wag10))/(1+exp(xb_wag10)); pr_wag1=(exp(xb_wag1))/(1+exp(xb_wag1)); pr_ed10=(exp(xb_ed10))/(1+exp(xb_ed10)); pr_ed12=(exp(xb_ed12))/(1+exp(xb_ed12)); pr_ed16=(exp(xb_ed16))/(1+exp(xb_ed16)); proc means;var pr_nopub pr_pub pr_nobig pr_big pr_wag5 pr_wag10 pr_wag1 pr_ed10 pr_ed12 pr_ed16; weight weight; run; D:\WP60\LECT2.PHD\LOGIST\LOGIST.PROBAB.SAS.WPD Page 6

proc genmod; model malform/total = alcohol / dist=bin link=identity obstats; title 'Table 2.7'; title2 'Identity Link';

proc genmod; model malform/total = alcohol / dist=bin link=identity obstats; title 'Table 2.7'; title2 'Identity Link'; BIOS 6244 Analysis of Categorical Data Assignment 5 s 1. Consider Exercise 4.4, p. 98. (i) Write the SAS code, including the DATA step, to fit the linear probability model and the logit model to the data

More information

[BINARY DEPENDENT VARIABLE ESTIMATION WITH STATA]

[BINARY DEPENDENT VARIABLE ESTIMATION WITH STATA] Tutorial #3 This example uses data in the file 16.09.2011.dta under Tutorial folder. It contains 753 observations from a sample PSID data on the labor force status of married women in the U.S in 1975.

More information

Final Exam - section 1. Thursday, December hours, 30 minutes

Final Exam - section 1. Thursday, December hours, 30 minutes Econometrics, ECON312 San Francisco State University Michael Bar Fall 2013 Final Exam - section 1 Thursday, December 19 1 hours, 30 minutes Name: Instructions 1. This is closed book, closed notes exam.

More information

Alastair Hall ECG 790F: Microeconometrics Spring Computer Handout # 2. Estimation of binary response models : part II

Alastair Hall ECG 790F: Microeconometrics Spring Computer Handout # 2. Estimation of binary response models : part II Alastair Hall ECG 790F: Microeconometrics Spring 2006 Computer Handout # 2 Estimation of binary response models : part II In this handout, we discuss the estimation of binary response models with and without

More information

tm / / / / / / / / / / / / Statistics/Data Analysis User: Klick Project: Limited Dependent Variables{space -6}

tm / / / / / / / / / / / / Statistics/Data Analysis User: Klick Project: Limited Dependent Variables{space -6} PS 4 Monday August 16 01:00:42 2010 Page 1 tm / / / / / / / / / / / / Statistics/Data Analysis User: Klick Project: Limited Dependent Variables{space -6} log: C:\web\PS4log.smcl log type: smcl opened on:

More information

Table 4. Probit model of union membership. Probit coefficients are presented below. Data from March 2008 Current Population Survey.

Table 4. Probit model of union membership. Probit coefficients are presented below. Data from March 2008 Current Population Survey. 1. Using a probit model and data from the 2008 March Current Population Survey, I estimated a probit model of the determinants of pension coverage. Three specifications were estimated. The first included

More information

Logistic Regression with R: Example One

Logistic Regression with R: Example One Logistic Regression with R: Example One math = read.table("http://www.utstat.toronto.edu/~brunner/appliedf12/data/mathcat.data") math[1:5,] hsgpa hsengl hscalc course passed outcome 1 78.0 80 Yes Mainstrm

More information

Lecture 21: Logit Models for Multinomial Responses Continued

Lecture 21: Logit Models for Multinomial Responses Continued Lecture 21: Logit Models for Multinomial Responses Continued Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University

More information

Multiple Regression and Logistic Regression II. Dajiang 525 Apr

Multiple Regression and Logistic Regression II. Dajiang 525 Apr Multiple Regression and Logistic Regression II Dajiang Liu @PHS 525 Apr-19-2016 Materials from Last Time Multiple regression model: Include multiple predictors in the model = + + + + How to interpret the

More information

Intro to GLM Day 2: GLM and Maximum Likelihood

Intro to GLM Day 2: GLM and Maximum Likelihood Intro to GLM Day 2: GLM and Maximum Likelihood Federico Vegetti Central European University ECPR Summer School in Methods and Techniques 1 / 32 Generalized Linear Modeling 3 steps of GLM 1. Specify the

More information

Calculating the Probabilities of Member Engagement

Calculating the Probabilities of Member Engagement Calculating the Probabilities of Member Engagement by Larry J. Seibert, Ph.D. Binary logistic regression is a regression technique that is used to calculate the probability of an outcome when there are

More information

Multinomial Logit Models - Overview Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 13, 2017

Multinomial Logit Models - Overview Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 13, 2017 Multinomial Logit Models - Overview Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 13, 2017 This is adapted heavily from Menard s Applied Logistic Regression

More information

STA 4504/5503 Sample questions for exam True-False questions.

STA 4504/5503 Sample questions for exam True-False questions. STA 4504/5503 Sample questions for exam 2 1. True-False questions. (a) For General Social Survey data on Y = political ideology (categories liberal, moderate, conservative), X 1 = gender (1 = female, 0

More information

sociology SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 SO5032 Quantitative Research Methods

sociology SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 SO5032 Quantitative Research Methods 1 SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 Lecture 10: Multinomial regression baseline category extension of binary What if we have multiple possible

More information

Logit Models for Binary Data

Logit Models for Binary Data Chapter 3 Logit Models for Binary Data We now turn our attention to regression models for dichotomous data, including logistic regression and probit analysis These models are appropriate when the response

More information

PASS Sample Size Software

PASS Sample Size Software Chapter 850 Introduction Cox proportional hazards regression models the relationship between the hazard function λ( t X ) time and k covariates using the following formula λ log λ ( t X ) ( t) 0 = β1 X1

More information

############################ ### toxo.r ### ############################

############################ ### toxo.r ### ############################ ############################ ### toxo.r ### ############################ toxo < read.table(file="n:\\courses\\stat8620\\fall 08\\toxo.dat",header=T) #toxo < read.table(file="c:\\documents and Settings\\dhall\\My

More information

Estimation Procedure for Parametric Survival Distribution Without Covariates

Estimation Procedure for Parametric Survival Distribution Without Covariates Estimation Procedure for Parametric Survival Distribution Without Covariates The maximum likelihood estimates of the parameters of commonly used survival distribution can be found by SAS. The following

More information

Review questions for Multinomial Logit/Probit, Tobit, Heckit, Quantile Regressions

Review questions for Multinomial Logit/Probit, Tobit, Heckit, Quantile Regressions 1. I estimated a multinomial logit model of employment behavior using data from the 2006 Current Population Survey. The three possible outcomes for a person are employed (outcome=1), unemployed (outcome=2)

More information

Ordinal Multinomial Logistic Regression. Thom M. Suhy Southern Methodist University May14th, 2013

Ordinal Multinomial Logistic Regression. Thom M. Suhy Southern Methodist University May14th, 2013 Ordinal Multinomial Logistic Thom M. Suhy Southern Methodist University May14th, 2013 GLM Generalized Linear Model (GLM) Framework for statistical analysis (Gelman and Hill, 2007, p. 135) Linear Continuous

More information

Market Variables and Financial Distress. Giovanni Fernandez Stetson University

Market Variables and Financial Distress. Giovanni Fernandez Stetson University Market Variables and Financial Distress Giovanni Fernandez Stetson University In this paper, I investigate the predictive ability of market variables in correctly predicting and distinguishing going concern

More information

BEcon Program, Faculty of Economics, Chulalongkorn University Page 1/7

BEcon Program, Faculty of Economics, Chulalongkorn University Page 1/7 Mid-term Exam (November 25, 2005, 0900-1200hr) Instructions: a) Textbooks, lecture notes and calculators are allowed. b) Each must work alone. Cheating will not be tolerated. c) Attempt all the tests.

More information

To be two or not be two, that is a LOGISTIC question

To be two or not be two, that is a LOGISTIC question MWSUG 2016 - Paper AA18 To be two or not be two, that is a LOGISTIC question Robert G. Downer, Grand Valley State University, Allendale, MI ABSTRACT A binary response is very common in logistic regression

More information

ARIMA ANALYSIS WITH INTERVENTIONS / OUTLIERS

ARIMA ANALYSIS WITH INTERVENTIONS / OUTLIERS TASK Run intervention analysis on the price of stock M: model a function of the price as ARIMA with outliers and interventions. SOLUTION The document below is an abridged version of the solution provided

More information

CHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES

CHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES Examples: Monte Carlo Simulation Studies CHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES Monte Carlo simulation studies are often used for methodological investigations of the performance of statistical

More information

Assessment on Credit Risk of Real Estate Based on Logistic Regression Model

Assessment on Credit Risk of Real Estate Based on Logistic Regression Model Assessment on Credit Risk of Real Estate Based on Logistic Regression Model Li Hongli 1, a, Song Liwei 2,b 1 Chongqing Engineering Polytechnic College, Chongqing400037, China 2 Division of Planning and

More information

Comparing Odds Ratios and Marginal Effects from Logistic Regression and Linear Probability Models

Comparing Odds Ratios and Marginal Effects from Logistic Regression and Linear Probability Models Western Kentucky University From the SelectedWorks of Matt Bogard Spring March 11, 2016 Comparing Odds Ratios and Marginal Effects from Logistic Regression and Linear Probability Models Matt Bogard Available

More information

Actuarial Research on the Effectiveness of Collision Avoidance Systems FCW & LDW. A translation from Hebrew to English of a research paper prepared by

Actuarial Research on the Effectiveness of Collision Avoidance Systems FCW & LDW. A translation from Hebrew to English of a research paper prepared by Actuarial Research on the Effectiveness of Collision Avoidance Systems FCW & LDW A translation from Hebrew to English of a research paper prepared by Ron Actuarial Intelligence LTD Contact Details: Shachar

More information

Financial Econometrics: Problem Set # 3 Solutions

Financial Econometrics: Problem Set # 3 Solutions Financial Econometrics: Problem Set # 3 Solutions N Vera Chau The University of Chicago: Booth February 9, 219 1 a. You can generate the returns using the exact same strategy as given in problem 2 below.

More information

Module 9: Single-level and Multilevel Models for Ordinal Responses. Stata Practical 1

Module 9: Single-level and Multilevel Models for Ordinal Responses. Stata Practical 1 Module 9: Single-level and Multilevel Models for Ordinal Responses Pre-requisites Modules 5, 6 and 7 Stata Practical 1 George Leckie, Tim Morris & Fiona Steele Centre for Multilevel Modelling If you find

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models Scott Creel Wednesday, September 10, 2014 This exercise extends the prior material on using the lm() function to fit an OLS regression and test hypotheses about effects on a parameter.

More information

Class Notes: Week 6. Multinomial Outcomes

Class Notes: Week 6. Multinomial Outcomes Ronald Hek Class Notes: Week 6 1 Class Notes: Week 6 Multinomial Outomes For the next ouple of weeks or so, we will look at models where there are more than two ategories of outomes. Multinomial logisti

More information

Logistic Regression. Logistic Regression Theory

Logistic Regression. Logistic Regression Theory Logistic Regression Dr. J. Kyle Roberts Southern Methodist University Simmons School of Education and Human Development Department of Teaching and Learning Logistic Regression The linear probability model.

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Maximum Likelihood Estimation EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #6 EPSY 905: Maximum Likelihood In This Lecture The basics of maximum likelihood estimation Ø The engine that

More information

Sociology Exam 3 Answer Key - DRAFT May 8, 2007

Sociology Exam 3 Answer Key - DRAFT May 8, 2007 Sociology 63993 Exam 3 Answer Key - DRAFT May 8, 2007 I. True-False. (20 points) Indicate whether the following statements are true or false. If false, briefly explain why. 1. The odds of an event occurring

More information

Using New SAS 9.4 Features for Cumulative Logit Models with Partial Proportional Odds Paul J. Hilliard, Educational Testing Service (ETS)

Using New SAS 9.4 Features for Cumulative Logit Models with Partial Proportional Odds Paul J. Hilliard, Educational Testing Service (ETS) Using New SAS 9.4 Features for Cumulative Logit Models with Partial Proportional Odds Using New SAS 9.4 Features for Cumulative Logit Models with Partial Proportional Odds INTRODUCTION Multicategory Logit

More information

9. Logit and Probit Models For Dichotomous Data

9. Logit and Probit Models For Dichotomous Data Sociology 740 John Fox Lecture Notes 9. Logit and Probit Models For Dichotomous Data Copyright 2014 by John Fox Logit and Probit Models for Dichotomous Responses 1 1. Goals: I To show how models similar

More information

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION In Inferential Statistic, ESTIMATION (i) (ii) is called the True Population Mean and is called the True Population Proportion. You must also remember that are not the only population parameters. There

More information

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 13, 2018

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 13, 2018 Maximum Likelihood Estimation Richard Williams, University of otre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 3, 208 [This handout draws very heavily from Regression Models for Categorical

More information

Getting Started in Logit and Ordered Logit Regression (ver. 3.1 beta)

Getting Started in Logit and Ordered Logit Regression (ver. 3.1 beta) Getting Started in Logit and Ordered Logit Regression (ver. 3. beta Oscar Torres-Reyna Data Consultant otorres@princeton.edu http://dss.princeton.edu/training/ Logit model Use logit models whenever your

More information

Dummy Variables. 1. Example: Factors Affecting Monthly Earnings

Dummy Variables. 1. Example: Factors Affecting Monthly Earnings Dummy Variables A dummy variable or binary variable is a variable that takes on a value of 0 or 1 as an indicator that the observation has some kind of characteristic. Common examples: Sex (female): FEMALE=1

More information

Getting Started in Logit and Ordered Logit Regression (ver. 3.1 beta)

Getting Started in Logit and Ordered Logit Regression (ver. 3.1 beta) Getting Started in Logit and Ordered Logit Regression (ver. 3. beta Oscar Torres-Reyna Data Consultant otorres@princeton.edu http://dss.princeton.edu/training/ Logit model Use logit models whenever your

More information

Chapter 4 Level of Volatility in the Indian Stock Market

Chapter 4 Level of Volatility in the Indian Stock Market Chapter 4 Level of Volatility in the Indian Stock Market Measurement of volatility is an important issue in financial econometrics. The main reason for the prominent role that volatility plays in financial

More information

Brief Sketch of Solutions: Tutorial 1. 2) descriptive statistics and correlogram. Series: LGCSI Sample 12/31/ /11/2009 Observations 2596

Brief Sketch of Solutions: Tutorial 1. 2) descriptive statistics and correlogram. Series: LGCSI Sample 12/31/ /11/2009 Observations 2596 Brief Sketch of Solutions: Tutorial 1 2) descriptive statistics and correlogram 240 200 160 120 80 40 0 4.8 5.0 5.2 5.4 5.6 5.8 6.0 6.2 Series: LGCSI Sample 12/31/1999 12/11/2009 Observations 2596 Mean

More information

Logistic Regression Analysis

Logistic Regression Analysis Revised July 2018 Logistic Regression Analysis This set of notes shows how to use Stata to estimate a logistic regression equation. It assumes that you have set Stata up on your computer (see the Getting

More information

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 10, 2017

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 10, 2017 Maximum Likelihood Estimation Richard Williams, University of otre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 0, 207 [This handout draws very heavily from Regression Models for Categorical

More information

Brief Sketch of Solutions: Tutorial 2. 2) graphs. 3) unit root tests

Brief Sketch of Solutions: Tutorial 2. 2) graphs. 3) unit root tests Brief Sketch of Solutions: Tutorial 2 2) graphs LJAPAN DJAPAN 5.2.12 5.0.08 4.8.04 4.6.00 4.4 -.04 4.2 -.08 4.0 01 02 03 04 05 06 07 08 09 -.12 01 02 03 04 05 06 07 08 09 LUSA DUSA 7.4.12 7.3 7.2.08 7.1.04

More information

Gov 2001: Section 5. I. A Normal Example II. Uncertainty. Gov Spring 2010

Gov 2001: Section 5. I. A Normal Example II. Uncertainty. Gov Spring 2010 Gov 2001: Section 5 I. A Normal Example II. Uncertainty Gov 2001 Spring 2010 A roadmap We started by introducing the concept of likelihood in the simplest univariate context one observation, one variable.

More information

a. Explain why the coefficients change in the observed direction when switching from OLS to Tobit estimation.

a. Explain why the coefficients change in the observed direction when switching from OLS to Tobit estimation. 1. Using data from IRS Form 5500 filings by U.S. pension plans, I estimated a model of contributions to pension plans as ln(1 + c i ) = α 0 + U i α 1 + PD i α 2 + e i Where the subscript i indicates the

More information

Phd Program in Transportation. Transport Demand Modeling. Session 11

Phd Program in Transportation. Transport Demand Modeling. Session 11 Phd Program in Transportation Transport Demand Modeling João de Abreu e Silva Session 11 Binary and Ordered Choice Models Phd in Transportation / Transport Demand Modelling 1/26 Heterocedasticity Homoscedasticity

More information

Allison notes there are two conditions for using fixed effects methods.

Allison notes there are two conditions for using fixed effects methods. Panel Data 3: Conditional Logit/ Fixed Effects Logit Models Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised April 2, 2017 These notes borrow very heavily, sometimes

More information

Catherine De Vries, Spyros Kosmidis & Andreas Murr

Catherine De Vries, Spyros Kosmidis & Andreas Murr APPLIED STATISTICS FOR POLITICAL SCIENTISTS WEEK 8: DEPENDENT CATEGORICAL VARIABLES II Catherine De Vries, Spyros Kosmidis & Andreas Murr Topic: Logistic regression. Predicted probabilities. STATA commands

More information

CREDIT RISK MODELING IN R. Logistic regression: introduction

CREDIT RISK MODELING IN R. Logistic regression: introduction CREDIT RISK MODELING IN R Logistic regression: introduction Final data structure > str(training_set) 'data.frame': 19394 obs. of 8 variables: $ loan_status : Factor w/ 2 levels "0","1": 1 1 1 1 1 1 1 1

More information

CREDIT SCORING & CREDIT CONTROL XIV August 2015 Edinburgh. Aneta Ptak-Chmielewska Warsaw School of Ecoomics

CREDIT SCORING & CREDIT CONTROL XIV August 2015 Edinburgh. Aneta Ptak-Chmielewska Warsaw School of Ecoomics CREDIT SCORING & CREDIT CONTROL XIV 26-28 August 2015 Edinburgh Aneta Ptak-Chmielewska Warsaw School of Ecoomics aptak@sgh.waw.pl 1 Background literature Hypothesis Data and methods Empirical example Conclusions

More information

u panel_lecture . sum

u panel_lecture . sum u panel_lecture sum Variable Obs Mean Std Dev Min Max datastre 639 9039644 6369418 900228 926665 year 639 1980 2584012 1976 1984 total_sa 639 9377839 3212313 682 441e+07 tot_fixe 639 5214385 1988422 642

More information

Ordinary Least Squares Regression Examples Vartanian: SW 504

Ordinary Least Squares Regression Examples Vartanian: SW 504 Ordinary Least Squares Regression Examples Vartanian: SW 504 1. In this first ordinary least squares regression model, the dependent variable is AFDC income. The independent variable is number of kids

More information

Tests for the Odds Ratio in a Matched Case-Control Design with a Binary X

Tests for the Odds Ratio in a Matched Case-Control Design with a Binary X Chapter 156 Tests for the Odds Ratio in a Matched Case-Control Design with a Binary X Introduction This procedure calculates the power and sample size necessary in a matched case-control study designed

More information

Variance clustering. Two motivations, volatility clustering, and implied volatility

Variance clustering. Two motivations, volatility clustering, and implied volatility Variance modelling The simplest assumption for time series is that variance is constant. Unfortunately that assumption is often violated in actual data. In this lecture we look at the implications of time

More information

Negative Binomial Model for Count Data Log-linear Models for Contingency Tables - Introduction

Negative Binomial Model for Count Data Log-linear Models for Contingency Tables - Introduction Negative Binomial Model for Count Data Log-linear Models for Contingency Tables - Introduction Statistics 149 Spring 2006 Copyright 2006 by Mark E. Irwin Negative Binomial Family Example: Absenteeism from

More information

Bayesian Multinomial Model for Ordinal Data

Bayesian Multinomial Model for Ordinal Data Bayesian Multinomial Model for Ordinal Data Overview This example illustrates how to fit a Bayesian multinomial model by using the built-in mutinomial density function (MULTINOM) in the MCMC procedure

More information

Didacticiel - Études de cas. In this tutorial, we show how to implement a multinomial logistic regression with TANAGRA.

Didacticiel - Études de cas. In this tutorial, we show how to implement a multinomial logistic regression with TANAGRA. Subject In this tutorial, we show how to implement a multinomial logistic regression with TANAGRA. Logistic regression is a technique for maing predictions when the dependent variable is a dichotomy, and

More information

Quantitative Techniques Term 2

Quantitative Techniques Term 2 Quantitative Techniques Term 2 Laboratory 7 2 March 2006 Overview The objective of this lab is to: Estimate a cost function for a panel of firms; Calculate returns to scale; Introduce the command cluster

More information

book 2014/5/6 15:21 page 261 #285

book 2014/5/6 15:21 page 261 #285 book 2014/5/6 15:21 page 261 #285 Chapter 10 Simulation Simulations provide a powerful way to answer questions and explore properties of statistical estimators and procedures. In this chapter, we will

More information

Final Exam, section 2. Tuesday, December hour, 30 minutes

Final Exam, section 2. Tuesday, December hour, 30 minutes San Francisco State University Michael Bar ECON 312 Fall 2018 Final Exam, section 2 Tuesday, December 18 1 hour, 30 minutes Name: Instructions 1. This is closed book, closed notes exam. 2. You can use

More information

Homework Solutions - Lecture 2 Part 2

Homework Solutions - Lecture 2 Part 2 Homework Solutions - Lecture 2 Part 2 1. In 1995, Time Warner Inc. had a Beta of 1.61. Part of the reason for this high Beta was the debt left over from the leveraged buyout of Time by Warner in 1989,

More information

Final Exam Suggested Solutions

Final Exam Suggested Solutions University of Washington Fall 003 Department of Economics Eric Zivot Economics 483 Final Exam Suggested Solutions This is a closed book and closed note exam. However, you are allowed one page of handwritten

More information

Categorical Outcomes. Statistical Modelling in Stata: Categorical Outcomes. R by C Table: Example. Nominal Outcomes. Mark Lunt.

Categorical Outcomes. Statistical Modelling in Stata: Categorical Outcomes. R by C Table: Example. Nominal Outcomes. Mark Lunt. Categorical Outcomes Statistical Modelling in Stata: Categorical Outcomes Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester Nominal Ordinal 28/11/2017 R by C Table: Example Categorical,

More information

EXST7015: Multiple Regression from Snedecor & Cochran (1967) RAW DATA LISTING

EXST7015: Multiple Regression from Snedecor & Cochran (1967) RAW DATA LISTING Multiple (Linear) Regression Introductory example Page 1 1 options ps=256 ls=132 nocenter nodate nonumber; 3 DATA ONE; 4 TITLE1 ''; 5 INPUT X1 X2 X3 Y; 6 **** LABEL Y ='Plant available phosphorus' 7 X1='Inorganic

More information

Econometric Methods for Valuation Analysis

Econometric Methods for Valuation Analysis Econometric Methods for Valuation Analysis Margarita Genius Dept of Economics M. Genius (Univ. of Crete) Econometric Methods for Valuation Analysis Cagliari, 2017 1 / 25 Outline We will consider econometric

More information

Hierarchical Generalized Linear Models. Measurement Incorporated Hierarchical Linear Models Workshop

Hierarchical Generalized Linear Models. Measurement Incorporated Hierarchical Linear Models Workshop Hierarchical Generalized Linear Models Measurement Incorporated Hierarchical Linear Models Workshop Hierarchical Generalized Linear Models So now we are moving on to the more advanced type topics. To begin

More information

Module 4 Bivariate Regressions

Module 4 Bivariate Regressions AGRODEP Stata Training April 2013 Module 4 Bivariate Regressions Manuel Barron 1 and Pia Basurto 2 1 University of California, Berkeley, Department of Agricultural and Resource Economics 2 University of

More information

Unit 8 Notes: Solving Quadratics by Factoring Alg 1

Unit 8 Notes: Solving Quadratics by Factoring Alg 1 Unit 8 Notes: Solving Quadratics by Factoring Alg 1 Name Period Day Date Assignment (Due the next class meeting) Tuesday Wednesday Thursday Friday Monday Tuesday Wednesday Thursday Friday Monday Tuesday

More information

Chapter 18: The Correlational Procedures

Chapter 18: The Correlational Procedures Introduction: In this chapter we are going to tackle about two kinds of relationship, positive relationship and negative relationship. Positive Relationship Let's say we have two values, votes and campaign

More information

Sean Howard Econometrics Final Project Paper. An Analysis of the Determinants and Factors of Physical Education Attendance in the Fourth Quarter

Sean Howard Econometrics Final Project Paper. An Analysis of the Determinants and Factors of Physical Education Attendance in the Fourth Quarter Sean Howard Econometrics Final Project Paper An Analysis of the Determinants and Factors of Physical Education Attendance in the Fourth Quarter Introduction This project attempted to gain a more complete

More information

Advanced Econometrics

Advanced Econometrics Advanced Econometrics Instructor: Takashi Yamano 11/14/2003 Due: 11/21/2003 Homework 5 (30 points) Sample Answers 1. (16 points) Read Example 13.4 and an AER paper by Meyer, Viscusi, and Durbin (1995).

More information

Modelling the potential human capital on the labor market using logistic regression in R

Modelling the potential human capital on the labor market using logistic regression in R Modelling the potential human capital on the labor market using logistic regression in R Ana-Maria Ciuhu (dobre.anamaria@hotmail.com) Institute of National Economy, Romanian Academy; National Institute

More information

Financial Literacy in Urban India: A Case Study of Bohra Community in Mumbai

Financial Literacy in Urban India: A Case Study of Bohra Community in Mumbai MPRA Munich Personal RePEc Archive Financial Literacy in Urban India: A Case Study of Bohra Community in Mumbai Tirupati Basutkar Ramanand Arya D. A. V. College, Mumbai, India 8 January 2016 Online at

More information

Graduate School of Business, University of Chicago Business 41202, Spring Quarter 2007, Mr. Ruey S. Tsay. Midterm

Graduate School of Business, University of Chicago Business 41202, Spring Quarter 2007, Mr. Ruey S. Tsay. Midterm Graduate School of Business, University of Chicago Business 41202, Spring Quarter 2007, Mr. Ruey S. Tsay Midterm GSB Honor Code: I pledge my honor that I have not violated the Honor Code during this examination.

More information

Applying Logistics Regression to Forecast Annual Organizational Retirements

Applying Logistics Regression to Forecast Annual Organizational Retirements SESUG Paper SD-137-2017 Applying Logistics Regression to Forecast Annual Organizational Retirements Alan Dunham, Greybeard Solutions, LLC ABSTRACT This paper briefly discusses the labor economics research

More information

Problem Set 5 Answers. ( ) 2. Yes, like temperature. See the plot of utility in the notes. Marginal utility should be positive.

Problem Set 5 Answers. ( ) 2. Yes, like temperature. See the plot of utility in the notes. Marginal utility should be positive. Business John H. Cochrane Problem Set Answers Part I A simple very short readings questions. + = + + + = + + + + = ( ). Yes, like temperature. See the plot of utility in the notes. Marginal utility should

More information

LAMPIRAN. Null Hypothesis: LO has a unit root Exogenous: Constant Lag Length: 1 (Automatic based on SIC, MAXLAG=13)

LAMPIRAN. Null Hypothesis: LO has a unit root Exogenous: Constant Lag Length: 1 (Automatic based on SIC, MAXLAG=13) 74 LAMPIRAN Lampiran 1 Analisis ARIMA 1.1. Uji Stasioneritas Variabel 1. Data Harga Minyak Riil Level Null Hypothesis: LO has a unit root Lag Length: 1 (Automatic based on SIC, MAXLAG=13) Augmented Dickey-Fuller

More information

Fall 2004 Social Sciences 7418 University of Wisconsin-Madison Problem Set 5 Answers

Fall 2004 Social Sciences 7418 University of Wisconsin-Madison Problem Set 5 Answers Economics 310 Menzie D. Chinn Fall 2004 Social Sciences 7418 University of Wisconsin-Madison Problem Set 5 Answers This problem set is due in lecture on Wednesday, December 15th. No late problem sets will

More information

Estimating Support Labor for a Production Program

Estimating Support Labor for a Production Program Estimating Support Labor for a Production Program ISPA / SCEA Joint Conference June 24-27, 2008 Jeff Platten PMP, CCE/A Systems Project Engineer Northrop Grumman Corporation Biography Jeff Platten is a

More information

DATABASE AND RESEARCH METHODOLOGY

DATABASE AND RESEARCH METHODOLOGY CHAPTER III DATABASE AND RESEARCH METHODOLOGY The nature of the present study Direct Tax Reforms in India: A Comparative Study of Pre and Post-liberalization periods is such that it requires secondary

More information

Economics 413: Economic Forecast and Analysis Department of Economics, Finance and Legal Studies University of Alabama

Economics 413: Economic Forecast and Analysis Department of Economics, Finance and Legal Studies University of Alabama Problem Set #1 (Linear Regression) 1. The file entitled MONEYDEM.XLS contains quarterly values of seasonally adjusted U.S.3-month ( 3 ) and 1-year ( 1 ) treasury bill rates. Each series is measured over

More information

How can saving deposit rate and Hang Seng Index affect housing prices : an empirical study in Hong Kong market

How can saving deposit rate and Hang Seng Index affect housing prices : an empirical study in Hong Kong market Lingnan Journal of Banking, Finance and Economics Volume 2 2010/2011 Academic Year Issue Article 3 January 2010 How can saving deposit rate and Hang Seng Index affect housing prices : an empirical study

More information

Final Exam, section 1. Thursday, May hour, 30 minutes

Final Exam, section 1. Thursday, May hour, 30 minutes San Francisco State University Michael Bar ECON 312 Spring 2018 Final Exam, section 1 Thursday, May 17 1 hour, 30 minutes Name: Instructions 1. This is closed book, closed notes exam. 2. You can use one

More information

Statistical Intervals (One sample) (Chs )

Statistical Intervals (One sample) (Chs ) 7 Statistical Intervals (One sample) (Chs 8.1-8.3) Confidence Intervals The CLT tells us that as the sample size n increases, the sample mean X is close to normally distributed with expected value µ and

More information

Discrete Choice Modeling William Greene Stern School of Business, New York University. Lab Session 2 Binary Choice Modeling with Panel Data

Discrete Choice Modeling William Greene Stern School of Business, New York University. Lab Session 2 Binary Choice Modeling with Panel Data Discrete Choice Modeling William Greene Stern School of Business, New York University Lab Session 2 Binary Choice Modeling with Panel Data This assignment will extend the models of binary choice and ordered

More information

Chapter 8 Exercises 1. Data Analysis & Graphics Using R Solutions to Exercises (May 1, 2010)

Chapter 8 Exercises 1. Data Analysis & Graphics Using R Solutions to Exercises (May 1, 2010) Chapter 8 Exercises 1 Data Analysis & Graphics Using R Solutions to Exercises (May 1, 2010) Preliminaries > library(daag) Exercise 1 The following table shows numbers of occasions when inhibition (i.e.,

More information

logistic logistic Merton Black - Scholes Black&Cox Merton Longstaff&Schwarlz Jarrow&Turnbull

logistic logistic Merton Black - Scholes Black&Cox Merton Longstaff&Schwarlz Jarrow&Turnbull 29 6 Vol. 29 No. 6 2016 11 Research of Finance and Education Nov. 2016 logistic 271000 logistic 2011-2014 80 A 21 logistic F830. 33 A 2095-0098 2016 06-0027 - 08 1 20 70 Merton 1974 1 Black - Scholes Black&Cox

More information

Renters Report Future Home Buying Optimism, While Family Financial Assistance Is Most Available to Populations with Higher Homeownership Rates

Renters Report Future Home Buying Optimism, While Family Financial Assistance Is Most Available to Populations with Higher Homeownership Rates Renters Report Future Home Buying Optimism, While Family Financial Assistance Is Most Available to Populations with Higher Homeownership Rates National Housing Survey Topic Analysis Q3 2016 Published on

More information

1) The Effect of Recent Tax Changes on Taxable Income

1) The Effect of Recent Tax Changes on Taxable Income 1) The Effect of Recent Tax Changes on Taxable Income In the most recent issue of the Journal of Policy Analysis and Management, Bradley Heim published a paper called The Effect of Recent Tax Changes on

More information

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Midterm

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Midterm Booth School of Business, University of Chicago Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay Midterm ChicagoBooth Honor Code: I pledge my honor that I have not violated the Honor Code during this

More information

Appendix. Table A.1 (Part A) The Author(s) 2015 G. Chakrabarti and C. Sen, Green Investing, SpringerBriefs in Finance, DOI /

Appendix. Table A.1 (Part A) The Author(s) 2015 G. Chakrabarti and C. Sen, Green Investing, SpringerBriefs in Finance, DOI / Appendix Table A.1 (Part A) Dependent variable: probability of crisis (own) Method: ML binary probit (quadratic hill climbing) Included observations: 47 after adjustments Convergence achieved after 6 iterations

More information

Generalized Multilevel Regression Example for a Binary Outcome

Generalized Multilevel Regression Example for a Binary Outcome Psy 510/610 Multilevel Regression, Spring 2017 1 HLM Generalized Multilevel Regression Example for a Binary Outcome Specifications for this Bernoulli HLM2 run Problem Title: no title The data source for

More information

Stat 401XV Exam 3 Spring 2017

Stat 401XV Exam 3 Spring 2017 Stat 40XV Exam Spring 07 I have neither given nor received unauthorized assistance on this exam. Name Signed Date Name Printed ATTENTION! Incorrect numerical answers unaccompanied by supporting reasoning

More information

Multiple regression - a brief introduction

Multiple regression - a brief introduction Multiple regression - a brief introduction Multiple regression is an extension to regular (simple) regression. Instead of one X, we now have several. Suppose, for example, that you are trying to predict

More information

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2017, Mr. Ruey S. Tsay. Solutions to Final Exam

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2017, Mr. Ruey S. Tsay. Solutions to Final Exam The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2017, Mr. Ruey S. Tsay Solutions to Final Exam Problem A: (40 points) Answer briefly the following questions. 1. Describe

More information

The Economic Consequences of Dollar Appreciation for US Manufacturing Investment: A Time-Series Analysis

The Economic Consequences of Dollar Appreciation for US Manufacturing Investment: A Time-Series Analysis The Economic Consequences of Dollar Appreciation for US Manufacturing Investment: A Time-Series Analysis Robert A. Blecker Unpublished Appendix to Paper Forthcoming in the International Review of Applied

More information