Determining Probability Estimates From Logistic Regression Results Vartanian: SW 541
|
|
- Felicia Norton
- 6 years ago
- Views:
Transcription
1 Determining Probability Estimates From Logistic Regression Results Vartanian: SW 541 In determining logistic regression results, you will generally be given the odds ratio in the SPSS or SAS output. However, you will not be given the probability estimates for different values of your independent variables. For example, you may be examining the likelihood of being in poverty and would like to know what the probability is for single mothers, for those with 5 children, or for single mothers with 5 children. Through the method I will show you below, you will be able to determine the probability estimates for any group you=re interested in. I will show you how to determine these probability estimates in SAS. Let=s say we=re examining the likelihood of having income below the poverty line (inpov) and are using 4 independent variables: 1.Whether a person lives in public housing (pubhouse B dummy variable) 2. Whether a person lives in a big city (over 750,000 population) (bigcit -- dummy variable) 3. The wages of the head of household (wagehd B continuous variable) 4. The level of education of the head of household (edhd -- continuous variable) The first step will be to use SAS to determine the logistic regression results. SAS automatically determines the likelihood of the 0 condition instead of the likelihood of the 1 condition (for example, the likelihood of not being in poverty), unless you use a descending command, which we will do. Prob(pov = 1)= Remember that the logistic equation takes the following form: We will use this form to determine probability estimates for the different variables within the model. While some researchers use mean values of all other variables to determine the probability estimates of a given variable, you will use the actual observed values to determine these probabilities. e 1+e You will use SAS commands to determine the probabilities of having income below the poverty line for individuals with specific conditions. Below, I have presented the first set of SAS commands to determine the logistic regression results and then to determine the probability estimates from these results. a+xb a+ xb libname in 'P:\pubdata\gssw'; (or wherever the data is located) D:\WP60\LECT2.PHD\LOGIST\LOGIST.PROBAB.SAS.WPD Page 1
2 data a;set in.psidphd; * You will use this a variable as a merge variable later. For some reason, SAS insists on having this variable to merge by. proc logist descending outest=dd maxiter=100; output out=cc xbeta=xb; model inpov=pubhouse bigcit wagehd edhd; *proc logist is the command for determining logistic regression results. data f;set dd; rename pubhouse=cpubhous bigcit=cbigcit wagehd=cwagehd edhd=cedhd; drop _type_; * You=re creating this merge variable in the second data set, so you can eventually merge the two data sets together. What the different commands mean: The outest=dd. This command tells SAS to create a new data set that only contains coefficient estimates from the logistic regression model. These newly created variables (the b coefficients) have the same names as the original variables (the X variables). We will need to change the names of these because we will eventually be multiply the Xs and the Bs together to determine the probability of an event occurring. Notice that we SET the variables from data set CC into data set f, and we use a rename statement to give these coefficient estimates new names. I=ve changed the names of the variables from their regular names to names that begin with c (for coefficient estimate). Maxiter=100. Logistic regression uses an interative process to determine b coefficients. The process keeps iterating until a stable b coefficient if found. The default number of iterations in SAS is 25. I have simply upped this number to 100 iterations. Out=CC. This is a means for creating a new data set that contains all the variables from the data that went into the logistic regression analysis. Thus, the data in the new data set CC, contains all the variables that went into the logistic run. You will need these variables to determine the probability estimates B these are your X variables. Xbeta=XB. This gives you the estimate for the variable values for each observation times the b coefficient. This XB estimate includes the estimate for the intercept (or a+xb). You=ll be able to use this value for determining probability estimates. However, you will be adding or subtracting from this value, depending on the probability you=re examining. D:\WP60\LECT2.PHD\LOGIST\LOGIST.PROBAB.SAS.WPD Page 2
3 SECOND SET OF SAS COMMANDS: data g;merge f cc;by a; Now, we will merge together the two created data sets, which will put the b coefficient estimates and the variable values into one data set. Each observation will have the same values for the b coefficients and will have different values for the variables. For example, each observation in the new merged data set below will have the same value for cpubhous=1.4513, while the value for pubhouse will depend on whether or not they lived in public housing (1=yes, 0=no). xb_nopub=xb-cpubhous*pubhouse; * This is the derivation of your XB for the likelihood of being in poverty for those who do not live in public housing. What I=ve done is taken the overall XB (which includes the XB for public housing) and subtracted off the coefficient*variable estimate for living in public housing. This is the same as setting pubhouse=0. In other words, we=re asking for the likelihood of being in poverty, given that all individuals do not live in public housing. This is just as what we did with the estimates in class. That is, we wanted to determine the probability of being in poverty if someone had 5 kids. We substituted the number 5 for the X and multiplied it by the b coefficient. What we will do here is determine this probability for all individuals (holding all else equal), and then take a mean for the sample. This will give us the overall likelihood of being in the condition. xb_pub=xb_nopub+cpubhous; * In this second estimate for public housing, I=m determining the XB for those who live in public housing. We again need to subtract off the coefficient*variable estimates for public housing (as we did above), and then add back in cpubhous*1 (or simply cpubhous). We=re again forcing everyone into a particular state, they live in public housing. We=ll then see how this affects their likelihood of being in poverty. Xb_nobig=xb-cbigcit*bigcit; xb_big=xb_nobig+cbigcit; xb_wag5=xb-cwagehd*wagehd+cwagehd*5; xb_wag10=xb-cwagehd*wagehd+cwagehd*10; xb_wag1=xb-cwagehd*wagehd+cwagehd*1; xb_ed10=xb-cedhd*edhd+cedhd*10; xb_ed12=xb-cedhd*edhd+cedhd*12; xb_ed16=xb-cedhd*edhd+cwagehd*16; With interval/ratio scale variables, we again need to subtract off the coefficient estimates of the variable times the actual value of the variable since these values are already contained in xb. We want to determine estimates for this variable at specific levels B not at the particular level of any individual. So we will add back on the coefficient estimate and multiply it by 10 (to get an estimate of those who have a 10 th grade education), or 12 (high school graduate) or 16 (college D:\WP60\LECT2.PHD\LOGIST\LOGIST.PROBAB.SAS.WPD Page 3
4 graduate). Below, we take the Xb estimates from above and put them into a logistic form (see the formula above). Each individual within the sample will a value for each of the variables below. We will then take the mean, which will give us an overall average probability of being in poverty. pr_nopub=(exp(xb_nopub))/(1+exp(xb_nopub)); pr_pub=(exp(xb_pub))/(1+exp(xb_pub)); pr_nobig=(exp(xb_nobig))/(1+exp(xb_nobig)); pr_big=(exp(xb_big))/(1+exp(xb_big)); pr_wag5=(exp(xb_wag5))/(1+exp(xb_wag5)); pr_wag10=(exp(xb_wag10))/(1+exp(xb_wag10)); pr_wag1=(exp(xb_wag1))/(1+exp(xb_wag1)); pr_ed10=(exp(xb_ed10))/(1+exp(xb_ed10)); pr_ed12=(exp(xb_ed12))/(1+exp(xb_ed12)); pr_ed16=(exp(xb_ed16))/(1+exp(xb_ed16)); proc means;var pr_nopub pr_pub pr_nobig pr_big pr_wag5 pr_wag10 pr_wag1 pr_ed10 pr_ed12 pr_ed16; weight weight; run; Results The LOGISTIC Procedure Data Set: WORK.A Response Variable: INPOV Response Levels: 2 Number of Observations: Link Function: Logit Response Profile Ordered Value INPOV Count (In this sample, 3,208 lived below the poverty line, 12,198 did not.) Model Fitting Information and Testing Global Null Hypothesis BETA=0 Intercept Intercept and Criterion Only Covariates Chi-Square for Covariates AIC SC LOG L with 4 DF (p=0.0001) Score with 4 DF (p=0.0001) Analysis of Maximum Likelihood Estimates D:\WP60\LECT2.PHD\LOGIST\LOGIST.PROBAB.SAS.WPD Page 4
5 Parameter Standard Wald Pr > Standardized Odds Variable DF Estimate Error Chi-Square Chi-Square Estimate Ratio INTERCPT PUBHOUSE BIGCIT WAGEHD EDHD Notice that all coefficient estimates are significantly related to being in poverty. Those living in public housing and in big cities are positively related to having income below the poverty line. Wages and education level of the head have a negative relationship to having income below the poverty line. Also note that -2 Log L is significant (p=.0001). Thus, all of the variables together are related to the dependent variable. Below are the probability estimates, or the likelihood of being in poverty, given particular conditions. The likelihood of being in poverty given that you don=t live in public housing, controlling for all other variables in the model, is 17.12%. For the probability for those living in public housing is %. For those with wages of $1/hour, the likelihood of being in poverty is 43.04%, while for those earning $10/hour, this likelihood is 7.64%. All of the probability estimates are determined holding all other variables within the model constant. The SAS System 15:40 Thursday, January 28, 1999 Variable N Mean Std Dev Minimum Maximum PR_NOPUB E PR_PUB E PR_NOBIG E PR_BIG E PR_WAG PR_WAG PR_WAG PR_ED E PR_ED E PR_ED E D:\WP60\LECT2.PHD\LOGIST\LOGIST.PROBAB.SAS.WPD Page 5
6 The SAS program: libname in 'P:\pubdata\gssw'; (or wherever the data is located) data a;set in.psidphd; proc logist descending outest=dd maxiter=100; output out=cc xbeta=xb; model inpov=pubhouse bigcit wagehd edhd; data f;set dd; rename pubhouse=cpubhous bigcit=cbigcit wagehd=cwagehd edhd=cedhd; drop _type_; data g;merge f cc;by a; xb_nopub=xb-cpubhous*pubhouse; xb_pub=xb_nopub+cpubhous; Xb_nobig=xb-cbigcit*bigcit; xb_big=xb_nobig+cbigcit; xb_wag5=xb-cwagehd*wagehd+cwagehd*5; xb_wag10=xb-cwagehd*wagehd+cwagehd*10; xb_wag1=xb-cwagehd*wagehd+cwagehd*1; xb_ed10=xb-cedhd*edhd+cedhd*10; xb_ed12=xb-cedhd*edhd+cedhd*12; xb_ed16=xb-cedhd*edhd+cwagehd*16; pr_nopub=(exp(xb_nopub))/(1+exp(xb_nopub)); pr_pub=(exp(xb_pub))/(1+exp(xb_pub)); pr_nobig=(exp(xb_nobig))/(1+exp(xb_nobig)); pr_big=(exp(xb_big))/(1+exp(xb_big)); pr_wag5=(exp(xb_wag5))/(1+exp(xb_wag5)); pr_wag10=(exp(xb_wag10))/(1+exp(xb_wag10)); pr_wag1=(exp(xb_wag1))/(1+exp(xb_wag1)); pr_ed10=(exp(xb_ed10))/(1+exp(xb_ed10)); pr_ed12=(exp(xb_ed12))/(1+exp(xb_ed12)); pr_ed16=(exp(xb_ed16))/(1+exp(xb_ed16)); proc means;var pr_nopub pr_pub pr_nobig pr_big pr_wag5 pr_wag10 pr_wag1 pr_ed10 pr_ed12 pr_ed16; weight weight; run; D:\WP60\LECT2.PHD\LOGIST\LOGIST.PROBAB.SAS.WPD Page 6
proc genmod; model malform/total = alcohol / dist=bin link=identity obstats; title 'Table 2.7'; title2 'Identity Link';
BIOS 6244 Analysis of Categorical Data Assignment 5 s 1. Consider Exercise 4.4, p. 98. (i) Write the SAS code, including the DATA step, to fit the linear probability model and the logit model to the data
More information[BINARY DEPENDENT VARIABLE ESTIMATION WITH STATA]
Tutorial #3 This example uses data in the file 16.09.2011.dta under Tutorial folder. It contains 753 observations from a sample PSID data on the labor force status of married women in the U.S in 1975.
More informationFinal Exam - section 1. Thursday, December hours, 30 minutes
Econometrics, ECON312 San Francisco State University Michael Bar Fall 2013 Final Exam - section 1 Thursday, December 19 1 hours, 30 minutes Name: Instructions 1. This is closed book, closed notes exam.
More informationAlastair Hall ECG 790F: Microeconometrics Spring Computer Handout # 2. Estimation of binary response models : part II
Alastair Hall ECG 790F: Microeconometrics Spring 2006 Computer Handout # 2 Estimation of binary response models : part II In this handout, we discuss the estimation of binary response models with and without
More informationtm / / / / / / / / / / / / Statistics/Data Analysis User: Klick Project: Limited Dependent Variables{space -6}
PS 4 Monday August 16 01:00:42 2010 Page 1 tm / / / / / / / / / / / / Statistics/Data Analysis User: Klick Project: Limited Dependent Variables{space -6} log: C:\web\PS4log.smcl log type: smcl opened on:
More informationTable 4. Probit model of union membership. Probit coefficients are presented below. Data from March 2008 Current Population Survey.
1. Using a probit model and data from the 2008 March Current Population Survey, I estimated a probit model of the determinants of pension coverage. Three specifications were estimated. The first included
More informationLogistic Regression with R: Example One
Logistic Regression with R: Example One math = read.table("http://www.utstat.toronto.edu/~brunner/appliedf12/data/mathcat.data") math[1:5,] hsgpa hsengl hscalc course passed outcome 1 78.0 80 Yes Mainstrm
More informationLecture 21: Logit Models for Multinomial Responses Continued
Lecture 21: Logit Models for Multinomial Responses Continued Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University
More informationMultiple Regression and Logistic Regression II. Dajiang 525 Apr
Multiple Regression and Logistic Regression II Dajiang Liu @PHS 525 Apr-19-2016 Materials from Last Time Multiple regression model: Include multiple predictors in the model = + + + + How to interpret the
More informationIntro to GLM Day 2: GLM and Maximum Likelihood
Intro to GLM Day 2: GLM and Maximum Likelihood Federico Vegetti Central European University ECPR Summer School in Methods and Techniques 1 / 32 Generalized Linear Modeling 3 steps of GLM 1. Specify the
More informationCalculating the Probabilities of Member Engagement
Calculating the Probabilities of Member Engagement by Larry J. Seibert, Ph.D. Binary logistic regression is a regression technique that is used to calculate the probability of an outcome when there are
More informationMultinomial Logit Models - Overview Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 13, 2017
Multinomial Logit Models - Overview Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 13, 2017 This is adapted heavily from Menard s Applied Logistic Regression
More informationSTA 4504/5503 Sample questions for exam True-False questions.
STA 4504/5503 Sample questions for exam 2 1. True-False questions. (a) For General Social Survey data on Y = political ideology (categories liberal, moderate, conservative), X 1 = gender (1 = female, 0
More informationsociology SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 SO5032 Quantitative Research Methods
1 SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 Lecture 10: Multinomial regression baseline category extension of binary What if we have multiple possible
More informationLogit Models for Binary Data
Chapter 3 Logit Models for Binary Data We now turn our attention to regression models for dichotomous data, including logistic regression and probit analysis These models are appropriate when the response
More informationPASS Sample Size Software
Chapter 850 Introduction Cox proportional hazards regression models the relationship between the hazard function λ( t X ) time and k covariates using the following formula λ log λ ( t X ) ( t) 0 = β1 X1
More information############################ ### toxo.r ### ############################
############################ ### toxo.r ### ############################ toxo < read.table(file="n:\\courses\\stat8620\\fall 08\\toxo.dat",header=T) #toxo < read.table(file="c:\\documents and Settings\\dhall\\My
More informationEstimation Procedure for Parametric Survival Distribution Without Covariates
Estimation Procedure for Parametric Survival Distribution Without Covariates The maximum likelihood estimates of the parameters of commonly used survival distribution can be found by SAS. The following
More informationReview questions for Multinomial Logit/Probit, Tobit, Heckit, Quantile Regressions
1. I estimated a multinomial logit model of employment behavior using data from the 2006 Current Population Survey. The three possible outcomes for a person are employed (outcome=1), unemployed (outcome=2)
More informationOrdinal Multinomial Logistic Regression. Thom M. Suhy Southern Methodist University May14th, 2013
Ordinal Multinomial Logistic Thom M. Suhy Southern Methodist University May14th, 2013 GLM Generalized Linear Model (GLM) Framework for statistical analysis (Gelman and Hill, 2007, p. 135) Linear Continuous
More informationMarket Variables and Financial Distress. Giovanni Fernandez Stetson University
Market Variables and Financial Distress Giovanni Fernandez Stetson University In this paper, I investigate the predictive ability of market variables in correctly predicting and distinguishing going concern
More informationBEcon Program, Faculty of Economics, Chulalongkorn University Page 1/7
Mid-term Exam (November 25, 2005, 0900-1200hr) Instructions: a) Textbooks, lecture notes and calculators are allowed. b) Each must work alone. Cheating will not be tolerated. c) Attempt all the tests.
More informationTo be two or not be two, that is a LOGISTIC question
MWSUG 2016 - Paper AA18 To be two or not be two, that is a LOGISTIC question Robert G. Downer, Grand Valley State University, Allendale, MI ABSTRACT A binary response is very common in logistic regression
More informationARIMA ANALYSIS WITH INTERVENTIONS / OUTLIERS
TASK Run intervention analysis on the price of stock M: model a function of the price as ARIMA with outliers and interventions. SOLUTION The document below is an abridged version of the solution provided
More informationCHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES
Examples: Monte Carlo Simulation Studies CHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES Monte Carlo simulation studies are often used for methodological investigations of the performance of statistical
More informationAssessment on Credit Risk of Real Estate Based on Logistic Regression Model
Assessment on Credit Risk of Real Estate Based on Logistic Regression Model Li Hongli 1, a, Song Liwei 2,b 1 Chongqing Engineering Polytechnic College, Chongqing400037, China 2 Division of Planning and
More informationComparing Odds Ratios and Marginal Effects from Logistic Regression and Linear Probability Models
Western Kentucky University From the SelectedWorks of Matt Bogard Spring March 11, 2016 Comparing Odds Ratios and Marginal Effects from Logistic Regression and Linear Probability Models Matt Bogard Available
More informationActuarial Research on the Effectiveness of Collision Avoidance Systems FCW & LDW. A translation from Hebrew to English of a research paper prepared by
Actuarial Research on the Effectiveness of Collision Avoidance Systems FCW & LDW A translation from Hebrew to English of a research paper prepared by Ron Actuarial Intelligence LTD Contact Details: Shachar
More informationFinancial Econometrics: Problem Set # 3 Solutions
Financial Econometrics: Problem Set # 3 Solutions N Vera Chau The University of Chicago: Booth February 9, 219 1 a. You can generate the returns using the exact same strategy as given in problem 2 below.
More informationModule 9: Single-level and Multilevel Models for Ordinal Responses. Stata Practical 1
Module 9: Single-level and Multilevel Models for Ordinal Responses Pre-requisites Modules 5, 6 and 7 Stata Practical 1 George Leckie, Tim Morris & Fiona Steele Centre for Multilevel Modelling If you find
More informationGeneralized Linear Models
Generalized Linear Models Scott Creel Wednesday, September 10, 2014 This exercise extends the prior material on using the lm() function to fit an OLS regression and test hypotheses about effects on a parameter.
More informationClass Notes: Week 6. Multinomial Outcomes
Ronald Hek Class Notes: Week 6 1 Class Notes: Week 6 Multinomial Outomes For the next ouple of weeks or so, we will look at models where there are more than two ategories of outomes. Multinomial logisti
More informationLogistic Regression. Logistic Regression Theory
Logistic Regression Dr. J. Kyle Roberts Southern Methodist University Simmons School of Education and Human Development Department of Teaching and Learning Logistic Regression The linear probability model.
More informationMaximum Likelihood Estimation
Maximum Likelihood Estimation EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #6 EPSY 905: Maximum Likelihood In This Lecture The basics of maximum likelihood estimation Ø The engine that
More informationSociology Exam 3 Answer Key - DRAFT May 8, 2007
Sociology 63993 Exam 3 Answer Key - DRAFT May 8, 2007 I. True-False. (20 points) Indicate whether the following statements are true or false. If false, briefly explain why. 1. The odds of an event occurring
More informationUsing New SAS 9.4 Features for Cumulative Logit Models with Partial Proportional Odds Paul J. Hilliard, Educational Testing Service (ETS)
Using New SAS 9.4 Features for Cumulative Logit Models with Partial Proportional Odds Using New SAS 9.4 Features for Cumulative Logit Models with Partial Proportional Odds INTRODUCTION Multicategory Logit
More information9. Logit and Probit Models For Dichotomous Data
Sociology 740 John Fox Lecture Notes 9. Logit and Probit Models For Dichotomous Data Copyright 2014 by John Fox Logit and Probit Models for Dichotomous Responses 1 1. Goals: I To show how models similar
More informationT.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION
In Inferential Statistic, ESTIMATION (i) (ii) is called the True Population Mean and is called the True Population Proportion. You must also remember that are not the only population parameters. There
More informationMaximum Likelihood Estimation Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 13, 2018
Maximum Likelihood Estimation Richard Williams, University of otre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 3, 208 [This handout draws very heavily from Regression Models for Categorical
More informationGetting Started in Logit and Ordered Logit Regression (ver. 3.1 beta)
Getting Started in Logit and Ordered Logit Regression (ver. 3. beta Oscar Torres-Reyna Data Consultant otorres@princeton.edu http://dss.princeton.edu/training/ Logit model Use logit models whenever your
More informationDummy Variables. 1. Example: Factors Affecting Monthly Earnings
Dummy Variables A dummy variable or binary variable is a variable that takes on a value of 0 or 1 as an indicator that the observation has some kind of characteristic. Common examples: Sex (female): FEMALE=1
More informationGetting Started in Logit and Ordered Logit Regression (ver. 3.1 beta)
Getting Started in Logit and Ordered Logit Regression (ver. 3. beta Oscar Torres-Reyna Data Consultant otorres@princeton.edu http://dss.princeton.edu/training/ Logit model Use logit models whenever your
More informationChapter 4 Level of Volatility in the Indian Stock Market
Chapter 4 Level of Volatility in the Indian Stock Market Measurement of volatility is an important issue in financial econometrics. The main reason for the prominent role that volatility plays in financial
More informationBrief Sketch of Solutions: Tutorial 1. 2) descriptive statistics and correlogram. Series: LGCSI Sample 12/31/ /11/2009 Observations 2596
Brief Sketch of Solutions: Tutorial 1 2) descriptive statistics and correlogram 240 200 160 120 80 40 0 4.8 5.0 5.2 5.4 5.6 5.8 6.0 6.2 Series: LGCSI Sample 12/31/1999 12/11/2009 Observations 2596 Mean
More informationLogistic Regression Analysis
Revised July 2018 Logistic Regression Analysis This set of notes shows how to use Stata to estimate a logistic regression equation. It assumes that you have set Stata up on your computer (see the Getting
More informationMaximum Likelihood Estimation Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 10, 2017
Maximum Likelihood Estimation Richard Williams, University of otre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 0, 207 [This handout draws very heavily from Regression Models for Categorical
More informationBrief Sketch of Solutions: Tutorial 2. 2) graphs. 3) unit root tests
Brief Sketch of Solutions: Tutorial 2 2) graphs LJAPAN DJAPAN 5.2.12 5.0.08 4.8.04 4.6.00 4.4 -.04 4.2 -.08 4.0 01 02 03 04 05 06 07 08 09 -.12 01 02 03 04 05 06 07 08 09 LUSA DUSA 7.4.12 7.3 7.2.08 7.1.04
More informationGov 2001: Section 5. I. A Normal Example II. Uncertainty. Gov Spring 2010
Gov 2001: Section 5 I. A Normal Example II. Uncertainty Gov 2001 Spring 2010 A roadmap We started by introducing the concept of likelihood in the simplest univariate context one observation, one variable.
More informationa. Explain why the coefficients change in the observed direction when switching from OLS to Tobit estimation.
1. Using data from IRS Form 5500 filings by U.S. pension plans, I estimated a model of contributions to pension plans as ln(1 + c i ) = α 0 + U i α 1 + PD i α 2 + e i Where the subscript i indicates the
More informationPhd Program in Transportation. Transport Demand Modeling. Session 11
Phd Program in Transportation Transport Demand Modeling João de Abreu e Silva Session 11 Binary and Ordered Choice Models Phd in Transportation / Transport Demand Modelling 1/26 Heterocedasticity Homoscedasticity
More informationAllison notes there are two conditions for using fixed effects methods.
Panel Data 3: Conditional Logit/ Fixed Effects Logit Models Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised April 2, 2017 These notes borrow very heavily, sometimes
More informationCatherine De Vries, Spyros Kosmidis & Andreas Murr
APPLIED STATISTICS FOR POLITICAL SCIENTISTS WEEK 8: DEPENDENT CATEGORICAL VARIABLES II Catherine De Vries, Spyros Kosmidis & Andreas Murr Topic: Logistic regression. Predicted probabilities. STATA commands
More informationCREDIT RISK MODELING IN R. Logistic regression: introduction
CREDIT RISK MODELING IN R Logistic regression: introduction Final data structure > str(training_set) 'data.frame': 19394 obs. of 8 variables: $ loan_status : Factor w/ 2 levels "0","1": 1 1 1 1 1 1 1 1
More informationCREDIT SCORING & CREDIT CONTROL XIV August 2015 Edinburgh. Aneta Ptak-Chmielewska Warsaw School of Ecoomics
CREDIT SCORING & CREDIT CONTROL XIV 26-28 August 2015 Edinburgh Aneta Ptak-Chmielewska Warsaw School of Ecoomics aptak@sgh.waw.pl 1 Background literature Hypothesis Data and methods Empirical example Conclusions
More informationu panel_lecture . sum
u panel_lecture sum Variable Obs Mean Std Dev Min Max datastre 639 9039644 6369418 900228 926665 year 639 1980 2584012 1976 1984 total_sa 639 9377839 3212313 682 441e+07 tot_fixe 639 5214385 1988422 642
More informationOrdinary Least Squares Regression Examples Vartanian: SW 504
Ordinary Least Squares Regression Examples Vartanian: SW 504 1. In this first ordinary least squares regression model, the dependent variable is AFDC income. The independent variable is number of kids
More informationTests for the Odds Ratio in a Matched Case-Control Design with a Binary X
Chapter 156 Tests for the Odds Ratio in a Matched Case-Control Design with a Binary X Introduction This procedure calculates the power and sample size necessary in a matched case-control study designed
More informationVariance clustering. Two motivations, volatility clustering, and implied volatility
Variance modelling The simplest assumption for time series is that variance is constant. Unfortunately that assumption is often violated in actual data. In this lecture we look at the implications of time
More informationNegative Binomial Model for Count Data Log-linear Models for Contingency Tables - Introduction
Negative Binomial Model for Count Data Log-linear Models for Contingency Tables - Introduction Statistics 149 Spring 2006 Copyright 2006 by Mark E. Irwin Negative Binomial Family Example: Absenteeism from
More informationBayesian Multinomial Model for Ordinal Data
Bayesian Multinomial Model for Ordinal Data Overview This example illustrates how to fit a Bayesian multinomial model by using the built-in mutinomial density function (MULTINOM) in the MCMC procedure
More informationDidacticiel - Études de cas. In this tutorial, we show how to implement a multinomial logistic regression with TANAGRA.
Subject In this tutorial, we show how to implement a multinomial logistic regression with TANAGRA. Logistic regression is a technique for maing predictions when the dependent variable is a dichotomy, and
More informationQuantitative Techniques Term 2
Quantitative Techniques Term 2 Laboratory 7 2 March 2006 Overview The objective of this lab is to: Estimate a cost function for a panel of firms; Calculate returns to scale; Introduce the command cluster
More informationbook 2014/5/6 15:21 page 261 #285
book 2014/5/6 15:21 page 261 #285 Chapter 10 Simulation Simulations provide a powerful way to answer questions and explore properties of statistical estimators and procedures. In this chapter, we will
More informationFinal Exam, section 2. Tuesday, December hour, 30 minutes
San Francisco State University Michael Bar ECON 312 Fall 2018 Final Exam, section 2 Tuesday, December 18 1 hour, 30 minutes Name: Instructions 1. This is closed book, closed notes exam. 2. You can use
More informationHomework Solutions - Lecture 2 Part 2
Homework Solutions - Lecture 2 Part 2 1. In 1995, Time Warner Inc. had a Beta of 1.61. Part of the reason for this high Beta was the debt left over from the leveraged buyout of Time by Warner in 1989,
More informationFinal Exam Suggested Solutions
University of Washington Fall 003 Department of Economics Eric Zivot Economics 483 Final Exam Suggested Solutions This is a closed book and closed note exam. However, you are allowed one page of handwritten
More informationCategorical Outcomes. Statistical Modelling in Stata: Categorical Outcomes. R by C Table: Example. Nominal Outcomes. Mark Lunt.
Categorical Outcomes Statistical Modelling in Stata: Categorical Outcomes Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester Nominal Ordinal 28/11/2017 R by C Table: Example Categorical,
More informationEXST7015: Multiple Regression from Snedecor & Cochran (1967) RAW DATA LISTING
Multiple (Linear) Regression Introductory example Page 1 1 options ps=256 ls=132 nocenter nodate nonumber; 3 DATA ONE; 4 TITLE1 ''; 5 INPUT X1 X2 X3 Y; 6 **** LABEL Y ='Plant available phosphorus' 7 X1='Inorganic
More informationEconometric Methods for Valuation Analysis
Econometric Methods for Valuation Analysis Margarita Genius Dept of Economics M. Genius (Univ. of Crete) Econometric Methods for Valuation Analysis Cagliari, 2017 1 / 25 Outline We will consider econometric
More informationHierarchical Generalized Linear Models. Measurement Incorporated Hierarchical Linear Models Workshop
Hierarchical Generalized Linear Models Measurement Incorporated Hierarchical Linear Models Workshop Hierarchical Generalized Linear Models So now we are moving on to the more advanced type topics. To begin
More informationModule 4 Bivariate Regressions
AGRODEP Stata Training April 2013 Module 4 Bivariate Regressions Manuel Barron 1 and Pia Basurto 2 1 University of California, Berkeley, Department of Agricultural and Resource Economics 2 University of
More informationUnit 8 Notes: Solving Quadratics by Factoring Alg 1
Unit 8 Notes: Solving Quadratics by Factoring Alg 1 Name Period Day Date Assignment (Due the next class meeting) Tuesday Wednesday Thursday Friday Monday Tuesday Wednesday Thursday Friday Monday Tuesday
More informationChapter 18: The Correlational Procedures
Introduction: In this chapter we are going to tackle about two kinds of relationship, positive relationship and negative relationship. Positive Relationship Let's say we have two values, votes and campaign
More informationSean Howard Econometrics Final Project Paper. An Analysis of the Determinants and Factors of Physical Education Attendance in the Fourth Quarter
Sean Howard Econometrics Final Project Paper An Analysis of the Determinants and Factors of Physical Education Attendance in the Fourth Quarter Introduction This project attempted to gain a more complete
More informationAdvanced Econometrics
Advanced Econometrics Instructor: Takashi Yamano 11/14/2003 Due: 11/21/2003 Homework 5 (30 points) Sample Answers 1. (16 points) Read Example 13.4 and an AER paper by Meyer, Viscusi, and Durbin (1995).
More informationModelling the potential human capital on the labor market using logistic regression in R
Modelling the potential human capital on the labor market using logistic regression in R Ana-Maria Ciuhu (dobre.anamaria@hotmail.com) Institute of National Economy, Romanian Academy; National Institute
More informationFinancial Literacy in Urban India: A Case Study of Bohra Community in Mumbai
MPRA Munich Personal RePEc Archive Financial Literacy in Urban India: A Case Study of Bohra Community in Mumbai Tirupati Basutkar Ramanand Arya D. A. V. College, Mumbai, India 8 January 2016 Online at
More informationGraduate School of Business, University of Chicago Business 41202, Spring Quarter 2007, Mr. Ruey S. Tsay. Midterm
Graduate School of Business, University of Chicago Business 41202, Spring Quarter 2007, Mr. Ruey S. Tsay Midterm GSB Honor Code: I pledge my honor that I have not violated the Honor Code during this examination.
More informationApplying Logistics Regression to Forecast Annual Organizational Retirements
SESUG Paper SD-137-2017 Applying Logistics Regression to Forecast Annual Organizational Retirements Alan Dunham, Greybeard Solutions, LLC ABSTRACT This paper briefly discusses the labor economics research
More informationProblem Set 5 Answers. ( ) 2. Yes, like temperature. See the plot of utility in the notes. Marginal utility should be positive.
Business John H. Cochrane Problem Set Answers Part I A simple very short readings questions. + = + + + = + + + + = ( ). Yes, like temperature. See the plot of utility in the notes. Marginal utility should
More informationLAMPIRAN. Null Hypothesis: LO has a unit root Exogenous: Constant Lag Length: 1 (Automatic based on SIC, MAXLAG=13)
74 LAMPIRAN Lampiran 1 Analisis ARIMA 1.1. Uji Stasioneritas Variabel 1. Data Harga Minyak Riil Level Null Hypothesis: LO has a unit root Lag Length: 1 (Automatic based on SIC, MAXLAG=13) Augmented Dickey-Fuller
More informationFall 2004 Social Sciences 7418 University of Wisconsin-Madison Problem Set 5 Answers
Economics 310 Menzie D. Chinn Fall 2004 Social Sciences 7418 University of Wisconsin-Madison Problem Set 5 Answers This problem set is due in lecture on Wednesday, December 15th. No late problem sets will
More informationEstimating Support Labor for a Production Program
Estimating Support Labor for a Production Program ISPA / SCEA Joint Conference June 24-27, 2008 Jeff Platten PMP, CCE/A Systems Project Engineer Northrop Grumman Corporation Biography Jeff Platten is a
More informationDATABASE AND RESEARCH METHODOLOGY
CHAPTER III DATABASE AND RESEARCH METHODOLOGY The nature of the present study Direct Tax Reforms in India: A Comparative Study of Pre and Post-liberalization periods is such that it requires secondary
More informationEconomics 413: Economic Forecast and Analysis Department of Economics, Finance and Legal Studies University of Alabama
Problem Set #1 (Linear Regression) 1. The file entitled MONEYDEM.XLS contains quarterly values of seasonally adjusted U.S.3-month ( 3 ) and 1-year ( 1 ) treasury bill rates. Each series is measured over
More informationHow can saving deposit rate and Hang Seng Index affect housing prices : an empirical study in Hong Kong market
Lingnan Journal of Banking, Finance and Economics Volume 2 2010/2011 Academic Year Issue Article 3 January 2010 How can saving deposit rate and Hang Seng Index affect housing prices : an empirical study
More informationFinal Exam, section 1. Thursday, May hour, 30 minutes
San Francisco State University Michael Bar ECON 312 Spring 2018 Final Exam, section 1 Thursday, May 17 1 hour, 30 minutes Name: Instructions 1. This is closed book, closed notes exam. 2. You can use one
More informationStatistical Intervals (One sample) (Chs )
7 Statistical Intervals (One sample) (Chs 8.1-8.3) Confidence Intervals The CLT tells us that as the sample size n increases, the sample mean X is close to normally distributed with expected value µ and
More informationDiscrete Choice Modeling William Greene Stern School of Business, New York University. Lab Session 2 Binary Choice Modeling with Panel Data
Discrete Choice Modeling William Greene Stern School of Business, New York University Lab Session 2 Binary Choice Modeling with Panel Data This assignment will extend the models of binary choice and ordered
More informationChapter 8 Exercises 1. Data Analysis & Graphics Using R Solutions to Exercises (May 1, 2010)
Chapter 8 Exercises 1 Data Analysis & Graphics Using R Solutions to Exercises (May 1, 2010) Preliminaries > library(daag) Exercise 1 The following table shows numbers of occasions when inhibition (i.e.,
More informationlogistic logistic Merton Black - Scholes Black&Cox Merton Longstaff&Schwarlz Jarrow&Turnbull
29 6 Vol. 29 No. 6 2016 11 Research of Finance and Education Nov. 2016 logistic 271000 logistic 2011-2014 80 A 21 logistic F830. 33 A 2095-0098 2016 06-0027 - 08 1 20 70 Merton 1974 1 Black - Scholes Black&Cox
More informationRenters Report Future Home Buying Optimism, While Family Financial Assistance Is Most Available to Populations with Higher Homeownership Rates
Renters Report Future Home Buying Optimism, While Family Financial Assistance Is Most Available to Populations with Higher Homeownership Rates National Housing Survey Topic Analysis Q3 2016 Published on
More information1) The Effect of Recent Tax Changes on Taxable Income
1) The Effect of Recent Tax Changes on Taxable Income In the most recent issue of the Journal of Policy Analysis and Management, Bradley Heim published a paper called The Effect of Recent Tax Changes on
More informationBooth School of Business, University of Chicago Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Midterm
Booth School of Business, University of Chicago Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay Midterm ChicagoBooth Honor Code: I pledge my honor that I have not violated the Honor Code during this
More informationAppendix. Table A.1 (Part A) The Author(s) 2015 G. Chakrabarti and C. Sen, Green Investing, SpringerBriefs in Finance, DOI /
Appendix Table A.1 (Part A) Dependent variable: probability of crisis (own) Method: ML binary probit (quadratic hill climbing) Included observations: 47 after adjustments Convergence achieved after 6 iterations
More informationGeneralized Multilevel Regression Example for a Binary Outcome
Psy 510/610 Multilevel Regression, Spring 2017 1 HLM Generalized Multilevel Regression Example for a Binary Outcome Specifications for this Bernoulli HLM2 run Problem Title: no title The data source for
More informationStat 401XV Exam 3 Spring 2017
Stat 40XV Exam Spring 07 I have neither given nor received unauthorized assistance on this exam. Name Signed Date Name Printed ATTENTION! Incorrect numerical answers unaccompanied by supporting reasoning
More informationMultiple regression - a brief introduction
Multiple regression - a brief introduction Multiple regression is an extension to regular (simple) regression. Instead of one X, we now have several. Suppose, for example, that you are trying to predict
More informationThe University of Chicago, Booth School of Business Business 41202, Spring Quarter 2017, Mr. Ruey S. Tsay. Solutions to Final Exam
The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2017, Mr. Ruey S. Tsay Solutions to Final Exam Problem A: (40 points) Answer briefly the following questions. 1. Describe
More informationThe Economic Consequences of Dollar Appreciation for US Manufacturing Investment: A Time-Series Analysis
The Economic Consequences of Dollar Appreciation for US Manufacturing Investment: A Time-Series Analysis Robert A. Blecker Unpublished Appendix to Paper Forthcoming in the International Review of Applied
More information