Generalized Linear Models

Size: px
Start display at page:

Download "Generalized Linear Models"

Transcription

1 Generalized Linear Models Ordinal Logistic Regression Dr. Tackett / 26

2 Announcements HW 8 due Thursday, 11/29 Lab 10 due Sunday, 12/2 Exam II, Thursday 12/6 2 / 26

3 Packages library(knitr) library(broom) library(dplyr) library(tibble) library(ggplot2) library(readr) library(cowplot) library(arm) #binned residuals library(nnet) #multinomial regression models 3 / 26

4 Multinomial Logistic Regression 4 / 26

5 Multinomial Logistic Regression If we have an explanatory variable, then we want to be a function of Choose a baseline category. Let's choose. Then, 5 / 26

6 Multinomial Logistic Regression If we have an explanatory variable, then we want to be a function of Choose a baseline category. Let's choose. Then, We have a separate model for each category 5 / 26

7 Multinomial Logistic Regression Suppose we have a response variable that can take three possible outcomes that are coded as 1,2,3 Let 1 be the baseline category. Then 6 / 26

8 Calculating Probabilities For, we calculate the probabilities, as For the baseline category,, we calculate the probability as 7 / 26

9 Interpreting Coe cients For every one unit increase in, the log-odds of vs. increase by Con dence Interval: to For every one unit increase in, the odds of vs. multiply by a factor of Con dence Interval: to 8 / 26

10 Model Diagnostics For each category of the response, : Analyze a plot of the binned residuals vs. each continuous explanatory variable Look for any patterns in the residuals plots For each categorical explanatory variable: Examine the average residuals for each category of the response variable 9 / 26

11 Example: Sesame Street We will analyze data from an experiment to test the e ectiveness of the children's program Sesame Street. As part of the experiment, children were assigned to one of two groups: those who were encouraged to watch the program and those who were not We want to understand what e ect the encouragement had on the frequency of viewing after adjusting for other characteristics 10 / 26

12 Response Variable viewcat 1: rarely watched show 2: once or twice a week 3: three to ve times a week 4: watched show on average more than ve times a week 11 / 26

13 Explanatory Variables age: child's age in months prenumb: score on numbers pretest (0 to 54) prelet: score on letters pretest (0 to 58) viewenc: 1: encouraged to watch, 0: not encouraged site: 1: three to ve year old from disadvantaged inner city area 2: four year old from advantaged suburban area 3: from advantaged rural area 4: from disadvantaged rural area 5: from Spanish speaking home 12 / 26

14 Example: Sesame Street Full description of data Analysis 13 / 26

15 Example: Sesame Street We built a model that could be used to describe the relationship between the amount a child watched Sesame Street and whether or not they were encouraged to watch the show The response variable viewcat has a natural and meaningful ordering which gets lost in the multinomial logistic model We can build an ordinal logistic model that takes into account the natural ordering of the response variable 14 / 26

16 Models for Ordinal Responses 15 / 26

17 Ordinal Responses Suppose is a response variable that can take values is an ordinal response if there is a natural ordering for categories Examples: Movie Ratings: 1 to 5 stars Survey Questions: Strongly Disagree to Strongly Agree Levels of Viewership: Rarely, Weekly, Daily 16 / 26

18 Cumulative Probabilities Suppose is a variable that can take values Assume the ordering Let be the associated probabilities The cumulative probability of response level j is 17 / 26

19 Proportional Odds Model Let be an ordinal response variable that takes levels with associated probabilities The proportional odds model can be written as the following: 18 / 26

20 Understanding the Model Why "proportional"? The proportional odds model assumes that each explanatory variable has the same e ect on the log-odds of being at or below category, regardless of what is 19 / 26

21 Understanding the Model Why "proportional"? The proportional odds model assumes that each explanatory variable has the same e ect on the log-odds of being at or below category, regardless of what is Why? Form of the model calculated by the polr() function in R Leads to a more intuitive interpretation of the coe cients 19 / 26

22 Interpreting the Intercept, When all explanatory variables equal 0, the log-odds of falling at or below category are 20 / 26

23 Interpreting the Intercept, When all explanatory variables equal 0, the log-odds of falling at or below category are When all explanatory variables equal 0, the odds of falling at or below category are 20 / 26

24 Interpreting the Slope, When increases by one unit, the log-odds of falling at or below category decrease by 21 / 26

25 Interpreting the Slope, When increases by one unit, the log-odds of falling at or below category decrease by When increases by one unit, the odds of falling at or below category multiply by a factor of 21 / 26

26 Interpreting the Slope, When increases by one unit, the log-odds of falling at or below category decrease by When increases by one unit, the odds of falling at or below category multiply by a factor of Con dence Intervals: Log-Odds: Odds: to to 21 / 26

27 Sesame Street: Interpreting prenumbcent Analysis For every one additional point on the numbers pretest, the logodds of a child being in viewing category or below, are expected to decrease by about For every one additional point on the numbers pretest, the odds of a child being in viewing category or below are expected to multiply by about / 26

28 Predicted Probabilities Predicted Cumulative Probability: 23 / 26

29 Predicted Probabilities Predicted Cumulative Probability: Predicted Probability: 23 / 26

30 Model Diagnostics For each category of the response, : Analyze a plot of the binned residuals vs. each continuous explanatory variable Look for any patterns in the residuals plots For each categorical explanatory variable: Examine the average residuals for each category of the response variable 24 / 26

31 Change in Deviance Test Use the change in deviance test to compare models (similar to logistic regression) To test signi cance of predictor(s) Fit model with predictor(s) and without predictor(s) Use anova() function to perform change in deviance test (test="chisq") Null hypothesis: coe cient(s) of additional predictor(s) equal to 0 25 / 26

32 Proportional Odds Model in R Use the polr() function the MASS package #calculate the proportional odds model library(mass) my.model <- polr(y ~ X1 + X Xp, data=my.data) # calculate predicted probabilities of being in each cateogry predicted.probability <- predict(my.model,type="probs") # find the predicted category for each observation predicted.category <- predict(my.model,type="class") 26 / 26

Ordinal and categorical variables

Ordinal and categorical variables Ordinal and categorical variables Ben Bolker October 29, 2018 Licensed under the Creative Commons attribution-noncommercial license (http: //creativecommons.org/licenses/by-nc/3.0/). Please share & remix

More information

Final Exam, section 1

Final Exam, section 1 San Francisco State University Michael Bar ECON 312 Fall 2015 Final Exam, section 1 Monday, December 14, 2015 Time: 1 hour, 30 minutes Name: Instructions: 1. This is closed book, closed notes exam. 2.

More information

Stat3011: Solution of Midterm Exam One

Stat3011: Solution of Midterm Exam One 1 Stat3011: Solution of Midterm Exam One Fall/2003, Tiefeng Jiang Name: Problem 1 (30 points). Choose one appropriate answer in each of the following questions. 1. (B ) The mean age of five people in a

More information

Principles of Econometrics Mid-Term

Principles of Econometrics Mid-Term Principles of Econometrics Mid-Term João Valle e Azevedo Sérgio Gaspar October 6th, 2008 Time for completion: 70 min For each question, identify the correct answer. For each question, there is one and

More information

Lecture 21: Logit Models for Multinomial Responses Continued

Lecture 21: Logit Models for Multinomial Responses Continued Lecture 21: Logit Models for Multinomial Responses Continued Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University

More information

9. Logit and Probit Models For Dichotomous Data

9. Logit and Probit Models For Dichotomous Data Sociology 740 John Fox Lecture Notes 9. Logit and Probit Models For Dichotomous Data Copyright 2014 by John Fox Logit and Probit Models for Dichotomous Responses 1 1. Goals: I To show how models similar

More information

Introduction to POL 217

Introduction to POL 217 Introduction to POL 217 Brad Jones 1 1 Department of Political Science University of California, Davis January 9, 2007 Topics of Course Outline Models for Categorical Data. Topics of Course Models for

More information

Categorical Outcomes. Statistical Modelling in Stata: Categorical Outcomes. R by C Table: Example. Nominal Outcomes. Mark Lunt.

Categorical Outcomes. Statistical Modelling in Stata: Categorical Outcomes. R by C Table: Example. Nominal Outcomes. Mark Lunt. Categorical Outcomes Statistical Modelling in Stata: Categorical Outcomes Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester Nominal Ordinal 28/11/2017 R by C Table: Example Categorical,

More information

sociology SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 SO5032 Quantitative Research Methods

sociology SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 SO5032 Quantitative Research Methods 1 SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 Lecture 10: Multinomial regression baseline category extension of binary What if we have multiple possible

More information

STA 4504/5503 Sample questions for exam True-False questions.

STA 4504/5503 Sample questions for exam True-False questions. STA 4504/5503 Sample questions for exam 2 1. True-False questions. (a) For General Social Survey data on Y = political ideology (categories liberal, moderate, conservative), X 1 = gender (1 = female, 0

More information

Statistics 101: Section L - Laboratory 6

Statistics 101: Section L - Laboratory 6 Statistics 101: Section L - Laboratory 6 In today s lab, we are going to look more at least squares regression, and interpretations of slopes and intercepts. Activity 1: From lab 1, we collected data on

More information

Ordinal Multinomial Logistic Regression. Thom M. Suhy Southern Methodist University May14th, 2013

Ordinal Multinomial Logistic Regression. Thom M. Suhy Southern Methodist University May14th, 2013 Ordinal Multinomial Logistic Thom M. Suhy Southern Methodist University May14th, 2013 GLM Generalized Linear Model (GLM) Framework for statistical analysis (Gelman and Hill, 2007, p. 135) Linear Continuous

More information

Logistic Regression Analysis

Logistic Regression Analysis Revised July 2018 Logistic Regression Analysis This set of notes shows how to use Stata to estimate a logistic regression equation. It assumes that you have set Stata up on your computer (see the Getting

More information

Logit Models for Binary Data

Logit Models for Binary Data Chapter 3 Logit Models for Binary Data We now turn our attention to regression models for dichotomous data, including logistic regression and probit analysis These models are appropriate when the response

More information

Case Study: Applying Generalized Linear Models

Case Study: Applying Generalized Linear Models Case Study: Applying Generalized Linear Models Dr. Kempthorne May 12, 2016 Contents 1 Generalized Linear Models of Semi-Quantal Biological Assay Data 2 1.1 Coal miners Pneumoconiosis Data.................

More information

Girma Tefera*, Legesse Negash and Solomon Buke. Department of Statistics, College of Natural Science, Jimma University. Ethiopia.

Girma Tefera*, Legesse Negash and Solomon Buke. Department of Statistics, College of Natural Science, Jimma University. Ethiopia. Vol. 5(2), pp. 15-21, July, 2014 DOI: 10.5897/IJSTER2013.0227 Article Number: C81977845738 ISSN 2141-6559 Copyright 2014 Author(s) retain the copyright of this article http://www.academicjournals.org/ijster

More information

Questions of Statistical Analysis and Discrete Choice Models

Questions of Statistical Analysis and Discrete Choice Models APPENDIX D Questions of Statistical Analysis and Discrete Choice Models In discrete choice models, the dependent variable assumes categorical values. The models are binary if the dependent variable assumes

More information

XLSTAT TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING

XLSTAT TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING XLSTAT TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING INTRODUCTION XLSTAT makes accessible to anyone a powerful, complete and user-friendly data analysis and statistical solution. Accessibility to

More information

Introduction to Population Modeling

Introduction to Population Modeling Introduction to Population Modeling In addition to estimating the size of a population, it is often beneficial to estimate how the population size changes over time. Ecologists often uses models to create

More information

proc genmod; model malform/total = alcohol / dist=bin link=identity obstats; title 'Table 2.7'; title2 'Identity Link';

proc genmod; model malform/total = alcohol / dist=bin link=identity obstats; title 'Table 2.7'; title2 'Identity Link'; BIOS 6244 Analysis of Categorical Data Assignment 5 s 1. Consider Exercise 4.4, p. 98. (i) Write the SAS code, including the DATA step, to fit the linear probability model and the logit model to the data

More information

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018 ` Subject CS1 Actuarial Statistics 1 Core Principles Syllabus for the 2019 exams 1 June 2018 Copyright in this Core Reading is the property of the Institute and Faculty of Actuaries who are the sole distributors.

More information

ORDERED MULTINOMIAL LOGISTIC REGRESSION ANALYSIS. Pooja Shivraj Southern Methodist University

ORDERED MULTINOMIAL LOGISTIC REGRESSION ANALYSIS. Pooja Shivraj Southern Methodist University ORDERED MULTINOMIAL LOGISTIC REGRESSION ANALYSIS Pooja Shivraj Southern Methodist University KINDS OF REGRESSION ANALYSES Linear Regression Logistic Regression Dichotomous dependent variable (yes/no, died/

More information

Multiple Regression and Logistic Regression II. Dajiang 525 Apr

Multiple Regression and Logistic Regression II. Dajiang 525 Apr Multiple Regression and Logistic Regression II Dajiang Liu @PHS 525 Apr-19-2016 Materials from Last Time Multiple regression model: Include multiple predictors in the model = + + + + How to interpret the

More information

Negative Binomial Model for Count Data Log-linear Models for Contingency Tables - Introduction

Negative Binomial Model for Count Data Log-linear Models for Contingency Tables - Introduction Negative Binomial Model for Count Data Log-linear Models for Contingency Tables - Introduction Statistics 149 Spring 2006 Copyright 2006 by Mark E. Irwin Negative Binomial Family Example: Absenteeism from

More information

σ e, which will be large when prediction errors are Linear regression model

σ e, which will be large when prediction errors are Linear regression model Linear regression model we assume that two quantitative variables, x and y, are linearly related; that is, the population of (x, y) pairs are related by an ideal population regression line y = α + βx +

More information

CREDIT RISK MODELING IN R. Logistic regression: introduction

CREDIT RISK MODELING IN R. Logistic regression: introduction CREDIT RISK MODELING IN R Logistic regression: introduction Final data structure > str(training_set) 'data.frame': 19394 obs. of 8 variables: $ loan_status : Factor w/ 2 levels "0","1": 1 1 1 1 1 1 1 1

More information

Stat 328, Summer 2005

Stat 328, Summer 2005 Stat 328, Summer 2005 Exam #2, 6/18/05 Name (print) UnivID I have neither given nor received any unauthorized aid in completing this exam. Signed Answer each question completely showing your work where

More information

Strategies for Assessing Health Plan Performance on Chronic Diseases: Selecting Performance Indicators and Applying Health-Based Risk Adjustment

Strategies for Assessing Health Plan Performance on Chronic Diseases: Selecting Performance Indicators and Applying Health-Based Risk Adjustment Strategies for Assessing Health Plan Performance on Chronic Diseases: Selecting Performance Indicators and Applying Health-Based Risk Adjustment Appendix I Performance Results Overview In this section,

More information

Dummy Variables. 1. Example: Factors Affecting Monthly Earnings

Dummy Variables. 1. Example: Factors Affecting Monthly Earnings Dummy Variables A dummy variable or binary variable is a variable that takes on a value of 0 or 1 as an indicator that the observation has some kind of characteristic. Common examples: Sex (female): FEMALE=1

More information

Multinomial Logit Models for Variable Response Categories Ordered

Multinomial Logit Models for Variable Response Categories Ordered www.ijcsi.org 219 Multinomial Logit Models for Variable Response Categories Ordered Malika CHIKHI 1*, Thierry MOREAU 2 and Michel CHAVANCE 2 1 Mathematics Department, University of Constantine 1, Ain El

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models Scott Creel Wednesday, September 10, 2014 This exercise extends the prior material on using the lm() function to fit an OLS regression and test hypotheses about effects on a parameter.

More information

Addiction - Multinomial Model

Addiction - Multinomial Model Addiction - Multinomial Model February 8, 2012 First the addiction data are loaded and attached. > library(catdata) > data(addiction) > attach(addiction) For the multinomial logit model the function multinom

More information

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN EXAMINATION INSTITUTE AND FACULTY OF ACTUARIES Curriculum 2019 SPECIMEN EXAMINATION Subject CS1A Actuarial Statistics Time allowed: Three hours and fifteen minutes INSTRUCTIONS TO THE CANDIDATE 1. Enter all the candidate

More information

Statistical Models of Stocks and Bonds. Zachary D Easterling: Department of Economics. The University of Akron

Statistical Models of Stocks and Bonds. Zachary D Easterling: Department of Economics. The University of Akron Statistical Models of Stocks and Bonds Zachary D Easterling: Department of Economics The University of Akron Abstract One of the key ideas in monetary economics is that the prices of investments tend to

More information

Bayesian Multinomial Model for Ordinal Data

Bayesian Multinomial Model for Ordinal Data Bayesian Multinomial Model for Ordinal Data Overview This example illustrates how to fit a Bayesian multinomial model by using the built-in mutinomial density function (MULTINOM) in the MCMC procedure

More information

ST 350 Lecture Worksheet #33 Reiland

ST 350 Lecture Worksheet #33 Reiland ST 350 Lecture Worksheet #33 Reiland SOLUTIONS Name Lotteries: Good Idea or Scam? Lotteries have become important sources of revenue for many state governments. However, people have criticized lotteries

More information

Modelling the potential human capital on the labor market using logistic regression in R

Modelling the potential human capital on the labor market using logistic regression in R Modelling the potential human capital on the labor market using logistic regression in R Ana-Maria Ciuhu (dobre.anamaria@hotmail.com) Institute of National Economy, Romanian Academy; National Institute

More information

Tests for the Odds Ratio in a Matched Case-Control Design with a Binary X

Tests for the Odds Ratio in a Matched Case-Control Design with a Binary X Chapter 156 Tests for the Odds Ratio in a Matched Case-Control Design with a Binary X Introduction This procedure calculates the power and sample size necessary in a matched case-control study designed

More information

Using New SAS 9.4 Features for Cumulative Logit Models with Partial Proportional Odds Paul J. Hilliard, Educational Testing Service (ETS)

Using New SAS 9.4 Features for Cumulative Logit Models with Partial Proportional Odds Paul J. Hilliard, Educational Testing Service (ETS) Using New SAS 9.4 Features for Cumulative Logit Models with Partial Proportional Odds Using New SAS 9.4 Features for Cumulative Logit Models with Partial Proportional Odds INTRODUCTION Multicategory Logit

More information

Keywords Akiake Information criterion, Automobile, Bonus-Malus, Exponential family, Linear regression, Residuals, Scaled deviance. I.

Keywords Akiake Information criterion, Automobile, Bonus-Malus, Exponential family, Linear regression, Residuals, Scaled deviance. I. Application of the Generalized Linear Models in Actuarial Framework BY MURWAN H. M. A. SIDDIG School of Mathematics, Faculty of Engineering Physical Science, The University of Manchester, Oxford Road,

More information

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2017, Mr. Ruey S. Tsay. Solutions to Final Exam

The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2017, Mr. Ruey S. Tsay. Solutions to Final Exam The University of Chicago, Booth School of Business Business 41202, Spring Quarter 2017, Mr. Ruey S. Tsay Solutions to Final Exam Problem A: (40 points) Answer briefly the following questions. 1. Describe

More information

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment

Math 2311 Bekki George Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Math 2311 Bekki George bekki@math.uh.edu Office Hours: MW 11am to 12:45pm in 639 PGH Online Thursdays 4-5:30pm And by appointment Class webpage: http://www.math.uh.edu/~bekki/math2311.html Math 2311 Class

More information

Catherine De Vries, Spyros Kosmidis & Andreas Murr

Catherine De Vries, Spyros Kosmidis & Andreas Murr APPLIED STATISTICS FOR POLITICAL SCIENTISTS WEEK 8: DEPENDENT CATEGORICAL VARIABLES II Catherine De Vries, Spyros Kosmidis & Andreas Murr Topic: Logistic regression. Predicted probabilities. STATA commands

More information

Session 178 TS, Stats for Health Actuaries. Moderator: Ian G. Duncan, FSA, FCA, FCIA, FIA, MAAA. Presenter: Joan C. Barrett, FSA, MAAA

Session 178 TS, Stats for Health Actuaries. Moderator: Ian G. Duncan, FSA, FCA, FCIA, FIA, MAAA. Presenter: Joan C. Barrett, FSA, MAAA Session 178 TS, Stats for Health Actuaries Moderator: Ian G. Duncan, FSA, FCA, FCIA, FIA, MAAA Presenter: Joan C. Barrett, FSA, MAAA Session 178 Statistics for Health Actuaries October 14, 2015 Presented

More information

This homework assignment uses the material on pages ( A moving average ).

This homework assignment uses the material on pages ( A moving average ). Module 2: Time series concepts HW Homework assignment: equally weighted moving average This homework assignment uses the material on pages 14-15 ( A moving average ). 2 Let Y t = 1/5 ( t + t-1 + t-2 +

More information

Homework Assignment Section 3

Homework Assignment Section 3 Homework Assignment Section 3 Tengyuan Liang Business Statistics Booth School of Business Problem 1 A company sets different prices for a particular stereo system in eight different regions of the country.

More information

MBF1923 Econometrics Prepared by Dr Khairul Anuar

MBF1923 Econometrics Prepared by Dr Khairul Anuar MBF1923 Econometrics Prepared by Dr Khairul Anuar L1 Introduction to Econometrics www.notes638.wordpress.com What is Econometrics? Econometrics means economic measurement. The scope of econometrics is

More information

Logistic Regression. Logistic Regression Theory

Logistic Regression. Logistic Regression Theory Logistic Regression Dr. J. Kyle Roberts Southern Methodist University Simmons School of Education and Human Development Department of Teaching and Learning Logistic Regression The linear probability model.

More information

To be two or not be two, that is a LOGISTIC question

To be two or not be two, that is a LOGISTIC question MWSUG 2016 - Paper AA18 To be two or not be two, that is a LOGISTIC question Robert G. Downer, Grand Valley State University, Allendale, MI ABSTRACT A binary response is very common in logistic regression

More information

Module 10: Single-level and Multilevel Models for Nominal Responses Concepts

Module 10: Single-level and Multilevel Models for Nominal Responses Concepts Module 10: Single-level and Multilevel Models for Nominal Responses Concepts Fiona Steele Centre for Multilevel Modelling Pre-requisites Modules 5, 6 and 7 Contents Introduction... 1 Introduction to the

More information

SJAM MPM 1D Unit 5 Day 13

SJAM MPM 1D Unit 5 Day 13 Homework 1. Identify the dependent variable. a) The distance a person walks depends on the time they walk. b) The recipe for 1 muffins requires cups of flour. c) Houses need 1 fire alarm per floor.. Identify

More information

We take up chapter 7 beginning the week of October 16.

We take up chapter 7 beginning the week of October 16. STT 315 Week of October 9, 2006 We take up chapter 7 beginning the week of October 16. This week 10-9-06 expands on chapter 6, after which you will be equipped with yet another powerful statistical idea

More information

CHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES

CHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES Examples: Monte Carlo Simulation Studies CHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES Monte Carlo simulation studies are often used for methodological investigations of the performance of statistical

More information

The Mode: An Example. The Mode: An Example. Measure of Central Tendency: The Mode. Measure of Central Tendency: The Median

The Mode: An Example. The Mode: An Example. Measure of Central Tendency: The Mode. Measure of Central Tendency: The Median Chapter 4: What is a measure of Central Tendency? Numbers that describe what is typical of the distribution You can think of this value as where the middle of a distribution lies (the median). or The value

More information

MATH 217 Test 2 Version A

MATH 217 Test 2 Version A MATH 217 Test 2 Version A Name: KEY Sec Number: Answer all questions to the best of your ability. Note you should show as much work as is possible. For questions answered using Excel be sure to include

More information

Lecture 10: Alternatives to OLS with limited dependent variables, part 1. PEA vs APE Logit/Probit

Lecture 10: Alternatives to OLS with limited dependent variables, part 1. PEA vs APE Logit/Probit Lecture 10: Alternatives to OLS with limited dependent variables, part 1 PEA vs APE Logit/Probit PEA vs APE PEA: partial effect at the average The effect of some x on y for a hypothetical case with sample

More information

An Empirical Study on Default Factors for US Sub-prime Residential Loans

An Empirical Study on Default Factors for US Sub-prime Residential Loans An Empirical Study on Default Factors for US Sub-prime Residential Loans Kai-Jiun Chang, Ph.D. Candidate, National Taiwan University, Taiwan ABSTRACT This research aims to identify the loan characteristics

More information

Module 9: Single-level and Multilevel Models for Ordinal Responses. Stata Practical 1

Module 9: Single-level and Multilevel Models for Ordinal Responses. Stata Practical 1 Module 9: Single-level and Multilevel Models for Ordinal Responses Pre-requisites Modules 5, 6 and 7 Stata Practical 1 George Leckie, Tim Morris & Fiona Steele Centre for Multilevel Modelling If you find

More information

Bin(20,.5) and N(10,5) distributions

Bin(20,.5) and N(10,5) distributions STAT 600 Design of Experiments for Research Workers Lab 5 { Due Thursday, November 18 Example Weight Loss In a dietary study, 14 of 0 subjects lost weight. If weight is assumed to uctuate up or down by

More information

Algebra 1 Unit 3: Writing Equations

Algebra 1 Unit 3: Writing Equations Lesson 8: Making Predictions and Creating Scatter Plots The table below represents the cost of a car over the recent years. Year Cost of a Car (in US dollars) 2000 22,500 2002 26,000 2004 32,000 2006 37,500

More information

Logit and Probit Models for Categorical Response Variables

Logit and Probit Models for Categorical Response Variables Applied Statistics With R Logit and Probit Models for Categorical Response Variables John Fox WU Wien May/June 2006 2006 by John Fox Logit and Probit Models 1 1. Goals: To show how models similar to linear

More information

Model fit assessment via marginal model plots

Model fit assessment via marginal model plots The Stata Journal (2010) 10, Number 2, pp. 215 225 Model fit assessment via marginal model plots Charles Lindsey Texas A & M University Department of Statistics College Station, TX lindseyc@stat.tamu.edu

More information

Models of Patterns. Lecture 3, SMMD 2005 Bob Stine

Models of Patterns. Lecture 3, SMMD 2005 Bob Stine Models of Patterns Lecture 3, SMMD 2005 Bob Stine Review Speculative investing and portfolios Risk and variance Volatility adjusted return Volatility drag Dependence Covariance Review Example Stock and

More information

Understanding Predictability (JPE, 2004)

Understanding Predictability (JPE, 2004) Understanding Predictability (JPE, 2004) Lior Menzly, Tano Santos, and Pietro Veronesi Presented by Peter Gross NYU October 19, 2009 Presented by Peter Gross (NYU) Understanding Predictability October

More information

Review questions for Multinomial Logit/Probit, Tobit, Heckit, Quantile Regressions

Review questions for Multinomial Logit/Probit, Tobit, Heckit, Quantile Regressions 1. I estimated a multinomial logit model of employment behavior using data from the 2006 Current Population Survey. The three possible outcomes for a person are employed (outcome=1), unemployed (outcome=2)

More information

Intro to GLM Day 2: GLM and Maximum Likelihood

Intro to GLM Day 2: GLM and Maximum Likelihood Intro to GLM Day 2: GLM and Maximum Likelihood Federico Vegetti Central European University ECPR Summer School in Methods and Techniques 1 / 32 Generalized Linear Modeling 3 steps of GLM 1. Specify the

More information

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1

Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 - Embers Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2.

More information

A Comparison of Univariate Probit and Logit. Models Using Simulation

A Comparison of Univariate Probit and Logit. Models Using Simulation Applied Mathematical Sciences, Vol. 12, 2018, no. 4, 185-204 HIKARI Ltd, www.m-hikari.com https://doi.org/10.12988/ams.2018.818 A Comparison of Univariate Probit and Logit Models Using Simulation Abeer

More information

Unit 3: Writing Equations Chapter Review

Unit 3: Writing Equations Chapter Review Unit 3: Writing Equations Chapter Review Part 1: Writing Equations in Slope Intercept Form. (Lesson 1) 1. Write an equation that represents the line on the graph. 2. Write an equation that has a slope

More information

AP Stats: 3B ~ Least Squares Regression and Residuals. Objectives:

AP Stats: 3B ~ Least Squares Regression and Residuals. Objectives: Objectives: INTERPRET the slope and y intercept of a least-squares regression line USE the least-squares regression line to predict y for a given x CALCULATE and INTERPRET residuals and their standard

More information

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR

STATISTICAL DISTRIBUTIONS AND THE CALCULATOR STATISTICAL DISTRIBUTIONS AND THE CALCULATOR 1. Basic data sets a. Measures of Center - Mean ( ): average of all values. Characteristic: non-resistant is affected by skew and outliers. - Median: Either

More information

Regression. Lecture Notes VII

Regression. Lecture Notes VII Regression Lecture Notes VII Statistics 112, Fall 2002 Outline Predicting based on Use of the conditional mean (the regression function) to make predictions. Prediction based on a sample. Regression line.

More information

Lecture 1: Happiness and Growth

Lecture 1: Happiness and Growth Lecture 1: Happiness and Growth Eugenio Proto March 13, 2009 Eugenio Proto () Lecture 1: Happiness and Growth March 13, 2009 1 / 18 Happiness and Utility Implies cardinal Utility and Interpersonal Comparison

More information

PASS Sample Size Software

PASS Sample Size Software Chapter 850 Introduction Cox proportional hazards regression models the relationship between the hazard function λ( t X ) time and k covariates using the following formula λ log λ ( t X ) ( t) 0 = β1 X1

More information

Non linearity issues in PD modelling. Amrita Juhi Lucas Klinkers

Non linearity issues in PD modelling. Amrita Juhi Lucas Klinkers Non linearity issues in PD modelling Amrita Juhi Lucas Klinkers May 2017 Content Introduction Identifying non-linearity Causes of non-linearity Performance 2 Content Introduction Identifying non-linearity

More information

Cross- Country Effects of Inflation on National Savings

Cross- Country Effects of Inflation on National Savings Cross- Country Effects of Inflation on National Savings Qun Cheng Xiaoyang Li Instructor: Professor Shatakshee Dhongde December 5, 2014 Abstract Inflation is considered to be one of the most crucial factors

More information

Homework Assignment Section 3

Homework Assignment Section 3 Homework Assignment Section 3 Tengyuan Liang Business Statistics Booth School of Business Problem 1 A company sets different prices for a particular stereo system in eight different regions of the country.

More information

Contents Part I Descriptive Statistics 1 Introduction and Framework Population, Sample, and Observations Variables Quali

Contents Part I Descriptive Statistics 1 Introduction and Framework Population, Sample, and Observations Variables Quali Part I Descriptive Statistics 1 Introduction and Framework... 3 1.1 Population, Sample, and Observations... 3 1.2 Variables.... 4 1.2.1 Qualitative and Quantitative Variables.... 5 1.2.2 Discrete and Continuous

More information

Name Period. Linear Correlation

Name Period. Linear Correlation Linear Regression Models Directions: Use the information below to solve the problems in this packet. Packets are due at the end of the period and students who do not finish will be required to come in

More information

Lecture 8: Markov and Regime

Lecture 8: Markov and Regime Lecture 8: Markov and Regime Switching Models Prof. Massimo Guidolin 20192 Financial Econometrics Spring 2016 Overview Motivation Deterministic vs. Endogeneous, Stochastic Switching Dummy Regressiom Switching

More information

Technical Documentation for Household Demographics Projection

Technical Documentation for Household Demographics Projection Technical Documentation for Household Demographics Projection REMI Household Forecast is a tool to complement the PI+ demographic model by providing comprehensive forecasts of a variety of household characteristics.

More information

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 13, 2018

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 13, 2018 Maximum Likelihood Estimation Richard Williams, University of otre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 3, 208 [This handout draws very heavily from Regression Models for Categorical

More information

REJECT INFERENCE FOR CREDIT ADJUDICATION

REJECT INFERENCE FOR CREDIT ADJUDICATION REJECT INFERENCE FOR CREDIT ADJUDICATION May 2014 THE SITUATION SOMEONE APPLIES FOR A LOAN AND A DECISION HAS TO BE MADE TO ACCEPT OR REJECT. THIS IS CREDIT ADJUDICATION IF WE ACCEPT WE CAN OBSERVE PERFORMANCE

More information

MATH Intuitive Calculus Spring 2011 Circle one: 8:50 5:30 Ms. Kracht. Name: Score: /100. EXAM 2: Version A NO CALCULATORS.

MATH Intuitive Calculus Spring 2011 Circle one: 8:50 5:30 Ms. Kracht. Name: Score: /100. EXAM 2: Version A NO CALCULATORS. MATH 11012 Intuitive Calculus Spring 2011 Circle one: 8:50 5:30 Ms Kracht Name: Score: /100 110 pts available) EXAM 2: Version A NO CALCULATORS Multiple Choice: 10 questions at 3 points each Circle the

More information

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 10, 2017

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 10, 2017 Maximum Likelihood Estimation Richard Williams, University of otre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 0, 207 [This handout draws very heavily from Regression Models for Categorical

More information

Extension Analysis. Lauren Goodwin Advisor: Steve Cherry. Spring Introduction and Background Filing Basics... 2

Extension Analysis. Lauren Goodwin Advisor: Steve Cherry. Spring Introduction and Background Filing Basics... 2 Extension Analysis Lauren Goodwin Advisor: Steve Cherry Spring 2015 Contents 1 Introduction and Background 2 1.1 Filing Basics............................................. 2 2 Objectives and Questions

More information

Honor Code: By signing my name below, I pledge my honor that I have not violated the Booth Honor Code during this examination.

Honor Code: By signing my name below, I pledge my honor that I have not violated the Booth Honor Code during this examination. Name: OUTLINE SOLUTIONS University of Chicago Graduate School of Business Business 41000: Business Statistics Special Notes: 1. This is a closed-book exam. You may use an 8 11 piece of paper for the formulas.

More information

2009/2010 CAIA Prerequisite Diagnostic Review (PDR) And Answer Key

2009/2010 CAIA Prerequisite Diagnostic Review (PDR) And Answer Key 2009/2010 CAIA Prerequisite Diagnostic Review (PDR) And Answer Key Form B --------------------------------------------------------------------------------- Candidates registered for the program are assumed

More information

Copyright 2011 Pearson Education, Inc. Publishing as Addison-Wesley.

Copyright 2011 Pearson Education, Inc. Publishing as Addison-Wesley. Appendix: Statistics in Action Part I Financial Time Series 1. These data show the effects of stock splits. If you investigate further, you ll find that most of these splits (such as in May 1970) are 3-for-1

More information

f x f x f x f x x 5 3 y-intercept: y-intercept: y-intercept: y-intercept: y-intercept of a linear function written in function notation

f x f x f x f x x 5 3 y-intercept: y-intercept: y-intercept: y-intercept: y-intercept of a linear function written in function notation Questions/ Main Ideas: Algebra Notes TOPIC: Function Translations and y-intercepts Name: Period: Date: What is the y-intercept of a graph? The four s given below are written in notation. For each one,

More information

THE UNIVERSITY OF CHICAGO Graduate School of Business Business 41202, Spring Quarter 2003, Mr. Ruey S. Tsay

THE UNIVERSITY OF CHICAGO Graduate School of Business Business 41202, Spring Quarter 2003, Mr. Ruey S. Tsay THE UNIVERSITY OF CHICAGO Graduate School of Business Business 41202, Spring Quarter 2003, Mr. Ruey S. Tsay Homework Assignment #2 Solution April 25, 2003 Each HW problem is 10 points throughout this quarter.

More information

Table 4. Probit model of union membership. Probit coefficients are presented below. Data from March 2008 Current Population Survey.

Table 4. Probit model of union membership. Probit coefficients are presented below. Data from March 2008 Current Population Survey. 1. Using a probit model and data from the 2008 March Current Population Survey, I estimated a probit model of the determinants of pension coverage. Three specifications were estimated. The first included

More information

Yesterday s Heroes: Compensation and Creative Risk Taking

Yesterday s Heroes: Compensation and Creative Risk Taking Yesterday s Heroes: Compensation and Creative Risk Taking Ing-Haw Cheng Harrison Hong Jose Scheinkman University of Michigan Princeton University and NBER Chicago Fed Conference on Bank Structure May 4,

More information

Internet Appendix for Can Rare Events Explain the Equity Premium Puzzle?

Internet Appendix for Can Rare Events Explain the Equity Premium Puzzle? Internet Appendix for Can Rare Events Explain the Equity Premium Puzzle? Christian Julliard London School of Economics Anisha Ghosh y Carnegie Mellon University March 6, 2012 Department of Finance and

More information

Milestone2. Zillow House Price Prediciton. Group: Lingzi Hong and Pranali Shetty

Milestone2. Zillow House Price Prediciton. Group: Lingzi Hong and Pranali Shetty Milestone2 Zillow House Price Prediciton Group Lingzi Hong and Pranali Shetty MILESTONE 2 REPORT Data Collection The following additional features were added 1. Population, Number of College Graduates

More information

These notes essentially correspond to chapter 7 of the text.

These notes essentially correspond to chapter 7 of the text. These notes essentially correspond to chapter 7 of the text. 1 Costs When discussing rms our ultimate goal is to determine how much pro t the rm makes. In the chapter 6 notes we discussed production functions,

More information

7. For the table that follows, answer the following questions: x y 1-1/4 2-1/2 3-3/4 4

7. For the table that follows, answer the following questions: x y 1-1/4 2-1/2 3-3/4 4 7. For the table that follows, answer the following questions: x y 1-1/4 2-1/2 3-3/4 4 - Would the correlation between x and y in the table above be positive or negative? The correlation is negative. -

More information

University of New South Wales Semester 1, Economics 4201 and Homework #2 Due on Tuesday 3/29 (20% penalty per day late)

University of New South Wales Semester 1, Economics 4201 and Homework #2 Due on Tuesday 3/29 (20% penalty per day late) University of New South Wales Semester 1, 2011 School of Economics James Morley 1. Autoregressive Processes (15 points) Economics 4201 and 6203 Homework #2 Due on Tuesday 3/29 (20 penalty per day late)

More information

Statistical Evidence and Inference

Statistical Evidence and Inference Statistical Evidence and Inference Basic Methods of Analysis Understanding the methods used by economists requires some basic terminology regarding the distribution of random variables. The mean of a distribution

More information

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2016, Mr. Ruey S. Tsay. Solutions to Midterm

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2016, Mr. Ruey S. Tsay. Solutions to Midterm Booth School of Business, University of Chicago Business 41202, Spring Quarter 2016, Mr. Ruey S. Tsay Solutions to Midterm Problem A: (30 pts) Answer briefly the following questions. Each question has

More information