Models of Multinomial Qualitative Response

Similar documents
Economics Multinomial Choice Models

Introduction to the Maximum Likelihood Estimation Technique. September 24, 2015

Econometrics II Multinomial Choice Models

Multinomial Choice (Basic Models)

Econometric Methods for Valuation Analysis

Questions of Statistical Analysis and Discrete Choice Models

Analysis of Microdata

A note on the nested Logit model

Lecture 1: Logit. Quantitative Methods for Economic Analysis. Seyed Ali Madani Zadeh and Hosein Joshaghani. Sharif University of Technology

Estimating Market Power in Differentiated Product Markets

Quant Econ Pset 2: Logit

sociology SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 SO5032 Quantitative Research Methods

Choice Probabilities. Logit Choice Probabilities Derivation. Choice Probabilities. Basic Econometrics in Transportation.

Review questions for Multinomial Logit/Probit, Tobit, Heckit, Quantile Regressions

3. Multinomial response models

Estimating Mixed Logit Models with Large Choice Sets. Roger H. von Haefen, NC State & NBER Adam Domanski, NOAA July 2013

Introduction to POL 217

What s New in Econometrics. Lecture 11

Valuing Environmental Impacts: Practical Guidelines for the Use of Value Transfer in Policy and Project Appraisal

The Multinomial Logit Model Revisited: A Semiparametric Approach in Discrete Choice Analysis

Topic 8 Lecture 1 Estimating Policy Effects in the Presence of. Endogeneity via the Linear Instrumental Variables (IV) Method

Industrial Organization

STATISTICAL METHODS FOR CATEGORICAL DATA ANALYSIS

PhD Qualifier Examination

Mixed Logit or Random Parameter Logit Model

Final Exam - section 1. Thursday, December hours, 30 minutes

Modeling Obesity and S&P500 Using Normal Inverse Gaussian

A Test of the Normality Assumption in the Ordered Probit Model *

Financial Liberalization and Neighbor Coordination

Table 4. Probit model of union membership. Probit coefficients are presented below. Data from March 2008 Current Population Survey.

Discrete Choice Modeling

Choice Models. Session 1. K. Sudhir Yale School of Management. Spring

Drawbacks of MNL. MNL may not work well in either of the following cases due to its IIA property:

COMPLEMENTARITY ANALYSIS IN MULTINOMIAL

Omitted Variables Bias in Regime-Switching Models with Slope-Constrained Estimators: Evidence from Monte Carlo Simulations

1 Excess burden of taxation

Econometric Models of Expenditure

A generalized Hosmer Lemeshow goodness-of-fit test for multinomial logistic regression models

A MODIFIED MULTINOMIAL LOGIT MODEL OF ROUTE CHOICE FOR DRIVERS USING THE TRANSPORTATION INFORMATION SYSTEM

Simplest Description of Binary Logit Model

Chapter 3. Dynamic discrete games and auctions: an introduction

Multivariate probit models for conditional claim-types

Applied Econometrics for Health Economists

Hierarchical Generalized Linear Models. Measurement Incorporated Hierarchical Linear Models Workshop

PhD Qualifier Examination

Categorical Outcomes. Statistical Modelling in Stata: Categorical Outcomes. R by C Table: Example. Nominal Outcomes. Mark Lunt.

Volume 37, Issue 2. Handling Endogeneity in Stochastic Frontier Analysis

Bivariate Birnbaum-Saunders Distribution

Applied Econometrics. Lectures 13 & 14: Nonlinear Models Beyond Binary Choice: Multinomial Response Models, Corner Solution Models &

Multinomial Logit Models - Overview Richard Williams, University of Notre Dame, Last revised February 13, 2017

Intro to GLM Day 2: GLM and Maximum Likelihood

Mondays from 6p to 8p in Nitze Building N417. Wednesdays from 8a to 9a in BOB 718

Automobile Prices in Equilibrium Berry, Levinsohn and Pakes. Empirical analysis of demand and supply in a differentiated product market.

List of tables List of boxes List of screenshots Preface to the third edition Acknowledgements

A potentially useful approach to model nonlinearities in time series is to assume different behavior (structural break) in different subsamples

NPTEL Project. Econometric Modelling. Module 16: Qualitative Response Regression Modelling. Lecture 20: Qualitative Response Regression Modelling

Duration Models: Parametric Models

Probits. Catalina Stefanescu, Vance W. Berger Scott Hershberger. Abstract

PhD Qualifier Examination

Point Estimation. Copyright Cengage Learning. All rights reserved.

Lecture 10: Alternatives to OLS with limited dependent variables, part 1. PEA vs APE Logit/Probit

May 9, Please put ONLY your ID number on the blue books. Three (3) points will be deducted for each time your name appears in a blue book.

Getting Started in Logit and Ordered Logit Regression (ver. 3.1 beta)

[BINARY DEPENDENT VARIABLE ESTIMATION WITH STATA]

Labor Economics Field Exam Spring 2014

Sean Howard Econometrics Final Project Paper. An Analysis of the Determinants and Factors of Physical Education Attendance in the Fourth Quarter

High-Frequency Data Analysis and Market Microstructure [Tsay (2005), chapter 5]

Multinomial Logit Models for Variable Response Categories Ordered

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, Last revised January 10, 2017

Pakes (1986): Patents as Options: Some Estimates of the Value of Holding European Patent Stocks

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, Last revised January 13, 2018

List of figures. I General information 1

Comparison of Complete Combinatorial and Likelihood Ratio Tests: Empirical Findings from Residential Choice Experiments

Economic stability through narrow measures of inflation

Log-linear Modeling Under Generalized Inverse Sampling Scheme

Getting Started in Logit and Ordered Logit Regression (ver. 3.1 beta)

Predicting the Probability of Being a Smoker: A Probit Analysis

Rational Inattention to Discrete Choices: A New Foundation for. the Multinomial Logit Model

Point Estimators. STATISTICS Lecture no. 10. Department of Econometrics FEM UO Brno office 69a, tel

Introductory Econometrics for Finance

Section Sampling Distributions for Counts and Proportions

Obtaining Analytic Derivatives for a Class of Discrete-Choice Dynamic Programming Models

INTERNATIONAL REAL ESTATE REVIEW 2002 Vol. 5 No. 1: pp Housing Demand with Random Group Effects

The following content is provided under a Creative Commons license. Your support

Logit Models for Binary Data

Regression with a binary dependent variable: Logistic regression diagnostic

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Solutions to Midterm

C.10 Exercises. Y* =!1 + Yz

Models and optimal designs for conjoint choice experiments including a no-choice option

3 Logit. 3.1 Choice Probabilities

3.4 Copula approach for modeling default dependency. Two aspects of modeling the default times of several obligors

CHAPTER 11 Regression with a Binary Dependent Variable. Kazu Matsuda IBEC PHBU 430 Econometrics

Why do the youth in Jamaica neither study nor work? Evidence from JSLC 2001

FIT OR HIT IN CHOICE MODELS

Economics 742 Brief Answers, Homework #2

Nested logit. Michel Bierlaire

Nested logit. Michel Bierlaire

15. Multinomial Outcomes A. Colin Cameron Pravin K. Trivedi Copyright 2006

Econ Spring 2016 Section 12

Labor Mobility of Artists and Creative Individuals Does Distance Matter?

Transcription:

Models of Multinomial Qualitative Response Multinomial Logit Models October 22, 2015

Dependent Variable as a Multinomial Outcome Suppose we observe an economic choice that is a binary signal from amongst M discrete alternatives. The focus on the course is to estimate parameters to test economic theories and/or predict the impact of exogenous change due to policy. Numerous examples: Regulator s choice of where to site a hazardous waste incinerator from among Virginia s 134 counties and independent cities. Student s choice of which of M colleges to attend (where M>2). Individual s choice from among M different private health care plans (where M>2). REI s choice of where to site their next retail store from among 366 Metropolitan statistical areas in the US.

Terminology and Definitions Terminology: The choice set, C, is the set comprised of the M feasible alternatives that could be chosen. For binary discrete outcomes, the choice set is C = Yes, No = 1, 0, while for multinomial outcomes, it is C = Red, Blue,..., Orange = 1, 2,..., M. For each choice alternative m, define a 1 K M vector of alternative specific independent variables, x m = [ x m1 x m2... x mk ]1 K m (1) For each individual i, define a 1 K I vector of individual specific independent variables, z i = [ x i1 x i2... x ik ]1 K I (2) Additionally, we could also define y im as a vector of individual and alternative specific data, but for brevity we omit this type of information without ruling it out.

The Random Utility Model (RUM) for Multinomial Outcomes In the Multinomial RUM, an individual i looks at the indirect utility (an index of well-being) of each of the M alternatives and chooses the best one: d im =1 if =0 otherwise V (x m, z i, ɛ m β) > V (x j, z i, ɛ j β) j C For individual i, M m=1 d im = 1, since only one alternative can be chosen.

The Random Utility Model (RUM): An example Suppose we observe student i, having characteristics z i considering M potential universities, each having characteristics x m.simplify by assuming M=3 (3 choice alternatives) and a linear in parameters form for V. So, we observe the individual choosing alternative 3 (d i3 = 1) iff x 3 β x + z i β z + ɛ i3 > x 2 β x + z i β z + ɛ i2 and x 3 β x + z i β z + ɛ i3 > x 1 β x + z i β z + ɛ i1 Note: variables that do not vary over the choice alternatives drop out of the difference, given our linear function. So, we can t identify β z.

Econometrics Step 1 As researchers, we can observe z i and/or x m (for all choice alternatives), but we can t observe the ɛ s. Nor is there a straightforward way to construct estimated errors as in OLS, since the multinomial qualitative choice signals if the indirect utility is higher or not, not the degree to which it is higher. But we can tackle this problem in a maximum likelihood framework. In a RUM context, write the probability that individual i choose 3 as Prob(3) = Prob ((x 3 x 2 )β z > ɛ i2 ɛ i3, (x 3 x 1 )β z > ɛ i1 ɛ i3 )

Econometrics Step 2 In an analogous manner to what we did before with binary probit and logit models and letting f (ɛ m ) be the pdf for unobservables, Prob(3 β, ɛ, x, z i ) = (x3 x 2 )β x +ɛ i3 (x3 x 1 )β x +ɛ i3 f (ɛ i3 )f (ɛ i2 )f (ɛ i1 )... dɛ i1 dɛ i2 dɛ i3 In words: Given the alternative and individual characteristics and a guess for β, find the likelihood that a draw of ɛ i3, ɛ i2 and ɛ i1 are consistent with the observed choice.

Econometrics Step 3: Assume a distribution for the errors Continuing to assume that 3 was chosen: Probit : (x3 x 2 )β x +ɛ i3 ) Prob(3) = φ(ɛ i3 )φ(ɛ i2 )φ(ɛ i1 )dɛ i1 dɛ i2 dɛ i3 (x3 x 1 )β x +ɛ i3 ) Logit : Prob(3) = e x 3 β e x 1 β +e x 2 β +e x 3 β With that, we can construct the log-likelihood over the entire sample: ln(l(β d, x i β)) = ( N M ) ln Prob(m) d im i=1 m=1 (3)

Interpreting Parameters Here, the marginal effects can be a bit complicated (all of these are for the multinomial logit model). For example, how changing an attribute at one alternative, changes the probability of another alternative: ln[p(m)] x jk = P(j) P(m) β k (4) Or, how changing an attribute at one alternative, changes the probability of choosing that alternative: ln[p(m)] x mk = P(m) β k (5) And finally, ln[p(m)/p(j)] [x mk x jk ] = β k (6)

There is no such thing as a free lunch. The logit probability is quite easy to work with compared to the multiple integrals required for the probit model. But, there is some baggage that comes with it. Inherent in the multinomial logit model is the IIA property- Independence of Irrelevant Alternatives. Consider the ratio of any two probabilities, such as P(3) P(1) : e x 3 β e x 1 β +e x 2 β +e x 3 β = e x 1 β e x 1 β +e x 2 β +e x 3 β e x 3β e x 1β (7) So what?? The β s estimated in the model must adhere to this potentially restrictive condition and by extension, preferences and economic inference may be biased.

IIA Example Suppose there are 7 health plans an individual might buy. Plans 1-5 are HMO plans while Plans 6 and 7 are PPO. Many people prefer PPO plans because it offers flexibility in choosing out of network physicians. Suppose we estimate a multinomial logit model and obtain the following predicted probabilities for individual i: Plan P(m) P(m)/P(PPO 2 ) HMO 1.13.65 HMO 2.09.45 HMO 3.14.7 HMO 4.09.45 HMO 5.15.75 PPO 1.2 1 PPO 2.2 1 Total 1

IIA Example, continued Now suppose that PPO 1 is no longer available. How does the Multinomial Logit and the IIA property reapportion the 20% likelihood of choosing PPO 1 amongst the remaining 6 alternatives? 7 Options Avail. 6 Options Avail. Plan P(m) P(m)/P(PPO 2 ) P(m) P(m)/ P(PPO 2 ) HMO 1.13.65.16.65 HMO 2.09.45.11.45 HMO 3.14.70.18.70 HMO 4.09.45.11.45 HMO 5.15.75.19.75 PPO 1.20 1.00 N/A N/A PPO 2.20 1.00.25 1.00 Total 1 1

Mechanics of IIA Assume 3 alternatives, with initial probabilities P(1), P(2), and P(3). Suppose that alternative 2 is eliminated, denote the new probabilities as P(1) and P(3). Given the multinomial logit model, it must be the case that following the elimination of alternative 2, the following 2 conditions must hold: Adding Up: P(1) + P(3) = 1 P(1) IIA: P(3) = P(1) P(3), which implies P(1) = P(3)P(1) P(3) Using these, it can be shown that P(3) = P(1) = P(3) P(1)+P(3) P(1) P(1)+P(3)

The Nested Logit Model Medical Plan Choice HMO PPO V (HMO1) V (HMO2) V (HMO3) V (HMO4) V (HMO5) V (PPO1) V (PPO2) V (PPO m)=x m + z PPO + PPO m V (HMO m)=x m + z HMO + HMO m Basic Idea: Relax IIA for alternatives in different nests, while keeping IIA within nests.

Choice Probabilities The nested logit model writes the probability of choosing the second PPO alternative, for example, as where P(PPO) = P(PPO 2 ) = P(PPO) P(PPO 2 PPO) (8) e τ PPO(z PPO γ+iv (PPO)) e τ PPO(z PPO γ+iv (PPO)) + e τ HMO(z HMO γ+iv (HMO)) (9) e x PPO,2β P(PPO 2 PPO) = e xppo,1β + e x (10) PPO,2β and IV (B) = ln [ ] m B ex B,m

What is the role of τ τ PPO and τ HMO dictate the degree to which it is easy to substitute from alternatives within one branch to alternatives in another branch. It can be shown that the IIA property 1 Is imposed for alternatives within a branch 2 Is relaxed (does not hold) for alternatives across branches If τ PPO = τ HMO = 1, then the nested logit collapses to the multinomial logit model. Therefore, the suitability of the IIA property can be tested in a maximum likelihood framework: H 0 : τ PPO = τ HMO = 1 H 1 : τ PPO τ HMO 1 Using a likelihood ratio test χ 2, in this case having 2 degrees of freedom since we have two restrictions.

Practical Issues and Extensions For large M, the clogit model is almost universally used (more than 10 choice alternatives) Stata has two multinomial logit commands: mlogit and clogit. Mlogit is consistent with the varying parameters model and clogit with the RUM model. The R command mlogit handles both types of models Some other extensions (also relevant for the binary Probit and Logit models): 1 Random Parameters (or mixed models) that relax IIA 2 Heteroskedastic error models