Revealing Additional Dimensions of Preference Heterogeneity in a Latent Class Mixed Multinomial Logit Model

Similar documents
WORKING PAPER ITLS-WP Does the choice model method and/or the data matter? INSTITUTE of TRANSPORT and LOGISTICS STUDIES

Contents. Part I Getting started 1. xxii xxix. List of tables Preface

Evaluation of influential factors in the choice of micro-generation solar devices

School of Economic Sciences

FIT OR HIT IN CHOICE MODELS

A MODIFIED MULTINOMIAL LOGIT MODEL OF ROUTE CHOICE FOR DRIVERS USING THE TRANSPORTATION INFORMATION SYSTEM

Interpretation issues in heteroscedastic conditional logit models

Comparison of Complete Combinatorial and Likelihood Ratio Tests: Empirical Findings from Residential Choice Experiments

Choice Probabilities. Logit Choice Probabilities Derivation. Choice Probabilities. Basic Econometrics in Transportation.

Evaluation of influential factors in the choice of micro-generation solar devices: a case study in Cyprus

Solving dynamic portfolio choice problems by recursing on optimized portfolio weights or on the value function?

The Mixed Logit Model: The State of Practice and Warnings for the Unwary

Transport Data Analysis and Modeling Methodologies

The Multinomial Logit Model Revisited: A Semiparametric Approach in Discrete Choice Analysis

Discrete Choice Model for Public Transport Development in Kuala Lumpur

Equity, Vacancy, and Time to Sale in Real Estate.

Correcting for Survival Effects in Cross Section Wage Equations Using NBA Data

Keywords Akiake Information criterion, Automobile, Bonus-Malus, Exponential family, Linear regression, Residuals, Scaled deviance. I.

Introductory Econometrics for Finance

Modelling catastrophic risk in international equity markets: An extreme value approach. JOHN COTTER University College Dublin

Phd Program in Transportation. Transport Demand Modeling. Session 11

EE266 Homework 5 Solutions

Lecture 3: Factor models in modern portfolio choice

M249 Diagnostic Quiz

Estimating Mixed Logit Models with Large Choice Sets. Roger H. von Haefen, NC State & NBER Adam Domanski, NOAA July 2013

The Mixed Logit Model: The State of Practice

Approximating the Confidence Intervals for Sharpe Style Weights

9. Logit and Probit Models For Dichotomous Data

Probits. Catalina Stefanescu, Vance W. Berger Scott Hershberger. Abstract

Estimating Market Power in Differentiated Product Markets

Chapter 2 Uncertainty Analysis and Sampling Techniques

Empirical Methods for Corporate Finance. Panel Data, Fixed Effects, and Standard Errors

A UNIFIED MIXED LOGIT FRAMEWORK FOR MODELING REVEALED AND STATED PREFERENCES: FORMULATION AND APPLICATION TO CONGESTION

Time Invariant and Time Varying Inefficiency: Airlines Panel Data

Introduction to the Maximum Likelihood Estimation Technique. September 24, 2015

to level-of-service factors, state dependence of the stated choices on the revealed choice, and

Multinomial Choice (Basic Models)

Asian Journal of Economic Modelling MEASUREMENT OF THE COST-OF-LIVING INDEX IN THE EASI MODEL: EVIDENCE FROM THE JAPANESE EXPENDITURE DATA

Available online at ScienceDirect. Procedia Environmental Sciences 22 (2014 )

Discrete Choice Modeling William Greene Stern School of Business, New York University. Lab Session 2 Binary Choice Modeling with Panel Data

Multi-Path General-to-Specific Modelling with OxMetrics

In Debt and Approaching Retirement: Claim Social Security or Work Longer?

[D7] PROBABILITY DISTRIBUTION OF OUTSTANDING LIABILITY FROM INDIVIDUAL PAYMENTS DATA Contributed by T S Wright

Fixed Effects Maximum Likelihood Estimation of a Flexibly Parametric Proportional Hazard Model with an Application to Job Exits

sociology SO5032 Quantitative Research Methods Brendan Halpin, Sociology, University of Limerick Spring 2018 SO5032 Quantitative Research Methods

Models and optimal designs for conjoint choice experiments including a no-choice option

The Optimization Process: An example of portfolio optimization

GMM for Discrete Choice Models: A Capital Accumulation Application

Mixed Logit or Random Parameter Logit Model

Valuing Environmental Impacts: Practical Guidelines for the Use of Value Transfer in Policy and Project Appraisal

Testing the Reliability of the Benefit Function Transfer Approach

Economics 430 Handout on Rational Expectations: Part I. Review of Statistics: Notation and Definitions

Chapter 5 Univariate time-series analysis. () Chapter 5 Univariate time-series analysis 1 / 29

The Probability of Legislative Shirking: Estimation and Validation

Dependence Structure and Extreme Comovements in International Equity and Bond Markets

Exercise 1. Data from the Journal of Applied Econometrics Archive. This is an unbalanced panel.n = 27326, Group sizes range from 1 to 7, 7293 groups.

MEASURING PORTFOLIO RISKS USING CONDITIONAL COPULA-AR-GARCH MODEL

Volume 29, Issue 2. Measuring the external risk in the United Kingdom. Estela Sáenz University of Zaragoza

CHAPTER 8 EXAMPLES: MIXTURE MODELING WITH LONGITUDINAL DATA

STATISTICAL METHODS FOR CATEGORICAL DATA ANALYSIS

Logit Models for Binary Data

Analyzing the Determinants of Project Success: A Probit Regression Approach

List of tables List of boxes List of screenshots Preface to the third edition Acknowledgements

Soft Budget Constraints in Public Hospitals. Donald J. Wright

Volume 37, Issue 2. Handling Endogeneity in Stochastic Frontier Analysis

Characterization of the Optimum

Available online at ScienceDirect. Transportation Research Procedia 1 (2014 ) 24 35

Portfolio theory and risk management Homework set 2

Assessment on Credit Risk of Real Estate Based on Logistic Regression Model

Student Loan Nudges: Experimental Evidence on Borrowing and. Educational Attainment. Online Appendix: Not for Publication

Are WTP Estimates for Wildfire Risk Reductions Transferrable from Coast to Coast? Results of a Choice Experiment in California and Florida

FS January, A CROSS-COUNTRY COMPARISON OF EFFICIENCY OF FIRMS IN THE FOOD INDUSTRY. Yvonne J. Acheampong Michael E.

An Analysis of the Factors Affecting Preferences for Rental Houses in Istanbul Using Mixed Logit Model: A Comparison of European and Asian Side

The Value of Information in Central-Place Foraging. Research Report

Non-participation in choice models: hurdle and latent class models. Michael Burton and Dan Rigby

The Determinants of Bank Mergers: A Revealed Preference Analysis

Econometrics is. The estimation of relationships suggested by economic theory

Online Appendix: Asymmetric Effects of Exogenous Tax Changes

Let us assume that we are measuring the yield of a crop plant on 5 different plots at 4 different observation times.

Estimating Ordered Categorical Variables Using Panel Data: A Generalised Ordered Probit Model with an Autofit Procedure

The mean-variance portfolio choice framework and its generalizations

Moral hazard in a voluntary deposit insurance system: Revisited

Wealth E ects and Countercyclical Net Exports

What s New in Econometrics. Lecture 11

Solution Guide to Exercises for Chapter 4 Decision making under uncertainty

Appendix to: AMoreElaborateModel

Operational Risk Aggregation

The Consistency between Analysts Earnings Forecast Errors and Recommendations

Supplementary Material for: Belief Updating in Sequential Games of Two-Sided Incomplete Information: An Experimental Study of a Crisis Bargaining

Automobile Ownership Model

Pricing & Risk Management of Synthetic CDOs

Do Random Coefficients and Alternative Specific Constants Improve Policy Analysis? An Empirical Investigation of Model Fit and Prediction

Intro to GLM Day 2: GLM and Maximum Likelihood

Simulating Continuous Time Rating Transitions

RATIONAL BUBBLES AND LEARNING

An experimental investigation of evolutionary dynamics in the Rock- Paper-Scissors game. Supplementary Information

COMPARING NEURAL NETWORK AND REGRESSION MODELS IN ASSET PRICING MODEL WITH HETEROGENEOUS BELIEFS

} Number of floors, presence of a garden, number of bedrooms, number of bathrooms, square footage of the house, type of house, age, materials, etc.

Discrete Choice Modeling

Incorporating Observed and Unobserved Heterogeneity. in Urban Work Travel Mode Choice Modeling. Chandra R. Bhat. Department of Civil Engineering

Transcription:

Revealing Additional Dimensions of Preference Heterogeneity in a Latent Class Mixed Multinomial Logit Model William H. Greene Department of Economics Stern School of Business New York University, New York 10012 wgreene@stern.nyu.edu David A. Hensher Institute of Transport and Logistics Studies Faculty of Economics and Business The University of Sydney NSW 2006 Australia David.Hensher@sydney.edu.au 16 September 2010 Abstract Latent class models offer an alternative perspective to the popular mixed logit form, replacing the continuous distribution with a discrete distribution in which preference heterogeneity is captured by membership of distinct classes of utility description. Within each class, preference homogeneity is usually assumed (i.e., fixed parameters), although interactions with observed contextual effects are permissible. A natural extension of the fixed parameter latent class model is a random parameter latent class model which allows for another layer of preference heterogeneity within each class. This paper sets out the random parameter latent class model, building on the fixed parameter latent class model, and illustrates its applications using a stated choice data set on alternative freight distribution attribute packages pivoted around a recent trip in Australia. Keywords: latent class mixed multinomial logit, random parameters, preference heterogeneity, stated choice experiment, freight distribution

1. Introduction The focus of the majority of discrete choice modelling over the last ten years has been centred on the mixed (or random parameter) logit model, and more recently on extensions of mixed logit to incorporate scale heterogeneity and estimation in willingness to pay space (referred to as generalised mixed logit) (see for example Train and Weeks 2005, Fiebig et al. 2009, Greene and Hensher 2010). The family of mixed logit models assume a continuous distribution for the random parameters across the sampled population, which may be systematically assigned to specific observations through an interaction with observed covariates. Latent class models offer an alternative perspective, replacing the continuous distribution with a discrete distribution in which preference heterogeneity is captured by membership in distinct classes of utility description (see Greene and Hensher 2003 that compares latent class and mixed logit models, and a recent revisit of the Greene- Hensher contribution by Shen 2009). Within each class, preference homogeneity is usually assumed (i.e., fixed parameters), although interactions with observed contextual effects are permissible. A natural extension of the fixed parameter latent class model is a random parameter latent class model which allows for another layer of preference heterogeneity within a class. This paper sets out the random parameter latent class model, known as the latent class mixed multinomial logit model, building on the fixed parameter latent class model. As far as we are aware, this is the first paper to present this extension and application of this method, although Bujosa et al. (2010) 1 have recently also undertaken a very similar extension. We illustrate its applications using an unlabelled stated choice data set on alternative freight distribution trip attribute packages pivoted around a recent trip in Australia. 2. Random Parameter Latent Class Model Latent class modelling provides an alternative approach to accommodating heterogeneity to models such as multinomial logit and mixed logit (see Everitt 1988 and Uebersax 1999). The natural approach assumes that parameter vectors, i are distributed among individuals with a discrete distribution, rather than the continuous distribution that lies behind the mixed logit model. Thus, it is assumed that the population consists of a finite number, Q, of groups of individuals. The groups are heterogeneous, with common parameters, q, for the members of the group, but the groups themselves are different from one another. We assume that the classes are distinguished by the different parameter vectors, though the fundamental data generating process, the probability density for the interesting variable under study, is the same. 1 Chronologically, we developed our model in November 2009 and were made aware of Bujosa et al. in early 2010. Like Bujosa et al., we have a 2-class model; however they use a single observation per respondent revealed preference data, whereas we use stated choice data and allow for correlation amongst the observations common to each respondent. 2

The analyst does not know from the data which observation is in which class, hence the term latent classes. The model assumes that individuals are distributed heterogeneously with a discrete distribution in a population. The full specification of the latent class structure for a generic data generating process is (Greene and Hensher 2003): f(y i x i,class = q) = g(y i x i, q ) (1) Prob(class = q) = q (), q = 1,...,Q. (2) The unconditional probability attached to an observation is obtained by integrating out the heterogeneity due to the distribution across classes, given in (3). f(y i x i ) = Σ q q ()g(y i x i, q ). (3) In this paper, we extend the latent class model to allow for heterogeneity both within and across groups. That is, we allow for variation of the parameter vector within classes as well as between classes. The extended model is a straightforward combination of the mixed logit and latent class models. To accommodate the two layers of heterogeneity, we allow for continuous variation of the parameters within classes. The latent class aspect of the model is given as (4) and (5). f(y i x i,class = q) = g(y i x i, i q ) (4) Prob(class = q) = q (), q = 1,...,Q. (5) The within-class heterogeneity is structured as i q = q + w i q (6) w i q ~ E[w i q X] = 0, Var[w i q X] = Σ q (7) where the use of X indicates that w i q is uncorrelated with all exogenous data in the sample. We will assume below that the underlying distribution for the within-class heterogeneity is normal with mean 0 and covariance matrix Σ. In a given application, it may be appropriate to further assume that certain rows and corresponding columns of Σ q equal zero, indicating that the variation of the corresponding parameter is entirely across classes. The contribution of individual i to the log likelihood for the model is obtained for each individual in the sample by integrating out the within-class heterogeneity and then the class heterogeneity. We will allow for a panel data, or stated preference data setting, hence the observed vector of outcomes is denoted y i and the observed data on exogenous variables are collected in X i = [X i1,..,x iti ]. The individual is assumed to engage in T i choice situations, where T i > 1. The generic model is given in (8). f(y i X i, 1,..., Q,,Σ 1,...,Σ Q ) = y w X w w.(8) Q Ti ( ) [ ( ), ] ( ) 1 q f 1 it q i it h q t i q d i w i 3

We parameterize the class probabilities using a multinomial logit formulation to impose the adding up and positivity restrictions on q (). Thus, exp( q ) q ( ), q = 1,...,Q; Q = 0. (9) exp( ) Q q1 q A useful refinement of the class probabilities model is to allow the probabilities to be dependent on individual data, such as demographics including age and income. The class probability model becomes exp( z ), q = 1,...,Q; Q = 0. (10) exp( ) q i iq ( zi) Q q 1 qz i The model employed in this application is a latent class, mixed multinomial logit (LC_MMNL) model. Individual i chooses among J alternatives with conditional probabilities given as (11). J exp[ j 1 yit, j( q wi) xit, j] it q i it J J j 1 j1 y it, j q wi xit, j f [ y ( w ), X ], j = 1,...,J, (11) exp[ ( ) ] y it,j = 1 for the j corresponding to the alternative chosen and 0 for all others, and x it,j is the vector of attributes of alternative j for individual i in choice situation t. The integrals cannot be evaluated analytically. We use maximum simulated likelihood (along the same lines as mixed logit) to evaluate the terms in the log likelihood expresion. The contribution of individual i to the simulated log likelihood is the log of equation (12). f S Q 1 R T (y i X i, 1,..., Q,,Σ 1,...,Σ Q ) = i ( ) [ ( 1 q f 1 1 it q i, r), it] q R r y w X (12) t w i,r is the rth of R random draws (Halton draws in our implementation) on the random vector w i. Collecting all terms, the simulated log likelihood is given as (13). S N Q 1 R T i log L log ( ) [ ( 1 1 q f 1 1 it q i, r), it] i q R r t y w X (13) The functional forms for q () and f[y it ( q +w i,r ),X it ] are given in (9) and (10), respectively. 3. Illustrative Empirical Context To illustrate the application of the model, we have selected a stated choice (SC) framework (Louviere et al. 2000) within which a freight transporter defined a recent reference trip in terms of its time and cost attributes (detailed below), treating fuel as a separate cost item to the variable user charge (VUC), with the latter being zero at present. An SC design of two alternatives using principles of D-optimality in 4

experimental design (see Hensher et al. 2007, Rose et al. 2008, Sandor and Wedel 2001) was developed to vary the levels of existing attributes around the recent reference trip levels plus introduce a VUC based on distance travelled but with varying rates per kilometre. Puckett et al. (2007) provide extended details of how the empirical study was designed and the data collection strategy. Selecting the set of attributes for the choice sets involved an iterative process of finding candidate attributes and determining how they could fit intuitively into the choice sets. Whilst in-depth interviews and literature reviews revealed myriad attributes that influence freight decision making (see Puckett et al. 2006, Cullinane and Toy 2000, Danielas et al. 2005, Fowkes et al. 2004), we focussed on the subset of these attributes that were most likely to be directly affected by congestion charges. Hence, the attributes that reside within the choice sets are: free-flow travel time, slowed-down travel time, time spent waiting to unload at the final destination, likelihood of on-time arrival, fuel cost and distance-based road user charges. These attributes are either an input into a congestion-charging policy (i.e., changes in fuel taxes, road user charges), or direct functions of such a policy. The levels of the attributes are expressed as deviations from the reference level, which is the exact value specified in the corresponding non-sc questions, unless noted: (1) Free-flow time: -50%, -25%, 0, +25%, +50% (2) Congested time: -50%, -25%, 0, +25%, +50% (3) Waiting time at destination: -50%, -25%, 0, +25%, +50% (4) Probability of on-time arrival: -50%, -25%, 0, +25%, +50%, with the resulting value rounded to the nearest five percent (e.g., a reference value of 75% reduced by 50% would yield a raw figure of 37.5%, which would be rounded to 40%). If the resulting value is 100%, the value is expressed as 99%. If the reference level is greater than 92%, the pivot base is set to 92%. If the pivot base is greater than 66 percent (i.e., if 1.5 times the base would be greater than 100%) let the pivot base equal X, and let the difference between 99% and X equal Y. The range of attribute levels for on-time arrival when X > 66% are (in percentage terms): X-Y, X-.5*Y, X, X+.5*Y, X+Y. This yields five equally-spaced attribute levels between X- Y and 99%. (5) Fuel cost: -50%, -25%, 0, +25%, +50% (representing changes in fuel taxes of -100%, -50%, 0, +50%, +100%) (6) Distance-based charges: Pivot base equals.5*(reference fuel cost), to reflect the amount of fuel taxes paid in the reference alternative. Variations around the pivot base are: -50%, -25%, 0, +25%, +50% The survey was conducted via computer-aided personal interview (CAPI) in which an interviewer asked the questions and the respondents provided a response. In many situations the respondent was the logistics manager, although a driver was present to provide details of a specific reference trip that were used to establish the attribute levels for the reference trip. An example of an SC screen is given in Figure 1. Freight 5

transporters were faced with four choice sets. They were asked to assume that, for each of the choice sets given, the same goods need to be carried for the same client, subject to the same constraints faced when the reference trip was undertaken. Respondents are then informed that the choice sets involve three alternative methods of making the trip: their stated trip and two SC alternatives that involve VUCs. The survey was undertaken in 2005, sampling transporters who were delivering goods on behalf of a single shipper to and/or from the Sydney Metropolitan area. Initially a sample of transporters were selected and screened for participation by a telephone call. Eligibility to participate in the CAPI survey required a respondent having (i) input into the routing or scheduling of freight vehicles used by their organisation (ii) input into the business arrangements made with their organisation s customers, and (iii) their organisation carry truckloads that contain cargo either sent by, or intended for, one single company. The resulting estimation sample is 108 transporters yielding 432 choice sets. 4. Results Figure 1 Main Choice Set Screen In this illustrative application of the LC_MMNL model, we have simplified the model and aggregated free flow time, congested time and time waiting at the destination to unload goods, as well as the fuel costs and variable user charge. From supplementary questions, over 80 percent of respondents indicated that they added up the times and costs respectively before making a choice. A dummy variable was included in the model to indicate the absence of a variable user charge in the total cost. We have assumed to this point that the number of classes, Q, is known. This will rarely be the case, so a question naturally arises, how can the analyst determine Q? Since Q is not a free parameter, a likelihood ratio test is not appropriate, though, in fact, log L will increase when Q increases. Researchers typically use an information criterion, such as AIC, to guide them toward the appropriate value. Heckman and Singer (1984) note a practical guidepost. If the model is fit with too many classes, then estimates will become imprecise, even varying wildly. Signature features of a model that has been overfit will be exceedingly small estimates of the class 6

probabilities, wild values of the structural parameters, and huge estimated standard errors. Statistical inference about the parameters can be made in the familiar fashion. The Wald test or likelihood ratio tests will probably be more convenient. Hypothesis tests across classes are unlikely to be meaningful. For example, suppose we fit a three class model. Tests about the equality of some of the coefficients in one class to those in another would probably be ambiguous, because the classes themselves are indeterminate. It is rare that one can even put a name on the classes, other than, 1, 2, etc. Likewise, testing about the number of classes is an uncertain exercise. If the parameters of the two classes are identical, it would seem that there is a single class. The number of restrictions would seem to be the number of model parameters. However, there remain two class probabilities, π 1 and π 2. If the parameter vectors are the same, then regardless of the values of π 1 and π 2, there is only one class. Thus, the degrees of freedom for this test are ambiguous. The same log likelihood will emerge for any pair of probabilities that sum to one. We found that two classes delivered statistical significance for each parameter estimate in at least one class, in contrast three and four classes had at least one variable that was statistically insignificant in all classes. The findings are presented in Table 1 for the two latent class-mixed MNL model with random parameters 2, together with an MNL model, a mixed logit model and two fixed parameter latent class models. The two versions of the latent class models (M3, M5) and (M5, M6) are distinguished by the inclusion of the freight rate as a source of systematic variation in the class membership probabilities. -0.00177 0.0296-0.0051 Table 1 Summary of Models 432 observations. For mixed logit we used constrained t-distributions 3 and 500 Halton draws MNL Mixed Logit Latent Class (Model M1) (Model M2) Fixed Parameters (M3) Random Parameters (M4) Attributes: Class 1 Class 2 Class 1 Class 2 Total time -0.0056-0.0053 0.0451 -.0082 0.0008 (-3.44) (-2.97) rp (-3.82) (1.73) (1.95) (0.25) On time delivery 0.03161 0.0360-0.0296-0.0005 rp 0.02713 rp (4.75) (4.51) rp (3.35) (-0.55) (-0.03) (2.53) Total cost -0.0030-0.0039 0.0134-0.0135-0.0052 (-4.15) (-3.74) rp (-3.95) (2.09) (-2.13) (-3.33) No variable charge dummy (1,0) 0.9406 (5.71) 0.8798 (4.84) 0.7716 (2.35) 4.2819 (2.18) 1.5933 (4.48) -1.2945 (-2.1) Class membership probability: 0.7137 (14.2) 0.2853 (4.04) BIC 1.8777 1.8651 1.8857 1.6650 Log-likelihood -393.44-390.71-380.01-350.63 0.574 (6.9) 0.4264 (5.14) 2 All models are estimated using Nlogit 5 (Pre-release). 3 The triangular distribution ensures that the sign of the parameter estimate is behaviourally plausible throughout the entire distribution. The triangular distribution was first used for random coefficients by Train and Revelt (2000), later incorporated into Train (2003). Hensher and Greene (2003) also used it and it is increasingly being used in empirical studies. Let c be the centre and s the spread. The density starts at c-s, rises linearly to c, and then drops linearly to c+s. It is zero below c-s and above c+s. The mean and mode are c. The standard deviation is the spread divided by 6 ; hence the spread is the standard deviation times 6. The height of the tent at c is 1/s (such that each side of the tent has area s(1/s)(1/2)=1/2, and both sides have area 1/2+1/2=1, as required for a density). The slope is 1/s 2. 7

Latent Class with decomposition of class membership probability Fixed Parameters (M5) Random Parameters (M6) Attributes: Class 1 Class 2 Class 1 Class 2 Total time (mins) 0.0056 (0.69) -0.00015 (-0.04) -0.0069 (-1.96) 0.0015 (0.56) On time delivery (percent) -0.0062 (-0.21) 0.0279 (2.34) -0.00056 rp (-0.040 0.0276 rp (2.86) Total cost ($) -0.0474 (-3.30) -0.0043 (-1.91) -0.0157 (-3.14) -0.0055 (-4.26) No variable charge dummy 1.4220 (2.84) -2.1567 (-1.34) 1.4109 (4.25) -1.3472 (-2.95) (1,0) Class 0.616 0.384 0.575 0.425 membership probability: constant 1.1741 (3.30) 0 1.8200 (2.79) 0 Freight rate ($) -0.00076 (-2.32) 0-0.0016 (-2.54) 0 BIC 1.7724 1.6498 Log-likelihood -372.85-346.36 Note: Elasticities are not informative for this model given that the alternatives are unlabelled. The overall fit of the LC-MMNL model is a statistically significant improvement over the other models on the BIC index, given the additional two parameters, compared to the fixed parameter latent class model, and three extra parameters compared to the mixed logit model (excluding the membership class probabilities). The addition of the freight rate in M6 as a systematic conditioning source on the probability of class provides the best fit model. The improvement of the LC-MMNL model over all other models is impressive, and supports the presence of preference heterogeneity in on time delivery in one class. Whereas all parameters are statistically significant for the MNL and mixed logit models, this is not the case for the latent class models, especially the random parameter specifications and the fixed parameter version of latent class when we allow for the decomposition of the class membership probability by the freight rate 4. This suggests that, up to a class membership probability, that some attributes have statistical merit for only one latent class. In particular, the LC-MMNL models 4 and 6 have two latent classes in which there is a clear preference for total time and total cost in one class, and on-time delivery and total cost in the other class. This is also the case under the fixed parameter latent class form. Furthermore, when we account for within-class preference heterogeneity at the attribute level, we see a significant change in the mix of membership probability between the two classes; namely 42.64 and 57.4 percent for the LC-MMNL model 4 (and 42.5 and 57.5 percent for model 6) and 71.37 and 28.53 percent for the fixed parameter latent class model 3 (and 61.6 and 38.4 percent for model 5). It should be noted that class 1 and class 2 in the latent class models are not equivalent in any sense across the models and should not be directly compared per se. The only random parameter is on time delivery, which is statistically significant in one class only for all four latent class models, and together with the fixed parameter estimate for total time, suggests that we have one class of transporters who focus on a trade-off between total time and total cost, up to a probability, and another class of 4 Bujosa et al. (2010) also found a number of statistically non-significant parameter estimates in one class. 8

transporters who focus on the trade-off between on time delivery and total cost up to a probability. The marginal utility distribution for on time delivery in class 2 for LC- MMNL Model 6 and the Mixed Logit model (both sources of random parameters) are given in Figures 2 and 3 respectively. The on time delivery parameter is not significant for class 1, suggesting that the mixed logit form confounds the mixture of respondents, up to a probability of latent class membership, with both a significant and a non-significant on time delivery parameter estimate which converges to statistical significance when combined into one class. The most informative evidence in Figures 2 and 3 is the range of estimates, since the two distributions are not directly comparable across the models. Some additional amount of preference heterogeneity is added in when we distinguish between latent classes in contrast to a single class treatment in mixed logit. 20.13 16.10 12.08 Den sity 8.05 4.03.00.000.019.038.058.077.096 OTLD Kernel density estimate for OTLD Figure 2 LC-MMNL Marginal Utility Distribution for On-Time Delivery mean = 0.045, std = 0.0178, min =0.0009, max = 0.0895 26.39 21.11 15.83 Density 10.56 5.28.00.000.015.031.046.061.076 OTLDM Kernel density estimate for OTLDM Figure 3 Mixed Logit Marginal Utility Distribution for On-Time Delivery mean = 0.035, std = 0.0143, min =0.0003, max = 0.0707 Willingness to pay (WTP) estimates for the value of total travel time savings ($/hour) and on time delivery ($/percent point on time) are summarised in Table 2. The mean estimates for both time dimensions are considerably lower for the latent class models compared to MNL and mixed logit, for reasons that are unclear other than the recognition that the MNL and mixed logit results ignore the possibility of the two classes of utility representation (up to a probability). The presence of statistically significant effects in one of the two classes on both time attributes might be reason for concern; however what this tells us is that the WTP estimate is driven by membership 9

of one latent class only. Given the overall fit, under BIC, is considerably better for the latent class models, the WTP estimates associated with MNL and Mixed Logit might be queried, suggesting a significant upward bias in the mean estimates for both total time and on time delivery. Whether this evidence is transferable to other situations can only be confirmed by additional empirical studies, although we note that Bujosa et al. (2010) in a totally different context of recreational trips to forest sites in Mallorca find significantly higher mean estimates for mixed logit and LC models for attributes with random parameters compared to MNL and fixed parameter LC. Table 2 Willingness to Pay Estimates for Total Time and On Time Delivery Values in brackets are standard deviations for random parameter specifications (based on 500 draws) MNL Mixed Logit Latent Class Fixed Parameters Random Parameters WTP M1 M2 M3 M5 M4 M6 Total time ($/hr) 112 81.5 (0.05) 20.8 ns 37.2 21.4 On time delivery ($/percent point) 10.53 17.13 (31.2) 4.76 6.49 5.08 (2.15) 5.17 (2.01) Conclusions This paper extends the latent class model to accommodate preference heterogeneity within each class through the use of random parameters. Although a subtle extension in many ways, in line with the evolution of choice models specified with continuous mixtures, it was only a matter of time before the interest in a latent class model with random parameters surfaced. The evidence herein shows that the introduction of random parameters within class is straightforward, and that this LC-MMNL model, in the context of one data set, delivers the best overall fit in contrast to MNL, mixed logit and fixed parameter latent class models. In the specific application however, this comes at a price, in that researchers interested in willingness to pay estimates, a major output of choice analysis, may find that the cost parameter is not statistically significant in all classes, as is the case herein (as well as Bujosa et al. (2010) and also found in other studies that use fixed latent class models, although not so in the current study), denying calculation of overall weighted average WTP estimates, which are necessary to obtain meaning to the notion of WTP being weighted by the probability of class membership. Ongoing research using a number of data sets will ensure that we gain a greater understanding of the gains to be made from the inclusion of random parameters in a latent class model, which we suspect will be data-set specific. References Bujosa, A., Riera, A. and Hicks, R. (2010 online) Combining discrete and continuous representation of preference heterogeneity: a latent class approach, Environment and Resource Economics, DOI 10.1007/s10640-010-9389-y. Cullinane, K.P.B. and Toy, N.R. (2000) Identifying influential attributes in freight route/mode choice decisions: a content analysis, Transportation Research E, 36 (1) 41-53. 10

Danielis, R., Marucci, E., and Rotaris, L. (2005) Logistics managers stated preferences for freight service attributes, Transportation Research Part E, 41 201-215. Everitt, B. (1988) A finite mixture model for the clustering of mixed-mode data, Statistics and Probability Letters, 6, 305-309. Fiebig, D., Keane, M., Louviere, J., and Wasi, N. (2009) The generalized multinomial logit: accounting for scale and coefficient heterogeneity, Marketing Science, published online before print July 23, DOI:10.1287/mksc.1090.0508 Fowkes, A. S., Firmin, P. E., Tweddle, G. and Whiteing A. E. (2004) How highly does the freight transport industry value journey time reliability and for what reasons International Journal of Logistics: Research and Applications, 7(1), 33-44. Greene, W.H. and Hensher, D.A. (2003) A latest class model for discrete choice analysis: contrasts with mixed logit. Transportation Research Part B, 37, 681-698. Greene, W.H. and Hensher, D.A. (2010) Does scale heterogeneity matter? a comparative assessment of logit models, Transportation, 37 (3), 413-428 Heckman, J. and B. Singer (1984) A method for minimizing the impact of distributional assumptions in econometric models, Econometrica, 52, 271-320. Hensher, D.A. and Greene, W.H. (2003) Mixed logit models: state of practice, Transportation, 30 (2), 133-176. Hensher, D.A., Puckett, S. and Rose, J. (2007) Agency decision making in freight distribution chains: revealing a parsimonious empirical strategy from alternative behavioural structures, Transportation Research B. 41 (9), 924-949. Puckett, S.M., Hensher, D.A. and Battellino, H. (2006) The adjustment of supply chains to new states: a qualitative assessment of decision relationships with special reference to congestion charging, International Journal of Transport Economics, XXXIII (3), October, 313-339. Puckett, S.M., Hensher, D.A., Rose, J. and Collins, A. (2007) Design and development of a stated choice experiment in a two-agent setting: interactions between buyers and sellers of urban freight services Transportation, 34, 429-451 Revelt, D. and K. Train (1998) Mixed Logit with repeated choices: households' choices of appliance efficiency level, Review of Economics and Statistics, 80, 1-11. Rose, J.M., Bliemer, M.C., Hensher and Collins, A. T. (2008) Designing efficient stated choice experiments in the presence of reference alternatives, Transportation Research B 42 (4), 395-406 Sandor, Zsolt and Wedel, M. (2001) Designing conjoint choice experiments using managers prior beliefs, Journal of Marketing Research, 38 (4), 430-444. Shen, J. (2009) Latent class model or mixed logit model? A comparison by transport mode choice data Applied Economics, 41, 2915 2924. Train, K. 2003, Discrete Choice Methods with Simulation, Cambridge University Press, Cambridge. Train, K. and Weeks, M. (2005) Discrete choice models in preference space and willing to-pay space, in R. Scarpa and A. Alberini, eds., Applications of Simulation Methods in Environmental and Resource Economics, Springer Publisher, Dordrecht, chapter 1, 1 16. Uebersax, J. (1999) Probit latent class analysis with dichotomous or ordered category measures: conditional independence/dependence models, Applied Psychological Measurement, 23, 283-297. 11