Comparability in Meaning Cross-Cultural Comparisons Andrey Pavlov

Similar documents
Descriptive Statistics (Devore Chapter One)

Stat3011: Solution of Midterm Exam One

Inflation Targeting and Revisions to Inflation Data: A Case Study with PCE Inflation * Calvin Price July 2011

CABARRUS COUNTY 2008 APPRAISAL MANUAL

Invitational Mathematics Competition. Statistics Individual Test

1 Variables and data types

Chapter 18: The Correlational Procedures

Copyright 2011 Pearson Education, Inc. Publishing as Addison-Wesley.

CHAPTER 2 Describing Data: Numerical

There are a couple of old sayings that could apply to fees paid to asset managers. You get

2c Tax Incidence : General Equilibrium

Citizens Health Care Working Group. Greenville, Mississippi Listening Sessions. April 18, Final Report

Journal of Insurance and Financial Management, Vol. 1, Issue 4 (2016)

CSC Advanced Scientific Programming, Spring Descriptive Statistics

Global Currency Hedging

Alternative VaR Models

8.1 Estimation of the Mean and Proportion

Final Exam Practice Set, STT 315, Section 106

Week 2 Quantitative Analysis of Financial Markets Hypothesis Testing and Confidence Intervals

Kingdom of Saudi Arabia Capital Market Authority. Investment

Topic: Government spending and taxation

THE UNIVERSITY OF TEXAS AT AUSTIN Department of Information, Risk, and Operations Management

Report on the Findings of the Information Commissioner s Office Annual Track Individuals. Final Report

Both the quizzes and exams are closed book. However, For quizzes: Formulas will be provided with quiz papers if there is any need.

COMMENTS ON SESSION 1 AUTOMATIC STABILISERS AND DISCRETIONARY FISCAL POLICY. Adi Brender *

PERCEPTION OF CARD USERS TOWARDS PLASTIC MONEY

Describing Data: One Quantitative Variable

Answers to Chapter 10 Review Questions

Social Studies 201 January 28, 2005 Measures of Variation Overview

Lecture 2 Describing Data

STAT 157 HW1 Solutions

Trends in Financial Literacy

The use of linked administrative data to tackle non response and attrition in longitudinal studies

Data Analysis. BCF106 Fundamentals of Cost Analysis

FACULTY OF SCIENCE DEPARTMENT OF STATISTICS

CHAPTER 5 RESULT AND ANALYSIS

Time for a. New Deal. for Young People. Broadbent Institute poll highlights millennials precarious future and boomers worries.

Exam 1 Review. 1) Identify the population being studied. The heights of 14 out of the 31 cucumber plants at Mr. Lonardo's greenhouse.

CHAPTER V. PRESENTATION OF RESULTS

Canadians abroad: most have travelled beyond the US, but how many would leave Canada for good?

Challenges and Possible Solutions in Enhancing Operational Risk Measurement

Assessment on Credit Risk of Real Estate Based on Logistic Regression Model

Maximum Likelihood Estimation Richard Williams, University of Notre Dame, Last revised January 13, 2018

AMS7: WEEK 4. CLASS 3

A PRIMER ON EXCHANGE RATES AND EXPORTING EM041E

INF385T Data Wrangling: From Excel to SQL 1. Worldwide expenditure on public health Trends based on the GDP and Income Groups of countries

Data Analysis and Statistical Methods Statistics 651

5.- RISK ANALYSIS. Business Plan

CHAPTER 4 DISCRETE PROBABILITY DISTRIBUTIONS

November 5, Very preliminary work in progress

Calculating the Probabilities of Member Engagement

Statistics Class 15 3/21/2012

Power-Law Networks in the Stock Market: Stability and Dynamics

3.4.1 Convert Percents, Decimals, and Fractions

TIM 50 Fall 2011 Notes on Cash Flows and Rate of Return

Article from: Health Watch. May 2012 Issue 69

DATA SUMMARIZATION AND VISUALIZATION

institution Top 10 to 20 undergraduate

Summarising Data. Summarising Data. Examples of Types of Data. Types of Data

The Fallacy of Large Numbers

Lecture 16: Estimating Parameters (Confidence Interval Estimates of the Mean)

ANZ Survey of Adult Financial Literacy in Australia Summary Report

Chapter 10 Estimating Proportions with Confidence

Using Fractals to Improve Currency Risk Management Strategies

Travel Metrics: Consumer Approaches to Travel Insurance and Assistance in Selected Global Markets

TEACHING UNIT. Grade Level: Grade 10 Recommended Curriculum Area: Language Arts Other Relevant Curriculum Area(s): Mathematics

Basic Procedure for Histograms

P2.T5. Market Risk Measurement & Management. Bruce Tuckman, Fixed Income Securities, 3rd Edition

Iowa State University Financial Counseling Clinic Client Report

US Income Tax For Expats

New Evidence on the Value of Financial Advice By Dr. Jon Cockerline, Ph.D.

The Golub Capital Altman Index

INSTITUTE OF ACTUARIES OF INDIA

Bayesian Normal Stuff

But suppose we want to find a particular value for y, at which the probability is, say, 0.90? In other words, we want to figure out the following:

PRIVATE COSTS OF ENFORCEMENT OF IPR

Fatness of Tails in Risk Models

Portfolio Sharpening

Bank Capital, Profitability and Interest Rate Spreads MUJTABA ZIA * This draft version: March 01, 2017

Evaluating the BLS Labor Force projections to 2000

MBF1923 Econometrics Prepared by Dr Khairul Anuar

M249 Diagnostic Quiz

THE IMPORTANCE OF ASSET ALLOCATION vs. SECURITY SELECTION: A PRIMER. Highlights:

Visualizing 360 Data Points in a Single Display. Stephen Few

Numerical Descriptive Measures. Measures of Center: Mean and Median

Thank you very much for your participation. This survey will take you about 15 minutes to complete.

Point Estimation. Stat 4570/5570 Material from Devore s book (Ed 8), and Cengage

chapter 2-3 Normal Positive Skewness Negative Skewness

Paper Series of Risk Management in Financial Institutions

Chapter 5: Summarizing Data: Measures of Variation

The Fallacy behind Investor versus Fund Returns (and why DALBAR is dead wrong)

Edgeworth Binomial Trees

Random Variables and Applications OPRE 6301

Applications of statistical physics distributions to several types of income

Putting Things Together Part 2

International Review of Business Research Papers Vol. 4 No.3 June 2008 Pp

Final Exam: 14 Dec 2004 Econ 200 David Reiley

IOP 201-Q (Industrial Psychological Research) Tutorial 5

XLSTAT TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING

Public spending on health care: how are different criteria related? a second opinion

Transcription:

Introduction Comparability in Meaning Cross-Cultural Comparisons Andrey Pavlov The measurement of abstract concepts, such as personal efficacy and privacy, in a cross-cultural context poses problems of comparability in meaning, i.e. that the same concept could be understood differently by individuals with different cultural background. While this is not a new problem, as comparative quantitative research attests, a novel solution has been recently suggested by a group of researchers working from Harvard. Led by Gary King, a political scientist, the group developed a modeling technique to be used in parametric data in order to correct for the problem of comparability. Respondents are asked to assess their personal stance with regard to the concept at hand, and compare this assessment to a hypothetical situation involving the same concept as depicted in a series of vignettes. Unlike self-assessment, the vignettes provide invariant tool against which to rank the respondent s self assessment and at the same time provides us with means of comparing the rankings among respondents in the sample. Depending on the results, it is then possible to correct for incomparability in meaning. The paper will explore the use of this technique in our GPD International Survey. Two sets of vignettes were used, one deals with individual control of personal information, and the second with the extent to which privacy is respected in international travel. Why do we need to correct responses? To make it easier for the reader to understand the problem, it will be explained in the framework of one of the questions of our survey: Q2 To what extent do you have a say in what happens to your personal information? There are four possible responses to this question ( Refused and Don t know responses are dropped from analysis): Complete say, A lot of say, Some say, and No say. Here are the results of the survey for Q2: Figure 1: Raw response frequencies Brazil Canada China France Hungary Mexico Spain USA No say 27.3% 17.0% 10.3% 9.5% 31.0% 9.3% 15.3% 16.7% Some say 37.2% 51.0% 17.3% 30.3% 41.4% 50.4% 52.0% 53.7% A lot of say 18.2% 22.0% 50.5% 24.5% 13.4% 21.9% 13.4% 17.5% Complete say 17.3% 9.9% 22.0% 35.7% 14.2% 18.4% 19.3% 12.1% This table shows the frequencies of responses falling in each category by country. We see for instance that France has an high percentage of Complete say, China has A lot of say frequency at least twice as high as it is in other countries, while Hungarians reported mostly No say and Some say. Does this mean that, compared to other countries, French and Chinese people have much more say in what happens to their

personal information, and Hungarian people have little say? We should be very careful in interpreting these figures, since what the table actually shows is people s opinion, based on their intuition, rather than verifiable facts. It is clear that the concept being measured, as well as the suggested categorical scale, is abstract and can be understood by people very differently, depending on their cultural background and perhaps other factors, such as age or gender. This creates inconsistencies in responses, and we can t compare them directly. Outline of the model The statistical model suggested by G. King et.al. is an attempt to measure this incomparability. To do this, respondents are asked a series of vignettes. In the vignettes they answer the same question but with regard to other hypothetical people, presented in short stories. They give their assessment on the same categorical scale. Using the answers to the vignette questions, the model corrects the answer for self assessment, and here is how. Figure 2: Statistical model It is assumed by the model that there actually exists a variable (call it μ), measured on a continuous scale, which represents one s amount of say that he/she has in what happens to his/her personal information. We don t observe this variable of course, but we can perceive it and categorize it into one of the four responses. The perception is modeled by a normal error and categorization is done by means of three thresholds (τ s), which divide the real numbers into four segments, corresponding to the four answers. Both the actual amount of say μ and the thresholds depend on one s cultural background. The vignettes, thought of as people, have their own true amount of say (θ s in the diagram). When these θ s are assessed by a respondent, we can estimate his/her thresholds, and then use them to estimate the respondent s μ. Preliminary analysis

Before we present the model for Q2, let s once again look at raw responses, summarized in table 1. To make the comparison of countries easier and following the ideas of the model, we develop a score which would represent how much say an average citizen of each country has in what happens to his/her personal information. As in the model, assume that the amount of say is measured on a continuous scale, and that the four responses are equidistant on that scale. Assign values for the responses in the following way: No say = 0, Some say = 1, A lot of say = 2 and Complete say = 3. Now compute the mean of these values for each country. If we want to classify the resulting means into one of the four categories, we may take the midpoints between our codes as thresholds, that is 0.5, 1.5 and 2.5. Here is what we get: Figure 3: What we might get without correcting incomparability 3 Complete 2 A lot 1 Some 0 No Hungary Canada USA Brazil Spain Mexico China France The error bars in the chart represent 95% confidence intervals for the estimated scores. As we have already observed, France and China received the highest score, while Hungary has the lowest score. Also if we look at our conventional categorical scale, we see that all countries are at least in the Some say region, with China and France scoring high in the A lot of say region, followed by Mexico, which is on the boundary. It is worth noting at this point, to be discussed further, that the response values were chosen somewhat arbitrarily, and so the heights of the bars are not very informative. Rather we should look at the differences, for instance we can note the big gap between China and Hungary or conclude that Canadian score is not significantly different from that of Brazil. Now take a look at the vignette responses. We don t use the model at this point, just look at how people have rated the four vignettes.

Figure 4: Vignettes response frequencies Mike Brazil Canada China France Hungary Mexico Spain USA No say 46.1% 42.0% 50.2% 60.1% 35.0% 12.2% 16.7% 42.7% Some say 27.8% 22.2% 17.0% 23.3% 21.6% 29.4% 27.8% 21.7% A lot of say 12.9% 8.8% 18.9% 11.9% 8.4% 26.7% 11.2% 11.4% Complete say 13.3% 26.9% 14.0% 4.7% 35.0% 31.8% 44.3% 24.2% James Brazil Canada China France Hungary Mexico Spain USA No say 45.5% 46.5% 42.7% 60.0% 30.9% 11.7% 17.3% 42.3% Some say 28.7% 21.7% 16.4% 21.6% 29.3% 27.5% 25.2% 33.1% A lot of say 15.0% 13.0% 28.0% 13.6% 13.2% 26.4% 14.0% 9.6% Complete say 10.9% 18.8% 12.8% 4.9% 26.6% 34.5% 43.5% 15.0% Mary Brazil Canada China France Hungary Mexico Spain USA No say 35.8% 13.5% 7.9% 21.4% 14.5% 7.9% 13.7% 16.7% Some say 19.0% 10.4% 10.6% 12.6% 10.2% 21.9% 22.2% 15.9% A lot of say 18.4% 13.1% 26.8% 29.7% 10.0% 36.0% 12.9% 14.4% Complete say 26.8% 63.0% 54.7% 36.3% 65.4% 34.2% 51.2% 53.0% Rita Brazil Canada China France Hungary Mexico Spain USA No say 34.7% 27.6% 20.7% 39.3% 33.5% 8.8% 14.0% 31.9% Some say 37.4% 41.1% 23.1% 41.6% 27.3% 26.0% 35.8% 39.6% A lot of say 14.4% 13.4% 38.9% 15.4% 12.9% 29.4% 12.5% 11.3% Complete say 13.6% 18.0% 17.3% 3.7% 26.3% 35.7% 37.7% 17.2% By just looking at these frequencies we may see that Complete say and A lot of say grades are rare with respondents from France. Coupled with the fact that the French people rated themselves very high, we may expect indeed that they will have greatest μ estimate. The situation is somewhat different with China. While their self assessment score was as high as that of France, we see that Chinese respondents tend to give considerably more Complete say and A lot of say ratings to all vignettes. This may mean that people in China have lower standards for what it means to them to control one s personal data. So we expect the model to correct for this and rank China s μ lower than France s. Another interesting change is expected with Mexico and Spain, which have a rather high self assessment score but give a lot of Complete say and A lot of say responses to persons in the vignettes.

Fitting the model and discussion of results The statistical model which we are going to use predicts one s amount of say μ and classification thresholds τ based on a group of variables which account for differences in cultural background. In our case we are mostly interested to carry out a cross-country comparison, but along with country we also include age, gender and race of the respondents. In a more elaborate analysis we may also wish to consider education, occupation and income as relevant background variables. Here are the results of fitting a model with mentioned explanatory variables. Figure 5: Regression coefficients for μ "Amount of say" of respondents (μ) Coefficient Standard Error gender -0.0444 0.0363 log-age -0.1298 0.0477 white 0.0377 0.0604 Brazil 0.2896 0.082 Canada -0.1768 0.0785 France 1.2311 0.0796 Hungary -0.5689 0.0798 Mexico -0.8071 0.0892 Spain -0.7737 0.0804 China 0.3485 0.084 As was mentioned above, the location and the units of the continuous scale on which we define μ have no meaning and are chosen arbitrarily. (The model is identified by having no intercept and fixing the variance of the random error to be 1). Thus the coefficients of the model can t be assigned their usual interpretation. However, we still can say which variables have significant contribution to the model and in which direction they influence the value of μ. For example, gender and race (white indicator) don t significantly differ from zero; negative log-age means that older people have, on average, less say; French people have, on average, more say than Americans (USA is the reference country because its indicator is not in the regression). The four vignettes are assumed to have fixed values of θ s, whose estimates are Figure 6: Amount of say in the vignettes (θ s) "Amount of say" of vignettes Value (θ) Standard Error Mike (q29) -1.0194 0.1986 James (q30) -0.9973 0.1979 Mary (q31) 0.4048 0.1993 Rita (q32) -0.6858 0.1972 It is difficult to see from this table how the vignettes compare to the respondents, but we clearly see how the vignettes are ordered. In particular we may notice that Mike

and James are almost identical, so the analysis results would not change if one of these two vignettes was missing. However, the situation may change if we add more background variables to the model. The last component of the model is the thresholds, which people use to categorize their perception of μ. Figure 7: Regression coefficients for thresholds (τ) Thresholds No / Some Some / A lot A lot / Complete Intercept -3.113-0.5875 0.9115 gender -0.0783-0.0371-0.0281 log-age 0.5052 0.165-0.0858 white -0.1164-0.0607-0.0174 Brazil 0.5096 0.204 0.2775 Canada -0.0608-0.2231-0.1538 France 0.6496 0.4171 0.6749 Hungary -0.0928-0.4522-0.6013 Mexico -1.1795-1.1087-0.8138 Spain -0.8199-0.8097-0.9778 China -0.0665-0.7084 0.0171 This table can be useful for comparing standards of classification between different strata. For example, white people have a bit lower standards than non-white; Mexicans and Spanish have considerably lower standards than Americans (that is, if an average American judges someone to have A lot of say, an average Mexican may classify as Complete say ). To visualize the estimated mean amount of say between countries, we fix other parameters of the model, thus picking a reference group, and calculate μ s from the model for that reference group across countries. We can also calculate the thresholds for that reference group and see how each country would be classified by an average representative of the reference group. Note that it is not important how we choose this group when we compare just the respondents, since it will only affect the location of estimated μ s but not the differences between them. If we want to compare respondents with vignettes, the reference group is important because vignettes are fixed values. The chart below uses the following reference group: Age = 30 year old, race= white, gender = male. The thresholds shown are those of the USA part of the reference group.

Figure 8: comparison with the vignette technique How much say do you have in what happens to your private information? Complete A lot Some Mexico Spain Mike (q29) Hungary James (q30) Rita (q32) Canada USA Brazil China Mary (q31) France No We see that, as was intuitively expected in the preliminary analysis, France remains at the head of the list. It is still followed by China, but its difference with France has changed from none to quite big. Another big change has occurred with regard to Mexico, which exhibited very low standards for having much say, and we now see it at the tail of our ranking. The same situation applies to Spain. We still observe that all countries would be classified as having at least Some say, however we now have to add that this is according to American standards (thresholds). If we picked extreme thresholds, like those of Mexico or France, we might see a very different classification picture. Which responses to use? After fitting the vignettes model we can calculate the estimated amount of say μ for each respondent. These values are supposed to provide a score on how much say respondents have in what happens to their private information, consistent in the crosscultural context. The reader may now wonder if he or she should use the original or corrected responses in any subsequent analysis involving Q2. It also becomes a point of argument whether the other questions, where vignettes were not available, provide a valid picture of the concepts they address? The answer is that both original and corrected responses may be appropriate to use, depending on what interpretation the results are given. When we use raw responses, we really only work with people s opinion, and therefore all findings should be stated

accordingly. For example, we have seen that 18.4% of 1071 Mexicans reported having Complete say in what happens to their personal data. It would be incorrect to infer from this that 18.4±0.47% in the population of Mexico have Complete say ; this is rather an estimate of how many people in Mexico would rank themselves into this category. After we have reanalyzed the responses with the vignettes, we can obtain objective results. For example, we can quantify the difference in the amount of say between Mexico and other countries or compute correlation of amount of say with other variables. Appendix: Vignette questions Q29: [Mike] goes to the drug store to buy film, which was advertised to be on sale. He finds out at the store that in order to receive the discount, he must apply for a customer loyalty card, which involves filling out an application form. It requires [Mike] to fill out his home address, occupation, and marital status. He fills the form out to get the special pricing. To what extent does [Mike] have a say in what happens to his personal information? Q30: As part of the concern for national security, assume that the government creates a database to search for terrorist activity. All government records are merged with any available commercial data such as bank records, credit statements, and travel manifests. Citizens, such as [James] are required to provide fingerprints, photographs, and iris scans. [James] does this. To what extent does James have a say in what happens to his personal information? Q31: [Mary] pays cash at a large, crowded department store and provides no information about herself to the cashier. The cashier asks for [Mary s] postal code/zip code; [Mary] refuses and still makes her purchase. To what extent does [Mary] have a say in what happens to her personal information? Q32: Assume that as part of the concern for national security, the government creates a database to search for terrorist activity. The government requires everyone, including [Rita], to submit an annual form containing detailed information about themselves. The form asks for employment information, criminal activity, and any travel abroad by the individual or any family members in the last five years. [Rita] complies, but decides not complete the section on travel. To what extent does [Rita] have a say in what happens to her personal information? References King, Gary, Christopher J. L. Murray, Joshua A. Salomon, and Ajay Tandon, Enhancing the Validity and Cross-Cultural Comparability of Measurement in Survey Research, American Political Science Review, 98(1), 191-207, 2004.