Does fiscally induced migration enhance tax competition? Evidence from Switzerland

Similar documents
Statistical Evidence and Inference

Research Philosophy. David R. Agrawal University of Michigan. 1 Themes

Returns to Education and Wage Differentials in Brazil: A Quantile Approach. Abstract

Conditional Investment-Cash Flow Sensitivities and Financing Constraints

Human capital and the ambiguity of the Mankiw-Romer-Weil model

STOCK RETURNS AND INFLATION: THE IMPACT OF INFLATION TARGETING

Investment is one of the most important and volatile components of macroeconomic activity. In the short-run, the relationship between uncertainty and

Banking Concentration and Fragility in the United States

Lobby Interaction and Trade Policy

Product Di erentiation: Exercises Part 1

The Economics of State Capacity. Ely Lectures. Johns Hopkins University. April 14th-18th Tim Besley LSE

Trade and Synchronization in a Multi-Country Economy

Estimating the Incidences of the Recent Pension Reform in China: Evidence from 100,000 Manufacturers

Mobility and local income redistribution

Electoral Manipulation via Voter-Friendly Spending: Theory and Evidence

Growth and Welfare Maximization in Models of Public Finance and Endogenous Growth

1 Unemployment Insurance

These notes essentially correspond to chapter 13 of the text.

"Inequality, Growth and Investment"

Estimating the Return to Endogenous Schooling Decisions for Australian Workers via Conditional Second Moments

The Elasticity of Taxable Income: Allowing for Endogeneity and Income Effects

The Economics of State Capacity. Weak States and Strong States. Ely Lectures. Johns Hopkins University. April 14th-18th 2008.

Optimal Progressivity

Monetary Economics: Macro Aspects, 19/ Henrik Jensen Department of Economics University of Copenhagen

Problem Set # Public Economics

EconS Advanced Microeconomics II Handout on Social Choice

Empirical Tests of Information Aggregation

For Online Publication Only. ONLINE APPENDIX for. Corporate Strategy, Conformism, and the Stock Market

Online Appendix. Moral Hazard in Health Insurance: Do Dynamic Incentives Matter? by Aron-Dine, Einav, Finkelstein, and Cullen

Effects of Inequality on Redistribution

Conditional Investment-Cash Flow Sensitivities and Financing Constraints

Factors that Affect Fiscal Externalities in an Economic Union

Principles of Econometrics Mid-Term

Partial Centralization as a Remedy for Public-Sector Spillovers: Making Interjurisdictional Transportation a National Responsibility

Supply-side effects of monetary policy and the central bank s objective function. Eurilton Araújo

Using Surveys of Business Perceptions as a Guide to Growth-Enhancing Fiscal Reforms

The E ciency Comparison of Taxes under Monopolistic Competition with Heterogenous Firms and Variable Markups

1 Income Inequality in the US

Problem Set #5 Solutions Public Economics

Spatial interaction in local expenditures among Italian municipalities: evidence from Italy

Companion Appendix for "Dynamic Adjustment of Fiscal Policy under a Debt Crisis"

Inequality and the Process of Development. Lecture III: Inequality and Human Capital Promoting Institutions

1. Cash-in-Advance models a. Basic model under certainty b. Extended model in stochastic case. recommended)

Intergenerational Bargaining and Capital Formation

OPTIMAL INCENTIVES IN A PRINCIPAL-AGENT MODEL WITH ENDOGENOUS TECHNOLOGY. WP-EMS Working Papers Series in Economics, Mathematics and Statistics

Advertising and entry deterrence: how the size of the market matters

Electoral Rules and Income Tax Progressivity

Fuel-Switching Capability

Monetary credibility problems. 1. In ation and discretionary monetary policy. 2. Reputational solution to credibility problems

Optimal Trade Policy and Production Location

Problem Set # Public Economics

Final Exam, section 1

Financial Liberalization and Neighbor Coordination

Cardiff University CARDIFF BUSINESS SCHOOL. Cardiff Economics Working Papers No. 2005/16

Downstream R&D, raising rival s costs, and input price contracts: a comment on the role of spillovers

Labor Economics Field Exam Spring 2014

Political support for the private system to nance political parties

Child Care Subsidies and the Work. E ort of Single Mothers

Organizing the Global Value Chain: Online Appendix

Strategic information acquisition and the. mitigation of global warming

Determinants of Ownership Concentration and Tender O er Law in the Chilean Stock Market

Equilibrium Asset Returns

5. COMPETITIVE MARKETS

How Do Exporters Respond to Antidumping Investigations?

Advanced Industrial Organization I Identi cation of Demand Functions

The Welfare Cost of Asymmetric Information: Evidence from the U.K. Annuity Market

1. Money in the utility function (continued)

Optimal Unemployment Bene ts Policy and the Firm Productivity Distribution

Product Di erentiation. We have seen earlier how pure external IRS can lead to intra-industry trade.

Financial Market Imperfections Uribe, Ch 7

SOLUTION PROBLEM SET 3 LABOR ECONOMICS

Working Paper Series. This paper can be downloaded without charge from:

The long run e ects of taxes and tax competition on top income shares: an empirical investigation ú

Appendix to: The Myth of Financial Innovation and the Great Moderation

Chapters 1 & 2 - MACROECONOMICS, THE DATA

Introducing nominal rigidities.

Unfunded Pension and Labor Supply: Characterizing the Nature of the Distortion Cost

Location Decision of Heterogeneous Multinational Firms

An Allegory of the Political Influence of the Top 1%

Fiscal policy and minimum wage for redistribution: an equivalence result. Abstract

Fiscal Consolidations in Currency Unions: Spending Cuts Vs. Tax Hikes

Executive Summary. Trends in Inequality: Globally and Nationally. How inequality constraints growth

EC202. Microeconomic Principles II. Summer 2011 Examination. 2010/2011 Syllabus ONLY

DEPARTMENT OF ECONOMICS DISCUSSION PAPER SERIES

Consumption-Savings Decisions and State Pricing

Microeconomics, IB and IBP

Multivariate Statistics Lecture Notes. Stephen Ansolabehere

The exporters behaviors : Evidence from the automobiles industry in China

FISCAL FEDERALISM WITH A SINGLE INSTRUMENT TO FINANCE GOVERNMENT. Carlos Maravall Rodríguez 1

Population Economics Field Exam September 2010

Inequality and GDP per capita: The Role of Initial Income

1. Monetary credibility problems. 2. In ation and discretionary monetary policy. 3. Reputational solution to credibility problems

1 Modern Macroeconomics

LABOR SUPPLY RESPONSES TO TAXES AND TRANSFERS: PART I (BASIC APPROACHES) Henrik Jacobsen Kleven London School of Economics

Network Effects of the Productivity of Infrastructure in Developing Countries*

The Long-run Optimal Degree of Indexation in the New Keynesian Model

1 Two Period Production Economy

Ine ciencies in regional commuting policy

Effective Tax Rates and the User Cost of Capital when Interest Rates are Low

Technology, Employment, and the Business Cycle: Do Technology Shocks Explain Aggregate Fluctuations? Comment

Transcription:

Does fiscally induced migration enhance tax competition? Evidence from Switzerland Florence Miguet* This version: January, 2006 Abstract This paper analyzes the problem of strategic tax setting at the local level in a direct democracy using Swiss data for a canton (Vaud). This setting is useful to isolate the determinants of tax rates and to test the predictions of the political economy models as both the scal autonomy of the Swiss municipalities and the internal migration rate are very high. Using spatial econometrics and instrumental variables, I provide strong evidence that municipal income taxes interact positively. However, only a small part of these interactions can be explained by a tax base e ect. These results suggest that concerns about a race to the bottom of tax rates should be minimal and alleviate fears of excessive scal competition induced by migration. JEL classi cation: D72, H2, H3, H7, P16, R23 Key words: Strategic interactions, tax competition, residential choice, spatial econometrics, political economy, Switzerland * Department of Economics, University of Geneva, bd du Pont d Arve 40, 1211 Genève 4, E-mail address: miguet@ecopo.unige.ch I am grateful to Jaime de Melo, Tobias Mueller, and Frédéric Robert-Nicoud for their helpful comments, suggestions and encouragements. Support by the Swiss National Science Foundation (Grant n 612-066094) is gratefully acknowledged.

1 Introduction Between countries as well as within countries, disparities in tax levels are rather large. With ever decreasing migration costs and the opening of borders to migration - as is the case for instance within the EU - fears about a race to the bottom of the tax levels and of the welfare state have become an important subject of debate in many developed countries. Indeed, big di erences in tax rates between jurisdictions might have a signi cant impact on the residential decision of households, hence fostering tax competition between them. Since the beginning of the eighties, there has been an outpouring of literature on the determinants of tax rates. In their in uential 1981 article, Meltzer and Richard have argued that the tax rate set by a jurisdiction governed by a median voter is negatively in uenced by its median to mean income ratio. However, empirical evidence in favor of this prediction is at best mixed. 1 Based on the observation that, in an ever more globalized world, jurisdictions cannot be considered as independent from each other, Case et al. (1993) have opened the way to a strategic interaction modelling of tax rates. At the theoretical as well as at the empirical level, strategic interactions seem to have an important role to play among the determinants of tax rates. 2 Nevertheless, even if many reasons governing tax interactions have been evoked in the literature, there is almost no evidence on the channels through which strategic interaction occur. 3 Finding answers to this question may yet have signi cant implications from a scal federalism point of view. Indeed, if tax interactions occur mainly through spillover e ects, i.e. residents of one jurisdiction consume or pay for the public goods provided by neighboring jurisdictions, or mainly through tax induced migration, the policy implications may be quite di erent. In the case of scally induced migration, the fear of a race to the bottom might have to be taken seriously and decentralization of scal competencies might be ine cient. 1 Meltzer and Richard (1983), Alesina and Rodrik (1994), Lindert (1996), Milanovic (2000), Borge and Rattsø (2005) nd some support for the Meltzer-Richard prediction but Persson and Tabellini (1994), Perotti (1996), Bassett et al. (1999), Rodriguez (1999) nd no support at all. 2 Case et al. (1993), Heyndels and Vuchelen (1998), Figlio et al. (1999), Saaveda (2000), Brueckner and Saaveda (2001), Revelli (2002), Baiker (2005), Solé-Ollé (2006) nd that strategic interactions of tax rates and expenditures levels between geographically close jurisdictions are strong and signi cantly positive. See Brueckner (2003) for an overview. 3 To my knowledge, the few exceptions are Besley and Case (1995), Solé-Ollé (2006) and Brett and Pinkse (2000). The rsts nd that, in a representative democracy, an incumbent politician who can run for reelection considers his neighbor jurisdictions tax rate as a yardstick to determine his own. The second tests local crowding spillovers against bene t spillovers but without considering scally induced migration. The thirds nd some evidence that tax base competition among local jurisdictions is small and that it does not have a detrimental e ect on business property taxes in Canada. 1

The main goal of this paper is to attempt to ll this gap by explicitly considering tax induced migration as a potential vector of strategic tax interaction between jurisdictions in Switzerland. Moreover, relying on a unique data-set comprising many local jurisdictions characteristics, I will also improve upon past empirical evidence by controlling better for the institutional setting governing scal policy making. First, the Swiss political system is built on direct democracy with Swiss citizens having extended rights to participate in the policy-making process and to in uence the political outcome according to their preferences. Hence, compared with most developed countries, Switzerland provides without doubt one of the most adequate settings to test the predictions of the median voter models. Second, using Swiss municipal data reduces considerably the problem of comparability of decentralized scal competencies which usually hampers empirical papers. Indeed, the Swiss municipalities within a canton all have the same public services to provide and the same constraints on the way of setting taxes. Basically, every year, every municipality has to choose a percentage to levy - a tax multiplier - on an exogenously given income tax schedule. This means that the scal competition between municipalities is restricted to an unidimensional choice. However, despite this constraint, di erences in municipal tax multipliers have remained large during the last two decades. For instance, an unmarried individual with a yearly income of 70 000 Frs had to pay up to 10 829 Frs cantonal and communal income taxes in Château d Oex and only 7 784 Frs in Saint-Sulpice (both communes are located in the same canton). Third, as Swiss municipalities rely heavily on income taxes to nance their expenditures, the tax multiplier chosen by a municipality might have a substantial impact on households residential choice. Combined with a high migration rate between municipalities, the database used in this paper should be very informative to isolate the e ect of scally induced migration on local tax rates. Worth noting is that the municipalities chosen are geographically very close together, thus households are supposed to be exposed to the same labor market conditions wherever they choose to reside. As main results, I found no signi cant evidence in favor of the Meltzer and Richard model but a strong and signi cant evidence that taxes interact positively within a given geographical area. Moreover, I was also able to identify that scally induced migration is only a very minor channel through which taxes interact strategically. The remainder of the paper is structured as follows. Section two discusses the political economy model of scal policy and of residential choice developed by Kessler and Lülfesmann (2004) that provides an appropriate model to interpret the empirical framework in section four. Section three then illustrates Switzerland s federal system and its unique direct 2

democracy set-up. It also presents the advantages of using Swiss communal data for testing the tax setting behavior of local communities and summarizes some stylized facts about the main variables entering in the tax setting model. Section four reviews the relations to be estimated and discusses further the empirical methodology to be used. It also presents the main results obtained and analyses them in light of the theoretical predictions. Section ve seeks to validate these results by means of diverse robustness checks. Conclusions follow in section six. 2 The political economy of scal policy in direct democracy: A framework In Switzerland, self-interested nationals can vote on tax levels both in the voting booth - in their residential locality - and with their feet since barriers to mobility are low and internal migration rates are high. Moreover, at a communal level, the median voter s tax choice is unidimensional as he only chooses the tax level and not the shape of the tax schedule. Kessler and Lülfesmann s (2005) model of residential and political choice is a useful framework to analyze the tax setting behavior of local jurisdictions. 2.1 Tax setting in a median voter framework without relocation Following Kessler and Lülfesmann, suppose that each local jurisdiction j nances the supply of its local public services g j exclusively by a proportional income tax t j on its residents. Given that the local jurisdiction s budget has to be balanced, we get: g j = (t j 1 2 t2 j)y j where y j is the mean income in jurisdiction j and t 2 [0; 1] is the local jurisdiction s proportional income tax level (the economy is composed of J jurisdictions which are populated by a continuum of individuals i who can move freely between these jurisdictions and locate only in one). Note that levying public funds entails a cost 4 represented by the term 1 2 t2 j y j. Individuals derive utility from the consumption of a private good c i and from local public services g j. The amount of public services received is supposed to be the same for every 4 Without this assumption, the median voter could vote for a tax rate of 100%. For an overview of the other reasons explaining the limits to redistribution, see Harms and Zink (2003) 3

individual in a given jurisdiction. Moreover, the individuals are heterogeneous with respect to their exogenously given income w i and to their preference intensity for public services i 1. Incomes are supposed to be distributed according to a continuous distribution skewed to the right so that the mean income is greater than the median income. Preferences are supposed to be distributed uniformly and independently from income. Utility of individual i is given by: U i (c i ; g j ; i ) = c i + i g j and his budget constraint is the following: c i = (1 t j )w i Subject to the local jurisdiction budget constraint, the median individual living in jurisdiction j - with income w m j and preference m j - will choose the tax level t j so as to maximize his indirect utility function: V (t j ; g(t j ; y j ); w m j ; m j ) = (1 t j )w m j + m j (t j 1 2 t2 j)y j Under a right-skewed distribution of income - mean income greater than median income -, this maximization problem yields the following solution: t j = 1 w m j =m j y j (1) Hence, the local tax rate is decreasing in the ratio of median to mean income as already identi ed by Meltzer and Richard (1981) 5 i.e. more income inequality increases the tax burden. Not surprisingly, the tax rate is also positively in uenced by the median voter s preference level for public services. Moreover, by supposing that preferences i can either be low or high, Kessler and Lülfesmann have shown that an asymmetric tax equilibrium with (imperfect) sorting of population according to intrinsic preferences exists. 6 In this simple political economy framework on the determination of the equilibrium tax rate, the average income y j in a local community was considered as given by the median voter. This assumption could correspond either to a situation where individuals cannot move freely across jurisdictions after tax rates in every locality have been chosen or to 5 However, in the Meltzer and Richard model, the tax rate depends on the median to mean income through a labor-leisure choice and not through an e ciency cost due to redistributive taxation. 6 The smaller the taste di erential, the stronger the motive for wealthy individuals to escape taxation and the strati cation into rich and poor communities is more pronounced. However, if preferences are identical for all individuals, no sorting equilibrium can exist and hence no migration equilibrium. 4

a myopic voter case - where voters in each jurisdiction ignore the e ects of taxes on migration. However, as both alternatives are quite unrealistic, next section relaxes them by introducing a relocation stage and by considering a non myopic median voter 7 in the framework developed so far. 2.2 Adding relocation Adding a relocation stage to their framework and solving it as two-stage game with perfect information, Kessler and Lülfesmann (2005) show that the same tax equilibrium - with (imperfect) sorting of population - as in the case with no relocation can exist. 8 Hence, adding scally induced migration and a non myopic median voter does not necessarily destroy sorting of population if preferences between individuals are su ciently heterogeneous. 9 For my empirical tests, it implies - by using backward induction - that the average income of a given jurisdiction cannot be assumed to be exogenous anymore 10. A reduced form of this relation can be given by: y j = y j (t j ; t j ) (2) where the subscript j indicates all the communities except j. As the median voter has to choose the tax rate which maximizes his utility taking into account the expected migration response, the local tax level can be determined in the following way: t j = t j (y j ; w m j ; m j ) (3) where y j depends on the outcome of last stage of the game. Finally, given the endogeneity of taxable income y j, the reduced form of the tax setting equation can be written as: t j = t j (t j ; w m j ; m j ) (4) 7 One of the main results of Epple et al. (2001) is that empirical ndings reject myopic voting models. 8 However, the set of equilibria is larger than in the case with no relocation, but it is not possible to characterize the full set of equilibria. 9 Sorting thus remains a robust outcome if one extends the traditional Tiebout multi-community model to a more dynamic framework which allows individuals to migrate again after local policies have been determined. 10 As every individual might move in response to - own and neighbor - tax rates, the tax base of a given jurisdiction also depends on these tax rates. 5

which, in the strategic interaction models, has been referred to as a reaction function and has often been estimated empirically. However, given the framework developed here, this reaction function is supposed to depend only on the mobility of the individuals, i.e. on a tax base e ect. 11 This reduced form framework is su cient to guide my empirical study. Indeed, I will proceed as follows. First, I will estimate equation (1). Next, I will explicitly consider strategic interaction and estimate equation (4) as done, for instance, by Case et al. (1993), Brett and Pinkse (2000), Solé-Ollé (2006). Since by estimating equation (4) in isolation it may be impossible to assert that tax competition arises because of scally induced migration, I go further and estimate equations (2) and (3) rst separately and then jointly. 3 Fiscal autonomy and direct democracy: Considering the Swiss communes Thanks to its federal structure and to its direct democracy, Switzerland provides a unique setting to study the shaping of tax and redistributive policies. The Swiss federal structure is built on three levels of jurisdictions: the confederation (federal level), the cantons (state level composed of 26 jurisdictions) and the communes (local level composed of about 3 000 municipalities), which all have a great level of scal autonomy. Furthermore, in the context of the Swiss direct democracy, at every level of jurisdiction, Swiss citizens have extended rights to participate in scal policy making. 3.1 Fiscal policy in an homogeneous setting In Switzerland, the tax and redistributive competencies are spread almost equally among the three levels of jurisdictions. 12 The local jurisdictions, even if not fully sovereign and submitted to the limits imposed by their cantonal Constitution, 13 must provide public services and have the obligation to nance them, thus giving them a large degree of scal autonomy and implying large disparities in tax rates and in expenditures per capita. 11 As noted by Saavedra (2000), strategic behavior may [also] arise if welfare migration is negligible, but if state o cials think that it occurs. 12 Over the last three decades, the relative income share of the Swiss local jurisdictions was about 27%. 13 See article 50 of the federal Constitution. 6

On the income side of their budget, the Swiss communes rely mainly on personal income taxes, i.e. to about 60% to 70% of their revenues -. 14 However, their ability to raise income taxes is limited to setting an annual tax multiplier, i.e. to levy a surcharge on the cantonal tax schedule. Hence, the level of progressivity of local taxes is totally independent of communal policy. On the expenditure side of their budget, the communes have large expenditures on education (especially primary school), environment, culture and recreation, health and social security. 15 To avoid having to deal with the multidimensional nature of personal income taxes - level and progressivity -, I only focus on the communes located in one canton. Moreover, as the distribution of competencies between the communes and the canton di er from one canton to the other, choosing a unique canton should also help to avoid this source of bias. The canton I choose to use is Vaud. The level of variability in its communal tax multipliers is one of the highest in Switzerland, i.e., in 2000, an unmarried individual with a yearly income of 70 000 Frs had to pay up to 10 829 Frs cantonal and communal income taxes in Château d Oex and only 7 784 Frs in Saint-Sulpice. From a political-economy point of view, as reported by Feld and Kirchgässner (1999), the budget draft, the tax rate and the budget de cits of the communes of the canton Vaud were mainly controlled by the voters in an obligatory referendum, an optional referendum or a local assembly. Moreover, foreign residents are excluded from the political decision-making process, i.e. they don t have the right to vote and to be elected although they pay the same taxes as nationals and are entitled to the same social bene ts. 3.2 Some stylized facts about communal tax multipliers Associated either with spillover e ects, a tax mimicking behavior of authorities or scally induced migration, the tax multiplier set by a given commune is expected to depend on the tax multipliers set by its neighbors. How does this prediction t the data for the communes of Vaud? To rst assess a possible spatial correlation between communal tax multipliers, I represent them in a Moran scatterplot. The Moran scatterplot reports the standardized communal tax multipliers (on the horizontal axis) against the weighted standardized neighbor tax multipliers (on the vertical axis). To obtain the weighted neighbor tax multipliers, I 14 The communes have a rather low dependence - on average 14% of total revenues - on transfer payments. 15 It is noteworthy that, contrary to social security (e.g. old age pensions and disability insurance) which is mainly federal, social assistance, which is highly redistributive in its nature, is under the competency of the cantons and of the communes. 7

considered as neighbor the communes geographically close together and used a geographical weight matrix as described in Appendix II. The Moran scatterplot reported in gure 1 shows that most observations are located on the upper right and on the lower left, suggesting that spatial dependences between communal tax multipliers seem to be strongly present, i.e. a commune with a low (high) tax rate is mostly surrounded by other communes with low (high) tax rates. Figure 1: Moran scatterplot for tax mulipliers, 2000, 15 km distance threshold 2 Moran scatterplot (Moran's I = 0.439) tax_00 1 0 Wz -1-2 -3-4 -3-2 -1 0 1 2 3 z Moreover, as is con rmed by the Moran I 16 and the Geary s C 17 statistics reported in table 1, there is clearly a positive and signi cant spatial autocorrelation between the 16 The Moran I is a spatially weighted autocorrelation measure given by: I(d) = where S 2 = 1 N NP (x i x) 2. Its theoretical mean in the absence of serial autocorrelation is i NP NP i j w ij (x i x)(x j x) S 2 N P i NP w ij J 1 w 0 and it N 1 is supposed to follow a normal law. 17 The Geary s C is a spatially weighted sum of square di erences between observation as given by: C(d) = NP NP (N 1) 2 w ij (x i x j ) 2 i j NP NP. Its theoretical mean in the absence of serial correlation is 1 and it is supposed to w ij (x i x) 2 i j follow a normal law. 8

communal tax multipliers. Note that the Moran I statistic can be represented in gure 1 by a regression line passing through the origin. Table 1: Moran and Geary global statistics Moran I * Geary s C ** 10 km 15 km 20 km 10 km 15 km 20 km 1980 0.399 0.360 0.337 0.582 0.618 0.638 1990 0.513 0.472 0.443 0.476 0.518 0.546 2000 0.480 0.439 0.410 0.510 0.552 0.578 Note: All values are signi cant at a 1% level H 0 : no spatial correlation * H 0 : I = 0 ** H 0 : C = 1 What about the evolution of the tax multipliers from 1980 to 2000? As summarized in table 2, the average communal tax multiplier over the last 20 years has remained quite stable. Table 2: Communal tax multipliers, in percent, by years Mean Std. Min. Max. 1980 96.75 22.81 20 160 1985 94.88 20.89 30 160 1990 95.70 21.08 40 150 1995 96.67 19.26 40 140 2000 96.63 17.33 40 135 Source: Vaud s Statistical O ce (SCRIS) Moreover, the autocorrelation matrix of the communal tax multipliers (available upon request) indicates that changes in communal tax multipliers are rather smooth. Indeed, the autocorrelation coe cients are very close to 1 at a ve-year interval and it decreases to 0.6 at a twenty-year interval. i.e. the ranking of the communes in terms of a standardized tax multiplier level shows some smooth evolution. 9

Given these stylized facts about the evolution of tax multipliers and the persistence of their large cross-sectional variability, it seems that - at rst sight - no race to the bottom occurred. 3.3 Fiscally induced migration, tax base and tax multipliers 18 Two important conditions to be satis ed for tax competition to occur are, rst, that tax rates signi cantly enter the residential choice of an individual (this assumption also needs to be satis ed in the Tiebout-like sorting of population models) and, second, that communes signi cantly take into account changes in their tax base when deciding upon their tax rates. The latter condition implies that tax base e ects may strongly limit the ability of communes to raise revenues and/or to redistribute resources adequately. Given that tax multipliers can in uence the tax base available for a commune and viceversa, both variables should be highly negatively correlated. This is indeed the case since the correlation coe cient, in 2000, is 0:71, i.e., on average, the low tax communities are also the communities with the highest tax base. Moreover, when mapping these variables in a geographical space, clustering of low tax/high income and high tax/low income communities seems to be strongly present. For instance, the low tax and the high income communities are mainly clustered around lake Geneva, especially on its western coast. Finally, is there evidence in favor of scally induced migration in Switzerland? By using alternatively the 26 Swiss cantons and the 137 biggest Swiss communes in 1990, Feld (2000) nds some evidence that the level of taxation in uences signi cantly the distribution of income in a given jurisdiction. The higher the tax rate, the lower the share of high income earners in a jurisdiction, which con rms that scal incentives on average matter in the residential choice of Swiss citizens. 19 This conclusion holds both at the cantonal or at the communal level but it is stronger at the communal level. Furthermore, tax di erentials mainly a ect the residential choice of the high income group. Finally, he nds that scal incentives rather consist of tax rates than of transfer levels. Using a random utility model on individual residential choices for a set of Swiss communes within the same area, Schmidheiny (2005a) establishes that local income taxation is [...] the major source for income sorting of moving households. Given these two pieces of evidence, it is promising to study more thoroughly the interaction between tax multipliers and migration at the communal level. Moreover, based on individual information about the place of residence ve years before 18 Data are issued from the Swiss Federal Statistical O ce (OFS): Public Finances in Switzerland, diverse years. 19 Kirchgässner and Pommerehne (1996) reached the same conclusion using earlier data. 10

the census, I nd that in 2000 about 17% of the population of the canton of Vaud made an intercommunal move and that this rate is much larger than the rate of intercantonal and international moves. 4 Results Following the predictions in the literature, this section starts by estimating the determinants of communal tax multipliers as predicted by a median voter model without relocation. Since the results obtained are not conclusive, I augment my estimations by taking explicitly into account possible strategic interactions between communes. Given the encouraging results obtained, I then decompose this relation into a tax base and a tax setting e ect. The summary statistics and some explanations on the construction of the variables are given in Appendix I. 4.1 Median voter model without strategic interaction To open the discussion about the empirical determinants of the tax setting behavior of communes, I start by estimating a speci cation close to equation (1). Assuming that communal tax rates are independent from each other and that the tax setting behavior of a commune can be approximated by a linear relationship, the empirical counterpart to equation (1) can be written as: t j = wm j y j + X j + " j (5) where t j is the tax multiplier of commune j = 1; :::; J, wm j y j is the ratio of median to mean income, y j is the mean income, X j is a vector of communal characteristics that are arguably exogenous and " j is an error term. Econometric issues To estimate this equation by ordinary least squares (OLS) may not be consistent. Indeed, if scally induced migration is to be observed among communes, the median and the mean income may be endogenous. For instance, if an increase in the tax rate induces an out ow of high earner individuals, the communal mean and median incomes will both decrease, thus inducing a change in the ratio of median to mean income. To control for endogeneity, I 11

use the instrumental variable technique (2SLS) but nding good instruments for explaining the ratio of median to mean income is di cult. Since the own and neighbor geographical characteristics may be important determinants of the residential choice of an individual, I will use them as instruments. Which variables, in addition to the ratio of median to mean income, could determine the communal tax multiplier? As communal expenditures are mainly devoted to nancing proximity services, the communal population s characteristics might be important determinants of tax multipliers. For instance, the proportion of school-age children in a commune may highly in uence its expenditures as one of the commune s main role is to nance public schools. However, as the communal tax multiplier may in uence the residential choices of households, it may also in uence its population characteristics, hence a careful attention will be needed in the choice of these variables. Another set of variables to be considered includes the geographical characteristics of the communes such as their altitude or their share of industrial area. Moreover, in addition to the level of inequality given by the ratio of median to mean income, the level of taxable income per taxpayer may play a very signi cant role in the tax setting equation (5). Indeed, as all the communes are committed to provide the same kind of public services, the size of their tax base may have a very important e ect on their tax level. As already mentioned, given the possible endogeneity of the tax base, I also control for it by using the same set of instruments as for the ratio of median to mean income. Results The results of the estimation of equation (5) are given in table 3. Columns [1], [2] and [4] are estimated by OLS with robust standard errors 20 and columns [3] and [5] by 2SLS with robust standard errors. However, both sets of estimates may be biased. Indeed, if the median to mean income is endogenous, OLS is likely to be inconsistent and if the instruments used are weak - as is the case here 21-2SLS may also be inconsistent. For the estimations in columns [2] to [5], we notice that the ratio of median to mean income has the expected negative sign but it is not signi cant. Given the weakness of my instruments, it is di cult to assess clearly that the predicted negative impact of the ratio of median to mean income on the tax rate is validated by this data. 20 Robust to heteroscedasticity by using White s estimator of the variance. 21 To test the weakness of the instruments, I relied on an F -test on the joint signi cance of the excluded instruments in the 1st stage. 12

Columns [4] and [5] con rm that, as expected, the taxable income per capita has a high explanatory power. The higher the taxable income per taxpayer, the less the commune has to tax its residents in order to the get the su cient amount of money to provide its public services. Table 3: Static tax setting estimations, 2000 [1] OLS [2] OLS [3] 2SLS [4] OLS [5] 2SLS Median/mean income -7.404-137.602-10.035-109.744 [0.49] [1.56] [0.83] [1.37] Taxable income -0.007*** -0.008*** [6.36] [4.82] R-squared 0.45 0.45 0.62 Robust t-statistics in brackets, Other control variables not reported * signi cant at 10%; ** signi cant at 5%, *** signi cant at 1% Spatial interaction tests In order to validate the assumption that, contrary to the static median voter model, communes set their tax level interdependently, I tested for spatial interactions. After every estimation done in the preceding section, I calculated the Lagrange multiplier and the robust Lagrange multiplier test statistics for spatial lag and for spatial error 22 by using a 15 km weight matrix as described in Appendix II. Under the null (H 0 ) of no spatial interaction both test statistics follow a 2 (1). Table 4: Spatial interaction tests, 2000 [1] [2] [3] [4] [5] LM lag 161.974*** 161.680**** 101.543*** 54.788*** 26.484*** RLM lag 69.623*** 69.082*** 39.916*** 42.363*** 20.713*** LM error 105.003*** 105.643*** 69.539*** 12.443*** 5.778** RLM error 12.652*** 13.045*** 7.912*** 0.019 0.064 H 0 (no spatial interaction) rejected: * at 10%; ** at 5%, *** at 1% 22 See Anselin (1996) for a description of these four tests. 13

The results of these tests are given in table 4. For columns [1] to [3], I could reject - at a 1% level - the null hypothesis of no spatial interaction either by spatial lag or by spatial error. For columns [4] and [5], the robust Lagrange multiplier reveals that - even at a 10% level - the errors are not signi cantly spatially correlated. Given that the robust Lagrange multiplier for spatial error is robust to the presence of spatial lag, it clearly encourages us to consider that strategic interactions mostly occur through spatial lag rather than through spatial error. 23 Hence, not including neighbor taxes as a determinant of own tax may seriously bias the estimation. 4.2 Adding strategic interaction Since the Lagrange and robust Lagrange multiplier tests for spatial lag signi cantly indicate that spatial correlation between communal tax multipliers is present, I start by estimating a standard reaction function as given by equation (4). Taking for simplicity a linear approximation, the speci cation to estimate is given by: t j = k6=jw P jk t k + X j + " j (6) where t j is the tax multiplier of commune j = 1; :::; J, w jk represents the weight assigned to the neighbor commune k, X j is a vector of communal characteristics and " j is an error term. Given the large number of weights w jk - 73 536 - to determine, it is not possible to estimate them along with the other parameters of equation (6). This implies that I must assume them to be known and therefore specify them a priori. Supposing that scally induced migration occurs mostly at a very local level and that communes have larger social ties with communes nearby, I consider geographical distance between communes as a good proxy for neighborliness. For all the estimations reported a 15 km threshold is used. 24 The construction of the weights is described in Appendix II Econometric issues The main econometric concern when estimating equation (6) is that the neighbor tax multipliers are endogenous. The reason is that the communal tax multipliers are all jointly 23 Revellli (2002) nds the same kind of pattern. Moreover, robustness checks in order to explicitly take into account the eventual problem of spatially correlated errors have been done but the main results presented in next section are not a ected by this correction. 24 Using a 10 km or a 20 km threshold does not change results signi cantly. 14

determined in exactly the same fashion. As a result, the OLS estimates of the parameters of equation (6) may be a ected by a simultaneity bias. This endogeneity can be addressed either by using an instrumental variable approach or a maximum likelihood method. For my estimations, I rely on the 2SLS technique as used for instance by Besley and Case (1995), Brett and Pinkse (2000), Solé-Ollé (2006). The main advantage of using 2SLS instead of maximum likelihood is that, even in the presence of spatial error dependence, the estimated coe cients remain consistent. 25 Moreover, given the set-up of the model, nding instruments is not too di cult. Indeed, as the communes are considered as setting their tax rate in a perfectly identical way, the most straightforward set of instruments to use is the weighted sum of neighbor characteristics P k6=jw jk X k. Finally, one of the main drawbacks when estimating a standard reaction function is that the only information one can get is if tax multipliers interact strategically. No information on the channel through which tax interactions happen can be extracted from it. A positive could re ect either tax competition for mobile tax base, or public spillover, or a tax mimicking behavior. 26 Taxable income per capita as an omitted variable? Since the e ect of neighbor taxes on own taxes can be the result of a direct e ect - either because of spillover e ects or of a tax mimicking behavior of communes - and of an indirect e ect - through a change in the tax base -, the estimation of equation (6) might be a ected by an omitted variable bias. 27 To get further intuition about this possibility, the estimated coe cient can be split into two components as follows: = 1 + 2 where 1 is the direct e ect of neighbor tax rates on own tax rate and 2 is the indirect e ect. In order to estimate the value of 1, the following equation has to be estimated: P t j = 1 w jk t k + 'y j + X j + " j (7) k6=j where y j is the taxable income per capita in commune j. Intuitively, adding the taxable income as a regressor at the same time as the neighbor tax multiplier should diminish the 25 See Kelejian and Prucha (1998) for a formal proof of this argument. 26 Revelli (2002) explicitly mentions all these e ects but doesn t test for their independent relevance in the tax interactions mechanism. 27 Carlsen et al. (2004) have done the same kind of correction for omitted variables in their estimation of the determinants of infrastructure fee in Norway. 15

estimated size of the coe cient on neighbor tax multiplier - b 1 < b - if strategic interactions are expected to depend on indirect tax base e ects. As already emphasized, the communal tax base (taxable income per taxpayer) is also endogenous. To take it into account, I use the same set of instruments as those used for the neighbor tax rate. Indeed, as the tax base depends on the residential decision of individuals, the neighbor communes characteristics seem to be appropriate instruments to explain individuals choice. Results for the tax setting equation Table 5 summarizes the results of estimating equations (6) and (7). 28 Contrary to the problem encountered with nding the appropriate set of instruments for the ratio of the median to mean income, the instruments used both for the weighted neighbor tax rate and for the taxable income per taxpayer are valid. Furthermore, for any estimation, I could not reject the null that the instruments are uncorrelated with the error term of the second stage estimation. To assert the utility of instrumenting for neighbor tax multipliers, I computed a Hausman test. 29 For any speci cation, I could not reject the null that OLS yield consistent estimates, i.e. endogeneity among the regressors has no deleterious e ect on OLS estimates. Given these results, OLS and 2SLS are both consistent but OLS is more e cient. 30 To start the results interpretation, it is worth noting that the overall t of the estimations is good, the R-squared are quite high ranging 58% to 66%. The result of main interest is that the coe cient on the weighted neighbor tax multiplier is always very signi cant: an increase [decrease] in neighbors tax multipliers leads to an increase [decrease] of own tax multiplier. This result seems to con rm that tax interactions are quite high in Switzerland, thus omitting to take into account spatial lags may strongly bias the estimations. In columns [6] and [7] - not including taxable income per capita as a regressor - the coe cient b is close to 1. However, looking at columns [8] and [9] - including taxable income per capita as a regressor - the coe cient b 1 is less than 1 and very close to the one estimated by Heyndels 28 All these estimations have also been done by explicitly taking into account the ratio median to mean income but - due to the absence of signi cativity of this variable and to the problems encounted to nd valid instruments - I do not report the results here. 29 Under H 0: b OLS and b 2SLS are both consistent but b OLS is more e cient, the Hausman test is given by: ( b 2SLS OLS b ) 0 [var( b 2SLS ) var( b OLS )]( b 2SLS OLS b )! d 2 m where m is the number of instrumented variables. 30 I also tested for the possible endogeneity of communal characteristics and I was never able to reject the null of exogeneity. 16

Table 5: Spatial interaction tax setting estimations, 2000 [6] OLS [7] 2SLS [8] OLS [9] 2SLS [10] 3SLS a [11] OLS [12] 2SLS Neighbor tax 0.872*** 0.880*** 0.550*** 0.696*** 0.612*** 0.927*** 0.947*** [9.66] [9.55] [5.67] [5.10] [4.76] [8.92] [8.69] Taxable income -0.005*** -0.003* -0.003* [6.31] [1.85] [1.82] Age 6 7 12.672 12.847-6.95 1.453 13.84 8.538 8.773 [0.49] [0.50] [0.29] [0.06] [0.62] [0.32] [0.34] 7 < age 6 15 51.427** 51.216** 38.295 43.391* 32.592* 47.041* 46.839* [2.03] [2.06] [1.60] [1.84] [1.68] [1.82] [1.87] Age > 65-31.429* -31.814* -46.035*** -40.607** -17.342-31.622* -32.320* [1.77] [1.84] [2.84] [2.39] [1.15] [1.75] [1.85] Foreigner 1.129 1.698 13.916 9.538-3.56 5.97 6.99 [0.08] [0.12] [1.16] [0.74] [0.34] [0.45] [0.54] Primary 37.878*** 37.788*** 24.814*** 30.076*** 27.889*** 35.612*** 35.568*** [4.45] [4.52] [3.32] [3.53] [3.57] [4.07] [4.19] Public 28.187 27.894 36.861** 32.799* 18.914 31.127 30.720* [1.53] [1.53] [2.09] [1.84] [1.47] [1.64] [1.66] Unemployed 43.633 43.309 26.609 33.134 27.561 33.576 32.777 [1.01] [1.02] [0.70] [0.86] [0.88] [0.78] [0.79] Owner -24.382*** -24.162*** -4.097-12.14-26.172*** -23.517*** -23.173*** [3.38] [3.35] [0.60] [1.36] [3.03] [3.30] [3.29] Density 0.331*** 0.331*** 0.394*** 0.368*** 0.282*** 0.381*** 0.382*** [3.26] [3.33] [4.09] [3.84] [2.64] [3.61] [3.71] Acreage 0.002*** 0.002*** 0.002*** 0.002*** 0.001** 0.002*** 0.002*** [3.22] [3.27] [3.22] [3.27] [2.07] [3.35] [3.46] Altitude 0.009 0.008 0.006 0.007 0.009* 0.012* 0.012* [1.59] [1.60] [1.10] [1.30] [1.76] [1.79] [1.86] Industry -1.377*** -1.385*** -1.535*** -1.482*** -1.432*** -1.351*** -1.368*** [3.27] [3.35] [4.20] [3.88] [4.15] [3.00] [3.11] Agriculture 0.046 0.045 0.086** 0.068* 0.067* 0.054 0.054 [1.16] [1.16] [2.47] [1.84] [1.82] [1.20] [1.24] Lake -6.308*** -6.300*** -4.670*** -5.335*** -5.853*** -6.657*** -6.616*** [3.75] [3.83] [3.17] [3.50] [3.20] [2.93] [3.00] R-squared 0.58 0.66 0.59 Neighbor control no no no no no yes yes Robust t-statistics in brackets, Constants not reported 17 * signi cant at 10%; ** signi cant at 5%, *** signi cant at 1%, a jointly estimated with column [15] table 6

and Vuchelen (1998) for Belgian municipal taxes. 31 Thus, as ( b b 1 ) > 0, one part of the positive interaction between communal tax rates can be explained by a tax base e ect. As expected, the taxable income per taxpayer has a signi cant negative impact on the tax multiplier set by a commune. Among the communal characteristics having a signi cant in uence on the tax multiplier, I nd that the share of young between 8 and 15 has a signi cant positive impact on the tax multiplier. This result con rms that the nancing of primary schools 32 weighs on the budget constraint of the communes and hence compels them to increase their revenues by setting higher taxes. The share of people over 64 has surprisingly a negative impact on tax multiplier. Generally, we would expect citizens over 64 to be on average less mobile than the younger and to need more public services such as health care. However, this negative impact could be explained by older people being also more conservative and hence favoring lower taxes at the booths. I also nd that the share of primary sector workers has a positive impact on tax multipliers. This result is quite in line with the theoretical predictions, as primary sector employees are on average less mobile than secondary or tertiary sector employees. A high owner share in a commune has a signi cant negative impact on its tax multiplier if the taxable income per taxpayer is not included among the explanatory variables. In columns [8] and [9], the coe cient on the owner share is no longer signi cant which may mean that owners are on average richer than renters. Finally, the signi cant positive impact of the share of public employees is certainly associated with a generally more left oriented position of public employees. Relying on the median voter theorem, a high share of foreigners - without voting rights and earning low incomes - should decrease the level of taxes. In the case of Switzerland, since the foreigners are on average either very high or very low skilled, their impact on the income distribution is not clear cut which may explain that the foreigners share has no signi cant in uence on the communal tax multipliers. 33 Among the signi cant geographical variables, the higher the communal density and 31 Looking at literature testing for spatial interaction in the setting of AFDC bene ts or of public spending, Case et al. (1993), Saavedra (1999), Figlio et al. (1999), Baiker (2005) also nd some strong signi cant evidence of positive interaction between own and neighbor bene ts levels. 32 As parents have not the choice to which public primary school to send their children, the share of children between 7 and 15 can be considered as highly correlated with the observed share of primary schoolchildren in a commune. 33 Robustness checks have been done by disentangling the foreigners share according to their level of education and I found that the more numerous the immigrants with a low (high) quali cation, the higher (lower) the taxes. These results contradict a priori the predictions of Razin et al. (2002). 18

size, the higher the tax rate. The result associated with density is not surprising as the communes with a high density are mostly urban and have proportionally more to spend for social security, culture and infrastructure than the communes with a lower density. The communes lying at the side of a lake set signi cantly lower taxes than the other. The lake dummy may capture some unobservable feature such as a high property value - entry barrier - which prevent poorer household to settle in such a commune. Finally, the share of industrial area in a commune has a very signi cant negative in uence on its tax multiplier. One explanation could rely on the complementarity between the taxes collected: for a given level of communal expenditures, if the corporate income taxes collected are high, the personal income taxes can be set at a lower level. Do tax multipliers induce a tax base e ect? Once taxable income per taxpayer is taken into account in the strategic modeling of tax rates, the coe cient on the weighted neighbor tax rate is signi cantly lowered. This result may be explained by taxes in uencing the taxable income available for a commune. 34 One possible way to assess whether tax base e ects are a signi cant channel through which tax interactions occur is to estimate the two following parametric equations: P t j = 1 w jk t k + 'y j + X 1j 1 + " 1j (8) k6=j y j = t j + P w jk t k + X 2j 2 + P jk X k + " 2j (9) k6=j k6=jw where X :j is a vector of commune j characteristics and X k are neighboring - competing - communes k characteristics. These two equations correspond to the empirical counterparts of equations (3) and (2) given by the theoretical model. In the estimation of both equations, endogeneity is again present. In equation (8) the tax base and the weighted neighbor tax rate are endogenous and in equation (9) the own tax rate is endogenous. To estimate these equations, I can either use 2SLS or 3SLS. Contrary to 2SLS, 3SLS estimate both equations jointly by taking into account that the errors between the two equations might be correlated. The 3SLS are constructed according to a three steps procedure. In the rst step, the coe cients of both equations are estimated by 2SLS. In the second step, a matrix of variancescovariances is constructed by using the residuals obtained from the rst stage estimation. 34 As similar assumption has been used by Buettner (2003) for estimating the determinants of the local capital tax base in Germany. 19

The last step consists of reestimating the two equations by GLS by using the matrix of variances-covariances computed in the second step and the instrumented values of the endogenous variables. The main advantage of using 3SLS is an increase in the e ciency of the estimations, but it comes at a cost: as it relies on the consistency of the variances-covariances matrix, if one equation in the system is misspeci ed, all the coe cients estimates will be inconsistent. Results for the tax base equation Table 6 summarizes the results obtained for the estimation of equation (9). It has to be emphasized that column [15] was estimated jointly with column [10] of table 5. The communal tax multiplier has, as expected, a signi cant negative e ect on the communal taxable income per taxpayer but this e ect is rather small. Indeed, for a given commune, a decrease of 10% in its tax multiplier will, on average, increase its annual taxable income per taxpayer by about 5 000 Frs. But, at the same time, since the tax rate applied to the taxable income has decreased for every taxpayer, the nal impact of this change is ambiguous. 35 The only possible bene t a commune can obtain by lowering its tax multiplier is to attract individuals with above average taxable income who will also pay proportionally more taxes due to the underlying progressive tax schedule. As an additional result, I nd that the weighted neighbor tax multiplier has also a signi cant negative e ect on the taxable income per taxpayer. How can it be explained? If an individual chooses rst the region where to locate and then a speci c commune within this region, it may explain this simultaneous negative e ect of own and neighbor taxes on the communal taxable income per taxpayer. Another explanation could be found when looking at the robustness checks reported in section ve. Indeed, when rent is added as an explanatory variable in the taxable income equation, the neighbor taxes loose their signi cance. As neighbor taxes and rents are highly negatively correlated - 0:72 - it may be that neighbor taxes have captured the rent e ect. Indeed, the higher the rents in a commune, the less the poorer individuals are able to reside in such a commune, thus the higher the taxable income per taxpayer. Among the other signi cant results, the lower the industrial area in a commune and the higher its neighbor density, the higher its taxable income per taxpayer. This result can well be explained by high income individuals rather setting in nice area not too industrialized 35 For the commune of Aigle with initial monthly taxable income per taxpayer of 4 240 Frs and tax multiplier 100, a 10% decrease in its tax multiplier while increasing its monthly taxable income per taxpayer by about 450 Frs, lowers - ceteris paribus - its annual tax revenue by 24 Frs per taxpayer, which is negligible. 20

Table 6: Taxable income per taxpayer estimations, 2000 [13] OLS [14] 2SLS [15] 3SLS a Tax -36.780*** -44.907*** -63.942*** [5.55] [4.21] [6.47] Neighbor tax -55.066*** -47.341*** -29.171** [8.18] [3.42] [2.48] Density -10.082-6.121 3.935 [1.47] [0.80] [0.40] Altitude -0.537* -0.472-0.376 [1.68] [1.30] [0.83] Industry -85.022*** -94.287*** -114.698*** [4.43] [3.83] [3.74] Agriculture 2.25 2.425 4.144 [0.75] [0.84] [1.30] Lake -50.273-110.429-243.48 [0.24] [0.66] [1.31] N. density 104.251*** 99.043*** 89.099*** [2.88] [2.93] [3.00] N. altitude 4.443*** 4.438*** 4.545*** [4.29] [4.05] [3.55] N. industry 78.283 80.097 69.073 [0.69] [0.71] [0.67] N. agriculture 26.932*** 27.100*** 23.637*** [4.96] [4.95] [3.64] N. lake 211.228 218.553 202.428 R-squared 0.63 [0.69] [0.75] [0.70] Robust t-statistics in brackets, Constants not reported * signi cant at 10%; ** signi cant at 5%, *** signi cant at 1% a jointly estimated with column [10] table 5 21