Quantal Response Equilibrium with Non-Monotone Probabilities: A Dynamic Approach

Similar documents
Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India October 2012

PhD Qualifier Examination

Evolution & Learning in Games

Calibration of Interest Rates

Measuring the Benefits from Futures Markets: Conceptual Issues

LEARNING AND NOISY EQUILIBRIUM BEHAVIOR

A Preference Foundation for Fehr and Schmidt s Model. of Inequity Aversion 1

All Equilibrium Revenues in Buy Price Auctions

Supplementary Material for: Belief Updating in Sequential Games of Two-Sided Incomplete Information: An Experimental Study of a Crisis Bargaining

ANASH EQUILIBRIUM of a strategic game is an action profile in which every. Strategy Equilibrium

Answer Key for M. A. Economics Entrance Examination 2017 (Main version)

American Option Pricing Formula for Uncertain Financial Market

STOCHASTIC CALCULUS AND BLACK-SCHOLES MODEL

6.254 : Game Theory with Engineering Applications Lecture 3: Strategic Form Games - Solution Concepts

ISSN BWPEF Uninformative Equilibrium in Uniform Price Auctions. Arup Daripa Birkbeck, University of London.

Bidding By Empirical Bayesians In Sealed Bid First Price Auctions

Option Pricing Formula for Fuzzy Financial Market

Barro-Gordon Revisited: Reputational Equilibria with Inferential Expectations

On the evolution of probability-weighting function and its impact on gambling

Repeated Games with Perfect Monitoring

Altruism and Noisy Behavior in One-Shot Public Goods Experiments

Copyright (C) 2001 David K. Levine This document is an open textbook; you can redistribute it and/or modify it under the terms of version 1 of the

ECON 803: MICROECONOMIC THEORY II Arthur J. Robson Fall 2016 Assignment 9 (due in class on November 22)

Game Theory Notes: Examples of Games with Dominant Strategy Equilibrium or Nash Equilibrium

Mean Variance Analysis and CAPM

Ideal Bootstrapping and Exact Recombination: Applications to Auction Experiments

UCLA Department of Economics Ph.D. Preliminary Exam Industrial Organization Field Exam (Spring 2010) Use SEPARATE booklets to answer each question

Game-Theoretic Approach to Bank Loan Repayment. Andrzej Paliński

Replicator Dynamics 1

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 19 11/20/2013. Applications of Ito calculus to finance

Appendix: Common Currencies vs. Monetary Independence

Lecture 3. Sergei Fedotov Introduction to Financial Mathematics. Sergei Fedotov (University of Manchester) / 6

1 The continuous time limit

Department of Agricultural Economics PhD Qualifier Examination January 2005

Chapter 3 PREFERENCES AND UTILITY. Copyright 2005 by South-Western, a division of Thomson Learning. All rights reserved.

Behavioral Finance and Asset Pricing

Stochastic Games and Bayesian Games

ECON 459 Game Theory. Lecture Notes Auctions. Luca Anderlini Spring 2017

Game Theory: Normal Form Games

Solutions of Bimatrix Coalitional Games

King s College London

Preferences and Utility

Long run equilibria in an asymmetric oligopoly

Exercise Chapter 10

Econ 8602, Fall 2017 Homework 2

G5212: Game Theory. Mark Dean. Spring 2017

Elements of Economic Analysis II Lecture X: Introduction to Game Theory

An Adaptive Learning Model in Coordination Games

Local vs Non-local Forward Equations for Option Pricing

Revenue Equivalence and Income Taxation

Pricing Dynamic Solvency Insurance and Investment Fund Protection

MAS3904/MAS8904 Stochastic Financial Modelling

Sequential Rationality and Weak Perfect Bayesian Equilibrium

Mixed Strategies. Samuel Alizon and Daniel Cownden February 4, 2009

Portfolio Selection with Quadratic Utility Revisited

PAULI MURTO, ANDREY ZHUKOV. If any mistakes or typos are spotted, kindly communicate them to

AUCTIONEER ESTIMATES AND CREDULOUS BUYERS REVISITED. November Preliminary, comments welcome.

Strategy -1- Strategy

Modelling the Term Structure of Hong Kong Inter-Bank Offered Rates (HIBOR)

G5212: Game Theory. Mark Dean. Spring 2017

Department of Agricultural Economics. PhD Qualifier Examination. August 2010

PAULI MURTO, ANDREY ZHUKOV

INCENTIVES IN PUBLIC GOODS EXPERIMENTS: IMPLICATIONS FOR THE ENVIRONMENT

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2017

The test has 13 questions. Answer any four. All questions carry equal (25) marks.

Parallel Accommodating Conduct: Evaluating the Performance of the CPPI Index

Chapter 10: Mixed strategies Nash equilibria, reaction curves and the equality of payoffs theorem

Comparing Allocations under Asymmetric Information: Coase Theorem Revisited

Advanced Stochastic Processes.

Chair of Communications Theory, Prof. Dr.-Ing. E. Jorswieck. Übung 5: Supermodular Games

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India August 2012

Best response cycles in perfect information games

On Forchheimer s Model of Dominant Firm Price Leadership

Trading Complex Risks

What are the additional assumptions that must be satisfied for Rabin s theorem to hold?

On Existence of Equilibria. Bayesian Allocation-Mechanisms

Elements of Economic Analysis II Lecture XI: Oligopoly: Cournot and Bertrand Competition

1.12 Exercises EXERCISES Use integration by parts to compute. ln(x) dx. 2. Compute 1 x ln(x) dx. Hint: Use the substitution u = ln(x).

Characterization of the Optimum

SIMPLE AGENTS, INTELLIGENT MARKETS*

SI 563 Homework 3 Oct 5, Determine the set of rationalizable strategies for each of the following games. a) X Y X Y Z

Behavioral Equilibrium and Evolutionary Dynamics

Game theory and applications: Lecture 1

A Note about the Black-Scholes Option Pricing Model under Time-Varying Conditions Yi-rong YING and Meng-meng BAI

Mixed Strategies. In the previous chapters we restricted players to using pure strategies and we

Social preferences I and II

On Effects of Asymmetric Information on Non-Life Insurance Prices under Competition

Microeconomic Theory May 2013 Applied Economics. Ph.D. PRELIMINARY EXAMINATION MICROECONOMIC THEORY. Applied Economics Graduate Program.

Random Search Techniques for Optimal Bidding in Auction Markets

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2015

Asymmetric Information: Walrasian Equilibria, and Rational Expectations Equilibria

Are stylized facts irrelevant in option-pricing?

Valuation of Standard Options under the Constant Elasticity of Variance Model

On the evolution from barter to fiat money

Outline. Simple, Compound, and Reduced Lotteries Independence Axiom Expected Utility Theory Money Lotteries Risk Aversion

Price Setting with Interdependent Values

A Model of Noisy Introspection

An Introduction to Experimental Economics and Insurance Experiments. J. Todd Swarthout

Comparative Risk Sensitivity with Reference-Dependent Preferences

Unraveling versus Unraveling: A Memo on Competitive Equilibriums and Trade in Insurance Markets

Transcription:

Quantal Response Equilibrium with Non-Monotone Probabilities: A Dynamic Approach Suren Basov 1 Department of Economics, University of Melbourne Abstract In this paper I will give an example of a population game and of a locally improving stochastic learning process such that the quantal response equilibrium assigns to the different strategies the probabilities that are non-monotone in the payoffs. Moreover, if the initial state probabilities are payoff-monotone, then the learning can be shown the shrink mistakes in one direction and exacerbate them in the other direction. Keywords: quantal response equilibrium, dynamic probabilistic choice models JEL classification numbers: C7 1 Suren Basov, Department of Economics, The University of Melbourne, Victoria 3010, Australia. e-mail: s.basov@cupid.ecom.unimelb.edu.au Tel. (61 3)-8344-7154, Fax (61 3)-8344-6899 1

1 INTRODUCTION Past decades have witnessed growing empirical evidence that calls into question the utility maximization paradigm. For a description of systematic errors made by experimental subjects, see Arkes and Hammond (1986), Hogarth (1980), Kahneman, Slovic, and Tversky (1982), and Nisbett and Ross (1980), and the survey papers by Payne, Bettman, and Johnson (1982), and by Pitz and Sachs (1984). To capture this type of behavior, Luce (1959) introduced a probabilistic choice model, currently known as the logit model. This and similar probabilistic choice models have already found their application in economics. See, for example, McKelvey and Palfrey (1995, 1998), Chen, Friedman, Thisse (1997),, Offerman, Schram, Sonnemans (1998), and Anderson, Goeree, and Halt (1998, 2001). It is typical in this kind of models to impose some intuitive restrictions on the probability density of choices and study the distributions that satisfy such restrictions. One of the most natural assumptions to make is payoff monotonicity: the choice probabilities are increasing in the payoffs. McKelvey and Palfrey (1995) applied the idea of probabilistic choice to develop a new equilibrium concept: quantal response equilibrium. Assumptions made 2

in McKelvey and Palfrey (1995) on the decision process guarantee that the payoff monotonicity will be satisfied for the equilibrium choice probabilities. However, the interpretation of McKelvey and Palfrey (1995) of quantal response equilibrium requires that the players should be able to calculate their expected payoff, possibly with some mistake. The same is true about all other papers cited in the previous paragraph. One might argue that the ability of the players to calculate their expected payoff does not square well with the assumption of bounded rationality, which motivated introduction of probabilistic choice models in the first place. 2 The problem arises because the approach taken in these papers is static. An alternative would be to require that the individuals adjust their choices based on some simple rule. The adjustment rule will give rise to a stochastic process on the choice set. Then the probability distribution of choices of the static approach will correspond to the steady state of this stochastic process. I call this approach dynamic. Unfortunately, the assumptions usually imposed on the probability density of choices need not be satisfied by the steady state density of the dynamic approach even for reasonable adjustment rules. 2 McKelvey and Palfrey (1995) discuss an alternative view that the players are fully rational but have an additive payoff disturbance associated with each pure strategy. Most authors, however, take a view that quantal response is associated with bounded rationality. 3

In this paper I take a dynamic point of view and give an example of a population game and of a locally improving stochastic learning process such that the quantal response equilibrium assigns to the different strategies the probabilities that are non-monotone in the payoffs. Moreover, if the initial state probabilities are payoff-monotone, then the learning can be shown the shrink mistakes in one direction and exacerbate them in the other direction. Basov (2002) argues that the situation described in this example is generic. 2 A POPULATION GAME Assume a population consists of a continuum of individuals. Each individual selects a vector (x, y) R 2. The payoff to the individual is given by u(x, y) = 1 2 (x αx)2 1 2 (y βy )2, (1) where α 6= 1,β6= 1,andX and Y are population means of x and y. Nash Equilibrium: It is straightforward to find the unique Nash equilib- 4

rium in this game. Indeed, the best response of an individual is given by x = αx y = βy. (2) Taking expectations one concludes that X = Y = 0and (2) implies that x = y =0in the Nash equilibrium. Logit Equilibrium: The logit equilibrium is a version of quantal response equilibrium when deviations of the perceived payoffs from the realized ones has the extreme value distribution. The population density of choices is given by f(x, y) = 1 πσ 2 exp( (x αx)2 +(y βy ) 2 σ 2 ). (3) Using it to calculate the population means one gets X = αx Y = βy. (4) Therefore, X = Y =0and the equilibrium choice density is f(x, y) = 1 πσ 2 exp( x2 + y 2 σ 2 ). (5) 5

Note that the logit equilibrium choice is unbiased with respect to the Nash equilibrium choice and the choice probability is an increasing function of the payoff. The equiprobability curves, i.e. the curves along which the probability density of choices is constant, are circumferences with the center at the origin. Figure 1 An equiprobability curve for logit equilibrium. 3 A DYNAMIC MODEL In the previous Section I described a population game and found its Nash and Logit equilibria. Note that both equilibrium notions used in the previous 6

Section are static, since they do not rely on any explicit learning dynamics. In this Section I am going to assume instead that the players adjust their choices gradually. The expected adjustment vector will be equal to the gradient of their utility. I will also assume that the adjustment is subject to a random mistake. I will call the steady state of this model a dynamic quantal response equilibrium. Formally, assume that dx = (x αx)dt + σ 1 dw 1 dy = (y βy )dt + σ 2 dw 2, (6) where W 1 and W 2 are independent standard Wiener processes. Then the probability density of choices satisfies the following partial differential equation (Ito, 1992) f t x ((x αx)f) y ((y αy )f) =σ2 1 2 f 2 x + σ2 2 2 f 2 2 y. (7) 2 The stationary solution of the equation (7) is given by f(x, y) = 1 (x αx)2 exp( πσ 1 σ 2 σ 2 1 (y βy )2 ). (8) σ 2 2 7

Using it to calculate the population means one gets X = αx Y = βy. (9) Therefore, X = Y =0and the equilibrium choice density is f(x, y) = 1 πσ 1 σ 2 exp( x2 σ 2 1 y2 ). (10) σ 2 2 Note that the dynamic quantal response equilibrium choice is unbiased with respect to the Nash equilibrium choice. It is, however, not monotone in payoffs, unless σ 1 = σ 2. To see this geometrically, note the the equiprobability curves are now ellipses. Figure 2 depicts an equiprobability and an indifference curve corresponding to the equilibrium of the model. Figure 2 8

An indifference curve (circumference) and an equiprobability curve (ellipse) in the dynamic quantal response equilibrium. Let point A belong to the part of the ellipse above the circumference, while point B to the part of the circle left of the ellipse, then u(a) >u(b) (since A is above the indifference curve, while B is below it), while f(a) <f(b) (since A is below the equiprobability curve, while B is above it). The above example demonstrates that the dynamic quantal response equilibrium can result in the steady state choice probabilities that are nonmonotone on payoffs. Moreover, suppose that the initial density of choices is 9

given by f(x, y; t =0)= 1 πσ 2 exp( x2 + y 2 σ 2 ). (11) Note that the initial choices are unbiased with respect to the Nash equilibrium choices and their density is payoff monotone. Now assume that the players adjust their choices according to rule (7) with σ 1 and σ 2 such that σ 1 <σ 2 and σ 2 1 + σ 2 2 =2σ 2. Then the equilibrium steady state density will be given by (10). Our assumptions on σ 1 and σ 2 insure that the total variability of the choices in the dynamic quantal response equilibrium remains the same as in the initial state: Var(x; t = )+Var(y; t = ) =Var(x; t =0)+Var(y; t =0). (12) However, Var(x; t = ) <Var(x; t =0),Var(y; t = ) >Var(y; t =0). (13) Therefore, learning shrinks mistakes in one direction, while exacerbating them in the other. If one interprets stochastic terms in (6) as experimentation, this result will imply that learning will exacerbate mistake in direction in 10

which players experiment too much. If one assumes furthermore that people tend to experiment more aggressively in new unfamiliar environments, this results will imply that learning can lower payoffs exactly in the situations when there is a lot to learn. 4 CONCLUSIONS In this paper I have shown by an example that the probability density of a strategy in a dynamic quantal response equilibrium need not be monotone in payoffs, even for a reasonable (locally improving) adjustment rule. In a recent paper Basov (2002) argued that this situation is generic. Moreover, for some initial conditions learning can lead to diminishing of mistakes in one direction and exacerbation in the other. This calls for a critical re-examination of the conclusions obtained from static equilibrium models and calls for an explicit modelling of a dynamic adjustment process 11

REFERENCES Anderson S. P., Goeree, J. K., Holt C. A, 1998, Rent seeking with bounded rationality: An analysis of all-pay auction. Journal of Political Economy, 106, 828-853. Anderson S. P., Goeree, J. K., Holt C. A., 2001, Minimum-effort coordination games: stochastic potential and logit equilibrium. Games and Economic Behavior, 34, 177-199. Arkes, H. R., Hammond, K. R., 1986, Judgment and decision making: an interdisciplinary reader. Cambridge: Cambridge U. Press. Basov, S., 2002, Bounded rationality: Static versus Dynamic Approaches, The University of Melbourne, Department of Economics Working Paper 864. Chen, H. C., Friedman, J. W., Thisse, J. F., 1997, Boundedly rational Nash equilibrium: a probabilistic choice approach, Games and Economic Behavior, 18, 32-54. Hogarth, R.: Judgment and choice: Psychology of decision, 1980, New York: Wiley. Ito, S., 1992, Diffusion equation. American Mathematical Society. Kahneman, D., Slovic P., Tversky A., 1982, Judgment under uncertainty: 12

heuristic and biases. Cambridge: Cambridge U. Press. Luce, R. D., 1959, Individual choice behavior, New York: Wiley. McKelvey, R. D., Palfrey, T. R., 1995, Quantal response equilibria for normal form games. Games and Economic Behavior, 10, 6-38. McKelvey, R. D., Palfrey, T. R., 1998, Quantal response equilibria for extensive form games. Experimental Economics, 1, 9-41. Nisbett, R., Ross L., 1980, Human inference: Strategies and shortcomings in the social judgment. Englewood Cliffs: Prentice-Hall. Offerman,T.,Schram,A.,Sonnemans,J.,1998,Quantalresponsemodelsin step-level public good games. European Journal of Political Economy, 14, 89-100. Payne, J. W., Bettman, J. R., Johnson E. J., 1992, Behavioral decision research: a constructive processing perspective. Annual Review of Psychology 43, 87-131. Pitz, G., Sachs N. J.: Judgment and decision, 1984, Theory and application. Annual Review of Psychology, 35,139-63. 13