Chapter 10: Mixed strategies Nash equilibria, reaction curves and the equality of payoffs theorem

Similar documents
Week 8: Basic concepts in game theory

Mixed Strategies. Samuel Alizon and Daniel Cownden February 4, 2009

Elements of Economic Analysis II Lecture X: Introduction to Game Theory

6.254 : Game Theory with Engineering Applications Lecture 3: Strategic Form Games - Solution Concepts

ECE 586BH: Problem Set 5: Problems and Solutions Multistage games, including repeated games, with observed moves

ECE 586GT: Problem Set 1: Problems and Solutions Analysis of static games

Game theory and applications: Lecture 1

Week 8: Basic concepts in game theory

Iterated Dominance and Nash Equilibrium

Problem Set 2 - SOLUTIONS

Game Theory. VK Room: M1.30 Last updated: October 22, 2012.

Strategies and Nash Equilibrium. A Whirlwind Tour of Game Theory

Player 2 H T T -1,1 1, -1

Math 135: Answers to Practice Problems

SI 563 Homework 3 Oct 5, Determine the set of rationalizable strategies for each of the following games. a) X Y X Y Z

Mixed Strategies. In the previous chapters we restricted players to using pure strategies and we

Jianfei Shen. School of Economics, The University of New South Wales, Sydney 2052, Australia

Notes for Section: Week 4

Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4)

S 2,2-1, x c C x r, 1 0,0

ECON 459 Game Theory. Lecture Notes Auctions. Luca Anderlini Spring 2017

Strategy -1- Strategy

Exercises Solutions: Oligopoly

Outline for today. Stat155 Game Theory Lecture 13: General-Sum Games. General-sum games. General-sum games. Dominated pure strategies

MA200.2 Game Theory II, LSE

Solution to Tutorial 1

Solution to Tutorial /2013 Semester I MA4264 Game Theory

Finding Mixed-strategy Nash Equilibria in 2 2 Games ÙÛ

Can we have no Nash Equilibria? Can you have more than one Nash Equilibrium? CS 430: Artificial Intelligence Game Theory II (Nash Equilibria)

The Ohio State University Department of Economics Second Midterm Examination Answers

Duopoly models Multistage games with observed actions Subgame perfect equilibrium Extensive form of a game Two-stage prisoner s dilemma

Elements of Economic Analysis II Lecture XI: Oligopoly: Cournot and Bertrand Competition

Introduction to Game Theory

MAT 4250: Lecture 1 Eric Chung

Mixed Strategy Nash Equilibrium. player 2

Version 01 as of May 12, 2014 Author: Steven Sagona To be submitted to PRL (not really)

On Existence of Equilibria. Bayesian Allocation-Mechanisms

The Ohio State University Department of Economics Econ 601 Prof. James Peck Extra Practice Problems Answers (for final)

Continuing game theory: mixed strategy equilibrium (Ch ), optimality (6.9), start on extensive form games (6.10, Sec. C)!

Notes for Section: Week 7

Thursday, March 3

Game Theory: Additional Exercises

Econ 101A Final exam May 14, 2013.

6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2

Regret Minimization and Security Strategies

6.896 Topics in Algorithmic Game Theory February 10, Lecture 3

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India August 2012

This is page 5 Printer: Opaq

Lecture 6 Dynamic games with imperfect information

m 11 m 12 Non-Zero Sum Games Matrix Form of Zero-Sum Games R&N Section 17.6

Economics 171: Final Exam

Simon Fraser University Fall Econ 302 D200 Final Exam Solution Instructor: Songzi Du Wednesday December 16, 2015, 8:30 11:30 AM

MATH 4321 Game Theory Solution to Homework Two

CMPSCI 240: Reasoning about Uncertainty

Regret Minimization and Correlated Equilibria

MS&E 246: Lecture 5 Efficiency and fairness. Ramesh Johari

Game Theory and Economics Prof. Dr. Debarshi Das Department of Humanities and Social Sciences Indian Institute of Technology, Guwahati.

Introductory Microeconomics

PAULI MURTO, ANDREY ZHUKOV

Games of Incomplete Information ( 資訊不全賽局 ) Games of Incomplete Information

Introduction to Multi-Agent Programming

Economics 109 Practice Problems 1, Vincent Crawford, Spring 2002

Repeated Games with Perfect Monitoring

Game Theory and Economics Prof. Dr. Debarshi Das Department of Humanities and Social Sciences Indian Institute of Technology, Guwahati

Game Theory Notes: Examples of Games with Dominant Strategy Equilibrium or Nash Equilibrium

Econ 323 Microeconomic Theory. Practice Exam 2 with Solutions

Introduction. Microeconomics II. Dominant Strategies. Definition (Dominant Strategies)

B w x y z a 4,4 3,3 5,1 2,2 b 3,6 2,5 6,-3 1,4 A c -2,0 2,-1 0,0 2,1 d 1,4 1,2 1,1 3,5

Basic Game-Theoretic Concepts. Game in strategic form has following elements. Player set N. (Pure) strategy set for player i, S i.

PhD Qualifier Examination

Notes on Auctions. Theorem 1 In a second price sealed bid auction bidding your valuation is always a weakly dominant strategy.

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

Best-Reply Sets. Jonathan Weinstein Washington University in St. Louis. This version: May 2015

Econ 323 Microeconomic Theory. Chapter 10, Question 1

UC Berkeley Haas School of Business Game Theory (EMBA 296 & EWMBA 211) Summer 2016

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India October 2012

10.1 Elimination of strictly dominated strategies

CS 7180: Behavioral Modeling and Decision- making in AI

G5212: Game Theory. Mark Dean. Spring 2017

Bayesian Nash Equilibrium

Econ 101A Final exam May 14, 2013.

Game Theory. Analyzing Games: From Optimality to Equilibrium. Manar Mohaisen Department of EEC Engineering

Econ 618 Simultaneous Move Bayesian Games

Rationalizable Strategies

We examine the impact of risk aversion on bidding behavior in first-price auctions.

Signaling Games. Farhad Ghassemi

In the Name of God. Sharif University of Technology. Microeconomics 2. Graduate School of Management and Economics. Dr. S.

(a) Describe the game in plain english and find its equivalent strategic form.

TTIC An Introduction to the Theory of Machine Learning. Learning and Game Theory. Avrim Blum 5/7/18, 5/9/18

CS 798: Homework Assignment 4 (Game Theory)

Noncooperative Oligopoly

A brief introduction to evolutionary game theory

MA300.2 Game Theory 2005, LSE

CSI 445/660 Part 9 (Introduction to Game Theory)

Problem 3,a. ds 1 (s 2 ) ds 2 < 0. = (1+t)

Name. Answers Discussion Final Exam, Econ 171, March, 2012

Optimal selling rules for repeated transactions.

Microeconomic Theory II Preliminary Examination Solutions

ANASH EQUILIBRIUM of a strategic game is an action profile in which every. Strategy Equilibrium

Exercises Solutions: Game Theory

Transcription:

Chapter 10: Mixed strategies Nash equilibria reaction curves and the equality of payoffs theorem Nash equilibrium: The concept of Nash equilibrium can be extended in a natural manner to the mixed strategies introduced in Lecture 5. First we generalize the idea of a best response to a mixed strategy Definition 1. A mixed strategy σ R is a best response for R to some mixed strategy σ C of C if we have σ R P R σ C σ R P R σ C for all σ R. A mixed strategy σ C is a best response for C to some strategy σ R of R if we have σ R P C σ C σ R P C σ C for all σ C We can then extend the definition to Nash equilbrium Definition 2. The mixed strategies σ R σ R are a Nash equilibrium for a two-player game with payoff matrices P R and P C if σ R is a best response to σ C and σ C is a best response to σ R or in other words σ R P R σ C σ R P R σ C for all σ R. σ R P C σ C σ R P C σ C for all σ R Reaction curves: For games with two strategies one can compute the best responses and the Nash equilbria in terms of the reactions curves. To explain the idea let us start with a example Example: Matching Pennies The payoff matrices are given by ( ( 1 1 1 1 P R P 1 1 R 1 1 and let us write the mixed strategies as σ R (p 1 p σ C (q 1 q To find the best response to σ C we compute ( ( ( p 1 1 q σ R P R σ C 1 p 1 1 1 q ( p 1 p p(2q 1 + (1 p(1 2q (2q 1 + p(4q 2 1 ( 2q 1 1 2q

Since we are computing the best response for R to the strategy of C we consider the payoff (2q 1 + p(4q 2 for fixed q and variable p with 0 p 1. This is a linear function of p and the maximum will depend on the slope of this function (here 4q 2 whether it is positive negative or 0. Best response for R: 4q 2 > 0 (or q < 1/2 The slope is positive so the maximum is at p 1. 4q 2 < 0 (or q > 1/2 The slope is negative so the maximum is at p 0 4q 2 0 (or q 1/2 The slope is 0 so the maximum is at any p between 0 and 1. To find the best response for C we compute ( ( ( p 1 1 q σ R P R σ C 1 p 1 1 1 q p(1 2q + (1 p(2q 1 (2p 1 + q(2 4p ( p 1 p ( 1 2q 2q 1 which we now consider has a function of the variable q and for fixed p. By maximizing over 0 q 1 we find Best response for C: 2 4p > 0 (or p > 1/2 The slope is positive so the maximum is at q 1. 2 4p < 0 (or p < 1/2 The slope is negative so the maximum is at q 0 2 4p 0 (or p 1/2 The slope is 0 so the maximum is at any q between 0 and 1. To find the Nash equilibria we argue as follows. If q > 1/2 then the best response is p 0 but the best response to p 0 is q 0 which contradicts q > 1/2 and this does not lead to a Nash equilbrium. If q < 1/2 then the best response is p 1 but the best response to p 1 is q 1 and again this does not lead to a Nash equilbrium. If q 1/2 then the best response is any p and so if we choose p 1/2 then the best response to p 1/2 is any q in particular q 1/2. 2

Figure 1: Reaction curves for the matching pennies game The conclusion is that there exists one Nash equilibrium σ R (1/2 1/2 σ C (1/2 1/2 A convenient way to put all this information together and to find the Nash equilibria in a a graphical manner is to draw the reaction curves which exhibit the best responses. For the matching pennies they are show in figure?? Example: Battle of the sexes. Writing the payoff as σ R (p 1 p σ C (q 1 q we have for the best response from R to C ( ( ( ( ( p 1 3 q p 3 2q σ R P R σ C 1 p 2 0 1 q 1 p 2q p(3 2q + (1 p2q 2q + p(3 4q (1 Since the fame is symmetric we obtain the payoff for C by exchanging p and q We obtain Best response for R: σ R P C σ C 2p + q(3 4p 3 4q > 0 (or q < 3/4 The slope is positive so the maximum is at p 1. 3 4q < 0 (or q > 3/4 The slope is negative so the maximum is at p 0 3 4q 0 (or q 3/4 The slope is 0 so the maximum is at any p between 0 and 1. and Best response for R: 3 4p > 0 (or p < 3/4 The slope is positive so the maximum is at q 1. 3

Figure 2: Reaction curves for the battle of the sexes game 3 4p < 0 (or p > 3/4 The slope is negative so the maximum is at q 0 3 4p 0 (or p 3/4 The slope is 0 so the maximum is at any q between 0 and 1. The best response curves are given in Figure?? The equality of payoffs theorem The method used above to compute the Nash equilibria works well if there are 2 strategies but is not very useful if three or more strategies are used since the optimization problems become much more complicated. We present here a method which in principle allows to compute all Nash equilibria of a game. But the reader should be warned that computations become quickly quite lengthy and involved. Some games do not have a Nash equilibrium in pure strategies (like rock-paper-scissors or matching pennies but there is always one (and often many if we consider mixed strategies. Theorem 3. (Nash Theorem A game (with a finite number of strategies always has at least one Nash equilibrium ( σ R σ C in mixed strategies. Many interesting examples of games are symmetric. Theorem 4. (Nash Theorem for symmetric games For a symmetric game we have ( σ R σ C is a NE ( σ C σ R is a NE Moroever there always exists at least one symmetric NE ( σ R σ C ( σ σ The computation of Nash equilibria is based on the simple observation. 4

Theorem 5. (Equality of payoff theorems Suppose σ R is a best response to σ C. Then we have 1. If the strategy i and j are played with positive probability for R (that is we have σ R (i > 0 and σ R (j > 0 then the payoff to play i and j against σ C are identical. σ R (i > 0 and σ R (j > 0 P R σ C (i P R σ C (j 2. If the strategy i is played with positive probability (that is we have σ R (i > 0 but the strategy j is played with probability 0 (that is σ R (j 0 then the payoff for R to play j against σ C is less than or equal to the payoff to play i against σ C. σ R (i > 0 and σ R (j 0 P R σ C (i P R σ C (j Proof. (i It is best to argue by contradiction. Suppose that the strategies i and j are played with positive probability for R (that is we have σ R (i > 0 and σ R (j > 0 but the payoffs P R σ C (i and P R σ C (j are not equal. Let us say for example that we have P R σ C (j > P R σ C (i. Then we argue that σ R cannot be a best response to σ C : since it it more favorable to play j than to play i if you choose a new mixed strategy where you play j with greater probability than σ R (i and j with a smaller probability that σ R (j and all the other strategies with the unchanged probabilities then your payoff in this new strategy against σ C will strictly increase. So σ R was not a best response. (ii Argue again by contradiction. Suppose that σ R (i > 0 and σ R (j 0 but that P R σ C (j > P R σ C (i. Then σ R cannot be a best response to σ C. To see this change your strategy σ R into a new strategy where you play now j with positive probability and i with probability 0. In this new strategy the payoff against σ C is greater than for σ R and so σ R was not a best response. It is useful to give a name for the strategies which are played with positive probabilities: Definition 6. The support of a mixed strategies σ is the set of pure strategies which are played with positive probability. We denote the support of σ by S(σ: S(σ {i : σ(i > 0} For example if σ (1/7 2/7 0 0 4/7 then S(σ {1 2 5} that is the mixed strategy σ the strategies played with positive probability are 1 2 and 5. 5

Using the equality of payoff theorem we can devise a method to compute all Nash equilibria: Algorithm to compute Nash equilibria Pick a support for both σ R and σ C. Compute the payoff for R i.e. compute the vector P R σ C and set the payoffs equal to each other if they are in the support of σ R. Solve the equation for σ C. Do the same for the payoff for C i.e. compute P T C σ R and set the payoffs equal to each other if they are in the support of σ C. Solve the equation for σ R. Check that the strategies which are not in the support of σ R and σ C do not offer a greater payoff. Pick another support for both σ R and σ C and repeat the exercise. At each step one tries to solve a system of linear equations which is not easy but the complexity of the algorithm grows very quickly since one needs to check all possible supports. For example if R has 2 strategies and C has 3 strategies there are 3 choices of support for R and 7 for C... It is a bit less tedious to compute symmetric Nash equilibria since one can use the symmetry of the problem. Algorithm to compute symmetric Nash equilibria ( σ σ Pick a support σ. Set all the payoff P R σ(i equal for i S(σ and try to solve for σ. Check that for j / S(σ the payoff P R σ(j is smaller than the payoff for i S(σ. Pick another support for σ and repeat. Let us illustrate the techniques with a number of examples: Example: Consider the game with payoff matrices ( 2 3 P R P 3 5 C ( 1 3 2 1 We try to find a NE in mixed strategies. We denote σ C (p 1 p and σ R (q 1q. The payoff for R are given by ( ( ( 2 3 p 3 5p P R σ C 3 5 1 p 8p 5 6

while the payoff for C are given by ( 1 2 PC T σ R 3 1 Setting the payoff for R to be equal we find ( q 1 q ( 2 q 1 + 2q 3 5p 8p 5 p 8/13 while setting the payoff for C to be equal gives 2 q 1 + 2q q 1/3 So we have a NE ( σ R σ C ((1/3 2/3 (8/13 5/13. Example: Symmetric 2-strategies game Consider the symmetric game with payoff matrices ( ( a b a c P R P c d C b d We already know that the game as 1 pure strategy NE is a > c b > d or a < c b < d and 2 pure strategy NE if a > c b < d or a < c b > d. To find when this game as a mixed Nash equilibrium we use the equality of payoff theorem. Denote σ C (p 1 p the strategy for C then the payoff for R is ( a b c d ( p 1 p Setting the payoff to be equal we find or So we find ( b + p(a b d + p(c d b + p(a b d + p(c d p[(a c + (d b] d b. p (d b (a c + (d b. Since the game is symmetric if we denote that strategy for R by σ R (q 1 q the payoff for C are ( b + q(a b d + q(c d and equating them gives the same solution q. Note that p and q should be (a c+(d b between 0 and 1 and this occurs only if a > c b < d or a < c b > d. (d b 7

( a b A symmetric two strategies game with matrix P R and P c d C PR T has a mixed strategy NE if a > c b < d or a < c b > d. The NE is symmetric and given by ( σ R σ C ( σ σ with ( (d b σ (a c + (d b (a c (a c + (d b Example: Nash equilibria for Rock-scissors-papers The payoff matrices are 0 1 1 0 1 1 P R 1 0 1 P C 1 0 1 1 1 0 1 1 0 Note that the game is a symmetric one so we should find a symmetric Nash equilibrium. The computation of Nash equilibria goes in several steps. Assume that one of the player use all his three pure strategies for example take σ C (p 1 p 2 1 p 1 p 2. Then the payoffs for R against this mixed strategy are given by P R σ C 0 1 1 1 0 1 1 1 0 p 1 p 2 1 p 1 p 2 We set the payoffs to be equal and find two equation 2p 2 + p 1 1 p 1 p 2 p 2 1/3 1 p 2 2p 1 1 p 1 p 2 p 1 1/3 2p 2 + p 1 1 1 p 2 2p 1 p 1 p 2 so we must have σ C (1/3 1/3 1/3. Since the game is symmetric by reversing the roles of R and C we find then σ R (1/3 1/3 1/3 and we have found a (symmetric NE. We try next to find NE where one player plays only 2 of his strategies say let us pick σ C (p 1 p 0. Then the payoff for R is 0 1 1 p 1 p P R σ C 1 0 1 1 p p 1 1 0 0 0 8

If we try to set the payoff equal to each other we find 1 p p which is impossible or p 0 which gives p 0 or 1 p 0 which gives p 1. In both case it means that C is actually using only one of his strategies and not two as assumed. Note that the game is also symmetric in the strategies and so picking another 2 strategies for any of the player will lead to the same conclusion. Finally assume that C use only a pure strategy in his NE for example his first one (Rock. Then we have 0 1 1 1 0 P R σ C 1 0 1 0 1 1 1 0 0 1 and the best response for R is then to play his third strategy (Paper of course. But conversely Rock for C is not a best response to Paper for R and so this does not give a NE. To conclude we find Unique NE σ R (1/3 1/3 1/3 σ C (1/3 1/3 1/3 Example: NE for coordination game: Let us consider the symmetric game where two players shows 1 2 or 3 fingers and win the number of finger they show if both player show the same number of fingers and lose the number of finger they show if the numbers of fingers do not match. This game is a coordination game since it is advantageous for a player to play the same strategy as his opponent. Coordination have typically many Nash equilibria. The payoff matrices are given by P R 1 1 1 2 2 2 3 3 3 P C 1 2 3 1 2 3 1 2 3 The game is symmetric and we shall use this fact to simplify the computations. computation of Nash equilibria goes in several steps. Assume player C use all his three strategies for example σ C (p 1 p 2 1 p 1 p 2. Then the payoffs against this strategies are given by 1 1 1 p 1 2p 1 1 P R σ C 2 2 2 p 2 4p 2 2 3 3 3 1 p 1 p 2 3 6p 1 6p 2 9 The

We set the payoffs to be equal and find or 2p 1 1 3 6p 1 6p 2 4p 2 2 3 6p 1 6p 2 8p 1 + 6p 2 4 6p 1 + 10p 2 5 and after some algebra we find σ C (5/22 8/22 9/22. Since the game is symmetric reversing the role of R and C we also find σ R (5/22 8/22 9/22. Let us assume that C uses only his first two strategies (one finger and two fingers σ C (p 1 p 0. Then we have 1 1 1 p 2p 1 P R σ C 2 2 2 1 p 2 4p 3 3 3 0 0 If we set the payoffs to be equal we find 2p 1 2 4p or p 1/2. For that choice the payoffs are then (0 0 0. So we can assume that R is playing is first two strategies with positive probability. His third strategy yields a payoff which is not bigger. But now if R use his first two strategies using the symmetry of the game we find the same result for C. So we obtain a Nash equilibrium σ R (1/2 1/2 0 σ C (1/2 1/2 0. Let us assume that C uses only his first and third strategies σ C (p 1 p 0. Then we find P R σ C (2p 1 0 3 6p and arguing as in the previous case we find a Nash equilibrium σ R (1/2 0 1/2 σ C (1/2 0 1/2. Let us assume that C uses only his second and third strategies then we obtain in a similar way σ R (0 1/2 1/2 σ C (0 1/2 1/2. Finally let us assume that C plays a pure strategy say 1. Then we have 1 1 1 1 1 P R σ C 2 2 2 2 3 3 3 0 3 and so the best response for R is to play 1. By symmetry we find the Nash equilibrium σ R (1 0 0 σ C (1 0 0. One argues similarly with playing pure strategies 2 and 3 and one finds two more Nash equilbrium in pure strategies. To summarize we have found 7 Nash equilibria all of them symmetric i.e. we have ( σ R σ C ( σ σ with σ (5/22 8/22 9/22 σ (1/2 1/2 0 σ (1/2 0 1/2 σ (0 1/2 1/2 σ (1 0 0 σ (0 1 0 σ (0 0 1. 10