m 11 m 12 Non-Zero Sum Games Matrix Form of Zero-Sum Games R&N Section 17.6

Similar documents
Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4)

CMPSCI 240: Reasoning about Uncertainty

Introduction to Multi-Agent Programming

ECE 586BH: Problem Set 5: Problems and Solutions Multistage games, including repeated games, with observed moves

CMPSCI 240: Reasoning about Uncertainty

Strategies and Nash Equilibrium. A Whirlwind Tour of Game Theory

Game Theory: Minimax, Maximin, and Iterated Removal Naima Hammoud

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India October 2012

Game Theory. VK Room: M1.30 Last updated: October 22, 2012.

Week 8: Basic concepts in game theory

Solution to Tutorial 1

Solution to Tutorial /2013 Semester I MA4264 Game Theory

S 2,2-1, x c C x r, 1 0,0

Stochastic Games and Bayesian Games

Prisoner s Dilemma. CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma. Prisoner s Dilemma. Prisoner s Dilemma.

Game theory and applications: Lecture 1

Elements of Economic Analysis II Lecture X: Introduction to Game Theory

HE+ Economics Nash Equilibrium

An introduction on game theory for wireless networking [1]

Outline Introduction Game Representations Reductions Solution Concepts. Game Theory. Enrico Franchi. May 19, 2010

The Nash equilibrium of the stage game is (D, R), giving payoffs (0, 0). Consider the trigger strategies:

Solutions of Bimatrix Coalitional Games

CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma

Stochastic Games and Bayesian Games

Duopoly models Multistage games with observed actions Subgame perfect equilibrium Extensive form of a game Two-stage prisoner s dilemma

Chapter 10: Mixed strategies Nash equilibria, reaction curves and the equality of payoffs theorem

6.254 : Game Theory with Engineering Applications Lecture 3: Strategic Form Games - Solution Concepts

Outline for today. Stat155 Game Theory Lecture 13: General-Sum Games. General-sum games. General-sum games. Dominated pure strategies

G5212: Game Theory. Mark Dean. Spring 2017

6.207/14.15: Networks Lecture 9: Introduction to Game Theory 1

Economics 109 Practice Problems 1, Vincent Crawford, Spring 2002

Iterated Dominance and Nash Equilibrium

MATH 121 GAME THEORY REVIEW

Microeconomics of Banking: Lecture 5

UC Berkeley Haas School of Business Game Theory (EMBA 296 & EWMBA 211) Summer 2016

Outline for today. Stat155 Game Theory Lecture 19: Price of anarchy. Cooperative games. Price of anarchy. Price of anarchy

MS&E 246: Lecture 5 Efficiency and fairness. Ramesh Johari

In the Name of God. Sharif University of Technology. Graduate School of Management and Economics

PAULI MURTO, ANDREY ZHUKOV

preferences of the individual players over these possible outcomes, typically measured by a utility or payoff function.

Week 8: Basic concepts in game theory

ECE 586GT: Problem Set 1: Problems and Solutions Analysis of static games

Game Theory Week 7, Lecture 7

January 26,

CS711: Introduction to Game Theory and Mechanism Design

CS 7180: Behavioral Modeling and Decision- making in AI

Name. Answers Discussion Final Exam, Econ 171, March, 2012

Microeconomics II. CIDE, MsC Economics. List of Problems

CUR 412: Game Theory and its Applications, Lecture 9

ECO 5341 (Section 2) Spring 2016 Midterm March 24th 2016 Total Points: 100

Regret Minimization and Security Strategies

Notes for Section: Week 7

PAULI MURTO, ANDREY ZHUKOV. If any mistakes or typos are spotted, kindly communicate them to

Chapter 8. Repeated Games. Strategies and payoffs for games played twice

CS711 Game Theory and Mechanism Design

Game Theory with Applications to Finance and Marketing, I

6.207/14.15: Networks Lecture 9: Introduction to Game Theory 1

Their opponent will play intelligently and wishes to maximize their own payoff.

Game Theory Notes: Examples of Games with Dominant Strategy Equilibrium or Nash Equilibrium

MATH 4321 Game Theory Solution to Homework Two

Outline for Dynamic Games of Complete Information

Game Theory - Lecture #8

Prisoner s dilemma with T = 1

SI 563 Homework 3 Oct 5, Determine the set of rationalizable strategies for each of the following games. a) X Y X Y Z

Using the Maximin Principle

Game Theory Tutorial 3 Answers

So we turn now to many-to-one matching with money, which is generally seen as a model of firms hiring workers

Economics 171: Final Exam

MAT 4250: Lecture 1 Eric Chung

Epistemic Game Theory

Problem 3 Solutions. l 3 r, 1

LECTURE 4: MULTIAGENT INTERACTIONS

Outline. Risk and Decision Analysis 5. Game Theory. What is game theory? Outline. Scope of game theory. Two-person zero sum games

Introduction to Game Theory

HW Consider the following game:

Solution Problem Set 2

Game Theory and Economics Prof. Dr. Debarshi Das Department of Humanities and Social Sciences Indian Institute of Technology, Guwahati

Static Games and Cournot. Competition

Game theory for. Leonardo Badia.

ORF 307: Lecture 12. Linear Programming: Chapter 11: Game Theory

Chapter 2 Discrete Static Games

Mixed Strategies. Samuel Alizon and Daniel Cownden February 4, 2009

MA300.2 Game Theory 2005, LSE

G5212: Game Theory. Mark Dean. Spring 2017

Games of Incomplete Information

Econ 323 Microeconomic Theory. Practice Exam 2 with Solutions

Rationalizable Strategies

ECON 459 Game Theory. Lecture Notes Auctions. Luca Anderlini Spring 2017

1. better to stick. 2. better to switch. 3. or does your second choice make no difference?

Econ 323 Microeconomic Theory. Chapter 10, Question 1

TPPE24 Ekonomisk Analys:

Not 0,4 2,1. i. Show there is a perfect Bayesian equilibrium where player A chooses to play, player A chooses L, and player B chooses L.

Name. FINAL EXAM, Econ 171, March, 2015

LINEAR PROGRAMMING. Homework 7

MA200.2 Game Theory II, LSE

Player 2 H T T -1,1 1, -1

Finding Mixed-strategy Nash Equilibria in 2 2 Games ÙÛ

Advanced Microeconomics II Game Theory Fall

TUFTS UNIVERSITY DEPARTMENT OF CIVIL AND ENVIRONMENTAL ENGINEERING ES 152 ENGINEERING SYSTEMS Spring Lesson 16 Introduction to Game Theory

Game Theory: Normal Form Games

Transcription:

Non-Zero Sum Games R&N Section 17.6 Matrix Form of Zero-Sum Games m 11 m 12 m 21 m 22 m ij = Player A s payoff if Player A follows pure strategy i and Player B follows pure strategy j 1

Results so far 2 players, perfect information, zero-sum: The game has always a pure strategy solution given by the minimax procedure Max Min m Rows i Columns 2 players, perfect information, zero-sum: The game has always a mixed strategy solution given by the minimax procedure max min( p m11 + (1 p) m21, p m12 + (1 p) m22) p j ij 2 players, perfect information, zero-sum:??????? Prisoner s Dilemma Two persons (A and B) are arrested with enough evidence for a minor crime, but not enough for a major crime. If they both confess to the crime, they each know that they will serve 5 years in prison. If only one of them testify, he will go free and the other prisoner will serve 10 years. If neither of them confess, they ll each spend 1 year in prison 2

Matrix normal form for non-zerosum games Player B Player A Testify Refuse Testify -5,-5-10,0 Refuse 0,-10-1,-1 Matrix normal form for non-zerosum games Player B Player A Testify Refuse Testify -5,-5-10,0 Player A s payoff for the pair of strategies A:Testify, B: Testify Refuse 0,-10-1,-1 Player B s payoff for the pair of strategies A: Testify, B: Refuse 3

Why this example? Although simple, this example models a huge variety of situations in which participants have similar rewards as in this game. Joint work: Two persons are working on a project. Each person can choose to either work hard or rest. If A works hard then prefers to rest, but the outcome of both working is preferable to the outcome of both resting (the project does not get done). Duopoly: Two firms compete for producing the same product and they both try to maximize profit. They can set two prices, High and Low. If both firms choose High, they both make a profit of $1000. If they both choose Low, they both make a lower profit of $600. Otherwise, the High firm makes a profit of $1200 and the Low firm takes a loss of $200. Arms race, Robot detection, Use of common property Matrix normal form for non-zerosum games Player A Testify Refuse Player B Testify -5,-5-10,0 Refuse 0,-10-1,-1 This not a zero-sum game The interests (payoffs) of the players are no longer opposite of each other What is the best strategy to follow for each player, assuming that they are both rational 4

Player A Dominant Strategies Player B Testify Refuse Testify -5,-5 0,-10 Refuse -10,0-1,-1 Player A s payoff is greater if he testifies than if he refuses, no matter what strategy B chooses Therefore Player A does not need to consider strategy refuse since it cannot possibly yield a higher payoff Player A Dominant Strategies Player B Testify Refuse Testify -5,-5 0,-10 Refuse -10,0-1,-1 The same reasoning can be applied to Player B: Player B s payoff is greater if he testifies than if he refuses, no matter what strategy A chooses Therefore Player B does not need to consider strategy refuse since it cannot possibly yield a higher payoff 5

Player A Dominant Strategies Player B Testify Refuse Testify -5,-5 0,-10 Refuse -10,0-1,-1 We say that a strategy strictly dominates if it yields a higher payoff than any other strategy for every one of the possible actions of the other player. Key result If both players have strictly dominating strategies, they provide a solution for the game (i.e., predict the outcome of the game) a dominant strategy equilibrium Testify is a strictly dominant strategy for A Testify is a strictly dominant strategy for B Therefore (Testify, Testify) is the solution Iterated Elimination of Dominated Strategies I II III IV 3,0 5,3 2,3 3,8 4,1 5,8 8,4 3,1 5,9 9,7 6,2 2,3 5,6 9,0 6,3 4,5 More generally: We can safely remove any strategy that is strictly dominated It will never be selected as a solution for the game Iteratively removing dominated strategies is the first step in simplifying the game toward a solution Is it sufficient? Did we get lucky earlier? 6

Dominant Strategies IA IIA IIIA IB -1,6 6,-1 4,5 IIB 6,-1-1,6 4,5 IIIB 5,4 5,4 7,7 How would the two players play this game? It is not the case that we are guaranteed that both players, or even that one player has a dominant strategy We can still use the rule for simplifying the game: Get rid of the strictly dominated strategies because they will never be selected in the solution However, we need a more general way of finding a solution to the game (i.e., to predict how rational players would play the game) We need a definition that generalizes the earlier definition for zero-sum games (using minimax) IA IB -1,6 IIB 6,-1 IIIB 5,4 IIA IIIA 6,-1 4,5-1,6 4,5 5,4 7,7 u u A B ( IIIA, IIIB) ua( X, IIIB) ( IIIA, IIIB) u ( IIIA, Y) B For any strategy X of Player A For any strategy Y of Player B (IIIA,IIIB) is an equilibrium because: Player A cannot find a better strategy given that Player B uses strategy IIIB Conversely, Player B cannot find a better strategy given that Player A uses strategy IIIA 7

Side Note: More than 2 Players? The formalism extends directly to more than 2 players. If we have n players, we need to define n payoff functions u i, i=1,..,n. Payoff function u i maps a tuple of n strategies to the corresponding payoff for player i u i (s 1,..,s n ) = payoff for player i if players 1,..,n use pure strategy s 1,,s n. Everything else (definition of dominating strategies, etc. remains the same) u i More formal definition A tuple of pure strategies (s 1 *,s 2 *,..,s n *) is a pure equilibrium if, for all i s: * * * * * * * * * ( s, s, s, s, L, s ) u ( s, L, s, s, s, L s ) 1, L i 1 i i+ 1 n i 1 i 1 i i+ 1, n for any strategy s i. In words: Player i cannot find a better strategy than s i * if the other player use the remaining strategies in the equilibrium Technically, called a pure Nash Equilibrium (NE) 8

s * i More formal definition (equivalent) A tuple of pure strategies (s 1 *,s 2 *,..,s n *) is a pure equilibrium if, for all i s: = arg max u s i i ( * * * * s,, s, s, s, L, s ) 1 L i 1 i i+ 1 In words: Player i cannot find a better strategy than s i * if the other player use the remaining strategies in the equilibrium Technically, called a pure Nash Equilibrium (NE) n Examples and Question So, we ve generalized our concepts for solving games to non zero-sum games NEs Basic questions: Is there always a NE? Is it unique? 9

Example with multiple NEs Left Right Left +1,+1-1,-1 Right -1,-1 +1,+1 Two vehicles are driving toward each other they have 2 choices: Move right or move left. Why is having multiple NEs a problem? Example with multiple NEs Hockey Movie Hockey +2,+1 0,0 Movie 0,0 +1,+2 Two friends have different tastes, A likes to watch hockey games but B prefers to go see a movie. Neither likes to go to his preferred choice alone; each would rather go the other s preferred choice rather than go alone to its own. 10

Example with no pure NE I II I II 0,1 1,0 1,0 0,1 Even very simple games may not have a pure strategy equilibrium This is not surprising since we saw earlier that we had a similar problem with zero-sum games, which did not necessarily have a pure strategy solution Solution: Same trick as with zero-sum games Allow the players to randomize and to use mixed strategies Mixed Strategy Equilibrium The concept of equilibrium can be extended to mixed strategies. In that case, a mixed strategy for each player i is a vector of probabilities p i = (p ij ), such that player i chooses pure strategy j with probability p ij A set of mixed strategies (p* 1,..,p* n ) if player i (for any i) gets a lower payoff by changing p* i to any other mixed strategy p i 11

Example Hockey Movie Hockey +2,+1 0,0 Movie 0,0 +1,+2 An example mixed strategy is: A chooses Hockey with probability: p = 2/3 B chooses Hockey with probability: q = 1/3 In fact, this is a mixed strategy equilibrium for this game The expected payoff is 2/3 for both A and B Hockey Movie Example Hockey +2,+1 0,0 Movie 0,0 +1,+2 Let A choose Hockey with probability p and B choose Hockey with probability q The expected payoff for Player A is: u A = (+2)xpq + (+1)x(1-p)x(1-q) = 1-p-q+3pq The expected payoff for Player B is: u B = (+1)xpq + (+2)x(1-p)x(1-q) = 2-2p-2q+3pq At the equilibrium, the derivative of u A with respect to p is zero (because ua(p*,q*) is greater than u A for any other value (p,q*)), therefore: 3q*-1 = 0 q* = 1/3 Similarly, the derivative of u B with respect to q must be 0 at the equilibrium, therefore: 3p*-2 = 0 p* = 2/3 Of course this example is constructed specifically so that these equations can be solved very easily. 12

Key results Theorem: For any game with a finite number of players, there exists at least one equilibrium There might not exists an equilibrium with only pure strategies, but at least one mixed strategy equilibrium exists Any equilibrium survives iterated elimination of dominated strategies (in most of the examples shown in this lecture, we ll use only pure strategies) How to compute the equilibrium: Example The same product is produced by two firms A and B The unit production cost is c, so the cost to produce q A unit for firm A is C = cq A The market price depends on the total number of units produced: P = α (q A + q B ) Therefore firm A s revenue is q A (α c - (q A + q B )) Problem: How to figure out the optimal output for firm A and B? If they produce too much, the price will go down and so would the revenue for each firm If they produce too little, the revenue will be small 13

Example Each possible value of q A is a pure strategy for firm A (and similarly for B). At equilibrium, A s revenue is maximum as we vary q A The derivative of q A (α c - (q A + q B )) with respect to q A is zero at the NE Similarly, B s revenue is maximum as we vary q B The derivative of q B (α c - (q A + q B )) with respect to q B is zero at the NE Therefore (q* A,q* B ) is solution of the system: α c - 2q A - q B = 0 α c - 2q B - q A = 0 With the solution: q* A = q* B = (α c)/3 And revenue for each firm: (α c) 2 /9 Note: We ignored the fact that the price must be set to 0 for q A + q B > α Is NE really the best that the 2 players can do? Suppose that instead of trying to find an equilibrium for A and B independently, we try to maximize the total revenue: The maximum is reached for q A = q B = (α c)/4 (just take the derivative of the total revenue with respect to the total output q A + q B ) This corresponds to a revenue per firm of (α c) 2 /8, which is greater than the revenue we get from the NEs. So the firms are doing better than our wonderful theory predicts. What is going on??? Is there something wrong with the theory?? 14

Coordination vs. No coordination There is nothing wrong with the theory. The reason is that the two firms are cooperating instead of deciding their strategy independently In general, in any game, the players would get a greater payoff if they agree to cooperate (coordinate, communicate). For example, in the prisoner s dilemma, the obvious solution is for the prisoners to both refuse to testify, if they agree in advance to coordinate their actions. We have considered only games without coordination, thus the seemingly paradoxical result. Tragedy of the Commons The previous example is one example of a more general situation, illustrated by the canonical example: n farmers use a common field for grazing goats Because the common field is a finite resource shared among all the farmers, the larger the total number of goats, the less food there is, and their unit value goes down Each individual farmer gets a higher profit if they all cooperate (maximize total profit) than if they use the NE equilibrium, acting rationally In the latter case, they tend to each try to exhaust the common resource. Note: Replace the silly example by changing common field energy resources, communication bandwidth, oil,.. and farmers customers, robots, vehicles, firms, Unit price Total number For those interested in historical references: "The Tragedy of the Commons," Garrett Hardin, Science, 162(1968):1243-1248 15

NE Recipes The NEs are within the strategies that survive iterated removal of dominated strategies Iterated removal is one way to get at the NEs For strategies described by continuous variables (see previous example), the NEs are found by solving the system of equations obtained by writing that the derivative of u i with respect to si * * is zero (i.e., u is an extremum with i ( s1, L, sn ) respect to s i ): u s i i * * ( s, L, s ) 0 1 n =.. and retain solutions that are maxima Summary Matrix form of non-zero-sum games and basic concepts for those games Strict dominance and its use Definition of game equilibrium Key result: Existence of (possibly mixed) equilibrium for any finite game Understand the difference between cooperating and non-cooperating situations Continuous games and corresponding recipes 16