ECE 586BH: Problem Set 5: Problems and Solutions Multistage games, including repeated games, with observed moves

Similar documents
Duopoly models Multistage games with observed actions Subgame perfect equilibrium Extensive form of a game Two-stage prisoner s dilemma

G5212: Game Theory. Mark Dean. Spring 2017

CHAPTER 14: REPEATED PRISONER S DILEMMA

Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4)

The Ohio State University Department of Economics Second Midterm Examination Answers

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

Microeconomics II. CIDE, MsC Economics. List of Problems

The Nash equilibrium of the stage game is (D, R), giving payoffs (0, 0). Consider the trigger strategies:

CUR 412: Game Theory and its Applications Final Exam Ronaldo Carpio Jan. 13, 2015

Repeated Games. EC202 Lectures IX & X. Francesco Nava. January London School of Economics. Nava (LSE) EC202 Lectures IX & X Jan / 16

ECE 586GT: Problem Set 1: Problems and Solutions Analysis of static games

Introduction to Game Theory Lecture Note 5: Repeated Games

Warm Up Finitely Repeated Games Infinitely Repeated Games Bayesian Games. Repeated Games

Chapter 8. Repeated Games. Strategies and payoffs for games played twice

S 2,2-1, x c C x r, 1 0,0

Chapter 10: Mixed strategies Nash equilibria, reaction curves and the equality of payoffs theorem

Answers to Problem Set 4

Université du Maine Théorie des Jeux Yves Zenou Correction de l examen du 16 décembre 2013 (1 heure 30)

Simon Fraser University Spring 2014

February 23, An Application in Industrial Organization

Repeated Games with Perfect Monitoring

HW Consider the following game:

Game Theory. Wolfgang Frimmel. Repeated Games

Answer Key: Problem Set 4

Economics 171: Final Exam

1 Solutions to Homework 4

Game Theory: Additional Exercises

In reality; some cases of prisoner s dilemma end in cooperation. Game Theory Dr. F. Fatemi Page 219

M.Phil. Game theory: Problem set II. These problems are designed for discussions in the classes of Week 8 of Michaelmas term. 1

Problem 3 Solutions. l 3 r, 1

Game Theory. Important Instructions

Repeated Games. Econ 400. University of Notre Dame. Econ 400 (ND) Repeated Games 1 / 48

Microeconomic Theory II Preliminary Examination Solutions Exam date: June 5, 2017

Stochastic Games and Bayesian Games

6.254 : Game Theory with Engineering Applications Lecture 3: Strategic Form Games - Solution Concepts

University at Albany, State University of New York Department of Economics Ph.D. Preliminary Examination in Microeconomics, June 20, 2017

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2017

Not 0,4 2,1. i. Show there is a perfect Bayesian equilibrium where player A chooses to play, player A chooses L, and player B chooses L.

m 11 m 12 Non-Zero Sum Games Matrix Form of Zero-Sum Games R&N Section 17.6

Eco AS , J. Sandford, spring 2019 March 9, Midterm answers

ECONS 424 STRATEGY AND GAME THEORY MIDTERM EXAM #2 ANSWER KEY

preferences of the individual players over these possible outcomes, typically measured by a utility or payoff function.

EC487 Advanced Microeconomics, Part I: Lecture 9

THE PENNSYLVANIA STATE UNIVERSITY. Department of Economics. January Written Portion of the Comprehensive Examination for

Games of Incomplete Information ( 資訊不全賽局 ) Games of Incomplete Information

REPEATED GAMES. MICROECONOMICS Principles and Analysis Frank Cowell. Frank Cowell: Repeated Games. Almost essential Game Theory: Dynamic.

Stochastic Games and Bayesian Games

is the best response of firm 1 to the quantity chosen by firm 2. Firm 2 s problem: Max Π 2 = q 2 (a b(q 1 + q 2 )) cq 2

Outline for Dynamic Games of Complete Information

Lecture 6 Dynamic games with imperfect information

Repeated Games. September 3, Definitions: Discounting, Individual Rationality. Finitely Repeated Games. Infinitely Repeated Games

Extensive-Form Games with Imperfect Information

Lecture 5 Leadership and Reputation

Answers to Microeconomics Prelim of August 24, In practice, firms often price their products by marking up a fixed percentage over (average)

(a) Describe the game in plain english and find its equivalent strategic form.

CUR 412: Game Theory and its Applications, Lecture 9

Microeconomics I. Undergraduate Programs in Business Administration and Economics

PAULI MURTO, ANDREY ZHUKOV

Economics 209A Theory and Application of Non-Cooperative Games (Fall 2013) Repeated games OR 8 and 9, and FT 5

Simon Fraser University Fall Econ 302 D200 Final Exam Solution Instructor: Songzi Du Wednesday December 16, 2015, 8:30 11:30 AM

In the Name of God. Sharif University of Technology. Graduate School of Management and Economics

Outline for today. Stat155 Game Theory Lecture 13: General-Sum Games. General-sum games. General-sum games. Dominated pure strategies

6.207/14.15: Networks Lecture 9: Introduction to Game Theory 1

SI 563 Homework 3 Oct 5, Determine the set of rationalizable strategies for each of the following games. a) X Y X Y Z

Bayesian Nash Equilibrium

Prisoner s dilemma with T = 1

Problem Set 2 Answers

Topics in Contract Theory Lecture 1

Introductory Microeconomics

Microeconomics Comprehensive Exam

Microeconomics of Banking: Lecture 5

Economics 502 April 3, 2008

13.1 Infinitely Repeated Cournot Oligopoly

Elements of Economic Analysis II Lecture XI: Oligopoly: Cournot and Bertrand Competition

Infinitely Repeated Games

Solution to Tutorial 1

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

Solution to Tutorial /2013 Semester I MA4264 Game Theory

Game Theory Fall 2003

Microeconomic Theory August 2013 Applied Economics. Ph.D. PRELIMINARY EXAMINATION MICROECONOMIC THEORY. Applied Economics Graduate Program

ECO410H: Practice Questions 2 SOLUTIONS

PAULI MURTO, ANDREY ZHUKOV. If any mistakes or typos are spotted, kindly communicate them to

Player 2 H T T -1,1 1, -1

Sequential-move games with Nature s moves.

CS711: Introduction to Game Theory and Mechanism Design

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India October 2012

MS&E 246: Lecture 5 Efficiency and fairness. Ramesh Johari

The Ohio State University Department of Economics Econ 601 Prof. James Peck Extra Practice Problems Answers (for final)

AS/ECON 2350 S2 N Answers to Mid term Exam July time : 1 hour. Do all 4 questions. All count equally.

1 x i c i if x 1 +x 2 > 0 u i (x 1,x 2 ) = 0 if x 1 +x 2 = 0

Game Theory with Applications to Finance and Marketing, I

Advanced Microeconomic Theory EC104

An introduction on game theory for wireless networking [1]

Microeconomic Theory II Preliminary Examination Solutions

Spring 2017 Final Exam

MATH 4321 Game Theory Solution to Homework Two

6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2

Department of Agricultural Economics. PhD Qualifier Examination. August 2010

Francesco Nava Microeconomic Principles II EC202 Lent Term 2010

Name. Answers Discussion Final Exam, Econ 171, March, 2012

Transcription:

University of Illinois Spring 01 ECE 586BH: Problem Set 5: Problems and Solutions Multistage games, including repeated games, with observed moves Due: Reading: Thursday, April 11 at beginning of class Fudendberg and Triole, Sections and 51 See Chapter for examples Ozdaglar lecture notes 1,1,15, & 16 (on extensive form games and repeated games) 1 [Another subgame perfect equilibria in repeated prisoner s dilemma] Consider the repeated game with the stage game being prisoner s dilemma with the payoff c d functions g i given by c 1,1-1, Action c means to cooperate Suppose player 1 has d,-1 0,0 discount factor δ 1 and player has discount factor δ, so the payoff for player i is given by J i (a) = t=0 δt i g i(a(t)), for a = ((a 1 (t), a (t)) : t 0) with a i (t) {c, d} Consider the player 1: c, d, d, c, c, c, c, following pair of scripts, for play beginning at time t = 0 : player : d, d, d, d, c, c, c, Both scripts have players playing c for t Let µ i be the trigger strategy that follows the script for player i as long as both players have followed the script in the past If either player has deviated from the script in the past, player i always plays d Let µ = (µ 1, µ ) (a) Express J i (µ ) in terms of δ i for i {1, } Solution: Under µ both players follow their scripts for all time The payoff sequence for player 1 is 1, 0, 0, 1, 1, 1, 1, so J 1 (µ ) = 1 δ1 + δ 1 The payoff sequence for player is, 0, 0,, 1, 1, 1, so J (µ ) = + δ + δ 1 δ (b) For what values of (δ 1, δ ) is µ a subgame perfect equilibrium (SPE) of the repeated game? Justify your answer using the one-stage-deviation principle Solution: Consider player 1 assuming player uses µ Let k 0, let h k denote a history for time k, and let µ 1 be the strategy (there is only one because the action space for the stage game has only two actions) that differs from µ 1 only for the time and history ) If h k is not equal to the two scripts up to time k 1 then player will be playing d from time k onward Player 1 under µ 1 will play c at time k and then play d onward Therefore, J (k) 1 ( µ 1, µ 1 h k ) J (k) 1 (µ 1, µ 1 h k ) = (δ 1 ) k < 0 It remains to check the cases when h k is given by the scripts up to time k 1 There is one such case for each value of k, and the resulting payoff sequences for µ 1 are listed in the following table deviation time payoff sequence payoff none (ie µ 1 ) 1, 0, 0, 1, 1, 1, 1, 1 δ 1 + δ 1 0 0, 0, 0, 0, 0, 0, 0, 0 1 1, 1, 0, 0, 0, 1 δ 1 < 0 1, 0, 1, 0, 0, 0, 1 δ1 < 0 1, 0, 0, 0, 1 < 0 k 1, 0, 0, 1, 1, 1, 1, }{{}, 0, 0, 0, 1 δ1 + δ 1 δk 1 + δ1 k time k

The payoff for no deviation is greater than or equal to the payoff for deviation at time k = 0 if 1 δ1 + δ 1 0, or δ 1 [076177, 1) The payoffs for deviation at time k with 1 k are negative, so are less than the payoff for deviation at time k = 0 Finally, the payoff for deviation at time k minus the payoff for no deviation is δ1 k( 1 ) which is less than or equal to zero for δ 1 [05, 1) Thus, the conditional payoff for µ 1 is greater than or equal to the conditional payoff for µ for any choice of ) if and only if δ 1 [076177, 1) Similarly, consider player assuming player 1 uses µ 1 Let k 0, let h k denote a history for time k, and let µ be the strategy that differs from µ only for the time and history ) By the same reasoning as above, if h k is not equal to the two scripts up to time k 1 then J (k) ( µ, µ h k ) J (k) (µ, µ h k ) = (δ ) k < 0 It remains to check the cases when h k is given by the scripts up to time k 1 There is one such case for each value of k, and the resulting payoff sequences for µ are listed in the following table deviation time payoff sequence payoff none (ie µ ), 0, 0,, 1, 1, 1, + δ + δ 1 δ 0 1, 0, 0, 0, 0, 0, 0, 1 1, 1, 0, 0, 0, 0, 0, δ, 0, 1, 0, 0, 0, δ, 0, 0, 1, 0, + δ k, 0, 0,, 1, 1, 1, }{{}, 0, 0, 0, + δ + δ δk 1 δ + δ k time k The payoff for k = is larger than for 0 k, so there is no need to consider the cases 0 k further The payoff for deviation at time k minus the payoff for no deviation is δ k ( 1 1 δ ) which is less than or equal to zero for δ [05, 1) Thus, µ be better than µ for any choice of ) if and only if δ [05, 1) Combining the results above, we find µ is an SPE for the repeated game if and only if (δ 1, δ ) [076177, 1) [05, 1) [Feasible payoffs for a trigger strategy in two stage game for a stage game with multiple NEs] Consider a multistage game such that the stage game has the following payoff matrix, where player 1 is the row player and player is the column player: 1 1, 5,,,5 6,6,,, 8,8 (a) Identify two NE for the stage game Solution: Both (1,1) and (,) are NE (b) Consider a two stage game obtained by playing the above stage game twice, with discount factor equal to one, so payoffs for the repeated game are equal to the sum of payoffs for the two stages Describe a subgame perfect equilibrium for the two stage game so that

the payoff vector is (1, 1) Verify that the strategy profile is subgame perfect using the single-deviation principle Solution: Player i uses the following strategy µ i : Play in the first stage If both players play in the first stage, then play in the second stage Else play 1 in the second stage Clearly µ = (µ 1, µ ) has payoff vector (1,1) We show that µ is subgame perfect using the single deviation principle By symmetry, we can focus on the responses of player 1 assuming player uses µ For a subgame beginning in stage, there are two possibilities If both players played in the first stage, then player 1 knows player will play in the second stage, and µ 1 gives the best response to that, namely to play as well If at least one of the players did not play in the first stage, then player 1 knowns that player will play 1 in the second stage, and µ 1 gives the best response to that, namely to play 1 as well So µ 1 gives the best response for the sub games beginning in the second stage, assuming player two uses µ The other subgame to consider for player one is the original game itself, and we need to consider the case player 1 deviates only in the first stage If player 1 were to play either 1 or in the first stage, and follow µ 1 in the second stage, then both players will play 1 in the second stage, getting payoffs each in the second stage Thus, the total payoff of player 1 will be less than or equal to 1 if he deviates from µ 1 in the first stage only Therefore, µ satisfies the single deviation condition, and is therefore an SPE [Repeated play of Cournot game] Consider the repeated game with the stage game being two-player Cournot competition with payoffs g i (s) = s i (a s 1 s c), where a > c > 0 Here s i represents a quantity produced by firm i, the market price is a s 1 s and c is the production cost per unit produced The action space for each player is the interval [0, ) (a) Identify the pure strategy NE of the stage game and the corresponding payoff vector Solution: Each g i is a concave function of s i and it is maximized at the point g i(s) s i = 0 yielding the NE (s 1, s ) = ( a c, a c ) and payoff vector v NE = ( (a c), (a c) ) (b) Identify the maxmin value v i for each player for the stage game Solution: For each player i, since the other player can produce so much that the price is zero (or even negative),the maxmin strategy of player i is s i = 0, so that v i = 0 (c) Identify the feasible payoff vectors for the stage game Solution: The sum of payoffs, (s 1 + s )(a c s 1 s ), ranges over the interval [0, (a c) ] as s 1 + s ranges over the interval [0, a c] For a given nonzero value of the sum s 1 + s, the sum payoff can be split arbitrarily between the two players by adjusting the ratio {(v 1, v ) R + : v 1 + v (a c) s 1 s 1 +s Therefore, the set of nonnegative feasible payoff vectors is } (d) Identify the payoff vectors for the repeated game that are feasible for Nash equilibrium of the repeated game for the discount factor δ sufficiently close to one, guaranteed by Nash s general feasibility theorem Solution: Since the vector of maxmin payoffs is the zero vector, Nash s feasible region for the repeated game is {(v 1, v ) : v 1 > 0, v > 0, v 1 + v (a c) (e) Identify the payoff vectors for the repeated game that are feasible for subgame perfect equilibrium of the repeated game for the discount factor δ sufficiently close to one, guaranteed by Friedman s general feasibility theorem }

Solution: {(v 1, v ) : v i > vi NE, v 1 + v (a c) } (f) Identify the payoff vectors for the repeated game that are feasible for subgame perfect equilibrium of the repeated game for the discount factor δ sufficiently close to one, guaranteed by the Fudenberg/Maskin general feasibility theorem Solution: Same region as in part (d) [A Cournot game with incomplete information] Given c > a > 0 and 0 < p < 1, consider the following version of a Cournot game Suppose the type θ 1 of player 1 is either zero, in which case his production cost is zero, or one, in which case his production cost is c (per unit produced) Player has only one possible type, and has production cost is c Player 1 knows his type θ 1 It is common knowledge that player believes player 1 is type one with probability p Both players sell what they produce at price per unit a q 1 q, where q i is the amount produced by player i A strategy for player 1 has the form (q 1,0, q 1,1 ), where q 1,θ1 is the amount produced by player one if player 1 is type θ 1 A strategy for player is q, the amount produced by player (a) Identify the best responses of each player to strategies of the other player Solution: The best response functions are given by: q 1,0 = arg max q 1 q 1 (a q q 1 ) = (a q ) + q 1,1 = arg max q 1 (a c q q 1 ) = (a c q ) + q 1 { q = arg max (1 p)q (a c q 1,0 q ) + pq (a c q 1,1 q ) } q = arg max q (a c q 1 q ) = (a c q 1) + q where q 1 = (1 p)q 1,0 + pq 1,1 (b) Identify the Bayes-Nash equilibrium of the game For simplicity, assume a c (Note: After finding the equilibrium, it can be shown that the three corresponding payoffs are given by u (q NE (a ( p)c) ) = and u 1 (q NE θ 1 = 0) = (a + ( p)c), u 1 (q NE θ 1 = 1) = 6 (a (1 + p)c) 6 As expected, u (q NE ) is increasing in p, and both u 1 (q NE θ 1 = 0) and u 1 (q NE θ 1 = 1) are decreasing in p with u (q NE ) < u 1 (q NE θ 1 = 1) < u 1 (q NE θ 1 = 0) for 0 < p < 1 In the limit p = 1, u (q NE ) = u 1 (q NE θ 1 = 1) = (a c), as in the original Cournot game with complete information and production cost c for both players) Solution: Substituting the best response functions for player one into the definition of q 1 in part (a), yields q 1 = a pc q

so that we have coupled fixed point equations for q 1 and q Seeking strictly positive solutions we have q = (a c q 1) + which can be solved to get = (a c a pc q ) = a ( p)c + q q = a ( p)c, q 10 = a + ( p)c, q 1,1 = 6 a (1 + p)c, 6 Furthermore, q 1 = a+(1 p)c 5