G5212: Game Theory. Mark Dean. Spring 2017

Similar documents
Introduction to Game Theory Lecture Note 5: Repeated Games

Game Theory. Wolfgang Frimmel. Repeated Games

In reality; some cases of prisoner s dilemma end in cooperation. Game Theory Dr. F. Fatemi Page 219

Warm Up Finitely Repeated Games Infinitely Repeated Games Bayesian Games. Repeated Games

CHAPTER 14: REPEATED PRISONER S DILEMMA

Repeated Games. September 3, Definitions: Discounting, Individual Rationality. Finitely Repeated Games. Infinitely Repeated Games

Repeated Games. EC202 Lectures IX & X. Francesco Nava. January London School of Economics. Nava (LSE) EC202 Lectures IX & X Jan / 16

Economics 209A Theory and Application of Non-Cooperative Games (Fall 2013) Repeated games OR 8 and 9, and FT 5

Repeated Games with Perfect Monitoring

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

Prisoner s dilemma with T = 1

Finitely repeated simultaneous move game.

Duopoly models Multistage games with observed actions Subgame perfect equilibrium Extensive form of a game Two-stage prisoner s dilemma

Chapter 8. Repeated Games. Strategies and payoffs for games played twice

Problem 3 Solutions. l 3 r, 1

Outline for Dynamic Games of Complete Information

ECE 586BH: Problem Set 5: Problems and Solutions Multistage games, including repeated games, with observed moves

Lecture 5 Leadership and Reputation

Repeated Games. Econ 400. University of Notre Dame. Econ 400 (ND) Repeated Games 1 / 48

REPEATED GAMES. MICROECONOMICS Principles and Analysis Frank Cowell. Frank Cowell: Repeated Games. Almost essential Game Theory: Dynamic.

The Nash equilibrium of the stage game is (D, R), giving payoffs (0, 0). Consider the trigger strategies:

Stochastic Games and Bayesian Games

Stochastic Games and Bayesian Games

Eco AS , J. Sandford, spring 2019 March 9, Midterm answers

Player 2 L R M H a,a 7,1 5,0 T 0,5 5,3 6,6

Early PD experiments

Economics 171: Final Exam

University of Hong Kong ECON6036 Stephen Chiu. Extensive Games with Perfect Information II. Outline

February 23, An Application in Industrial Organization

Answer Key: Problem Set 4

M.Phil. Game theory: Problem set II. These problems are designed for discussions in the classes of Week 8 of Michaelmas term. 1

Introductory Microeconomics

MA300.2 Game Theory 2005, LSE

Infinitely Repeated Games

CUR 412: Game Theory and its Applications Final Exam Ronaldo Carpio Jan. 13, 2015

Economics 431 Infinitely repeated games

1 Solutions to Homework 4

SI Game Theory, Fall 2008

IPR Protection in the High-Tech Industries: A Model of Piracy. Thierry Rayna University of Bristol

Topics in Contract Theory Lecture 1

Simon Fraser University Spring 2014

Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4)

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

Optimal selling rules for repeated transactions.

An introduction on game theory for wireless networking [1]

In the Name of God. Sharif University of Technology. Graduate School of Management and Economics

Repeated games. Felix Munoz-Garcia. Strategy and Game Theory - Washington State University

EC487 Advanced Microeconomics, Part I: Lecture 9

Sequential-move games with Nature s moves.

Notes for Section: Week 4

MIDTERM ANSWER KEY GAME THEORY, ECON 395

GAME THEORY. Department of Economics, MIT, Follow Muhamet s slides. We need the following result for future reference.

Repeated Games. Debraj Ray, October 2006

Lecture 6 Dynamic games with imperfect information

GAME THEORY: DYNAMIC. MICROECONOMICS Principles and Analysis Frank Cowell. Frank Cowell: Dynamic Game Theory

(a) (5 points) Suppose p = 1. Calculate all the Nash Equilibria of the game. Do/es the equilibrium/a that you have found maximize social utility?

Microeconomics of Banking: Lecture 5

Microeconomic Theory II Preliminary Examination Solutions

Mixed-Strategy Subgame-Perfect Equilibria in Repeated Games

Exercises Solutions: Game Theory

Game Theory. Important Instructions

Answers to Problem Set 4

Spring 2017 Final Exam

Player two accepts or rejects. If player 2 accepts, they split the pie. Otherwise, player 2 gets r 2 and player 1 gets r 1, where r 1 +r 2 < 1.

Other Regarding Preferences

ECON402: Practice Final Exam Solutions

CMSC 474, Introduction to Game Theory 16. Behavioral vs. Mixed Strategies

G5212: Game Theory. Mark Dean. Spring 2017

Agenda. Game Theory Matrix Form of a Game Dominant Strategy and Dominated Strategy Nash Equilibrium Game Trees Subgame Perfection

Introduction to Game Theory

14.12 Game Theory Midterm II 11/15/ Compute all the subgame perfect equilibria in pure strategies for the following game:

CUR 412: Game Theory and its Applications, Lecture 9

Game Theory: Additional Exercises

is the best response of firm 1 to the quantity chosen by firm 2. Firm 2 s problem: Max Π 2 = q 2 (a b(q 1 + q 2 )) cq 2

Game Theory Fall 2006

Extensive-Form Games with Imperfect Information

Topics in Contract Theory Lecture 3

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India October 2012

MA200.2 Game Theory II, LSE

Iterated Dominance and Nash Equilibrium

Online Appendix for Military Mobilization and Commitment Problems

The folk theorem revisited

Problem Set 2 Answers

Game theory and applications: Lecture 1

Game Theory with Applications to Finance and Marketing, I

Solution to Tutorial 1

m 11 m 12 Non-Zero Sum Games Matrix Form of Zero-Sum Games R&N Section 17.6

Group-lending with sequential financing, contingent renewal and social capital. Prabal Roy Chowdhury

1 Solutions to Homework 3

Solution to Tutorial /2013 Semester I MA4264 Game Theory

Dynamic Games. Econ 400. University of Notre Dame. Econ 400 (ND) Dynamic Games 1 / 18

Follow the Leader I has three pure strategy Nash equilibria of which only one is reasonable.

1 x i c i if x 1 +x 2 > 0 u i (x 1,x 2 ) = 0 if x 1 +x 2 = 0

Game Theory Fall 2003

CUR 412: Game Theory and its Applications, Lecture 12

Maintaining a Reputation Against a Patient Opponent 1

Noncooperative Oligopoly

Economics 502 April 3, 2008

Subgame Perfect Cooperation in an Extensive Game

CHAPTER 15 Sequential rationality 1-1

Transcription:

G5212: Game Theory Mark Dean Spring 2017

Bargaining We will now apply the concept of SPNE to bargaining A bit of background Bargaining is hugely interesting but complicated to model It turns out that the outcome depends a lot on the details of the game i.e. the bargaining protocol you assume Has led some to thing that non-cooperative game theory is not the way to go See Nash 1950

Bargaining However, Sequential Bargaining is still a classic framework One side makes an offer The other side can either accept or reject If they reject then either the game ends or they get to make a counter offer We will analyze a finite (two stage) bargaining game Then an infinite game Rubinstein bargaining

Two Stage Bargaining Example Two Stage Bargaining There are two players bargaining over a pie of size 1. There are two rounds: Player 1 first propose a way to split the pie Player 2 either accepts or rejects it. If an offer is accepted, the pie is split accordingly. Otherwise, the pie shrinks by 10% and the game proceed to the next round. Player 2 proposes in the second round, and player 1 either accepts it or rejects it. If the proposal is rejected, the pie disappears. Otherwise, the pie is split accordingly.

Infinite Period Bargaining Example Rubinstein Bargaining. Now suppose there are 2n periods: *In t = 2n, P2 proposes (0, 1) ; *in t = 2n 1, P1 proposes (1 δ, δ) ; *in t = 2n 2, ((1 δ) δ, 1 (1 δ) δ) = ( δ δ 2, 1 δ + δ 2) ; *in t = 2n 3, ( 1 δ ( 1 δ + δ 2), δ ( ( 1 δ + δ 2)) = 1 δ + δ 2 δ 3, δ δ 2 + δ 3). *Inductively, ( we find, in period 1, 1 δ + δ 2 δ 3 +... + δ 2n 2 δ 2n 1, δ δ 2 + δ 3.. ) ( = 2n 1 k=0 ( δ)k, 1 ( ) 2n 1 k=0 ( δ)k) = 1 δ 2n 1+δ, δ+δ2n 1+δ ( ) *If n, the division goes to 1 1+δ, δ 1+δ

Infinite Period Bargaining Infinite Horizon Bargaining (Rubinstein 1982 Ecma). There are two players bargaining over a pie of size 1. Player 1 proposes at t = 1, 3,... and player 2 proposes at t = 2, 4,... They discounts future payoff with a discount factor δ (0, 1). Theorem. There is a unique SPNE: in any period, proposer δ offers 1+δ to the other and keeps 1 1+δ for himself; responder accepts an offer iff it is at least δ 1+δ.

Infinite Period Bargaining Proof of Uniqueness Uniqueness: Let m be the smallest amount a proposer receives over all SPNE in all subgames. Let M be the highest amount. Claim 1. m 1 δm The reason is that the responder today will be the proposer tomorrow this player will get at most M tomorrow. Hence, today s proposer, can ensure an acceptance by offering δm to the receiver. Hence, his payoff must be no smaller than 1 δm. Claim 2. M 1 δm. The responder today has a payoff of at least δm (because he gets at least m tomorrow as a proposer). So the responder will not accept anything less than δm today. If δm is accepted today, then the proposer receives a payoff of 1 δm; if it is rejected, the responder today (i.e., the proposer tomorrow) will not offer more than δm tomorrow (which is only δ 2 M discounted back today). Therefore, M max { 1 δm, δ 2 M }.

Infinite Period Bargaining Proof of Uniqueness (conti.) Putting together the two claims, we have m 1 δm 1 δ (1 δm) = m 1 1 + δ M 1 δm 1 δ (1 δm) = M 1 1 + δ Therefore, m = M = 1 1 + δ.

Infinite Period Bargaining Comments on Rubinstein Bargaining Immediate agreement. First mover advantage Discounting 1 δ is a friction basically the pie shrinks over time. If the friction is larger, i.e., δ smaller, then the proposer gets a larger share. If the friction disappears, i.e., δ 1, then shares 1 2 (this is the Nash bargaining solution).

Repeated Games Think back to the prisoner s dilemma game Is it reasonable to think that people cannot sustain the good outcome of keeping quiet? Perhaps the problem is that people play prisoner s dilemma more than once Can that solve the problem?

Finitely Repeated Games Example C D C 1, 1 1, 2 D 2, 1 0, 0 This game is repeatedly played for T periods,. All previous actions are observable (perfect monitoring). Each player values the sum of payoffs over all periods. Can the repetition help us avoid the Prisoner s Dilemma outcome? No! Use backward induction

Finitely Repeated Games Theorem If the stage game has a unique Nash Equilibrium, then the unique SPNE of the finitely repeated game is for both players to play the Nash Equilibrium in each round Note that this is not the same as saying that finite repetition has no power!

Finitely Repeated Games Example L C R T 1, 1 5, 0 0, 0 M 0, 5 4, 4 0, 0 B 0, 0 0, 0 3, 3 Pure Strategy NE: (T,L), (BR) Suppose the game is played twice? Finite repetition can help Infinite repetition can help more!

Infinitely Repeated Game Stage game: G = {(A i, u i )}. A i is the action space for player i. a = (a 1,..., a n ) is the action profile. The game is played repeatedly at t = 0, 1,... with discount factor δ (0, 1) History up to date t: h t := ( a 0,..., a t 1) A t =: H t ; H 0 = { }. Set of possible histories H := t=0 Ht. Pure strategy of player i: s i : H A i An outcome path induced by s = (s 1,..., s n ) is denoted by a (s) = ( a 0 (s), a 1 (s),... ) Payoff U i (s) = (1 δ) ( t=0 δt u i a t (s) )

Infinitely Repeated Prisoner s Dilemma C D C 1, 1 1, 2 D 2, 1 0, 0 Consider the following strategies of players 1 and 2: play C no matter what: s i ( h t ) = C for any h t H Player 1 s payoff is U 1 (s 1, s 2 ) = (1 δ) t=0 δt u 1 (C, C) = (1 δ) t=0 δt = 1.

Infinitely Repeated Prisoner s Dilemma C D C 1, 1 1, 2 D 2, 1 0, 0 Consider the following strategy of players 1 and 2, grim trigger : Play C if your opponent has never played D Play D if your opponent has ever played D Player 1 s payoff is U 1 (s 1, s 2 ) = (1 δ) ( t=0 δt u 1 a t 1 (s), a t 2 (s) ) = (1 δ) t=0 δt = 1.

Infinitely Repeated Prisoner s Dilemma Is this an equilibrium? How can we check? Use the one shot deviation principle! Definition Player i has a profitable one-shot deviation from his strategy s i : H A i for a fixed profile of his opponents strategies s i, where S j : H A j, if there exists a history h t in which player i can profitably deviate from s i ( h t ) to some other action at h t and reverts to play s i afterwards.

One Shot Deviation Principle We showed that, in finite games we could test for SPNE by testing the one-shot deviation principle Can we do the same thing here? Not necessarily Consider the following (very silly) game One player Pick a or b in every period If always pick a get 1 Otherwise get 0 The strategy of playing b in every period is not SPNE, but survives one shot deviation

One Shot Deviation Principle Theorem For δ < 1,a strategy profile is subgame perfect if there are no profitable one-shot deviations.

Infinitely Repeated Prisoner s Dilemma C D C 1, 1 1, 2 D 2, 1 0, 0 Is play C no matter what immune to one shot deviations? s i ( h t ) = C for any h t H Consider the one shot deviation of playing D in period 0. Call this strategy s 1. Player 1 s payoff from (s 1, s 2 ) is U 1 (s 1, s 2 ) = (1 δ) ( t=0 δt u 1 a t 1 (s 1, s 2 ), a t 2 (s 1, s 2 ) ) = (1 δ) ( 2 + 1 δ + 1 δ 2 +... ) = 2 δ > 1.

Infinitely Repeated Prisoner s Dilemma C D C 1, 1 1, 2 D 2, 1 0, 0 Is Grim Trigger immune to one shot deviations? Consider the following one-shot deviation of player 1: Play D at t = 0 and then follow s 1 afterwards. Call this s 1 Player 1 s payoff from (s 1, s 2 ) is U 1 (s 1, s 2 ) = (1 δ) ( t=0 δt u 1 a t 1 (s 1, s 2 ), a t 2 (s 1, s 2 ) ) = (1 δ) ( 2 + δ1 + δ 2 0... ) = (1 δ) (2 δ). Note that, after the 1 shot deviation will play C next period and will get punished, hence 1 in period 2

Infinitely Repeated Prisoner s Dilemma C D C 1, 1 1, 2 D 2, 1 0, 0 If δ > 0.382 this is not profitable BUT, for SPNE have to check each subgame! Let s imagine that play in the first period was (C, D) According to the grim trigger, Player 1 will play D in period 2 Player 2 will play C in period 2 After which both players will play D forever

Infinitely Repeated Prisoner s Dilemma C D C 1, 1 1, 2 D 2, 1 0, 0 Consider only the payoff from period 2 onwards Payoff of Player 2 playing C in period 2 is (1 δ)( 1 + 0 + 0 +...) Payoff of the 1 shot deviation to D in period 2 is (1 δ)(0 + 0 + 0 +...)

Infinitely Repeated Prisoner s Dilemma C D C 1, 1 1, 2 D 2, 1 0, 0 Grim Trigger not SPNE as stated Need a slightly modified version Play C if no one has ever played D Play D if anyone has ever played D Can show that this is SPNE as long as δ > 1 2

Possible Outcomes in Infinitely Repeated Games We have shown that, in the infinitely repeated prisoner s dilemma we can support the outcome (1, 1) as long as people are patient enough A natural question is: What is the range of possible outcomes that can be supported in a repeated game?

Example C D C 5, 5 0, 6 D 6, 0 1, 1 Can there be a SPNE with payoffs (5.9, 0.1)?

Possible Outcomes in Infinitely Repeated Games It seems that this is unlikely, as by playing D, player 2 can guarantee themselves a payoff of 1 Why would they every accept 0.1? Pure strategy minmax payoff for player i: min max u i (a i, a i ) a i a i We say a payoff for player i is individually rational if it is above their minmax payoff We would not expect to be able to support any payoffs that are not IR Are there any other restrictions?

Possible Outcomes in Infinitely Repeated Games No! every payoff vector that is strictly individually rational for each player can be supported as SPNE if the discount factor is large enough.

Folk Theorem More formally: Feasible payoff: con {(u 1 (a 1, a 2 ), u 2 (a 1, a 2 )) : a 1 A 1, a 2 A 2 }. Folk theorem Every action profile a that gives a payoff that is strictly individually rational can be played on the equilibrium path repeatedly on a SPNE if δ is large enough. Every payoff profile that is feasible and strictly individually rational can be supported as a SPNE payoff if δ is large enough.

Folk Theorem Sketch of proof: 1 All players start by playing a and continue to play a if no deviation occurs. 2 If any one player, say player i, deviated, play the strategy profile m which minmaxes i for N periods. 3 If no players deviated from phase 2, all player j i gets rewarded ε above j s min-max forever after, while player i continues receiving his min-max 4 If player j deviated from phase 2, all players restart phase 2 with j as target.

Comment on Folk Theorem: Bad result: everything is feasible, and hence no predictive power at all. Good result: shows the power of dynamic incentives.