Mixed-Strategy Subgame-Perfect Equilibria in Repeated Games

Similar documents
Construction of Subgame-Perfect Mixed-Strategy Equilibria in Repeated Games

Repeated Games. EC202 Lectures IX & X. Francesco Nava. January London School of Economics. Nava (LSE) EC202 Lectures IX & X Jan / 16

CHAPTER 14: REPEATED PRISONER S DILEMMA

Introduction to Game Theory Lecture Note 5: Repeated Games

Game Theory. Wolfgang Frimmel. Repeated Games

G5212: Game Theory. Mark Dean. Spring 2017

Repeated Games. September 3, Definitions: Discounting, Individual Rationality. Finitely Repeated Games. Infinitely Repeated Games

Economics 209A Theory and Application of Non-Cooperative Games (Fall 2013) Repeated games OR 8 and 9, and FT 5

Repeated Games with Perfect Monitoring

Discounted Stochastic Games with Voluntary Transfers

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

Outline for Dynamic Games of Complete Information

Chapter 8. Repeated Games. Strategies and payoffs for games played twice

1 Solutions to Homework 4

The folk theorem revisited

February 23, An Application in Industrial Organization

Repeated Games. Debraj Ray, October 2006

Stochastic Games and Bayesian Games

The Nash equilibrium of the stage game is (D, R), giving payoffs (0, 0). Consider the trigger strategies:

ECE 586BH: Problem Set 5: Problems and Solutions Multistage games, including repeated games, with observed moves

Renegotiation in Repeated Games with Side-Payments 1

Repeated Games. Econ 400. University of Notre Dame. Econ 400 (ND) Repeated Games 1 / 48

Game Theory Fall 2003

Prisoner s dilemma with T = 1

Warm Up Finitely Repeated Games Infinitely Repeated Games Bayesian Games. Repeated Games

Game Theory Fall 2006

Finite Memory and Imperfect Monitoring

Infinitely Repeated Games

Introduction to Game Theory

Stochastic Games and Bayesian Games

Duopoly models Multistage games with observed actions Subgame perfect equilibrium Extensive form of a game Two-stage prisoner s dilemma

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India October 2012

Player 2 L R M H a,a 7,1 5,0 T 0,5 5,3 6,6

Computing equilibria in discounted 2 2 supergames

IPR Protection in the High-Tech Industries: A Model of Piracy. Thierry Rayna University of Bristol

REPEATED GAMES. MICROECONOMICS Principles and Analysis Frank Cowell. Frank Cowell: Repeated Games. Almost essential Game Theory: Dynamic.

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

Microeconomic Theory II Preliminary Examination Solutions

Finite Memory and Imperfect Monitoring

Repeated games with present-biased preferences

Introductory Microeconomics

MS&E 246: Lecture 5 Efficiency and fairness. Ramesh Johari

Topics in Contract Theory Lecture 1

Early PD experiments

Graduate School of Decision Sciences

Repeated Games. Olivier Gossner and Tristan Tomala. December 6, 2007

Un-unraveling in Nearby, Finite-Horizon Games

Switching Costs in Infinitely Repeated Games 1

Not 0,4 2,1. i. Show there is a perfect Bayesian equilibrium where player A chooses to play, player A chooses L, and player B chooses L.

Best-Reply Sets. Jonathan Weinstein Washington University in St. Louis. This version: May 2015

Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4)

Lecture 5 Leadership and Reputation

Outline for today. Stat155 Game Theory Lecture 19: Price of anarchy. Cooperative games. Price of anarchy. Price of anarchy

Game Theory for Wireless Engineers Chapter 3, 4

Relational Incentive Contracts

Advanced Microeconomics

An introduction on game theory for wireless networking [1]

Problem 3 Solutions. l 3 r, 1

Economics 431 Infinitely repeated games

Extensive-Form Games with Imperfect Information

Columbia University. Department of Economics Discussion Paper Series. Repeated Games with Observation Costs

High Frequency Repeated Games with Costly Monitoring

In the Name of God. Sharif University of Technology. Graduate School of Management and Economics

GAME THEORY. Department of Economics, MIT, Follow Muhamet s slides. We need the following result for future reference.

Efficient Outcomes in Repeated Games with Limited Monitoring and Impatient Players 1

REPUTATION WITH LONG RUN PLAYERS

Economics 171: Final Exam

Communication in Repeated Games with Costly Monitoring

In reality; some cases of prisoner s dilemma end in cooperation. Game Theory Dr. F. Fatemi Page 219

Lecture 3 Representation of Games

In the Name of God. Sharif University of Technology. Microeconomics 2. Graduate School of Management and Economics. Dr. S.

Game Theory. Important Instructions

Department of Agricultural Economics. PhD Qualifier Examination. August 2010

CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma

Efficient Outcomes in Repeated Games with Limited Monitoring

SI Game Theory, Fall 2008

Online Appendix: Power Dynamics in Organizations, Jin Li, Niko Matouschek, and Michael Powell. Appendix A: Optimal Relational Contract in the Baseline

Unattainable Payoffs for Repeated Games of Private Monitoring

Answer Key: Problem Set 4

Outline for today. Stat155 Game Theory Lecture 13: General-Sum Games. General-sum games. General-sum games. Dominated pure strategies

Finding Equilibria in Games of No Chance

CUR 412: Game Theory and its Applications Final Exam Ronaldo Carpio Jan. 13, 2015

Microeconomics of Banking: Lecture 5

Game Theory: Normal Form Games

Economics 502 April 3, 2008

DISCOUNTED STOCHASTIC GAMES WITH VOLUNTARY TRANSFERS. Sebastian Kranz. January 2012 COWLES FOUNDATION DISCUSSION PAPER NO. 1847

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India August 2012

Microeconomic Theory III Final Exam March 18, 2010 (80 Minutes)

Competing Mechanisms with Limited Commitment

EC487 Advanced Microeconomics, Part I: Lecture 9

The Limits of Reciprocal Altruism

Maintaining a Reputation Against a Patient Opponent 1

When does strategic information disclosure lead to perfect consumer information?

Game Theory. VK Room: M1.30 Last updated: October 22, 2012.

Answers to Microeconomics Prelim of August 24, In practice, firms often price their products by marking up a fixed percentage over (average)

M.Phil. Game theory: Problem set II. These problems are designed for discussions in the classes of Week 8 of Michaelmas term. 1

Optimal selling rules for repeated transactions.

Econ 323 Microeconomic Theory. Chapter 10, Question 1

MATH 121 GAME THEORY REVIEW

Coordination Games on Graphs

Transcription:

Mixed-Strategy Subgame-Perfect Equilibria in Repeated Games Kimmo Berg Department of Mathematics and Systems Analysis Aalto University, Finland (joint with Gijs Schoenmakers) July 8, 2014

Outline of the presentation Illustrative example Shows how players may randomize in repeated games Convert into various normal-form games by using different continuation payoffs Abreu-Pierce-Stacchetti fixed-point characterization Extension to behavior strategies Self-supporting sets to find equilibria in behavior strategies Comparison between pure, behavior and correlated strategies

The model Infinitely repeated game Stage game with finitely many actions Discounting (possibly unequal discount factors) Behavior strategies (randomization and history-dependent) Players observe realized pure actions (not randomizations)

The model (2) Finite set of players N = {1,, n} Finite set of pure actions A i, i N, A = i N A i Mixed action q i (a i ) 0, profile q = (q 1,, q N ) Probability of pure action profile a A: π q (a) = j N q j(a j ) Stage game payoff u i (q) = a A u i(a)π q (a) Histories H k = A k for stage k 0, H 0 = Behavior strategy σ i : H Q i Discounted payoff U i (σ) = E [ (1 δ i ) k=0 δk i uk i (σ)]

Payoffs from stage games 4 4 4 3 3 3 2 2 2 1 1 1 0 0 1 2 3 4 0 0 1 2 3 4 0 0 1 2 3 4 3,3 1,3 3,1 1,1 7/3,7/3 1/3,3 11/3,1 1,1 0,0 2,1 1,2 0,0

Prisoner s Dilemma 3,3 (a) 0,4 (b) 4,0 (c) 1,1 (d) What are equilibria in pure, behavior and correlated strategies? Common discount factor δ = 1/3 The pure action profiles are called a, b, c and d

Prisoner s Dilemma (2) 3,3 1/3,3 3,1/3 5/3,5/3 7/3,7/3 1/3,3 3,1/3 1,1 Left: No unilateral deviation, a and d followed by cooperation, b and c by punishment Right: d after all pure action profiles

Prisoner s Dilemma: Pure strategies 4 35 3 25 2 15 1 1 15 2 25 3 35 4 Berg and Kitti (2010): elementary subpaths d,aa,ba,bc,ca,cb Equilibrium paths are compositions of the elementary subpaths, eg, d 7 (bc) 3 a

Prisoner s Dilemma: Correlated strategies 4 3 2 1 0 0 1 2 3 4 All reasonable (feasible and individually rational) payoffs

Prisoner s Dilemma: Behavior strategies 4 3 2 1 0 0 1 2 3 4 Union of rectangle (1, 3) (1, 3) and two lines How do we get these payoffs?

Prisoner s Dilemma: Behavior strategies (2) 3,3 0,4 4,0 1,1 7/3,7/3 1/3,3 11/3,1 1,1 Find follow-up strategies and continuation payoffs so that payoffs correspond to the game on right Action profiles a, b and d are followed by d (SPEP) and c is followed by a (SPEP) ad : (1 δ)(3, 3) + δ(1, 1) = (7/3, 7/3) ca : (1 δ)(4, 0) + δ(3, 3) = (11/3, 1) Produces the red lines of payoffs

Prisoner s Dilemma: Behavior strategies (3) 3,3 0,4 4,0 1,1 3,3 1,3 3,1 1,1 Find continuation payoffs: a (3, 3), b (3, 1), c (1, 3), d (1, 1) (1 δ)(0, 4) + δ(3, 1) = (1, 3) a is followed by a, d is followed by d b is followed by (cb) : (1 δ)(1 δ 2 ) 1 [(4, 0) + δ(0, 4)] = (3, 1) No randomization needed (not as easy in general!) Produces the green rectangle of payoffs

Characterization of Equilibria à la APS Carrier of mixed action Car(q i ) = {a i A i q i (a i ) > 0} Most profitable deviation d i (q) = max u i(a i, q i ) a i A i\car(q i ) Smallest payoff from a set p i (W ) = min{w i, w W } A pair (q, w) is admissible with respect to (w )W if (1 δ)u i (q) + δw i (1 δ)d i (q) + δp i (W ) Each a Car(q) may follow by different continuation play Continuation payoff w = x(a)π q (a), x(a) W a Car(q)

Characterization (2) Stage game payoffs ũ δ (a) = (1 δ)u(a) + δx(a) Set of all equilibrium payoffs M(x) of stage game with ũ V is the set of subgame-perfect equilibrium payoffs Theorem V is the largest fixed point of B: W = B(W ) = where (q, w) admissible, w formed by x, and q equilibrium of stage game with payoffs x x W A M(x),

Comparison to Pure Strategies V P is the set of pure-strategy subgame-perfect equilibrium payoffs Theorem (Abreu-Pearce-Stacchetti 1986/1990) V P is the largest fixed point of B P : W = B P (W ) = (1 δ)u(a) + δw, a A w C a(w ) where C a (W ) = {w W st (a, w) admissible}

Comparison to Pure Strategies (2) Complexity of fixed-point is higher Structure of equilibria different In pure strategies, enough to have high enough continuation payoff Randomization requires exact continuation payoffs

Self-supporting sets Definition S is self-supporting set if S M(x) for x R A and x(a) S for a Car(q(s)), if player i plays an action ã i outside Car(q(s) i ) (an observable deviation), while a i Car(q(s) i ), then x i (ã i, a i ) is player i s punishment payoff if at least two players make an observable deviation, then the continuation payoff is a predetermined equilibrium payoff Strongly self-supporting if x(a) S for all a A

Self-supporting sets (2) Required continuation payoffs are within the set itself Easy way to produce (subsets of) equilibrium payoffs Theorem (Monotonicity in δ) If S is self-supporting set for δ, S is convex, ũ δ (a) = (1 δ)u(a) + δx(a) S for all a Car(q(s)), and p i (V (δ)) is not increasing in δ for all i N Then there exists a self-supporting set S S for δ > δ

Results: Prisoner s Dilemma a, a b, c c, b d, d with c > a > d > b Theorem The rectangle [d, a] [d, a] is a subset of the subgame-perfect equilibrium payoffs for [ c a δ max c d, d b ] a b

Results: Nonmonotonicity Theorem (Nonmonotonicity of payoffs) The set of subgame-perfect equilibrium payoffs are not monotone in the discount factor in the following symmetric game: 3, 3 1 10, 4 10, 10 1, 10 4, 1 10 1, 1 10, 10 10, 10 43 10, 1 10, 10 10, 1 10 10, 10 10, 10 10, 10 10, 10 1 10, 43 10 [1, 3] [1, 3] is a subset of the subgame-perfect equilibrium payoffs when δ = 1/3 but not for a higher discount factor Rectangle gets contracted and relies on outside payoff

Results: Comparison of pure, mixed and correlated Feasible payoffs V = co (v R n : q A st v = u(q)) Reasonable payoffs V (δ) = { v V, v i p i (V (δ)), i N } Critical discount factor δ M = inf { δ : V (δ ) = V (δ ), δ δ } Theorem For all δ, V P (δ) V M (δ) V C (δ) Theorem If p P (V P (δ )) = p(v (δ )) = p C (V C (δ )) for all δ min [ δ P, δ M, δ C], then it holds that δ P δ M δ C

Results: Comparison in Prisoner s Dilemma Theorem In symmetric Prisoner s Dilemma, it holds that [ δ P = δ M c b c a = a + c b d > max c d, d b ] a b when b + c < 2a, and otherwise = δ C, δ P = 2(c d) b + 3c 4d > δm = c b 2(c d) > d b c d = δc,

Conclusion Characterization of equilibria in behavior strategies Self-supporting sets offer easy way to find behavior strategies It is possible to compare equilibria under different assumptions Open problem: punishment strategies in pure and behavior strategies

That s all folks Ma Mb M Mc Md Thank you! Any questions?