Discounted Stochastic Games with Voluntary Transfers

Similar documents
Graduate School of Decision Sciences

DISCOUNTED STOCHASTIC GAMES WITH VOLUNTARY TRANSFERS. Sebastian Kranz. January 2012 COWLES FOUNDATION DISCUSSION PAPER NO. 1847

Repeated Games. EC202 Lectures IX & X. Francesco Nava. January London School of Economics. Nava (LSE) EC202 Lectures IX & X Jan / 16

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

Duopoly models Multistage games with observed actions Subgame perfect equilibrium Extensive form of a game Two-stage prisoner s dilemma

Game Theory. Wolfgang Frimmel. Repeated Games

Introduction to Game Theory Lecture Note 5: Repeated Games

Repeated Games with Perfect Monitoring

Mixed-Strategy Subgame-Perfect Equilibria in Repeated Games

Repeated Games. September 3, Definitions: Discounting, Individual Rationality. Finitely Repeated Games. Infinitely Repeated Games

Relational Contracting, Repeated Negotiations, and Hold-Up

Warm Up Finitely Repeated Games Infinitely Repeated Games Bayesian Games. Repeated Games

Economics 209A Theory and Application of Non-Cooperative Games (Fall 2013) Repeated games OR 8 and 9, and FT 5

Stochastic Games and Bayesian Games

Player 2 L R M H a,a 7,1 5,0 T 0,5 5,3 6,6

Prisoner s dilemma with T = 1

Relational Incentive Contracts

G5212: Game Theory. Mark Dean. Spring 2017

CHAPTER 14: REPEATED PRISONER S DILEMMA

Infinitely Repeated Games

Answer Key: Problem Set 4

Stochastic Games and Bayesian Games

Outline for Dynamic Games of Complete Information

EconS 424 Strategy and Game Theory. Homework #5 Answer Key

Repeated Games. Econ 400. University of Notre Dame. Econ 400 (ND) Repeated Games 1 / 48

EconS 424 Strategy and Game Theory. Homework #5 Answer Key

In reality; some cases of prisoner s dilemma end in cooperation. Game Theory Dr. F. Fatemi Page 219

February 23, An Application in Industrial Organization

The Nash equilibrium of the stage game is (D, R), giving payoffs (0, 0). Consider the trigger strategies:

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

Economics 171: Final Exam

13.1 Infinitely Repeated Cournot Oligopoly

Game Theory. Important Instructions

ECON/MGEC 333 Game Theory And Strategy Problem Set 9 Solutions. Levent Koçkesen January 6, 2011

1 Solutions to Homework 4

M.Phil. Game theory: Problem set II. These problems are designed for discussions in the classes of Week 8 of Michaelmas term. 1

Chapter 8. Repeated Games. Strategies and payoffs for games played twice

Sequential-move games with Nature s moves.

S 2,2-1, x c C x r, 1 0,0

Repeated games. Felix Munoz-Garcia. Strategy and Game Theory - Washington State University

Simon Fraser University Spring 2014

REPEATED GAMES. MICROECONOMICS Principles and Analysis Frank Cowell. Frank Cowell: Repeated Games. Almost essential Game Theory: Dynamic.

Microeconomics I. Undergraduate Programs in Business Administration and Economics

Early PD experiments

6.6 Secret price cuts

Optimal selling rules for repeated transactions.

Competing Mechanisms with Limited Commitment

ECE 586BH: Problem Set 5: Problems and Solutions Multistage games, including repeated games, with observed moves

In the Name of God. Sharif University of Technology. Graduate School of Management and Economics

Problem 3 Solutions. l 3 r, 1

CUR 412: Game Theory and its Applications Final Exam Ronaldo Carpio Jan. 13, 2015

Lecture 5 Leadership and Reputation

Not 0,4 2,1. i. Show there is a perfect Bayesian equilibrium where player A chooses to play, player A chooses L, and player B chooses L.

Appendix: Common Currencies vs. Monetary Independence

Game Theory Fall 2003

CHAPTER 15 Sequential rationality 1-1

An introduction on game theory for wireless networking [1]

Economics 431 Infinitely repeated games

Eco AS , J. Sandford, spring 2019 March 9, Midterm answers

Monetary Union with Voluntary Participation

Microeconomic Theory II Preliminary Examination Solutions Exam date: August 7, 2017

Introductory Microeconomics

IPR Protection in the High-Tech Industries: A Model of Piracy. Thierry Rayna University of Bristol

Outline for today. Stat155 Game Theory Lecture 19: Price of anarchy. Cooperative games. Price of anarchy. Price of anarchy

ECON106P: Pricing and Strategy

Econ 711 Homework 1 Solutions

Construction of Subgame-Perfect Mixed-Strategy Equilibria in Repeated Games

ECONS 424 STRATEGY AND GAME THEORY MIDTERM EXAM #2 ANSWER KEY

Econ 101A Final exam May 14, 2013.

Microeconomic Theory II Preliminary Examination Solutions

Topics in Contract Theory Lecture 1

MS&E 246: Lecture 5 Efficiency and fairness. Ramesh Johari

Renegotiation in Repeated Games with Side-Payments 1

is the best response of firm 1 to the quantity chosen by firm 2. Firm 2 s problem: Max Π 2 = q 2 (a b(q 1 + q 2 )) cq 2

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India October 2012

Answers to Microeconomics Prelim of August 24, In practice, firms often price their products by marking up a fixed percentage over (average)

Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4)

GAME THEORY. Department of Economics, MIT, Follow Muhamet s slides. We need the following result for future reference.

MA300.2 Game Theory 2005, LSE

PRISONER S DILEMMA. Example from P-R p. 455; also 476-7, Price-setting (Bertrand) duopoly Demand functions

Université du Maine Théorie des Jeux Yves Zenou Correction de l examen du 16 décembre 2013 (1 heure 30)

The Ohio State University Department of Economics Econ 601 Prof. James Peck Extra Practice Problems Answers (for final)

TitleNon-cooperative Bargaining for Side. Citation KIER Discussion Paper (2018), 983:

Microeconomics II. CIDE, MsC Economics. List of Problems

Finite Memory and Imperfect Monitoring

Econ 101A Final exam Th 15 December. Do not turn the page until instructed to.

IMPERFECT COMPETITION AND TRADE POLICY

CONTRACT THEORY. Patrick Bolton and Mathias Dewatripont. The MIT Press Cambridge, Massachusetts London, England

High Frequency Repeated Games with Costly Monitoring

Game Theory for Wireless Engineers Chapter 3, 4

Collusion under imperfect monitoring with asymmetric firms

EXTRA PROBLEMS. and. a b c d

Finite Memory and Imperfect Monitoring

Rational Behaviour and Strategy Construction in Infinite Multiplayer Games

1 Solutions to Homework 3

Endogenous Cartel Formation with Differentiated Products and Price Competition

Repeated Games. Debraj Ray, October 2006

Econ 711 Final Solutions

Game Theory Fall 2006

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2017

Transcription:

Discounted Stochastic Games with Voluntary Transfers Sebastian Kranz University of Cologne Slides

Discounted Stochastic Games Natural generalization of infinitely repeated games n players infinitely many periods, common discount factor δ < 1 in every period there is a state x X (finite) Stage game actions a = (a1,...,a n ) A(x) payoffs π(a,x) State can change after every period τ(x x,a): probability that new state is x This talk: Players publicly observe a and x (perfect monitoring)

Example: Cournot Model with Stochastic Reserves Two firms i = 1,2 that operate hydro-electric power plants State x = (x 1,x 2 ) {0,1,..., x} 2 amount of hydro-energy in each firm s water reservoir Firma i can sell in a period a i {0,1,...,x i } units of energy. Stage game profits: π i (a,x) = P(a 1,a 2 )a i New state depends on random rainfall: x i = x i a i + ε i

Solving stochastic games... for Markov perfect equilibria? Most applied literature: Markov perfect equilibria (MPE) actions depend only on current state x Problems with MPE Multiple MPE can exist (e.g. Besanko et. al., 2010) Set of MPE payoffs often unknown Set of MPE payoffs can be very sensitive to state space single state (infinitely repeated game): MPE = repetition of static Nash equilibrium including (almost) payoff irrelevant states, e.g. output in previous period, may allow quite collusive MPE

Solving stochastic games... for subgame perfect equilibria? Set of SPE payoffs hard to characterize Large discount factors (δ 1) & irreducible stochastic game Dutta (1995), Hörner et. al. (2011) Fixed δ: Extending algorithms for repeated games? Abreu, Pearce and Stachetti (1990), Judd, Yeltekin, Conklin (2003), Abreu & Sannikov (2011) Pareto-optimal equilibria don t have in general a simple structure

This paper Considers an economic relevant subset of stochastic games Main results: every SPE payoff can be implemented with a simple class of equilibria methods to analytically find or to compute equilibrium payoff sets

This paper Considers an economic relevant subset of stochastic games Main results: every SPE payoff can be implemented with a simple class of equilibria methods to analytically find or to compute equilibrium payoff sets Stochastic games with voluntary monetary transfers and risk-neutral players repeated games: Levin (2003), Goldluecke und Kranz (2010), Malcomson & McLeod (1989), Doornik (2006), Rayo (2007), Klimenko, Ramey and Watson (2008), Harrington and Skrzypacz (2007),... transfers implemented in several cartels via sales between firms

Structure of a period in game with transfers transfers actions new state x drawn transfers Transfers: players chooses simultaneously amount of money they want to transfer to other players no binding liquidity constraints money burning possible received net amount of money will be added to payoffs πi (a,x)

Simple strategy profiles Basic structure n + 1 phases k {e,1,...,n} equilibrium phase k = e a punishment phase k = i

Simple strategy profiles Play in phase k state x transfers actions draw new state x transfers no transfers α k (x) A(x) p k (x,x,a) exception: upfront transfers in first period

Simple strategy profiles Transition between phases phase only changes after upfront transfer or after transfer at end of period player i unilaterally deviates from his transfer punishment phase k = i no player unilaterally deviates from transfer equilibrium phase k = e punishment have a stick-and-carrot structure (similar to Abreu, 1986)

Main Result Theorem Fix a discount factor δ: Every SPE payoff can be implemented with an equilibrium in simple strategies.

Intuition Incentive compatible monetary transfers can be used for three important functions 1 Distribute joint payoffs (with upfront payments) 2 Balance incentive constraints between players 3 Fines as punishment

1. Distributing with upfront transfers u 2 blue area = equilibrium payoffs without upfront transfers u 1

1. Distributing with upfront transfers u 2 Pareto-frontier if any upfront transfers could be enforced blue area = equilibrium payoffs without upfront transfers u 1

1. Distributing with upfront transfers u 2 Punish players that deviate from upfront transfers with optimal penal codes á la Abreu (1988) blue area = equilibrium payoffs without upfront transfers u 1

1. Distributing with upfront transfers u 2 With upfront money burning can implement every payoff in the triangle (simplex) Payoff set described by n+1 numbers blue area = equilibrium payoffs without upfront transfers u 1

2. Balancing incentive constraints Repeated asymmetric Prisoners dilemma Aim: (C,C) in every period Punishment: (D,D) forever C D C 3,1-1,4 D 4,-1 0,0 Incentive constraints for subgame perfection Player 1: 3δ t = 3 t=0 1 δ 4 δ 1 4 Player 2: δ t = 1 t=0 1 δ 4 δ 3 4 Given asymmetries stationary equilibrium play may not be optimal (without transfers)

2. Balancing incentive constraints Assume pl. 1 transfers 1 unit of money every period on eq. path to pl. 2. No incentives to deviate from (C,C): Player 1: 3 1 1 δ 4 δ 1 2 1+1 Player 2: 1 δ 4 δ 1 2 Consider summed incentive constraints: 3 + 1 1 δ 4 + 4 δ 1 2 General result: if summed incentive constraints hold, one can always find transfers such that no player has incentives to deviate from individual actions or transfers

3. Fines as punishment Allow a player who deviates to avoid punishment actions by paying a fine Punishment actions only necessary if fines not paid After one period of punishment actions, remaining punishment can be settled again with a fine Optimal penal codes can be described by one action profile per state (plus transfer / fine scheme) For mixed action profiles transfer make a player indifferent between all pure actions in the support

Optimal Simple Equilibria & Algorithms Paper develops additional results for finding optimal simple equilibria that can implement every SPE payoff by varying upfront transfers. different numerical algorithms guidance to find closed-form solutions

Solving the model of Cournot Competition with Stochastic Reserves...

Prices under Collusion 20 20 18 Reserves firm 2 15 10 5 16 14 12 10 δ = 2 3 x = 20 P(a) = 20 a 1 a 2 rain: 3 or 4 units 441 states 30 sec. to solve on my notebook 0 0 5 10 15 20 Reserves firm 1 8

Principal-Agent Relationship with a Durable Product State describes whether principal has a durable product: x {0,1} π P (a,x) = x Agent can exert costly effort to build or destroy the product a { 1,0,1} π A (a,x) = c a x = min{max{x + a,0},1} Unique MPE: no transfers, a(x) = 0

Principal-Agent Relationship with a Durable Product State describes whether principal has a durable product: x {0,1} π P (a,x) = x Agent can exert costly effort to build or destroy the product a { 1,0,1} π A (a,x) = c a x = min{max{x + a,0},1} Unique MPE: no transfers, a(x) = 0 grim-trigger-style equilibria: after deviation play MPE forever

Principal-Agent Relationship with a Durable Product State describes whether principal has a durable product: x {0,1} π P (a,x) = x Agent can exert costly effort to build or destroy the product a { 1,0,1} π A (a,x) = c a x = min{max{x + a,0},1} Unique MPE: no transfers, a(x) = 0 grim-trigger-style equilibria: after deviation play MPE forever only zero effort can be implemented

Principal-Agent Relationship with a Durable Product State describes whether principal has a durable product: x {0,1} π P (a,x) = x Agent can exert costly effort to construction or destroy the product a { 1,0,1} x = min{max{x + a,0},1} π A (a,x) = c a Unique MPE: a(x) = 0, no transfers Optimal simple equilibria: construction in equilibrium phase and destruction as punishment whenever (1 δ 2 )c δ 2.

Summary Allow transfers & assume risk-neutrality in discounted stochastic game Every SPE payoff can be implemented with simple equilibria Algorithms to solve for equilibrium payoff sets Results extend to imperfect monitoring of actions

Useful results to find optimal simple equilibria Perfect monitoring, finite action space, set of pure strategy SPE There are optimal transfers for given actions (a k (x)) k,x Computing joint equilibrium payoffs and punishment payoffs U(x a e ) = (1 δ)π(a e (x),x) +δe[u(x a e ) a e (x),x] v i (x a i ) = max ai A(x){(1 δ)π i (a,x) +δe[v i (x a i ) a i,a i i(x),x]} a k (x) can be implemented if and only if n i=1 (1 δ)π(a k (x),x) + δe[u(x a e ) a k (x),x] max {(1 δ)π i(a i,a i(x),x) k + δe[v i (x a i ) a i,a i(x),x]} k a i A(x)

Basic idea of one algorithm Assume in round r all action profiles in A r (x) A(x) can be implemented In round r = 0 all action profiles can be implemented Let U(x A r ) = max a e A r U(x ae ) Markov decision process v i (x A r ) = min a i A r v i(x a i ) Nested Markov decision process Let A r+1 (x) be all profiles that survive joint incentive constraints given U(. A r ) and v i (x A r ) Stop once A r = A r+1

Public Correlation and Non-Optimality of Stationary Equilibrium paths Stage game: A B A 0,0-1,3 B 3,-1 0,0 Mix between (A,B) and (B,A) δ 1 2 Alternate {(A,B),(B,A),(A,B),...} δ 1 3