SI Game Theory, Fall 2008

Similar documents
In reality; some cases of prisoner s dilemma end in cooperation. Game Theory Dr. F. Fatemi Page 219

Game Theory. Wolfgang Frimmel. Repeated Games

Introduction to Game Theory Lecture Note 5: Repeated Games

Warm Up Finitely Repeated Games Infinitely Repeated Games Bayesian Games. Repeated Games

Repeated Games. September 3, Definitions: Discounting, Individual Rationality. Finitely Repeated Games. Infinitely Repeated Games

Early PD experiments

Duopoly models Multistage games with observed actions Subgame perfect equilibrium Extensive form of a game Two-stage prisoner s dilemma

CHAPTER 14: REPEATED PRISONER S DILEMMA

G5212: Game Theory. Mark Dean. Spring 2017

Copyright 2008, Yan Chen

February 23, An Application in Industrial Organization

Economics 209A Theory and Application of Non-Cooperative Games (Fall 2013) Repeated games OR 8 and 9, and FT 5

Repeated games. Felix Munoz-Garcia. Strategy and Game Theory - Washington State University

Prisoner s dilemma with T = 1

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

LECTURE 4: MULTIAGENT INTERACTIONS

Infinitely Repeated Games

Chapter 8. Repeated Games. Strategies and payoffs for games played twice

Repeated Games. Econ 400. University of Notre Dame. Econ 400 (ND) Repeated Games 1 / 48

IPR Protection in the High-Tech Industries: A Model of Piracy. Thierry Rayna University of Bristol

Introductory Microeconomics

Stochastic Games and Bayesian Games

Repeated Games. EC202 Lectures IX & X. Francesco Nava. January London School of Economics. Nava (LSE) EC202 Lectures IX & X Jan / 16

Lecture 5 Leadership and Reputation

Problem 3 Solutions. l 3 r, 1

Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4)

Stochastic Games and Bayesian Games

CUR 412: Game Theory and its Applications Final Exam Ronaldo Carpio Jan. 13, 2015

Repeated Games with Perfect Monitoring

REPEATED GAMES. MICROECONOMICS Principles and Analysis Frank Cowell. Frank Cowell: Repeated Games. Almost essential Game Theory: Dynamic.

The Nash equilibrium of the stage game is (D, R), giving payoffs (0, 0). Consider the trigger strategies:

M.Phil. Game theory: Problem set II. These problems are designed for discussions in the classes of Week 8 of Michaelmas term. 1

Answer Key: Problem Set 4

Player 2 L R M H a,a 7,1 5,0 T 0,5 5,3 6,6

The Limits of Reciprocal Altruism

In the Name of God. Sharif University of Technology. Graduate School of Management and Economics

1 Solutions to Homework 4

Economics 431 Infinitely repeated games

Spring 2017 Final Exam

Economics and Computation

6 Dynamic Games with Incomplete Information

ECON/MGEC 333 Game Theory And Strategy Problem Set 9 Solutions. Levent Koçkesen January 6, 2011

Game theory and applications: Lecture 1

Economics 171: Final Exam

Iterated Dominance and Nash Equilibrium

Microeconomics of Banking: Lecture 5

Name. Answers Discussion Final Exam, Econ 171, March, 2012

ECO303: Intermediate Microeconomic Theory Benjamin Balak, Spring 2008

Module 3: "Dynamic games of complete information" Lecture 23: "Bertrand Paradox in an Infinitely repeated game" The Lecture Contains:

CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma

Finitely repeated simultaneous move game.

Eco AS , J. Sandford, spring 2019 March 9, Midterm answers

Noncooperative Oligopoly

CUR 412: Game Theory and its Applications, Lecture 12

Game Theory: Additional Exercises

SI Game Theory, Fall 2008

PRISONER S DILEMMA. Example from P-R p. 455; also 476-7, Price-setting (Bertrand) duopoly Demand functions

ECE 586BH: Problem Set 5: Problems and Solutions Multistage games, including repeated games, with observed moves

Game Theory. Important Instructions

EconS 424 Strategy and Game Theory. Homework #5 Answer Key

Competition and Regulation. Lecture 4 Collusion

Topics in Contract Theory Lecture 1

EC 202. Lecture notes 14 Oligopoly I. George Symeonidis

Sequential-move games with Nature s moves.

Extensive-Form Games with Imperfect Information

Agenda. Game Theory Matrix Form of a Game Dominant Strategy and Dominated Strategy Nash Equilibrium Game Trees Subgame Perfection

Simon Fraser University Spring 2014

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

is the best response of firm 1 to the quantity chosen by firm 2. Firm 2 s problem: Max Π 2 = q 2 (a b(q 1 + q 2 )) cq 2

Algorithms and Networking for Computer Games

CUR 412: Game Theory and its Applications, Lecture 9

An introduction on game theory for wireless networking [1]

preferences of the individual players over these possible outcomes, typically measured by a utility or payoff function.

Outline for Dynamic Games of Complete Information

Econ 711 Homework 1 Solutions

Game Theory (part-2) Week 8, Lecture 8

Introduction to Game Theory

CMSC 474, Introduction to Game Theory 16. Behavioral vs. Mixed Strategies

Lecture 6 Dynamic games with imperfect information

13.1 Infinitely Repeated Cournot Oligopoly

Not 0,4 2,1. i. Show there is a perfect Bayesian equilibrium where player A chooses to play, player A chooses L, and player B chooses L.

Advanced Microeconomics

Repeated, Stochastic and Bayesian Games

EconS 424 Strategy and Game Theory. Homework #5 Answer Key

Notes for Section: Week 4

The Ohio State University Department of Economics Econ 601 Prof. James Peck Extra Practice Problems Answers (for final)

MA300.2 Game Theory 2005, LSE

Game Theory. VK Room: M1.30 Last updated: October 22, 2012.

Multiagent Systems. Multiagent Systems General setting Division of Resources Task Allocation Resource Allocation. 13.

Introduction to Game Theory

Today. Applications of NE and SPNE Auctions English Auction Second-Price Sealed-Bid Auction First-Price Sealed-Bid Auction

1 Solutions to Homework 3

6.254 : Game Theory with Engineering Applications Lecture 3: Strategic Form Games - Solution Concepts

CMPSCI 240: Reasoning about Uncertainty

Repeated Games. Debraj Ray, October 2006

Economic Management Strategy: Hwrk 1. 1 Simultaneous-Move Game Theory Questions.

Mixed-Strategy Subgame-Perfect Equilibria in Repeated Games

Overuse of a Common Resource: A Two-player Example

Microeconomic Theory II Preliminary Examination Solutions Exam date: June 5, 2017

GAME THEORY: DYNAMIC. MICROECONOMICS Principles and Analysis Frank Cowell. Frank Cowell: Dynamic Game Theory

Transcription:

University of Michigan Deep Blue deepblue.lib.umich.edu 2008-09 SI 563 - Game Theory, Fall 2008 Chen, Yan Chen, Y. (2008, November 12). Game Theory. Retrieved from Open.Michigan - Educational Resources Web site: https://open.umich.edu/education/si/si563-fall2008. <http://hdl.handle.net/2027.42/64940> http://hdl.handle.net/2027.42/64940

Unless otherwise noted, the content of this course material is licensed under a Creative Commons Attribution Non-Commercial 3.0 License. http://creativecommons.org/licenses/by-nc/3.0/ Copyright 2008, Yan Chen You assume all responsibility for use and potential liability associated with any use of the material. Material contains copyrighted content, used in accordance with U.S. law. Copyright holders of content included in this material should contact open.michigan@umich.edu with any questions, corrections, or clarifications regarding the use of content. The Regents of the University of Michigan do not license the use of third party content posted to this site unless such a license is specifically granted in connection with particular content objects. Users of content are responsible for their compliance with applicable law. Mention of specific products in this recording solely represents the opinion of the speaker and does not represent an endorsement by the University of Michigan. For more information about how to cite these materials visit http://michigan.educommons.net/about/terms-of-use 1

SI 563 Lecture 5 Repeated Games and Reputation Professor Yan Chen Fall 2008 Some material in this lecture drawn from http://gametheory.net/lectures/level.pl 2

Finitely repeated games Infinitely repeated games Folk Theorems» Minmax» Nash-threat Fun project: ad auction (Next Class) 3

(Watson Chapter 22) 4

Empirical observations People often interact in ongoing relationships Your behavior today might influence actions of others in the future New dimension: time Questions What if interaction is repeated? What strategies can lead players to cooperate? 5

Repeated game: played over discrete periods of time (period 1, period 2, and so on) t: any given period T: total number of periods In each period, players play a static stage game History of play: sequence of action profiles 6

1 2 X Y Z A 4,3 0,0 1,4 B 0,0 2,1 0,0 Stage game, repeated once (T = 2) Stage game NE: (A, Z), (B, Y) 7

1 2 X Y Z A 5,7 1,4 2,8 B 1,4 3,5 1,4 The subgame following (A,Z), with payoffs (1, 4) 8

v 2 8 7 6 5 4 3 2 1 1 2 3 4 5 6 7 8 v 1 All possible repeated game payoffs: larger set 9

v 2 Select stage game NE: (A, Z), (B, Y) with payoffs (1, 4), (2, 1) 8 7 6 5 4 3 2 1 Select (A, X) in period 1, then (1) If 2 not deviate from X, select (A, Z); (2) Otherwise, play (B, Y) in period 2. 1 2 3 4 5 6 7 8 Result: Any sequence of stage Nash profiles can be supported as the outcome of a SPNE. And there are more SPNE! v 1 10

Reputational equilibrium: Nonstage Nash profile in 1 st period Stage Nash profile in 2 nd period 2 nd period actions contingent on outcome in first period (whether players cheat or not) Example: Select (A, X) in 1 st period If player 2 chooses X in 1 st period, select (A, Z) in 2 nd period If player 2 chooses Y or Z in 1 st period, select (B, Y) in 2 nd period 11

Discounting (δ): future payoffs not as valuable as current payoffs A fixed known chance of game s ending after each round, p Interest rate, r δ =1-p=1/(1+r) 12

Discounting: Present-day value of future profits is less than value of current profits r is the interest rate Invest $1 today get $(1+r) next year Want $1 next year invest $1/(1+r) today Annuity paying $1 today and $1 every year has a net present value of $ 1+1/r 13

1+ or: 1 (1+ r) + 1 (1 + r) 2 + 1 (1 + r) 3 + 1 (1 + r) 4 +... = 1 + 1 r Why? 14

Equilibrium: $54 K Firm 1 Firm 2 Low High Low 54, 54 72, 47 High 47, 72 60, 60 Cooperation: $60 K Source: Mike Shor, gametheory.net 15

Private rationality collective irrationality» The equilibrium that arises from using dominant strategies is worse for every player than the outcome that would arise if every player used her dominated strategy instead Goal:» To sustain mutually beneficial cooperative outcome overcoming incentives to cheat Source: Mike Shor, gametheory.net 16

Source: Mike Shor, gametheory.net Why does the dilemma occur? Interaction» No fear of punishment» Short term or myopic play Firms:» Lack of monopoly power» Homogeneity in products and costs» Overcapacity» Incentives for profit or market share Consumers» Price sensitive» Price aware» Low switching costs 17

Interaction No fear of punishment» Exploit repeated play Short term or myopic play» Introduce repeated encounters» Introduce uncertainty Source: Mike Shor, gametheory.net 18

No last period, so no backward induction Use history-dependent strategies Trigger strategies:» Begin by cooperating» Cooperate as long as the rivals do» Upon observing a defection: immediately revert to a period of punishment of specified length in which everyone plays non-cooperatively 19

Grim trigger strategy Cooperate until a rival deviates Once a deviation occurs, play non-cooperatively for the rest of the game Tit-for-tat Cooperate if your rival cooperated in the most recent period Cheat if your rival cheated in the most recent period 20

Tit-for-Tat is most forgiving shortest memory proportional credible but lacks deterrence Tit-for-tat answers: Is cooperation easy? Grim trigger is least forgiving longest memory adequate deterrence but lacks credibility Grim trigger answers: Is cooperation possible? 21

Cooperate if the present value of cooperation is greater than the present value of defection Firm 2 Firm 1 Low High Low 54, 54 72, 47 High 47, 72 60, 60 Cooperate: 60 today, 60 next year, 60 60 Defect: 72 today, 54 next year, 54 54 22

profit 72 60 cooperate 54 defect t t+1 t+2 t+3 time 23

Cooperate if PV(cooperation) 60 60 60 60 60/(1- δ) 18 δ δ > > > > > PV(defection) 72 54 54 54 72+ 54 δ/(1- δ) 12 2/3 Cooperation is sustainable using grim trigger strategies as long as δ > 2/3 24

profit 72 60 54 47 cooperate defect once defect t t+1 t+2 t+3 time 25

Cooperate if PV(cooperation) PV(cooperation) 60 60 60 60 60+ 60 δ 13 δ δ > and > > > > > PV(defection) PV(defect once) 72 47 60 60 72 + 47 δ 12 12/13 Much harder to sustain than grim trigger Cooperation may not be likely 26

Grim Trigger and Tit-for-Tat are extremes Balance two goals: Deterrence» GTS is adequate punishment» Tit-for-tat might be too little Credibility» GTS hurts the punisher too much» Tit-for-tat is credible 27

R. Axelrod, The Evolution of Cooperation Prisoner s Dilemma repeated 200 times Game theorists submitted strategies Pairs of strategies competed Winner: Tit-for-Tat Reasons:» Forgiving, Nice, Provocable, Clear 28

Not necessarily tit-for-tat» Doesn t always work Don t be envious Don t be the first to cheat Reciprocate opponent s behavior» Cooperation and defection Don t be too clever 29

Cooperation» Struggle between high profits today and a lasting relationship into the future Deterrence» A clear, provocable policy of punishment Credibility» Must incorporate forgiveness Looking ahead:» How to be credible? 30

2 1 C D C D 4,4-2,6 6,-2 0,0 31

Conditions under which cooperation can be sustained: We check whether Grim Trigger can form a SPNE: Suppose j plays GT. If i also plays GT, her payoff is If i defects, she gets 6 in period of defection, and 0 afterwards. Player i has an incentive to cooperate if 32

Players alternate between (C, C) and (D, C) over time, starting with (C, C) If either or both deviates from the alternating strategy, both will revert to the stage Nash profile, (D, D) Can MGT be supported as a SPNE? 33

Suppose 2 plays MGT. If 1 also plays MGT, 1 s payoff is PV 1 If 2 plays MGT, 2 s payoff is PV 2 34

(1) If 2 defects in an odd-numbered period, her payoff is 6 in this round, and 0 after: 2 has no incentive to deviate in any odd-numbered period, if (2) If 2 defects in an even-numbered period, her payoff is 0 in this round, and 0 after: 2 has no incentive to deviate in any even-numbered period, if Therefore, MGT can be supported as SPNE if δ 0.77. 35

Depending on the discount factor, there are many SPNE in the repeated PD (D, D) in every period (GT, GT) (TFT, TFT) etc. 36

v 2 Any payoff inside or on the edges of the diamond can be obtained as an average payoff if players choose the right sequence of actions over time. (C, D) 6 4 (C,C) 2-2 (D, D) 2 4 6 v 1-2 (D, C) 37

v 2 6 Any point on the edges or interior of the shaded area can be supported as an equilibrium average per-period payoff, as long as the players are patient enough 4 2-2 2 4 6 v 1-2 38

The Nash-threat Folk Theorem: For repeated games with stage game G, for any feasible payoffs (M) greater than or equal to the Nash equilibrium payoffs, and for sufficiently large discount factor, there is a SPNE that has payoffs M. 39

Governing the Commons The Evolution of Institutions for Collective Action by Elinor Ostrom International trade agreements ebay s reputation system (Check out Chapter 23) 40

Finitely repeated games Infinitely repeated games Folk theorems Next week: Games with Incomplete Information Fun exercise: ad words auction 41

Chapter 22: #1, 2, 3, 5 42