The Optimality of Regret Matching

Size: px
Start display at page:

Download "The Optimality of Regret Matching"

Transcription

1 The Optimality of Regret Matching Sergiu Hart July 2008 SERGIU HART c 2008 p. 1

2 THE OPTIMALITY OF REGRET MATCHING Sergiu Hart Center for the Study of Rationality Dept of Economics Dept of Mathematics The Hebrew University of Jerusalem SERGIU HART c 2008 p. 2

3 Joint work with Elchanan Ben-Porath SERGIU HART c 2008 p. 3

4 Repeated Games In a repeated game: SERGIU HART c 2008 p. 4

5 Repeated Games In a repeated game: If the opponent plays i.i.d. (independently identically distributed) SERGIU HART c 2008 p. 4

6 Repeated Games In a repeated game: If the opponent plays i.i.d. (independently identically distributed) Then fictitious play is optimal SERGIU HART c 2008 p. 4

7 Repeated Games In a repeated game: If the opponent plays i.i.d. (independently identically distributed) Then fictitious play is optimal If the opponent is OBLIVIOUS SERGIU HART c 2008 p. 4

8 Repeated Games In a repeated game: If the opponent plays i.i.d. (independently identically distributed) Then fictitious play is optimal If the opponent is OBLIVIOUS OBLIVIOUS = does not see / know / use my actions SERGIU HART c 2008 p. 4

9 Repeated Games In a repeated game: If the opponent plays i.i.d. (independently identically distributed) Then fictitious play is optimal If the opponent is OBLIVIOUS Then? OBLIVIOUS = does not see / know / use my actions SERGIU HART c 2008 p. 4

10 Oblivious SERGIU HART c 2008 p. 5

11 Oblivious SERGIU HART c 2008 p. 5

12 Model: One-Shot Game SERGIU HART c 2008 p. 6

13 Model: One-Shot Game The one-shot game Γ = < I, J, u > SERGIU HART c 2008 p. 6

14 Model: One-Shot Game The one-shot game Γ = < I, J, u > I = (finite) set of actions of Player 1 J = (finite) set of actions of Player 2 (the "opponent") SERGIU HART c 2008 p. 6

15 Model: One-Shot Game The one-shot game Γ = < I, J, u > I = (finite) set of actions of Player 1 J = (finite) set of actions of Player 2 (the "opponent") u : I J R = the payoff function of Player 1 SERGIU HART c 2008 p. 6

16 Model: One-Shot Game The one-shot game Γ = < I, J, u > I = (finite) set of actions of Player 1 J = (finite) set of actions of Player 2 (the "opponent") u : I J R = the payoff function of Player 1 without loss of generality assume: 0 u(i, j) 1 for all i I, j J SERGIU HART c 2008 p. 6

17 Model: Repeated Game SERGIU HART c 2008 p. 7

18 Model: Repeated Game The repeated game Γ SERGIU HART c 2008 p. 7

19 Model: Repeated Game The repeated game Γ O Player 2 is restricted to OBLIVIOUS strategies η : t=0 Jt (J) SERGIU HART c 2008 p. 7

20 Model: Repeated Game The repeated game Γ O Player 2 is restricted to OBLIVIOUS strategies η : t=0 Jt (J) Player 1 is unrestricted σ : t=0 (I J)t (I) SERGIU HART c 2008 p. 7

21 Model: Repeated Game The repeated game Γ O Player 2 is restricted to OBLIVIOUS strategies η : t=0 Jt (J) Player 1 is unrestricted σ : t=0 (I J)t (I) Payoffs v t := u(i t, j t ) payoff at time t v T := (1/T) T t=1 v t average payoff up to time T SERGIU HART c 2008 p. 7

22 Model: Repeated Game The repeated game Γ O Player 2 is restricted to OBLIVIOUS strategies η : t=0 Jt (J) Player 1 is unrestricted σ : t=0 (I J)t (I) Without loss of generality: Player 1 uses "self-oblivious" strategies σ : t=0 Jt (I) SERGIU HART c 2008 p. 7

23 Dominance SERGIU HART c 2008 p. 8

24 Dominance If Player 1 has a dominant action i I in Γ then playing i forever is a dominant strategy in the repeated game SERGIU HART c 2008 p. 8

25 Dominance If Player 1 has a dominant action i I in Γ then playing i forever is a dominant strategy in the repeated game The one-shot game Γ is called essential SERGIU HART c 2008 p. 8

26 Dominance If Player 1 has a dominant action i I in Γ then playing i forever is a dominant strategy in the repeated game The one-shot game Γ is called essential if Player 1 does NOT have a dominant action: SERGIU HART c 2008 p. 8

27 Dominance If Player 1 has a dominant action i I in Γ then playing i forever is a dominant strategy in the repeated game The one-shot game Γ is called essential if Player 1 does NOT have a dominant action: for every i I there exists i I with u(i, j) < u(i, j) for some j J SERGIU HART c 2008 p. 8

28 Dominance Proposition. Let Γ be an essential game. SERGIU HART c 2008 p. 9

29 Dominance in Γ O Proposition. Let Γ be an essential game. Then every strategy of Player 1 in Γ O is weakly dominated. SERGIU HART c 2008 p. 9

30 Dominance in Γ O Proposition. Let Γ be an essential game. Then every strategy of Player 1 in Γ O is weakly dominated. That is, for every σ there exists ˆσ such that: For every oblivious strategy η of Player 2 Eˆσ,η [ v T ] E σ,η [ v T ] 1/T for all T 1 SERGIU HART c 2008 p. 9

31 Dominance in Γ O Proposition. Let Γ be an essential game. Then every strategy of Player 1 in Γ O is weakly dominated. That is, for every σ there exists ˆσ such that: For every oblivious strategy η of Player 2 Eˆσ,η [ v T ] E σ,η [ v T ] 1/T for all T 1 There exists an oblivious strategy η 0 of Player 2 and a constant γ > 0 such that Eˆσ,η0 [ v T ] > E σ,η0 [ v T ] + γ for all T 1 SERGIU HART c 2008 p. 9

32 Regret Matching SERGIU HART c 2008 p. 10

33 Regret Matching For every k K, let σ k : t=0 Jt (I) be a (self-oblivious, behavior) strategy of Player 1. SERGIU HART c 2008 p. 10

34 Regret Matching For every k K, let σ k : t=0 Jt (I) be a (self-oblivious, behavior) strategy of Player 1. Theorem. For every finite set K there exists a K-REGRET-MATCHING strategy σ σ K such that E σ,η [ v T ] E σk,η [ v T ] 3 K T for every k K, every time T 1, and every oblivious strategy η of Player 2. SERGIU HART c 2008 p. 10

35 Regret Matching The K-REGRET-MATCHING strategy σ σ K is defined as follows. SERGIU HART c 2008 p. 11

36 Regret Matching The K-REGRET-MATCHING strategy σ σ K is defined as follows. At time T + 1: SERGIU HART c 2008 p. 11

37 Regret Matching The K-REGRET-MATCHING strategy σ σ K is defined as follows. At time T + 1: U := T t=1 v t (realized total payoff) SERGIU HART c 2008 p. 11

38 Regret Matching The K-REGRET-MATCHING strategy σ σ K is defined as follows. At time T + 1: U := T t=1 v t (realized total payoff) For each k K: SERGIU HART c 2008 p. 11

39 Regret Matching The K-REGRET-MATCHING strategy σ σ K is defined as follows. At time T + 1: U := T t=1 v t (realized total payoff) For each k K: V (k) := T t=1 u(σ k(h 2 t 1 ), j t) (the total payoff of σ k against the realized sequence of actions of Player 2) SERGIU HART c 2008 p. 11

40 Regret Matching The K-REGRET-MATCHING strategy σ σ K is defined as follows. At time T + 1: U := T t=1 v t (realized total payoff) For each k K: V (k) := T t=1 u(σ k(h 2 t 1 ), j t) (the total payoff of σ k against the realized sequence of actions of Player 2) R(k) := [V (k) U] + (the REGRET of k) SERGIU HART c 2008 p. 11

41 Regret Matching The K-REGRET-MATCHING strategy σ σ K is defined as follows. At time T + 1: U := T t=1 v t (realized total payoff) For each k K: V (k) := T t=1 u(σ k(h 2 t 1 ), j t) (the total payoff of σ k against the realized sequence of actions of Player 2) R(k) := [V (k) U] + (the REGRET of k) ρ k := R(k)/ l K R(l) (the normalized regret of k) SERGIU HART c 2008 p. 11

42 Regret Matching The K-REGRET-MATCHING strategy σ σ K is defined as follows. At time T + 1: U := T t=1 v t (realized total payoff) For each k K: V (k) := T t=1 u(σ k(h 2 t 1 ), j t) R(k) := [V (k) U] + (the REGRET of k) ρ k := R(k)/ l K R(l) SERGIU HART c 2008 p. 11

43 Regret Matching The K-REGRET-MATCHING strategy σ σ K is defined as follows. At time T + 1: U := T t=1 v t (realized total payoff) For each k K: V (k) := T t=1 u(σ k(h 2 t 1 ), j t) R(k) := [V (k) U] + (the REGRET of k) ρ k := R(k)/ l K R(l) σ plays like σ k with probability ρ k SERGIU HART c 2008 p. 11

44 Regret Matching The K-REGRET-MATCHING strategy σ σ K is defined as follows. At time T + 1: U := T t=1 v t (realized total payoff) For each k K: V (k) := T t=1 u(σ k(h 2 t 1 ), j t) R(k) := [V (k) U] + (the REGRET of k) ρ k := R(k)/ l K R(l) σ plays like σ k with probability ρ k : σ (h 2 T )(i) = k K ρ k σ k (h 2 T )(i) SERGIU HART c 2008 p. 11

45 Regret Matching SERGIU HART c 2008 p. 12

46 Regret Matching Generalizes Hannan s result: SERGIU HART c 2008 p. 12

47 Regret Matching Generalizes Hannan s result: σ i = "play i forever" for each i I SERGIU HART c 2008 p. 12

48 Regret Matching Generalizes Hannan s result: σ i = "play i forever" for each i I Blackwell approachability (with changing payoffs) SERGIU HART c 2008 p. 12

49 Regret Matching SERGIU HART c 2008 p. 13

50 Regret Matching - Countable Theorem. For every COUNTABLE SET K there exists a K-REGRET-MATCHING strategy σ σ K such that SERGIU HART c 2008 p. 13

51 Regret Matching - Countable Theorem. For every COUNTABLE SET K there exists a K-REGRET-MATCHING strategy σ σ K such that ) E σ,η [ v T ] E σk,η [ v T ] O (T 1/4 for every k K, every time T 1, and every oblivious strategy η of Player 2. SERGIU HART c 2008 p. 13

52 Regret Matching - Countable SERGIU HART c 2008 p. 14

53 Regret Matching - Countable The K-REGRET-MATCHING strategy σ σ K for COUNTABLE K = 1, 2,... : SERGIU HART c 2008 p. 14

54 Regret Matching - Countable The K-REGRET-MATCHING strategy σ σ K for COUNTABLE K = 1, 2,... : For each k = 1, 2,... : BLOCK k has n k = k 3 periods SERGIU HART c 2008 p. 14

55 Regret Matching - Countable The K-REGRET-MATCHING strategy σ σ K for COUNTABLE K = 1, 2,... : For each k = 1, 2,... : BLOCK k has n k = k 3 periods During BLOCK k play the regret-matching strategy for the set {σ 1,..., σ k } SERGIU HART c 2008 p. 14

56 Regret Matching - Countable The K-REGRET-MATCHING strategy σ σ K for COUNTABLE K = 1, 2,... : For each k = 1, 2,... : BLOCK k has n k = k 3 periods During BLOCK k play the regret-matching strategy for the set {σ 1,..., σ k } Regrets are computed on the basis of the current block only SERGIU HART c 2008 p. 14

57 Regret Matching SERGIU HART c 2008 p. 15

58 Regret Matching Countable sets of strategies: SERGIU HART c 2008 p. 15

59 Regret Matching Countable sets of strategies: Automata SERGIU HART c 2008 p. 15

60 Regret Matching Countable sets of strategies: Automata Turing Machines SERGIU HART c 2008 p. 15

61 Regret Matching Countable sets of strategies: Automata Turing Machines A COMPUTABLE SET OF STRATEGIES SERGIU HART c 2008 p. 15

62 Regret Matching Countable sets of strategies: Automata Turing Machines A COMPUTABLE SET OF STRATEGIES INPUT: k h t (index of a strategy) (a history) SERGIU HART c 2008 p. 15

63 Regret Matching Countable sets of strategies: Automata Turing Machines A COMPUTABLE SET OF STRATEGIES INPUT: k h t (index of a strategy) (a history) OUTPUT: σ k (h t ) (I) SERGIU HART c 2008 p. 15

64 Regret SERGIU HART c 2008 p. 16

65 Regret: The End SERGIU HART c 2008 p. 16

Rationalizable Strategies

Rationalizable Strategies Rationalizable Strategies Carlos Hurtado Department of Economics University of Illinois at Urbana-Champaign hrtdmrt2@illinois.edu Jun 1st, 2015 C. Hurtado (UIUC - Economics) Game Theory On the Agenda 1

More information

Repeated Games. September 3, Definitions: Discounting, Individual Rationality. Finitely Repeated Games. Infinitely Repeated Games

Repeated Games. September 3, Definitions: Discounting, Individual Rationality. Finitely Repeated Games. Infinitely Repeated Games Repeated Games Frédéric KOESSLER September 3, 2007 1/ Definitions: Discounting, Individual Rationality Finitely Repeated Games Infinitely Repeated Games Automaton Representation of Strategies The One-Shot

More information

Extensive form games - contd

Extensive form games - contd Extensive form games - contd Proposition: Every finite game of perfect information Γ E has a pure-strategy SPNE. Moreover, if no player has the same payoffs in any two terminal nodes, then there is a unique

More information

Strategies and Nash Equilibrium. A Whirlwind Tour of Game Theory

Strategies and Nash Equilibrium. A Whirlwind Tour of Game Theory Strategies and Nash Equilibrium A Whirlwind Tour of Game Theory (Mostly from Fudenberg & Tirole) Players choose actions, receive rewards based on their own actions and those of the other players. Example,

More information

Lecture 1: Normal Form Games: Refinements and Correlated Equilibrium

Lecture 1: Normal Form Games: Refinements and Correlated Equilibrium Lecture 1: Normal Form Games: Refinements and Correlated Equilibrium Albert Banal-Estanol April 2006 Lecture 1 2 Albert Banal-Estanol Trembling hand perfect equilibrium: Motivation, definition and examples

More information

Duopoly models Multistage games with observed actions Subgame perfect equilibrium Extensive form of a game Two-stage prisoner s dilemma

Duopoly models Multistage games with observed actions Subgame perfect equilibrium Extensive form of a game Two-stage prisoner s dilemma Recap Last class (September 20, 2016) Duopoly models Multistage games with observed actions Subgame perfect equilibrium Extensive form of a game Two-stage prisoner s dilemma Today (October 13, 2016) Finitely

More information

Prisoner s dilemma with T = 1

Prisoner s dilemma with T = 1 REPEATED GAMES Overview Context: players (e.g., firms) interact with each other on an ongoing basis Concepts: repeated games, grim strategies Economic principle: repetition helps enforcing otherwise unenforceable

More information

Game Theory. VK Room: M1.30 Last updated: October 22, 2012.

Game Theory. VK Room: M1.30  Last updated: October 22, 2012. Game Theory VK Room: M1.30 knightva@cf.ac.uk www.vincent-knight.com Last updated: October 22, 2012. 1 / 33 Overview Normal Form Games Pure Nash Equilibrium Mixed Nash Equilibrium 2 / 33 Normal Form Games

More information

Chapter 8. Repeated Games. Strategies and payoffs for games played twice

Chapter 8. Repeated Games. Strategies and payoffs for games played twice Chapter 8 epeated Games 1 Strategies and payoffs for games played twice Finitely repeated games Discounted utility and normalized utility Complete plans of play for 2 2 games played twice Trigger strategies

More information

Best-Reply Sets. Jonathan Weinstein Washington University in St. Louis. This version: May 2015

Best-Reply Sets. Jonathan Weinstein Washington University in St. Louis. This version: May 2015 Best-Reply Sets Jonathan Weinstein Washington University in St. Louis This version: May 2015 Introduction The best-reply correspondence of a game the mapping from beliefs over one s opponents actions to

More information

CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma

CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma CS 331: Artificial Intelligence Game Theory I 1 Prisoner s Dilemma You and your partner have both been caught red handed near the scene of a burglary. Both of you have been brought to the police station,

More information

Introduction to Game Theory Lecture Note 5: Repeated Games

Introduction to Game Theory Lecture Note 5: Repeated Games Introduction to Game Theory Lecture Note 5: Repeated Games Haifeng Huang University of California, Merced Repeated games Repeated games: given a simultaneous-move game G, a repeated game of G is an extensive

More information

Game Theory. Wolfgang Frimmel. Repeated Games

Game Theory. Wolfgang Frimmel. Repeated Games Game Theory Wolfgang Frimmel Repeated Games 1 / 41 Recap: SPNE The solution concept for dynamic games with complete information is the subgame perfect Nash Equilibrium (SPNE) Selten (1965): A strategy

More information

SUCCESSIVE INFORMATION REVELATION IN 3-PLAYER INFINITELY REPEATED GAMES WITH INCOMPLETE INFORMATION ON ONE SIDE

SUCCESSIVE INFORMATION REVELATION IN 3-PLAYER INFINITELY REPEATED GAMES WITH INCOMPLETE INFORMATION ON ONE SIDE SUCCESSIVE INFORMATION REVELATION IN 3-PLAYER INFINITELY REPEATED GAMES WITH INCOMPLETE INFORMATION ON ONE SIDE JULIAN MERSCHEN Bonn Graduate School of Economics, University of Bonn Adenauerallee 24-42,

More information

Game theory and applications: Lecture 1

Game theory and applications: Lecture 1 Game theory and applications: Lecture 1 Adam Szeidl September 20, 2018 Outline for today 1 Some applications of game theory 2 Games in strategic form 3 Dominance 4 Nash equilibrium 1 / 8 1. Some applications

More information

An Approach to Bounded Rationality

An Approach to Bounded Rationality An Approach to Bounded Rationality Eli Ben-Sasson Department of Computer Science Technion Israel Institute of Technology Adam Tauman Kalai Toyota Technological Institute at Chicago Ehud Kalai Kellogg Graduate

More information

In the Name of God. Sharif University of Technology. Microeconomics 2. Graduate School of Management and Economics. Dr. S.

In the Name of God. Sharif University of Technology. Microeconomics 2. Graduate School of Management and Economics. Dr. S. In the Name of God Sharif University of Technology Graduate School of Management and Economics Microeconomics 2 44706 (1394-95 2 nd term) - Group 2 Dr. S. Farshad Fatemi Chapter 8: Simultaneous-Move Games

More information

Communication in Repeated Games with Costly Monitoring

Communication in Repeated Games with Costly Monitoring Communication in Repeated Games with Costly Monitoring Elchanan Ben-Porath 1 and Michael Kahneman January, 2002 1 The department of Economics and the Center for Rationality, the Hebrew University of Jerusalem,

More information

Introduction. Microeconomics II. Dominant Strategies. Definition (Dominant Strategies)

Introduction. Microeconomics II. Dominant Strategies. Definition (Dominant Strategies) Introduction Microeconomics II Dominance and Rationalizability Levent Koçkesen Koç University Nash equilibrium concept assumes that each player knows the other players equilibrium behavior This is problematic

More information

Microeconomic Theory II Preliminary Examination Solutions

Microeconomic Theory II Preliminary Examination Solutions Microeconomic Theory II Preliminary Examination Solutions 1. (45 points) Consider the following normal form game played by Bruce and Sheila: L Sheila R T 1, 0 3, 3 Bruce M 1, x 0, 0 B 0, 0 4, 1 (a) Suppose

More information

Finitely repeated simultaneous move game.

Finitely repeated simultaneous move game. Finitely repeated simultaneous move game. Consider a normal form game (simultaneous move game) Γ N which is played repeatedly for a finite (T )number of times. The normal form game which is played repeatedly

More information

An introduction on game theory for wireless networking [1]

An introduction on game theory for wireless networking [1] An introduction on game theory for wireless networking [1] Ning Zhang 14 May, 2012 [1] Game Theory in Wireless Networks: A Tutorial 1 Roadmap 1 Introduction 2 Static games 3 Extensive-form games 4 Summary

More information

Economics 209A Theory and Application of Non-Cooperative Games (Fall 2013) Repeated games OR 8 and 9, and FT 5

Economics 209A Theory and Application of Non-Cooperative Games (Fall 2013) Repeated games OR 8 and 9, and FT 5 Economics 209A Theory and Application of Non-Cooperative Games (Fall 2013) Repeated games OR 8 and 9, and FT 5 The basic idea prisoner s dilemma The prisoner s dilemma game with one-shot payoffs 2 2 0

More information

SF2972 GAME THEORY Infinite games

SF2972 GAME THEORY Infinite games SF2972 GAME THEORY Infinite games Jörgen Weibull February 2017 1 Introduction Sofar,thecoursehasbeenfocusedonfinite games: Normal-form games with a finite number of players, where each player has a finite

More information

Bounded computational capacity equilibrium

Bounded computational capacity equilibrium Available online at www.sciencedirect.com ScienceDirect Journal of Economic Theory 63 (206) 342 364 www.elsevier.com/locate/jet Bounded computational capacity equilibrium Penélope Hernández a, Eilon Solan

More information

Outline Introduction Game Representations Reductions Solution Concepts. Game Theory. Enrico Franchi. May 19, 2010

Outline Introduction Game Representations Reductions Solution Concepts. Game Theory. Enrico Franchi. May 19, 2010 May 19, 2010 1 Introduction Scope of Agent preferences Utility Functions 2 Game Representations Example: Game-1 Extended Form Strategic Form Equivalences 3 Reductions Best Response Domination 4 Solution

More information

On Finite Strategy Sets for Finitely Repeated Zero-Sum Games

On Finite Strategy Sets for Finitely Repeated Zero-Sum Games On Finite Strategy Sets for Finitely Repeated Zero-Sum Games Thomas C. O Connell Department of Mathematics and Computer Science Skidmore College 815 North Broadway Saratoga Springs, NY 12866 E-mail: oconnellt@acm.org

More information

ECON/MGEC 333 Game Theory And Strategy Problem Set 9 Solutions. Levent Koçkesen January 6, 2011

ECON/MGEC 333 Game Theory And Strategy Problem Set 9 Solutions. Levent Koçkesen January 6, 2011 Koç University Department of Economics ECON/MGEC 333 Game Theory And Strategy Problem Set Solutions Levent Koçkesen January 6, 2011 1. (a) Tit-For-Tat: The behavior of a player who adopts this strategy

More information

EX-POST STABILITY IN LARGE GAMES (DRAFT, COMMENTS WELCOME)

EX-POST STABILITY IN LARGE GAMES (DRAFT, COMMENTS WELCOME) EX-POST STABILITY IN LARGE GAMES (DRAFT, COMMENTS WELCOME) EHUD KALAI Abstract. The equilibria of strategic games with many semi-anonymous players have a strong ex-post Nash property. Even with perfect

More information

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015. FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.) Hints for Problem Set 3 1. Consider the following strategic

More information

Repeated Games with Perfect Monitoring

Repeated Games with Perfect Monitoring Repeated Games with Perfect Monitoring Mihai Manea MIT Repeated Games normal-form stage game G = (N, A, u) players simultaneously play game G at time t = 0, 1,... at each date t, players observe all past

More information

6.254 : Game Theory with Engineering Applications Lecture 3: Strategic Form Games - Solution Concepts

6.254 : Game Theory with Engineering Applications Lecture 3: Strategic Form Games - Solution Concepts 6.254 : Game Theory with Engineering Applications Lecture 3: Strategic Form Games - Solution Concepts Asu Ozdaglar MIT February 9, 2010 1 Introduction Outline Review Examples of Pure Strategy Nash Equilibria

More information

REPUTATION WITH LONG RUN PLAYERS AND IMPERFECT OBSERVATION

REPUTATION WITH LONG RUN PLAYERS AND IMPERFECT OBSERVATION REPUTATION WITH LONG RUN PLAYERS AND IMPERFECT OBSERVATION ALP E. ATAKAN AND MEHMET EKMEKCI Abstract. Previous work shows that reputation results may fail in repeated games between two long-run players

More information

Advanced Micro 1 Lecture 14: Dynamic Games Equilibrium Concepts

Advanced Micro 1 Lecture 14: Dynamic Games Equilibrium Concepts Advanced Micro 1 Lecture 14: Dynamic Games quilibrium Concepts Nicolas Schutz Nicolas Schutz Dynamic Games: quilibrium Concepts 1 / 79 Plan 1 Nash equilibrium and the normal form 2 Subgame-perfect equilibrium

More information

Basic Game-Theoretic Concepts. Game in strategic form has following elements. Player set N. (Pure) strategy set for player i, S i.

Basic Game-Theoretic Concepts. Game in strategic form has following elements. Player set N. (Pure) strategy set for player i, S i. Basic Game-Theoretic Concepts Game in strategic form has following elements Player set N (Pure) strategy set for player i, S i. Payoff function f i for player i f i : S R, where S is product of S i s.

More information

Discounted Stochastic Games

Discounted Stochastic Games Discounted Stochastic Games Eilon Solan October 26, 1998 Abstract We give an alternative proof to a result of Mertens and Parthasarathy, stating that every n-player discounted stochastic game with general

More information

Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4)

Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4) Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4) Outline: Modeling by means of games Normal form games Dominant strategies; dominated strategies,

More information

An Approach to Bounded Rationality

An Approach to Bounded Rationality An Approach to Bounded Rationality Eli Ben-Sasson Technion Adam Tauman Kalai TTI Chicago Ehud Kalai Northwestern University July 13, 2006 Abstract A central question in game theory, learning, and other

More information

Lecture 3 Representation of Games

Lecture 3 Representation of Games ecture 3 epresentation of Games 4. Game Theory Muhamet Yildiz oad Map. Cardinal representation Expected utility theory. Quiz 3. epresentation of games in strategic and extensive forms 4. Dominance; dominant-strategy

More information

In reality; some cases of prisoner s dilemma end in cooperation. Game Theory Dr. F. Fatemi Page 219

In reality; some cases of prisoner s dilemma end in cooperation. Game Theory Dr. F. Fatemi Page 219 Repeated Games Basic lesson of prisoner s dilemma: In one-shot interaction, individual s have incentive to behave opportunistically Leads to socially inefficient outcomes In reality; some cases of prisoner

More information

Rational Behaviour and Strategy Construction in Infinite Multiplayer Games

Rational Behaviour and Strategy Construction in Infinite Multiplayer Games Rational Behaviour and Strategy Construction in Infinite Multiplayer Games Michael Ummels ummels@logic.rwth-aachen.de FSTTCS 2006 Michael Ummels Rational Behaviour and Strategy Construction 1 / 15 Infinite

More information

PLAYING GAMES WITHOUT OBSERVING PAYOFFS

PLAYING GAMES WITHOUT OBSERVING PAYOFFS PLAYING GAMES WITHOUT OBSERVING PAYOFFS Michal Feldman Hebrew University & Microsoft Israel R&D Center Joint work with Adam Kalai and Moshe Tennenholtz FLA--BONG-DING FLA BONG DING 鲍步 爱丽丝 Y FLA Y FLA 5

More information

TTIC An Introduction to the Theory of Machine Learning. Learning and Game Theory. Avrim Blum 5/7/18, 5/9/18

TTIC An Introduction to the Theory of Machine Learning. Learning and Game Theory. Avrim Blum 5/7/18, 5/9/18 TTIC 31250 An Introduction to the Theory of Machine Learning Learning and Game Theory Avrim Blum 5/7/18, 5/9/18 Zero-sum games, Minimax Optimality & Minimax Thm; Connection to Boosting & Regret Minimization

More information

Assignment 1: Preference Relations. Decision Theory. Pareto Optimality. Game Types.

Assignment 1: Preference Relations. Decision Theory. Pareto Optimality. Game Types. Simon Fraser University Spring 2010 CMPT 882 Instructor: Oliver Schulte Assignment 1: Preference Relations. Decision Theory. Pareto Optimality. Game Types. The due date for this assignment is Wednesday,

More information

Maximal Revenue with Multiple Goods: Nonmonotonicity and Other Observations

Maximal Revenue with Multiple Goods: Nonmonotonicity and Other Observations Maximal Revenue with Multiple Goods: Nonmonotonicity and Other Observations Sergiu Hart Philip J. Reny November 21, 2013 Abstract Consider the problem of maximizing the revenue from selling a number of

More information

Solutions of Bimatrix Coalitional Games

Solutions of Bimatrix Coalitional Games Applied Mathematical Sciences, Vol. 8, 2014, no. 169, 8435-8441 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2014.410880 Solutions of Bimatrix Coalitional Games Xeniya Grigorieva St.Petersburg

More information

CMSC 474, Introduction to Game Theory 16. Behavioral vs. Mixed Strategies

CMSC 474, Introduction to Game Theory 16. Behavioral vs. Mixed Strategies CMSC 474, Introduction to Game Theory 16. Behavioral vs. Mixed Strategies Mohammad T. Hajiaghayi University of Maryland Behavioral Strategies In imperfect-information extensive-form games, we can define

More information

Introductory Microeconomics

Introductory Microeconomics Prof. Wolfram Elsner Faculty of Business Studies and Economics iino Institute of Institutional and Innovation Economics Introductory Microeconomics More Formal Concepts of Game Theory and Evolutionary

More information

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015. FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.) Hints for Problem Set 2 1. Consider a zero-sum game, where

More information

On Existence of Equilibria. Bayesian Allocation-Mechanisms

On Existence of Equilibria. Bayesian Allocation-Mechanisms On Existence of Equilibria in Bayesian Allocation Mechanisms Northwestern University April 23, 2014 Bayesian Allocation Mechanisms In allocation mechanisms, agents choose messages. The messages determine

More information

Extensive-Form Games with Imperfect Information

Extensive-Form Games with Imperfect Information May 6, 2015 Example 2, 2 A 3, 3 C Player 1 Player 1 Up B Player 2 D 0, 0 1 0, 0 Down C Player 1 D 3, 3 Extensive-Form Games With Imperfect Information Finite No simultaneous moves: each node belongs to

More information

Chapter 7. Sampling Distributions and the Central Limit Theorem

Chapter 7. Sampling Distributions and the Central Limit Theorem Chapter 7. Sampling Distributions and the Central Limit Theorem 1 Introduction 2 Sampling Distributions related to the normal distribution 3 The central limit theorem 4 The normal approximation to binomial

More information

6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2

6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2 6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2 Daron Acemoglu and Asu Ozdaglar MIT October 14, 2009 1 Introduction Outline Review Examples of Pure Strategy Nash Equilibria Mixed Strategies

More information

Identification and Estimation of Dynamic Games when Players Belief Are Not in Equilibrium

Identification and Estimation of Dynamic Games when Players Belief Are Not in Equilibrium Identification and Estimation of Dynamic Games when Players Belief Are Not in Equilibrium A Short Review of Aguirregabiria and Magesan (2010) January 25, 2012 1 / 18 Dynamics of the game Two players, {i,

More information

Blackwell Optimality in Markov Decision Processes with Partial Observation

Blackwell Optimality in Markov Decision Processes with Partial Observation Blackwell Optimality in Markov Decision Processes with Partial Observation Dinah Rosenberg and Eilon Solan and Nicolas Vieille April 6, 2000 Abstract We prove the existence of Blackwell ε-optimal strategies

More information

ECON322 Game Theory Half II

ECON322 Game Theory Half II ECON322 Game Theory Half II Part 1: Reasoning Foundations Rationality Christian W. Bach University of Liverpool & EPICENTER Agenda Introduction Rational Choice Strict Dominance Characterization of Rationality

More information

G5212: Game Theory. Mark Dean. Spring 2017

G5212: Game Theory. Mark Dean. Spring 2017 G5212: Game Theory Mark Dean Spring 2017 Bargaining We will now apply the concept of SPNE to bargaining A bit of background Bargaining is hugely interesting but complicated to model It turns out that the

More information

Can we have no Nash Equilibria? Can you have more than one Nash Equilibrium? CS 430: Artificial Intelligence Game Theory II (Nash Equilibria)

Can we have no Nash Equilibria? Can you have more than one Nash Equilibrium? CS 430: Artificial Intelligence Game Theory II (Nash Equilibria) CS 0: Artificial Intelligence Game Theory II (Nash Equilibria) ACME, a video game hardware manufacturer, has to decide whether its next game machine will use DVDs or CDs Best, a video game software producer,

More information

Advanced Microeconomics

Advanced Microeconomics Advanced Microeconomics ECON5200 - Fall 2014 Introduction What you have done: - consumers maximize their utility subject to budget constraints and firms maximize their profits given technology and market

More information

Reactive Synthesis Without Regret

Reactive Synthesis Without Regret Reactive Synthesis Without Regret (Non, rien de rien... ) Guillermo A. Pérez Highlights @ Prague September, 2015 That feeling when... Example: doing your laundry sc s G.A. Pérez (ULB) Reactive Synthesis

More information

Games of Incomplete Information

Games of Incomplete Information Games of Incomplete Information EC202 Lectures V & VI Francesco Nava London School of Economics January 2011 Nava (LSE) EC202 Lectures V & VI Jan 2011 1 / 22 Summary Games of Incomplete Information: Definitions:

More information

Chapter 10: Mixed strategies Nash equilibria, reaction curves and the equality of payoffs theorem

Chapter 10: Mixed strategies Nash equilibria, reaction curves and the equality of payoffs theorem Chapter 10: Mixed strategies Nash equilibria reaction curves and the equality of payoffs theorem Nash equilibrium: The concept of Nash equilibrium can be extended in a natural manner to the mixed strategies

More information

Reduced Complexity Approaches to Asymmetric Information Games

Reduced Complexity Approaches to Asymmetric Information Games Reduced Complexity Approaches to Asymmetric Information Games Jeff Shamma and Lichun Li Georgia Institution of Technology ARO MURI Annual Review November 19, 2014 Research Thrust: Obtaining Actionable

More information

Warm Up Finitely Repeated Games Infinitely Repeated Games Bayesian Games. Repeated Games

Warm Up Finitely Repeated Games Infinitely Repeated Games Bayesian Games. Repeated Games Repeated Games Warm up: bargaining Suppose you and your Qatz.com partner have a falling-out. You agree set up two meetings to negotiate a way to split the value of your assets, which amount to $1 million

More information

An Adaptive Learning Model in Coordination Games

An Adaptive Learning Model in Coordination Games Department of Economics An Adaptive Learning Model in Coordination Games Department of Economics Discussion Paper 13-14 Naoki Funai An Adaptive Learning Model in Coordination Games Naoki Funai June 17,

More information

CS711: Introduction to Game Theory and Mechanism Design

CS711: Introduction to Game Theory and Mechanism Design CS711: Introduction to Game Theory and Mechanism Design Teacher: Swaprava Nath Domination, Elimination of Dominated Strategies, Nash Equilibrium Domination Normal form game N, (S i ) i N, (u i ) i N Definition

More information

Economics 171: Final Exam

Economics 171: Final Exam Question 1: Basic Concepts (20 points) Economics 171: Final Exam 1. Is it true that every strategy is either strictly dominated or is a dominant strategy? Explain. (5) No, some strategies are neither dominated

More information

CHOICE THEORY, UTILITY FUNCTIONS AND RISK AVERSION

CHOICE THEORY, UTILITY FUNCTIONS AND RISK AVERSION CHOICE THEORY, UTILITY FUNCTIONS AND RISK AVERSION Szabolcs Sebestyén szabolcs.sebestyen@iscte.pt Master in Finance INVESTMENTS Sebestyén (ISCTE-IUL) Choice Theory Investments 1 / 65 Outline 1 An Introduction

More information

Epistemic Experiments: Utilities, Beliefs, and Irrational Play

Epistemic Experiments: Utilities, Beliefs, and Irrational Play Epistemic Experiments: Utilities, Beliefs, and Irrational Play P.J. Healy PJ Healy (OSU) Epistemics 2017 1 / 62 Motivation Question: How do people play games?? E.g.: Do people play equilibrium? If not,

More information

Stochastic Games and Bayesian Games

Stochastic Games and Bayesian Games Stochastic Games and Bayesian Games CPSC 532L Lecture 10 Stochastic Games and Bayesian Games CPSC 532L Lecture 10, Slide 1 Lecture Overview 1 Recap 2 Stochastic Games 3 Bayesian Games Stochastic Games

More information

GAME THEORY. Game theory. The odds and evens game. Two person, zero sum game. Prototype example

GAME THEORY. Game theory. The odds and evens game. Two person, zero sum game. Prototype example Game theory GAME THEORY (Hillier & Lieberman Introduction to Operations Research, 8 th edition) Mathematical theory that deals, in an formal, abstract way, with the general features of competitive situations

More information

AUCTIONEER ESTIMATES AND CREDULOUS BUYERS REVISITED. November Preliminary, comments welcome.

AUCTIONEER ESTIMATES AND CREDULOUS BUYERS REVISITED. November Preliminary, comments welcome. AUCTIONEER ESTIMATES AND CREDULOUS BUYERS REVISITED Alex Gershkov and Flavio Toxvaerd November 2004. Preliminary, comments welcome. Abstract. This paper revisits recent empirical research on buyer credulity

More information

INVESTMENT DYNAMICS IN ELECTRICITY MARKETS Alfredo Garcia, University of Virginia joint work with Ennio Stacchetti, New York University May 2007

INVESTMENT DYNAMICS IN ELECTRICITY MARKETS Alfredo Garcia, University of Virginia joint work with Ennio Stacchetti, New York University May 2007 INVESTMENT DYNAMICS IN ELECTRICITY MARKETS Alfredo Garcia, University of Virginia joint work with Ennio Stacchetti, New York University May 2007 1 MOTIVATION We study resource adequacy as an endogenous

More information

Infinitely Repeated Games

Infinitely Repeated Games February 10 Infinitely Repeated Games Recall the following theorem Theorem 72 If a game has a unique Nash equilibrium, then its finite repetition has a unique SPNE. Our intuition, however, is that long-term

More information

Game Theory Fall 2006

Game Theory Fall 2006 Game Theory Fall 2006 Answers to Problem Set 3 [1a] Omitted. [1b] Let a k be a sequence of paths that converge in the product topology to a; that is, a k (t) a(t) for each date t, as k. Let M be the maximum

More information

Multitask, Accountability, and Institutional Design

Multitask, Accountability, and Institutional Design Multitask, Accountability, and Institutional Design Scott Ashworth & Ethan Bueno de Mesquita Harris School of Public Policy Studies University of Chicago 1 / 32 Motivation Multiple executive tasks divided

More information

Decision Making in Uncertain and Changing Environments

Decision Making in Uncertain and Changing Environments Decision Making in Uncertain and Changing Environments Karl H. Schlag Andriy Zapechelnyuk June 2, 2009 Abstract We consider an agent who has to repeatedly make choices in an uncertain and changing environment,

More information

Microeconomics of Banking: Lecture 5

Microeconomics of Banking: Lecture 5 Microeconomics of Banking: Lecture 5 Prof. Ronaldo CARPIO Oct. 23, 2015 Administrative Stuff Homework 2 is due next week. Due to the change in material covered, I have decided to change the grading system

More information

Maintaining a Reputation Against a Patient Opponent 1

Maintaining a Reputation Against a Patient Opponent 1 Maintaining a Reputation Against a Patient Opponent July 3, 006 Marco Celentani Drew Fudenberg David K. Levine Wolfgang Pesendorfer ABSTRACT: We analyze reputation in a game between a patient player and

More information

arxiv: v1 [math.oc] 23 Dec 2010

arxiv: v1 [math.oc] 23 Dec 2010 ASYMPTOTIC PROPERTIES OF OPTIMAL TRAJECTORIES IN DYNAMIC PROGRAMMING SYLVAIN SORIN, XAVIER VENEL, GUILLAUME VIGERAL Abstract. We show in a dynamic programming framework that uniform convergence of the

More information

Martingales. by D. Cox December 2, 2009

Martingales. by D. Cox December 2, 2009 Martingales by D. Cox December 2, 2009 1 Stochastic Processes. Definition 1.1 Let T be an arbitrary index set. A stochastic process indexed by T is a family of random variables (X t : t T) defined on a

More information

ECE 586GT: Problem Set 1: Problems and Solutions Analysis of static games

ECE 586GT: Problem Set 1: Problems and Solutions Analysis of static games University of Illinois Fall 2018 ECE 586GT: Problem Set 1: Problems and Solutions Analysis of static games Due: Tuesday, Sept. 11, at beginning of class Reading: Course notes, Sections 1.1-1.4 1. [A random

More information

Decision Making in Uncertain and Changing Environments

Decision Making in Uncertain and Changing Environments Decision Making in Uncertain and Changing Environments Karl H. Schlag Andriy Zapechelnyuk June 18, 2009 Abstract We consider an agent who has to repeatedly make choices in an uncertain and changing environment,

More information

Chapter 7. Sampling Distributions and the Central Limit Theorem

Chapter 7. Sampling Distributions and the Central Limit Theorem Chapter 7. Sampling Distributions and the Central Limit Theorem 1 Introduction 2 Sampling Distributions related to the normal distribution 3 The central limit theorem 4 The normal approximation to binomial

More information

Using the Maximin Principle

Using the Maximin Principle Using the Maximin Principle Under the maximin principle, it is easy to see that Rose should choose a, making her worst-case payoff 0. Colin s similar rationality as a player induces him to play (under

More information

Lecture 5 Leadership and Reputation

Lecture 5 Leadership and Reputation Lecture 5 Leadership and Reputation Reputations arise in situations where there is an element of repetition, and also where coordination between players is possible. One definition of leadership is that

More information

Stochastic Games and Bayesian Games

Stochastic Games and Bayesian Games Stochastic Games and Bayesian Games CPSC 532l Lecture 10 Stochastic Games and Bayesian Games CPSC 532l Lecture 10, Slide 1 Lecture Overview 1 Recap 2 Stochastic Games 3 Bayesian Games 4 Analyzing Bayesian

More information

Math 152: Applicable Mathematics and Computing

Math 152: Applicable Mathematics and Computing Math 152: Applicable Mathematics and Computing May 22, 2017 May 22, 2017 1 / 19 Bertrand Duopoly: Undifferentiated Products Game (Bertrand) Firm and Firm produce identical products. Each firm simultaneously

More information

Beliefs and Sequential Rationality

Beliefs and Sequential Rationality Beliefs and Sequential Rationality A system of beliefs µ in extensive form game Γ E is a specification of a probability µ(x) [0,1] for each decision node x in Γ E such that x H µ(x) = 1 for all information

More information

Prisoner s Dilemma. CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma. Prisoner s Dilemma. Prisoner s Dilemma.

Prisoner s Dilemma. CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma. Prisoner s Dilemma. Prisoner s Dilemma. CS 331: rtificial Intelligence Game Theory I You and your partner have both been caught red handed near the scene of a burglary. oth of you have been brought to the police station, where you are interrogated

More information

Solutions to Homework 3

Solutions to Homework 3 Solutions to Homework 3 AEC 504 - Summer 2007 Fundamentals of Economics c 2007 Alexander Barinov 1 Price Discrimination Consider a firm with MC = AC = 2, which serves two markets with demand functions

More information

The Neoclassical Growth Model

The Neoclassical Growth Model The Neoclassical Growth Model 1 Setup Three goods: Final output Capital Labour One household, with preferences β t u (c t ) (Later we will introduce preferences with respect to labour/leisure) Endowment

More information

University of Hong Kong ECON6036 Stephen Chiu. Extensive Games with Perfect Information II. Outline

University of Hong Kong ECON6036 Stephen Chiu. Extensive Games with Perfect Information II. Outline University of Hong Kong ECON6036 Stephen Chiu Extensive Games with Perfect Information II 1 Outline Interpretation of strategy Backward induction One stage deviation principle Rubinstein alternative bargaining

More information

Web Appendix: Proofs and extensions.

Web Appendix: Proofs and extensions. B eb Appendix: Proofs and extensions. B.1 Proofs of results about block correlated markets. This subsection provides proofs for Propositions A1, A2, A3 and A4, and the proof of Lemma A1. Proof of Proposition

More information

GAME THEORY. Department of Economics, MIT, Follow Muhamet s slides. We need the following result for future reference.

GAME THEORY. Department of Economics, MIT, Follow Muhamet s slides. We need the following result for future reference. 14.126 GAME THEORY MIHAI MANEA Department of Economics, MIT, 1. Existence and Continuity of Nash Equilibria Follow Muhamet s slides. We need the following result for future reference. Theorem 1. Suppose

More information

Games of Incomplete Information ( 資訊不全賽局 ) Games of Incomplete Information

Games of Incomplete Information ( 資訊不全賽局 ) Games of Incomplete Information 1 Games of Incomplete Information ( 資訊不全賽局 ) Wang 2012/12/13 (Lecture 9, Micro Theory I) Simultaneous Move Games An Example One or more players know preferences only probabilistically (cf. Harsanyi, 1976-77)

More information

SI 563 Homework 3 Oct 5, Determine the set of rationalizable strategies for each of the following games. a) X Y X Y Z

SI 563 Homework 3 Oct 5, Determine the set of rationalizable strategies for each of the following games. a) X Y X Y Z SI 563 Homework 3 Oct 5, 06 Chapter 7 Exercise : ( points) Determine the set of rationalizable strategies for each of the following games. a) U (0,4) (4,0) M (3,3) (3,3) D (4,0) (0,4) X Y U (0,4) (4,0)

More information

In the Name of God. Sharif University of Technology. Graduate School of Management and Economics

In the Name of God. Sharif University of Technology. Graduate School of Management and Economics In the Name of God Sharif University of Technology Graduate School of Management and Economics Microeconomics (for MBA students) 44111 (1393-94 1 st term) - Group 2 Dr. S. Farshad Fatemi Game Theory Game:

More information

From Discrete Time to Continuous Time Modeling

From Discrete Time to Continuous Time Modeling From Discrete Time to Continuous Time Modeling Prof. S. Jaimungal, Department of Statistics, University of Toronto 2004 Arrow-Debreu Securities 2004 Prof. S. Jaimungal 2 Consider a simple one-period economy

More information

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India October 2012

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India October 2012 Game Theory Lecture Notes By Y. Narahari Department of Computer Science and Automation Indian Institute of Science Bangalore, India October 22 COOPERATIVE GAME THEORY Correlated Strategies and Correlated

More information