The Optimality of Regret Matching
|
|
- Blaze Berry
- 5 years ago
- Views:
Transcription
1 The Optimality of Regret Matching Sergiu Hart July 2008 SERGIU HART c 2008 p. 1
2 THE OPTIMALITY OF REGRET MATCHING Sergiu Hart Center for the Study of Rationality Dept of Economics Dept of Mathematics The Hebrew University of Jerusalem SERGIU HART c 2008 p. 2
3 Joint work with Elchanan Ben-Porath SERGIU HART c 2008 p. 3
4 Repeated Games In a repeated game: SERGIU HART c 2008 p. 4
5 Repeated Games In a repeated game: If the opponent plays i.i.d. (independently identically distributed) SERGIU HART c 2008 p. 4
6 Repeated Games In a repeated game: If the opponent plays i.i.d. (independently identically distributed) Then fictitious play is optimal SERGIU HART c 2008 p. 4
7 Repeated Games In a repeated game: If the opponent plays i.i.d. (independently identically distributed) Then fictitious play is optimal If the opponent is OBLIVIOUS SERGIU HART c 2008 p. 4
8 Repeated Games In a repeated game: If the opponent plays i.i.d. (independently identically distributed) Then fictitious play is optimal If the opponent is OBLIVIOUS OBLIVIOUS = does not see / know / use my actions SERGIU HART c 2008 p. 4
9 Repeated Games In a repeated game: If the opponent plays i.i.d. (independently identically distributed) Then fictitious play is optimal If the opponent is OBLIVIOUS Then? OBLIVIOUS = does not see / know / use my actions SERGIU HART c 2008 p. 4
10 Oblivious SERGIU HART c 2008 p. 5
11 Oblivious SERGIU HART c 2008 p. 5
12 Model: One-Shot Game SERGIU HART c 2008 p. 6
13 Model: One-Shot Game The one-shot game Γ = < I, J, u > SERGIU HART c 2008 p. 6
14 Model: One-Shot Game The one-shot game Γ = < I, J, u > I = (finite) set of actions of Player 1 J = (finite) set of actions of Player 2 (the "opponent") SERGIU HART c 2008 p. 6
15 Model: One-Shot Game The one-shot game Γ = < I, J, u > I = (finite) set of actions of Player 1 J = (finite) set of actions of Player 2 (the "opponent") u : I J R = the payoff function of Player 1 SERGIU HART c 2008 p. 6
16 Model: One-Shot Game The one-shot game Γ = < I, J, u > I = (finite) set of actions of Player 1 J = (finite) set of actions of Player 2 (the "opponent") u : I J R = the payoff function of Player 1 without loss of generality assume: 0 u(i, j) 1 for all i I, j J SERGIU HART c 2008 p. 6
17 Model: Repeated Game SERGIU HART c 2008 p. 7
18 Model: Repeated Game The repeated game Γ SERGIU HART c 2008 p. 7
19 Model: Repeated Game The repeated game Γ O Player 2 is restricted to OBLIVIOUS strategies η : t=0 Jt (J) SERGIU HART c 2008 p. 7
20 Model: Repeated Game The repeated game Γ O Player 2 is restricted to OBLIVIOUS strategies η : t=0 Jt (J) Player 1 is unrestricted σ : t=0 (I J)t (I) SERGIU HART c 2008 p. 7
21 Model: Repeated Game The repeated game Γ O Player 2 is restricted to OBLIVIOUS strategies η : t=0 Jt (J) Player 1 is unrestricted σ : t=0 (I J)t (I) Payoffs v t := u(i t, j t ) payoff at time t v T := (1/T) T t=1 v t average payoff up to time T SERGIU HART c 2008 p. 7
22 Model: Repeated Game The repeated game Γ O Player 2 is restricted to OBLIVIOUS strategies η : t=0 Jt (J) Player 1 is unrestricted σ : t=0 (I J)t (I) Without loss of generality: Player 1 uses "self-oblivious" strategies σ : t=0 Jt (I) SERGIU HART c 2008 p. 7
23 Dominance SERGIU HART c 2008 p. 8
24 Dominance If Player 1 has a dominant action i I in Γ then playing i forever is a dominant strategy in the repeated game SERGIU HART c 2008 p. 8
25 Dominance If Player 1 has a dominant action i I in Γ then playing i forever is a dominant strategy in the repeated game The one-shot game Γ is called essential SERGIU HART c 2008 p. 8
26 Dominance If Player 1 has a dominant action i I in Γ then playing i forever is a dominant strategy in the repeated game The one-shot game Γ is called essential if Player 1 does NOT have a dominant action: SERGIU HART c 2008 p. 8
27 Dominance If Player 1 has a dominant action i I in Γ then playing i forever is a dominant strategy in the repeated game The one-shot game Γ is called essential if Player 1 does NOT have a dominant action: for every i I there exists i I with u(i, j) < u(i, j) for some j J SERGIU HART c 2008 p. 8
28 Dominance Proposition. Let Γ be an essential game. SERGIU HART c 2008 p. 9
29 Dominance in Γ O Proposition. Let Γ be an essential game. Then every strategy of Player 1 in Γ O is weakly dominated. SERGIU HART c 2008 p. 9
30 Dominance in Γ O Proposition. Let Γ be an essential game. Then every strategy of Player 1 in Γ O is weakly dominated. That is, for every σ there exists ˆσ such that: For every oblivious strategy η of Player 2 Eˆσ,η [ v T ] E σ,η [ v T ] 1/T for all T 1 SERGIU HART c 2008 p. 9
31 Dominance in Γ O Proposition. Let Γ be an essential game. Then every strategy of Player 1 in Γ O is weakly dominated. That is, for every σ there exists ˆσ such that: For every oblivious strategy η of Player 2 Eˆσ,η [ v T ] E σ,η [ v T ] 1/T for all T 1 There exists an oblivious strategy η 0 of Player 2 and a constant γ > 0 such that Eˆσ,η0 [ v T ] > E σ,η0 [ v T ] + γ for all T 1 SERGIU HART c 2008 p. 9
32 Regret Matching SERGIU HART c 2008 p. 10
33 Regret Matching For every k K, let σ k : t=0 Jt (I) be a (self-oblivious, behavior) strategy of Player 1. SERGIU HART c 2008 p. 10
34 Regret Matching For every k K, let σ k : t=0 Jt (I) be a (self-oblivious, behavior) strategy of Player 1. Theorem. For every finite set K there exists a K-REGRET-MATCHING strategy σ σ K such that E σ,η [ v T ] E σk,η [ v T ] 3 K T for every k K, every time T 1, and every oblivious strategy η of Player 2. SERGIU HART c 2008 p. 10
35 Regret Matching The K-REGRET-MATCHING strategy σ σ K is defined as follows. SERGIU HART c 2008 p. 11
36 Regret Matching The K-REGRET-MATCHING strategy σ σ K is defined as follows. At time T + 1: SERGIU HART c 2008 p. 11
37 Regret Matching The K-REGRET-MATCHING strategy σ σ K is defined as follows. At time T + 1: U := T t=1 v t (realized total payoff) SERGIU HART c 2008 p. 11
38 Regret Matching The K-REGRET-MATCHING strategy σ σ K is defined as follows. At time T + 1: U := T t=1 v t (realized total payoff) For each k K: SERGIU HART c 2008 p. 11
39 Regret Matching The K-REGRET-MATCHING strategy σ σ K is defined as follows. At time T + 1: U := T t=1 v t (realized total payoff) For each k K: V (k) := T t=1 u(σ k(h 2 t 1 ), j t) (the total payoff of σ k against the realized sequence of actions of Player 2) SERGIU HART c 2008 p. 11
40 Regret Matching The K-REGRET-MATCHING strategy σ σ K is defined as follows. At time T + 1: U := T t=1 v t (realized total payoff) For each k K: V (k) := T t=1 u(σ k(h 2 t 1 ), j t) (the total payoff of σ k against the realized sequence of actions of Player 2) R(k) := [V (k) U] + (the REGRET of k) SERGIU HART c 2008 p. 11
41 Regret Matching The K-REGRET-MATCHING strategy σ σ K is defined as follows. At time T + 1: U := T t=1 v t (realized total payoff) For each k K: V (k) := T t=1 u(σ k(h 2 t 1 ), j t) (the total payoff of σ k against the realized sequence of actions of Player 2) R(k) := [V (k) U] + (the REGRET of k) ρ k := R(k)/ l K R(l) (the normalized regret of k) SERGIU HART c 2008 p. 11
42 Regret Matching The K-REGRET-MATCHING strategy σ σ K is defined as follows. At time T + 1: U := T t=1 v t (realized total payoff) For each k K: V (k) := T t=1 u(σ k(h 2 t 1 ), j t) R(k) := [V (k) U] + (the REGRET of k) ρ k := R(k)/ l K R(l) SERGIU HART c 2008 p. 11
43 Regret Matching The K-REGRET-MATCHING strategy σ σ K is defined as follows. At time T + 1: U := T t=1 v t (realized total payoff) For each k K: V (k) := T t=1 u(σ k(h 2 t 1 ), j t) R(k) := [V (k) U] + (the REGRET of k) ρ k := R(k)/ l K R(l) σ plays like σ k with probability ρ k SERGIU HART c 2008 p. 11
44 Regret Matching The K-REGRET-MATCHING strategy σ σ K is defined as follows. At time T + 1: U := T t=1 v t (realized total payoff) For each k K: V (k) := T t=1 u(σ k(h 2 t 1 ), j t) R(k) := [V (k) U] + (the REGRET of k) ρ k := R(k)/ l K R(l) σ plays like σ k with probability ρ k : σ (h 2 T )(i) = k K ρ k σ k (h 2 T )(i) SERGIU HART c 2008 p. 11
45 Regret Matching SERGIU HART c 2008 p. 12
46 Regret Matching Generalizes Hannan s result: SERGIU HART c 2008 p. 12
47 Regret Matching Generalizes Hannan s result: σ i = "play i forever" for each i I SERGIU HART c 2008 p. 12
48 Regret Matching Generalizes Hannan s result: σ i = "play i forever" for each i I Blackwell approachability (with changing payoffs) SERGIU HART c 2008 p. 12
49 Regret Matching SERGIU HART c 2008 p. 13
50 Regret Matching - Countable Theorem. For every COUNTABLE SET K there exists a K-REGRET-MATCHING strategy σ σ K such that SERGIU HART c 2008 p. 13
51 Regret Matching - Countable Theorem. For every COUNTABLE SET K there exists a K-REGRET-MATCHING strategy σ σ K such that ) E σ,η [ v T ] E σk,η [ v T ] O (T 1/4 for every k K, every time T 1, and every oblivious strategy η of Player 2. SERGIU HART c 2008 p. 13
52 Regret Matching - Countable SERGIU HART c 2008 p. 14
53 Regret Matching - Countable The K-REGRET-MATCHING strategy σ σ K for COUNTABLE K = 1, 2,... : SERGIU HART c 2008 p. 14
54 Regret Matching - Countable The K-REGRET-MATCHING strategy σ σ K for COUNTABLE K = 1, 2,... : For each k = 1, 2,... : BLOCK k has n k = k 3 periods SERGIU HART c 2008 p. 14
55 Regret Matching - Countable The K-REGRET-MATCHING strategy σ σ K for COUNTABLE K = 1, 2,... : For each k = 1, 2,... : BLOCK k has n k = k 3 periods During BLOCK k play the regret-matching strategy for the set {σ 1,..., σ k } SERGIU HART c 2008 p. 14
56 Regret Matching - Countable The K-REGRET-MATCHING strategy σ σ K for COUNTABLE K = 1, 2,... : For each k = 1, 2,... : BLOCK k has n k = k 3 periods During BLOCK k play the regret-matching strategy for the set {σ 1,..., σ k } Regrets are computed on the basis of the current block only SERGIU HART c 2008 p. 14
57 Regret Matching SERGIU HART c 2008 p. 15
58 Regret Matching Countable sets of strategies: SERGIU HART c 2008 p. 15
59 Regret Matching Countable sets of strategies: Automata SERGIU HART c 2008 p. 15
60 Regret Matching Countable sets of strategies: Automata Turing Machines SERGIU HART c 2008 p. 15
61 Regret Matching Countable sets of strategies: Automata Turing Machines A COMPUTABLE SET OF STRATEGIES SERGIU HART c 2008 p. 15
62 Regret Matching Countable sets of strategies: Automata Turing Machines A COMPUTABLE SET OF STRATEGIES INPUT: k h t (index of a strategy) (a history) SERGIU HART c 2008 p. 15
63 Regret Matching Countable sets of strategies: Automata Turing Machines A COMPUTABLE SET OF STRATEGIES INPUT: k h t (index of a strategy) (a history) OUTPUT: σ k (h t ) (I) SERGIU HART c 2008 p. 15
64 Regret SERGIU HART c 2008 p. 16
65 Regret: The End SERGIU HART c 2008 p. 16
Rationalizable Strategies
Rationalizable Strategies Carlos Hurtado Department of Economics University of Illinois at Urbana-Champaign hrtdmrt2@illinois.edu Jun 1st, 2015 C. Hurtado (UIUC - Economics) Game Theory On the Agenda 1
More informationRepeated Games. September 3, Definitions: Discounting, Individual Rationality. Finitely Repeated Games. Infinitely Repeated Games
Repeated Games Frédéric KOESSLER September 3, 2007 1/ Definitions: Discounting, Individual Rationality Finitely Repeated Games Infinitely Repeated Games Automaton Representation of Strategies The One-Shot
More informationExtensive form games - contd
Extensive form games - contd Proposition: Every finite game of perfect information Γ E has a pure-strategy SPNE. Moreover, if no player has the same payoffs in any two terminal nodes, then there is a unique
More informationStrategies and Nash Equilibrium. A Whirlwind Tour of Game Theory
Strategies and Nash Equilibrium A Whirlwind Tour of Game Theory (Mostly from Fudenberg & Tirole) Players choose actions, receive rewards based on their own actions and those of the other players. Example,
More informationLecture 1: Normal Form Games: Refinements and Correlated Equilibrium
Lecture 1: Normal Form Games: Refinements and Correlated Equilibrium Albert Banal-Estanol April 2006 Lecture 1 2 Albert Banal-Estanol Trembling hand perfect equilibrium: Motivation, definition and examples
More informationDuopoly models Multistage games with observed actions Subgame perfect equilibrium Extensive form of a game Two-stage prisoner s dilemma
Recap Last class (September 20, 2016) Duopoly models Multistage games with observed actions Subgame perfect equilibrium Extensive form of a game Two-stage prisoner s dilemma Today (October 13, 2016) Finitely
More informationPrisoner s dilemma with T = 1
REPEATED GAMES Overview Context: players (e.g., firms) interact with each other on an ongoing basis Concepts: repeated games, grim strategies Economic principle: repetition helps enforcing otherwise unenforceable
More informationGame Theory. VK Room: M1.30 Last updated: October 22, 2012.
Game Theory VK Room: M1.30 knightva@cf.ac.uk www.vincent-knight.com Last updated: October 22, 2012. 1 / 33 Overview Normal Form Games Pure Nash Equilibrium Mixed Nash Equilibrium 2 / 33 Normal Form Games
More informationChapter 8. Repeated Games. Strategies and payoffs for games played twice
Chapter 8 epeated Games 1 Strategies and payoffs for games played twice Finitely repeated games Discounted utility and normalized utility Complete plans of play for 2 2 games played twice Trigger strategies
More informationBest-Reply Sets. Jonathan Weinstein Washington University in St. Louis. This version: May 2015
Best-Reply Sets Jonathan Weinstein Washington University in St. Louis This version: May 2015 Introduction The best-reply correspondence of a game the mapping from beliefs over one s opponents actions to
More informationCS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma
CS 331: Artificial Intelligence Game Theory I 1 Prisoner s Dilemma You and your partner have both been caught red handed near the scene of a burglary. Both of you have been brought to the police station,
More informationIntroduction to Game Theory Lecture Note 5: Repeated Games
Introduction to Game Theory Lecture Note 5: Repeated Games Haifeng Huang University of California, Merced Repeated games Repeated games: given a simultaneous-move game G, a repeated game of G is an extensive
More informationGame Theory. Wolfgang Frimmel. Repeated Games
Game Theory Wolfgang Frimmel Repeated Games 1 / 41 Recap: SPNE The solution concept for dynamic games with complete information is the subgame perfect Nash Equilibrium (SPNE) Selten (1965): A strategy
More informationSUCCESSIVE INFORMATION REVELATION IN 3-PLAYER INFINITELY REPEATED GAMES WITH INCOMPLETE INFORMATION ON ONE SIDE
SUCCESSIVE INFORMATION REVELATION IN 3-PLAYER INFINITELY REPEATED GAMES WITH INCOMPLETE INFORMATION ON ONE SIDE JULIAN MERSCHEN Bonn Graduate School of Economics, University of Bonn Adenauerallee 24-42,
More informationGame theory and applications: Lecture 1
Game theory and applications: Lecture 1 Adam Szeidl September 20, 2018 Outline for today 1 Some applications of game theory 2 Games in strategic form 3 Dominance 4 Nash equilibrium 1 / 8 1. Some applications
More informationAn Approach to Bounded Rationality
An Approach to Bounded Rationality Eli Ben-Sasson Department of Computer Science Technion Israel Institute of Technology Adam Tauman Kalai Toyota Technological Institute at Chicago Ehud Kalai Kellogg Graduate
More informationIn the Name of God. Sharif University of Technology. Microeconomics 2. Graduate School of Management and Economics. Dr. S.
In the Name of God Sharif University of Technology Graduate School of Management and Economics Microeconomics 2 44706 (1394-95 2 nd term) - Group 2 Dr. S. Farshad Fatemi Chapter 8: Simultaneous-Move Games
More informationCommunication in Repeated Games with Costly Monitoring
Communication in Repeated Games with Costly Monitoring Elchanan Ben-Porath 1 and Michael Kahneman January, 2002 1 The department of Economics and the Center for Rationality, the Hebrew University of Jerusalem,
More informationIntroduction. Microeconomics II. Dominant Strategies. Definition (Dominant Strategies)
Introduction Microeconomics II Dominance and Rationalizability Levent Koçkesen Koç University Nash equilibrium concept assumes that each player knows the other players equilibrium behavior This is problematic
More informationMicroeconomic Theory II Preliminary Examination Solutions
Microeconomic Theory II Preliminary Examination Solutions 1. (45 points) Consider the following normal form game played by Bruce and Sheila: L Sheila R T 1, 0 3, 3 Bruce M 1, x 0, 0 B 0, 0 4, 1 (a) Suppose
More informationFinitely repeated simultaneous move game.
Finitely repeated simultaneous move game. Consider a normal form game (simultaneous move game) Γ N which is played repeatedly for a finite (T )number of times. The normal form game which is played repeatedly
More informationAn introduction on game theory for wireless networking [1]
An introduction on game theory for wireless networking [1] Ning Zhang 14 May, 2012 [1] Game Theory in Wireless Networks: A Tutorial 1 Roadmap 1 Introduction 2 Static games 3 Extensive-form games 4 Summary
More informationEconomics 209A Theory and Application of Non-Cooperative Games (Fall 2013) Repeated games OR 8 and 9, and FT 5
Economics 209A Theory and Application of Non-Cooperative Games (Fall 2013) Repeated games OR 8 and 9, and FT 5 The basic idea prisoner s dilemma The prisoner s dilemma game with one-shot payoffs 2 2 0
More informationSF2972 GAME THEORY Infinite games
SF2972 GAME THEORY Infinite games Jörgen Weibull February 2017 1 Introduction Sofar,thecoursehasbeenfocusedonfinite games: Normal-form games with a finite number of players, where each player has a finite
More informationBounded computational capacity equilibrium
Available online at www.sciencedirect.com ScienceDirect Journal of Economic Theory 63 (206) 342 364 www.elsevier.com/locate/jet Bounded computational capacity equilibrium Penélope Hernández a, Eilon Solan
More informationOutline Introduction Game Representations Reductions Solution Concepts. Game Theory. Enrico Franchi. May 19, 2010
May 19, 2010 1 Introduction Scope of Agent preferences Utility Functions 2 Game Representations Example: Game-1 Extended Form Strategic Form Equivalences 3 Reductions Best Response Domination 4 Solution
More informationOn Finite Strategy Sets for Finitely Repeated Zero-Sum Games
On Finite Strategy Sets for Finitely Repeated Zero-Sum Games Thomas C. O Connell Department of Mathematics and Computer Science Skidmore College 815 North Broadway Saratoga Springs, NY 12866 E-mail: oconnellt@acm.org
More informationECON/MGEC 333 Game Theory And Strategy Problem Set 9 Solutions. Levent Koçkesen January 6, 2011
Koç University Department of Economics ECON/MGEC 333 Game Theory And Strategy Problem Set Solutions Levent Koçkesen January 6, 2011 1. (a) Tit-For-Tat: The behavior of a player who adopts this strategy
More informationEX-POST STABILITY IN LARGE GAMES (DRAFT, COMMENTS WELCOME)
EX-POST STABILITY IN LARGE GAMES (DRAFT, COMMENTS WELCOME) EHUD KALAI Abstract. The equilibria of strategic games with many semi-anonymous players have a strong ex-post Nash property. Even with perfect
More informationFDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.
FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.) Hints for Problem Set 3 1. Consider the following strategic
More informationRepeated Games with Perfect Monitoring
Repeated Games with Perfect Monitoring Mihai Manea MIT Repeated Games normal-form stage game G = (N, A, u) players simultaneously play game G at time t = 0, 1,... at each date t, players observe all past
More information6.254 : Game Theory with Engineering Applications Lecture 3: Strategic Form Games - Solution Concepts
6.254 : Game Theory with Engineering Applications Lecture 3: Strategic Form Games - Solution Concepts Asu Ozdaglar MIT February 9, 2010 1 Introduction Outline Review Examples of Pure Strategy Nash Equilibria
More informationREPUTATION WITH LONG RUN PLAYERS AND IMPERFECT OBSERVATION
REPUTATION WITH LONG RUN PLAYERS AND IMPERFECT OBSERVATION ALP E. ATAKAN AND MEHMET EKMEKCI Abstract. Previous work shows that reputation results may fail in repeated games between two long-run players
More informationAdvanced Micro 1 Lecture 14: Dynamic Games Equilibrium Concepts
Advanced Micro 1 Lecture 14: Dynamic Games quilibrium Concepts Nicolas Schutz Nicolas Schutz Dynamic Games: quilibrium Concepts 1 / 79 Plan 1 Nash equilibrium and the normal form 2 Subgame-perfect equilibrium
More informationBasic Game-Theoretic Concepts. Game in strategic form has following elements. Player set N. (Pure) strategy set for player i, S i.
Basic Game-Theoretic Concepts Game in strategic form has following elements Player set N (Pure) strategy set for player i, S i. Payoff function f i for player i f i : S R, where S is product of S i s.
More informationDiscounted Stochastic Games
Discounted Stochastic Games Eilon Solan October 26, 1998 Abstract We give an alternative proof to a result of Mertens and Parthasarathy, stating that every n-player discounted stochastic game with general
More informationIntroduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4)
Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4) Outline: Modeling by means of games Normal form games Dominant strategies; dominated strategies,
More informationAn Approach to Bounded Rationality
An Approach to Bounded Rationality Eli Ben-Sasson Technion Adam Tauman Kalai TTI Chicago Ehud Kalai Northwestern University July 13, 2006 Abstract A central question in game theory, learning, and other
More informationLecture 3 Representation of Games
ecture 3 epresentation of Games 4. Game Theory Muhamet Yildiz oad Map. Cardinal representation Expected utility theory. Quiz 3. epresentation of games in strategic and extensive forms 4. Dominance; dominant-strategy
More informationIn reality; some cases of prisoner s dilemma end in cooperation. Game Theory Dr. F. Fatemi Page 219
Repeated Games Basic lesson of prisoner s dilemma: In one-shot interaction, individual s have incentive to behave opportunistically Leads to socially inefficient outcomes In reality; some cases of prisoner
More informationRational Behaviour and Strategy Construction in Infinite Multiplayer Games
Rational Behaviour and Strategy Construction in Infinite Multiplayer Games Michael Ummels ummels@logic.rwth-aachen.de FSTTCS 2006 Michael Ummels Rational Behaviour and Strategy Construction 1 / 15 Infinite
More informationPLAYING GAMES WITHOUT OBSERVING PAYOFFS
PLAYING GAMES WITHOUT OBSERVING PAYOFFS Michal Feldman Hebrew University & Microsoft Israel R&D Center Joint work with Adam Kalai and Moshe Tennenholtz FLA--BONG-DING FLA BONG DING 鲍步 爱丽丝 Y FLA Y FLA 5
More informationTTIC An Introduction to the Theory of Machine Learning. Learning and Game Theory. Avrim Blum 5/7/18, 5/9/18
TTIC 31250 An Introduction to the Theory of Machine Learning Learning and Game Theory Avrim Blum 5/7/18, 5/9/18 Zero-sum games, Minimax Optimality & Minimax Thm; Connection to Boosting & Regret Minimization
More informationAssignment 1: Preference Relations. Decision Theory. Pareto Optimality. Game Types.
Simon Fraser University Spring 2010 CMPT 882 Instructor: Oliver Schulte Assignment 1: Preference Relations. Decision Theory. Pareto Optimality. Game Types. The due date for this assignment is Wednesday,
More informationMaximal Revenue with Multiple Goods: Nonmonotonicity and Other Observations
Maximal Revenue with Multiple Goods: Nonmonotonicity and Other Observations Sergiu Hart Philip J. Reny November 21, 2013 Abstract Consider the problem of maximizing the revenue from selling a number of
More informationSolutions of Bimatrix Coalitional Games
Applied Mathematical Sciences, Vol. 8, 2014, no. 169, 8435-8441 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2014.410880 Solutions of Bimatrix Coalitional Games Xeniya Grigorieva St.Petersburg
More informationCMSC 474, Introduction to Game Theory 16. Behavioral vs. Mixed Strategies
CMSC 474, Introduction to Game Theory 16. Behavioral vs. Mixed Strategies Mohammad T. Hajiaghayi University of Maryland Behavioral Strategies In imperfect-information extensive-form games, we can define
More informationIntroductory Microeconomics
Prof. Wolfram Elsner Faculty of Business Studies and Economics iino Institute of Institutional and Innovation Economics Introductory Microeconomics More Formal Concepts of Game Theory and Evolutionary
More informationFDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.
FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.) Hints for Problem Set 2 1. Consider a zero-sum game, where
More informationOn Existence of Equilibria. Bayesian Allocation-Mechanisms
On Existence of Equilibria in Bayesian Allocation Mechanisms Northwestern University April 23, 2014 Bayesian Allocation Mechanisms In allocation mechanisms, agents choose messages. The messages determine
More informationExtensive-Form Games with Imperfect Information
May 6, 2015 Example 2, 2 A 3, 3 C Player 1 Player 1 Up B Player 2 D 0, 0 1 0, 0 Down C Player 1 D 3, 3 Extensive-Form Games With Imperfect Information Finite No simultaneous moves: each node belongs to
More informationChapter 7. Sampling Distributions and the Central Limit Theorem
Chapter 7. Sampling Distributions and the Central Limit Theorem 1 Introduction 2 Sampling Distributions related to the normal distribution 3 The central limit theorem 4 The normal approximation to binomial
More information6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2
6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2 Daron Acemoglu and Asu Ozdaglar MIT October 14, 2009 1 Introduction Outline Review Examples of Pure Strategy Nash Equilibria Mixed Strategies
More informationIdentification and Estimation of Dynamic Games when Players Belief Are Not in Equilibrium
Identification and Estimation of Dynamic Games when Players Belief Are Not in Equilibrium A Short Review of Aguirregabiria and Magesan (2010) January 25, 2012 1 / 18 Dynamics of the game Two players, {i,
More informationBlackwell Optimality in Markov Decision Processes with Partial Observation
Blackwell Optimality in Markov Decision Processes with Partial Observation Dinah Rosenberg and Eilon Solan and Nicolas Vieille April 6, 2000 Abstract We prove the existence of Blackwell ε-optimal strategies
More informationECON322 Game Theory Half II
ECON322 Game Theory Half II Part 1: Reasoning Foundations Rationality Christian W. Bach University of Liverpool & EPICENTER Agenda Introduction Rational Choice Strict Dominance Characterization of Rationality
More informationG5212: Game Theory. Mark Dean. Spring 2017
G5212: Game Theory Mark Dean Spring 2017 Bargaining We will now apply the concept of SPNE to bargaining A bit of background Bargaining is hugely interesting but complicated to model It turns out that the
More informationCan we have no Nash Equilibria? Can you have more than one Nash Equilibrium? CS 430: Artificial Intelligence Game Theory II (Nash Equilibria)
CS 0: Artificial Intelligence Game Theory II (Nash Equilibria) ACME, a video game hardware manufacturer, has to decide whether its next game machine will use DVDs or CDs Best, a video game software producer,
More informationAdvanced Microeconomics
Advanced Microeconomics ECON5200 - Fall 2014 Introduction What you have done: - consumers maximize their utility subject to budget constraints and firms maximize their profits given technology and market
More informationReactive Synthesis Without Regret
Reactive Synthesis Without Regret (Non, rien de rien... ) Guillermo A. Pérez Highlights @ Prague September, 2015 That feeling when... Example: doing your laundry sc s G.A. Pérez (ULB) Reactive Synthesis
More informationGames of Incomplete Information
Games of Incomplete Information EC202 Lectures V & VI Francesco Nava London School of Economics January 2011 Nava (LSE) EC202 Lectures V & VI Jan 2011 1 / 22 Summary Games of Incomplete Information: Definitions:
More informationChapter 10: Mixed strategies Nash equilibria, reaction curves and the equality of payoffs theorem
Chapter 10: Mixed strategies Nash equilibria reaction curves and the equality of payoffs theorem Nash equilibrium: The concept of Nash equilibrium can be extended in a natural manner to the mixed strategies
More informationReduced Complexity Approaches to Asymmetric Information Games
Reduced Complexity Approaches to Asymmetric Information Games Jeff Shamma and Lichun Li Georgia Institution of Technology ARO MURI Annual Review November 19, 2014 Research Thrust: Obtaining Actionable
More informationWarm Up Finitely Repeated Games Infinitely Repeated Games Bayesian Games. Repeated Games
Repeated Games Warm up: bargaining Suppose you and your Qatz.com partner have a falling-out. You agree set up two meetings to negotiate a way to split the value of your assets, which amount to $1 million
More informationAn Adaptive Learning Model in Coordination Games
Department of Economics An Adaptive Learning Model in Coordination Games Department of Economics Discussion Paper 13-14 Naoki Funai An Adaptive Learning Model in Coordination Games Naoki Funai June 17,
More informationCS711: Introduction to Game Theory and Mechanism Design
CS711: Introduction to Game Theory and Mechanism Design Teacher: Swaprava Nath Domination, Elimination of Dominated Strategies, Nash Equilibrium Domination Normal form game N, (S i ) i N, (u i ) i N Definition
More informationEconomics 171: Final Exam
Question 1: Basic Concepts (20 points) Economics 171: Final Exam 1. Is it true that every strategy is either strictly dominated or is a dominant strategy? Explain. (5) No, some strategies are neither dominated
More informationCHOICE THEORY, UTILITY FUNCTIONS AND RISK AVERSION
CHOICE THEORY, UTILITY FUNCTIONS AND RISK AVERSION Szabolcs Sebestyén szabolcs.sebestyen@iscte.pt Master in Finance INVESTMENTS Sebestyén (ISCTE-IUL) Choice Theory Investments 1 / 65 Outline 1 An Introduction
More informationEpistemic Experiments: Utilities, Beliefs, and Irrational Play
Epistemic Experiments: Utilities, Beliefs, and Irrational Play P.J. Healy PJ Healy (OSU) Epistemics 2017 1 / 62 Motivation Question: How do people play games?? E.g.: Do people play equilibrium? If not,
More informationStochastic Games and Bayesian Games
Stochastic Games and Bayesian Games CPSC 532L Lecture 10 Stochastic Games and Bayesian Games CPSC 532L Lecture 10, Slide 1 Lecture Overview 1 Recap 2 Stochastic Games 3 Bayesian Games Stochastic Games
More informationGAME THEORY. Game theory. The odds and evens game. Two person, zero sum game. Prototype example
Game theory GAME THEORY (Hillier & Lieberman Introduction to Operations Research, 8 th edition) Mathematical theory that deals, in an formal, abstract way, with the general features of competitive situations
More informationAUCTIONEER ESTIMATES AND CREDULOUS BUYERS REVISITED. November Preliminary, comments welcome.
AUCTIONEER ESTIMATES AND CREDULOUS BUYERS REVISITED Alex Gershkov and Flavio Toxvaerd November 2004. Preliminary, comments welcome. Abstract. This paper revisits recent empirical research on buyer credulity
More informationINVESTMENT DYNAMICS IN ELECTRICITY MARKETS Alfredo Garcia, University of Virginia joint work with Ennio Stacchetti, New York University May 2007
INVESTMENT DYNAMICS IN ELECTRICITY MARKETS Alfredo Garcia, University of Virginia joint work with Ennio Stacchetti, New York University May 2007 1 MOTIVATION We study resource adequacy as an endogenous
More informationInfinitely Repeated Games
February 10 Infinitely Repeated Games Recall the following theorem Theorem 72 If a game has a unique Nash equilibrium, then its finite repetition has a unique SPNE. Our intuition, however, is that long-term
More informationGame Theory Fall 2006
Game Theory Fall 2006 Answers to Problem Set 3 [1a] Omitted. [1b] Let a k be a sequence of paths that converge in the product topology to a; that is, a k (t) a(t) for each date t, as k. Let M be the maximum
More informationMultitask, Accountability, and Institutional Design
Multitask, Accountability, and Institutional Design Scott Ashworth & Ethan Bueno de Mesquita Harris School of Public Policy Studies University of Chicago 1 / 32 Motivation Multiple executive tasks divided
More informationDecision Making in Uncertain and Changing Environments
Decision Making in Uncertain and Changing Environments Karl H. Schlag Andriy Zapechelnyuk June 2, 2009 Abstract We consider an agent who has to repeatedly make choices in an uncertain and changing environment,
More informationMicroeconomics of Banking: Lecture 5
Microeconomics of Banking: Lecture 5 Prof. Ronaldo CARPIO Oct. 23, 2015 Administrative Stuff Homework 2 is due next week. Due to the change in material covered, I have decided to change the grading system
More informationMaintaining a Reputation Against a Patient Opponent 1
Maintaining a Reputation Against a Patient Opponent July 3, 006 Marco Celentani Drew Fudenberg David K. Levine Wolfgang Pesendorfer ABSTRACT: We analyze reputation in a game between a patient player and
More informationarxiv: v1 [math.oc] 23 Dec 2010
ASYMPTOTIC PROPERTIES OF OPTIMAL TRAJECTORIES IN DYNAMIC PROGRAMMING SYLVAIN SORIN, XAVIER VENEL, GUILLAUME VIGERAL Abstract. We show in a dynamic programming framework that uniform convergence of the
More informationMartingales. by D. Cox December 2, 2009
Martingales by D. Cox December 2, 2009 1 Stochastic Processes. Definition 1.1 Let T be an arbitrary index set. A stochastic process indexed by T is a family of random variables (X t : t T) defined on a
More informationECE 586GT: Problem Set 1: Problems and Solutions Analysis of static games
University of Illinois Fall 2018 ECE 586GT: Problem Set 1: Problems and Solutions Analysis of static games Due: Tuesday, Sept. 11, at beginning of class Reading: Course notes, Sections 1.1-1.4 1. [A random
More informationDecision Making in Uncertain and Changing Environments
Decision Making in Uncertain and Changing Environments Karl H. Schlag Andriy Zapechelnyuk June 18, 2009 Abstract We consider an agent who has to repeatedly make choices in an uncertain and changing environment,
More informationChapter 7. Sampling Distributions and the Central Limit Theorem
Chapter 7. Sampling Distributions and the Central Limit Theorem 1 Introduction 2 Sampling Distributions related to the normal distribution 3 The central limit theorem 4 The normal approximation to binomial
More informationUsing the Maximin Principle
Using the Maximin Principle Under the maximin principle, it is easy to see that Rose should choose a, making her worst-case payoff 0. Colin s similar rationality as a player induces him to play (under
More informationLecture 5 Leadership and Reputation
Lecture 5 Leadership and Reputation Reputations arise in situations where there is an element of repetition, and also where coordination between players is possible. One definition of leadership is that
More informationStochastic Games and Bayesian Games
Stochastic Games and Bayesian Games CPSC 532l Lecture 10 Stochastic Games and Bayesian Games CPSC 532l Lecture 10, Slide 1 Lecture Overview 1 Recap 2 Stochastic Games 3 Bayesian Games 4 Analyzing Bayesian
More informationMath 152: Applicable Mathematics and Computing
Math 152: Applicable Mathematics and Computing May 22, 2017 May 22, 2017 1 / 19 Bertrand Duopoly: Undifferentiated Products Game (Bertrand) Firm and Firm produce identical products. Each firm simultaneously
More informationBeliefs and Sequential Rationality
Beliefs and Sequential Rationality A system of beliefs µ in extensive form game Γ E is a specification of a probability µ(x) [0,1] for each decision node x in Γ E such that x H µ(x) = 1 for all information
More informationPrisoner s Dilemma. CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma. Prisoner s Dilemma. Prisoner s Dilemma.
CS 331: rtificial Intelligence Game Theory I You and your partner have both been caught red handed near the scene of a burglary. oth of you have been brought to the police station, where you are interrogated
More informationSolutions to Homework 3
Solutions to Homework 3 AEC 504 - Summer 2007 Fundamentals of Economics c 2007 Alexander Barinov 1 Price Discrimination Consider a firm with MC = AC = 2, which serves two markets with demand functions
More informationThe Neoclassical Growth Model
The Neoclassical Growth Model 1 Setup Three goods: Final output Capital Labour One household, with preferences β t u (c t ) (Later we will introduce preferences with respect to labour/leisure) Endowment
More informationUniversity of Hong Kong ECON6036 Stephen Chiu. Extensive Games with Perfect Information II. Outline
University of Hong Kong ECON6036 Stephen Chiu Extensive Games with Perfect Information II 1 Outline Interpretation of strategy Backward induction One stage deviation principle Rubinstein alternative bargaining
More informationWeb Appendix: Proofs and extensions.
B eb Appendix: Proofs and extensions. B.1 Proofs of results about block correlated markets. This subsection provides proofs for Propositions A1, A2, A3 and A4, and the proof of Lemma A1. Proof of Proposition
More informationGAME THEORY. Department of Economics, MIT, Follow Muhamet s slides. We need the following result for future reference.
14.126 GAME THEORY MIHAI MANEA Department of Economics, MIT, 1. Existence and Continuity of Nash Equilibria Follow Muhamet s slides. We need the following result for future reference. Theorem 1. Suppose
More informationGames of Incomplete Information ( 資訊不全賽局 ) Games of Incomplete Information
1 Games of Incomplete Information ( 資訊不全賽局 ) Wang 2012/12/13 (Lecture 9, Micro Theory I) Simultaneous Move Games An Example One or more players know preferences only probabilistically (cf. Harsanyi, 1976-77)
More informationSI 563 Homework 3 Oct 5, Determine the set of rationalizable strategies for each of the following games. a) X Y X Y Z
SI 563 Homework 3 Oct 5, 06 Chapter 7 Exercise : ( points) Determine the set of rationalizable strategies for each of the following games. a) U (0,4) (4,0) M (3,3) (3,3) D (4,0) (0,4) X Y U (0,4) (4,0)
More informationIn the Name of God. Sharif University of Technology. Graduate School of Management and Economics
In the Name of God Sharif University of Technology Graduate School of Management and Economics Microeconomics (for MBA students) 44111 (1393-94 1 st term) - Group 2 Dr. S. Farshad Fatemi Game Theory Game:
More informationFrom Discrete Time to Continuous Time Modeling
From Discrete Time to Continuous Time Modeling Prof. S. Jaimungal, Department of Statistics, University of Toronto 2004 Arrow-Debreu Securities 2004 Prof. S. Jaimungal 2 Consider a simple one-period economy
More informationGame Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India October 2012
Game Theory Lecture Notes By Y. Narahari Department of Computer Science and Automation Indian Institute of Science Bangalore, India October 22 COOPERATIVE GAME THEORY Correlated Strategies and Correlated
More information