Continuing game theory: mixed strategy equilibrium (Ch ), optimality (6.9), start on extensive form games (6.10, Sec. C)!

Similar documents
G5212: Game Theory. Mark Dean. Spring 2017

Chapter 10: Mixed strategies Nash equilibria, reaction curves and the equality of payoffs theorem

Microeconomics of Banking: Lecture 5

6.254 : Game Theory with Engineering Applications Lecture 3: Strategic Form Games - Solution Concepts

Game Theory. Analyzing Games: From Optimality to Equilibrium. Manar Mohaisen Department of EEC Engineering

CUR 412: Game Theory and its Applications, Lecture 9

CMPSCI 240: Reasoning about Uncertainty

Iterated Dominance and Nash Equilibrium

Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4)

CUR 412: Game Theory and its Applications, Lecture 12

Week 8: Basic concepts in game theory

Introduction to Game Theory

CS 7180: Behavioral Modeling and Decision- making in AI

An introduction on game theory for wireless networking [1]

Lecture 12: Introduction to reasoning under uncertainty. Actions and Consequences

Elements of Economic Analysis II Lecture X: Introduction to Game Theory

Simon Fraser University Fall Econ 302 D200 Final Exam Solution Instructor: Songzi Du Wednesday December 16, 2015, 8:30 11:30 AM

Preliminary Notions in Game Theory

Outline for today. Stat155 Game Theory Lecture 13: General-Sum Games. General-sum games. General-sum games. Dominated pure strategies

UC Berkeley Haas School of Business Game Theory (EMBA 296 & EWMBA 211) Summer 2016

Now we return to simultaneous-move games. We resolve the issue of non-existence of Nash equilibrium. in pure strategies through intentional mixing.

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India October 2012

Sequential Rationality and Weak Perfect Bayesian Equilibrium

Mixed Strategies. Samuel Alizon and Daniel Cownden February 4, 2009

CMPSCI 240: Reasoning about Uncertainty

Finitely repeated simultaneous move game.

Replicator Dynamics 1

CUR 412: Game Theory and its Applications Final Exam Ronaldo Carpio Jan. 13, 2015

CMSC 474, Introduction to Game Theory 16. Behavioral vs. Mixed Strategies

Economic Management Strategy: Hwrk 1. 1 Simultaneous-Move Game Theory Questions.

Extensive-Form Games with Imperfect Information

Noncooperative Oligopoly

CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma

Introduction to Game Theory

Introduction to Multi-Agent Programming

Week 8: Basic concepts in game theory

G5212: Game Theory. Mark Dean. Spring 2017

Outline Introduction Game Representations Reductions Solution Concepts. Game Theory. Enrico Franchi. May 19, 2010

Advanced Micro 1 Lecture 14: Dynamic Games Equilibrium Concepts

LECTURE NOTES ON GAME THEORY. Player 2 Cooperate Defect Cooperate (10,10) (-1,11) Defect (11,-1) (0,0)

1 Solutions to Homework 4

ECON 803: MICROECONOMIC THEORY II Arthur J. Robson Fall 2016 Assignment 9 (due in class on November 22)

Introductory Microeconomics

Epistemic Game Theory

TTIC An Introduction to the Theory of Machine Learning. Learning and Game Theory. Avrim Blum 5/7/18, 5/9/18

Game Theory. VK Room: M1.30 Last updated: October 22, 2012.

ANASH EQUILIBRIUM of a strategic game is an action profile in which every. Strategy Equilibrium

Math 135: Answers to Practice Problems

PAULI MURTO, ANDREY ZHUKOV

ECON 459 Game Theory. Lecture Notes Auctions. Luca Anderlini Spring 2017

Rationalizable Strategies

Answers to Problem Set 4

Lecture 6 Dynamic games with imperfect information

A brief introduction to evolutionary game theory

Economics 109 Practice Problems 1, Vincent Crawford, Spring 2002

MA200.2 Game Theory II, LSE

Version 01 as of May 12, 2014 Author: Steven Sagona To be submitted to PRL (not really)

Can we have no Nash Equilibria? Can you have more than one Nash Equilibrium? CS 430: Artificial Intelligence Game Theory II (Nash Equilibria)

Outline for today. Stat155 Game Theory Lecture 19: Price of anarchy. Cooperative games. Price of anarchy. Price of anarchy

January 26,

Economics 51: Game Theory

S 2,2-1, x c C x r, 1 0,0

Lecture 3 Representation of Games

IV. Cooperation & Competition

Notes for Section: Week 4

Strategies and Nash Equilibrium. A Whirlwind Tour of Game Theory

CS 798: Homework Assignment 4 (Game Theory)

Economics 335 March 2, 1999 Notes 6: Game Theory

ECONS 424 STRATEGY AND GAME THEORY HOMEWORK #7 ANSWER KEY

GAME THEORY. Department of Economics, MIT, Follow Muhamet s slides. We need the following result for future reference.

Exercises Solutions: Game Theory

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

Game Theory: Minimax, Maximin, and Iterated Removal Naima Hammoud

Duopoly models Multistage games with observed actions Subgame perfect equilibrium Extensive form of a game Two-stage prisoner s dilemma

Stochastic Games and Bayesian Games

Advanced Microeconomics

Opinion formation CS 224W. Cascades, Easley & Kleinberg Ch 19 1

Game theory and applications: Lecture 1

Prisoner s dilemma with T = 1

Econ 711 Homework 1 Solutions

d. Find a competitive equilibrium for this economy. Is the allocation Pareto efficient? Are there any other competitive equilibrium allocations?

Evolutionary voting games. Master s thesis in Complex Adaptive Systems CARL FREDRIKSSON

Problem 3 Solutions. l 3 r, 1

CSI 445/660 Part 9 (Introduction to Game Theory)

MIDTERM ANSWER KEY GAME THEORY, ECON 395

Name. Answers Discussion Final Exam, Econ 171, March, 2012

Basic Game-Theoretic Concepts. Game in strategic form has following elements. Player set N. (Pure) strategy set for player i, S i.

Chapter 6. Game Theory

Dynamic Games. Econ 400. University of Notre Dame. Econ 400 (ND) Dynamic Games 1 / 18

Introduction to Game Theory Lecture Note 5: Repeated Games

In the Name of God. Sharif University of Technology. Microeconomics 2. Graduate School of Management and Economics. Dr. S.

Chapter 11: Dynamic Games and First and Second Movers

Game Theory. Important Instructions

SI 563 Homework 3 Oct 5, Determine the set of rationalizable strategies for each of the following games. a) X Y X Y Z

CUR 412: Game Theory and its Applications, Lecture 11

6.207/14.15: Networks Lecture 9: Introduction to Game Theory 1

Review Best Response Mixed Strategy NE Summary. Syllabus

Problem Set 2 - SOLUTIONS

The Ohio State University Department of Economics Econ 601 Prof. James Peck Extra Practice Problems Answers (for final)

Evolution & Learning in Games

Transcription:

CSC200: Lecture 10!Today Continuing game theory: mixed strategy equilibrium (Ch.6.7-6.8), optimality (6.9), start on extensive form games (6.10, Sec. C)!Next few lectures game theory: Ch.8, Ch.9!Announcements Quiz 2 on Friday, Oct.23: covers Chapter 4 (closures in a socialaffiliation network)! see practice question on web page 1

Mixed Equilibria Matching Pennies!Some games have no pure strategy (deterministic) NE MP: 1 heads 2 heads 1 tails 2 tails 1 heads!solution: randomized or mixed strategies players randomly choose each action with some probability Nash s result: (mixed) equilibria exist for all normal form games 2

Mixed Strategies! Let S 1 be the set of pure strategies of player 1! A mixed strategy σ 1 for player 1 is a probability distribution over S 1 assigns a probability to each s in S 1 (probabilities must sum to one, must be non-negative) player 1 chooses each s with the corresponding probability we say the player is mixing between his pure strategies! Examples (note: pure strategies are a special case): [0.5 heads, 0.5 tails]; [1.0 heads, 0 tails]; [0.01 heads, 0.99 tails] Firm1: [0.5 A, 0.25 B, 0.25 C]; [1.0 A, 0 B, 0 C] Bob: [1.0 uptown, 0 downtown]; [0.2 uptown, 0.8 downtown]! In two-action games, often just state probability p of first action probability of second action is 1-p so set of strategies σ is just set of real numbers between 0 and 1 3

Outcomes of Mixed Strategies! Since players randomize, outcome and payoffs random are too! Consider matching pennies P1 plays heads with probability p (tails, 1-p) (e.g., p=0.3) P2 plays heads with probability q (tails 1-q) (e.g., q=0.8) Assume they randomize independently (can t coordinate!)! Probability of various outcomes and payoffs Outcome Prob e.g. Payoff(1) Payoff(2) (H,H) pq 0.24-1 +1 (H,T) p(1-q) 0.06 +1-1 (T,H) (1-p)q 0.56 +1-1 (T,T) (1-p)(1-q) 0.14-1 +1 4

Payoff of Mixed Profile!Since payoff to each player is random, the value to the player of a profile is her expected payoff simply take the (probabilistic weighted) average of her payoffs Outcome Prob e.g. Payoff(1) Payoff(2) (H,H) pq 0.24-1 +1 (H,T) p(1-q) 0.06 +1-1 (T,H) (1-p)q 0.56 +1-1 (T,T) (1-p)(1-q) 0.14-1 +1! Expected payoff (or expected value, EV) for player 1: 0.24(-1) + 0.06(1) + 0.56(1) + 0.14(-1) = 0.24! Expected payoff for player 2: 0.24(1) + 0.06(-1) + 0.56(-1) + 0.14(1) = 0.24! Hardly a coincidence that one is minus the other 5

Best Responses to Mixed Strategies!Which strategy should P1 choose if P2 plays q? EV(H) = q(-1) + (1-q)(1) = 1-2q EV(T) = q(1) + (1-q)(-1) = 2q -1 EV(p) = p EV(H) + (1-p) EV(T) = p(1-2q) + (1-p)(2q - 1)!So P1 s best response to q is: H is better than T: if 1-2q > 2q-1 (i.e., q<0.5) T is better than H: if 1-2q < 2q-1 (i.e., q>0.5) H and T equally good: if 1-2q = 2q-1 (i.e., q=0.5) Mixed strategy p?! p must be 1 if q<0.5! p must be 0 if q>0.5! p can be anything between 0 and 1 if q=0.5 6

Nash Equilibrium in Mixed Strategies!Define NE exactly as before, but allowing mixing a pair of mixed strategies. s for player 1 and t for player 2, such that s is a best response to t, and t is a best response to s generalize to more than two players as before!what is NE of matching pennies? As before, no pure strategy can be part of NE: unstable So P2 must play a mixed strategy q (q can t be 0 or 1) But we ve seen P1 s best responses are pure unless q = 0.5 So only NE must have q = 0.5 We ve seen P1 s best response to q = 0.5 can be any p Unless p = 0.5, P2 s best response would also be pure So we must have p = 0.5!Only NE: ([0.5,0.5] 1, [0.5, 0.5] 2 ) i.e., p = 0.5 and q=0.5 7

Notes on Mixed Equilibria! If player has a mixed best response that gives positive probability to two (or more) pure strategies, each such pure strategy must be a deterministic best response Suppose opponent plays σ and responses have EV(s 1 ) > EV(s 2 ) Then response [p s 1 ; (1-p) s 2 ] is worse than s 1 unless p = 1 Players can only mix between equally good pure strategies! Why mix? Prevents other player from exploiting you! In 2-player, 2-action games, a mixed strategy equilibrium requires the mixture of each player to make the other player indifferent to playing either of their pure strategies in matching pennies, ensures other player can t exploit what you do this is the basis of some algorithms for computing NE applies to general games (more than two players, two actions)! Nash s result: showed the existence of (mixed) Nash equilibrium for any finite game in normal form 8

Exercise: Rock Paper Scissors!Determine mixed equilibrium of rock-paper-scissors Just a version of matching pennies with more moves Year World Champion Country 2002 Peter Lovering Canada 2003 Rob Krueger Canada 2004 Lee Rammage Canada 2005 Andrew Bergel Canada 2006 Bob Cooper United Kingdom 2007 Andrea Farina USA 2008 Monica Martinez Canada 2009 Tim Conrad USA 9

Exercise: Rock Paper Scissors Lizard Spock!Or try its more interesting variant: rock-paper-scissors-lizard-spock 10

Notes on Mixed Equilibria!Mixed strategies are rarely even mixes Consider run-pass game (American/Canadian football) No pure NE; only mixed NE requires that defense defends pass with q = 2/3; and offence will pass with p = 1/3 Why doesn t offense pass more frequently?! if it did so, defense would always defend against the pass (and offense would get lower expected payoff)! why not run more frequently? Same reason.! threat of passing: defense must defend it more often than it occurs Indifference principle observed (roughly) in NFL! avg yds gained per pass attempt close to that of avg. yds. per run 11

c Computing Mixed Equilibria! We can use principal of indifference to P1 compute mixed Nash equilibria This game has no pure NE, so compute mixed eq: P2 d a 8/6 0 / 10 b 4/4 1 / 2! P2 s mixture (pc, pd) must make P1 indifferent between playing a and b Recall that pd = 1 - pc! P1 s mixture (pa, pb) must make P2 indifferent between playing c and d Recall that pb = 1 - pa 12

Interpretation of Mixed Equilibria!What is meaning of mixed strategies and NE?! Genuine randomization of choices to prevent exploitation soccer penalty kicks, bluffing in poker, many contests and sports,...! Probabilities represent proportions of strategies in a population likelihood of two different types of agents (behaviors, species, competing fashions, ) meeting is given by relative proportions payoff interpreted as successful propagation of that agent type style of analysis used a lot in evolutionary biology to explain emergence of many phenomena (altruism, sexual differentiation, etc.); sociology and anthropology to explain evolution of culture, norms, ideas; etc. see Ch.7 of E&K 13

Interpretation of Mixed Equilibria! Conditions on stable beliefs (or expectations, conjectures) about how opponent will play if I believe you will play q = 0.5, and I believe that you believe I will play p = 0.5, and I believe that you believe that I believe that you will play q = 0.5, and! Learning interpretations: beliefs based on experience with opponent (or related opponents) I ve (P2) seen you (P1) play heads 47 times and tails 53 times: I assume your strategy is p = 0.47 My best response is heads (q = 1.0) But you learn in similar fashion and also adopt best response to what you ve observed In some cases, observed frequencies converge to NE But actually play may often be deterministic (and quite cyclic, not random at all) 14

Pareto Optimality (will come back to this in Ch.8) Quiet Quiet -1 / -1 Confess -10 / 0 Confess 0 / -10-4 / -4!We called some NE undesirable (e.g., Prisoners s dilemma)!need some notion of optimality to compare payoff profiles or strategy profiles!pareto optimality: a strategy profile is Pareto optimal iff no other strategy profile gives some player(s) a higher (expected) payoff without lowering the payoffs of others Everyone would agree to switch from a non-pareto optimal profile to a Pareto-optimal profile (if agreement enforceable )!In PD: the unique NE is not Pareto optimal Cooperative profile is better for all involved (but not enforceable) Note: Q/C and C/Q are also both Pareto optimal (only Quiet player wants to switch) 15

Social Optimality (will come back to this in Ch.8) Quiet Confess Quiet -1 / -1 0 / -10 Confess -10 / 0-4 / -4!Pareto optimality desirable, but weak and not always achievable in noncooperative games cooperative game theory: models commitments/agreements!social optimality: a strategy profile is socially optimal iff it gives rise to the highest sum of expected payoffs Total payoff to entire group (social welfare) is maximized Assumes that summing payoffs in sensible If a profile is socially optimal, it must be Pareto optimal (if one player better off without hurting others, total payoff must increase), but not vice versa!in PD: the unique NE is not socially optimal 16

Extensive Form (Dynamic) Games! Normal form (matrix) games seem limited Players move simultaneously and outcome determined at once No observation, reaction, etc.! Most games have a dynamic structure (turn taking) Chess, tic-tac-toe, cards games, soccer, corporate decisions, markets See what opponent does before making move Sometimes you only see partial information (won t discuss this)! Example: Rogers, Telus competing for market in two remote areas Each firm, Rogers and Telus, can tackle one area only Total revenue in Area A: 12, Area B: 6 If firm is alone in one area, get all of that area s revenue If both firms target same area, first mover gets 2/3, second 1/3 Rogers prepped: makes first move, Telus chooses after Rogers Critical: Telus observes Rogers choice before moving! 17

Game Tree (Extensive Form) Rogers Region A Region B Rogers choice Region A Telus Region B Region A Telus Region B Telus choice 8, 4 12, 6 6, 12 4, 2 R s payoff, T s payoff 18

Strategies Adopted Bwith payoff 6 (better than A) Region A Region A Telus 12, 6 Rogers 12, 6 6, 12 Region B Region A Region B Telus A with payoff 12 better than B With payoff 6 Region B A with payoff 12 (better than B) 8, 4 12, 6 6, 12 4, 2 19

Backward Induction! Start from choice nodes at bottom of tree Player at that node chooses best move E.g., Telus chooses B at node Rogers did A, A at node Rogers did B This dictates which terminal node (payoffs) will be reached We can now assign payoffs to that node (from chosen terminal node)! Work up the tree: compute choices at other nodes once choices/payoffs at all child nodes made Player at that node chooses best move E.g., Rogers chooses A at root node Dictates which child ( Rogers did A ) will be reached, hence which payoff! At end of procedure: Each choice node labelled with action/choice and payoff vector Payoff for the game is the payoff vector at the root node Path through tree given by choices, tells us how the game will unfold 20