CMSC 474, Introduction to Game Theory 16. Behavioral vs. Mixed Strategies

Similar documents
Problem 3 Solutions. l 3 r, 1

Microeconomics of Banking: Lecture 5

SI 563 Homework 3 Oct 5, Determine the set of rationalizable strategies for each of the following games. a) X Y X Y Z

Answers to Problem Set 4

An introduction on game theory for wireless networking [1]

Notes for Section: Week 4

Stochastic Games and Bayesian Games

Economics 171: Final Exam

Introduction to Game Theory

Extensive-Form Games with Imperfect Information

ECON 803: MICROECONOMIC THEORY II Arthur J. Robson Fall 2016 Assignment 9 (due in class on November 22)

Beliefs and Sequential Rationality

Economics 209A Theory and Application of Non-Cooperative Games (Fall 2013) Repeated games OR 8 and 9, and FT 5

Game Theory. Wolfgang Frimmel. Repeated Games

Rationalizable Strategies

Game Theory. Important Instructions

Stochastic Games and Bayesian Games

Advanced Micro 1 Lecture 14: Dynamic Games Equilibrium Concepts

G5212: Game Theory. Mark Dean. Spring 2017

Simon Fraser University Fall Econ 302 D200 Final Exam Solution Instructor: Songzi Du Wednesday December 16, 2015, 8:30 11:30 AM

Finitely repeated simultaneous move game.

Lecture Note Set 3 3 N-PERSON GAMES. IE675 Game Theory. Wayne F. Bialas 1 Monday, March 10, N-Person Games in Strategic Form

Lecture 6 Dynamic games with imperfect information

Agenda. Game Theory Matrix Form of a Game Dominant Strategy and Dominated Strategy Nash Equilibrium Game Trees Subgame Perfection

Economics 109 Practice Problems 1, Vincent Crawford, Spring 2002

January 26,

Finding Equilibria in Games of No Chance

Not 0,4 2,1. i. Show there is a perfect Bayesian equilibrium where player A chooses to play, player A chooses L, and player B chooses L.

Advanced Microeconomics

1 Solutions to Homework 4

G5212: Game Theory. Mark Dean. Spring 2017

Exercises Solutions: Game Theory

Duopoly models Multistage games with observed actions Subgame perfect equilibrium Extensive form of a game Two-stage prisoner s dilemma

Warm Up Finitely Repeated Games Infinitely Repeated Games Bayesian Games. Repeated Games

CUR 412: Game Theory and its Applications, Lecture 12

1 x i c i if x 1 +x 2 > 0 u i (x 1,x 2 ) = 0 if x 1 +x 2 = 0

CUR 412: Game Theory and its Applications Final Exam Ronaldo Carpio Jan. 13, 2015

GAME THEORY. Department of Economics, MIT, Follow Muhamet s slides. We need the following result for future reference.

Introduction to Game Theory Lecture Note 5: Repeated Games

CMSC 474, Introduction to Game Theory 20. Shapley Values

February 23, An Application in Industrial Organization

Econ 711 Homework 1 Solutions

Game theory and applications: Lecture 1

Prisoner s dilemma with T = 1

Introduction to Multi-Agent Programming

Prisoner s Dilemma. CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma. Prisoner s Dilemma. Prisoner s Dilemma.

(a) (5 points) Suppose p = 1. Calculate all the Nash Equilibria of the game. Do/es the equilibrium/a that you have found maximize social utility?

Topics in Contract Theory Lecture 1

G5212: Game Theory. Mark Dean. Spring 2017

Game Theory Notes: Examples of Games with Dominant Strategy Equilibrium or Nash Equilibrium

ECONS 424 STRATEGY AND GAME THEORY HANDOUT ON PERFECT BAYESIAN EQUILIBRIUM- III Semi-Separating equilibrium

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

Chapter 11: Dynamic Games and First and Second Movers

1 R. 2 l r 1 1 l2 r 2

1 Solutions to Homework 3

EC 308 Question Set # 1

In reality; some cases of prisoner s dilemma end in cooperation. Game Theory Dr. F. Fatemi Page 219

CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma

Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4)

Economic Management Strategy: Hwrk 1. 1 Simultaneous-Move Game Theory Questions.

CHAPTER 15 Sequential rationality 1-1

MIDTERM ANSWER KEY GAME THEORY, ECON 395

Economics 335 March 2, 1999 Notes 6: Game Theory

Outline Introduction Game Representations Reductions Solution Concepts. Game Theory. Enrico Franchi. May 19, 2010

Player 2 L R M H a,a 7,1 5,0 T 0,5 5,3 6,6

CMPSCI 240: Reasoning about Uncertainty

MKTG 555: Marketing Models

M.Phil. Game theory: Problem set II. These problems are designed for discussions in the classes of Week 8 of Michaelmas term. 1

The Nash equilibrium of the stage game is (D, R), giving payoffs (0, 0). Consider the trigger strategies:

REPEATED GAMES. MICROECONOMICS Principles and Analysis Frank Cowell. Frank Cowell: Repeated Games. Almost essential Game Theory: Dynamic.

Lecture 5 Leadership and Reputation

March 30, Why do economists (and increasingly, engineers and computer scientists) study auctions?

Games of Incomplete Information ( 資訊不全賽局 ) Games of Incomplete Information

Repeated, Stochastic and Bayesian Games

HW Consider the following game:

Microeconomics II. CIDE, MsC Economics. List of Problems

Name. Answers Discussion Final Exam, Econ 171, March, 2012

Game Theory Problem Set 4 Solutions

Chapter 8. Repeated Games. Strategies and payoffs for games played twice

Repeated Games. Econ 400. University of Notre Dame. Econ 400 (ND) Repeated Games 1 / 48

Extensive form games - contd

Player 2 H T T -1,1 1, -1

Mixed Strategies. Samuel Alizon and Daniel Cownden February 4, 2009

BAYESIAN GAMES: GAMES OF INCOMPLETE INFORMATION

The Ohio State University Department of Economics Second Midterm Examination Answers

arxiv: v1 [cs.gt] 12 Jul 2007

CS711: Introduction to Game Theory and Mechanism Design

UC Berkeley Haas School of Business Game Theory (EMBA 296 & EWMBA 211) Summer 2016

TR : Knowledge-Based Rational Decisions and Nash Paths

Dynamic Games. Econ 400. University of Notre Dame. Econ 400 (ND) Dynamic Games 1 / 18

6.254 : Game Theory with Engineering Applications Lecture 3: Strategic Form Games - Solution Concepts

Chapter 10: Mixed strategies Nash equilibria, reaction curves and the equality of payoffs theorem

Best-Reply Sets. Jonathan Weinstein Washington University in St. Louis. This version: May 2015

MA300.2 Game Theory 2005, LSE

Follow the Leader I has three pure strategy Nash equilibria of which only one is reasonable.

ECE 586GT: Problem Set 1: Problems and Solutions Analysis of static games

The Ohio State University Department of Economics Econ 601 Prof. James Peck Extra Practice Problems Answers (for final)

4. Beliefs at all info sets off the equilibrium path are determined by Bayes' Rule & the players' equilibrium strategies where possible.

Game Theory. VK Room: M1.30 Last updated: October 22, 2012.

Matching Markets and Google s Sponsored Search

Transcription:

CMSC 474, Introduction to Game Theory 16. Behavioral vs. Mixed Strategies Mohammad T. Hajiaghayi University of Maryland

Behavioral Strategies In imperfect-information extensive-form games, we can define a new class of strategies called behavioral strategies An agent s (probabilistic) choice at each node is independent of his/her choices at other nodes Consider the imperfect-info game shown here: A behavioral strategy for Agent 1: At node a, {(0.5, A), (0.5, B)} At node g, {(0.3, G), (0.7, H)} Is there an equivalent mixed strategy? What do we mean by equivalent?

Behavioral Strategies In imperfect-information extensive-form games, we can define a new class of strategies called behavioral strategies An agent s (probabilistic) choice at each node is independent of his/her choices at other nodes Consider the imperfect-info game shown here: A behavioral strategy for Agent 1: At node a, {(0.5, A), (0.5, B)} At node g, {(0.3, G), (0.7, H)} Is there an equivalent mixed strategy? What do we mean by equivalent? Two strategies s i and s i ' are equivalent if for every fixed strategy profile s i of the remaining agents, s i and s i ' give us the same probabilities on outcomes An equivalent mixed strategy: {(0.15, (A, G)); (0.35, (A, H)); (0.15, (B, G)); (0.35, (B, H))}

Behavioral vs. Mixed Strategies Consider the following mixed strategy: {(0.6, (A, G)), (0.4, (B, H))} The choices at the two nodes aren t independent Choose A at a choose G at g Choose B at a choose H at g Is there an equivalent behavioral strategy?

Behavioral vs. Mixed Strategies Consider the following mixed strategy: {(0.6, (A, G)), (0.4, (B, H))} The choices at the two nodes aren t independent Choose A at a choose G at g Choose B at a choose H at g Thus not always easy to find an equivalent behavioral strategy.

Behavioral vs. Mixed Strategies In some games, there are mixed strategies that have no equivalent behavioral strategy behavioral strategies that have no equivalent mixed strategy Thus mixed and behavioral strategies can produce different sets of equilibria Consider the game shown here: At both a and b, agent 1 s information set is {a, b} How can this ever happen?

Behavioral vs. Mixed Strategies Mixed strategy {(p, L), (1 p, R)}: agent 1 chooses L or R randomly, but commits to it Choose L the game will end at d Choose R the game will end at f or g The game will never end at node e Behavioral strategy {(q, L), (1 q, R)}: every time agent 1 is in {a, b}, agent 1 re-makes the choice Pr[game ends at e] = q(1 q) Pr[game ends at e] > 0, except when q = 0 or q = 1 Only two cases in which there are equivalent mixed and behavioral strategies If p = q = 0, then both strategies are the pure strategy L If p = q = 1, then both strategies are the pure strategy R In all other cases, the mixed and behavioral strategies produce different probability distributions over the outcomes

Nash Equilibrium in Mixed Strategies Nash equilibrium in mixed strategies: If agent 1 uses a mixed strategy, the game will never end at node e Thus For agent 1, R is strictly dominant For agent 2, D is strictly dominant So (R,D) is the unique Nash equilibrium

Nash Equilibrium in Behavioral Strategies Nash equilibrium in behavioral strategies: For Agent 2, D is strictly dominant Find 1 s best response among behavioral strategies Suppose 1 uses the behavioral strategy {(q, L), (1 q, R)} Then agent 1 s expected payoff is u 1 = 1 q 2 + 100 q(1 q) + 2 (1 q) = 99q 2 + 98q + 2 To find the maximum value of u 1, set du 1 /dq = 0 198q + 98 + 0 = 0, so q = 49/99 So (R,D) is not an equilibrium The equilibrium is ({(49/99, L), (50/99, R)}, D)

Why This Happened The reason the strategies weren t equivalent was because agent 1 could be in the same information set more than once With a mixed-strategy, 1 made the same move both times With a behavioral strategy, 1 could make a different move each time There are games in which this can never happen Games of perfect recall

Games of Perfect Recall In an imperfect-information game G, agent i has perfect recall if i never forgets anything he/she knew earlier In particular, i remembers all his/her own moves G is a game of perfect recall if every agent in G has perfect recall Theorem: For every history in a game of perfect recall, no agent can be in the same information set more than once Proof: Let h be any history for G. Suppose that At one point in h, i s information set is I At another point later in h, i s information set is J Then i must have made at least one move in between If i remembers all his/her moves, then At J, i remembers a longer sequence of moves than at I Thus I and J are different information sets

Games of Perfect Recall Theorem (Kuhn, 1953). In a game of perfect recall, for every mixed strategy s i there is an equivalent behavioral strategy s i ', and vice versa In a game of perfect recall, the set of Nash equilibria doesn t change if we consider behavioral strategies instead of mixed strategies

Sequential Equilibrium For perfect-information games, subgame-perfect equilibria were useful Avoided non-credible threats; could be computed more easily Is there something similar for imperfect-info games? In a subgame-perfect equilibrium, each agent s strategy must be a best response in every subgame We can t use that definition in imperfect-information games No longer have a well-defined notion of a subgame Rather, at each info set, a subforest or a collection of subgames Could we require each player s strategy to be a best response in each of the subgames in the forest? Won t work correctly

Example 2 s information set is {c,d} No strategy is a best response at both c and d But if 1 is rational, then 1 will never choose C So if rationality is common knowledge Then 2 only needs a best response at node d a b c d e f g h

Example 2 s information set is {c,d} No strategy is a best response at both c and d a If 1 is rational, then 1 will never choose L Let 1 s mixed strategy be {(p, C), (1 p, R)}, and 2 s mixed strategy be {(q, U), (1 q, D)} b c d Can show there is one Nash equilibrium, at p = q = ½ But q = ½ is not a best response at either c or d e f g h (4,2) (3,5) (3,5) (4,2)

Sequential Equilibrium This leads to a complicated solution concept called sequential equilibrium A little like a trembling-hand perfect equilibrium (which was already complicated), but with additional complications to deal with the tree structure Every finite game of perfect recall has a sequential equilibrium Every subgame-perfect equilibrium is a sequential equilibrium, but not vice versa We won t discuss it further

Summary Topics covered: information sets behavioral vs. mixed strategies games of perfect recall equivalence between behavioral and mixed strategies in such games Sequential equilibrium instead of subgame-perfect equilibrium