CMPSCI 240: Reasoning about Uncertainty

Similar documents
CMPSCI 240: Reasoning about Uncertainty

Elements of Economic Analysis II Lecture X: Introduction to Game Theory

CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma

Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4)

m 11 m 12 Non-Zero Sum Games Matrix Form of Zero-Sum Games R&N Section 17.6

Game Theory. VK Room: M1.30 Last updated: October 22, 2012.

preferences of the individual players over these possible outcomes, typically measured by a utility or payoff function.

Outline for today. Stat155 Game Theory Lecture 19: Price of anarchy. Cooperative games. Price of anarchy. Price of anarchy

Can we have no Nash Equilibria? Can you have more than one Nash Equilibrium? CS 430: Artificial Intelligence Game Theory II (Nash Equilibria)

Agenda. Game Theory Matrix Form of a Game Dominant Strategy and Dominated Strategy Nash Equilibrium Game Trees Subgame Perfection

Static Games and Cournot. Competition

Outline for today. Stat155 Game Theory Lecture 13: General-Sum Games. General-sum games. General-sum games. Dominated pure strategies

Introduction to Multi-Agent Programming

Prisoner s dilemma with T = 1

MATH 4321 Game Theory Solution to Homework Two

Stochastic Games and Bayesian Games

Stochastic Games and Bayesian Games

HE+ Economics Nash Equilibrium

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India October 2012

Thursday, March 3

15.053/8 February 28, person 0-sum (or constant sum) game theory

SI 563 Homework 3 Oct 5, Determine the set of rationalizable strategies for each of the following games. a) X Y X Y Z

1. better to stick. 2. better to switch. 3. or does your second choice make no difference?

Economics 171: Final Exam

S 2,2-1, x c C x r, 1 0,0

The Nash equilibrium of the stage game is (D, R), giving payoffs (0, 0). Consider the trigger strategies:

Game Theory. Important Instructions

CHAPTER 14: REPEATED PRISONER S DILEMMA

An introduction on game theory for wireless networking [1]

G5212: Game Theory. Mark Dean. Spring 2017

Game Theory Tutorial 3 Answers

CS711: Introduction to Game Theory and Mechanism Design

Yao s Minimax Principle

Chapter 10: Mixed strategies Nash equilibria, reaction curves and the equality of payoffs theorem

Game Theory Notes: Examples of Games with Dominant Strategy Equilibrium or Nash Equilibrium

Solution to Tutorial 1

Solution to Tutorial /2013 Semester I MA4264 Game Theory

LECTURE 4: MULTIAGENT INTERACTIONS

MKTG 555: Marketing Models

Test 1. ECON3161, Game Theory. Tuesday, September 25 th

In reality; some cases of prisoner s dilemma end in cooperation. Game Theory Dr. F. Fatemi Page 219

Economics 51: Game Theory

Game theory and applications: Lecture 1

G5212: Game Theory. Mark Dean. Spring 2017

Outline. Risk and Decision Analysis 5. Game Theory. What is game theory? Outline. Scope of game theory. Two-person zero sum games

MS&E 246: Lecture 5 Efficiency and fairness. Ramesh Johari

6.207/14.15: Networks Lecture 9: Introduction to Game Theory 1

ODD. Answers to Odd-Numbered Problems, 4th Edition of Games and Information, Rasmusen PROBLEMS FOR CHAPTER 1

In the Name of God. Sharif University of Technology. Microeconomics 2. Graduate School of Management and Economics. Dr. S.

Strategies and Nash Equilibrium. A Whirlwind Tour of Game Theory

Game Theory Lecture #16

Introduction to Game Theory

Preliminary Notions in Game Theory

Duopoly models Multistage games with observed actions Subgame perfect equilibrium Extensive form of a game Two-stage prisoner s dilemma

Problem 3 Solutions. l 3 r, 1

CUR 412: Game Theory and its Applications Final Exam Ronaldo Carpio Jan. 13, 2015

Introduction to Game Theory Lecture Note 5: Repeated Games

Prisoner s Dilemma. CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma. Prisoner s Dilemma. Prisoner s Dilemma.

Chapter 2 Strategic Dominance

M.Phil. Game theory: Problem set II. These problems are designed for discussions in the classes of Week 8 of Michaelmas term. 1

UC Berkeley Haas School of Business Game Theory (EMBA 296 & EWMBA 211) Summer 2016

Game Theory - Lecture #8

Rationalizable Strategies

Overuse of a Common Resource: A Two-player Example

Exercises Solutions: Game Theory

Economics and Computation

Simon Fraser University Fall Econ 302 D200 Final Exam Solution Instructor: Songzi Du Wednesday December 16, 2015, 8:30 11:30 AM

Iterated Dominance and Nash Equilibrium

Applying Risk Theory to Game Theory Tristan Barnett. Abstract

Game Theory: Minimax, Maximin, and Iterated Removal Naima Hammoud

Noncooperative Oligopoly

Continuing game theory: mixed strategy equilibrium (Ch ), optimality (6.9), start on extensive form games (6.10, Sec. C)!

14.01 Principles of Microeconomics, Fall 2007 Chia-Hui Chen November 26, Lecture 28. Oligopoly

Chapter 8. Repeated Games. Strategies and payoffs for games played twice

LINEAR PROGRAMMING. Homework 7

Math 135: Answers to Practice Problems

CS711 Game Theory and Mechanism Design

Name. Answers Discussion Final Exam, Econ 171, March, 2012

Econ 711 Homework 1 Solutions

ECE 586BH: Problem Set 5: Problems and Solutions Multistage games, including repeated games, with observed moves

Jianfei Shen. School of Economics, The University of New South Wales, Sydney 2052, Australia

TPPE24 Ekonomisk Analys:

Managerial Economics FtA

Microeconomics of Banking: Lecture 5

Solutions of Bimatrix Coalitional Games

IV. Cooperation & Competition

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

Economics 209A Theory and Application of Non-Cooperative Games (Fall 2013) Repeated games OR 8 and 9, and FT 5

Economics 109 Practice Problems 1, Vincent Crawford, Spring 2002

Mixed Strategies. Samuel Alizon and Daniel Cownden February 4, 2009

Games of Incomplete Information ( 資訊不全賽局 ) Games of Incomplete Information

MIDTERM ANSWER KEY GAME THEORY, ECON 395

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

Chapter 2 Discrete Static Games

A brief introduction to evolutionary game theory

1 Solutions to Homework 4

6.207/14.15: Networks Lecture 9: Introduction to Game Theory 1

Managerial Economics ECO404 OLIGOPOLY: GAME THEORETIC APPROACH

LECTURE NOTES ON GAME THEORY. Player 2 Cooperate Defect Cooperate (10,10) (-1,11) Defect (11,-1) (0,0)

In the Name of God. Sharif University of Technology. Graduate School of Management and Economics

Transcription:

CMPSCI 240: Reasoning about Uncertainty Lecture 21: Game Theory Andrew McGregor University of Massachusetts Last Compiled: April 29, 2017

Outline 1 Game Theory 2

Example: Two-finger Morra Alice and Bob play a game Simultaneously Alice picks a {1, 2} and Bob picks b {1, 2} Bob pays Alice $(a + b) if a + b is even Alice pays Bob $(a + b) if a + b is odd Obviously if Bob always plays the same number, Alice can take advantage of this. What if Bob plays different numbers with different probabilities?

Analysis of Two-finger Morra (1/3) Suppose Bob plays 1 with prob. q and 2 with prob. 1 q If Alice plays 1 then Bob has expected return 2q + 3(1 q) = 5q + 3 If Alice plays 2 then Bob has expected return 3q 4(1 q) = 4 + 7q Then no matter what Alice does, Bob expects to get min( 5q + 3, 4 + 7q) If Bob sets q = 7/12 his expected earning is at least: min( 5 7/12 + 3, 4 + 7 7/12) = 1/12

Analysis of Two-finger Morra (2/3) Suppose Alice plays 1 with prob. p and 2 with prob. 1 p If Bob plays 1 then Alice expects to get 2p 3(1 p) = 3 + 5p If Bob plays 2 then Alice expects to get 3p + 4(1 p) = 7p + 4 Then no matter what Bob does, Alice expects to get min( 3 + 5p, 7p + 4) If Alice sets p = 7/12 her expected earning is at least: min( 3 + 5 7/12, 7 7/12 + 4) = 1/12

Analysis of Two-finger Morra (3/3) Conclusion: Alice s expected earning plus Bob s expected earnings always sum up to 0. Bob can ensure his expected earning is at least 1/12; so Alice s expected earning is at best 1/12 Alice can ensure his expected earning is at least 1/12; so Bob s expected earning is at best 1/12. Hence the strategies that ensured Bob got 1/12 in expectation and Alice got 1/12 in expectation are optimal, i.e., both players is play 1 finger with probability 7/12 and 2 fingers with probability 5/12.

Zero-Sum Games Definition A two-player, simultaneous-move, zero-sum game consists of a set of k options for player A, a set of options l for player B, and a k l payoff matrix P. If A is chooses her ith option and B chooses his jth option then A gets P ij and B gets P ij. For two-finger Morra, the payoff matrix is 1 B Finger 2 B Finger 1 A Finger +2 3 2 A Finger 3 +4 where best strategy was for players to show 1 finger with probability 7/12 and two fingers with probability 5/12.

Pure and Mixed Strategies Definition If a player picks one of their options, we call it a pure strategy. If they pick a distribution over their options, we call it a mixed strategy. If one option is better than the other no matter what the other player does, we say the first strategy dominates the second.

Example: Two-finger Morra with Slightly Different Pay-off 1 B Finger 2 B Finger 1 A Finger +2 3 2 A Finger 2 +4 If Alice plays 1 with prob. p and 2 with prob. 1 p then her expected payoff is at least: min(2p 2(1 p), 3p + 4(1 p)) which is at least 2/11 when p = 6/11. If Bob plays 1 with prob. q and 2 with prob. 1 q then his expected payoff is at least: min( 2q + 3(1 q), 2q 4(1 q)) which is at least 2/11 when q = 7/11. Hence, Alice s best strategy is one finger with probability 6/11 and Bob s best strategy is one finger with probability 7/11.

Example: Three-finger Morra Alice and Bob play a game Simultaneously Alice picks a {1, 2, 3} and Bob picks b {1, 2, 3} Bob pays Alice $(a + b) if a + b is even Alice pays Bob $(a + b) if a + b is odd The payoff matrix is 1 B Finger 2 B Finger 3 B Finger 1 A Finger +2 3 +4 2 A Finger 3 +4 5 3 A Finger +4 5 +6

Analysis of Three-finger Morra (1/2) Suppose A plays 1 with probability r, 2 with probability s, and 3 with probability 1 r s If B plays 1 then A s expected reward is 2r 3s + 4(1 r s) = 4 2r 7s If B plays 2 then A s expected reward is 3r + 4s 5(1 r s) = 5 + 2r + 9s If B plays 3 then A s expected reward is 4r 5s + 6(1 r s) = 6 2r 11s Hence, for r = 1/4, s = 1/2, A gets expected return at least 0

Analysis of Three-finger Morra (2/2) Suppose B plays 1 with probability r, 2 with probability s, and 3 with probability 1 r s If A plays 1 then B s expected reward is 2r + 3s 4(1 r s) = 4 + 2r + 7s If A plays 2 then B s expected reward is 3r 4s + 5(1 r s) = 5 2r 9s If A plays 3 then B s expected reward is 4r + 5s 6(1 r s) = 6 + 2r + 11s Hence, for r = 1/4, s = 1/2, B gets expected return at least 0 Hence, best strategy for each player is show 1 finger with probability 1/4 and 2 fingers with probability 1/2.

Outline 1 Game Theory 2

Prisoner s Dilemma Two prisoners are being held pending trial for a crime they are alleged to have committed. The prosecutor offers each a deal: Give evidence against your partner and you ll go free, unless your partner also confesses. If both confess, both get 5 year sentences. If neither confess, both get 1 year sentences. If you don t confess but your partner does, you get 10 years! Can represent this as a game but it s not zero-sum: B Confesses B Stays Mute A Confesses 5, 5 0, 10 A Stays Mute 10, 0 1, 1 In each entry, the first number is A s reward and the second number of B s reward.

Nash Equilibrium Definition A Nash Equilibrium is a set of strategies for each player where no change by one player alone can improve his outcome. For the prisoners dilemma the unique Nash Equilibrium is that both prisoners confess. Theorem (Nash) Every game where each player has a finite number of options, has at least one Nash equilibrium.