CMPSCI 240: Reasoning about Uncertainty

Similar documents
CMPSCI 240: Reasoning about Uncertainty

Introduction to Multi-Agent Programming

Now we return to simultaneous-move games. We resolve the issue of non-existence of Nash equilibrium. in pure strategies through intentional mixing.

CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma

Elements of Economic Analysis II Lecture X: Introduction to Game Theory

Game Theory. VK Room: M1.30 Last updated: October 22, 2012.

Outline for today. Stat155 Game Theory Lecture 13: General-Sum Games. General-sum games. General-sum games. Dominated pure strategies

preferences of the individual players over these possible outcomes, typically measured by a utility or payoff function.

m 11 m 12 Non-Zero Sum Games Matrix Form of Zero-Sum Games R&N Section 17.6

Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4)

S 2,2-1, x c C x r, 1 0,0

CS711 Game Theory and Mechanism Design

Game Theory. Analyzing Games: From Optimality to Equilibrium. Manar Mohaisen Department of EEC Engineering

Outline for today. Stat155 Game Theory Lecture 19: Price of anarchy. Cooperative games. Price of anarchy. Price of anarchy

Can we have no Nash Equilibria? Can you have more than one Nash Equilibrium? CS 430: Artificial Intelligence Game Theory II (Nash Equilibria)

Introduction to Game Theory

Stochastic Games and Bayesian Games

Game Theory - Lecture #8

Player 2 L R M H a,a 7,1 5,0 T 0,5 5,3 6,6

LECTURE 4: MULTIAGENT INTERACTIONS

Introduction to Game Theory

Prisoner s dilemma with T = 1

Stochastic Games and Bayesian Games

MATH 4321 Game Theory Solution to Homework Two

Chapter 10: Mixed strategies Nash equilibria, reaction curves and the equality of payoffs theorem

Static Games and Cournot. Competition

Game theory and applications: Lecture 1

Game Theory. Important Instructions

Introduction to Game Theory Lecture Note 5: Repeated Games

CUR 412: Game Theory and its Applications Final Exam Ronaldo Carpio Jan. 13, 2015

An introduction on game theory for wireless networking [1]

Rationalizable Strategies

Duopoly models Multistage games with observed actions Subgame perfect equilibrium Extensive form of a game Two-stage prisoner s dilemma

SI 563 Homework 3 Oct 5, Determine the set of rationalizable strategies for each of the following games. a) X Y X Y Z

Chapter 2 Strategic Dominance

Economics 109 Practice Problems 1, Vincent Crawford, Spring 2002

UC Berkeley Haas School of Business Game Theory (EMBA 296 & EWMBA 211) Summer 2016

CMSC 474, Introduction to Game Theory 16. Behavioral vs. Mixed Strategies

6.254 : Game Theory with Engineering Applications Lecture 3: Strategic Form Games - Solution Concepts

1. better to stick. 2. better to switch. 3. or does your second choice make no difference?

ECE 586GT: Problem Set 1: Problems and Solutions Analysis of static games

ECE 586BH: Problem Set 5: Problems and Solutions Multistage games, including repeated games, with observed moves

In the Name of God. Sharif University of Technology. Microeconomics 2. Graduate School of Management and Economics. Dr. S.

Introduction to Game Theory

ANASH EQUILIBRIUM of a strategic game is an action profile in which every. Strategy Equilibrium

Continuing game theory: mixed strategy equilibrium (Ch ), optimality (6.9), start on extensive form games (6.10, Sec. C)!

G5212: Game Theory. Mark Dean. Spring 2017

Thursday, March 3

Name. Answers Discussion Final Exam, Econ 171, March, 2012

Regret Minimization and Security Strategies

6.207/14.15: Networks Lecture 9: Introduction to Game Theory 1

Problem Set 2 - SOLUTIONS

Agenda. Game Theory Matrix Form of a Game Dominant Strategy and Dominated Strategy Nash Equilibrium Game Trees Subgame Perfection

Repeated Games. Econ 400. University of Notre Dame. Econ 400 (ND) Repeated Games 1 / 48

January 26,

Game Theory: Minimax, Maximin, and Iterated Removal Naima Hammoud

Economics 171: Final Exam

Solution to Tutorial 1

Iterated Dominance and Nash Equilibrium

Solution to Tutorial /2013 Semester I MA4264 Game Theory

In reality; some cases of prisoner s dilemma end in cooperation. Game Theory Dr. F. Fatemi Page 219

CS711: Introduction to Game Theory and Mechanism Design

Repeated, Stochastic and Bayesian Games

CS 798: Homework Assignment 4 (Game Theory)

CSI 445/660 Part 9 (Introduction to Game Theory)

Player 2 H T T -1,1 1, -1

Yao s Minimax Principle

Lecture 5 Leadership and Reputation

Prisoner s Dilemma. CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma. Prisoner s Dilemma. Prisoner s Dilemma.

HE+ Economics Nash Equilibrium

In the Name of God. Sharif University of Technology. Graduate School of Management and Economics

10.1 Elimination of strictly dominated strategies

2 Game Theory: Basic Concepts

Strategies and Nash Equilibrium. A Whirlwind Tour of Game Theory

G5212: Game Theory. Mark Dean. Spring 2017

Mixed Strategies. Samuel Alizon and Daniel Cownden February 4, 2009

Game Theory. Wolfgang Frimmel. Repeated Games

GAME THEORY. Game theory. The odds and evens game. Two person, zero sum game. Prototype example

6.207/14.15: Networks Lecture 9: Introduction to Game Theory 1

15.053/8 February 28, person 0-sum (or constant sum) game theory

The Nash equilibrium of the stage game is (D, R), giving payoffs (0, 0). Consider the trigger strategies:

Test 1. ECON3161, Game Theory. Tuesday, September 25 th

GAME THEORY. (Hillier & Lieberman Introduction to Operations Research, 8 th edition)

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

BRIEF INTRODUCTION TO GAME THEORY

Sequential-move games with Nature s moves.

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

Microeconomics Comprehensive Exam

Econ 323 Microeconomic Theory. Practice Exam 2 with Solutions

Game Theory Lecture #16

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India October 2012

Outline. Risk and Decision Analysis 5. Game Theory. What is game theory? Outline. Scope of game theory. Two-person zero sum games

Their opponent will play intelligently and wishes to maximize their own payoff.

Problem 3 Solutions. l 3 r, 1

Econ 323 Microeconomic Theory. Chapter 10, Question 1

1 Solutions to Homework 3

Week 8: Basic concepts in game theory

Microeconomics of Banking: Lecture 5

Economic Management Strategy: Hwrk 1. 1 Simultaneous-Move Game Theory Questions.

Repeated Games with Perfect Monitoring

Transcription:

CMPSCI 240: Reasoning about Uncertainty Lecture 23: More Game Theory Andrew McGregor University of Massachusetts Last Compiled: April 20, 2017

Outline 1 Game Theory 2 Non Zero-Sum Games and Nash Equilibrium 3 Iterated Prisoners Dilemma

Last Time: Zero-Sum Games Definition A two-player, simultaneous-move, zero-sum game consists of a set of k options for player A, a set of options l for player B, and a k l payoff matrix P. If A is chooses her ith option and B chooses his jth option then A gets P ij and B gets P ij. For two-finger Morra, the payoff matrix is 1 B Finger 2 B Finger 1 A Finger +2 3 2 A Finger 3 +4 where best strategy was for players to show 1 finger with probability 7/12 and two fingers with probability 5/12.

Outline 1 Game Theory 2 Non Zero-Sum Games and Nash Equilibrium 3 Iterated Prisoners Dilemma

Prisoner s Dilemma Two prisoners are being held pending trial for a crime they are alleged to have committed. The prosecutor offers each a deal: Give evidence against your partner and you ll go free, unless your partner also confesses. If both confess, both get 5 year sentences. If neither confess, both get 1 year sentences. If you don t confess but your partner does, you get 10 years! Can represent this as a game but it s not zero-sum: B Confesses B Stays Mute A Confesses 5, 5 0, 10 A Stays Mute 10, 0 1, 1

Nash Equilibrium Definition A Nash Equilibrium is a set of strategies for each player where no change by one player alone can improve his outcome. For the prisoners dilemma the unique Nash Equilibrium is that both prisoners confess. Theorem (Nash) Every game where each player has a finite number of options, has at least one Nash equilibrium.

Hawks and Doves Two birds meet over a piece of food and have to decide whether to act aggressive (hawkish) or passive (dovish) If a hawk meets a dove, the hawk gets the food worth 50 points If two hawks meet they both loose -25 points If two doves meet, they both get 15 points Can represent this as: B is a Hawk B is a Dove A is a Hawk 25, 25 50, 0 A is a Dove 0, 50 15, 15 No Nash Equilibrium where both players play same pure strategies: If A and B are Hawks, both would prefer to switch to Doves If A and B are Doves, both would prefer to switch to Hawks A plays Hawk and B plays Dove is a Nash Equilibria and vice versa

A Mixed Strategy Nash Equilibrium for Hawks and Doves Suppose Alice play Hawk with probability 0 < q < 1 and Bob plays Hawk with probability 0 < p < 1 Alice s expected reward is 25pq + 50q(1 p) + 15(1 p)(1 q) = 60pq + 35q + 15 15p = q(35 60p) + 15 15p When can Alice not improve by changing q? When p = 7/12. Bob s expected reward is 60pq + 35p + 15 15q = p(35 60q) + 15 15q When can Bob not improve by changing p? When q = 7/12. Hence p = q = 7/12 is only Nash Equilibrium with mixed strategies.

Outline 1 Game Theory 2 Non Zero-Sum Games and Nash Equilibrium 3 Iterated Prisoners Dilemma

Iterated Prisoners Dilemma Suppose you are playing an prisoner dilemma game multiple times with someone, what should you do? Recall the sentencing matrix: B Confesses B Stays Mute A Confesses -5,-5 0, -10 A Stays Mute -10, 0-1, -1 Suppose after each game, we play again with probability p. We can model the number of games played as a geometric random variable. Since the stopping condition is the probability of of not continuing the game, we can say that the expected number of games is 1 1 p.

Playing against someone who always confesses If you know the other player will always confess, you minimize your losses by always confessing as well. Let X be a random variable representing the number of rounds for which the game is played. Since you d lose 5 units every round, your expected payoff is 5 E(X ) = 5 1 p

Playing against someone who retaliates Suppose you know your opponent will confess on every turn once you have confessed once, but will stay mute until then. For what values of p should you confess on the first turn? Let X be a random variable representing the number of rounds for which the game is played. If you confess from the first round onwards your payoff is: which is 5/(1 p) + 5. 0 + ( 5) (X 1) If you never confess, your payoff is 1/(1 p). Hence you should confess from the start if i.e., p 1/5. 5/(1 p) + 5 1/(1 p)