Game Theory: introduction and applications to computer networks

Similar documents
Game Theory: introduction and applications to computer networks

Using the Maximin Principle

Game Theory: Minimax, Maximin, and Iterated Removal Naima Hammoud

MATH 121 GAME THEORY REVIEW

An introduction on game theory for wireless networking [1]

CS711: Introduction to Game Theory and Mechanism Design

Game theory for. Leonardo Badia.

Outline for today. Stat155 Game Theory Lecture 13: General-Sum Games. General-sum games. General-sum games. Dominated pure strategies

Game theory and applications: Lecture 1

15.053/8 February 28, person 0-sum (or constant sum) game theory

Econ 323 Microeconomic Theory. Practice Exam 2 with Solutions

Econ 323 Microeconomic Theory. Chapter 10, Question 1

LINEAR PROGRAMMING. Homework 7

CHAPTER 14: REPEATED PRISONER S DILEMMA

Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4)

ECE 586BH: Problem Set 5: Problems and Solutions Multistage games, including repeated games, with observed moves

Exercises Solutions: Game Theory

Introductory Microeconomics

Applying Risk Theory to Game Theory Tristan Barnett. Abstract

CS711 Game Theory and Mechanism Design

Introduction to Game Theory

MAT 4250: Lecture 1 Eric Chung

Chapter 10: Mixed strategies Nash equilibria, reaction curves and the equality of payoffs theorem

Lecture 3 Representation of Games

Duopoly models Multistage games with observed actions Subgame perfect equilibrium Extensive form of a game Two-stage prisoner s dilemma

G5212: Game Theory. Mark Dean. Spring 2017

TPPE24 Ekonomisk Analys:

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India October 2012

Thursday, March 3

Math 167: Mathematical Game Theory Instructor: Alpár R. Mészáros

Sequential-move games with Nature s moves.

CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma

TUFTS UNIVERSITY DEPARTMENT OF CIVIL AND ENVIRONMENTAL ENGINEERING ES 152 ENGINEERING SYSTEMS Spring Lesson 16 Introduction to Game Theory

Microeconomics of Banking: Lecture 5

CMPSCI 240: Reasoning about Uncertainty

Economics 171: Final Exam

Introduction to Multi-Agent Programming

The Nash equilibrium of the stage game is (D, R), giving payoffs (0, 0). Consider the trigger strategies:

Finding Equilibria in Games of No Chance

Can we have no Nash Equilibria? Can you have more than one Nash Equilibrium? CS 430: Artificial Intelligence Game Theory II (Nash Equilibria)

February 23, An Application in Industrial Organization

m 11 m 12 Non-Zero Sum Games Matrix Form of Zero-Sum Games R&N Section 17.6

Algorithms and Networking for Computer Games

6.207/14.15: Networks Lecture 9: Introduction to Game Theory 1

Repeated, Stochastic and Bayesian Games

Preliminary Notions in Game Theory

Player 2 H T T -1,1 1, -1

Game Theory. Important Instructions

CUR 412: Game Theory and its Applications Final Exam Ronaldo Carpio Jan. 13, 2015

1. better to stick. 2. better to switch. 3. or does your second choice make no difference?

In reality; some cases of prisoner s dilemma end in cooperation. Game Theory Dr. F. Fatemi Page 219

(a) Describe the game in plain english and find its equivalent strategic form.

Repeated Games with Perfect Monitoring

Solution to Tutorial 1

GAME THEORY. Game theory. The odds and evens game. Two person, zero sum game. Prototype example

Solution to Tutorial /2013 Semester I MA4264 Game Theory

Comparative Study between Linear and Graphical Methods in Solving Optimization Problems

Agenda. Game Theory Matrix Form of a Game Dominant Strategy and Dominated Strategy Nash Equilibrium Game Trees Subgame Perfection

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2015

Rationalizable Strategies

Elements of Economic Analysis II Lecture X: Introduction to Game Theory

IV. Cooperation & Competition

Introduction to game theory LECTURE 2

Epistemic Game Theory

Repeated Games. September 3, Definitions: Discounting, Individual Rationality. Finitely Repeated Games. Infinitely Repeated Games

GAME THEORY. (Hillier & Lieberman Introduction to Operations Research, 8 th edition)

Topic One: Zero-sum games and saddle point equilibriums

The Ohio State University Department of Economics Econ 601 Prof. James Peck Extra Practice Problems Answers (for final)

Game Theory - Lecture #8

Lecture 6 Dynamic games with imperfect information

Cooperative Game Theory. John Musacchio 11/16/04

Week 8: Basic concepts in game theory

Their opponent will play intelligently and wishes to maximize their own payoff.

Player 2 L R M H a,a 7,1 5,0 T 0,5 5,3 6,6

Outline Introduction Game Representations Reductions Solution Concepts. Game Theory. Enrico Franchi. May 19, 2010

ECO303: Intermediate Microeconomic Theory Benjamin Balak, Spring 2008

Chapter 2 Strategic Dominance

Stochastic Games and Bayesian Games

GAME THEORY. Department of Economics, MIT, Follow Muhamet s slides. We need the following result for future reference.

Warm Up Finitely Repeated Games Infinitely Repeated Games Bayesian Games. Repeated Games

Economics 109 Practice Problems 1, Vincent Crawford, Spring 2002

S 2,2-1, x c C x r, 1 0,0

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

Lecture Note Set 3 3 N-PERSON GAMES. IE675 Game Theory. Wayne F. Bialas 1 Monday, March 10, N-Person Games in Strategic Form

Strategies and Nash Equilibrium. A Whirlwind Tour of Game Theory

Simon Fraser University Fall Econ 302 D200 Final Exam Solution Instructor: Songzi Du Wednesday December 16, 2015, 8:30 11:30 AM

The Ohio State University Department of Economics Second Midterm Examination Answers

Chapter 8. Repeated Games. Strategies and payoffs for games played twice

CMPSCI 240: Reasoning about Uncertainty

Iterated Dominance and Nash Equilibrium

Elements of Economic Analysis II Lecture XI: Oligopoly: Cournot and Bertrand Competition

Advanced Microeconomics

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2017

In the Name of God. Sharif University of Technology. Microeconomics 2. Graduate School of Management and Economics. Dr. S.

SI 563 Homework 3 Oct 5, Determine the set of rationalizable strategies for each of the following games. a) X Y X Y Z

MA300.2 Game Theory 2005, LSE

Stochastic Games and Bayesian Games

University at Albany, State University of New York Department of Economics Ph.D. Preliminary Examination in Microeconomics, June 20, 2017

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India October 2012

Department of Agricultural Economics. PhD Qualifier Examination. August 2010

Transcription:

Game Theory: introduction and applications to computer networks Zero-Sum Games (follow-up) Giovanni Neglia INRIA EPI Maestro 20 January 2014 Part of the slides are based on a previous course with D. Figueiredo (UFRJ) and H. Zhang (Suffolk University)

Saddle Points main theorem The game has a saddle point iff max v min w u(v,w) = min w max v u(v,w) Rose Colin A B D min w A 12-1 0-1 B 5 1-20 -20 C 3 2 3 2 D -16 0 16-16 max v 12 2 16 Rose C ε argmax min w u(v,w) most cautious strategy for Rose: it secures the maximum worst case gain independently from Colin s action (the game maximin value) Colin B ε argmin max v u(v,w) most cautious strategy for Colin: it secures the minimum worst case loss (the game minimax value)

Saddle Points main theorem Another formulation: The game has a saddle point iff maximin = minimax, This value is called the value of the game

Saddle Points main theorem The game has a saddle point iff N.C. max v min w u(v,w) = min w max v u(v,w) Two preliminary remarks 1. It holds (always) max v min w u(v,w) <= min w max v u(v,w) because min w u(v,w)<=u(v,w)<=max v u(v,w) for all v and w 2. By definition, (x,y) is a saddle point iff u(x,y)<=u(x,w) for all w in S Colin i.e. u(x,y)=min w u(x,w) u(x,y) >= u(v,y) for all v in S Rose i.e. u(x,y)=max v u(v,y)

Saddle Points main theorem The game has a saddle point iff max v min w u(v,w) = min w max v u(v,w) 1. max v min w u(v,w) <= min w max v u(v,w) 2. if (x,y) is a saddle point o u(x,y)=min w u(x,w), u(x,y)=max v u(v,y) N.C. u(x,y)=min w u(x,w)<=max v min w u(v,w)<=min w max v u(v,w)<=max v u(v,y)=u(x,y)

Saddle Points main theorem The game has a saddle point iff max v min w u(v,w) = min w max v u(v,w) S.C. x in argmax min w u(v,w) y in argmin max v u(v,w) We prove that (x,y) is a saddle-point w 0 in argmin w u(x,w) (max v min w u(v,w)=u(x,w 0 )) v 0 in argmax v u(v,y) (min w max v u(v,w)=u(v 0,y)) u(x,w 0 )=min w u(x,w)<=u(x,y)<=max v u(v,y)=u(v 0,y) v 0 w 0 x <= y <= But u(x,w 0 )=u(v 0,y) by hypothesis, then u(x,y) = min w u(x,w) = max v (v,y)

Saddle Points main theorem The game has a saddle point iff max v min w u(v,w) = min w max v u(v,w) Colin A B D min w Rose A 12-1 0-1 B 5 1-20 -20 C 3 2 3 2 D -16 0 16-16 max v 12 2 16 This result provides also another way to find saddle points

Properties Given two saddle points (x 1,y 1 ) and (x 2,y 2 ), they have the same payoff (equivalence property): it follows from previous proof: u(x 1,y 1 ) = max v min w u(v,w) = u(x 2,y 2 ) (x 1,y 2 ) and (x 2,y 1 ) are also saddle points(interchangeability property): as in previous proof y 1 y 2 They make saddle point a very nice solution! x 2 x 1 <= <=

What is left? There are games with no saddle-point! An example? R P S min R R 0-1 1-1 P P 1 0-1 -1 maximin S S -1 1 0-1 max max 1 1 1 minimax maximin <> minimax

What is left? There are games with no saddle-point! An example? An even simpler one A B min A 2 0 0 maximin B -5 3-5 max 2 3 minimax

Some practice: find all the saddle points A B C D A 3 2 4 2 B 2 1 3 0 C 2 2 2 2 A B C A -2 0 4 B 2 1 3 C 3-1 -2 A B C A 4 3 8 B 9 5 1 C 2 7 6

Games with no saddle points Colin A B Rose A 2 0 B -5 3 What should players do? resort to randomness to select strategies

Mixed Strategies Each player associates a probability distribution over its set of strategies Expected value principle: maximize the expected payoff Rose Colin 1/3 2/3 A B A 2 0 B -5 3 Rose s expected payoff when playing A = 1/3*2+2/3*0=2/3 Rose s expected payoff when playing B = 1/3*-5+2/3*3=1/3 How should Colin choose its prob. distribution?

Rose 2x2 game Colin p 1-p A B A 2 0 B -5 3 How should Colin choose its prob. distribution? o o Rose cannot take advantage of p=3/10 0 3/10 1 Rose s exp. gain when playing A = 2p + (1-p)*0 = 2p Rose s expected payoff Rose s exp. gain when playing B = -5*p + (1-p)*3 = 3-8p for p=3/10 Colin guarantees a loss of 3/5, what about Rose s? 3 0 2-5 p

Rose 1-q q 2x2 game Colin A B A 2 0 B -5 3 3-5 0 8/10 1 2 0 Colin s expected loss q Colin s exp. loss when playing A = 2q -5*(1-q) = 7q-5 Colin s exp. loss when playing B = 0*q+3*(1-q) = 3-3q How should Rose choose its prob. distribution? o Colin cannot take advantage of q=8/10 o for q=8/10 Rose guarantees a gain of?

2x2 game Rose 1-q q Colin p 1-p A B A 2 0 B -5 3 3 0 Rose s expected payoff 2 3 Colin s expected loss -5-5 0 3/10 1 p 0 8/10 1 q 2 0 Rose playing the mixed strategy (8/10,2/10) and Colin playing the mixed strategy (3/10,7/10) is the equilibrium of the game o No player has any incentives to change, because any other choice would allow the opponent to gain more o Rose gain 3/5 and Colin loses 3/5

Rose 1-x-y y x mx2 game Colin p 1-p A B A 2 0 B -5 3 C 3-5 3 0-5 0 3/10 1 3 2-5 p Rose s expected payoff By playing p=3/10, Colin guarantees max exp. loss = 3/5 o it loses 3/5 if Rose plays A or B, it wins 13/5 if Rose plays C Rose should not play strategy C

Rose 1-x-y y x mx2 game Colin p 1-p A B A 2 0 B -5 3 C 3-5 3 0 Colin s expected loss 1 y (8/10,2/10,3/5) Then Rose should play mixed strategy(8/10,2/10,0) guaranteeing a gain not less than 3/5 1 x -5

Minimax Theorem Every two-person zero-sum game has a solution, i.e, there is a unique value v (value of the game) and there are optimal (pure or mixed) strategies such that Rose s optimal strategy guarantees to her a payoff >= v (no matter what Colin does) Colin s optimal strategies guarantees to him a payoff <= v (no matter what Rose does) This solution can always be found as the solution of a kxk subgame Proved by John von Neumann in 1928! birth of game theory

How to solve mxm games if all the strategies are used at the equilibrium, the probability vector is such to make equivalent for the opponent all its strategies a linear system with m-1 equations and m-1 variables if it has no solution, then we need to look for smaller subgames Rose 1-x-y y x Colin A B C A 2 0 1 B -5 3-2 C 3-5 3 Example: 2x-5y+3(1-x-y)=0x+3y-5(1-x-y) 2x-5y+3(1-x-y)=1x-2y+3(1-x-y)

How to solve 2x2 games Rose If the game has no saddle point 1-q q calculate the absolute difference of the payoffs achievable with a strategy invert them normalize the values so that they become probabilities Colin p 1-p A B A 2 0 B -5 3 2-0 =2-5-3 =8 8 2 8/10 2/10

How to solve mxn matrix games 1. Eliminate dominated strategies 2. Look for saddle points (solution of 1x1 games), if found stop 3. Look for a solution of all the hxh games, with h=min{m,n}, if found stop 4. Look for a solution of all the (h-1)x(h-1) games, if found stop 5. h+1. Look for a solution of all the 2x2 games, if found stop Remark: when a potential solution for a specific kxk game is found, it should be checked that Rose s m-k strategies not considered do not provide her a better outcome given Colin s mixed strategy, and that Colin s n-k strategies not considered do not provide him a better outcome given Rose s mixed strategy.

Game Theory: introduction and applications to computer networks Two-person non zero-sum games Giovanni Neglia INRIA EPI Maestro Slides are based on a previous course with D. Figueiredo (UFRJ) and H. Zhang (Suffolk University)

Outline Two-person zero-sum games Matrix games Pure strategy equilibria (dominance and saddle points), ch 2 Mixed strategy equilibria, ch 3 Game trees, ch 7 Two-person non-zero-sum games Nash equilibria And its limits (equivalence, interchangeability, Prisoner s dilemma), ch. 11 and 12 Strategic games, ch. 14 Subgame Perfect Nash Equilibria (not in the book) Repeated Games, partially in ch. 12 Evolutionary games, ch. 15 N-persons games

Two-person Non-zero Sum Games Players are not strictly opposed payoff sum is non-zero Player 2 A B Player 1 A 3, 4 2, 0 B 5, 1-1, 2 Situations where interest is not directly opposed players could cooperate communication may play an important role for the moment assume no communication is possible

What do we keep from zero-sum games? Dominance Movement diagram Player 1 pay attention to which payoffs have to be considered to decide movements Player 2 A B A 5, 4 2, 0 B 3, 1-1, 2 Enough to determine pure strategies equilibria but still there are some differences (see after)

What can we keep from zero-sum games? As in zero-sum games, pure strategies equilibria do not always exist Player 1 Player 2 A B A 5, 0-1, 4 B 3, 2 2, 1 but we can find mixed strategies equilibria