OPERATIONS RESEARCH. Game Theory

Similar documents
Static (or Simultaneous- Move) Games of Complete Information

Games and Decisions. Part I: Basic Theorems. Contents. 1 Introduction. Jane Yuxin Wang. 1 Introduction 1. 2 Two-player Games 2

A MODEL OF COMPETITION AMONG TELECOMMUNICATION SERVICE PROVIDERS BASED ON REPEATED GAME

Lecture 7. We now use Brouwer s fixed point theorem to prove Nash s theorem.

Problems to be discussed at the 5 th seminar Suggested solutions

Lecture Note 1: Foundations 1

ECE 586GT: Problem Set 2: Problems and Solutions Uniqueness of Nash equilibria, zero sum games, evolutionary dynamics

Introduction to game theory

Price and Quantity Competition Revisited. Abstract

TCOM501 Networking: Theory & Fundamentals Final Examination Professor Yannis A. Korilis April 26, 2002

Chapter 5 Student Lecture Notes 5-1

Linear Combinations of Random Variables and Sampling (100 points)

Quiz on Deterministic part of course October 22, 2002

Dr. A. Sudhakaraiah* V. Rama Latha E.Gnana Deepika

Random Variables. b 2.

II. Random Variables. Variable Types. Variables Map Outcomes to Numbers

Elements of Economic Analysis II Lecture VI: Industry Supply

Appendix - Normally Distributed Admissible Choices are Optimal

AC : THE DIAGRAMMATIC AND MATHEMATICAL APPROACH OF PROJECT TIME-COST TRADEOFFS

Applications of Myerson s Lemma

references Chapters on game theory in Mas-Colell, Whinston and Green

Lecture Note 2 Time Value of Money

Economic Design of Short-Run CSP-1 Plan Under Linear Inspection Cost

Understanding Annuities. Some Algebraic Terminology.

Ch Rival Pure private goods (most retail goods) Non-Rival Impure public goods (internet service)

15-451/651: Design & Analysis of Algorithms January 22, 2019 Lecture #3: Amortized Analysis last changed: January 18, 2019

Problem Set #4 Solutions

UNIVERSITY OF NOTTINGHAM

Equilibrium in Prediction Markets with Buyers and Sellers

occurrence of a larger storm than our culvert or bridge is barely capable of handling? (what is The main question is: What is the possibility of

- contrast so-called first-best outcome of Lindahl equilibrium with case of private provision through voluntary contributions of households

Elton, Gruber, Brown and Goetzmann. Modern Portfolio Theory and Investment Analysis, 7th Edition. Solutions to Text Problems: Chapter 4

CS 286r: Matching and Market Design Lecture 2 Combinatorial Markets, Walrasian Equilibrium, Tâtonnement

Problem Set 6 Finance 1,

To find a non-split strong dominating set of an interval graph using an algorithm

3: Central Limit Theorem, Systematic Errors

Scribe: Chris Berlind Date: Feb 1, 2010

Chapter 3 Student Lecture Notes 3-1

Jeffrey Ely. October 7, This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License.

CHAPTER 3: BAYESIAN DECISION THEORY

In this appendix, we present some theoretical aspects of game theory that would be followed by players in a restructured energy market.

Tests for Two Correlations

Single-Item Auctions. CS 234r: Markets for Networks and Crowds Lecture 4 Auctions, Mechanisms, and Welfare Maximization

EDC Introduction

Solution of periodic review inventory model with general constrains

General Examination in Microeconomic Theory. Fall You have FOUR hours. 2. Answer all questions

A Utilitarian Approach of the Rawls s Difference Principle

MgtOp 215 Chapter 13 Dr. Ahn

Capacitated Location-Allocation Problem in a Competitive Environment

Mathematical Thinking Exam 1 09 October 2017

Topics on the Border of Economics and Computation November 6, Lecture 2

Economics 1410 Fall Section 7 Notes 1. Define the tax in a flexible way using T (z), where z is the income reported by the agent.

ISyE 2030 Summer Semester 2004 June 30, 2004

Bargaining over Strategies of Non-Cooperative Games

A Network Modeling Approach for the Optimization of Internet-Based Advertising Strategies and Pricing with a Quantitative Explanation of Two Paradoxes

IND E 250 Final Exam Solutions June 8, Section A. Multiple choice and simple computation. [5 points each] (Version A)

Privatization and government preference in an international Cournot triopoly

The Influence of Nature on Outcomes of Three Players Game

Final Exam. 7. (10 points) Please state whether each of the following statements is true or false. No explanation needed.

EXAMINATIONS OF THE HONG KONG STATISTICAL SOCIETY

INTRODUCTION TO MACROECONOMICS FOR THE SHORT RUN (CHAPTER 1) WHY STUDY BUSINESS CYCLES? The intellectual challenge: Why is economic growth irregular?

Elton, Gruber, Brown, and Goetzmann. Modern Portfolio Theory and Investment Analysis, 7th Edition. Solutions to Text Problems: Chapter 9

YORK UNIVERSITY Faculty of Science Department of Mathematics and Statistics MATH A Test #2 November 03, 2014

COS 511: Theoretical Machine Learning. Lecturer: Rob Schapire Lecture #21 Scribe: Lawrence Diao April 23, 2013

Hedging Greeks for a portfolio of options using linear and quadratic programming

Financial mathematics

Tests for Two Ordered Categorical Variables

Volume 30, Issue 1. Partial privatization in price-setting mixed duopoly. Kazuhiro Ohnishi Institute for Basic Economic Science, Japan

SIMPLE FIXED-POINT ITERATION

Instituto de Engenharia de Sistemas e Computadores de Coimbra Institute of Systems Engineering and Computers INESC - Coimbra

Dynamic Analysis of Knowledge Sharing of Agents with. Heterogeneous Knowledge

A DUAL EXTERIOR POINT SIMPLEX TYPE ALGORITHM FOR THE MINIMUM COST NETWORK FLOW PROBLEM

Data Mining Linear and Logistic Regression

Likelihood Fits. Craig Blocker Brandeis August 23, 2004

Mode is the value which occurs most frequency. The mode may not exist, and even if it does, it may not be unique.

Attorneys' Compensation in Litigation with Bilateral Delegation

Interval Estimation for a Linear Function of. Variances of Nonnormal Distributions. that Utilize the Kurtosis

Clearing Notice SIX x-clear Ltd

Finance 402: Problem Set 1 Solutions

Meaningful cheap talk must improve equilibrium payoffs

Production and Supply Chain Management Logistics. Paolo Detti Department of Information Engeneering and Mathematical Sciences University of Siena

Lecture 8. v i p i if i = ī, p i otherwise.

Notes are not permitted in this examination. Do not turn over until you are told to do so by the Invigilator.

Analogy-Based Expectation Equilibrium

Testing alternative theories of financial decision making: a survey study with lottery bonds

ON THE COMPLEMENTARITY BETWEEN LAND REFORMS AND TRADE REFORMS

Stochastic optimal day-ahead bid with physical future contracts

Stackelberg vs. Nash in Security Games: Interchangeability, Equivalence, and Uniqueness

Consumption Based Asset Pricing

Mutual Funds and Management Styles. Active Portfolio Management

Final Examination MATH NOTE TO PRINTER

A Php 5,000 loan is being repaid in 10 yearly payments. If interest is 8% effective, find the annual payment. 1 ( ) 10) 0.

Chapter 15: Debt and Taxes

Note on Cubic Spline Valuation Methodology

Creating a zero coupon curve by bootstrapping with cubic splines.

Optimal policy for FDI incentives: An auction theory approach

Wages as Anti-Corruption Strategy: A Note

iii) pay F P 0,T = S 0 e δt when stock has dividend yield δ.

Contests with Group-Specific Public-Good Prizes

Wage-rise contract and endogenous timing in international mixed duopoly

Transcription:

OPERATIONS RESEARCH Chapter 2 Game Theory Prof. Bbhas C. Gr Department of Mathematcs Jadavpur Unversty Kolkata, Inda Emal: bcgr.umath@gmal.com

1.0 Introducton Game theory was developed for decson makng under conflctng stuatons where there are one or more opponents (players). The games lke chess, poker, etc. have the characterstcs of competton and are played accordng to some defnte rules. Game theory provdes optmal solutons to such games, assumng that each of the players wants to maxmze hs proft or mnmze hs loss. Game theory has applcatons n a varety of areas ncludng busness and economcs. In 1994, the Nobel Prze for Economc Scences was won by John F. Nash, Jr., John C. Harsany, and Renhard Selton for ther analyss of equlbra n the theory of noncooperatve games. Later, n 2005, Robert J. Aumann and Thomas C. Schellng won the Nobel Prze for Economc Scences for enhancng our understandng of conflct and cooperaton through game theory analyss.

MODULE - 1: Basc Concept and Termnologes, Two-person Zero-sum Game, and Game wth Pure and Mxed Strateges In ths Module, we wll dscuss some basc termnologes used n Game Theory, twoperson zero-sum game and games wth pure and mxed strateges. 1.1 Basc Termnologes The followng termnologes are commonly used n Game theory. Player : Each partcpant (nterested party) of a game s called a player. Strategy : The strategy of a player s the predetermned rule by whch a player decdes hs course of acton from the lst of courses of acton durng the game. A strategy may be of two types: Pure strategy - It s a decson, n advance of all plays, always to choose a partcular course of acton. Mxed strategy - It s a decson, n advance of all plays, to choose a course of acton for each play n accordance wth some partcular probablty dstrbuton. Optmal strategy : The course of acton whch maxmzes the proft of a player or mnmzes hs loss s called an optmal strategy. Payoff : The outcome of playng a game s called payoff. Payoff matrx : When the players select ther partcular strateges, the payoffs (gans or losses) can be represented n the form of a matrx called the payoff matrx. 3

Saddle pont : A saddle pont s an element of the payoff matrx, whch s both the smallest element n ts row and the largest element n ts column. Furthermore, the saddle pont s also regarded as an equlbrum pont n the theory of games. Value of the game : It refers to the expected outcome per play when players follow ther optmal strategy. 1.2 Two-Person Zero-Sum Game A game wth only two players s called a two-person zero-sum game f the losses of one player are equvalent to the gans of the other so that the sum of ther net gans s zero. Ths game also known as rectangular game. In a two-person game, suppose that the player A has m actvtes and the player B has n actvtes. Then a payoff matrx can be formed by adoptng the followng rules: () Row desgnatons for each matrx are actvtes avalable to the player A. () Column desgnatons for each matrx are actvtes avalable to the player B. () Cell entry v s the payment to the player A n A s payoff matrx when A chooses the actvty and B chooses the actvty. (v) For a zero-sum game, the cell entry n the player B s payoff matrx wll be negatve correspondng to the cell entry v n the player A s payoff matrx so that the sum of payoff matrces for the players A and B s ultmately zero, see Tables 1.1 and 1.2. 1 2 n 1 2 n 1 v 11 v 12 v 1n 1 v 11 v 12 v 1n 2 v 21 v 22 v 2n..... 2 v 21 v 22 v 2n..... m v m1 v m2 v mn m v m1 v m2 v mn Table 1.1: s payoff matrx Table 1.2: s payoff matrx Consder a two-person con tossng game. Each player tosses an unbased con smultaneously. Each player selects ether a head H or a tal T. If the outcomes match (.e., (H, H) or (T, T)) then A wns Rs. 4 from B; otherwse, B wns Rs. 3 from A. s

payoff matrx s gven n Table 1.3. Ths game s a two-person zero-sum game, snce the wnnng of one player s taken as losses for the other. Each player has hs choce from amongst two pure strateges H and T. H T H 4-3 T -3 4 Table 1.3 1.3 Pure Strateges (Mnmax and Maxmn Crteron) The smplest type of game s one where the best strateges for both players are pure strateges. Ths s the case f and only f, the payoff matrx contans a saddle pont. Theorem 1.1: Let (v ) be the m n payoff matrx for a two-person zero-sum game. If v denotes the maxmn value and v the mnmax value of the game, then v v. That s, mn [max {v }] max [mn {v }]. Proof: We have max {v } v f or all = 1,2,...,n and mn {v } v f or all = 1,2,...,m Let the above maxmum and mnmum values be attaned at = 1 and = 1, respectvely,.e., Then, we must have max {v } = v 1 and mn {v } = v 1 v 1 v v 1 f or all = 1,2,...,n; f or all = 1,2,...,m. From ths, we get Therefore, mn mn v 1 v max [max {v }] max v 1 f or all = 1,2,...,n; = 1,2,...,m. [mn {v }].

Note: A game s sad to be far, f v = 0 = v and t s sad to be strctly determnable f v = v = v. Example 1.1: Consder a two-person zero-sum game matrx whch represents payoff to the player A, see Table 1.4. Fnd the optmal strategy, f any. I II III IV V I -2 0 0 5 3 II 4 2 1 3 2 III -4-3 0-2 6 IV 5 3-4 2-6 Table 1.4: Payoff matrx for Example 1.1 Soluton: We use the maxmn (mnmax) prncple to determne the optmal strategy. The player A wshes to obtan the largest possble v by choosng one of hs actvtes (I, II, III, IV), whle the player B s determned to make A s gan the mnmum possble by choce of actvtes from hs lst (I, II, III, IV, V). The player A s called the maxmzng player and B, the mnmzng player. If player A chooses the actvty I then t could I II III IV V Row mnmum I -2 0 0 5 3-2 II 4 2 1 3 2 1 Maxmn III -4-3 0-2 6-4 IV 5 3-4 2-6 -6 Column maxmum 5 3 1 5 6 Mnmax Table 1.5: s payoff matrx happen that the player B also chooses hs actvty I. In ths case, the player B can guarantee a gan of at least 2 to player A,.e., mn{ 2,0,0,5,3} = 2. Smlarly, for other choces of player A,.e., actvtes II, III and IV, B can force the player A to gan only 1, 4 and 6, respectvely, by proper choces from (II, III, IV).e., mn{4,2,1,3,2} = 1, mn{ 4, 3,0, 2,6} = 4 and mn{5,3, 4,2, 6} = 6. For player A, mnmum value n each row represents the least gan to hm f he chooses hs partcular strategy. These

are wrtten n Table 1.5 by row mnmum. wll select the strategy that maxmzes hs mnmum gans,.e., max{ 2,1, 4, 6} = 1.e., player A chooses the strategy II. Ths choce of player A s called the maxmn prncple, and the correspondng gan (here 1) s called the maxmn value of the game. In general, the player A should try to maxmze hs least gans or to fnd max mn v = v. For player B, on the other hand, lkes to mnmze hs losses. The maxmum value n each column represents the maxmum loss to hm f he chooses hs partcular strategy. These are wrtten n Table 1.5 by column maxmum. wll then select the strategy that mnmzes hs maxmum losses. Ths choce of player B s called the mnmax prncple, and the correspondng loss s the mnmax value of the game. In ths case, the value s also 1 and player B chooses the strategy III. In general, the player B should try to mnmze hs maxmum loss or to fnd mn max v = v. If the maxmn value equals the mnmax value then the game s sad to have a saddle pont (here (II, III) cell) and the correspondng strateges are called optmum strateges. The amount at the saddle pont s known as the value of the game. Example 1.2: Solve the game whose payoff matrx s gven below: I II III I -2 15-2 II -5-6 -4 III -5 20-8 Table 1.6: A s payoff matrx Soluton: We use the maxmn (mnmax) prncple to determne the optmal strategy. The game has two saddle ponts at postons (1, 1) and (1, 3). I II III Row mnmum I -2 15-2 -2 Maxmn II -5-6 -4-6 III -5 20-8 -8 Column maxmum -2 20 Mnmax -2 Mnmax

() The best strategy for player A s I. () The best strategy for player B s ether I or III. () The value of the game s 2 for player A and +2 for player B. 1.4 Mxed Strategy: Game wthout A Saddle Pont If maxmn value s not equal to mnmax value then the game s sad to have no saddle pont. In such a case, both the players must determne an optmal mxture of strateges to fnd an equlbrum pont. The optmal strategy mxture for each player may be determned by assgnng to each strategy ts probablty of beng chosen. The strateges so determned are called mxed strateges. The value of game obtaned by the use of mxed strateges represents the least payoff whch player A can expect to wn and the least payoff whch player B can expect to lose. The expected payoff to a player n a game wth payoff matrx [v ] m n can be defned as E(p,q) = m =1 =1 n p v q = pvq T where p = (p 1,p 2,...,p m ) and q = (q 1,q 2,...,q n ) denote probabltes or relatve frequency wth whch a strategy s chosen from the lst of strateges assocated wth m strateges of player A and n strateges of player B, respectvely. Obvously, p 0 ( = 1,2,...m), q 0( = 1,2,...,n) and p 1 + p 2 +... + p m = 1; q 1 + q 2 +... + q n = 1. Theorem 1.2: For any 2 2 two-person zero-sum game wthout any saddle pont havng B 1 B 2 A 1 v 11 v 12 the payoff matrx for player A gven by A 2 v 21 v, the optmal mxed strateges 22 A 1 A 2 S A = p 1 p and S B 1 B 2 B = 2 q 1 q are determned by p 1 p 2 = v 22 v 21 v 11 v 12, q 1 q 2 = v 22 v 12 v 11 v 21 where 2 p 1 + p 2 = 1 and q 1 + q 2 = 1. The value v of the game to A s gven by v = v 11v 22 v 21 v 12 v 11 +v 22 (v 12 +v 21 ). A 1 A 2 Proof: Let a mxed strategy for player A be gven by S A = p 1 p, where p 1+p 2 = 1. 2 Thus, f player B moves B 1 then the net expected gan of A wll be E 1 (p) = v 11 p 1 +v 21 p 2 and f B moves B 2, the net expected gan of A wll be E 2 (p) = v 12 p 1 + v 22 p 2. B 1 B 2 Smlarly, f B plays hs mxed strategy S B = q 1 q, where q 1 + q 2 = 1, then B s 2 net expected loss wll be E 1 (q) = v 11 q 1 +v 12 q 2 f A plays A 1, and E 2 (q) = v 21 q 1 +v 22 q 2 f A

plays A 2. The expected gan of player A, when B chooses hs moves wth probabltes q 1 and q 2, s gven by E(p,q) = q 1 [v 11 p 1 + v 21 p 2 ] + q 2 [v 12 p 1 + v 22 p 2 ]. would always try to mx hs moves wth such probabltes so as to maxmze hs expected gan. Now, E(p,q) = q 1 [v 11 p 1 + v 21 (1 p 1 )] + (1 q 1 )[v 12 p 1 + v 22 (1 p 1 )] = [v 11 + v 22 (v 12 + v 21 )]p 1 q 1 + (v 12 v 22 )p 1 + (v 21 v 22 )q 1 + v 22 ( = λ p 1 v )( 22 v 21 q λ 1 v ) 22 v 12 + v 11v 22 v 12 v 21 λ λ where λ = v 11 + v 22 (v 12 + v 21 ). We see that f A chooses p 1 = v 22 v 21 λ, he ensures an expected gan of at least (v 11 v 22 v 12 v 21 )/λ. Smlarly, f B chooses q 1 = v 22 v 12 λ, then he can lmt hs expected loss to at most (v 11 v 22 v 12 v 21 )/λ. These choces of p 1 and q 1 wll thus be optmal to the two players. Thus, we get p 1 = v 22 v 21 v = 22 v 21 λ v 11 + v 22 (v 12 + v 21 ) q 1 = v 22 v 12 v = 22 v 12 λ v 11 + v 22 (v 12 + v 21 ) v and v = 11 v 22 v 12 v 21 v 11 + v 22 (v 12 + v 21 ) Hence, we have v and p 2 = 1 p 1 = 11 v 12 v 11 + v 22 (v 12 + v 21 ) v and q 2 = 1 q 1 = 11 v 21 v 11 + v 22 (v 12 + v 21 ) p 1 p 2 = v 22 v 21 v 11 v 12, q 1 = v 22 v 12 v ; and v = 11 v 22 v 21 v 12 q 2 v 11 v 21 v 11 + v 22 (v 12 + v 21 ). (1.1) Note: The above formula for p 1, p 2, q 1, q 2 and v are vald only for 2 2 games wthout saddle pont. Example 1.3: Suppose that n a game of matchng cons wth two players, one player wns Rs. 2 when there are 2 heads, and gets nothng when there are 2 tals and looses Re 1 when there are one head and one tal. Determne the best strateges for each player and the value of the game. Soluton: The payoff matrx for player A s gven n Table 1.7. The game has no saddle pont. Let the player A plays H wth probablty x and T wth probablty 1 x. Then, f the player B plays H, then A s expected gan s E(A,H) = x(2) + (1 x)( 1) = 3x 1. If the player B plays T, A s expected gan s E(A,T ) = x( 1) + (1 x).0 = x.

H T H 2-1 T -1 0 Table 1.7: s payoff matrx If the player A chooses x such that E(A,H) = E(A,T ) = E(A) say, then ths wll determne best strategy for hm. Thus we have 3x 1 = x or x = 1/4. Therefore, the best strategy for the player A s to play H and T wth probablty 1/4 and 3/4, respectvely. Therefore, the expected gan for player A s E(A) = 1 4 (2) + 3 4 ( 1) = 1 4. The same procedure can be appled for player B. If the probablty of B s choce of H s y and that of T s 1 y then for the best strategy of the player B, E(B,H) = E(B,T ) whch gves y = 1/4. Therefore, 1 y = 3/4. Thus A s optmal strategy s (1/4, 3/4) and B s optmal strategy s (1/4, 3/4). The expected value of the game s 1/4 to the player A. Ths result can also be obtaned drectly by usng the formulae (1.1).