CS 798: Homework Assignment 4 (Game Theory)

Similar documents
Iterated Dominance and Nash Equilibrium

Economics 171: Final Exam

Economics 109 Practice Problems 1, Vincent Crawford, Spring 2002

Econ 323 Microeconomic Theory. Practice Exam 2 with Solutions

Microeconomics of Banking: Lecture 5

Problem Set 2 - SOLUTIONS

Econ 323 Microeconomic Theory. Chapter 10, Question 1

The Ohio State University Department of Economics Second Midterm Examination Answers

S 2,2-1, x c C x r, 1 0,0

Game Theory. Important Instructions

ECE 586BH: Problem Set 5: Problems and Solutions Multistage games, including repeated games, with observed moves

Introduction to Multi-Agent Programming

An introduction on game theory for wireless networking [1]

MATH 4321 Game Theory Solution to Homework Two

Prisoner s dilemma with T = 1

Name. Answers Discussion Final Exam, Econ 171, March, 2012

SI 563 Homework 3 Oct 5, Determine the set of rationalizable strategies for each of the following games. a) X Y X Y Z

CUR 412: Game Theory and its Applications Final Exam Ronaldo Carpio Jan. 13, 2015

Exercises Solutions: Game Theory

CHAPTER 14: REPEATED PRISONER S DILEMMA

MA200.2 Game Theory II, LSE

Chapter 10: Mixed strategies Nash equilibria, reaction curves and the equality of payoffs theorem

6.254 : Game Theory with Engineering Applications Lecture 3: Strategic Form Games - Solution Concepts

ECO303: Intermediate Microeconomic Theory Benjamin Balak, Spring 2008

Microeconomic Theory II Preliminary Examination Solutions Exam date: June 5, 2017

PAULI MURTO, ANDREY ZHUKOV

The Nash equilibrium of the stage game is (D, R), giving payoffs (0, 0). Consider the trigger strategies:

2 Game Theory: Basic Concepts

Advanced Microeconomics

ECE 586GT: Problem Set 1: Problems and Solutions Analysis of static games

Using the Maximin Principle

Now we return to simultaneous-move games. We resolve the issue of non-existence of Nash equilibrium. in pure strategies through intentional mixing.

PRISONER S DILEMMA. Example from P-R p. 455; also 476-7, Price-setting (Bertrand) duopoly Demand functions

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

Stochastic Games and Bayesian Games

CUR 412: Game Theory and its Applications, Lecture 12

ECON 459 Game Theory. Lecture Notes Auctions. Luca Anderlini Spring 2017

Stochastic Games and Bayesian Games

CS711 Game Theory and Mechanism Design

Outline for today. Stat155 Game Theory Lecture 13: General-Sum Games. General-sum games. General-sum games. Dominated pure strategies

CMPSCI 240: Reasoning about Uncertainty

Out of equilibrium beliefs and Refinements of PBE

Game Theory. VK Room: M1.30 Last updated: October 22, 2012.

January 26,

Solution to Tutorial 1

Solution to Tutorial /2013 Semester I MA4264 Game Theory

6.207/14.15: Networks Lecture 9: Introduction to Game Theory 1

Game Theory: Minimax, Maximin, and Iterated Removal Naima Hammoud

Answer Key: Problem Set 4

CS711: Introduction to Game Theory and Mechanism Design

Regret Minimization and Security Strategies

Simon Fraser University Fall Econ 302 D200 Final Exam Solution Instructor: Songzi Du Wednesday December 16, 2015, 8:30 11:30 AM

Economics Honors Exam Review (Micro) Mar Based on Zhaoning Wang s final review packet for Ec 1010a, Fall 2013

CUR 412: Game Theory and its Applications, Lecture 9

Mohammad Hossein Manshaei 1394

Expectations & Randomization Normal Form Games Dominance Iterated Dominance. Normal Form Games & Dominance

preferences of the individual players over these possible outcomes, typically measured by a utility or payoff function.

Game Theory: Normal Form Games

Economics 431 Infinitely repeated games

Introduction to Game Theory Lecture Note 5: Repeated Games

PAULI MURTO, ANDREY ZHUKOV. If any mistakes or typos are spotted, kindly communicate them to

Microeconomics II. CIDE, MsC Economics. List of Problems

Math 167: Mathematical Game Theory Instructor: Alpár R. Mészáros

(a) Describe the game in plain english and find its equivalent strategic form.

Repeated Games with Perfect Monitoring

Solution Problem Set 2

Simon Fraser University Spring 2014

Problem 3 Solutions. l 3 r, 1

Strategy -1- Strategy

Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4)

Introduction to Game Theory

In the Name of God. Sharif University of Technology. Graduate School of Management and Economics

CSE 316A: Homework 5

CMSC 474, Introduction to Game Theory 16. Behavioral vs. Mixed Strategies

Math 135: Answers to Practice Problems

Chapter 2 Strategic Dominance

Bayesian Nash Equilibrium

Game Theory - Lecture #8

MA300.2 Game Theory 2005, LSE

Finding Mixed-strategy Nash Equilibria in 2 2 Games ÙÛ

Games of Incomplete Information ( 資訊不全賽局 ) Games of Incomplete Information

Duopoly models Multistage games with observed actions Subgame perfect equilibrium Extensive form of a game Two-stage prisoner s dilemma

MAT 4250: Lecture 1 Eric Chung

Strategies and Nash Equilibrium. A Whirlwind Tour of Game Theory

Game theory for. Leonardo Badia.

CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma

Econ 101A Final exam May 14, 2013.

In the Name of God. Sharif University of Technology. Microeconomics 2. Graduate School of Management and Economics. Dr. S.

Game Theory Notes: Examples of Games with Dominant Strategy Equilibrium or Nash Equilibrium

Economics 502 April 3, 2008

Early PD experiments

Outline Introduction Game Representations Reductions Solution Concepts. Game Theory. Enrico Franchi. May 19, 2010

Lecture 5 Leadership and Reputation

CS360 Homework 14 Solution

Chapter 11: Dynamic Games and First and Second Movers

Microeconomic Theory II Preliminary Examination Solutions

Exercises Solutions: Oligopoly

Game Theory: Additional Exercises

6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2

Econ 711 Homework 1 Solutions

Transcription:

0 5 CS 798: Homework Assignment 4 (Game Theory) 1.0 Preferences Assigned: October 28, 2009 Suppose that you equally like a banana and a lottery that gives you an apple 30% of the time and a carrot 70% of the time. Also, you equally like a peach and a lottery that gives you an apple 10% of the time and a carrot 90% of the time. (a) What can you say about your relative preferences for bananas and peaches? If you had a lottery whose payoffs were bananas and carrots, what probability of winning a banana or a carrot would be equally preferred to a peach? Denote apple = A, banana = B, carrot = C, peach = P. We are free to choose utilities as we wish, so let U(A)=0, U(C) = 1. Then, U(B) =.7 and U(P) =.9, so you prefer peaches to bananas. (b) Let P(win B) = p. Then.7p + 1(1-p) =.9, so.3p =.1, so p = 0.33. 2.0 Utility functions Your cable company gives you 10GB of free data transfer a month, and charges $5/GB thereafter. Suppose that your utility from transferring x GB of data is 100(1-e -0.25x ) and that your disutility from paying $1 is 1. How much data should you transfer in a month to maximize your utility? Your net utility from transferring x GB is 100(1-e -0.25x ) if x<10 and 100(1-e -0.25x ) - 5(x-10) otherwise. The plot of these two functions is shown below: It is clear that the maximum occurs at x=10 for a value of 96 approximately 92. So, your utility is maximized by transferring 88 80 exactly 10GB/month. 72 64 56 48 40 32 24 16 8 10 15 20 25 30 35 40 45 50 3.0 Pure and mixed strategies Consider the game of tic-tac-toe. What are the possible actions for the first move of the first player (ignore symmetries)? What would constitute a pure strategy? What would constitute a mixed strategy? Would you ever play a mixed strategy for this game? Why or why not? The only possible first actions are: play corner, play middle, and play center. Depending on which move is played, the second player would have response, and depending on that response the first player would have a response etc. A pure strategy for each player is each valid response to the prior move (whether or not it is rational). A mixed strategy would play one of the pure strategies (i.e the entire sequence) with Page 1 of 5

some probability. It turns out that in tic-tac-toe, with two expert players, a tie is guaranteed with a pure strategy, but a mixed strategy (depending over what you mix) could lose when played against an optimal strategy. So, it never makes sense to mix. In general, every component of a mixed strategy must be a potentially winning strategy. Otherwise, the mixed strategy would improve by discarding a component that can never win. 4.0 Zero-sum game If the payoffs (a, -a) of every outcome of a zero sum game were changed so that the new payoffs were (a+5, -5a), the game would no longer be zero sum. But, would the structure of the game change? No, because utilities are only unique to a affine transformation. 5.0 Representation Represent the Pricing game of Example 7 in Normal form. L M H Y (1,a-1) (2,a-2) (3,a-3) N (0,0) (0,0) (0,0) 6.0 Representation Prove that normal and extensive form are equivalent if information sets are permitted. We need to prove two things (a) if information sets are permitted every normal form game can be represented in extensive form (b) if information sets are permitted every extensive-form game can be represented in normal form. To prove (a): given a normal form game with n players, simply draw a tree of depth n, where all moves by the first player are associated with a node with an edge leading from the root to that node, and all nodes are in the same information set. Then, from each such node, draw an edge for each possible move for the second player, and place each set of nodes in the same information set. Repeat for each successive player, and label the leaves with the payoff from the corresponding array element. To prove (b): given the extensive form game, form paths from the root to each leaf. Decompose the path into moves by t each of the players and find all possible moves by each player each time it is allowed to make a move. Let S i denote the set of moves that player i can move on its t turn. Then the strategy space for player i is the cross product of these sets. Finally, the normal form is an n-dimensional matrix with the ith dimension indexed by the strategy space of the ith player, and the corresponding element having the payoff for these strategies. 7.0 Best response What is the best response for the customer in the Pricing game (Example 7)? The best response depends on the value of a. For each of the strategies of the ISP, i.e., L, M, and H, the best response is Y if a-price > 0, otherwise it is N. Page 2 of 5

8.0 Dominant strategy Suppose that you are not well prepared for a final, and you think you might fail it. If you miss the exam, you will certainly fail it. What is your dominant strategy: attend or miss? Why? If you attend, your payoff is your utility for either pass or fail, but if you miss, your payoff is your utility for fail. Assuming that utility(pass) > utility(fail), your payoff for attending is as good as or better than the payoff for not attending. So, your dominant strategy is to attend. 9.0 Bayesian game Does the Bayesian game in Example 11 have a dominant strategy for the Row player? If so, what is it? It is easy to verify that no matter the type of the Column player (strong or weak signal), the best response for Row if Column plays S is D and if Column plays D is S. Therefore, knowing the type of the Column player does not help Row, and the game does not have a dominant strategy for Row. 10.0 Repeated game Suppose that both players in Prisoner s Dilemma (Example 15) play their dominant strategy in an infinitely repeated game with a discount factor of 0.6. What is their payoff for the repeated game? The one shot payoff is -3 for each, so the repeated payoff is -3* 0.6 i = -3/.4 = -7.5. 11.0 Dominant strategy equilibrium Interpret the meaning of the dominant strategy equilibrium of Example 14. Look up how the 802.11e EDCA protocol solves this problem. It is dominant for both players to send rather than wait. In equilibrium, they always send right away so their packets always collide, and in fact, no progress is made, so that delays are actually infinite. This game illustrates the aphorism: haste makes waste. The EDCA protocol allows higher priority (delay sensitive) stations to wait for a shorter time than lower-priority stations before accessing the medium, therefore making it more probable that they would get access to medium and experience a shorter delay. i = 0 12.0 Iterated deletion Show an example of a pure strategy that is dominated by a mixture of other pure strategies, although none of the strategies in the mixture dominate the pure strategy. Consider the following game, where we only show the payoffs for Row: C1 C2 R1 0 0 R2 1-1 R3-2 2 Page 3 of 5

Neither R2 nor R3 dominate R1. However any mixed strategy of R2 and R3 that plays R3 with a probability greater than 2/3 dominates R1. Therefore, we can delete R1 from the game. 13.0 Maximin What are the maximin equilibria in Examples 10 and 14? In Example 10, Row can get as low as -1 with S, but at least 0 with D, so its maximin strategy is D. Column is assured 1 with S, so its maximin strategy is S, and the equilibrium is DS. In Example 14, Row maximizes its minimum payoff with S. The game is symmetric, so the maximin equilibrium is SS. 14.0 Maximin in a zero-sum game Show that in Example 18 if Row uses any value of p other than 0.5, then it may get a payoff lower than 2.5 if Column plays either pure or mixed strategies. In Figure 3, note that when p is smaller than 0.5, the column player can play pure strategy C1 to reduce Row s payoff below 2.5. Similarly, if p is greater than 0.5, Column can use a pure strategy C2 to reduce Row s payoff. For any value of p, Column can play a mixture qc1 + (1-q) C2 to give Row a payoff of q(p+2) + (1-q)(4-3p). To make this smaller than 2.5, we set q(p+2) + (1-q)(4-3p) < 2.5, or q > (3-6p)/(4-8p). For instance, if p=0, q > 3/4, and if p=1, q>3/4. (The inequality is not valid when p= 0.5.) 15.0 Nash equilibrium Referring to Example 19, assume that if Column plays a mixed strategy with probability qh + (1-q)T instead of its Nash equilibrium strategy. What is Row s mixed strategy best response? Let the row player play ph + (1-p)T. Then, its payoff, given Column s mixed strategy, is p(q-(1- q))+(1-p)(-q+(1-q)) = 4pq -2q -2p +1 =(1-2p)(1-2q). If q < 0.5, p should be 0, otherwise p should be 1. Intuitively, if the Column player is more likely to play T, then Row should play T for sure and vice versa. 16.0 Correlated equilibrium Does the WiFi game of Example 6 have a correlated equilibrium? If so, describe it. Consider an external agency that tells the players to play pds + (1-p)SD. When Row is told to play D, it knows that it will get a payoff of -1 if it deviates. Similarly, when told to play S, it will get 0 if it deviates (instead of 1). So, it will not deviate, independent of the value of p. By symmetry, the same analysis holds for Column, and therefore we have a correlated equilibrium. The external agency can arrange for any desired payoffs to Row and Column by adjusting p. 17.0 Price discrimination Outline the design of a price-discrimination mechanism with n player types (whose valuations are known). Page 4 of 5

Assume that the valuations of each player are v 1,...,v n for minimum quantities of q 1,...,q n. The scheme is essentially to charge v i for q i adjusting for the fact that player i could buy multiples of q j j<i if that minimizes its total cost. 18.0 VCG mechanism The CIO of a company wants to decide how much capacity to buy from its ISP. The cost of capacity is $20/ Mbps/month. There are three departments in the company, who value capacity as follows: department 1 (D1) values capacity xmbps/month at $20(*1-e -0.5x ), D2 values it at $40(*1-e -0.5x ), D3 values it at $80(*1-e - 0.5x ). (a) Assuming the disutility of ISP payment is linear in the amount of payment, what is the overall function that the CIO should maximize? (b) What is type of each department? (c) What is the optimal social choice? (d) What are the Clarke Pivot payments for each department? (e) Is this budget balanced? (a) The overall function is (20+40+80)(1-e -0.5x ) - 20x = 140(1-e -0.5x ) - 20x. (b) The types are the only unknowns in the utility functions, i.e. 20, 40, and 80 respectively. (c) The optimal social choice comes from maximizing the function in (a). Setting f(x) = 140(1-e -0.5x ) - 20x, solve for f (x * )=0, so that x * = 2.5055. (d) To compute x -1, we maximize (120(1-e -0.5x ) - 20x) to get 2.197. Similarly, x -2 = 1.832, and x -3 = 0.8109. Thus, p 1 = v 2 (x -1 ) + v 3 (x -1 ) - (v 2 (x * ) + v 3 (x * )) = (40+80)(1-e -0.5*2.197 ) - (40+80)(1-e -0.5*2.5055 ) = 120*(e - 1.25275- e -1.0985 ) = -5.718. Similarly, p 2 = v 1 (x -2 ) + v 3 (x -2 ) - (v 1 (x * ) + v 3 (x * )) = 100(e -1.25275- e -0.5*1.832 ) = -11.439, p 3 = v 1 (x -3 ) + v 2 (x -3 ) - (v 1 (x * ) + v 2 (x * )) = 60(e -0.5*0.8109 -e -1.25275 )= 60*(e -1.25275- e -0.4055 ) = -22.857. (e) No, the budget is not balanced: the CIO has to pay each department. Page 5 of 5