Prisoner s Dilemma. CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma. Prisoner s Dilemma. Prisoner s Dilemma.

Similar documents
CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma

6.254 : Game Theory with Engineering Applications Lecture 3: Strategic Form Games - Solution Concepts

SI 563 Homework 3 Oct 5, Determine the set of rationalizable strategies for each of the following games. a) X Y X Y Z

An introduction on game theory for wireless networking [1]

Game Theory: Minimax, Maximin, and Iterated Removal Naima Hammoud

Solution to Tutorial /2013 Semester I MA4264 Game Theory

Solution to Tutorial 1

Introduction to Game Theory

m 11 m 12 Non-Zero Sum Games Matrix Form of Zero-Sum Games R&N Section 17.6

Game theory and applications: Lecture 1

Rationalizable Strategies

In the Name of God. Sharif University of Technology. Graduate School of Management and Economics

CMSC 474, Introduction to Game Theory 16. Behavioral vs. Mixed Strategies

Introduction to Multi-Agent Programming

Problem 3 Solutions. l 3 r, 1

CS711: Introduction to Game Theory and Mechanism Design

MS&E 246: Lecture 2 The basics. Ramesh Johari January 16, 2007

Advanced Microeconomics

Outline Introduction Game Representations Reductions Solution Concepts. Game Theory. Enrico Franchi. May 19, 2010

Regret Minimization and Security Strategies

preferences of the individual players over these possible outcomes, typically measured by a utility or payoff function.

Their opponent will play intelligently and wishes to maximize their own payoff.

Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4)

CS711 Game Theory and Mechanism Design

January 26,

Economics 109 Practice Problems 1, Vincent Crawford, Spring 2002

1. better to stick. 2. better to switch. 3. or does your second choice make no difference?

CMPSCI 240: Reasoning about Uncertainty

Game Theory Lecture #16

LECTURE 4: MULTIAGENT INTERACTIONS

Lecture 3 Representation of Games

Now we return to simultaneous-move games. We resolve the issue of non-existence of Nash equilibrium. in pure strategies through intentional mixing.

UC Berkeley Haas School of Business Game Theory (EMBA 296 & EWMBA 211) Summer 2016

Econ 711 Homework 1 Solutions

Today. Applications of NE and SPNE Auctions English Auction Second-Price Sealed-Bid Auction First-Price Sealed-Bid Auction

Can we have no Nash Equilibria? Can you have more than one Nash Equilibrium? CS 430: Artificial Intelligence Game Theory II (Nash Equilibria)

Name. FINAL EXAM, Econ 171, March, 2015

TTIC An Introduction to the Theory of Machine Learning. Learning and Game Theory. Avrim Blum 5/7/18, 5/9/18

Week 8: Basic concepts in game theory

Introduction to Game Theory

ECE 586GT: Problem Set 1: Problems and Solutions Analysis of static games

CMPSCI 240: Reasoning about Uncertainty

Game Theory Problem Set 4 Solutions

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India August 2012

Review Best Response Mixed Strategy NE Summary. Syllabus

Overuse of a Common Resource: A Two-player Example

Game Theory I 1 / 38

MATH 4321 Game Theory Solution to Homework Two

Iterated Dominance and Nash Equilibrium

Notes for Section: Week 4

PAULI MURTO, ANDREY ZHUKOV. If any mistakes or typos are spotted, kindly communicate them to

Game Theory I 1 / 38

Notes for Section: Week 7

Prisoner s dilemma with T = 1

PAULI MURTO, ANDREY ZHUKOV

Week 8: Basic concepts in game theory

Game Theory. VK Room: M1.30 Last updated: October 22, 2012.

Games of Incomplete Information

Microeconomics II. CIDE, MsC Economics. List of Problems

ECO303: Intermediate Microeconomic Theory Benjamin Balak, Spring 2008

Game Theory. Wolfgang Frimmel. Repeated Games

S 2,2-1, x c C x r, 1 0,0

Exercises Solutions: Game Theory

Test 1. ECON3161, Game Theory. Tuesday, September 25 th

CSI 445/660 Part 9 (Introduction to Game Theory)

w E(Q w) w/100 E(Q w) w/

CS 7180: Behavioral Modeling and Decision- making in AI

Econ 101A Final exam May 14, 2013.

6.207/14.15: Networks Lecture 9: Introduction to Game Theory 1

Outline for today. Stat155 Game Theory Lecture 13: General-Sum Games. General-sum games. General-sum games. Dominated pure strategies

Econ 101A Final exam May 14, 2013.

Strategies and Nash Equilibrium. A Whirlwind Tour of Game Theory

Microeconomic Theory II Preliminary Examination Solutions Exam date: August 7, 2017

Algorithms and Networking for Computer Games

Stochastic Games and Bayesian Games

6.1 What is a Game? 166 CHAPTER 6. GAMES

Repeated, Stochastic and Bayesian Games

Player 2 H T T -1,1 1, -1

Econ 323 Microeconomic Theory. Practice Exam 2 with Solutions

Game Theory: Normal Form Games

CUR 412: Game Theory and its Applications Final Exam Ronaldo Carpio Jan. 13, 2015

In the Name of God. Sharif University of Technology. Microeconomics 2. Graduate School of Management and Economics. Dr. S.

Economics 101A (Lecture 21) Stefano DellaVigna

GAME THEORY. Department of Economics, MIT, Follow Muhamet s slides. We need the following result for future reference.

MA300.2 Game Theory 2005, LSE

Simon Fraser University Spring 2014

Econ 323 Microeconomic Theory. Chapter 10, Question 1

CEC login. Student Details Name SOLUTIONS

Notes on Auctions. Theorem 1 In a second price sealed bid auction bidding your valuation is always a weakly dominant strategy.

MATH 121 GAME THEORY REVIEW

ECON 459 Game Theory. Lecture Notes Auctions. Luca Anderlini Spring 2017

Game Theory Week 7, Lecture 7

Economics 171: Final Exam

10.1 Elimination of strictly dominated strategies

Game theory for. Leonardo Badia.

Game Theory: Additional Exercises

Epistemic Game Theory

Game Theory. Analyzing Games: From Optimality to Equilibrium. Manar Mohaisen Department of EEC Engineering

In reality; some cases of prisoner s dilemma end in cooperation. Game Theory Dr. F. Fatemi Page 219

Microeconomic Theory II Preliminary Examination Solutions

Transcription:

CS 331: rtificial Intelligence Game Theory I You and your partner have both been caught red handed near the scene of a burglary. oth of you have been brought to the police station, where you are interrogated separately by the police. 1 2 The police present your options: 1. You can testify against your partner 2. You can refuse to testify against your partner (and keep your mouth shut) Here are the consequences of your actions: If you testify against your partner and your partner refuses, you are released and your partner will serve 10 years in jail If you refuse and your partner testifies against you, you will serve 10 years in jail and your partner is released If both of you testify against each other, both of you will serve 5 years in jail If both of you refuse, both of you will only serve 1 year in jail 3 4 Your partner is offered the same deal Remember that you can t communicate with your partner and you don t know what he/she will do Will you testify or refuse? Game Theory Welcome to the world of Game Theory! Game Theory defined as the study of rational decision-making in situations of conflict and/or cooperation dversarial search is part of Game Theory We will now look at a much broader group of games 5 6 1

Types of games we will deal with today Two players Discrete, finite action space Simultaneous moves (or without knowledge of the other player s move) Imperfect information Zero sum games and non-zero sum games Uses of Game Theory gent design: determine the best strategy against a rational player and the expected return for each player Mechanism design: Define the rules of the game to influence the behavior of the agents Real world applications: negotiations, bandwidth sharing, auctions, bankruptcy proceedings, pricing decisions 7 8 ack to Formal definition of Normal Form Normal-form (or matrix-form) representation Players: lice, ob ctions: testify, refuse lice: refuse = -10, = 0 = -1, = -1 Payoffs for each player (non-zero sum game in this example) 9 The normal-form representation of an n- player game specifies: The players strategy spaces S 1,, S n Their payoff functions u 1,,u n where u i : S 1 x S 2 x x S n R ie. a function that maps from the combination of strategies of all the players and returns the payoff for player i 10 Strategies Each player must adopt and execute a strategy Strategy = policy ie. mapping from state to action is a one move game: Strategy is a single action There is only a single state pure strategy is a deterministic policy Other Normal Form Games The game of chicken: two cars (usually driven by macho guys with something to prove) drive at each other on a narrow road. The first one to swerve loses. : Stay : Swerve : Stay = -100, = -100 = 1, = -1 : Swerve = -1, = 1 = 0, = 0 11 12 2

Other Normal Form Games Penalty kick in Soccer: Shooter vs. Goalie. The shooter shoots the ball either to the left or to the right. The goalie dives either left or right. If it s the same side as the ball was shot, the goalie makes the save. Otherwise, the shooter scores. Strategy lice: refuse = -10, = 0 = -1, = -1 Shooter: Left Shooter: Right Goalie: Left Goalie: Right S =-1, G = 1 S = 1, G = -1 S = 1, G = -1 S = -1, G = 1 What is the right pure strategy for lice or ob? (ssume both want to maximize their own expected utility) 13 14 Strategy lice: refuse = -10, = 0 = -1, = -1 lice thinks: If ob testifies, I get 5 years if I testify and 10 years if I don t If ob doesn t testify, I get 0 years if I testify and 1 year if I don t lright I ll testify Strategy lice: refuse = -10, = 0 = -1, = -1 Testify is a dominant strategy for the game (notice how the payoffs for lice are always bigger if she testifies than if she refuses) 15 16 Dominant Strategies Example of Dominant Strategies Suppose a player has two strategies S and S. We say S dominates S if choosing S always yields at least as good an outcome as choosing S. S strictly dominates S if choosing S always gives a better outcome than choosing S (no matter what the other player does) S weakly dominates S if there is one set of opponent s actions for which S is superior, and all other sets of opponent s actions give S and S the same payoff. 17 lice: refuse = -10, = 0 = -1, = -1 lice: refuse = -10, = 0 = 0, = -1 Note testify strongly dominates refuse testify weakly dominates refuse 18 3

Dominated Strategies (The opposite) S is dominated by S if choosing S never gives a better outcome than choosing S, no matter what the other players do S is strictly dominated by S if choosing S always gives a worse outcome than choosing S, no matter what the other player does S is weakly dominated by S if there is at least one set of opponent s actions for which S gives a worse outcome than S, and all other sets of opponent s actions give S and S the same payoff. 19 Dominance It is irrational not to play a strictly dominant strategy (if it exists) It is irrational to play a strictly dominated strategy Since Game Theory assumes players are rational, they will not play strictly dominated strategies 20 Iterated Elimination of Strictly Dominated Strategies lice: refuse = -10, = 0 = -1, = -1 Simplifies to: Iterated Eliminiation of Strictly Dominated Strategies ut in this simplified game, refuse is also a strictly dominated strategy for ob 21 22 Iterated Elimination of Strictly Dominated Strategies ob: testify lice: testify = -5, = -5 Simplifies to: This is the gametheoretic solution to (note that it s worse off than if both players refuse) 23 Dominant Strategy Equilibrium lice: refuse = -10, = 0 = -1, = -1 (testify,testify) is a dominant strategy equilibrium It s an equilibrium because no player can benefit by switching strategies given that the other player sticks with the same strategy n equilibrium is a local optimum in the space of policies 24 4

Pareto Optimal n outcome is Pareto optimal if there is no other outcome that all players would prefer n outcome is Pareto dominated by another outcome if all players would prefer the other outcome If lice and ob both testify, this outcome is Pareto dominated by the outcome if they both refuse. This is why it s called What If No Strategies re Strictly Dominated? S1 = 0, = 4 = 4, = 0 = 5, = 3 S2 = 4, = 0 = 0, = 4 = 5, = 3 S3 = 3, = 5 = 3, = 5 = 6, = 6 How do we find these equilibrium points in the game? 25 26 Nash Equilibrium dominant strategy equilibrium is a special case of a Nash Equilibrium Nash Equilibrium: strategy profile in which no player wants to deviate from his or her strategy. Strategy profile: n assignment of a strategy to each player eg. (testify, testify) in ny Nash Equilibrium will survive iterated elimination of strictly dominated strategies 27 Nash Equilibrium in lice: refuse = -10, = 0 = -1, = -1 If (testify,testify) is a Nash Equilibrium, then: lice doesn t want to change her strategy of testify given that ob chooses testify ob doesn t want to change his strategy of testify given that lice chooses testify 28 How to Spot a Nash Equilibrium S1 = 0, = 4 = 4, = 0 = 5, = 3 S2 = 4, = 0 = 0, = 4 = 5, = 3 S3 = 3, = 5 = 3, = 5 = 6, = 6 How to Spot a Nash Equilibrium S1 = 0, = 4 = 4, = 0 = 5, = 3 S2 = 4, = 0 = 0, = 4 = 5, = 3 S3 = 3, = 5 = 3, = 5 = 6, = 6 29 Go through each square and see: If player gets a higher payoff if she changes her strategy If player gets a higher payoff if he changes his strategy If the answer is no to both of the above, you have a Nash Equilibrium 30 5

How to Spot a Nash Equilibrium S1 = 0, = 4 = 4, = 0 = 5, = 3 S2 = 4, = 0 = 0, = 4 = 5, = 3 S3 = 3, = 5 = 3, = 5 = 6, = 6 won t change his Strategy of S3 Payoff of 6 > 5 (S2) and 6 > 5 (S1) won t change her Strategy of S3 Payoff of 6 > 5 (S2) and 6 > 5 (S1) Formal Definition of Nash Equilibrium (n-player) Notation: S i = Set of strategies for player i s i S i means strategy s i is a member of strategy set S i u i (s 1, s 2,, s n ) = payoff for player i if all the players in the game play their respective strategies s 1, s 2,, s n. s * 1 S 1,s * 2 S 2,, s * n S n are a Nash equilibrium iff: * * * i s arg max u ( s,, s i si i 1 i1, s, s i * i1 *,, s ) n 31 32 Formal Definition of a Nash Equilibrium S1 = 0, = 4 = 4, = 0 = 5, = 3 S2 = 4, = 0 = 0, = 4 = 5, = 3 S3 = 3, = 5 = 3, = 5 = 6, = 6 Using the notation u i ( s strategy, s strategy): u ( S3, max u ( S1,, u ( S2,, u u ( S3, max u ( S3, S1), u ( S3, S2), u ( S3, ( S3, Neat fact If your game has a single Nash Equilibrium, you can announce to your opponent that you will play your Nash Equilibrium strategy If your opponent is rational, he will have no choice but to play his part of the Nash Equilibrium strategy Why? 33 34 6