Epistemic Game Theory

Similar documents
Introduction to game theory LECTURE 2

Logic and Artificial Intelligence Lecture 24

6.254 : Game Theory with Engineering Applications Lecture 3: Strategic Form Games - Solution Concepts

Game theory and applications: Lecture 1

Advanced Microeconomics

Logic and Artificial Intelligence Lecture 25

TR : Knowledge-Based Rational Decisions and Nash Paths

6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2

Outline Introduction Game Representations Reductions Solution Concepts. Game Theory. Enrico Franchi. May 19, 2010

Rationalizable Strategies

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

An introduction on game theory for wireless networking [1]

Game theory for. Leonardo Badia.

Stochastic Games and Bayesian Games

m 11 m 12 Non-Zero Sum Games Matrix Form of Zero-Sum Games R&N Section 17.6

10.1 Elimination of strictly dominated strategies

Strategies and Nash Equilibrium. A Whirlwind Tour of Game Theory

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India October 2012

CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma

Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4)

Best response cycles in perfect information games

Best-Reply Sets. Jonathan Weinstein Washington University in St. Louis. This version: May 2015

TR : Knowledge-Based Rational Decisions

Game Theory: Normal Form Games

PAULI MURTO, ANDREY ZHUKOV. If any mistakes or typos are spotted, kindly communicate them to

Stochastic Games and Bayesian Games

Game Theory. VK Room: M1.30 Last updated: October 22, 2012.

Player 2 H T T -1,1 1, -1

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

G5212: Game Theory. Mark Dean. Spring 2017

Lecture 3 Representation of Games

Basic Game-Theoretic Concepts. Game in strategic form has following elements. Player set N. (Pure) strategy set for player i, S i.

ECE 586GT: Problem Set 1: Problems and Solutions Analysis of static games

CS711 Game Theory and Mechanism Design

PAULI MURTO, ANDREY ZHUKOV

ECON 803: MICROECONOMIC THEORY II Arthur J. Robson Fall 2016 Assignment 9 (due in class on November 22)

Introduction to Multi-Agent Programming

Games of Incomplete Information

Beliefs and Sequential Rationality

Repeated, Stochastic and Bayesian Games

CMPSCI 240: Reasoning about Uncertainty

Using the Maximin Principle

Economics 171: Final Exam

January 26,

CS711: Introduction to Game Theory and Mechanism Design

PhD Qualifier Examination

SI 563 Homework 3 Oct 5, Determine the set of rationalizable strategies for each of the following games. a) X Y X Y Z

ECON322 Game Theory Half II

Introduction to Game Theory

MATH 121 GAME THEORY REVIEW

Microeconomics Comprehensive Exam

CHAPTER 14: REPEATED PRISONER S DILEMMA

Continuing game theory: mixed strategy equilibrium (Ch ), optimality (6.9), start on extensive form games (6.10, Sec. C)!

Sequential Rationality and Weak Perfect Bayesian Equilibrium

Outline for today. Stat155 Game Theory Lecture 13: General-Sum Games. General-sum games. General-sum games. Dominated pure strategies

Introduction to Game Theory Lecture Note 5: Repeated Games

The Ohio State University Department of Economics Econ 601 Prof. James Peck Extra Practice Problems Answers (for final)

Answers to Problem Set 4

CUR 412: Game Theory and its Applications Final Exam Ronaldo Carpio Jan. 13, 2015

Chapter 2 Strategic Dominance

6.207/14.15: Networks Lecture 9: Introduction to Game Theory 1

Iterated Dominance and Nash Equilibrium

CMPSCI 240: Reasoning about Uncertainty

Introductory Microeconomics

Game Theory. Important Instructions

Epistemic Experiments: Utilities, Beliefs, and Irrational Play

Game Theory: Minimax, Maximin, and Iterated Removal Naima Hammoud

Introduction. Microeconomics II. Dominant Strategies. Definition (Dominant Strategies)

Microeconomic Theory II Preliminary Examination Solutions

Algorithmic Game Theory and Applications. Lecture 11: Games of Perfect Information

On Existence of Equilibria. Bayesian Allocation-Mechanisms

CMSC 474, Introduction to Game Theory 16. Behavioral vs. Mixed Strategies

Introduction to Game Theory

Solution to Tutorial 1

Exercises Solutions: Game Theory

Solution to Tutorial /2013 Semester I MA4264 Game Theory

(a) (5 points) Suppose p = 1. Calculate all the Nash Equilibria of the game. Do/es the equilibrium/a that you have found maximize social utility?

Microeconomics of Banking: Lecture 5

M.Phil. Game theory: Problem set II. These problems are designed for discussions in the classes of Week 8 of Michaelmas term. 1

Introduction to Game Theory

Almost essential MICROECONOMICS

Prisoner s Dilemma. CS 331: Artificial Intelligence Game Theory I. Prisoner s Dilemma. Prisoner s Dilemma. Prisoner s Dilemma.

Rational Behaviour and Strategy Construction in Infinite Multiplayer Games

6.207/14.15: Networks Lecture 9: Introduction to Game Theory 1

Introduction to Political Economy Problem Set 3

Problem 3 Solutions. l 3 r, 1

UC Berkeley Haas School of Business Game Theory (EMBA 296 & EWMBA 211) Summer 2016

GAME THEORY. Department of Economics, MIT, Follow Muhamet s slides. We need the following result for future reference.

Econ 711 Homework 1 Solutions

Mixed Strategies. In the previous chapters we restricted players to using pure strategies and we

Game Theory Week 7, Lecture 7

Sequential-move games with Nature s moves.

Chapter 10: Mixed strategies Nash equilibria, reaction curves and the equality of payoffs theorem

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India August 2012

G5212: Game Theory. Mark Dean. Spring 2017

ECON 459 Game Theory. Lecture Notes Auctions. Luca Anderlini Spring 2017

Extensive-Form Games with Imperfect Information

Homework #2 Psychology 101 Spr 03 Prof Colin Camerer

Mixed Strategies. Samuel Alizon and Daniel Cownden February 4, 2009

Finding Equilibria in Games of No Chance

Transcription:

Epistemic Game Theory Lecture 1 ESSLLI 12, Opole Eric Pacuit Olivier Roy TiLPS, Tilburg University MCMP, LMU Munich ai.stanford.edu/~epacuit http://olivier.amonbofis.net August 6, 2012 Eric Pacuit and Olivier Roy 1

The Guessing Game Eric Pacuit and Olivier Roy 2

Plan for the week Eric Pacuit and Olivier Roy 3

Plan for the week 1. Monday Basic Concepts. Basics of Game Theory. The Epistemic View on Games. Basics of Decision Theory Eric Pacuit and Olivier Roy 3

Plan for the week 1. Monday Basic Concepts. 2. Tuesday Epistemics. Logical/qualitative models of beliefs, knowledge and higher-order attitudes. Probabilistic/quantitative models of beliefs, knowledge and higher-order attitudes. Eric Pacuit and Olivier Roy 3

Plan for the week 1. Monday Basic Concepts. 2. Tuesday Epistemics. 3. Wednesday Fundamentals of Epistemic Game Theory. Common knowledge of Rationality and iterated strict dominance in the matrix. Common knowledge of Rationality and backward induction (strict dominance in the tree). Eric Pacuit and Olivier Roy 3

Plan for the week 1. Monday Basic Concepts. 2. Tuesday Epistemics. 3. Wednesday Fundamentals of Epistemic Game Theory. 4. Thursday Puzzles and Paradoxes. Weak dominance and admissibility in the matrix. Russell-style paradoxes in models of higher-order beliefs. (The Brandenburger-Kiesler paradox). Eric Pacuit and Olivier Roy 3

Plan for the week 1. Monday Basic Concepts. 2. Tuesday Epistemics. 3. Wednesday Fundamentals of Epistemic Game Theory. 4. Thursday Puzzles and Paradoxes. 5. Friday Extensions and New Directions. Nash Equilibrium and mixted strategies. Forward Induction. Are the models normative or descriptive? Theory of play. Eric Pacuit and Olivier Roy 3

Practicalities Course Website: ai.stanford.edu/~epacuit/esslli2012/epgmth.html There you ll find handouts, reading material and additional references. In case of problem: Olivier Roy: Olivier.Roy@lmu.de Eric Pacuit: E.J.Pacuit@uvt.nl Eric Pacuit and Olivier Roy 4

Eric Pacuit and Olivier Roy 5

Key Concepts Games in Strategic (matrix) and Extensive (tree) form. Strategies (pure and mixed). Solution Concepts: Iterated Strict Dominance, Iterated Weak Dominance, Nash Equilibrium, Eric Pacuit and Olivier Roy 6

The Matrix: games in strategic forms. Eric Pacuit and Olivier Roy 7

The Matrix: games in strategic forms. Strangelove Alexei Players, Eric Pacuit and Olivier Roy 7

The Matrix: games in strategic forms. Strangelove Disarm Arm Alexei Disarm Arm Players, Actions or Strategies, Strategy profiles, Eric Pacuit and Olivier Roy 7

The Matrix: games in strategic forms. Strangelove Alexei Disarm Arm Disarm 3, 3 Arm 1, 1 Players, Actions or Strategies, Strategy profiles, Payoffs on profiles. Eric Pacuit and Olivier Roy 7

The Matrix: games in strategic forms. Strangelove Alexei Disarm Arm Disarm 3, 3 0, 4 Arm 4, 0 1, 1 Players, Actions or Strategies, Strategy profiles, Payoffs on profiles. Eric Pacuit and Olivier Roy 7

A three players game Strglv Fidel - D Alexei D A D 3, 3, 3 1, 4, 5 A 4, 1, 1 2, 2, 2 Strglv Fidel - A Alexei D A D 3, 3, 2 1, 4, 4 A 4, 1, 0 2, 2, 2 Eric Pacuit and Olivier Roy 8

The Tree: games in extensive forms. S D A A A D A D A 3, 3 1, 4 4, 1 2, 2 Actions, Eric Pacuit and Olivier Roy 9

The Tree: games in extensive forms. S D A A A D A D A 3, 3 1, 4 4, 1 2, 2 Actions, Players, Eric Pacuit and Olivier Roy 9

The Tree: games in extensive forms. S D A A A D A D A 3, 3 1, 4 4, 1 2, 2 Actions, Players, Payoffs on leaves, Eric Pacuit and Olivier Roy 9

The Tree: games in extensive forms. S D A A A D A D A 3, 3 1, 4 4, 1 2, 2 Actions, Players, Payoffs on leaves, Strategies Eric Pacuit and Olivier Roy 9

The Tree: games in extensive forms. S D A A A D A D A 3, 3 1, 4 4, 1 2, 2 Actions, Players, Payoffs on leaves, Strategies Eric Pacuit and Olivier Roy 9

The Tree: games in extensive forms. S D A A A D A D A 3, 3 1, 4 4, 1 2, 2 Actions, Players, Payoffs on leaves, Strategies Eric Pacuit and Olivier Roy 9

The Tree: games in extensive forms. S D A A A D A D A 3, 3 1, 4 4, 1 2, 2 Actions, Players, Payoffs on leaves, Strategies Eric Pacuit and Olivier Roy 9

The Tree: games in extensive forms. S D A A A D A D A 3, 3 1, 4 4, 1 2, 2 Actions, Players, Payoffs on leaves, Strategies Eric Pacuit and Olivier Roy 9

The Tree: games in extensive forms. S D A A A D A D A 3, 3 1, 4 4, 1 2, 2 Actions, Players, Payoffs on leaves, Strategies Eric Pacuit and Olivier Roy 9

The Tree: games in extensive forms. S D A A A D A D A 3, 3 1, 4 4, 1 2, 2 Actions, Players, Payoffs on leaves, Strategies Eric Pacuit and Olivier Roy 9

Extensive and strategic form games are related S A D A D 3, 3 1, 4 A 4,1 2, 2 S D A A A D A D A 3,3 1,4 4,1 2,2 Eric Pacuit and Olivier Roy 10

Extensive and strategic form games are related S A D A D 3, 3 1, 4 A 4,1 2, 2 S D A A A D A D A 3,3 1,4 4,1 2,2 Eric Pacuit and Olivier Roy 10

Some types of non-cooperative games of interest 2 players games. 2 players, zero-sum: if one player wins x then the other looses x. 2 players, win-loose games. Perfect/imperfect information. Eric Pacuit and Olivier Roy 11

Pure and mixed strategies. Eric Pacuit and Olivier Roy 12

Pure and mixed strategies. Strangelove Alexei Head Tail Head 1, -1-1, 1 Tail -1, 1 1, -1 Eric Pacuit and Olivier Roy 12

Pure and mixed strategies. Strangelove Alexei Head Tail Head 1, -1-1, 1 Tail -1, 1 1, -1 Strangelove has two pure strategies: Head and Tail. Eric Pacuit and Olivier Roy 12

Pure and mixed strategies. Strangelove Alexei Head Tail Head 1, -1-1, 1 Tail -1, 1 1, -1 Strangelove has two pure strategies: Head and Tail. A mixed strategy is a probability distribution over the set of pure strategies. For instance: (1/2 Head, 1/2 Tail) (1/3 Head, 2/3 Tail)... Eric Pacuit and Olivier Roy 12

Pure and mixed strategies. Strangelove Alexei Head Tail Head 1, -1-1, 1 Tail -1, 1 1, -1 Strangelove has two pure strategies: Head and Tail. A mixed strategy is a probability distribution over the set of pure strategies. For instance: (1/2 Head, 1/2 Tail) (1/3 Head, 2/3 Tail)... Additional subtleties in extensive games. (mixing at a node vs mixing whole strategies). Eric Pacuit and Olivier Roy 12

Interpretation of mixed strategies Eric Pacuit and Olivier Roy 13

Interpretation of mixed strategies 1. Real randomizations: Side of goal in penalty kicks. Serving side in tennis. Luggage check at the airport. Eric Pacuit and Olivier Roy 13

Interpretation of mixed strategies 1. Real randomizations: Side of goal in penalty kicks. Serving side in tennis. Luggage check at the airport. 2. Epistemic interpretation: Mixed strategies as beliefs of the other player(s) about what you do. Eric Pacuit and Olivier Roy 13

Solution Concepts Eric Pacuit and Olivier Roy 14

Solution Concepts Set of profiles or outcome of the game that are intuitively viewed as rational. Eric Pacuit and Olivier Roy 14

Solution Concepts Set of profiles or outcome of the game that are intuitively viewed as rational. Three well-known solution concepts in the matrix: Nash Equilibrium. Iterated elimitation of: Strictly dominated strategies. Weakly dominated strategies. Eric Pacuit and Olivier Roy 14

Solution Concepts Set of profiles or outcome of the game that are intuitively viewed as rational. Three well-known solution concepts in the matrix: Nash Equilibrium. Iterated elimitation of: Strictly dominated strategies. Weakly dominated strategies. In the tree we will focus on one: Backward induction. Eric Pacuit and Olivier Roy 14

Nash Equilibrium A B a 1, 1 0, 0 b 0, 0 1, 1 The profile aa is a Nash equilibrium of that game. Eric Pacuit and Olivier Roy 15

Nash Equilibrium A B a 1, 1 0, 0 b 0, 0 1, 1 The profile aa is a Nash equilibrium of that game. Definition A strategy profile σ is a Nash equilibrium iff for all i and all s i σ i : u i (σ) u i (s i, σ i ) Eric Pacuit and Olivier Roy 15

Some Facts about Nash Equilibrium Nash equilibria in Pure Strategies do not always exist. Every game in strategic form has a Nash equilibrium in mixed strategies. The proof of this make use of Kakutani s Fixed point thm. Some games have multiple Nash equilibria. Eric Pacuit and Olivier Roy 16

von Neumann s minimax theorem For every two-player zero-sum game with finite strategy sets S 1 and S 2, there is a number v, called the value of the game such that: v = max p (S 1 ) = min q (S 2 ) min u 1(s 1, s 2 ) q (S 2 ) max u 1(s 1, s 2 ) p (S 1 ) Furthermore, a mixed strategy profile (s 1, s 2 ) is a Nash equilibrium if and only if s 1 argmax p (S1 ) min u 1(p, q) q (S 2 ) s 2 argmax q (S2 ) min u 1(p, q) p (S 1 ) Finally, for all mixed Nash equilibria (p, q), u 1 (p, q) = v Eric Pacuit and Olivier Roy 17

Strictly Dominated Strategies Eric Pacuit and Olivier Roy 18

Strictly Dominated Strategies S A D A D 3, 3 1, 4 A 4,1 2, 2 Eric Pacuit and Olivier Roy 18

Strictly Dominated Strategies A B Eric Pacuit and Olivier Roy 19

Strictly Dominated Strategies A B Eric Pacuit and Olivier Roy 19

Strictly Dominated Strategies A B > > > > > Eric Pacuit and Olivier Roy 19

Strictly Dominated Strategies A B > > > > > In general, the idea applies to both mixed and pure strategies. Eric Pacuit and Olivier Roy 19

Iterated Elimination of Strictly Dominated Strategies Bob U L R Ann U 1,2 0,1 U D 0,1 1,0 U Eric Pacuit and Olivier Roy 20

Iterated Elimination of Strictly Dominated Strategies Bob U L R Ann U 1,2 0,1 U D 0,1 1,0 U Eric Pacuit and Olivier Roy 20

Iterated Elimination of Strictly Dominated Strategies Bob U L R Ann U 1,2 0,1 U D 0,1 1,0 U Eric Pacuit and Olivier Roy 20

Iterated Elimination of Strictly Dominated Strategies Bob U L R Ann U 1,2 0,1 U D 0,1 1,0 U Eric Pacuit and Olivier Roy 20

Facts about IESDS The algorithm always terminates on finite games. Intuition: this is a decreasing (in fact, monotonic) function on sub-games. It thus has a fixed-point by the Knaster-Tarski thm. The algorithm is order independent: One can eliminate SDS one player at the time, in difference order, or all simultaneously. The fixed-point of the elimination procedure will always be the same. All Nash equilibria survive IESDS. But not all profile that survive IESDS are Nash equilibria. Eric Pacuit and Olivier Roy 21

Weak Dominance A B Eric Pacuit and Olivier Roy 22

Weak Dominance A B Eric Pacuit and Olivier Roy 22

Weak Dominance A > = > = = B Eric Pacuit and Olivier Roy 22

Weak Dominance A > = > = = B All strictly dominated strategies are weakly dominated. Eric Pacuit and Olivier Roy 22

Iterated Elimination of Weakly Dominated Strategies Bob U L R Ann U 1,2 0,1 U D 0,1 1,1 U Eric Pacuit and Olivier Roy 23

Iterated Elimination of Weakly Dominated Strategies Bob U L R Ann U 1,2 0,1 U D 0,1 1,1 U Eric Pacuit and Olivier Roy 23

Iterated Elimination of Weakly Dominated Strategies Bob U L R Ann U 1,2 1,1 U D 0,1 1,1 U Eric Pacuit and Olivier Roy 23

Iterated Elimination of Weakly Dominated Strategies Bob U L R Ann U 1,2 0,1 U D 0,1 1,1 U Eric Pacuit and Olivier Roy 23

Facts about IEWDS The algorithm always terminates on finite games. The algorithm is order dependent!: Eliminating simultaneously all WDS at each round need not to lead to the same result as eliminating only some of them. Not all Nash equilibria survive IESDS. Eric Pacuit and Olivier Roy 24

The Epistemic View on Games Hey, no, equilibrium is not the way to look at games. Now, Nash equilibrium is king in game theory. Absolutely king. We say: No, Nash equilibrium is an interesting concept, and its an important concept, but its not the most basic concept. The most basic concept should be: to maximise your utility given your information. Its in a game just like in any other situation. Maximise your utility given your information! Robert Aumann, 5 Questions on Epistemic Logic, 2010 Eric Pacuit and Olivier Roy 25

The Epistemic View on Games Hey, no, equilibrium is not the way to look at games. Now, Nash equilibrium is king in game theory. Absolutely king. We say: No, Nash equilibrium is an interesting concept, and its an important concept, but its not the most basic concept. The most basic concept should be: to maximise your utility given your information. Its in a game just like in any other situation. Maximise your utility given your information! Robert Aumann, 5 Questions on Epistemic Logic, 2010 Two views on games: Eric Pacuit and Olivier Roy 25

The Epistemic View on Games Hey, no, equilibrium is not the way to look at games. Now, Nash equilibrium is king in game theory. Absolutely king. We say: No, Nash equilibrium is an interesting concept, and its an important concept, but its not the most basic concept. The most basic concept should be: to maximise your utility given your information. Its in a game just like in any other situation. Maximise your utility given your information! Robert Aumann, 5 Questions on Epistemic Logic, 2010 Two views on games: Based on solution Concepts. Eric Pacuit and Olivier Roy 25

The Epistemic View on Games Hey, no, equilibrium is not the way to look at games. Now, Nash equilibrium is king in game theory. Absolutely king. We say: No, Nash equilibrium is an interesting concept, and its an important concept, but its not the most basic concept. The most basic concept should be: to maximise your utility given your information. Its in a game just like in any other situation. Maximise your utility given your information! Robert Aumann, 5 Questions on Epistemic Logic, 2010 Two views on games: Based on solution Concepts. Classical, decision-theoretic. Eric Pacuit and Olivier Roy 25

The Epistemic View on Games Component of a Game A game in strategic form: Ann/ Bob L R T 1, 1 1, 0 B 0, 0 0, 1 A coordination game: Ann/ Bob L R T 1, 1 0, 0 B 0, 0 1, 1 G = Ag, {(S i, π i ) i Ag } Ag is a finite set of agents. S i is a finite set of strategies, one for each agent i Ag. u i : Π i Ag S i R is a payoff function defined on the set of outcomes of the game. Solutions/recommendations: Nash Equilibrium, Elimination of strictly dominated strategies, of weakly dominated strategies... Eric Pacuit and Olivier Roy 26

The Epistemic View on Games A Decision Problem: Leonard s Omelette Egg Good Egg Rotten Break with other eggs 4 0 Separate bowl 2 1 Eric Pacuit and Olivier Roy 27

The Epistemic View on Games A Decision Problem: Leonard s Omelette Egg Good Egg Rotten Break with other eggs 4 0 Separate bowl 2 1 Agent, actions, states, payoffs, beliefs. Eric Pacuit and Olivier Roy 27

The Epistemic View on Games A Decision Problem: Leonard s Omelette Egg Good Egg Rotten Break with other eggs 4 0 Separate bowl 2 1 Agent, actions, states, payoffs, beliefs. Ex.: Leonard s beliefs: p L (EG) = 1/2, p L (ER) = 1/2. Eric Pacuit and Olivier Roy 27

The Epistemic View on Games A Decision Problem: Leonard s Omelette Egg Good Egg Rotten Break with other eggs 4 0 Separate bowl 2 1 Agent, actions, states, payoffs, beliefs. Ex.: Leonard s beliefs: p L (EG) = 1/2, p L (ER) = 1/2. Solution/recommendations: choice rules. Maximization of Expected Utility, Dominance, Minmax... Eric Pacuit and Olivier Roy 27

The Epistemic View on Games The Epistemic or Bayesian View on Games Traditional game theory: Actions, outcomes, preferences, solution concepts. Decision theory: Actions, outcomes, preferences beliefs, choice rules. Eric Pacuit and Olivier Roy 28

The Epistemic View on Games The Epistemic or Bayesian View on Games Traditional game theory: Actions, outcomes, preferences, solution concepts. Decision theory: Actions, outcomes, preferences beliefs, choice rules. Epistemic game theory: Actions, outcomes, preferences, beliefs, choice rules. Eric Pacuit and Olivier Roy 28

The Epistemic View on Games The Epistemic or Bayesian View on Games Traditional game theory: Actions, outcomes, preferences, solution concepts. Decision theory: Actions, outcomes, preferences beliefs, choice rules. Epistemic game theory: := (interactive) decision problem and choice rule + higher-order information. Eric Pacuit and Olivier Roy 28

Basics of Decision Theory Eric Pacuit and Olivier Roy 29

Basics of Decision Theory A Decision Problem: Leonard s Omelette u i P P A 4 0 B 2 1 p i P P A 1/8 3/8 B 1/8 3/8 Actions, states, payoffs, beliefs. Eric Pacuit and Olivier Roy 30

Basics of Decision Theory A Decision Problem: Leonard s Omelette u i P P A 4 0 B 2 1 p i P P A 1/8 3/8 B 1/8 3/8 Actions, states, payoffs, beliefs. Eric Pacuit and Olivier Roy 30

Basics of Decision Theory A Decision Problem: Leonard s Omelette u i P P A 4 0 B 2 1 p i P P A 1/8 3/8 B 1/8 3/8 Actions, states, payoffs, beliefs. Eric Pacuit and Olivier Roy 30

Basics of Decision Theory A Decision Problem: Leonard s Omelette u i P P A 4 0 B 2 1 p i P P A 1/8 3/8 B 1/8 3/8 Actions, states, payoffs, beliefs. Eric Pacuit and Olivier Roy 30

Basics of Decision Theory A Decision Problem: Leonard s Omelette u i P P A 4 0 B 2 1 p i P P A 1/8 3/8 B 1/8 3/8 Actions, states, payoffs, beliefs. Eric Pacuit and Olivier Roy 30

Basics of Decision Theory A Decision Problem: Leonard s Omelette u i P P A 4 0 B 2 1 p i P P A 1/8 3/8 B 1/8 3/8 Actions, states, payoffs, beliefs. Solution/recommendations: choice rules. Which choice rule is normatively or descriptively appropriate depends on what kind of information are at the agent s disposal, and what kind of attitude she has. Eric Pacuit and Olivier Roy 30

Basics of Decision Theory Decision Under Risk When the agent has probabilistic beliefs, or that her beliefs can be represented probabilistically. u i P P A 4 0 B 2 1 p i P P A 1/8 3/8 B 1/8 3/8 Expected Utility: Given an agent s beliefs and desires, the expected utility of an action leading to a set of outcomes Out is: o Out [ subjective prob. of o] [utility of o] Eric Pacuit and Olivier Roy 31

Basics of Decision Theory Why don t we just give our best guess of wet or dry? Often people want to make a decision, such as whether to put out their washing to dry, and would like us to give a simple yes or no. However, this is often a simplification of the complexities of the forecast and may not be accurate. Eric Pacuit and Olivier Roy 32

Basics of Decision Theory Why don t we just give our best guess of wet or dry? Often people want to make a decision, such as whether to put out their washing to dry, and would like us to give a simple yes or no. However, this is often a simplification of the complexities of the forecast and may not be accurate. By giving PoP we give a more honest opinion of the risk and allow you to make a decision depending on how much it matters to you. Eric Pacuit and Olivier Roy 32

Basics of Decision Theory Why don t we just give our best guess of wet or dry? Often people want to make a decision, such as whether to put out their washing to dry, and would like us to give a simple yes or no. However, this is often a simplification of the complexities of the forecast and may not be accurate. By giving PoP we give a more honest opinion of the risk and allow you to make a decision depending on how much it matters to you. For example, if you are just hanging out your sheets that you need next week you might take the risk at 40% probability of precipitation, whereas if you are drying your best shirt that you need for an important dinner this evening then you might not hang it out at more than 10% probability. Eric Pacuit and Olivier Roy 32

Basics of Decision Theory Why don t we just give our best guess of wet or dry? Often people want to make a decision, such as whether to put out their washing to dry, and would like us to give a simple yes or no. However, this is often a simplification of the complexities of the forecast and may not be accurate. By giving PoP we give a more honest opinion of the risk and allow you to make a decision depending on how much it matters to you. For example, if you are just hanging out your sheets that you need next week you might take the risk at 40% probability of precipitation, whereas if you are drying your best shirt that you need for an important dinner this evening then you might not hang it out at more than 10% probability. PoP allows you to make the decisions that matter to you. http: // www. metoffice. gov. uk/ news/ in-depth/ science-behind-probability-of-precipitation Eric Pacuit and Olivier Roy 32

Basics of Decision Theory Maximization of Expected Utility Let DP = S, O, u, p be a decision problem. S is a finite set of states and O a set of outcomes. An action a : S O is a function from states to outcomes, u i a real-valued utility function on O, and p i a probability measure over S. The expected utility of a A with respect to p i is defined as follows: EU p (a) := Σ s S p(s)u(a(s)) An action a A maximizes expected utility with respect to p i provided for all a A, EU p (a) EU p (a ). In such a case, we also say a is a best response to p in game DP. Eric Pacuit and Olivier Roy 33

Basics of Decision Theory Decision under Ignorance What to do when the agent cannot assign probabilities states? Or when we can t represent his beliefs probabilistically? Many alternatives proposed: Dominance Reasoning Admissibility Minimax... Eric Pacuit and Olivier Roy 34

Basics of Decision Theory Dominance Reasoning A B > > > > > Eric Pacuit and Olivier Roy 35

Basics of Decision Theory Some facts about strict dominance Strict dominance is downward monotonic: If a i is strictly dominated with respect to X S and X X, then a i is strictly dominated with respect to X. Eric Pacuit and Olivier Roy 36

Basics of Decision Theory Some facts about strict dominance Strict dominance is downward monotonic: If a i is strictly dominated with respect to X S and X X, then a i is strictly dominated with respect to X. Intuition: the condition of being strictly dominated can be written down in a first-order formula of the form xϕ(x), where ϕ(x) is quantifier-free. Such formulas are downward monotonic: If M, s = xϕ(x) and M M then M, s = xϕ(x) Eric Pacuit and Olivier Roy 36

Basics of Decision Theory Some facts about strict dominance Relation with MEU: Suppose that G = N, {S i } i N, {u i } i N is a strategic game. A strategy s i S i is strictly dominated (possibly by a mixed strategy) with respect to X S i iff there is no probability measure p (X ) such that s i is a best response with respect to p. Eric Pacuit and Olivier Roy 37

Basics of Decision Theory Some facts about admissibility Admissibility is NOT downward monotonic: If a i is not admissible with respect to X S and X X, it can be that a i is admissible with respect to X. Eric Pacuit and Olivier Roy 38

Basics of Decision Theory Some facts about admissibility Admissibility is NOT downward monotonic: If a i is not admissible with respect to X S and X X, it can be that a i is admissible with respect to X. Intuition: the condition of being inadmissible can be written down in a first-order formula of the form xϕ(x) xψ(x), where ϕ(x) and ψ(x) are quantifier-free. The existential quantifier breaks the downward monotonicity. Eric Pacuit and Olivier Roy 38

Basics of Decision Theory Some facts about admissibility Relation with MEU: Suppose that G = N, {S i } i N, {u i } i N is a strategic game. A strategy s i S i is weakly dominated (possibly by a mixed strategy) with respect to X S i iff there is no full support probability measure p >0 (X ) such that s i is a best response with respect to p. Eric Pacuit and Olivier Roy 39

Road Map again 1. Today Basic Concepts. Basics of Game Theory. The Epistemic View on Games. Basics of Decision Theory Eric Pacuit and Olivier Roy 40

Road Map again 1. Today Basic Concepts. Basics of Game Theory. The Epistemic View on Games. Basics of Decision Theory 2. Tomorrow Epistemics. Logical/qualitative models of beliefs, knowledge and higher-order attitudes. Probabilistic/quantitative models of beliefs, knowledge and higher-order attitudes. Eric Pacuit and Olivier Roy 40

Formal Definitions Strategic Games Definition A game in strategic form G is a tuple A, S i, u i such that : A is a finite set of agents. S i is a finite set of actions or strategies for i. A strategy profile σ Π i A S i is a vector of strategies, one for each agent in I. The strategy s i which i plays in the profile σ is noted σ i. u i : Π i A S i R is an utility function that assigns to every strategy profile σ Π i A S i the utility valuation of that profile for agent i. Eric Pacuit and Olivier Roy 41

Formal Definitions Extensive form games Definition A game in extensive form T is a tuple I, T, τ, {u i } i I such that: T is finite set of finite sequences of actions, called histories, such that: The empty sequence, the root of the tree, is in T. T is prefix-closed: if (a 1,..., a n, a n+1 ) T then (a 1,..., a n ) T. A history h is terminal in T whenever it is the sub-sequence of no other history h T. Z denotes the set of terminal histories in T. τ : (T Z) I is a turn function which assigns to every non-terminal history h the player whose turn it is to play at h. u i : Z R is a payoff function for player i which assigns i s payoff at each terminal history. Eric Pacuit and Olivier Roy 42

Formal Definitions Strategies Definition A strategy s i for agent i is a function that gives, for every history h such that i = τ(h), an action a A(h). S i is the set of strategies for agent i. A strategy profile σ Π i I S i is a combination of strategies, one for each agent, and σ(h) is a shorthand for the action a such that a = σ i (h) for the agent i whose turn it is at h. A history h is reachable or not excluded by the profile σ from h if h = (h, σ(h), σ(h, σ(h)),...) for some finite number of application of σ. We denote u h i (σ) the value of util i at the unique terminal history reachable from h by the profile σ. Eric Pacuit and Olivier Roy 43

Formal Definitions Nash Equilibrium - General Definition Definition A profile of mixed strategy σ is a Nash equilibrium iff for all i and all mixed strategy σ i σ i : EU i (σ i, σ i ) EU i (σ i, σ i ) Where EU i, the expected utility of the strategy σ i against σ i is calculated as follows (σ = (σ i, σ i )): ) EU i (σ) = Σ s Πj S j ((Π j Ag σ j (s j ))u i (s) Eric Pacuit and Olivier Roy 44