Rational Behaviour and Strategy Construction in Infinite Multiplayer Games

Similar documents
Decision Problems for Nash Equilibria in Stochastic Games *

GAME THEORY: DYNAMIC. MICROECONOMICS Principles and Analysis Frank Cowell. Frank Cowell: Dynamic Game Theory

Extensive-Form Games with Imperfect Information

Sequential Rationality and Weak Perfect Bayesian Equilibrium

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

1 x i c i if x 1 +x 2 > 0 u i (x 1,x 2 ) = 0 if x 1 +x 2 = 0

Microeconomic Theory II Preliminary Examination Solutions

Game Theory. Wolfgang Frimmel. Repeated Games

Stochastic Games and Bayesian Games

Finding Equilibria in Games of No Chance

FDPE Microeconomics 3 Spring 2017 Pauli Murto TA: Tsz-Ning Wong (These solution hints are based on Julia Salmi s solution hints for Spring 2015.

ECON 803: MICROECONOMIC THEORY II Arthur J. Robson Fall 2016 Assignment 9 (due in class on November 22)

Subgame Perfect Cooperation in an Extensive Game

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2017

CHAPTER 14: REPEATED PRISONER S DILEMMA

Introduction to Game Theory Lecture Note 5: Repeated Games

GAME THEORY. Department of Economics, MIT, Follow Muhamet s slides. We need the following result for future reference.

Yao s Minimax Principle

Economics 209A Theory and Application of Non-Cooperative Games (Fall 2013) Repeated games OR 8 and 9, and FT 5

Stochastic Games and Bayesian Games

On Existence of Equilibria. Bayesian Allocation-Mechanisms

Problem 3 Solutions. l 3 r, 1

ECE 586GT: Problem Set 1: Problems and Solutions Analysis of static games

Evaluating Strategic Forecasters. Rahul Deb with Mallesh Pai (Rice) and Maher Said (NYU Stern) Becker Friedman Theory Conference III July 22, 2017

Algorithmic Game Theory and Applications. Lecture 11: Games of Perfect Information

The Ohio State University Department of Economics Econ 601 Prof. James Peck Extra Practice Problems Answers (for final)

Repeated Games with Perfect Monitoring

Equilibria in Finite Games

Best response cycles in perfect information games

Outline Introduction Game Representations Reductions Solution Concepts. Game Theory. Enrico Franchi. May 19, 2010

Rationalizable Strategies

CHAPTER 15 Sequential rationality 1-1

Notes for Section: Week 4

Equilibria in Finite Games

Game Theory: Normal Form Games

6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2

Coordination Games on Graphs

G5212: Game Theory. Mark Dean. Spring 2017

Advanced Microeconomics

CUR 412: Game Theory and its Applications Final Exam Ronaldo Carpio Jan. 13, 2015

CMSC 474, Introduction to Game Theory 16. Behavioral vs. Mixed Strategies

Duopoly models Multistage games with observed actions Subgame perfect equilibrium Extensive form of a game Two-stage prisoner s dilemma

Game Theory for Wireless Engineers Chapter 3, 4

4 Martingales in Discrete-Time

Economics 502 April 3, 2008

Microeconomics of Banking: Lecture 5

PAULI MURTO, ANDREY ZHUKOV

Preliminary Notions in Game Theory

Reactive Synthesis Without Regret

Commitment in First-price Auctions

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program August 2017

Microeconomics II. CIDE, MsC Economics. List of Problems

Lecture 5 Leadership and Reputation

Outline of Lecture 1. Martin-Löf tests and martingales

In reality; some cases of prisoner s dilemma end in cooperation. Game Theory Dr. F. Fatemi Page 219

MA300.2 Game Theory 2005, LSE

A Core Concept for Partition Function Games *

Lecture Note Set 3 3 N-PERSON GAMES. IE675 Game Theory. Wayne F. Bialas 1 Monday, March 10, N-Person Games in Strategic Form

Game Theory Fall 2003

Finitely repeated simultaneous move game.

Finite Memory and Imperfect Monitoring

6.254 : Game Theory with Engineering Applications Lecture 3: Strategic Form Games - Solution Concepts

Chapter 10: Mixed strategies Nash equilibria, reaction curves and the equality of payoffs theorem

Player 2 L R M H a,a 7,1 5,0 T 0,5 5,3 6,6

Dynamic Games. Econ 400. University of Notre Dame. Econ 400 (ND) Dynamic Games 1 / 18

Game Theory Problem Set 4 Solutions

Competitive Outcomes, Endogenous Firm Formation and the Aspiration Core

Sublinear Time Algorithms Oct 19, Lecture 1

Repeated Games. EC202 Lectures IX & X. Francesco Nava. January London School of Economics. Nava (LSE) EC202 Lectures IX & X Jan / 16

Exercises Solutions: Game Theory

Not 0,4 2,1. i. Show there is a perfect Bayesian equilibrium where player A chooses to play, player A chooses L, and player B chooses L.

Warm Up Finitely Repeated Games Infinitely Repeated Games Bayesian Games. Repeated Games

Topics in Contract Theory Lecture 1

Tug of War Game. William Gasarch and Nick Sovich and Paul Zimand. October 6, Abstract

REPEATED GAMES. MICROECONOMICS Principles and Analysis Frank Cowell. Frank Cowell: Repeated Games. Almost essential Game Theory: Dynamic.

Strategies and Nash Equilibrium. A Whirlwind Tour of Game Theory

Advanced Micro 1 Lecture 14: Dynamic Games Equilibrium Concepts

Admissibility in Quantitative Graph Games

Bargaining Order and Delays in Multilateral Bargaining with Asymmetric Sellers

M.Phil. Game theory: Problem set II. These problems are designed for discussions in the classes of Week 8 of Michaelmas term. 1

Early PD experiments

Lecture 23: April 10

Optimal Assumptions for Synthesis

INTERIM CORRELATED RATIONALIZABILITY IN INFINITE GAMES

The Core of a Strategic Game *

Epistemic Game Theory

EXTENSIVE AND NORMAL FORM GAMES

Economics 171: Final Exam

G5212: Game Theory. Mark Dean. Spring 2017

MA200.2 Game Theory II, LSE

1 Solutions to Homework 4

Pure stationary optimal strategies in Markov decision processes

Introduction to game theory LECTURE 2

MA200.2 Game Theory II, LSE

Alternating-Offer Games with Final-Offer Arbitration

Extensive form games - contd

Long Term Values in MDPs Second Workshop on Open Games

Beliefs and Sequential Rationality

Lecture 19: March 20

Complexity and Repeated Implementation

Transcription:

Rational Behaviour and Strategy Construction in Infinite Multiplayer Games Michael Ummels ummels@logic.rwth-aachen.de FSTTCS 2006 Michael Ummels Rational Behaviour and Strategy Construction 1 / 15

Infinite Multiplayer Games An infinite multiplayer game G consists of: a finite set of players, a directed graph G = (V, E) (the arena), a partition (V ) assigning each vertex a player, a mapping χ : V C into a finite set C of colours, for each player, a winning condition W C ω. Notation: (G, 0 ) for a game with initial vertex 0. How to play? Intuitively, a token is moved through the graph forming an infinite path (a play). Whenever the token arrives at a vertex V, player has to move the token to a successor vertex. Michael Ummels Rational Behaviour and Strategy Construction 2 / 15

Strategies What is a strategy in such a game? unrestricted: depends on the whole sequence of vertices seen so far (σ : V V V); finite-state: depends only on bounded information about the sequence of states seen so far (finite transducer); positional: depends only on the current vertex (σ : V V). A strategy profile is a collection (σ ) of strategies, one for each player. A strategy profile (σ ) determines a unique play of (G, 0 ), call it (σ ) 0. Michael Ummels Rational Behaviour and Strategy Construction 3 / 15

Nash Equilibria Question: What is the solution of a game? In contrast to so-called two-player zero-sum games, winning strategies do, in general, not exist even for very simple winning condition like reachability! Weaker solutions: Admissibility, Equilibria. Definition: A strategy profile (σ ) is a Nash equilibrium of (G, 0 ) if for all players and all strategies σ of player we have: σ, (σ j ) j \{ } 0 W (σ j ) j 0 W. Remark: For any two-player zero-sum game, a strategy profile (σ 0, σ 1 ) is a Nash equilibrium if either σ 0 or σ 1 is a winning strategy. Michael Ummels Rational Behaviour and Strategy Construction 4 / 15

Nash Equilibria: Example Game with two players 1 (round) and 2 (boxed): 0 3 1 2 Fact: If both players want to reach vertex 2, no player has a winning strategy. However, there are two Nash equilibria: 1 moves from 0 to 3, and 2 moves from 3 to 2. 1 moves from 0 to 1, and 2 stays in 3. But why should player 2 stay in vertex 3? Michael Ummels Rational Behaviour and Strategy Construction 5 / 15

Subgame Perfect Equilibria Problem: Nash equilibria ignore the sequential structure of an infinite game. Solution: Require that strategies are optimal after every possible play prefix! Definition: A strategy profile (σ 1,..., σ k ) is a subgame perfect equilibrium (SPE) of (G, 0 ) if, for every non-empty play prefix h (starting with 0 ), (σ 1 h,..., σ k h ) is a Nash equilibrium of (G h, ) where G h is as G but with winning conditions W h = {α C ω : χ(h) α W }, and σ h : z σ (hz ) is the strategy σ restricted to G h. In particular: Every SPE is a Nash equilibrium (take h = ϵ). Michael Ummels Rational Behaviour and Strategy Construction 6 / 15

Subgame Perfect Equilibria: Example Game with two players 1 (round) and 2 (boxed): 0 3 1 2 Recall: This game has two Nash equilibria if both players want to reach vertex 2. However, the game has a unique SPE: 1 moves from 0 to 3, and 2 moves from 3 to 2. 1 moves from 0 to 1, and 2 stays in 3. Note that staying in 3 is not optimal for play prefix 03. Michael Ummels Rational Behaviour and Strategy Construction 7 / 15

Existence of Nash Equilibria Recall Martin s Theorem: Every two-player zero-sum (quasi-)borel game is determined, i.e. one player has a winning strategy. Theorem (Chatterjee-Jurdziński-Majumdar, CSL 04) Every game with (quasi-)borel winning conditions has a Nash equilibrium. Proof idea: Let each player play an optimal strategy w.r.t. her winning condition. Upon deviation of one player, react with an optimal joint counter-strategy against this player. Drawback: Assume that player j has deviated after history h. It may be the case that player loses the resulting play, although she has a winning strategy in (G h, ). No SPE! Michael Ummels Rational Behaviour and Strategy Construction 8 / 15

Existence of Subgame Perfect Equilibria Theorem Every game with (quasi-)borel winning conditions has a SPE. Proof sketch: Consider the unravelling of G from the initial vertex 0 (the game tree). We prune the tree as follows: If some player has a winning strategy from some of her positions, delete all outgoing edges other than the one taken by this strategy (do this in a uniform way!). Repeat this construction, until we reach a fixed point. Let each player play any (fixed) strategy in the pruned game. Upon deviation, react with an optimal counter strategy in the pruned game. The resulting strategy profile is a subgame perfect equilibrium of the original game. Michael Ummels Rational Behaviour and Strategy Construction 9 / 15

Winning Conditions Abstract winning conditions are nice, but to be of algorithmic use, we have to use finitely representable conditions: Reachability: Given F C, player wins if F is visited. Büchi: Given F C, player wins if F is visited infinitely often. parity: Given a priority function Ω : C ω, player wins if the least priority seen infinitely often is even. ω-regular: Given an MSO (or LTL) formula φ using < and predicates P c for each c C, player wins if (ω, <, (P c ) c C ) = φ. Note: Any game with ω-regular winning conditions can be reduced to a game with parity winning conditions. From now on: Let us only consider games over finite arenas! Michael Ummels Rational Behaviour and Strategy Construction 10 / 15

Computing Equilibria Recall that two-player zero-sum parity games are positionally determined: One of the two players not only has a winning strategy but a positional one. This allows us to perform the construction used in the existence proof directly on the game arena. As positional winning strategies in two-player zero-sum parity games can be computed in exponential time, we get the following theorem. Theorem For any multiplayer parity game (played on a finite arena), we can compute a finite-state subgame perfect equilibrium in exponential time. Michael Ummels Rational Behaviour and Strategy Construction 11 / 15

The Decision Problem SPE Problem: Subgame perfect equilibria are not unique. Indeed, it can be shown that the algorithm may compute an arbitrary poor equilibrium. Definition: The payoff of a strategy profile is the vector {0, 1} def. by ( ) = 1 if player wins the induced play. Goal: Find a SPE with an optimal payoff. The decision problem SPE is defined as follows: Given a multiplayer parity game (G, 0 ) (played on a finite arena) and thresholds, y {0, 1}, decide whether (G, 0 ) has a SPE with payoff and y. If we can solve the problem, we can compute an optimal SPE payoff by using binary search. Michael Ummels Rational Behaviour and Strategy Construction 12 / 15

Solving The Problem Proviso: Choose a suitable encoding of strategy profiles by infinite binary trees. We use tree automata to solve the problem: Construct a nondeterministic Büchi tree automaton accepting precisely those trees that are encodings of strategy profiles. Construct an alternating parity tree automaton accepting those strategy profiles that are SPEs with a desired payoff. Make the latter tree automaton nondeterministic and check the product automaton for nonemptiness. This gives an exponential time algorithm for the problem. Michael Ummels Rational Behaviour and Strategy Construction 13 / 15

Consequences Theorem The decision problem SPE is decidable in exponential time. For games with a bounded number of players and priorities, the problem is in P. Open question: What is the exact complexity of the problem? The proof also reveals that, for any payoff thresholds, y, the set of SPEs with a payoff and y is regular. Theorem For any multiplayer parity game (G, 0 ) (played on a finite arena) and {0, 1}, if (G, 0 ) has a SPE with payoff, then (G, 0 ) has a finite-state SPE with payoff. Remark: Positional strategies do not suffice! Michael Ummels Rational Behaviour and Strategy Construction 14 / 15

Conclusion Summary: Every infinite game with (quasi-)borel winning conditions has a SPE. Deciding whether a parity game played on a finite arena has a SPE with a payoff between two given thresholds can be done in exponential time. Finite-state strategies suffice to implement SPEs in parity games played on finite arenas. Future work: Generalisation to stochastic and concurrent games (in progress). Generalisation to infinite arenas (e.g. pushdown graphs). Michael Ummels Rational Behaviour and Strategy Construction 15 / 15