Sequential allocation of indivisible goods

Similar documents
Yao s Minimax Principle

CS 573: Algorithmic Game Theory Lecture date: 22 February Combinatorial Auctions 1. 2 The Vickrey-Clarke-Groves (VCG) Mechanism 3

CSE 21 Winter 2016 Homework 6 Due: Wednesday, May 11, 2016 at 11:59pm. Instructions

Microeconomics of Banking: Lecture 5

An effective perfect-set theorem

Algorithmic Game Theory and Applications. Lecture 11: Games of Perfect Information

COS 445 Final. Due online Monday, May 21st at 11:59 pm. Please upload each problem as a separate file via MTA.

Lecture 10: The knapsack problem

A relation on 132-avoiding permutation patterns

Notes on Natural Logic

CHAPTER 14: REPEATED PRISONER S DILEMMA

Pareto-Optimal Assignments by Hierarchical Exchange

arxiv: v1 [math.co] 31 Mar 2009

Fair Protocols for Sequential Allocation Problems

Finding Equilibria in Games of No Chance

Value of Flexibility in Managing R&D Projects Revisited

Resource Allocation Algorithms

TR : Knowledge-Based Rational Decisions and Nash Paths

Chapter 10: Mixed strategies Nash equilibria, reaction curves and the equality of payoffs theorem

Integer Programming Models

Lecture 7: Bayesian approach to MAB - Gittins index

CUR 412: Game Theory and its Applications Final Exam Ronaldo Carpio Jan. 13, 2015

1 Solutions to Homework 3

1 Shapley-Shubik Model

Prediction Market, Mechanism Design, and Cooperative Game Theory

Strategies and Nash Equilibrium. A Whirlwind Tour of Game Theory

Introduction to Industrial Organization Professor: Caixia Shen Fall 2014 Lecture Note 5 Games and Strategy (Ch. 4)

6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2

Markov Decision Processes

Finite Memory and Imperfect Monitoring

Economics 209A Theory and Application of Non-Cooperative Games (Fall 2013) Repeated games OR 8 and 9, and FT 5

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India October 2012

Structural Induction

Extending MCTS

6.896 Topics in Algorithmic Game Theory February 10, Lecture 3

Homework #4. CMSC351 - Spring 2013 PRINT Name : Due: Thu Apr 16 th at the start of class

Counting Basics. Venn diagrams

Can we have no Nash Equilibria? Can you have more than one Nash Equilibrium? CS 430: Artificial Intelligence Game Theory II (Nash Equilibria)

Notes, Comments, and Letters to the Editor. Cores and Competitive Equilibria with Indivisibilities and Lotteries

Game Theory. Analyzing Games: From Optimality to Equilibrium. Manar Mohaisen Department of EEC Engineering

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2015

Microeconomic Theory August 2013 Applied Economics. Ph.D. PRELIMINARY EXAMINATION MICROECONOMIC THEORY. Applied Economics Graduate Program

In the Name of God. Sharif University of Technology. Graduate School of Management and Economics

6.254 : Game Theory with Engineering Applications Lecture 3: Strategic Form Games - Solution Concepts

THE TRAVELING SALESMAN PROBLEM FOR MOVING POINTS ON A LINE

Reinforcement learning and Markov Decision Processes (MDPs) (B) Avrim Blum

Best-Reply Sets. Jonathan Weinstein Washington University in St. Louis. This version: May 2015

Q1. [?? pts] Search Traces

Game Theory: Normal Form Games

IEOR E4004: Introduction to OR: Deterministic Models

TR : Knowledge-Based Rational Decisions

Game Theory Fall 2003

Outline for Dynamic Games of Complete Information

Multirate Multicast Service Provisioning I: An Algorithm for Optimal Price Splitting Along Multicast Trees

Shapley Allocation, Diversification and Services in Operational Risk

Virtual Demand and Stable Mechanisms

Introduction to Game Theory

Efficiency and Herd Behavior in a Signalling Market. Jeffrey Gao

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program August 2017

NOTES ON FIBONACCI TREES AND THEIR OPTIMALITY* YASUICHI HORIBE INTRODUCTION 1. FIBONACCI TREES

Interpolation. 1 What is interpolation? 2 Why are we interested in this?

A Decentralized Learning Equilibrium

MS&E 246: Lecture 5 Efficiency and fairness. Ramesh Johari

COMPUTER SCIENCE 20, SPRING 2014 Homework Problems Recursive Definitions, Structural Induction, States and Invariants

Answer Key: Problem Set 4

Introduction to Game Theory:

Epistemic Game Theory

In reality; some cases of prisoner s dilemma end in cooperation. Game Theory Dr. F. Fatemi Page 219

Issues. Senate (Total = 100) Senate Group 1 Y Y N N Y 32 Senate Group 2 Y Y D N D 16 Senate Group 3 N N Y Y Y 30 Senate Group 4 D Y N D Y 22

Web Appendix: Proofs and extensions.

BAYESIAN GAMES: GAMES OF INCOMPLETE INFORMATION

CMPSCI 311: Introduction to Algorithms Second Midterm Practice Exam SOLUTIONS

Consider the following (true) preference orderings of 4 agents on 4 candidates.

Matching Markets and Google s Sponsored Search

M.Phil. Game theory: Problem set II. These problems are designed for discussions in the classes of Week 8 of Michaelmas term. 1

Strategic Production Game 1

1 Solutions to Tute09

ECO 5341 (Section 2) Spring 2016 Midterm March 24th 2016 Total Points: 100

LECTURE NOTES ON GAME THEORY. Player 2 Cooperate Defect Cooperate (10,10) (-1,11) Defect (11,-1) (0,0)

Parkash Chander and Myrna Wooders

Searching for Multiple Objects in Multiple Locations

COS 511: Theoretical Machine Learning. Lecturer: Rob Schapire Lecture #24 Scribe: Jordan Ash May 1, 2014

Mechanisms for House Allocation with Existing Tenants under Dichotomous Preferences

Lecture 5 January 30

Smoothed Analysis of Binary Search Trees

Hierarchical Exchange Rules and the Core in. Indivisible Objects Allocation

Maximum Contiguous Subsequences

Uncertainty in Equilibrium

Lecture 17: More on Markov Decision Processes. Reinforcement learning

Game Theory for Wireless Engineers Chapter 3, 4

Session 10: Lessons from the Markowitz framework p. 1

Reasoning About Others: Representing and Processing Infinite Belief Hierarchies

Stochastic Games and Bayesian Games

Essays on Some Combinatorial Optimization Problems with Interval Data

Discrete Mathematics for CS Spring 2008 David Wagner Final Exam

Online Shopping Intermediaries: The Strategic Design of Search Environments

Microeconomic Theory III Final Exam March 18, 2010 (80 Minutes)

Algorithmic Game Theory

CUR 412: Game Theory and its Applications, Lecture 9

Econ 711 Homework 1 Solutions

Transcription:

1 / 27 Sequential allocation of indivisible goods Thomas Kalinowski Institut für Mathematik, Universität Rostock Newcastle Tuesday, January 22, 2013

joint work with... 2 / 27 Nina Narodytska Toby Walsh NICTA and UNSW in Sydney

Outline 3 / 27 1 Introduction 2 Sequential allocation policies 3 Maximizing the social welfare

Sequential allocation 4 / 27 A simple example Suppose you are coaching a football team and you want to divide your players into two teams for a practice match.

Sequential allocation 4 / 27 A simple example Suppose you are coaching a football team and you want to divide your players into two teams for a practice match. Nominate two captains and let them take turns in picking team members

Sequential allocation 4 / 27 A simple example Suppose you are coaching a football team and you want to divide your players into two teams for a practice match. Nominate two captains and let them take turns in picking team members What is the best picking order? alternating: 1, 2, 1, 2, 1, 2, 1, 2, 1, 2, 1, 2 alternating and reversing: 1, 2, 2, 1, 1, 2, 2, 1, 1, 2, 2, 1???

Example: Alternating policy 5 / 27 Captain 1 Captain 2 121212

Example: Alternating policy 5 / 27 Captain 1 Captain 2 121212

Example: Alternating policy 5 / 27 Captain 1 Captain 2 121212

Example: Alternating policy 5 / 27 Captain 1 Captain 2 121212

Example: Alternating policy 5 / 27 Captain 1 Captain 2 121212

Example: Alternating policy 5 / 27 Captain 1 Captain 2 121212

Example: Alternating policy 5 / 27 Captain 1 Captain 2 121212

Example: Alternating and reversing policy 6 / 27 Captain 1 Captain 2 122112

Example: Alternating and reversing policy 6 / 27 Captain 1 Captain 2 122112

Example: Alternating and reversing policy 6 / 27 Captain 1 Captain 2 122112

Example: Alternating and reversing policy 6 / 27 Captain 1 Captain 2 122112

Example: Alternating and reversing policy 6 / 27 Captain 1 Captain 2 122112

Example: Alternating and reversing policy 6 / 27 Captain 1 Captain 2 122112

Example: Alternating and reversing policy 6 / 27 Captain 1 Captain 2 122112

The order makes a difference 7 / 27 Preference orders Captain 1 Captain 2 Alternating Captain 1 Captain 2 Alternating and reversing Captain 1 Captain 2

Background 8 / 27 How do we best share resources between competing agents? Best can mean different things (fair, efficient,...) Resources can be divisible (mineral rights, viewing times, etc.) or indivisible (machines, holiday slots, time slots for landing and take-off, etc.)

Background 8 / 27 How do we best share resources between competing agents? Best can mean different things (fair, efficient,...) Resources can be divisible (mineral rights, viewing times, etc.) or indivisible (machines, holiday slots, time slots for landing and take-off, etc.) The allocation of scarce resources is an abundant problem in many economic and social contexts, in engineering, algorithm design, etc. Therefore, it is of great interest to theoretically understand the related phenomena, and develop good allocation mechanisms.

Cake-cutting (fair division) Cut-and-choose Dividing a cake between two persons The first person cuts the cake into two parts The second person chooses which part to take 9 / 27

Cake-cutting (fair division) Cut-and-choose Dividing a cake between two persons The first person cuts the cake into two parts The second person chooses which part to take More agents Different solutions depending on fairness notion Banach, Knaster, Steinhaus 1947 Selfridge; Conway 1960 Brams, Taylor 1995 9 / 27

From fair division to social welfare maximization 10 / 27 To compare division mechanisms the agent s shares have to be evaluated using a utility function. Fair division usually tries to balance utilities: Every agent should be satisfied with the outcome. Game theory studies the effect of strategic decision making. A different aspect A central agency that manages the allocation process might be interested in maximizing a global quality measure, while the opinions of individual agents might be irrelevant.

From fair division to social welfare maximization 10 / 27 To compare division mechanisms the agent s shares have to be evaluated using a utility function. Fair division usually tries to balance utilities: Every agent should be satisfied with the outcome. Game theory studies the effect of strategic decision making. A different aspect A central agency that manages the allocation process might be interested in maximizing a global quality measure, while the opinions of individual agents might be irrelevant. Problem [Bouveret, Lang 2011] Maximize the social welfare over a class of allocation mechanisms.

Formal setup n agents compete for k items Preference order Permutation π of the set [k] = {1,..., k} 11 / 27

Formal setup n agents compete for k items Preference order Permutation π of the set [k] = {1,..., k} Preference profile n-tuple R = (π 1,..., π n ) of preference orders 11 / 27

Formal setup n agents compete for k items Preference order Permutation π of the set [k] = {1,..., k} Preference profile n-tuple R = (π 1,..., π n ) of preference orders Utilities Values k, k 1, k 2,..., 1 11 / 27

Formal setup n agents compete for k items Preference order Permutation π of the set [k] = {1,..., k} Preference profile n-tuple R = (π 1,..., π n ) of preference orders Utilities Values k, k 1, k 2,..., 1 Additivity assumption The utility of a subset A [k] is the sum of the utilities of the elements of A. 11 / 27

Example for n = 2, k = 6, alternating 12 / 27 Available items 1 2 3 4 5 6 Profile (1, 2, 3, 4, 5, 6), (1, 4, 2, 5, 3, 6)

Example for n = 2, k = 6, alternating 12 / 27 Available items 1 2 3 4 5 6 Profile (1, 2, 3, 4, 5, 6), (1, 4, 2, 5, 3, 6) Allocation Agent 1 Agent 2 Utilities Agent 1: Agent 2:

Example for n = 2, k = 6, alternating 12 / 27 Available items 1 2 3 4 5 6 Profile (1, 2, 3, 4, 5, 6), (1, 4, 2, 5, 3, 6) Allocation Agent 1 Agent 2 Utilities Agent 1: 6 Agent 2:

Example for n = 2, k = 6, alternating 12 / 27 Available items 2 3 4 5 6 Profile (1, 2, 3, 4, 5, 6), (1, 4, 2, 5, 3, 6) Allocation Agent 1 Agent 2 Utilities Agent 1: 6 Agent 2: 5

Example for n = 2, k = 6, alternating 12 / 27 Available items 2 3 5 6 Profile (1, 2, 3, 4, 5, 6), (1, 4, 2, 5, 3, 6) Allocation Agent 1 Agent 2 Utilities Agent 1: 6 + 5 Agent 2: 5

Example for n = 2, k = 6, alternating 12 / 27 Available items 3 5 6 Profile (1, 2, 3, 4, 5, 6), (1, 4, 2, 5, 3, 6) Allocation Agent 1 Agent 2 Utilities Agent 1: 6 + 5 Agent 2: 5 + 3

Example for n = 2, k = 6, alternating 12 / 27 Available items 3 6 Profile (1, 2, 3, 4, 5, 6), (1, 4, 2, 5, 3, 6) Allocation Agent 1 Agent 2 Utilities Agent 1: 6 + 5 + 4 = 15 Agent 2: 5 + 3

Example for n = 2, k = 6, alternating 12 / 27 Available items 6 Profile (1, 2, 3, 4, 5, 6), (1, 4, 2, 5, 3, 6) Allocation Agent 1 Agent 2 Utilities Agent 1: 6 + 5 + 4 = 15 Agent 2: 5 + 3 + 1 = 9 = social welfare 15 + 9 = 24

Example for n = 2, k = 6, alternating and reversing 13 / 27 Available items 6 Profile (1, 2, 3, 4, 5, 6), (1, 4, 2, 3, 5, 6) Allocation Agent 1 Agent 2 Utilities Agent 1: 6 + 4 + 2 = 12 Agent 2: 5 + 4 + 1 = 10 = social welfare 12+10 = 22

Allocation policies 14 / 27 Policy p = p 1... p k [n] k In step i agent p i picks an item. Truthful behaviour Among the available items, the agent always picks the best according to her ranking. Individual utilities u i (R, p) Utility of agent i for profile R and policy p Social welfare sw(r, p) = n u i (R, p) i=1

Problem formulation 15 / 27 For a given probability P on the set R of all profiles we consider Expected utilities and social welfare u i (p) = R R P(R)u i (R, p) and sw(p) = R R P(R) sw(r, p) n Linearity of expectation: sw(p) = u i (p). i=1 Here P is always the uniform distribution on R. Conjecture [Bouveret & Lang 2011] The expected social welfare is maximized by the alternating policy p = 12... (n 1)n 12... (n 1)n...... 12... (n 1)n...

Main results 16 / 27 Theorem (K,Narodytska,Walsh 2013+) The expected utilities u i (p) can be computed in linear time. Theorem (K,Narodytska,Walsh 2013+) For a linear utility function and n = 2 agents the expected social welfare is maximized by the alternating policy p = 121212... Theorem (K,Narodytska,Walsh 2013+) For Borda utility and n agents the expected social welfare is nk 2 + O(k) and this is asymptotically optimal. n + 1

The policy tree 17 / 27 1 2 21 11 12 22 121 221 211 111 212 112 122 222 2121 1121 1221 2221 1211 2211 2111 1111 1212 2212 2112 1112 2122 1122 1222 2222 Level k 1 p 1 p 2... p k 1 alternate don t alternate 1 = 2, 2 = 1 Level k p 1 p 1 p 2... p k 1 p 1 p 1 p 2... p k 1

The policy tree 17 / 27 1 2 21 11 12 22 121 221 211 111 212 112 122 222 2121 1121 1221 2221 1211 2211 2111 1111 1212 2212 2112 1112 2122 1122 1222 2222 Reducing symmetry u 1 (1211) = u 2 (2122), u 2 (1211) = u 1 (2122). = sw(1211) = sw(2122)

The policy tree 17 / 27 1 2 21 11 12 22 121 221 211 111 212 112 122 222 2121 1121 1221 2221 1211 2211 2111 1111 1212 2212 2112 1112 2122 1122 1222 2222 Reducing symmetry u 1 (1211) = u 2 (2122), u 2 (1211) = u 1 (2122). = sw(1211) = sw(2122) = It is sufficient to consider the left subtree.

Recursive computation of the expected utilities 18 / 27 With a policy we associate a pair (x, y) where x is the expected utility for the starting agent, y is the expected utility for the other agent. The root node (k = 1): (1, 0) The recursion for k 2 Level k 1 (x, y) Level k ( y + k, k+1 k x) ( x + k, k+1 k y)

Utilities for the first four levels 19 / 27 1 (1, 0) 21 11 121 221 211 111 2121 1121 1221 2221 1211 2211 2111 1111 (1, 0) (0 + 2, 32 1 ) (1 + 2, 32 0 )

Utilities for the first four levels 19 / 27 ( 2, 3 ) 2 21 1 (1, 0) (3, 0) 11 121 221 211 111 2121 1121 1221 2221 1211 2211 2111 1111 (1, 0) (0 + 2, 32 1 ) (1 + 2, 32 0 )

Utilities for the first four levels 19 / 27 ( 2, 3 ) 2 21 1 (1, 0) (3, 0) 11 121 221 211 111 2121 1121 1221 2221 1211 2211 2111 1111 ( 2, 3 ) 2 (3, 0) ( 3 2 + 3, 4 ) ( 3 2 2 + 3, 4 3 3 ) 2 (0 + 3, 43 3 ) (3 + 3, 43 0 )

Utilities for the first four levels 19 / 27 ( 9 ) ( 2, 3 ) 2 21 1 (1, 0) (3, 0) 11 2, 8 3 121 (5, 2) 221 (3, 4) 211 (6, 0) 111 2121 1121 1221 2221 1211 2211 2111 1111 ( 2, 3 ) 2 (3, 0) ( 3 2 + 3, 4 ) ( 3 2 2 + 3, 4 3 3 ) 2 (0 + 3, 43 3 ) (3 + 3, 43 0 )

Utilities for the first four levels 19 / 27 ( 9 ) ( 2, 3 ) 2 21 1 (1, 0) (3, 0) 11 2, 8 3 121 (5, 2) 221 (3, 4) 211 (6, 0) 111 2121 1121 1221 2221 1211 2211 2111 1111 ( 9 2, 8 ) 3 (3, 4) ( 8 3 + 4, 5 4 9 ) ( 9 2 2 + 4, 5 4 8 ) 3 (4 + 4, 54 3 ) (3 + 4, 54 4 )

Utilities for the first four levels 19 / 27 ( 9 ) ( 2, 3 ) 2 21 1 (1, 0) (3, 0) 11 2, 8 3 121 (5, 2) 221 (3, 4) 211 (6, 0) 111 2121 ( 20 3, 45 ) 8 1121 ( 17 2, 10 ) 3 1221 2221 1211 ( ) 8, 15 4 2211 (7, 5) 2111 1111 ( 9 2, 8 ) 3 (3, 4) ( 8 3 + 4, 5 4 9 ) ( 9 2 2 + 4, 5 4 8 ) 3 (4 + 4, 54 3 ) (3 + 4, 54 4 )

Utilities for the first four levels 19 / 27 ( 9 ) ( 2, 3 ) 2 21 1 (1, 0) (3, 0) 11 2, 8 3 121 (5, 2) 221 (3, 4) 211 (6, 0) 111 2121 ( 20 3, 45 ) 8 1121 ( 17 2, 10 ) 3 1221 ( 6, 25 ) 4 2221 ( 9, 5 ) 2 1211 ( 8, 15 ) 4 2211 (7, 5) 2111 ( 4, 15 ) 2 1111 (10, 0) (5, 2) (6, 0) (2 + 4, 54 5 ) (5 + 4, 54 2 ) (0 + 4, 54 6 ) (6 + 4, 54 0 )

Solution for the alternating policy 20 / 27 Let p k denote the alternating policy of length k, p k = 1212... Theorem (K,Narodytska,Walsh 2013+) The expected social welfare for the alternating policy is sw ( p k) k(2k + 1) = + O( k). 3 The expected utility difference between the agents is d k := u 1 ( p k) u 2 ( p k) = k 3 + O( k).

Reformulation 21 / 27 Defect pairs For a policy p we measure the deviation from p k by the pair (x p, y p ) = (u 1 (p) u 1 ( p k), u 2 (p) u 2 ( p k)) sw(p) sw( p k ) = x p + y p The optimality of p k for all k is equivalent to Theorem For all k 1, if (x, y) is the defect pair for a policy of length k then x + y 0.

Defect pairs for k = 10 22 / 27

Small defect pairs for k = 10 23 / 27

Recursion for defect pairs 24 / 27 (0, 0) is the only defect pair for k = 1. Recursion for k 2 Level k 1 (x, y) L k 1 R k 1 Level k ( ) ( ) y, k+1 k x x + d k, k+1 k (y d k)

Recursion for defect pairs 24 / 27 (0, 0) is the only defect pair for k = 1. Recursion for k 2 Level k 1 (x, y) L k 1 R k 1 Level k ( ) ( ) y, k+1 k x x + d k, k+1 k (y d k) where d k is the utility difference for the alternating policy: d k = u 1 ( p k) u 2 ( p k).

Defect pairs in the policy tree 25 / 27 (0, 0) 121 (0, 0) 21 ( ) 1 2, 2 3 221 1 (0, 0) ( 3 2, 4 ) 3 211 ( 1, 3 ) 2 11 ( ) 3 2, 8 3 111 2121 (0, 0) 1121 ( ) 11 6, 55 24 1221 ( 2 3, 5 ) 8 2221 ( ) 7 3, 25 8 1211 ( ) 4 3, 15 8 2211 ( ) 1 3, 5 8 2111 ( 8 3, 15 ) 8 1111 ( ) 10 3, 45 8

Proof strategy 26 / 27 Prove by induction on k the following statement: Proposition If (x, y) is a defect pair for a policy of length k then 1 x + y 0, 2 m 1 : (x, y ) = R k+m 1 R k (x, y) = x + y 0, 3 m 1 : (x, y ) = R k+m 1 R k+1 L k (x, y) = x +y 0.

Outlook 27 / 27 Generalize the optimality result for the alternating policy to more than two agents, convex utility functions, i.e. the utility difference between consecutive items decreases with the rank, different probability distributions on the set of profiles. Study different social welfare measures. What happens if agents behave strategically?