Optimal Control of Batch Service Queues with Finite Service Capacity and General Holding Costs

Similar documents
Dynamic Admission and Service Rate Control of a Queue

Martingales. by D. Cox December 2, 2009

BEHAVIOUR OF PASSAGE TIME FOR A QUEUEING NETWORK MODEL WITH FEEDBACK: A SIMULATION STUDY

arxiv: v1 [math.pr] 6 Apr 2015

Admissioncontrolwithbatcharrivals

Portfolio Optimization with Alternative Risk Measures

Forecast Horizons for Production Planning with Stochastic Demand

An optimal policy for joint dynamic price and lead-time quotation

The ruin probabilities of a multidimensional perturbed risk model

Last Time. Martingale inequalities Martingale convergence theorem Uniformly integrable martingales. Today s lecture: Sections 4.4.1, 5.

Lecture 7: Bayesian approach to MAB - Gittins index

The value of foresight

Optimally Thresholded Realized Power Variations for Lévy Jump Diffusion Models

17 MAKING COMPLEX DECISIONS

Large-Scale SVM Optimization: Taking a Machine Learning Perspective

A Robust Option Pricing Problem

Yao s Minimax Principle

Design of Information Sharing Mechanisms

Stochastic Optimal Control

Arbitrages and pricing of stock options

3 Arbitrage pricing theory in discrete time.

EE266 Homework 5 Solutions

ON COMPETING NON-LIFE INSURERS

X i = 124 MARTINGALES

Risk minimizing strategies for tracking a stochastic target

An Introduction to Stochastic Calculus

Final exam solutions

Lecture 23: April 10

Information aggregation for timing decision making.

Dynamic and Stochastic Knapsack-Type Models for Foreclosed Housing Acquisition and Redevelopment

Slides for Risk Management

The Stigler-Luckock model with market makers

Online Algorithms SS 2013

based on two joint papers with Sara Biagini Scuola Normale Superiore di Pisa, Università degli Studi di Perugia

On the Optimality of FCFS for Networks of Multi-Server Queues

Modelling long term interest rates for pension funds

Optimal Dividend Policy of A Large Insurance Company with Solvency Constraints. Zongxia Liang

The Value of Information in Central-Place Foraging. Research Report

Making Complex Decisions

Introduction to game theory LECTURE 2

EVA Tutorial #1 BLOCK MAXIMA APPROACH IN HYDROLOGIC/CLIMATE APPLICATIONS. Rick Katz

- Introduction to Mathematical Finance -

1 Online Problem Examples

Homework Assignments

IEOR E4703: Monte-Carlo Simulation

Dynamic pricing and scheduling in a multi-class single-server queueing system

The Double Skorohod Map and Real-Time Queues

induced by the Solvency II project

Laws of probabilities in efficient markets

6.231 DYNAMIC PROGRAMMING LECTURE 3 LECTURE OUTLINE

Two hours UNIVERSITY OF MANCHESTER. 23 May :00 16:00. Answer ALL SIX questions The total number of marks in the paper is 90.

Non-Deterministic Search

4 Reinforcement Learning Basic Algorithms

Arbitrage Theory without a Reference Probability: challenges of the model independent approach

Self-organized criticality on the stock market

Optimal Policies for Distributed Data Aggregation in Wireless Sensor Networks

From Discrete Time to Continuous Time Modeling

An Application of Ramsey Theorem to Stopping Games

Liquidation of a Large Block of Stock

Pricing and hedging in incomplete markets

On Complexity of Multistage Stochastic Programs

On the Lower Arbitrage Bound of American Contingent Claims

arxiv: v2 [math.lo] 13 Feb 2014

Martingale Pricing Theory in Discrete-Time and Discrete-Space Models

Dynamic Programming: An overview. 1 Preliminaries: The basic principle underlying dynamic programming

Assets with possibly negative dividends

Stochastic Games with 2 Non-Absorbing States

Output Analysis for Simulations

Lecture Outline. Scheduling aperiodic jobs (cont d) Scheduling sporadic jobs

Pakes (1986): Patents as Options: Some Estimates of the Value of Holding European Patent Stocks

Optimal Stopping. Nick Hay (presentation follows Thomas Ferguson s Optimal Stopping and Applications) November 6, 2008

Corrections to the Second Edition of Modeling and Analysis of Stochastic Systems

Non replication of options

Disclosure Risk Measurement with Entropy in Sample Based Frequency Tables

Lecture 2: Making Good Sequences of Decisions Given a Model of World. CS234: RL Emma Brunskill Winter 2018

Call Admission Control for Preemptive and Partially Blocking Service Integration Schemes in ATM Networks

Asymptotic results discrete time martingales and stochastic algorithms

Decision Theory: Value Iteration

Practical example of an Economic Scenario Generator

Markov Decision Processes: Making Decision in the Presence of Uncertainty. (some of) R&N R&N

From Bayesian Auctions to Approximation Guarantees

Minimal Variance Hedging in Large Financial Markets: random fields approach

16 MAKING SIMPLE DECISIONS

16 MAKING SIMPLE DECISIONS

Applications of Lévy processes

Introduction to Probability Theory and Stochastic Processes for Finance Lecture Notes

MTH6154 Financial Mathematics I Interest Rates and Present Value Analysis

4: SINGLE-PERIOD MARKET MODELS

Are the Azéma-Yor processes truly remarkable?

University of Groningen. Inventory Control for Multi-location Rental Systems van der Heide, Gerlach

Essays on Some Combinatorial Optimization Problems with Interval Data

Optimum Thresholding for Semimartingales with Lévy Jumps under the mean-square error

Lesson 3: Basic theory of stochastic processes

STAT/MATH 395 PROBABILITY II

Dynamic Pricing in Ridesharing Platforms

4 Martingales in Discrete-Time

RECURSIVE VALUATION AND SENTIMENTS

ADVANCED MACROECONOMIC TECHNIQUES NOTE 7b

Dynamic Pricing Policies for an Inventory Model with Random Windows of Opportunities

DRAFT. 1 exercise in state (S, t), π(s, t) = 0 do not exercise in state (S, t) Review of the Risk Neutral Stock Dynamics

Transcription:

Queueing Colloquium, CWI, Amsterdam, February 24, 1999 Optimal Control of Batch Service Queues with Finite Service Capacity and General Holding Costs Samuli Aalto EURANDOM Eindhoven 24-2-99 cwi.ppt 1

Background Ph.D. Thesis: Studies in Queueing Theory, University of Helsinki, Finland, 1998 [1] S. Aalto (1997) Optimal control of batch service queues with Poisson arrivals and finite service capacity, Dep Mathematics, University of Helsinki [2] S. Aalto (1998) Optimal control of batch service queues with compound Poisson arrivals and finite service capacity, Math Meth Oper Res [3] S. Aalto (1998) Characterization of the output rate process for a Markovian storage model, J Appl Prob [4] S. Aalto (1998) Output of a multiplexer loaded by heterogeneous on-off sources, Stoch Models Supervisors: Prof. E. Nummelin and Ph.D. T. Lehtonen 2

Contents Batch service queue Control problem Known results New results Open questions 3

Batch service queue In an ordinary queue customers are served individually In a batch service queue customers are served in batches of varying size Additional parameter needed: Q = service capacity = max nr of customers served in a batch 4

Evolution T 0 T 1 T 2 T 3 Service epochs T n Queue length X(t) X(t) S 1 S 2 S 3 Q Service times S n Service capacity Q t Time t Cost rate Z(t) Z(t) Important: cost rate increasing between service epochs t Time t 5

Queueing models considered M/G(Q)/1 Poisson arrivals generally distributed IID service times single server with service capacity Q M X /G(Q)/1 compound Poisson arrivals generally distributed IID service times single server with service capacity Q 6

An application 1 2 3 4 5 6 7

Contents Batch service queue Control problem Known results New results Open questions 8

Control problem Given arrival process A(t) and service times S n Determine service epochs T n service batches B n Operating policy π = ((T n ),(B n )) should be admissible 9

Optimal control Usual operating policy: after a service completion, a new service is initiated as soon as X ( t) 1 a service batch includes as many customers as possible This is certainly reasonable But what is the optimal operating policy? The answer depends on the cost structure and the objective function 10

Cost structure Holding costs: Z(t) described by the cost rate process Z(t) cost rate depends on the nr of waiting customers X(t), and the times they have been waiting, W 1 (t),,w X(t) (t) called linear if Z ( t) = h( X ( t)) Serving costs: K+cB n K per each service batch c per each customer served 11

Objective function Minimize the long run average cost φ π or the discounted cost V α π Among all the admissible operating policies π 12

Contents Batch service queue Control problem Known results New results Open questions 13

Known results Linear holding costs z=h(x) General holding costs z=h(x,w) Infinite service capacity Q= Case A: - Deb & Serfozo (1973) - Deb (1984) Case C: - Weiss (1979) - Weiss & Pliska (1982) Finite service capacity Q< Case B: - Deb & Serfozo (1973) Case D 14

Cases A and B: linear holding costs A B Deb & Serfozo (1973) Poisson arrivals finite or infinite service capacity average cost & discounted cost Deb (1984) compound Poisson arrivals infinite service capacity discounted cost case only Result: h(x) is uniformly increasing => a queue length threshold policy is optimal Note: Optimal threshold is never greater than Q 15

Queue length threshold policies A B Queue length threshold policy π x with threshold x: after a service completion, a new service is initiated as soon as X ( t) x a service batch includes as many customers as possible Sufficient to watch over the queue length process X(t) Note: the usual operating policy = π 1 16

Declaration for cases A and B A B Linear holding costs cost rate remains constant between arrivals system can be reviewed discretely, just when a service has just been completed or the server is free and a new customer arrives Semi-Markov decision technique can be applied no reason to start a new service until the next customer arrives queue length threshold policies are optimal 17

Case C: general holding costs & infinite service capacity C Weiss (1979), Weiss & Pliska (1982) compound Poisson arrivals infinite service capacity average cost case only Result: Z(t) is increasing (without limits when service is postponed forever) => a cost rate threshold policy is optimal 18

Cost rate threshold policies C Cost rate threshold policy π(z) with threshold z: after a service completion, a new service is initiated as soon as Z ( t) z a service batch includes as many customers as possible infinite capacity => all waiting customers Sufficient to watch over the cost rate process Z(t) If linear and non-decreasing holding costs ( Z(t) = h(x(t)) ), then cost rate threshold policies = queue length threshold policies 19

Declaration for case C C Infinite capacity queue can be emptied at every service epoch no reason to watch over the queue length X(t) each service starts a new regeneration cycle (as regards the stationary policies) General holding costs system needs to be reviewed continuously Semi-Markov decision technique cannot be applied: Cost rate Z(t) non-decreasing (until the next service) cost rate threshold policies are optimal 20

Contents Batch service queue Control problem Known results New results Open questions 21

New results Linear holding costs z=h(x) General holding costs z = h(x,w) Infinite service capacity Q= Case A: - Deb & Serfozo (1973) - Deb (1984) Case C: - Weiss (1979) - Weiss & Pliska (1982) Finite service capacity Q< Case B: - Deb & Serfozo (1973) Case D: - Aalto (1997) [1] - Aalto (1998) [2] 22

Case D1: General holding costs & finite capacity & single arrivals D1 Aalto (1997) [1] Poisson arrivals finite service capacity average cost & discounted cost cases Result: FIFO queueing discipline, consistent holding costs and no serving costs included (K = c = 0) => a cost rate threshold Q-policy is optimal 23

Consistent holding costs D1 Assume that Z ( t) = h( X ( t), W ( t)) where W(t) = (W 1 (t),w 2 (t), ) denotes the vector of the waiting times of the customers waiting at time t (in decreasing order) Definition: Holding costs are consistent if x x' and w w' h( x, w) h( x', w' ) Examples: 1. h(x,w)=x, 2. h(x,w)=w 1 + +w x 24

Cost rate threshold Q-policies D1 Cost rate threshold Q-policy π Q (z) with threshold z: after a service completion, a new service is initiated as soon as Z ( t) z or X ( t) Q a service batch includes as many customers as possible finite capacity => min{x(t),q} Necessary to watch over both the queue length process X(t) and the cost rate process Z(t) If linear and non-decreasing holding costs ( Z(t) = h(x(t)) ), then cost rate threshold Q-policies = queue length threshold Q-policies 25

Declaration D1 Finite capacity queue cannot be emptied at every service epoch First key observation: To minimize the holding costs, it is sufficient to consider the class of Q-policies Second key observation (due to single arrivals): For each Q-policy, the queue becomes empty at every non-trivial service epoch such an epoch starts a new regeneration cycle (as regards the stationary Q-policies) 26

Q-policies D1 An operating policy π is called a Q-policy if after a service completion, a new service is initiated at latest when X ( t) Q a service batch includes as many customers as possible finite capacity => min{x(t),q} 27

Why just Q-policies? D1 For each admissible policy π, it is possible to construct such a Q-policy π Q that π X Q ( t) X ( t) t Due to FIFO principle and consistent holding costs, this implies that π π Z Q ( t) π Z ( t) t 28

Non-trivial service epochs D1 Idea: Find such service completions that leave less than Q customers waiting For each Q-policy π, let π Nk = min{ n > Nk 1 X n ( Tn 1 + Sn ) < Q} Note: N 1 is the same for all Q-policies π Definition: Non-trivial service epochs π ~ T = T π k π π π N k π 29

Single arrivals Decision interval D1 T 0 T 1 Service epochs T n Queue length X 1 (t) S 1 S 2 Q Service times S n Service capacity Q Cost rate Z 1 (t) t Time t Note: cost rate increasing during decision intervals! t Time t 30

Stationary Q-policies D1 Definition: A Q-policy π is called stationary if the non-trivial service epochs constitute a renewal sequence, and the process (X π,w π ) is regenerative w.r.t. this sequence Long run average cost φ π : where π φ ~ C = ~ ~ π E[ C1( T1 )] ~ π E[ T ] 1 t ~ 1 ( t) : = Z1( u) 0 du 31

Average cost optimal stationary Q-policy D1 Denote: π φ = inf{ φ π stationary Q - policies} Theorem: The cost rate threshold Q-policy π Q (φ) is average cost optimal among all stationary Q-policies. Idea of the proof: Show first that the cost rate threshold Q-policy π Q (φ π )isbetterin the average cost sense for any stationary π Then iterate! 32

Discounted cost optimal stationary Q-policy D1 Discounted cost for a stationary Q-policy π: ~ ~ π [ ( )] t π E Dα,1 T1 ~ Vα = ~, where D,1( t) : = e π α 1 E[exp( αt )] Denote: Theorem: V α = inf{ V π stationary Q - policies} The cost rate threshold Q-policy π Q (αv α ) is discounted cost optimal among all stationary Q-policies. Similar proof as in the average cost case 1 π α 0 αu ~ Z 1 ( u) du 33

Discounted cost optimal Q-policy D1 Theorem: The cost rate threshold Q-policy π Q (αv α ) is discounted cost optimal among all Q-policies. Idea of the proof: for any policy π, consider a sequence of policies π* k,whereπ* k is identical to π up to the k th non-trivial service epoch but thereafter changes to the optimal stationary rule the difference between discounted costs of π and π* k canbemade arbitrarily small by taking k great enough π* k is better than π* k+1 in the discounted cost sense π* 0 does not depend on the original policy π but is, in fact, the optimal stationary Q-policy π Q (αv α ) 34

Average cost optimal Q-policy D1 Theorem: The cost rate threshold Q-policy π Q (φ) is average cost optimal among all Q-policies. Idea of the proof: Limitofthediscountedcostcaseasα tends to 0 35

Case D2: General holding costs & finite capacity & group arrivals D2 Aalto (1998) [2] compound Poisson arrivals finite service capacity discounted cost case only Result: FIFO queueing discipline consistent holding costs, no serving costs included (K = c = 0) and bounded arrival batches ( M) => a general threshold Q-policy is optimal 36

General threshold Q-policies D2 General threshold Q-policy π Q (z,ζ) with threshold z and (nondecreasing) value function ζ: after a service completion, a new service is initiated as soon as Z ( t) + ζ ( X ( t)) z or X ( t) Q a service batch includes as many customers as possible finite capacity => min{x(t),q} Necessary to watch over both the queue length process X(t) and the cost rate process Z(t) If linear and non-decreasing holding costs ( Z(t) = h(x(t)) ), then general threshold Q-policies = queue length threshold Q-policies 37

Declaration D2 Finite capacity queue cannot be emptied at every service epoch First key observation: To minimize the holding costs, it is (still) sufficient to consider the class of Q-policies Second key observation (due to FIFO principle): All those customers that remain waiting at a non-trivial service epoch arrived at that time => their waiting times are zero such an epoch starts a new semi-regeneration cycle (as regards the stationary Q-policies) 38

Batch arrivals Decision interval D2 T 0 T 1 Service epochs T n Queue length X 1 (t) S 1 S 2 Q Service times S n Service capacity Q t Time t Cost rate Z 1 (t) Note: cost rate increasing during decision intervals! t Time t 39

Stationary Q-policies D2 Definition: A Q-policy π is called stationary if the non-trivial service epochs together with the number of customers that remain waiting at those epochs (ξ π k )constitutea Markov renewal sequence, and the process (X π,w π ) is semi-regenerative w.r.t. this sequence Discounted cost for a stationary Q-policy π: π V = 6 π V π where 6 π ~ ~ π ~ π v( x) : = Ex [ D1 ( T1 ) + exp( T1 ) v( ξ1 π )] Note: ξ π k <M forallπ and k 40

Discounted cost optimal stationary Q-policy (1) D2 Definition: Let I M = the non - decreasing functions defined on {0,1,, M 1} With each v in I M, associate a general threshold Q-policy π v : v π = π ( αv (0), ) Proposition: For any v in I M and x in {0,1,,M-1} 6 π v Definition: For any v in I M and x in {0,1,,M-1}, let Q π v( x) = inf{ 6 v( x) π Q policies} π Q v 6 v( x) = 6 v( x) v Q ) 41

Discounted cost optimal stationary Q-policy (2) D2 Definition: Let Proposition: Proposition: Theorem: * 6 = 6 IM where IM = { v IM v 6v } 6 * * has a * * M v I M v I 6 unique fixed point w I The general threshold Q-policy π w is discounted cost optimal among all stationary Q-policies. * * M 42

Discounted cost optimal Q-policy D2 Theorem: The general threshold Q-policy π w is discounted cost optimal among all Q-policies. Idea of the proof: for any policy π, consider a sequence of policies π* k,whereπ* k is identical to π up to the k th non-trivial service epoch but thereafter changes to the optimal stationary rule the difference between discounted costs of π and π* k canbemade arbitrarily small by taking k great enough π* k is better than π* k+1 in the discounted cost sense π* 0 does not depend on the original policy π but is, in fact, the optimal stationary Q-policy π w 43

Case B2 (as a special case of D2): Linear holding costs & finite capacity & group arrivals B2 Aalto (1998) [2] compound Poisson arrivals finite service capacity discounted cost case only Corollary (of Case D2): linear holding costs with h(x) non-decreasing, no serving costs included (K = c = 0) and bounded arrival batches => a queue length threshold Q-policy is optimal 44

Contents Batch service queue Control problem Known results New results Open questions 45

Case D1: General holding costs & finite capacity & single arrivals D1 If serving costs are included (K > 0, c > 0), What is the optimal policy in the average cost or discounted cost sense? 46

Case D2: General holding costs & finite capacity & group arrivals D2 How to get rid of the boundedness assumption concerning the arrival batches? If no serving costs are included (K = 0, c = 0), Is it true that similar results are valid in the average cost case as in the discounted cost case? If serving costs are included (K > 0, c > 0), What is the optimal policy in the average cost or discounted cost sense? 47

Case B2 (as a special case of D2): Linear holding costs & finite capacity & group arrivals B2 How to get rid of the boundedness assumption concerning the arrival batches? If no serving costs are included (K = 0, c = 0), Is it true that similar results are valid in the average cost case as in the discounted cost case? If serving costs are included (K > 0, c > 0), What is the optimal policy in the average cost or discounted cost sense? 48

LOPPU 49