ADAPTIVE SIMULATION BUDGET ALLOCATION FOR DETERMINING THE BEST DESIGN. Qi Fan Jiaqiao Hu
|
|
- Marybeth Douglas
- 5 years ago
- Views:
Transcription
1 Proceedings of the 013 Winter Simulation Conference R. Pasupathy, S.-H. Kim, A. Tol, R. Hill, and M. E. Kuhl, eds. ADAPTIVE SIMULATIO BUDGET ALLOCATIO FOR DETERMIIG THE BEST DESIG Qi Fan Jiaqiao Hu Department of Applied Mathematics and Statistics State University of ew Yor Stony Broo, Y 11794, USA ABSTRACT We consider the prolem of allocating a given simulation udget among a set of design alternatives in order to maximize the proaility of correct selection. Prior wor has focused on deriving static rules that predetermine the numer of simulation replications to e allocated to each design. In contrast, we formulate the prolem as a Marov decision process MDP and propose a dynamic myopic scheme to adaptively allocate simulation samples ased on current estimates of the means and variances of the design alternatives. We provide numerical examples to illustrate the performance of the proposed dynamic allocation rule. 1 ITRODUCTIO We consider the prolem of identifying the est design from a finite set of design alternatives. Each design is assumed to involve random uncertainty and requires stochastic simulation for performance estimation. When simulation is expensive and the numer of design alternatives is relatively small, a well-nown class of procedures for solving such prolems is raning and selection, where the goal is to determine the numer of simulation runs to e allocated to each design in order to guarantee a pre-specified correct selection proaility. Examples of raning and selection methods include Rinott s two-stage indifference zone procedure Rinott 1978, the expected value of information procedure Chic and Inoue 001, and the K family of algorithms Kim and elson 006; some reviews and advances in this field can e found in e.g., Goldsman and elson 1998, elson et al. 001, and Kim and elson 007. Chen 1995 approached the prolem from a different perspective y determining the est allocation of a given simulation udget among the designs in order to maximize the proaility of correct selection PCS. In particular, Chen 1996 proposed to use a Bayesian approach to estimate the design performance measures ased on prior sampling information and derived a lower ound for the correct selection proaility. The idea was susequently used in Chen, Chen, and Yücesan 000 and Chen et al. 000 to develop analytical allocation rules called Optimal Computing Budget Allocation OCBA that asymptotically optimize the lower ounds of the proaility of correct selection. More recently, Chen et al. 006 also investigated a dynamic allocation rule ased on perfect information assumption, and suggested that a dynamic scheme could dramatically improve the performance of static allocation rules. In this paper, we consider the setting of OCBA, i.e., maximizing PCS under a simulation udget constraint. However, unlie previous wor, which has primarily focused on static rules that predetermine the numer of simulation replications to e allocated to each design, we investigate a dynamic programming approach that adaptively allocates simulation samples ased on current estimates of the means and variances of various designs. Our wor can e viewed as an extension of that of Chen et al. 006 ased on perfect information. In particular, we model the simulation allocation process as a Marov decision process MDP with a terminal cost function. The state variale of the MDP model consists of the current sample mean of each design and the numer of simulation replications allocated to each of them y assuming that /13/$ IEEE 888
2 design variances are nown. These variances can then e estimated y sample variances. Since analytically solving the MDP model is intractale, we further develop an upper ound to the optimal value function and propose a one-step looahead index policy to myopically minimize the sum of the current one-stage cost and the upper ound of the optimal value function. Our preliminary numerical results indicate competitive performance of our approach with that of OCBA, especially when the simulation udget is small. The rest of the paper is organized as follows: In Section, we define notations and descrie the prolem setting. In Section 3, we formulate the prolem as an MDP, provide an upper ound to the optimal value function, and derive a myopic index policy for simulation allocation. umerical examples are provided in Section 4 to illustrate the performance of our approach. Finally, we conclude the paper in Section 5. PROBLEM SETTIG Consider the following optimization prolem: min J i min EL iξ ] i Θ i Θ where Θ = {1,,...,} is a finite set of design indices and J i is the true performance measure of design i. ote that J i itself is the expectation of the sample performance L i ξ, where the expectation is understood with respect to the distriution of the random variale ξ representing the stochastic uncertainty of the design. We assume that the expectation cannot e evaluated exactly; however, for a given simulation udget t, the performance measure J i can e estimated y the sample mean J t i 1 t i i t L i ξ i j, j=1 where t i represents the numer of replication runs allocated to design i and ξ i j represents the jth realization of ξ simulation sample path from design i. Throughout this paper, we assume that the simulation outputs are independent of each other. We egin y defining some notations. : the true variance of design i, i.e., σ i = VarL i ξ, which can e estimated y its sample variance. t : the index of the design that shows the current est sample performance after t simulation replications have een allocated, i.e., t t min i i t. s t : the index of the design that shows the second est sample performance after t simulation runs have een allocated, i.e., s t t min i t i t. δ t t,i = t t : the difference etween the sample performance of the current est and the ith designs. σi σ t t,i = σ t t t J t i + σ i : the standard deviation of the random variale δ t i t t,i. Define the event of correct selection CS as the event that design t i.e., the one with the current est sample performance is actually the est design. For a given simulation udget, the goal is to find a way to maximize the proaility of correct selection P{CS}. We follow the Bayesian approach introduced in Chen 1996 and assume that the output performance measure L i ξ is normally distriuted for each design. Let J i e a random variale whose distriution is the posterior distriution of design i given the previous sampling information: P{ J i } = P{J i L i ξ i j, j = 1,,, t i }, for i = 1,,,. It can e shown that if no priori nowledge is given on the performance of each design, J i has the normal distriution J i i t, σ i. Thus, a lower ound to P{CS} called approximate proaility t i 889
3 of correct selection APCS can e otained y applying Bonferroni s inequality. P{CS} = P{J t < J i, i t L i ξ i j, j = 1,,,i t,i = 1,,,} = P{ J t < J i, i t }, = P{ J t J i < 0} i t 1 P{ J t J i > 0} i t = 1 i t = APCS, δ t t,i σ t t,i where throughout this paper, we use and φ to denote the c.d.f. and p.d.f. of the standard normal distriution. Since P{CS} is difficult to evaluate, whereas APCS can e computed analytically without resorting to additional simulation effort, Chen et al. 000 proposed to use APCS as an approximation to the true proaility of correct selection. For a given udget T, the goal is to find a simulation udget allocation that solves the following optimization prolem. min 1 T,T,,T s.t. δ T T,i σ T T,i T i T and T i A DYAMIC BUDGET ALLOCATIO PROCEDURE Motivated y the wor of Hu et al. 011 and that of Chen et al. 006, we aim to solve the allocation prolem y modeling the allocation process as an MDP model with a terminal cost function. So instead of allocating all T simulation replications at the eginning as in 1, we derive a dynamic policy that sequentially allocates the udget ased on the estimated performance measure of all designs as well as the current sampling information. 3.1 Modeling the Allocation Process as an MDP Given a total of T simulation samples, we start y assigning at step t = 0 a small numer n 0 simulation replications to each of the designs. Define the state variale w t = i t,t i,i = 1,,,T as a vector containing the current performance estimates of all designs and the numer of times each design has een sampled thus far, where i 0 = n 0 and i 0 = 1 n 0 n 0 j=1 L i ξ i j for all i. ext consider a sequential allocation policy π that determines, at each step t = 1,,,T n 0 ased on w t, whether one more replication run should e allocated to one of the designs or the entire allocation process should e terminated. Let π t w t {0,1,...,} e the action taen at step t, then { i allocate one simulate run to design i, i = 1,,, π t w t = 0 stop the allocation process. 890
4 For every allocation policy π descried aove, it can e seen that {w t } is a Marov chain with the following state transition dynamics: i t+1 = t i i t +Y ii {a=i} t i + I {a=i} for i = 1,,, 3 t+1 i = t i + I {a=i} for i = 1,,,, 4 where I is the indicator function, a is the action taen at step t under π, and Y i is the simulation output performance measure of design i after the additional allocation. Based on 3 and 4, the updating formula for the current est sample mean is given y: t+1 t+1 = where Z + = max{0,z}. Let w = i, i,i = 1,..., T σ + σ i t t +Y t + s t t J s t t t t t t +I if a = t {a=i} t t J t t t aj a+y t a I + {a=i} if a a+i t {a=i} t, e a given state and define = argmin i J i, δ,i = J J i, and σ,i = i. By associating the state action pair w,a with the following one-stage cost function: { 0 if a 0 R t w,a = δ,i σ,i if a = 0 and R s w,a = 0 for all s t + 1 whenever π t w = 0, we otain a T n horizon MDP with the total cost V π T n 0 w = E R t w t,π t w t ] w0 = w, t=0 where the expectation is taen respect to the proaility measure induced y π. For a given initial state w 0 = w, the ojective is to find an optimal simulation allocation policy π to minimize the total cost accumulated efore the allocation process terminates. 3. A Myopic Index Policy Since otaining the exact optimal policy for the MDP model is intractale, we derive a myopic index policy using one-step looahead optimization. The following result provides an upper ound to the optimal value function. Theorem 1 Let V t e the optimal cost-to-go function at stage t of the MDP defined in Section 3.1. For every t = 0,1,,,T n 0 and w = i, i,i = 1,..., T, we have δ,i V t w. 6 Proof. At the final stage t = T n 0, all T replications have een exhausted, so the only option is to stop the allocation process. Therefore, we must have πt n 0 w = 0 and V T n0 w = δ,i. It follows that when t = T n 0 1, σ,i σ,i V T n0 1w = min a E a R T n0 1w,a +V T n0 w ] 5 891
5 where a {0,1,,,} and w = J i, i,i = 1,,,T is the next state generated according to the transition dynamics 3 and 4 when action a is taen, in particular, J i = ij i +Y i I {a=i} i +I and {a=i} i = i + I {a=i}. Therefore, V T n0 1w = mine a R T n0 a 1w,a +V T n0 w ] δ,i = min{,min E a a 0 δ,i σ,i, σ,i σ + σ i i δ,i σ,i ] } where = argmin i J i, δ,i = J i, and σ,i =. ow proceed y induction and assume that V t+1 w δ,i σ,i for all w. Then V t w = mine a R t w,a +V t+1 w ] a δ,i min{,min σ E a,i a 0 δ,i. This completes the proof of the theorem. σ,i δ,i Motivated y Theorem 1, we propose a simple stationary greedy policy that minimizes the sum of the current one-stage cost function and the upper ound of the optimal cost-to-go function at each step: 0 if πw = δ,i σ,i min a 0 E a δ ],i σ,i argmin a 0 E a δ ] 7,i σ otherwise.,i By connecting π to 1, it is not difficult to see that if one more simulation sample is needed, such a policy myopically allocates the next sample in such a way so that the APCS in the next step is maximized after the additional allocation. Since π is an index policy, we can create an index for each action a ased on the current state w. Denote y indexa as the index of action a and let w = J i, i,i = 1,...,T e the sampled next state when action a is taen, we have three different cases ased on the transition dynamics 3, 4, and 5: case 1 If a = 0, case If a =, index = E l=1,l index0 = J s s σ J +Y + σ l J J i + σ J l + σ i i σ,i J s + ]}, 8 J s J +Y σ + σ + J +Y ], 9 89
6 case 3 If a and a 0, indexa = E a l=1,l a J a σ J a +Y a + σ l Fan and Hu + J l J + J a J a +Y a σ + σ a a + a J a +Y a ], 10 where = argmin i J i and σ is the variance of design. Let B s = { J +Y J s }. The + operator can e removed from 9 y conditioning on event B s : index = E l + E l +Y σ J l + σ l J s J l σs s + σ l + 1 ] B s PB s + J s J +Y σ s s + σ B ] c s PB c s. 11 Similarly, y conditioning on B = { J +Y }, the index in case 3 can e otained as indexa = E a l a + E a l a a a +Y a σa + σ l J J l σ + σ l J l + 1 ] B PB + J a J a +Y a σ + σ a B ] c PB c A Dynamic Budget Allocation Algorithm ote that when a = 0 the index in case 1 can e calculated analytically, whereas calculating the performance indices in 11 and 1 require evaluating the expectations with respect to the design distriutions. One natural approach to evaluate/estimate these expectations is to use Taylor expansion. Taing 11 as an example, we can treat each respective term as a function of the sample mean J +Y and perform a first order Taylor expansion of the term around. In addition, y replacing the true mean of the current est design with its sample mean, PB s can e approximated y +1 J s σ. Thus when a =, we can approximate the index of action y the following analytical formula: index = l J φ l σ l φ σ s J J l σ + σ l J J s s + σ J J s σs s + σ + σ l ] σ σ + 1 J s J + l l σs + 1 σ + 1 J s + σ l s + σ l σ s s + σ σ ] + 1 J s φ σ J + 1 J s J s φ σ J σ J
7 Similarly, when a and a 0, indexa can e approximated y ] indexa 1 = + a J l a + 1 a l a σa + σ l l + φ l a φ a J l σa + σ l a σ + a σ + σ a σa a + 1 J J + l σ l a + σ l + σ l a + 1 σ + σ a ] a + 1 φ J a a a + 1 φ J a J a. 14 Finally, y replacing true variances with sample variances, we propose the following algorithm for simulation udget allocation. Dynamic Simulation Budget Allocation DSBA Step 0: Perform n 0 simulation replications for all designs. Calculate the sample mean and sample variance for each design. Step 1: For each action a {0,...,}, compute the index of action a according to 8, 13, and 14. Step : Select the action a with the smallest index value. If a = 0, then stop the allocation process; else if a = i, perform one simulation replication for design i, update the sample mean and sample variance of design i. Increase the numer of simulation replications to design i y 1 and go ac to Step 1 until the given udget is exhausted. 4 UMERICAL RESULTS In this section, we test the proposed DSBA algorithm and compare its performance with that of OCBA on some simple examples. OCBA was derived ased on analytically solving the static optimization prolem 1. It has een shown in Chen et al. 000 that the asymptotically optimal solution to the prolem as T satisfies the following conditions. 1 i σi /δ j =,i σ j /δ, j = σ,i i σi, where i is the numer of samples allocated to design i, δ,i = J i, min i J i, and σ i is the standard deviation of the performance measure for design i, which can e estimated y sample variance. OCBA sequentially allocates a given udget T y splitting it into atches of size. Then at each step, the current computing udget is increased y and a udget allocation is calculated using conditions 1 and ased on the updated computing udget. This allocation is then used to determine the numer of additional simulation runs need to e allocated to each design. The process continues until all udget has een consumed. We consider the following examples in our computational experiment. Example 1 : This is a special example where the est design has a zero variance and the rest two designs have the same performance: X 1 j 0,0,X j 0.4,3,X 3 j 0.4,3 894
8 Example : There are five design alternatives with the est design eing deterministic: X 1 j 0,0,X j 0.4,1.5,X 3 j 0.4,3,X 4 j 1,3,X 5 j,3 Example 3 : This example is an extension of the previous one with the deterministic design removed: X 1 j 0,1.5,X j 0.6,3,X 3 j 1,3,X 4 j,3 Example 4 : This is an example with three alternatives, all of which are random: X 1 j 1,1,X j 1.5,3,X 3 j 1.5,3 In our experiment, the initial numer of replications n 0 is set to 10 for oth DSBA and OCBA. Figure 1 shows the performance of oth algorithms for each of the four respective test cases, where the true P{CS} in each case is estimated y the proportional of times the est design is found y an algorithm out of 10,000 independent experiments. The figure indicates competitive performance of DSBA with that of OCBA in all test cases. In particular, DSBA outperforms OCBA when the simulation udget is small, whereas OCBA shows slightly etter performance when the udget is increased, especially in the last case. Our conjecture is that this is due to the asymptotic optimality of OCBA, whereas DSBA is myopic in nature. 5 COCLUSIO In this paper, we have introduced a dynamic simulation udget allocation procedure for determining the est design from a set of finite design alternatives. The idea is to use a myopic one-step looahead policy to approximately solve an underlying MDP characterizing the udget allocation process. Such a policy gives rise to a stationary index rule that adaptively determines at each step which design should e simulated next in order to myopically maximize the approximate proaility of correction after the additional allocation. Our preliminary numerical results indicate that our approach may provide competitive performance with that of OCBA, especially when the computing udget is small. ACKOWLEDGMETS This wor was supported in part y the Air Force Office of Scientific Research under Grant FA and y the ational Science Foundation under Grant CMMI REFERECES Chen, C.-H An Effective Approach to Smartly Allocate Computing Budget for Discrete Event Simulation. In Prodeedings of the 34th IEEE Conference on Decision and Control, Piscataway, J: IEEE. Chen, C.-H A Low Bound for the Correct Suset-Selection Proaility and Its Application to Discrete-Event System Simulation. IEEE Transaction on Automatic Control 41: Chen, C.-H., D. He, and M. Fu Efficient Dynamic Simulation Allocation in Ordinal Optimization. IEEE Transaction on Automatic Control 51: Chen, C.-H., J. Lin, E. Yücesan, and S. E. Chic Simulation Budget Allocation for Further Enhancing the Efficiency of Ordinal Optimization. Discrete Event Dynamic System: Theory and Applications 10: Chen, H.-C., C.-H. Chen, and E. Yücesan Computing Efforts Allocation for Ordinal Optimization and Discrete Event Simulation. IEEE Transaction on Automatic Control 45: Chic, S. E., and K. Inoue ew Two-Stage and Sequential Procedures for Selecting the Best Simulated System. Operations Research 49: Goldsman, D., and B. L. elson Comparing systems via simulation. In Handoo of simulation, edited y J. Bans, ew Yor: John Wiley. 895
9 Figure 1: Comparison of OCBA and DSBA. Hu, J., H. S. Chang, M. C. Fu, and S. I. Marcus Dynamic Sample Budget Allocation in Model-Based Optimization. Journal of Gloal Optimization 50: Kim, S.-H., and B. L. elson Selecting the Best System. In Handoos in Operations Research and Mangement Science: Simulation, edited y S. G. Henderson and B. L. elson, Chapter 17, Oxford, UK: Elsevier Science. Kim, S.-H., and B. L. elson Recent Advances in Raning and Selection. In Prodeedings of the 007 Winter Simulation Conference, edited y S. Henderson, B. Biler, M.-H. Hsieh, J. Shortle, J. Tew, and R. Barton, Piscataway, J: IEEE. elson, B. L., J. Swann, D. Goldsman, and W. Song Simple Procedures for Selecting the Best Simulated System when the umer of Alternatives Is Large. Operations Research 49: Rinott, Y On Two-Stage Selection Procedures and Related Proaility Inequalities. Communications in Statistics - Theory and Methods 7: AUTHOR BIOGRAPHIES Qi Fan is a Ph.D. student in the Department of Applied Mathematics and Statistic at the State University of ew Yor, Stony Broo. He received the B.S. degree in mathematics from Zhejiang University, China in 011. His research interests include Marov decision processes, optimization and simulation. His address is qfan@ams.sunys.edu. 896
10 JIAQIAO HU is an Associate Professor in the Department of Applied Mathematics and Statistics at the State University of ew Yor, Stony Broo. He received the B.S. degree in automation from Shanghai Jiao Tong University, the M.S. degree in applied mathematics from the University of Maryland, Baltimore County, and the Ph.D. degree in electrical engineering from the University of Maryland, College Par. His research interests include Marov decision processes, applied proaility, and simulation optimization. His address is 897
Provably Near-Optimal Sampling-Based Policies for Stochastic Inventory Control Models
Provaly Near-Optimal Sampling-Based Policies for Stochastic Inventory Control Models Retsef Levi Sloan School of Management, MIT, Camridge, MA, 02139, USA email: retsef@mit.edu Roin O. Roundy School of
More informationOptimal Production-Inventory Policy under Energy Buy-Back Program
The inth International Symposium on Operations Research and Its Applications (ISORA 10) Chengdu-Jiuzhaigou, China, August 19 23, 2010 Copyright 2010 ORSC & APORC, pp. 526 532 Optimal Production-Inventory
More informationRanking and selection (R&S) with multiple performance measures using incomplete preference information
Ranking and selection (R&S) with multiple performance measures using incomplete Ville Mattila and Kai Virtanen (ville.a.mattila@aalto.fi) Systems Analysis Laboratory Aalto University School of Science
More informationDynamic Portfolio Choice II
Dynamic Portfolio Choice II Dynamic Programming Leonid Kogan MIT, Sloan 15.450, Fall 2010 c Leonid Kogan ( MIT, Sloan ) Dynamic Portfolio Choice II 15.450, Fall 2010 1 / 35 Outline 1 Introduction to Dynamic
More informationLecture 7: Bayesian approach to MAB - Gittins index
Advanced Topics in Machine Learning and Algorithmic Game Theory Lecture 7: Bayesian approach to MAB - Gittins index Lecturer: Yishay Mansour Scribe: Mariano Schain 7.1 Introduction In the Bayesian approach
More informationProceedings of the 2014 Winter Simulation Conference A. Tolk, S. Y. Diallo, I. O. Ryzhov, L. Yilmaz, S. Buckley, and J. A. Miller, eds.
Proceedings of the 204 Winter Simulation Conference A. Tolk, S. Y. Diallo, I. O. Ryzhov, L. Yilmaz, S. Buckley, and J. A. Miller, eds. SAMPLE ALLOCATION FOR MULTIPLE ATTRIBUTE SELECTION PROBLEMS Dennis
More informationThe Optimal Choice of Monetary Instruments The Poole Model
The Optimal Choice of Monetary Instruments The Poole Model Vivaldo M. Mendes ISCTE Lison University Institute 06 Novemer 2013 (Vivaldo M. Mendes) The Poole Model 06 Novemer 2013 1 / 27 Summary 1 Tools,
More informationEFFECT OF IMPLEMENTATION TIME ON REAL OPTIONS VALUATION. Mehmet Aktan
Proceedings of the 2002 Winter Simulation Conference E. Yücesan, C.-H. Chen, J. L. Snowdon, and J. M. Charnes, eds. EFFECT OF IMPLEMENTATION TIME ON REAL OPTIONS VALUATION Harriet Black Nembhard Leyuan
More informationSequential Decision Making
Sequential Decision Making Dynamic programming Christos Dimitrakakis Intelligent Autonomous Systems, IvI, University of Amsterdam, The Netherlands March 18, 2008 Introduction Some examples Dynamic programming
More informationA Simple Method for Solving Multiperiod Mean-Variance Asset-Liability Management Problem
Available online at wwwsciencedirectcom Procedia Engineering 3 () 387 39 Power Electronics and Engineering Application A Simple Method for Solving Multiperiod Mean-Variance Asset-Liability Management Problem
More informationOptimal Search for Parameters in Monte Carlo Simulation for Derivative Pricing
Optimal Search for Parameters in Monte Carlo Simulation for Derivative Pricing Prof. Chuan-Ju Wang Department of Computer Science University of Taipei Joint work with Prof. Ming-Yang Kao March 28, 2014
More informationAmerican Barrier Option Pricing Formulae for Uncertain Stock Model
American Barrier Option Pricing Formulae for Uncertain Stock Model Rong Gao School of Economics and Management, Heei University of Technology, Tianjin 341, China gaor14@tsinghua.org.cn Astract Uncertain
More informationMicroeconomics II. CIDE, Spring 2011 List of Problems
Microeconomics II CIDE, Spring 2011 List of Prolems 1. There are three people, Amy (A), Bart (B) and Chris (C): A and B have hats. These three people are arranged in a room so that B can see everything
More informationA GENERALIZED MARTINGALE BETTING STRATEGY
DAVID K. NEAL AND MICHAEL D. RUSSELL Astract. A generalized martingale etting strategy is analyzed for which ets are increased y a factor of m 1 after each loss, ut return to the initial et amount after
More informationA lower bound on seller revenue in single buyer monopoly auctions
A lower bound on seller revenue in single buyer monopoly auctions Omer Tamuz October 7, 213 Abstract We consider a monopoly seller who optimally auctions a single object to a single potential buyer, with
More informationComparative Study between Linear and Graphical Methods in Solving Optimization Problems
Comparative Study between Linear and Graphical Methods in Solving Optimization Problems Mona M Abd El-Kareem Abstract The main target of this paper is to establish a comparative study between the performance
More informationUPDATE ON ECONOMIC APPROACH TO SIMULATION SELECTION PROBLEMS
Proceedings of the 2008 Winter Simulation Conference S. J. Mason, R. R. Hill, L. Mönch, O. Rose, T. Jefferson, J. W. Fowler eds. UPDATE ON ECONOMIC APPROACH TO SIMULATION SELECTION PROBLEMS Stephen E.
More informationJournal of Computational and Applied Mathematics. The mean-absolute deviation portfolio selection problem with interval-valued returns
Journal of Computational and Applied Mathematics 235 (2011) 4149 4157 Contents lists available at ScienceDirect Journal of Computational and Applied Mathematics journal homepage: www.elsevier.com/locate/cam
More informationForecast Horizons for Production Planning with Stochastic Demand
Forecast Horizons for Production Planning with Stochastic Demand Alfredo Garcia and Robert L. Smith Department of Industrial and Operations Engineering Universityof Michigan, Ann Arbor MI 48109 December
More informationOptimal Dam Management
Optimal Dam Management Michel De Lara et Vincent Leclère July 3, 2012 Contents 1 Problem statement 1 1.1 Dam dynamics.................................. 2 1.2 Intertemporal payoff criterion..........................
More informationTHis paper presents a model for determining optimal allunit
A Wholesaler s Optimal Ordering and Quantity Discount Policies for Deteriorating Items Hidefumi Kawakatsu Astract This study analyses the seller s wholesaler s decision to offer quantity discounts to the
More informationRisk Measurement in Credit Portfolio Models
9 th DGVFM Scientific Day 30 April 2010 1 Risk Measurement in Credit Portfolio Models 9 th DGVFM Scientific Day 30 April 2010 9 th DGVFM Scientific Day 30 April 2010 2 Quantitative Risk Management Profit
More informationTHE OPTIMAL ASSET ALLOCATION PROBLEMFOR AN INVESTOR THROUGH UTILITY MAXIMIZATION
THE OPTIMAL ASSET ALLOCATION PROBLEMFOR AN INVESTOR THROUGH UTILITY MAXIMIZATION SILAS A. IHEDIOHA 1, BRIGHT O. OSU 2 1 Department of Mathematics, Plateau State University, Bokkos, P. M. B. 2012, Jos,
More informationMulti-armed bandit problems
Multi-armed bandit problems Stochastic Decision Theory (2WB12) Arnoud den Boer 13 March 2013 Set-up 13 and 14 March: Lectures. 20 and 21 March: Paper presentations (Four groups, 45 min per group). Before
More informationSequential Sampling for Selection: The Undiscounted Case
Sequential Sampling for Selection: The Undiscounted Case Stephen E. Chick 1 Peter I. Frazier 2 1 Technology & Operations Management, INSEAD 2 Operations Research & Information Engineering, Cornell University
More informationOPTIMAL PORTFOLIO CONTROL WITH TRADING STRATEGIES OF FINITE
Proceedings of the 44th IEEE Conference on Decision and Control, and the European Control Conference 005 Seville, Spain, December 1-15, 005 WeA11.6 OPTIMAL PORTFOLIO CONTROL WITH TRADING STRATEGIES OF
More informationSequential Coalition Formation for Uncertain Environments
Sequential Coalition Formation for Uncertain Environments Hosam Hanna Computer Sciences Department GREYC - University of Caen 14032 Caen - France hanna@info.unicaen.fr Abstract In several applications,
More informationSEQUENTIAL DECISION PROBLEM WITH PARTIAL MAINTENANCE ON A PARTIALLY OBSERVABLE MARKOV PROCESS. Toru Nakai. Received February 22, 2010
Scientiae Mathematicae Japonicae Online, e-21, 283 292 283 SEQUENTIAL DECISION PROBLEM WITH PARTIAL MAINTENANCE ON A PARTIALLY OBSERVABLE MARKOV PROCESS Toru Nakai Received February 22, 21 Abstract. In
More informationOptimally Thresholded Realized Power Variations for Lévy Jump Diffusion Models
Optimally Thresholded Realized Power Variations for Lévy Jump Diffusion Models José E. Figueroa-López 1 1 Department of Statistics Purdue University University of Missouri-Kansas City Department of Mathematics
More informationMultistage risk-averse asset allocation with transaction costs
Multistage risk-averse asset allocation with transaction costs 1 Introduction Václav Kozmík 1 Abstract. This paper deals with asset allocation problems formulated as multistage stochastic programming models.
More informationDynamic and Stochastic Knapsack-Type Models for Foreclosed Housing Acquisition and Redevelopment
Proceedings of the 2012 International Conference on Industrial Engineering and Operations Management Istanbul, Turkey, July 3-6, 2012 Dynamic and Stochastic Knapsack-Type Models for Foreclosed Housing
More information6.896 Topics in Algorithmic Game Theory February 10, Lecture 3
6.896 Topics in Algorithmic Game Theory February 0, 200 Lecture 3 Lecturer: Constantinos Daskalakis Scribe: Pablo Azar, Anthony Kim In the previous lecture we saw that there always exists a Nash equilibrium
More informationAn Approximation Algorithm for Capacity Allocation over a Single Flight Leg with Fare-Locking
An Approximation Algorithm for Capacity Allocation over a Single Flight Leg with Fare-Locking Mika Sumida School of Operations Research and Information Engineering, Cornell University, Ithaca, New York
More informationChapter 3. Dynamic discrete games and auctions: an introduction
Chapter 3. Dynamic discrete games and auctions: an introduction Joan Llull Structural Micro. IDEA PhD Program I. Dynamic Discrete Games with Imperfect Information A. Motivating example: firm entry and
More informationEE266 Homework 5 Solutions
EE, Spring 15-1 Professor S. Lall EE Homework 5 Solutions 1. A refined inventory model. In this problem we consider an inventory model that is more refined than the one you ve seen in the lectures. The
More informationA Dynamic Hedging Strategy for Option Transaction Using Artificial Neural Networks
A Dynamic Hedging Strategy for Option Transaction Using Artificial Neural Networks Hyun Joon Shin and Jaepil Ryu Dept. of Management Eng. Sangmyung University {hjshin, jpru}@smu.ac.kr Abstract In order
More informationBasic Concepts and Examples in Finance
Basic Concepts and Examples in Finance Bernardo D Auria email: bernardo.dauria@uc3m.es web: www.est.uc3m.es/bdauria July 5, 2017 ICMAT / UC3M The Financial Market The Financial Market We assume there are
More informationMarkov Decision Processes II
Markov Decision Processes II Daisuke Oyama Topics in Economic Theory December 17, 2014 Review Finite state space S, finite action space A. The value of a policy σ A S : v σ = β t Q t σr σ, t=0 which satisfies
More informationSo we turn now to many-to-one matching with money, which is generally seen as a model of firms hiring workers
Econ 805 Advanced Micro Theory I Dan Quint Fall 2009 Lecture 20 November 13 2008 So far, we ve considered matching markets in settings where there is no money you can t necessarily pay someone to marry
More informationDynamic Replication of Non-Maturing Assets and Liabilities
Dynamic Replication of Non-Maturing Assets and Liabilities Michael Schürle Institute for Operations Research and Computational Finance, University of St. Gallen, Bodanstr. 6, CH-9000 St. Gallen, Switzerland
More informationNo-arbitrage theorem for multi-factor uncertain stock model with floating interest rate
Fuzzy Optim Decis Making 217 16:221 234 DOI 117/s17-16-9246-8 No-arbitrage theorem for multi-factor uncertain stock model with floating interest rate Xiaoyu Ji 1 Hua Ke 2 Published online: 17 May 216 Springer
More informationHandout 4: Deterministic Systems and the Shortest Path Problem
SEEM 3470: Dynamic Optimization and Applications 2013 14 Second Term Handout 4: Deterministic Systems and the Shortest Path Problem Instructor: Shiqian Ma January 27, 2014 Suggested Reading: Bertsekas
More informationAPPROXIMATING FREE EXERCISE BOUNDARIES FOR AMERICAN-STYLE OPTIONS USING SIMULATION AND OPTIMIZATION. Barry R. Cobb John M. Charnes
Proceedings of the 2004 Winter Simulation Conference R. G. Ingalls, M. D. Rossetti, J. S. Smith, and B. A. Peters, eds. APPROXIMATING FREE EXERCISE BOUNDARIES FOR AMERICAN-STYLE OPTIONS USING SIMULATION
More informationResearch Memo: Adding Nonfarm Employment to the Mixed-Frequency VAR Model
Research Memo: Adding Nonfarm Employment to the Mixed-Frequency VAR Model Kenneth Beauchemin Federal Reserve Bank of Minneapolis January 2015 Abstract This memo describes a revision to the mixed-frequency
More informationAlternating-offers bargaining with one-sided uncertain deadlines: an efficient algorithm
Artificial Intelligence 172 (2008) 1119 1157 www.elsevier.com/locate/artint Alternating-offers argaining with one-sided uncertain deadlines: an efficient algorithm Nicola Gatti, Francesco Di Giunta, Stefano
More informationSTOCHASTIC CALCULUS AND BLACK-SCHOLES MODEL
STOCHASTIC CALCULUS AND BLACK-SCHOLES MODEL YOUNGGEUN YOO Abstract. Ito s lemma is often used in Ito calculus to find the differentials of a stochastic process that depends on time. This paper will introduce
More informationMATH 121 GAME THEORY REVIEW
MATH 121 GAME THEORY REVIEW ERIN PEARSE Contents 1. Definitions 2 1.1. Non-cooperative Games 2 1.2. Cooperative 2-person Games 4 1.3. Cooperative n-person Games (in coalitional form) 6 2. Theorems and
More information6.231 DYNAMIC PROGRAMMING LECTURE 3 LECTURE OUTLINE
6.21 DYNAMIC PROGRAMMING LECTURE LECTURE OUTLINE Deterministic finite-state DP problems Backward shortest path algorithm Forward shortest path algorithm Shortest path examples Alternative shortest path
More informationStochastic Differential Equations in Finance and Monte Carlo Simulations
Stochastic Differential Equations in Finance and Department of Statistics and Modelling Science University of Strathclyde Glasgow, G1 1XH China 2009 Outline Stochastic Modelling in Asset Prices 1 Stochastic
More informationLog-Robust Portfolio Management
Log-Robust Portfolio Management Dr. Aurélie Thiele Lehigh University Joint work with Elcin Cetinkaya and Ban Kawas Research partially supported by the National Science Foundation Grant CMMI-0757983 Dr.
More informationKutay Cingiz, János Flesch, P. Jean-Jacques Herings, Arkadi Predtetchinski. Doing It Now, Later, or Never RM/15/022
Kutay Cingiz, János Flesch, P Jean-Jacques Herings, Arkadi Predtetchinski Doing It Now, Later, or Never RM/15/ Doing It Now, Later, or Never Kutay Cingiz János Flesch P Jean-Jacques Herings Arkadi Predtetchinski
More informationMS-E2114 Investment Science Lecture 5: Mean-variance portfolio theory
MS-E2114 Investment Science Lecture 5: Mean-variance portfolio theory A. Salo, T. Seeve Systems Analysis Laboratory Department of System Analysis and Mathematics Aalto University, School of Science Overview
More informationCSCI 1951-G Optimization Methods in Finance Part 00: Course Logistics Introduction to Finance Optimization Problems
CSCI 1951-G Optimization Methods in Finance Part 00: Course Logistics Introduction to Finance Optimization Problems January 26, 2018 1 / 24 Basic information All information is available in the syllabus
More informationMathematical Annex 5 Models with Rational Expectations
George Alogoskoufis, Dynamic Macroeconomic Theory, 2015 Mathematical Annex 5 Models with Rational Expectations In this mathematical annex we examine the properties and alternative solution methods for
More informationBudget Setting Strategies for the Company s Divisions
Budget Setting Strategies for the Company s Divisions Menachem Berg Ruud Brekelmans Anja De Waegenaere November 14, 1997 Abstract The paper deals with the issue of budget setting to the divisions of a
More informationCalibration of Interest Rates
WDS'12 Proceedings of Contributed Papers, Part I, 25 30, 2012. ISBN 978-80-7378-224-5 MATFYZPRESS Calibration of Interest Rates J. Černý Charles University, Faculty of Mathematics and Physics, Prague,
More informationA No-Arbitrage Theorem for Uncertain Stock Model
Fuzzy Optim Decis Making manuscript No (will be inserted by the editor) A No-Arbitrage Theorem for Uncertain Stock Model Kai Yao Received: date / Accepted: date Abstract Stock model is used to describe
More informationDecision Supporting Model for Highway Maintenance
Decision Supporting Model for Highway Maintenance András I. Baó * Zoltán Horváth ** * Professor of Budapest Politechni ** Adviser, Hungarian Development Ban H-1034, Budapest, 6, Doberdo str. Abstract A
More informationTWO-STAGE NEWSBOY MODEL WITH BACKORDERS AND INITIAL INVENTORY
TWO-STAGE NEWSBOY MODEL WITH BACKORDERS AND INITIAL INVENTORY Ali Cheaitou, Christian van Delft, Yves Dallery and Zied Jemai Laboratoire Génie Industriel, Ecole Centrale Paris, Grande Voie des Vignes,
More informationDynamic Programming: An overview. 1 Preliminaries: The basic principle underlying dynamic programming
Dynamic Programming: An overview These notes summarize some key properties of the Dynamic Programming principle to optimize a function or cost that depends on an interval or stages. This plays a key role
More informationVI. Continuous Probability Distributions
VI. Continuous Proaility Distriutions A. An Important Definition (reminder) Continuous Random Variale - a numerical description of the outcome of an experiment whose outcome can assume any numerical value
More informationB. Consider the problem of evaluating the one dimensional integral
Proceedings of the 2008 Winter Simulation Conference S. J. Mason, R. R. Hill, L. Mönch, O. Rose, T. Jefferson, J. W. Fowler eds. MONOTONICITY AND STRATIFICATION Gang Zhao Division of Systems Engineering
More informationElif Özge Özdamar T Reinforcement Learning - Theory and Applications February 14, 2006
On the convergence of Q-learning Elif Özge Özdamar elif.ozdamar@helsinki.fi T-61.6020 Reinforcement Learning - Theory and Applications February 14, 2006 the covergence of stochastic iterative algorithms
More informationImproved Taguchi Method Based Contracted Capacity Optimization for Power Consumer with Self-Owned Generating Units
Proceedings of the 6th WSEAS International Conference on Applications of Electrical Engineering, Istanul, Turey, May 7-9, 007 7 Improved Taguchi Method ased Contracted Capacity Optimization for Consumer
More informationLecture 2: Making Good Sequences of Decisions Given a Model of World. CS234: RL Emma Brunskill Winter 2018
Lecture 2: Making Good Sequences of Decisions Given a Model of World CS234: RL Emma Brunskill Winter 218 Human in the loop exoskeleton work from Steve Collins lab Class Structure Last Time: Introduction
More informationApproximations of Stochastic Programs. Scenario Tree Reduction and Construction
Approximations of Stochastic Programs. Scenario Tree Reduction and Construction W. Römisch Humboldt-University Berlin Institute of Mathematics 10099 Berlin, Germany www.mathematik.hu-berlin.de/~romisch
More informationCasino gambling problem under probability weighting
Casino gambling problem under probability weighting Sang Hu National University of Singapore Mathematical Finance Colloquium University of Southern California Jan 25, 2016 Based on joint work with Xue
More informationLecture 17: More on Markov Decision Processes. Reinforcement learning
Lecture 17: More on Markov Decision Processes. Reinforcement learning Learning a model: maximum likelihood Learning a value function directly Monte Carlo Temporal-difference (TD) learning COMP-424, Lecture
More informationOptimal rebalancing of portfolios with transaction costs assuming constant risk aversion
Optimal rebalancing of portfolios with transaction costs assuming constant risk aversion Lars Holden PhD, Managing director t: +47 22852672 Norwegian Computing Center, P. O. Box 114 Blindern, NO 0314 Oslo,
More informationA Decentralized Learning Equilibrium
Paper to be presented at the DRUID Society Conference 2014, CBS, Copenhagen, June 16-18 A Decentralized Learning Equilibrium Andreas Blume University of Arizona Economics ablume@email.arizona.edu April
More informationStochastic Optimal Control
Stochastic Optimal Control Lecturer: Eilyan Bitar, Cornell ECE Scribe: Kevin Kircher, Cornell MAE These notes summarize some of the material from ECE 5555 (Stochastic Systems) at Cornell in the fall of
More informationKreps & Scheinkman with product differentiation: an expository note
Kreps & Scheinkman with product differentiation: an expository note Stephen Martin Department of Economics Purdue University West Lafayette, IN 47906 smartin@purdueedu April 2000; revised Decemer 200;
More informationSOLVING ROBUST SUPPLY CHAIN PROBLEMS
SOLVING ROBUST SUPPLY CHAIN PROBLEMS Daniel Bienstock Nuri Sercan Özbay Columbia University, New York November 13, 2005 Project with Lucent Technologies Optimize the inventory buffer levels in a complicated
More informationBarrier Options Pricing in Uncertain Financial Market
Barrier Options Pricing in Uncertain Financial Market Jianqiang Xu, Jin Peng Institute of Uncertain Systems, Huanggang Normal University, Hubei 438, China College of Mathematics and Science, Shanghai Normal
More informationEssays on Some Combinatorial Optimization Problems with Interval Data
Essays on Some Combinatorial Optimization Problems with Interval Data a thesis submitted to the department of industrial engineering and the institute of engineering and sciences of bilkent university
More informationGame Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India August 2012
Game Theory Lecture Notes By Y. Narahari Department of Computer Science and Automation Indian Institute of Science Bangalore, India August 2012 Chapter 6: Mixed Strategies and Mixed Strategy Nash Equilibrium
More informationRobust Optimization Applied to a Currency Portfolio
Robust Optimization Applied to a Currency Portfolio R. Fonseca, S. Zymler, W. Wiesemann, B. Rustem Workshop on Numerical Methods and Optimization in Finance June, 2009 OUTLINE Introduction Motivation &
More informationThe Optimization Process: An example of portfolio optimization
ISyE 6669: Deterministic Optimization The Optimization Process: An example of portfolio optimization Shabbir Ahmed Fall 2002 1 Introduction Optimization can be roughly defined as a quantitative approach
More informationSample Path Large Deviations and Optimal Importance Sampling for Stochastic Volatility Models
Sample Path Large Deviations and Optimal Importance Sampling for Stochastic Volatility Models Scott Robertson Carnegie Mellon University scottrob@andrew.cmu.edu http://www.math.cmu.edu/users/scottrob June
More informationSolving real-life portfolio problem using stochastic programming and Monte-Carlo techniques
Solving real-life portfolio problem using stochastic programming and Monte-Carlo techniques 1 Introduction Martin Branda 1 Abstract. We deal with real-life portfolio problem with Value at Risk, transaction
More informationLecture 3: Review of mathematical finance and derivative pricing models
Lecture 3: Review of mathematical finance and derivative pricing models Xiaoguang Wang STAT 598W January 21th, 2014 (STAT 598W) Lecture 3 1 / 51 Outline 1 Some model independent definitions and principals
More informationPh.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program August 2017
Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program August 2017 The time limit for this exam is four hours. The exam has four sections. Each section includes two questions.
More informationMarkov Decision Processes (MDPs) CS 486/686 Introduction to AI University of Waterloo
Markov Decision Processes (MDPs) CS 486/686 Introduction to AI University of Waterloo Outline Sequential Decision Processes Markov chains Highlight Markov property Discounted rewards Value iteration Markov
More informationChapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.
Chapter 14 : Statistical Inference 1 Chapter 14 : Introduction to Statistical Inference Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same. Data x
More informationOptimal Allocation of Policy Limits and Deductibles
Optimal Allocation of Policy Limits and Deductibles Ka Chun Cheung Email: kccheung@math.ucalgary.ca Tel: +1-403-2108697 Fax: +1-403-2825150 Department of Mathematics and Statistics, University of Calgary,
More informationDynamic Asset and Liability Management Models for Pension Systems
Dynamic Asset and Liability Management Models for Pension Systems The Comparison between Multi-period Stochastic Programming Model and Stochastic Control Model Muneki Kawaguchi and Norio Hibiki June 1,
More informationSocial Common Capital and Sustainable Development. H. Uzawa. Social Common Capital Research, Tokyo, Japan. (IPD Climate Change Manchester Meeting)
Social Common Capital and Sustainable Development H. Uzawa Social Common Capital Research, Tokyo, Japan (IPD Climate Change Manchester Meeting) In this paper, we prove in terms of the prototype model of
More informationOn Existence of Equilibria. Bayesian Allocation-Mechanisms
On Existence of Equilibria in Bayesian Allocation Mechanisms Northwestern University April 23, 2014 Bayesian Allocation Mechanisms In allocation mechanisms, agents choose messages. The messages determine
More informationAsset allocation under regime-switching models
Title Asset allocation under regime-switching models Authors Song, N; Ching, WK; Zhu, D; Siu, TK Citation The 5th International Conference on Business Intelligence and Financial Engineering BIFE 212, Lanzhou,
More informationStock Repurchase with an Adaptive Reservation Price: A Study of the Greedy Policy
Stock Repurchase with an Adaptive Reservation Price: A Study of the Greedy Policy Ye Lu Asuman Ozdaglar David Simchi-Levi November 8, 200 Abstract. We consider the problem of stock repurchase over a finite
More informationHandout 8: Introduction to Stochastic Dynamic Programming. 2 Examples of Stochastic Dynamic Programming Problems
SEEM 3470: Dynamic Optimization and Applications 2013 14 Second Term Handout 8: Introduction to Stochastic Dynamic Programming Instructor: Shiqian Ma March 10, 2014 Suggested Reading: Chapter 1 of Bertsekas,
More informationMath 416/516: Stochastic Simulation
Math 416/516: Stochastic Simulation Haijun Li lih@math.wsu.edu Department of Mathematics Washington State University Week 13 Haijun Li Math 416/516: Stochastic Simulation Week 13 1 / 28 Outline 1 Simulation
More information16 MAKING SIMPLE DECISIONS
247 16 MAKING SIMPLE DECISIONS Let us associate each state S with a numeric utility U(S), which expresses the desirability of the state A nondeterministic action A will have possible outcome states Result
More informationAustralian Journal of Basic and Applied Sciences. Conditional Maximum Likelihood Estimation For Survival Function Using Cox Model
AENSI Journals Australian Journal of Basic and Applied Sciences Journal home page: wwwajbaswebcom Conditional Maximum Likelihood Estimation For Survival Function Using Cox Model Khawla Mustafa Sadiq University
More informationLecture outline W.B.Powell 1
Lecture outline What is a policy? Policy function approximations (PFAs) Cost function approximations (CFAs) alue function approximations (FAs) Lookahead policies Finding good policies Optimizing continuous
More informationChapter 7: Estimation Sections
1 / 40 Chapter 7: Estimation Sections 7.1 Statistical Inference Bayesian Methods: Chapter 7 7.2 Prior and Posterior Distributions 7.3 Conjugate Prior Distributions 7.4 Bayes Estimators Frequentist Methods:
More informationarxiv: v1 [math.pr] 6 Apr 2015
Analysis of the Optimal Resource Allocation for a Tandem Queueing System arxiv:1504.01248v1 [math.pr] 6 Apr 2015 Liu Zaiming, Chen Gang, Wu Jinbiao School of Mathematics and Statistics, Central South University,
More informationAssortment Optimization Over Time
Assortment Optimization Over Time James M. Davis Huseyin Topaloglu David P. Williamson Abstract In this note, we introduce the problem of assortment optimization over time. In this problem, we have a sequence
More informationApproximate Revenue Maximization with Multiple Items
Approximate Revenue Maximization with Multiple Items Nir Shabbat - 05305311 December 5, 2012 Introduction The paper I read is called Approximate Revenue Maximization with Multiple Items by Sergiu Hart
More informationA New Multivariate Kurtosis and Its Asymptotic Distribution
A ew Multivariate Kurtosis and Its Asymptotic Distribution Chiaki Miyagawa 1 and Takashi Seo 1 Department of Mathematical Information Science, Graduate School of Science, Tokyo University of Science, Tokyo,
More information