ADAPTIVE SIMULATION BUDGET ALLOCATION FOR DETERMINING THE BEST DESIGN. Qi Fan Jiaqiao Hu

Size: px
Start display at page:

Download "ADAPTIVE SIMULATION BUDGET ALLOCATION FOR DETERMINING THE BEST DESIGN. Qi Fan Jiaqiao Hu"

Transcription

1 Proceedings of the 013 Winter Simulation Conference R. Pasupathy, S.-H. Kim, A. Tol, R. Hill, and M. E. Kuhl, eds. ADAPTIVE SIMULATIO BUDGET ALLOCATIO FOR DETERMIIG THE BEST DESIG Qi Fan Jiaqiao Hu Department of Applied Mathematics and Statistics State University of ew Yor Stony Broo, Y 11794, USA ABSTRACT We consider the prolem of allocating a given simulation udget among a set of design alternatives in order to maximize the proaility of correct selection. Prior wor has focused on deriving static rules that predetermine the numer of simulation replications to e allocated to each design. In contrast, we formulate the prolem as a Marov decision process MDP and propose a dynamic myopic scheme to adaptively allocate simulation samples ased on current estimates of the means and variances of the design alternatives. We provide numerical examples to illustrate the performance of the proposed dynamic allocation rule. 1 ITRODUCTIO We consider the prolem of identifying the est design from a finite set of design alternatives. Each design is assumed to involve random uncertainty and requires stochastic simulation for performance estimation. When simulation is expensive and the numer of design alternatives is relatively small, a well-nown class of procedures for solving such prolems is raning and selection, where the goal is to determine the numer of simulation runs to e allocated to each design in order to guarantee a pre-specified correct selection proaility. Examples of raning and selection methods include Rinott s two-stage indifference zone procedure Rinott 1978, the expected value of information procedure Chic and Inoue 001, and the K family of algorithms Kim and elson 006; some reviews and advances in this field can e found in e.g., Goldsman and elson 1998, elson et al. 001, and Kim and elson 007. Chen 1995 approached the prolem from a different perspective y determining the est allocation of a given simulation udget among the designs in order to maximize the proaility of correct selection PCS. In particular, Chen 1996 proposed to use a Bayesian approach to estimate the design performance measures ased on prior sampling information and derived a lower ound for the correct selection proaility. The idea was susequently used in Chen, Chen, and Yücesan 000 and Chen et al. 000 to develop analytical allocation rules called Optimal Computing Budget Allocation OCBA that asymptotically optimize the lower ounds of the proaility of correct selection. More recently, Chen et al. 006 also investigated a dynamic allocation rule ased on perfect information assumption, and suggested that a dynamic scheme could dramatically improve the performance of static allocation rules. In this paper, we consider the setting of OCBA, i.e., maximizing PCS under a simulation udget constraint. However, unlie previous wor, which has primarily focused on static rules that predetermine the numer of simulation replications to e allocated to each design, we investigate a dynamic programming approach that adaptively allocates simulation samples ased on current estimates of the means and variances of various designs. Our wor can e viewed as an extension of that of Chen et al. 006 ased on perfect information. In particular, we model the simulation allocation process as a Marov decision process MDP with a terminal cost function. The state variale of the MDP model consists of the current sample mean of each design and the numer of simulation replications allocated to each of them y assuming that /13/$ IEEE 888

2 design variances are nown. These variances can then e estimated y sample variances. Since analytically solving the MDP model is intractale, we further develop an upper ound to the optimal value function and propose a one-step looahead index policy to myopically minimize the sum of the current one-stage cost and the upper ound of the optimal value function. Our preliminary numerical results indicate competitive performance of our approach with that of OCBA, especially when the simulation udget is small. The rest of the paper is organized as follows: In Section, we define notations and descrie the prolem setting. In Section 3, we formulate the prolem as an MDP, provide an upper ound to the optimal value function, and derive a myopic index policy for simulation allocation. umerical examples are provided in Section 4 to illustrate the performance of our approach. Finally, we conclude the paper in Section 5. PROBLEM SETTIG Consider the following optimization prolem: min J i min EL iξ ] i Θ i Θ where Θ = {1,,...,} is a finite set of design indices and J i is the true performance measure of design i. ote that J i itself is the expectation of the sample performance L i ξ, where the expectation is understood with respect to the distriution of the random variale ξ representing the stochastic uncertainty of the design. We assume that the expectation cannot e evaluated exactly; however, for a given simulation udget t, the performance measure J i can e estimated y the sample mean J t i 1 t i i t L i ξ i j, j=1 where t i represents the numer of replication runs allocated to design i and ξ i j represents the jth realization of ξ simulation sample path from design i. Throughout this paper, we assume that the simulation outputs are independent of each other. We egin y defining some notations. : the true variance of design i, i.e., σ i = VarL i ξ, which can e estimated y its sample variance. t : the index of the design that shows the current est sample performance after t simulation replications have een allocated, i.e., t t min i i t. s t : the index of the design that shows the second est sample performance after t simulation runs have een allocated, i.e., s t t min i t i t. δ t t,i = t t : the difference etween the sample performance of the current est and the ith designs. σi σ t t,i = σ t t t J t i + σ i : the standard deviation of the random variale δ t i t t,i. Define the event of correct selection CS as the event that design t i.e., the one with the current est sample performance is actually the est design. For a given simulation udget, the goal is to find a way to maximize the proaility of correct selection P{CS}. We follow the Bayesian approach introduced in Chen 1996 and assume that the output performance measure L i ξ is normally distriuted for each design. Let J i e a random variale whose distriution is the posterior distriution of design i given the previous sampling information: P{ J i } = P{J i L i ξ i j, j = 1,,, t i }, for i = 1,,,. It can e shown that if no priori nowledge is given on the performance of each design, J i has the normal distriution J i i t, σ i. Thus, a lower ound to P{CS} called approximate proaility t i 889

3 of correct selection APCS can e otained y applying Bonferroni s inequality. P{CS} = P{J t < J i, i t L i ξ i j, j = 1,,,i t,i = 1,,,} = P{ J t < J i, i t }, = P{ J t J i < 0} i t 1 P{ J t J i > 0} i t = 1 i t = APCS, δ t t,i σ t t,i where throughout this paper, we use and φ to denote the c.d.f. and p.d.f. of the standard normal distriution. Since P{CS} is difficult to evaluate, whereas APCS can e computed analytically without resorting to additional simulation effort, Chen et al. 000 proposed to use APCS as an approximation to the true proaility of correct selection. For a given udget T, the goal is to find a simulation udget allocation that solves the following optimization prolem. min 1 T,T,,T s.t. δ T T,i σ T T,i T i T and T i A DYAMIC BUDGET ALLOCATIO PROCEDURE Motivated y the wor of Hu et al. 011 and that of Chen et al. 006, we aim to solve the allocation prolem y modeling the allocation process as an MDP model with a terminal cost function. So instead of allocating all T simulation replications at the eginning as in 1, we derive a dynamic policy that sequentially allocates the udget ased on the estimated performance measure of all designs as well as the current sampling information. 3.1 Modeling the Allocation Process as an MDP Given a total of T simulation samples, we start y assigning at step t = 0 a small numer n 0 simulation replications to each of the designs. Define the state variale w t = i t,t i,i = 1,,,T as a vector containing the current performance estimates of all designs and the numer of times each design has een sampled thus far, where i 0 = n 0 and i 0 = 1 n 0 n 0 j=1 L i ξ i j for all i. ext consider a sequential allocation policy π that determines, at each step t = 1,,,T n 0 ased on w t, whether one more replication run should e allocated to one of the designs or the entire allocation process should e terminated. Let π t w t {0,1,...,} e the action taen at step t, then { i allocate one simulate run to design i, i = 1,,, π t w t = 0 stop the allocation process. 890

4 For every allocation policy π descried aove, it can e seen that {w t } is a Marov chain with the following state transition dynamics: i t+1 = t i i t +Y ii {a=i} t i + I {a=i} for i = 1,,, 3 t+1 i = t i + I {a=i} for i = 1,,,, 4 where I is the indicator function, a is the action taen at step t under π, and Y i is the simulation output performance measure of design i after the additional allocation. Based on 3 and 4, the updating formula for the current est sample mean is given y: t+1 t+1 = where Z + = max{0,z}. Let w = i, i,i = 1,..., T σ + σ i t t +Y t + s t t J s t t t t t t +I if a = t {a=i} t t J t t t aj a+y t a I + {a=i} if a a+i t {a=i} t, e a given state and define = argmin i J i, δ,i = J J i, and σ,i = i. By associating the state action pair w,a with the following one-stage cost function: { 0 if a 0 R t w,a = δ,i σ,i if a = 0 and R s w,a = 0 for all s t + 1 whenever π t w = 0, we otain a T n horizon MDP with the total cost V π T n 0 w = E R t w t,π t w t ] w0 = w, t=0 where the expectation is taen respect to the proaility measure induced y π. For a given initial state w 0 = w, the ojective is to find an optimal simulation allocation policy π to minimize the total cost accumulated efore the allocation process terminates. 3. A Myopic Index Policy Since otaining the exact optimal policy for the MDP model is intractale, we derive a myopic index policy using one-step looahead optimization. The following result provides an upper ound to the optimal value function. Theorem 1 Let V t e the optimal cost-to-go function at stage t of the MDP defined in Section 3.1. For every t = 0,1,,,T n 0 and w = i, i,i = 1,..., T, we have δ,i V t w. 6 Proof. At the final stage t = T n 0, all T replications have een exhausted, so the only option is to stop the allocation process. Therefore, we must have πt n 0 w = 0 and V T n0 w = δ,i. It follows that when t = T n 0 1, σ,i σ,i V T n0 1w = min a E a R T n0 1w,a +V T n0 w ] 5 891

5 where a {0,1,,,} and w = J i, i,i = 1,,,T is the next state generated according to the transition dynamics 3 and 4 when action a is taen, in particular, J i = ij i +Y i I {a=i} i +I and {a=i} i = i + I {a=i}. Therefore, V T n0 1w = mine a R T n0 a 1w,a +V T n0 w ] δ,i = min{,min E a a 0 δ,i σ,i, σ,i σ + σ i i δ,i σ,i ] } where = argmin i J i, δ,i = J i, and σ,i =. ow proceed y induction and assume that V t+1 w δ,i σ,i for all w. Then V t w = mine a R t w,a +V t+1 w ] a δ,i min{,min σ E a,i a 0 δ,i. This completes the proof of the theorem. σ,i δ,i Motivated y Theorem 1, we propose a simple stationary greedy policy that minimizes the sum of the current one-stage cost function and the upper ound of the optimal cost-to-go function at each step: 0 if πw = δ,i σ,i min a 0 E a δ ],i σ,i argmin a 0 E a δ ] 7,i σ otherwise.,i By connecting π to 1, it is not difficult to see that if one more simulation sample is needed, such a policy myopically allocates the next sample in such a way so that the APCS in the next step is maximized after the additional allocation. Since π is an index policy, we can create an index for each action a ased on the current state w. Denote y indexa as the index of action a and let w = J i, i,i = 1,...,T e the sampled next state when action a is taen, we have three different cases ased on the transition dynamics 3, 4, and 5: case 1 If a = 0, case If a =, index = E l=1,l index0 = J s s σ J +Y + σ l J J i + σ J l + σ i i σ,i J s + ]}, 8 J s J +Y σ + σ + J +Y ], 9 89

6 case 3 If a and a 0, indexa = E a l=1,l a J a σ J a +Y a + σ l Fan and Hu + J l J + J a J a +Y a σ + σ a a + a J a +Y a ], 10 where = argmin i J i and σ is the variance of design. Let B s = { J +Y J s }. The + operator can e removed from 9 y conditioning on event B s : index = E l + E l +Y σ J l + σ l J s J l σs s + σ l + 1 ] B s PB s + J s J +Y σ s s + σ B ] c s PB c s. 11 Similarly, y conditioning on B = { J +Y }, the index in case 3 can e otained as indexa = E a l a + E a l a a a +Y a σa + σ l J J l σ + σ l J l + 1 ] B PB + J a J a +Y a σ + σ a B ] c PB c A Dynamic Budget Allocation Algorithm ote that when a = 0 the index in case 1 can e calculated analytically, whereas calculating the performance indices in 11 and 1 require evaluating the expectations with respect to the design distriutions. One natural approach to evaluate/estimate these expectations is to use Taylor expansion. Taing 11 as an example, we can treat each respective term as a function of the sample mean J +Y and perform a first order Taylor expansion of the term around. In addition, y replacing the true mean of the current est design with its sample mean, PB s can e approximated y +1 J s σ. Thus when a =, we can approximate the index of action y the following analytical formula: index = l J φ l σ l φ σ s J J l σ + σ l J J s s + σ J J s σs s + σ + σ l ] σ σ + 1 J s J + l l σs + 1 σ + 1 J s + σ l s + σ l σ s s + σ σ ] + 1 J s φ σ J + 1 J s J s φ σ J σ J

7 Similarly, when a and a 0, indexa can e approximated y ] indexa 1 = + a J l a + 1 a l a σa + σ l l + φ l a φ a J l σa + σ l a σ + a σ + σ a σa a + 1 J J + l σ l a + σ l + σ l a + 1 σ + σ a ] a + 1 φ J a a a + 1 φ J a J a. 14 Finally, y replacing true variances with sample variances, we propose the following algorithm for simulation udget allocation. Dynamic Simulation Budget Allocation DSBA Step 0: Perform n 0 simulation replications for all designs. Calculate the sample mean and sample variance for each design. Step 1: For each action a {0,...,}, compute the index of action a according to 8, 13, and 14. Step : Select the action a with the smallest index value. If a = 0, then stop the allocation process; else if a = i, perform one simulation replication for design i, update the sample mean and sample variance of design i. Increase the numer of simulation replications to design i y 1 and go ac to Step 1 until the given udget is exhausted. 4 UMERICAL RESULTS In this section, we test the proposed DSBA algorithm and compare its performance with that of OCBA on some simple examples. OCBA was derived ased on analytically solving the static optimization prolem 1. It has een shown in Chen et al. 000 that the asymptotically optimal solution to the prolem as T satisfies the following conditions. 1 i σi /δ j =,i σ j /δ, j = σ,i i σi, where i is the numer of samples allocated to design i, δ,i = J i, min i J i, and σ i is the standard deviation of the performance measure for design i, which can e estimated y sample variance. OCBA sequentially allocates a given udget T y splitting it into atches of size. Then at each step, the current computing udget is increased y and a udget allocation is calculated using conditions 1 and ased on the updated computing udget. This allocation is then used to determine the numer of additional simulation runs need to e allocated to each design. The process continues until all udget has een consumed. We consider the following examples in our computational experiment. Example 1 : This is a special example where the est design has a zero variance and the rest two designs have the same performance: X 1 j 0,0,X j 0.4,3,X 3 j 0.4,3 894

8 Example : There are five design alternatives with the est design eing deterministic: X 1 j 0,0,X j 0.4,1.5,X 3 j 0.4,3,X 4 j 1,3,X 5 j,3 Example 3 : This example is an extension of the previous one with the deterministic design removed: X 1 j 0,1.5,X j 0.6,3,X 3 j 1,3,X 4 j,3 Example 4 : This is an example with three alternatives, all of which are random: X 1 j 1,1,X j 1.5,3,X 3 j 1.5,3 In our experiment, the initial numer of replications n 0 is set to 10 for oth DSBA and OCBA. Figure 1 shows the performance of oth algorithms for each of the four respective test cases, where the true P{CS} in each case is estimated y the proportional of times the est design is found y an algorithm out of 10,000 independent experiments. The figure indicates competitive performance of DSBA with that of OCBA in all test cases. In particular, DSBA outperforms OCBA when the simulation udget is small, whereas OCBA shows slightly etter performance when the udget is increased, especially in the last case. Our conjecture is that this is due to the asymptotic optimality of OCBA, whereas DSBA is myopic in nature. 5 COCLUSIO In this paper, we have introduced a dynamic simulation udget allocation procedure for determining the est design from a set of finite design alternatives. The idea is to use a myopic one-step looahead policy to approximately solve an underlying MDP characterizing the udget allocation process. Such a policy gives rise to a stationary index rule that adaptively determines at each step which design should e simulated next in order to myopically maximize the approximate proaility of correction after the additional allocation. Our preliminary numerical results indicate that our approach may provide competitive performance with that of OCBA, especially when the computing udget is small. ACKOWLEDGMETS This wor was supported in part y the Air Force Office of Scientific Research under Grant FA and y the ational Science Foundation under Grant CMMI REFERECES Chen, C.-H An Effective Approach to Smartly Allocate Computing Budget for Discrete Event Simulation. In Prodeedings of the 34th IEEE Conference on Decision and Control, Piscataway, J: IEEE. Chen, C.-H A Low Bound for the Correct Suset-Selection Proaility and Its Application to Discrete-Event System Simulation. IEEE Transaction on Automatic Control 41: Chen, C.-H., D. He, and M. Fu Efficient Dynamic Simulation Allocation in Ordinal Optimization. IEEE Transaction on Automatic Control 51: Chen, C.-H., J. Lin, E. Yücesan, and S. E. Chic Simulation Budget Allocation for Further Enhancing the Efficiency of Ordinal Optimization. Discrete Event Dynamic System: Theory and Applications 10: Chen, H.-C., C.-H. Chen, and E. Yücesan Computing Efforts Allocation for Ordinal Optimization and Discrete Event Simulation. IEEE Transaction on Automatic Control 45: Chic, S. E., and K. Inoue ew Two-Stage and Sequential Procedures for Selecting the Best Simulated System. Operations Research 49: Goldsman, D., and B. L. elson Comparing systems via simulation. In Handoo of simulation, edited y J. Bans, ew Yor: John Wiley. 895

9 Figure 1: Comparison of OCBA and DSBA. Hu, J., H. S. Chang, M. C. Fu, and S. I. Marcus Dynamic Sample Budget Allocation in Model-Based Optimization. Journal of Gloal Optimization 50: Kim, S.-H., and B. L. elson Selecting the Best System. In Handoos in Operations Research and Mangement Science: Simulation, edited y S. G. Henderson and B. L. elson, Chapter 17, Oxford, UK: Elsevier Science. Kim, S.-H., and B. L. elson Recent Advances in Raning and Selection. In Prodeedings of the 007 Winter Simulation Conference, edited y S. Henderson, B. Biler, M.-H. Hsieh, J. Shortle, J. Tew, and R. Barton, Piscataway, J: IEEE. elson, B. L., J. Swann, D. Goldsman, and W. Song Simple Procedures for Selecting the Best Simulated System when the umer of Alternatives Is Large. Operations Research 49: Rinott, Y On Two-Stage Selection Procedures and Related Proaility Inequalities. Communications in Statistics - Theory and Methods 7: AUTHOR BIOGRAPHIES Qi Fan is a Ph.D. student in the Department of Applied Mathematics and Statistic at the State University of ew Yor, Stony Broo. He received the B.S. degree in mathematics from Zhejiang University, China in 011. His research interests include Marov decision processes, optimization and simulation. His address is qfan@ams.sunys.edu. 896

10 JIAQIAO HU is an Associate Professor in the Department of Applied Mathematics and Statistics at the State University of ew Yor, Stony Broo. He received the B.S. degree in automation from Shanghai Jiao Tong University, the M.S. degree in applied mathematics from the University of Maryland, Baltimore County, and the Ph.D. degree in electrical engineering from the University of Maryland, College Par. His research interests include Marov decision processes, applied proaility, and simulation optimization. His address is 897

Provably Near-Optimal Sampling-Based Policies for Stochastic Inventory Control Models

Provably Near-Optimal Sampling-Based Policies for Stochastic Inventory Control Models Provaly Near-Optimal Sampling-Based Policies for Stochastic Inventory Control Models Retsef Levi Sloan School of Management, MIT, Camridge, MA, 02139, USA email: retsef@mit.edu Roin O. Roundy School of

More information

Optimal Production-Inventory Policy under Energy Buy-Back Program

Optimal Production-Inventory Policy under Energy Buy-Back Program The inth International Symposium on Operations Research and Its Applications (ISORA 10) Chengdu-Jiuzhaigou, China, August 19 23, 2010 Copyright 2010 ORSC & APORC, pp. 526 532 Optimal Production-Inventory

More information

Ranking and selection (R&S) with multiple performance measures using incomplete preference information

Ranking and selection (R&S) with multiple performance measures using incomplete preference information Ranking and selection (R&S) with multiple performance measures using incomplete Ville Mattila and Kai Virtanen (ville.a.mattila@aalto.fi) Systems Analysis Laboratory Aalto University School of Science

More information

Dynamic Portfolio Choice II

Dynamic Portfolio Choice II Dynamic Portfolio Choice II Dynamic Programming Leonid Kogan MIT, Sloan 15.450, Fall 2010 c Leonid Kogan ( MIT, Sloan ) Dynamic Portfolio Choice II 15.450, Fall 2010 1 / 35 Outline 1 Introduction to Dynamic

More information

Lecture 7: Bayesian approach to MAB - Gittins index

Lecture 7: Bayesian approach to MAB - Gittins index Advanced Topics in Machine Learning and Algorithmic Game Theory Lecture 7: Bayesian approach to MAB - Gittins index Lecturer: Yishay Mansour Scribe: Mariano Schain 7.1 Introduction In the Bayesian approach

More information

Proceedings of the 2014 Winter Simulation Conference A. Tolk, S. Y. Diallo, I. O. Ryzhov, L. Yilmaz, S. Buckley, and J. A. Miller, eds.

Proceedings of the 2014 Winter Simulation Conference A. Tolk, S. Y. Diallo, I. O. Ryzhov, L. Yilmaz, S. Buckley, and J. A. Miller, eds. Proceedings of the 204 Winter Simulation Conference A. Tolk, S. Y. Diallo, I. O. Ryzhov, L. Yilmaz, S. Buckley, and J. A. Miller, eds. SAMPLE ALLOCATION FOR MULTIPLE ATTRIBUTE SELECTION PROBLEMS Dennis

More information

The Optimal Choice of Monetary Instruments The Poole Model

The Optimal Choice of Monetary Instruments The Poole Model The Optimal Choice of Monetary Instruments The Poole Model Vivaldo M. Mendes ISCTE Lison University Institute 06 Novemer 2013 (Vivaldo M. Mendes) The Poole Model 06 Novemer 2013 1 / 27 Summary 1 Tools,

More information

EFFECT OF IMPLEMENTATION TIME ON REAL OPTIONS VALUATION. Mehmet Aktan

EFFECT OF IMPLEMENTATION TIME ON REAL OPTIONS VALUATION. Mehmet Aktan Proceedings of the 2002 Winter Simulation Conference E. Yücesan, C.-H. Chen, J. L. Snowdon, and J. M. Charnes, eds. EFFECT OF IMPLEMENTATION TIME ON REAL OPTIONS VALUATION Harriet Black Nembhard Leyuan

More information

Sequential Decision Making

Sequential Decision Making Sequential Decision Making Dynamic programming Christos Dimitrakakis Intelligent Autonomous Systems, IvI, University of Amsterdam, The Netherlands March 18, 2008 Introduction Some examples Dynamic programming

More information

A Simple Method for Solving Multiperiod Mean-Variance Asset-Liability Management Problem

A Simple Method for Solving Multiperiod Mean-Variance Asset-Liability Management Problem Available online at wwwsciencedirectcom Procedia Engineering 3 () 387 39 Power Electronics and Engineering Application A Simple Method for Solving Multiperiod Mean-Variance Asset-Liability Management Problem

More information

Optimal Search for Parameters in Monte Carlo Simulation for Derivative Pricing

Optimal Search for Parameters in Monte Carlo Simulation for Derivative Pricing Optimal Search for Parameters in Monte Carlo Simulation for Derivative Pricing Prof. Chuan-Ju Wang Department of Computer Science University of Taipei Joint work with Prof. Ming-Yang Kao March 28, 2014

More information

American Barrier Option Pricing Formulae for Uncertain Stock Model

American Barrier Option Pricing Formulae for Uncertain Stock Model American Barrier Option Pricing Formulae for Uncertain Stock Model Rong Gao School of Economics and Management, Heei University of Technology, Tianjin 341, China gaor14@tsinghua.org.cn Astract Uncertain

More information

Microeconomics II. CIDE, Spring 2011 List of Problems

Microeconomics II. CIDE, Spring 2011 List of Problems Microeconomics II CIDE, Spring 2011 List of Prolems 1. There are three people, Amy (A), Bart (B) and Chris (C): A and B have hats. These three people are arranged in a room so that B can see everything

More information

A GENERALIZED MARTINGALE BETTING STRATEGY

A GENERALIZED MARTINGALE BETTING STRATEGY DAVID K. NEAL AND MICHAEL D. RUSSELL Astract. A generalized martingale etting strategy is analyzed for which ets are increased y a factor of m 1 after each loss, ut return to the initial et amount after

More information

A lower bound on seller revenue in single buyer monopoly auctions

A lower bound on seller revenue in single buyer monopoly auctions A lower bound on seller revenue in single buyer monopoly auctions Omer Tamuz October 7, 213 Abstract We consider a monopoly seller who optimally auctions a single object to a single potential buyer, with

More information

Comparative Study between Linear and Graphical Methods in Solving Optimization Problems

Comparative Study between Linear and Graphical Methods in Solving Optimization Problems Comparative Study between Linear and Graphical Methods in Solving Optimization Problems Mona M Abd El-Kareem Abstract The main target of this paper is to establish a comparative study between the performance

More information

UPDATE ON ECONOMIC APPROACH TO SIMULATION SELECTION PROBLEMS

UPDATE ON ECONOMIC APPROACH TO SIMULATION SELECTION PROBLEMS Proceedings of the 2008 Winter Simulation Conference S. J. Mason, R. R. Hill, L. Mönch, O. Rose, T. Jefferson, J. W. Fowler eds. UPDATE ON ECONOMIC APPROACH TO SIMULATION SELECTION PROBLEMS Stephen E.

More information

Journal of Computational and Applied Mathematics. The mean-absolute deviation portfolio selection problem with interval-valued returns

Journal of Computational and Applied Mathematics. The mean-absolute deviation portfolio selection problem with interval-valued returns Journal of Computational and Applied Mathematics 235 (2011) 4149 4157 Contents lists available at ScienceDirect Journal of Computational and Applied Mathematics journal homepage: www.elsevier.com/locate/cam

More information

Forecast Horizons for Production Planning with Stochastic Demand

Forecast Horizons for Production Planning with Stochastic Demand Forecast Horizons for Production Planning with Stochastic Demand Alfredo Garcia and Robert L. Smith Department of Industrial and Operations Engineering Universityof Michigan, Ann Arbor MI 48109 December

More information

Optimal Dam Management

Optimal Dam Management Optimal Dam Management Michel De Lara et Vincent Leclère July 3, 2012 Contents 1 Problem statement 1 1.1 Dam dynamics.................................. 2 1.2 Intertemporal payoff criterion..........................

More information

THis paper presents a model for determining optimal allunit

THis paper presents a model for determining optimal allunit A Wholesaler s Optimal Ordering and Quantity Discount Policies for Deteriorating Items Hidefumi Kawakatsu Astract This study analyses the seller s wholesaler s decision to offer quantity discounts to the

More information

Risk Measurement in Credit Portfolio Models

Risk Measurement in Credit Portfolio Models 9 th DGVFM Scientific Day 30 April 2010 1 Risk Measurement in Credit Portfolio Models 9 th DGVFM Scientific Day 30 April 2010 9 th DGVFM Scientific Day 30 April 2010 2 Quantitative Risk Management Profit

More information

THE OPTIMAL ASSET ALLOCATION PROBLEMFOR AN INVESTOR THROUGH UTILITY MAXIMIZATION

THE OPTIMAL ASSET ALLOCATION PROBLEMFOR AN INVESTOR THROUGH UTILITY MAXIMIZATION THE OPTIMAL ASSET ALLOCATION PROBLEMFOR AN INVESTOR THROUGH UTILITY MAXIMIZATION SILAS A. IHEDIOHA 1, BRIGHT O. OSU 2 1 Department of Mathematics, Plateau State University, Bokkos, P. M. B. 2012, Jos,

More information

Multi-armed bandit problems

Multi-armed bandit problems Multi-armed bandit problems Stochastic Decision Theory (2WB12) Arnoud den Boer 13 March 2013 Set-up 13 and 14 March: Lectures. 20 and 21 March: Paper presentations (Four groups, 45 min per group). Before

More information

Sequential Sampling for Selection: The Undiscounted Case

Sequential Sampling for Selection: The Undiscounted Case Sequential Sampling for Selection: The Undiscounted Case Stephen E. Chick 1 Peter I. Frazier 2 1 Technology & Operations Management, INSEAD 2 Operations Research & Information Engineering, Cornell University

More information

OPTIMAL PORTFOLIO CONTROL WITH TRADING STRATEGIES OF FINITE

OPTIMAL PORTFOLIO CONTROL WITH TRADING STRATEGIES OF FINITE Proceedings of the 44th IEEE Conference on Decision and Control, and the European Control Conference 005 Seville, Spain, December 1-15, 005 WeA11.6 OPTIMAL PORTFOLIO CONTROL WITH TRADING STRATEGIES OF

More information

Sequential Coalition Formation for Uncertain Environments

Sequential Coalition Formation for Uncertain Environments Sequential Coalition Formation for Uncertain Environments Hosam Hanna Computer Sciences Department GREYC - University of Caen 14032 Caen - France hanna@info.unicaen.fr Abstract In several applications,

More information

SEQUENTIAL DECISION PROBLEM WITH PARTIAL MAINTENANCE ON A PARTIALLY OBSERVABLE MARKOV PROCESS. Toru Nakai. Received February 22, 2010

SEQUENTIAL DECISION PROBLEM WITH PARTIAL MAINTENANCE ON A PARTIALLY OBSERVABLE MARKOV PROCESS. Toru Nakai. Received February 22, 2010 Scientiae Mathematicae Japonicae Online, e-21, 283 292 283 SEQUENTIAL DECISION PROBLEM WITH PARTIAL MAINTENANCE ON A PARTIALLY OBSERVABLE MARKOV PROCESS Toru Nakai Received February 22, 21 Abstract. In

More information

Optimally Thresholded Realized Power Variations for Lévy Jump Diffusion Models

Optimally Thresholded Realized Power Variations for Lévy Jump Diffusion Models Optimally Thresholded Realized Power Variations for Lévy Jump Diffusion Models José E. Figueroa-López 1 1 Department of Statistics Purdue University University of Missouri-Kansas City Department of Mathematics

More information

Multistage risk-averse asset allocation with transaction costs

Multistage risk-averse asset allocation with transaction costs Multistage risk-averse asset allocation with transaction costs 1 Introduction Václav Kozmík 1 Abstract. This paper deals with asset allocation problems formulated as multistage stochastic programming models.

More information

Dynamic and Stochastic Knapsack-Type Models for Foreclosed Housing Acquisition and Redevelopment

Dynamic and Stochastic Knapsack-Type Models for Foreclosed Housing Acquisition and Redevelopment Proceedings of the 2012 International Conference on Industrial Engineering and Operations Management Istanbul, Turkey, July 3-6, 2012 Dynamic and Stochastic Knapsack-Type Models for Foreclosed Housing

More information

6.896 Topics in Algorithmic Game Theory February 10, Lecture 3

6.896 Topics in Algorithmic Game Theory February 10, Lecture 3 6.896 Topics in Algorithmic Game Theory February 0, 200 Lecture 3 Lecturer: Constantinos Daskalakis Scribe: Pablo Azar, Anthony Kim In the previous lecture we saw that there always exists a Nash equilibrium

More information

An Approximation Algorithm for Capacity Allocation over a Single Flight Leg with Fare-Locking

An Approximation Algorithm for Capacity Allocation over a Single Flight Leg with Fare-Locking An Approximation Algorithm for Capacity Allocation over a Single Flight Leg with Fare-Locking Mika Sumida School of Operations Research and Information Engineering, Cornell University, Ithaca, New York

More information

Chapter 3. Dynamic discrete games and auctions: an introduction

Chapter 3. Dynamic discrete games and auctions: an introduction Chapter 3. Dynamic discrete games and auctions: an introduction Joan Llull Structural Micro. IDEA PhD Program I. Dynamic Discrete Games with Imperfect Information A. Motivating example: firm entry and

More information

EE266 Homework 5 Solutions

EE266 Homework 5 Solutions EE, Spring 15-1 Professor S. Lall EE Homework 5 Solutions 1. A refined inventory model. In this problem we consider an inventory model that is more refined than the one you ve seen in the lectures. The

More information

A Dynamic Hedging Strategy for Option Transaction Using Artificial Neural Networks

A Dynamic Hedging Strategy for Option Transaction Using Artificial Neural Networks A Dynamic Hedging Strategy for Option Transaction Using Artificial Neural Networks Hyun Joon Shin and Jaepil Ryu Dept. of Management Eng. Sangmyung University {hjshin, jpru}@smu.ac.kr Abstract In order

More information

Basic Concepts and Examples in Finance

Basic Concepts and Examples in Finance Basic Concepts and Examples in Finance Bernardo D Auria email: bernardo.dauria@uc3m.es web: www.est.uc3m.es/bdauria July 5, 2017 ICMAT / UC3M The Financial Market The Financial Market We assume there are

More information

Markov Decision Processes II

Markov Decision Processes II Markov Decision Processes II Daisuke Oyama Topics in Economic Theory December 17, 2014 Review Finite state space S, finite action space A. The value of a policy σ A S : v σ = β t Q t σr σ, t=0 which satisfies

More information

So we turn now to many-to-one matching with money, which is generally seen as a model of firms hiring workers

So we turn now to many-to-one matching with money, which is generally seen as a model of firms hiring workers Econ 805 Advanced Micro Theory I Dan Quint Fall 2009 Lecture 20 November 13 2008 So far, we ve considered matching markets in settings where there is no money you can t necessarily pay someone to marry

More information

Dynamic Replication of Non-Maturing Assets and Liabilities

Dynamic Replication of Non-Maturing Assets and Liabilities Dynamic Replication of Non-Maturing Assets and Liabilities Michael Schürle Institute for Operations Research and Computational Finance, University of St. Gallen, Bodanstr. 6, CH-9000 St. Gallen, Switzerland

More information

No-arbitrage theorem for multi-factor uncertain stock model with floating interest rate

No-arbitrage theorem for multi-factor uncertain stock model with floating interest rate Fuzzy Optim Decis Making 217 16:221 234 DOI 117/s17-16-9246-8 No-arbitrage theorem for multi-factor uncertain stock model with floating interest rate Xiaoyu Ji 1 Hua Ke 2 Published online: 17 May 216 Springer

More information

Handout 4: Deterministic Systems and the Shortest Path Problem

Handout 4: Deterministic Systems and the Shortest Path Problem SEEM 3470: Dynamic Optimization and Applications 2013 14 Second Term Handout 4: Deterministic Systems and the Shortest Path Problem Instructor: Shiqian Ma January 27, 2014 Suggested Reading: Bertsekas

More information

APPROXIMATING FREE EXERCISE BOUNDARIES FOR AMERICAN-STYLE OPTIONS USING SIMULATION AND OPTIMIZATION. Barry R. Cobb John M. Charnes

APPROXIMATING FREE EXERCISE BOUNDARIES FOR AMERICAN-STYLE OPTIONS USING SIMULATION AND OPTIMIZATION. Barry R. Cobb John M. Charnes Proceedings of the 2004 Winter Simulation Conference R. G. Ingalls, M. D. Rossetti, J. S. Smith, and B. A. Peters, eds. APPROXIMATING FREE EXERCISE BOUNDARIES FOR AMERICAN-STYLE OPTIONS USING SIMULATION

More information

Research Memo: Adding Nonfarm Employment to the Mixed-Frequency VAR Model

Research Memo: Adding Nonfarm Employment to the Mixed-Frequency VAR Model Research Memo: Adding Nonfarm Employment to the Mixed-Frequency VAR Model Kenneth Beauchemin Federal Reserve Bank of Minneapolis January 2015 Abstract This memo describes a revision to the mixed-frequency

More information

Alternating-offers bargaining with one-sided uncertain deadlines: an efficient algorithm

Alternating-offers bargaining with one-sided uncertain deadlines: an efficient algorithm Artificial Intelligence 172 (2008) 1119 1157 www.elsevier.com/locate/artint Alternating-offers argaining with one-sided uncertain deadlines: an efficient algorithm Nicola Gatti, Francesco Di Giunta, Stefano

More information

STOCHASTIC CALCULUS AND BLACK-SCHOLES MODEL

STOCHASTIC CALCULUS AND BLACK-SCHOLES MODEL STOCHASTIC CALCULUS AND BLACK-SCHOLES MODEL YOUNGGEUN YOO Abstract. Ito s lemma is often used in Ito calculus to find the differentials of a stochastic process that depends on time. This paper will introduce

More information

MATH 121 GAME THEORY REVIEW

MATH 121 GAME THEORY REVIEW MATH 121 GAME THEORY REVIEW ERIN PEARSE Contents 1. Definitions 2 1.1. Non-cooperative Games 2 1.2. Cooperative 2-person Games 4 1.3. Cooperative n-person Games (in coalitional form) 6 2. Theorems and

More information

6.231 DYNAMIC PROGRAMMING LECTURE 3 LECTURE OUTLINE

6.231 DYNAMIC PROGRAMMING LECTURE 3 LECTURE OUTLINE 6.21 DYNAMIC PROGRAMMING LECTURE LECTURE OUTLINE Deterministic finite-state DP problems Backward shortest path algorithm Forward shortest path algorithm Shortest path examples Alternative shortest path

More information

Stochastic Differential Equations in Finance and Monte Carlo Simulations

Stochastic Differential Equations in Finance and Monte Carlo Simulations Stochastic Differential Equations in Finance and Department of Statistics and Modelling Science University of Strathclyde Glasgow, G1 1XH China 2009 Outline Stochastic Modelling in Asset Prices 1 Stochastic

More information

Log-Robust Portfolio Management

Log-Robust Portfolio Management Log-Robust Portfolio Management Dr. Aurélie Thiele Lehigh University Joint work with Elcin Cetinkaya and Ban Kawas Research partially supported by the National Science Foundation Grant CMMI-0757983 Dr.

More information

Kutay Cingiz, János Flesch, P. Jean-Jacques Herings, Arkadi Predtetchinski. Doing It Now, Later, or Never RM/15/022

Kutay Cingiz, János Flesch, P. Jean-Jacques Herings, Arkadi Predtetchinski. Doing It Now, Later, or Never RM/15/022 Kutay Cingiz, János Flesch, P Jean-Jacques Herings, Arkadi Predtetchinski Doing It Now, Later, or Never RM/15/ Doing It Now, Later, or Never Kutay Cingiz János Flesch P Jean-Jacques Herings Arkadi Predtetchinski

More information

MS-E2114 Investment Science Lecture 5: Mean-variance portfolio theory

MS-E2114 Investment Science Lecture 5: Mean-variance portfolio theory MS-E2114 Investment Science Lecture 5: Mean-variance portfolio theory A. Salo, T. Seeve Systems Analysis Laboratory Department of System Analysis and Mathematics Aalto University, School of Science Overview

More information

CSCI 1951-G Optimization Methods in Finance Part 00: Course Logistics Introduction to Finance Optimization Problems

CSCI 1951-G Optimization Methods in Finance Part 00: Course Logistics Introduction to Finance Optimization Problems CSCI 1951-G Optimization Methods in Finance Part 00: Course Logistics Introduction to Finance Optimization Problems January 26, 2018 1 / 24 Basic information All information is available in the syllabus

More information

Mathematical Annex 5 Models with Rational Expectations

Mathematical Annex 5 Models with Rational Expectations George Alogoskoufis, Dynamic Macroeconomic Theory, 2015 Mathematical Annex 5 Models with Rational Expectations In this mathematical annex we examine the properties and alternative solution methods for

More information

Budget Setting Strategies for the Company s Divisions

Budget Setting Strategies for the Company s Divisions Budget Setting Strategies for the Company s Divisions Menachem Berg Ruud Brekelmans Anja De Waegenaere November 14, 1997 Abstract The paper deals with the issue of budget setting to the divisions of a

More information

Calibration of Interest Rates

Calibration of Interest Rates WDS'12 Proceedings of Contributed Papers, Part I, 25 30, 2012. ISBN 978-80-7378-224-5 MATFYZPRESS Calibration of Interest Rates J. Černý Charles University, Faculty of Mathematics and Physics, Prague,

More information

A No-Arbitrage Theorem for Uncertain Stock Model

A No-Arbitrage Theorem for Uncertain Stock Model Fuzzy Optim Decis Making manuscript No (will be inserted by the editor) A No-Arbitrage Theorem for Uncertain Stock Model Kai Yao Received: date / Accepted: date Abstract Stock model is used to describe

More information

Decision Supporting Model for Highway Maintenance

Decision Supporting Model for Highway Maintenance Decision Supporting Model for Highway Maintenance András I. Baó * Zoltán Horváth ** * Professor of Budapest Politechni ** Adviser, Hungarian Development Ban H-1034, Budapest, 6, Doberdo str. Abstract A

More information

TWO-STAGE NEWSBOY MODEL WITH BACKORDERS AND INITIAL INVENTORY

TWO-STAGE NEWSBOY MODEL WITH BACKORDERS AND INITIAL INVENTORY TWO-STAGE NEWSBOY MODEL WITH BACKORDERS AND INITIAL INVENTORY Ali Cheaitou, Christian van Delft, Yves Dallery and Zied Jemai Laboratoire Génie Industriel, Ecole Centrale Paris, Grande Voie des Vignes,

More information

Dynamic Programming: An overview. 1 Preliminaries: The basic principle underlying dynamic programming

Dynamic Programming: An overview. 1 Preliminaries: The basic principle underlying dynamic programming Dynamic Programming: An overview These notes summarize some key properties of the Dynamic Programming principle to optimize a function or cost that depends on an interval or stages. This plays a key role

More information

VI. Continuous Probability Distributions

VI. Continuous Probability Distributions VI. Continuous Proaility Distriutions A. An Important Definition (reminder) Continuous Random Variale - a numerical description of the outcome of an experiment whose outcome can assume any numerical value

More information

B. Consider the problem of evaluating the one dimensional integral

B. Consider the problem of evaluating the one dimensional integral Proceedings of the 2008 Winter Simulation Conference S. J. Mason, R. R. Hill, L. Mönch, O. Rose, T. Jefferson, J. W. Fowler eds. MONOTONICITY AND STRATIFICATION Gang Zhao Division of Systems Engineering

More information

Elif Özge Özdamar T Reinforcement Learning - Theory and Applications February 14, 2006

Elif Özge Özdamar T Reinforcement Learning - Theory and Applications February 14, 2006 On the convergence of Q-learning Elif Özge Özdamar elif.ozdamar@helsinki.fi T-61.6020 Reinforcement Learning - Theory and Applications February 14, 2006 the covergence of stochastic iterative algorithms

More information

Improved Taguchi Method Based Contracted Capacity Optimization for Power Consumer with Self-Owned Generating Units

Improved Taguchi Method Based Contracted Capacity Optimization for Power Consumer with Self-Owned Generating Units Proceedings of the 6th WSEAS International Conference on Applications of Electrical Engineering, Istanul, Turey, May 7-9, 007 7 Improved Taguchi Method ased Contracted Capacity Optimization for Consumer

More information

Lecture 2: Making Good Sequences of Decisions Given a Model of World. CS234: RL Emma Brunskill Winter 2018

Lecture 2: Making Good Sequences of Decisions Given a Model of World. CS234: RL Emma Brunskill Winter 2018 Lecture 2: Making Good Sequences of Decisions Given a Model of World CS234: RL Emma Brunskill Winter 218 Human in the loop exoskeleton work from Steve Collins lab Class Structure Last Time: Introduction

More information

Approximations of Stochastic Programs. Scenario Tree Reduction and Construction

Approximations of Stochastic Programs. Scenario Tree Reduction and Construction Approximations of Stochastic Programs. Scenario Tree Reduction and Construction W. Römisch Humboldt-University Berlin Institute of Mathematics 10099 Berlin, Germany www.mathematik.hu-berlin.de/~romisch

More information

Casino gambling problem under probability weighting

Casino gambling problem under probability weighting Casino gambling problem under probability weighting Sang Hu National University of Singapore Mathematical Finance Colloquium University of Southern California Jan 25, 2016 Based on joint work with Xue

More information

Lecture 17: More on Markov Decision Processes. Reinforcement learning

Lecture 17: More on Markov Decision Processes. Reinforcement learning Lecture 17: More on Markov Decision Processes. Reinforcement learning Learning a model: maximum likelihood Learning a value function directly Monte Carlo Temporal-difference (TD) learning COMP-424, Lecture

More information

Optimal rebalancing of portfolios with transaction costs assuming constant risk aversion

Optimal rebalancing of portfolios with transaction costs assuming constant risk aversion Optimal rebalancing of portfolios with transaction costs assuming constant risk aversion Lars Holden PhD, Managing director t: +47 22852672 Norwegian Computing Center, P. O. Box 114 Blindern, NO 0314 Oslo,

More information

A Decentralized Learning Equilibrium

A Decentralized Learning Equilibrium Paper to be presented at the DRUID Society Conference 2014, CBS, Copenhagen, June 16-18 A Decentralized Learning Equilibrium Andreas Blume University of Arizona Economics ablume@email.arizona.edu April

More information

Stochastic Optimal Control

Stochastic Optimal Control Stochastic Optimal Control Lecturer: Eilyan Bitar, Cornell ECE Scribe: Kevin Kircher, Cornell MAE These notes summarize some of the material from ECE 5555 (Stochastic Systems) at Cornell in the fall of

More information

Kreps & Scheinkman with product differentiation: an expository note

Kreps & Scheinkman with product differentiation: an expository note Kreps & Scheinkman with product differentiation: an expository note Stephen Martin Department of Economics Purdue University West Lafayette, IN 47906 smartin@purdueedu April 2000; revised Decemer 200;

More information

SOLVING ROBUST SUPPLY CHAIN PROBLEMS

SOLVING ROBUST SUPPLY CHAIN PROBLEMS SOLVING ROBUST SUPPLY CHAIN PROBLEMS Daniel Bienstock Nuri Sercan Özbay Columbia University, New York November 13, 2005 Project with Lucent Technologies Optimize the inventory buffer levels in a complicated

More information

Barrier Options Pricing in Uncertain Financial Market

Barrier Options Pricing in Uncertain Financial Market Barrier Options Pricing in Uncertain Financial Market Jianqiang Xu, Jin Peng Institute of Uncertain Systems, Huanggang Normal University, Hubei 438, China College of Mathematics and Science, Shanghai Normal

More information

Essays on Some Combinatorial Optimization Problems with Interval Data

Essays on Some Combinatorial Optimization Problems with Interval Data Essays on Some Combinatorial Optimization Problems with Interval Data a thesis submitted to the department of industrial engineering and the institute of engineering and sciences of bilkent university

More information

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India August 2012

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India August 2012 Game Theory Lecture Notes By Y. Narahari Department of Computer Science and Automation Indian Institute of Science Bangalore, India August 2012 Chapter 6: Mixed Strategies and Mixed Strategy Nash Equilibrium

More information

Robust Optimization Applied to a Currency Portfolio

Robust Optimization Applied to a Currency Portfolio Robust Optimization Applied to a Currency Portfolio R. Fonseca, S. Zymler, W. Wiesemann, B. Rustem Workshop on Numerical Methods and Optimization in Finance June, 2009 OUTLINE Introduction Motivation &

More information

The Optimization Process: An example of portfolio optimization

The Optimization Process: An example of portfolio optimization ISyE 6669: Deterministic Optimization The Optimization Process: An example of portfolio optimization Shabbir Ahmed Fall 2002 1 Introduction Optimization can be roughly defined as a quantitative approach

More information

Sample Path Large Deviations and Optimal Importance Sampling for Stochastic Volatility Models

Sample Path Large Deviations and Optimal Importance Sampling for Stochastic Volatility Models Sample Path Large Deviations and Optimal Importance Sampling for Stochastic Volatility Models Scott Robertson Carnegie Mellon University scottrob@andrew.cmu.edu http://www.math.cmu.edu/users/scottrob June

More information

Solving real-life portfolio problem using stochastic programming and Monte-Carlo techniques

Solving real-life portfolio problem using stochastic programming and Monte-Carlo techniques Solving real-life portfolio problem using stochastic programming and Monte-Carlo techniques 1 Introduction Martin Branda 1 Abstract. We deal with real-life portfolio problem with Value at Risk, transaction

More information

Lecture 3: Review of mathematical finance and derivative pricing models

Lecture 3: Review of mathematical finance and derivative pricing models Lecture 3: Review of mathematical finance and derivative pricing models Xiaoguang Wang STAT 598W January 21th, 2014 (STAT 598W) Lecture 3 1 / 51 Outline 1 Some model independent definitions and principals

More information

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program August 2017

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program August 2017 Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program August 2017 The time limit for this exam is four hours. The exam has four sections. Each section includes two questions.

More information

Markov Decision Processes (MDPs) CS 486/686 Introduction to AI University of Waterloo

Markov Decision Processes (MDPs) CS 486/686 Introduction to AI University of Waterloo Markov Decision Processes (MDPs) CS 486/686 Introduction to AI University of Waterloo Outline Sequential Decision Processes Markov chains Highlight Markov property Discounted rewards Value iteration Markov

More information

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same.

Chapter 14 : Statistical Inference 1. Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same. Chapter 14 : Statistical Inference 1 Chapter 14 : Introduction to Statistical Inference Note : Here the 4-th and 5-th editions of the text have different chapters, but the material is the same. Data x

More information

Optimal Allocation of Policy Limits and Deductibles

Optimal Allocation of Policy Limits and Deductibles Optimal Allocation of Policy Limits and Deductibles Ka Chun Cheung Email: kccheung@math.ucalgary.ca Tel: +1-403-2108697 Fax: +1-403-2825150 Department of Mathematics and Statistics, University of Calgary,

More information

Dynamic Asset and Liability Management Models for Pension Systems

Dynamic Asset and Liability Management Models for Pension Systems Dynamic Asset and Liability Management Models for Pension Systems The Comparison between Multi-period Stochastic Programming Model and Stochastic Control Model Muneki Kawaguchi and Norio Hibiki June 1,

More information

Social Common Capital and Sustainable Development. H. Uzawa. Social Common Capital Research, Tokyo, Japan. (IPD Climate Change Manchester Meeting)

Social Common Capital and Sustainable Development. H. Uzawa. Social Common Capital Research, Tokyo, Japan. (IPD Climate Change Manchester Meeting) Social Common Capital and Sustainable Development H. Uzawa Social Common Capital Research, Tokyo, Japan (IPD Climate Change Manchester Meeting) In this paper, we prove in terms of the prototype model of

More information

On Existence of Equilibria. Bayesian Allocation-Mechanisms

On Existence of Equilibria. Bayesian Allocation-Mechanisms On Existence of Equilibria in Bayesian Allocation Mechanisms Northwestern University April 23, 2014 Bayesian Allocation Mechanisms In allocation mechanisms, agents choose messages. The messages determine

More information

Asset allocation under regime-switching models

Asset allocation under regime-switching models Title Asset allocation under regime-switching models Authors Song, N; Ching, WK; Zhu, D; Siu, TK Citation The 5th International Conference on Business Intelligence and Financial Engineering BIFE 212, Lanzhou,

More information

Stock Repurchase with an Adaptive Reservation Price: A Study of the Greedy Policy

Stock Repurchase with an Adaptive Reservation Price: A Study of the Greedy Policy Stock Repurchase with an Adaptive Reservation Price: A Study of the Greedy Policy Ye Lu Asuman Ozdaglar David Simchi-Levi November 8, 200 Abstract. We consider the problem of stock repurchase over a finite

More information

Handout 8: Introduction to Stochastic Dynamic Programming. 2 Examples of Stochastic Dynamic Programming Problems

Handout 8: Introduction to Stochastic Dynamic Programming. 2 Examples of Stochastic Dynamic Programming Problems SEEM 3470: Dynamic Optimization and Applications 2013 14 Second Term Handout 8: Introduction to Stochastic Dynamic Programming Instructor: Shiqian Ma March 10, 2014 Suggested Reading: Chapter 1 of Bertsekas,

More information

Math 416/516: Stochastic Simulation

Math 416/516: Stochastic Simulation Math 416/516: Stochastic Simulation Haijun Li lih@math.wsu.edu Department of Mathematics Washington State University Week 13 Haijun Li Math 416/516: Stochastic Simulation Week 13 1 / 28 Outline 1 Simulation

More information

16 MAKING SIMPLE DECISIONS

16 MAKING SIMPLE DECISIONS 247 16 MAKING SIMPLE DECISIONS Let us associate each state S with a numeric utility U(S), which expresses the desirability of the state A nondeterministic action A will have possible outcome states Result

More information

Australian Journal of Basic and Applied Sciences. Conditional Maximum Likelihood Estimation For Survival Function Using Cox Model

Australian Journal of Basic and Applied Sciences. Conditional Maximum Likelihood Estimation For Survival Function Using Cox Model AENSI Journals Australian Journal of Basic and Applied Sciences Journal home page: wwwajbaswebcom Conditional Maximum Likelihood Estimation For Survival Function Using Cox Model Khawla Mustafa Sadiq University

More information

Lecture outline W.B.Powell 1

Lecture outline W.B.Powell 1 Lecture outline What is a policy? Policy function approximations (PFAs) Cost function approximations (CFAs) alue function approximations (FAs) Lookahead policies Finding good policies Optimizing continuous

More information

Chapter 7: Estimation Sections

Chapter 7: Estimation Sections 1 / 40 Chapter 7: Estimation Sections 7.1 Statistical Inference Bayesian Methods: Chapter 7 7.2 Prior and Posterior Distributions 7.3 Conjugate Prior Distributions 7.4 Bayes Estimators Frequentist Methods:

More information

arxiv: v1 [math.pr] 6 Apr 2015

arxiv: v1 [math.pr] 6 Apr 2015 Analysis of the Optimal Resource Allocation for a Tandem Queueing System arxiv:1504.01248v1 [math.pr] 6 Apr 2015 Liu Zaiming, Chen Gang, Wu Jinbiao School of Mathematics and Statistics, Central South University,

More information

Assortment Optimization Over Time

Assortment Optimization Over Time Assortment Optimization Over Time James M. Davis Huseyin Topaloglu David P. Williamson Abstract In this note, we introduce the problem of assortment optimization over time. In this problem, we have a sequence

More information

Approximate Revenue Maximization with Multiple Items

Approximate Revenue Maximization with Multiple Items Approximate Revenue Maximization with Multiple Items Nir Shabbat - 05305311 December 5, 2012 Introduction The paper I read is called Approximate Revenue Maximization with Multiple Items by Sergiu Hart

More information

A New Multivariate Kurtosis and Its Asymptotic Distribution

A New Multivariate Kurtosis and Its Asymptotic Distribution A ew Multivariate Kurtosis and Its Asymptotic Distribution Chiaki Miyagawa 1 and Takashi Seo 1 Department of Mathematical Information Science, Graduate School of Science, Tokyo University of Science, Tokyo,

More information