Lecture 21. Dynamic Programming. Dynamic programming is a method to solve optimal control problems. Here we
|
|
- Chester Gibbs
- 5 years ago
- Views:
Transcription
1 Lectre 21 Dynamic Programming Karl Henrik Johansson Dynamic programming is a method to solve optimal control problems. Here we introdce the notion by discssing dynamic programming for a combinatorial problem and dynamic programming for continos-time systems. 1 Discrete-Time Systems Consider the problem of traveling by minimm cost from point A to point in the following graph: A The weights on the edges denote the cost for taking a particlar way between two vertices. We assme that it is only possible to move north-east and soth-east in the graph. One way of solving this problem is, of corse, to calclate the total cost for all possible paths going from A and. A more ecient way (in particlar, if the nmber of vertices is large) is to se dynamic programming. To illstrate the idea, let s introdce some notation. Let x(t) 2 f1; : : :; 4g f1; : : :; 4g denote the state at (discrete) time t 2 f0; : : :; g, where each time step corresponds to moving one step to the right in the graph. The vertices are labeled after the rows (pointing soth-east) and the colmns (pointing north-east) in the graph, sch that the initial state A is x(0) = (1; 1) and the nal state is x() = (4; 4). At each time t 2 f0; : : :; 5g, a control action (t) 2 f1g is to be chosen, which decides how next step shold be taken ( = +1 for north-east and = 1 for
2 EECS291E Hybrid Systems Lectre 21 soth-east). Note that for some states the control is constrained, for example, for x(4) = (4; 1) only (4) = +1 is possible. Dene the cost fnction J(()) = P 5 k=0 W (x(k); x(k + 1)) for a path from A to, where W (x(k); x(k + 1)) is the weight on the edge (x(k); x(k + 1)). 1 Let the cost-to-go fnction J (x(t); t) denote the optimal (minimm) cost fnction starting in x(t) at time t, i.e., J (x(t); t) = min (t);:::;(5) J(()): The dynamic programming soltion to the traveling problem is based on deriving the cost-to-go fnction for all states, starting with the nal state and then going backwards ntil the initial state is reached. We do the bookkeeping by specifying the vale J (x(t); t) at the vertex corresponding to (x(t); t). At the nal state, we obviosly have J ((4; 4); ) = 0. At (3; 4), we have J ((3; 4); 5) = J ((4; 4); ) + W ((3; 4); (4; 4)) = 0 + = and similarly we have J ((4; 3); 5) =, as indicated in the following detail of the graph: 0 The arrows indicate the optimal control, i.e., the optimal action to take at a certain state. For (3; 4) and (4; 3) the optimal control is, of corse, the trivial choices shown above. Deriving J (; 4) yields the following detail of the graph: Contining like this gives the graph 1 Here x() denotes a fnction in contrast to a single point, x(k), say. Note also that, of corse, x() depends on () throgh the dynamics imposed by the graph. 2
3 EECS291E Hybrid Systems Lectre A From this, we may easily depict the optimal soltion going from A to by jst following the arrows: A This is the dynamic programming soltion to the graph traveling problem. In general, we have J (x(t); t) = J (x(t + 1); t + 1) + W (x(t); x(t + 1)): Hence, the cost-to-go fnction at time t depends only on the cost-to-go fnction at time t + 1 and the optimal cost going from x(t) to x(t + 1). This is a simple, bt fndamental, fact, which is called the principle of optimality. The principle of optimality states that if gives the optimal soltion (i); (i + 1); : : : ; (j); : : :; (k) x(i); x(i + 1); : : : ; x(j); : : : ; x(k); 3
4 EECS291E Hybrid Systems Lectre 21 then the trncated control (j); : : : ; (k) gives the optimal soltion from x(j). It ths follows that it is not necessary to solve the whole optimal control at once, bt instead we may solve individal pieces going backwards from the nal state and then patch these pieces together, as was done in the example. Comparing dynamic programming with the approach of deriving all possible paths from A and, we see that the dynamic programming leads to the calclation of 15 nmbers while there are 20 possible paths. In general, for a graph with N N vertices, we have N 2 1 cost-to-go fnctions to calclated compared to (2(N 1))!=((N 1)!) 2 paths. For example, for N = 8, we get 3 compared to Dynamic programming atomatically leads to a feedback soltion: to each state (vertex) there is a dedicated control action (arrow). This means that the soltion is robst, for instance, if an implse distrbance moves the state from one location to another, still the optimal control is applied. Dynamic programming is particlarly ecient in mlti-stage decision problems, when there are few control choices at each stage. In the traveling problem, we have only two choices at each time step. If the dimensions of the control space and the state space are large, dynamic programming may be less appealing. The phrase \crse of dimensionality" was given to the problem of exponential growth of a hyper-volme as a fnction of dimensionality Dynamic programming may also be applied to discrete-time systems specied as dierence eqations. Next, however, we consider the continos-time set-p. 2 Continos-Time Systems Consider the continos-time control system _x(t) = f(x(t); (t); t); x(t0) = x0; (1) where t 2 [t0; t f ], x(t) 2 R n, and (t) 2 R. Assme that f is smooth in all its argments and that is piecewise continos. Dene the cost fnction J(()) = Z tf t0 L(x(s); (s); s) ds + (x(t f ); t f ); (2) where the rnning cost L and the terminal cost are smooth fnctions. oth the initial time t0 and the nal time t f > t0 are assmed to be xed. The optimal control problem is to minimize J with respect to : [t0; t f ]! R sbject to (1). The optimal control is denoted : [t0; t f ]! R and the corresponding trajectory x : [t0; t f ]! R n. The cost-to-go fnction is dened as the minimm cost to go from any state x 2 R n at time t 2 [t0; t f ] to x(t f ) and is given by Z tf J (x; t) = min L(x(s); (s); s) ds + (x(t f ); t f ): () t 4
5 EECS291E Hybrid Systems Lectre 21 The cost-to-go fnction at t = t0 is hence the vale of the cost fnction for the optimal control, i.e., J (x (t0); t0) = J( ()): The principle of optimality for continos-time control systems reads as follows. Let x and be the optimal trajectory and optimal control as dened above. Assme x0 = x ( t0 ) for some t0 2 (t0; t f ). Then, the optimal control for (1) with respect to (2), where t0 is replaced by t0 and x0 by x0, is given by the trncated control : [ t0 ; t f ]! R with (t) = (t) for all t 2 [ t0 ; t f ]. This means, similar to the discrete case, that the optimal control problem can be solved by going backwards from the nal state, by rst solving the optimal control problem on some interval [t f ; t f ], then on [t f 2; t f ] etc, and nally patching the soltions together to get the soltion on [t0; t f ]. Hamilton-Jacobi-ellman Using the principle of optimality, we do a heristic derivation of the Hamilton- Jacobi-ellman eqation. Consider an optimal trajectory going from x (t0) to x (t f ), shown as a solid line the following gre: x (t0) x 00 = x(t 0 + t) x 0 = x(t 0 ) x(t f ) x (t f ) Consider a non-optimal trajectory deviating from the optimal only between x(t 0 ) = x 0 and x(t 0 + t) = x 00, and from x 00 following an optimal trajectory to the nal state at t = t f. The deviation is shown with a dashed line above. Denote the cost for starting in x 0 at t 0 and passing throgh x 00 by J 0 (x 0 ; t 0 ). Clearly, J 0 (x 0 ; t 0 ) J (x 0 ; t 0 ) with eqality only for t = 0, i.e., only if (t) = (t) for all t 2 (t 0 ; t f ). If t > 0 is small, then J 0 (x 0 ; t 0 ) J (x 0 ; t 0 + t) + L(x 0 ; 0 ; t 0 )t; where we sed the notation 0 = (t 0 ). Hence, assming that the following minima are attained, we get J (x 0 ; t 0 ) = min J (x 00 ; t 0 + t) + L(x 0 ; 0 ; t 0 )t = min J (x 0 + f(x 0 ; 0 ; t 0 )t; t 0 + t) + L(x 0 ; 0 ; t 0 )t = min J (x 0 ; t 0 (x0 ; t 0 )f(x 0 ; 0 ; t 0 (x0 ; t 0 )t + L(x 0 ; 0 ; t 0 )t : 5
6 EECS291E Hybrid Systems Lectre 21 Note that the minima are taken pointwise, i.e., 2 R. This yields the Hamilton- (x; t) (x; t)f(x; ; t) + L(x; ; t) ; J (x; t f ) = (x; t f ): This is a partial dierential eqation with bondary condition given by the terminal cost. Under certain conditions, the soltion of this eqation corresponds to the optimal cost. The optimal control is eqal (t) = arg (x; t)f(x; ; t) + L(x; ; t) : Note that this pointwise minimization gives a feedback control law: given the fnction J, the control is derived from the crrent state x(t). We rephrase the Hamilton-Jacobi-ellman eqation in a slightly more general setting in the following theorem. Here the nal time t f is a variable and x(t f ) is constrained by the terminal condition (x(t f ); t f ) = 0, where is a smooth fnction. Theorem 1 (Hamilton-Jacobi-ellman Eqation). Assme the C 1 fnction V satises the Hamilton-Jacobi-ellman eqation (x; t) = (x; t)f(x; ; t) + L(x; ; t) ; 8x; t V (x; t f ) = (x; t f ); 8x 2 fz : (z; t f ) = 0g: Also, assme the minimm is attained for all x and t for some : R n [t0; t f ]! R, which is piecewise C 0 in t. Finally, assme that the soltion to _x(t) = f(x(t); (x(t); t); t) is niqe for all initial states. Then, V is the niqe soltion of the Hamilton- Jacobi-ellman eqation and it is eqal to the optimal cost-to-go fnction, i.e., V (x; t) = J (x; t); 8x; t: Frthermore, the optimal control is given by (t) = (x(t); t) = arg (x(t); t)f(x(t); ; t) + L(x(t); ; t) : Linear Qadratic Control Consider the optimal control problem given by the linear time-invariant system _x(t) = Ax(t) + (t)
7 EECS291E Hybrid Systems Lectre 21 and cost fnction J(()) = Z tf t0 x(t)t Qx(t) + (t) 2 dt + x(t f ) T Q f x(t f ); where t0 and t f are xed, Q and Q f are positive semidenite matrices, and > 0. The Hamilton-Jacobi-ellman eqation is then eqal (x; t) = (x; t)(ax + ) + xt Qx + 2 with bondary condition V (x; t f ) = x T Q f x: Let s try a soltion of the form V (x; t) = x T P (t)x with P (t) = P (t) T. = _ xt P = 2xT P (t); which gives x T P(t)x _ = min 2xT P (t)ax + 2x T P (t) + x T Qx + 2 : The minimm is attained for = 1 T P (t)x. Sbstitting this into the Hamilton-Jacobi-ellman eqation gives x T P(t)x _ = 2x T P (t)ax 2 1 x T P (t) T P (t)x+x T Qx+x T P (t) 1 T P (t)x; so P (t) mst satisfy the continos-time Riccati eqation _ P (t) = P (t)a A T P (t) + P (t) 1 T P (t) Q; P (t f ) = Q f : Solving this matrix dierential eqation ths gives the optimal linear qadratic control as (t) = 1 T P (t)x(t); which is a time-varying linear state-feedback control law. Note that it follows from Theorem 1 that this control is niqe, so there exist no nonlinear controller that performs eqally well or better. ackgrond The term dynamic programming was coined by ellman [1]. There are several good textbooks on optimal control and dynamic programming, e.g., [2{5]. The example in Section 1 is from [3].
8 EECS291E Hybrid Systems Lectre 21 References 1. R. ellman. Dynamic Prograggmin. Princeton University Press, Princeton, NJ, D. P. ertsekas. Dynamic Programming and Optimal Control: Volmes I{II. Athena Scientic, elmont, MA, A. E. ryson and Y.-C. Ho. Applied Optimal Control. Hemisphere Pblishing Corporation, G. Leitmann. An Introdction to Optimal Control. McGrawHill, New York, NY, L. C. Yong. Optimal Control Theory. Chelsea,
Finance: Risk Management Module II: Optimal Risk Sharing and Arrow-Lind Theorem
Institte for Risk Management and Insrance Winter 00/0 Modle II: Optimal Risk Sharing and Arrow-Lind Theorem Part I: steinorth@bwl.lm.de Efficient risk-sharing between risk-averse individals Consider two
More informationSpecialization, Matching Intensity and Income Inequality of Sellers
MPRA Mnich Personal RePEc Archive Specialization, Matching Intensity and Income Ineqality of Sellers Konstantinos Eleftherio and Michael Polemis University of Piraes 1 October 016 Online at https://mpra.b.ni-menchen.de/74579/
More informationDifferentiation of some functionals of multidimensional risk processes and determination of optimal reserve allocation
Differentiation of some fnctionals of mltidimensional risk processes and determination of optimal reserve allocation Stéphane Loisel Laboratoire de Sciences Actarielle et Financière Université Lyon 1,
More informationRight-cancellability of a family of operations on binary trees
Right-cancellability of a family of operations on binary trees Philippe Dchon LaBRI, U.R.A. CNRS 1304, Université Bordeax 1, 33405 Talence, France We prove some new reslts on a family of operations on
More information)] = q * So using this equation together with consumer 1 s best response function, we can solve for Consumer 1 s best response becomes:
Econ8: Introdction to Game Theory ASWERS T PRBLEM SET a If yo decide to bring q i coins to the market, yor payoff is determined as: i P qi 60 qi [8 0.5q q q 5 ]q i 60 q i If firm i sbscript confses yo,
More informationComparing allocations under asymmetric information: Coase Theorem revisited
Economics Letters 80 (2003) 67 71 www.elsevier.com/ locate/ econbase Comparing allocations nder asymmetric information: Coase Theorem revisited Shingo Ishigro* Gradate School of Economics, Osaka University,
More informationEE365: Risk Averse Control
EE365: Risk Averse Control Risk averse optimization Exponential risk aversion Risk averse control 1 Outline Risk averse optimization Exponential risk aversion Risk averse control Risk averse optimization
More informationCrank Nicolson Scheme
Crank Nicolson Scheme De to some limitations over Explicit Scheme, mainly regarding convergence and stability, another schemes were developed which have less trncation error and which are nconditionally
More informationWinter 2015/16. Insurance Economics. Prof. Dr. Jörg Schiller.
Winter 15/16 Insrance Economics Prof. Dr. Jörg Schiller j.schiller@ni-hohenheim.de Yo ill find frther information on or ebpage: http://.insrance.ni-hohenheim.de and on https://ilias.ni-hohenheim.de Agenda
More informationApplication of US GAAP training programme
Application of US GAAP training programme 8-day comprehensive programme to prepare yo for applying acconting rles and procedres that comprise US GAAP Client Relations Officer Małgorzata Tryc tel. +48 22
More informationThe Economics of Climate Change C 175 Christian Traeger Part 3: Policy Instruments continued. Standards and Taxes
The Economics of Climate Change C 75 The Economics of Climate Change C 75 Christian Traeger Part 3: Policy Instrments contined Standards and Taxes Lectre 0 Read: Parry, I.W.H. & W.A. Pier (007), Emissions
More informationPreserving Privacy in Collaborative Filtering through Distributed Aggregation of Offline Profiles
Preserving Privacy in Collaborative Filtering throgh Distribted Aggregation of Offline Profiles Reza Shokri, Pedram Pedarsani, George Theodorakopolos, and Jean-Pierre Hbax Laboratory for Compter Commnications
More informationFORMULAS FOR STOPPED DIFFUSION PROCESSES WITH STOPPING TIMES BASED ON DRAWDOWNS AND DRAWUPS
FORMULAS FOR STOPPED DIFFUSION PROCESSES WITH STOPPING TIMES BASED ON DRAWDOWNS AND DRAWUPS By Libor Pospisil, Jan Vecer, and Olympia Hadjiliadis, Colmbia University and Brooklyn College (C.U.N.Y.) This
More informationMethod of Characteristics
Method of Characteristics A. Salih Department of Aerospace Engineering Indian Institte of Space Science and Technology, Thirvananthapram 1 Jly 2016 1 Introction Let s recall a simple fact from the theory
More informationLevedahl s explanation for the cashout puzzle in the U.S. Food Stamp Program: A Comment *
Levedahl s explanation for the cashot pzzle in the U.S. Food Stamp Program: A Comment * Robert V. Brenig and Indraneel Dasgpta Address for Correspondence: Robert V. Brenig Centre for Economic Policy Research,
More informationRight-cancellability of a family of operations on binary trees
Discrete Mathematics and Theoretical Compter Science 2, 1998, 27 33 Right-cancellability of a family of operations on binary trees Philippe Dchon LaBRI, U.R.A. CNRS 1304, Université Bordeax 1, 33405 Talence,
More informationMaster the opportunities
TM MasterDex 5 Annity Master the opportnities A fixed index annity with point-to-point monthly crediting and a premim bons Allianz Life Insrance Company of North America CB50626-CT Page 1 of 16 Discover
More informationPrice Postponement in a Newsvendor Model with Wholesale Price-Only Contracts
Prde University Prde e-pbs Prde CIER Working Papers Krannert Gradate School of anagement 1-1-011 Price Postponement in a Newsvendor odel with Wholesale Price-Only Contracts Yanyi X Shanghai University
More informationSwitch Bound Allocation for Maximizing Routability in. Department of Computer Sciences. University of Texas at Austin. Austin, Texas
Switch Bond Allocation for Maximizing Rotability in Timing-Drien Roting of FPGAs Kai Zh and D.F. Wong Department of ompter Sciences Uniersity of Texas at Astin Astin, Texas 787-88 Abstract In segmented
More informationPORTFOLIO OPTIMIZATION AND EXPECTED SHORTFALL MINIMIZATION FROM HISTORICAL DATA
PORTFOLIO OPTIMIZATION AND EXPECTED SHORTFALL MINIMIZATION FROM HISTORICAL DATA We begin by describing the problem at hand which motivates our results. Suppose that we have n financial instruments at hand,
More informationThe Theory of the Consumer. Correction from Last Week. Consumer Preferences. Preference Orderings Example. Axioms of Consumer Preference
Professor Jay Bhattacharya Sring 00 Correction from Last Week The algebraic reresentation of the bdget constraint for stams is: 0 hose = I hose + hose hose = I 0 if < 0 if The Theory of the Consmer Model
More informationTHE TRAVELING SALESMAN PROBLEM FOR MOVING POINTS ON A LINE
THE TRAVELING SALESMAN PROBLEM FOR MOVING POINTS ON A LINE GÜNTER ROTE Abstract. A salesperson wants to visit each of n objects that move on a line at given constant speeds in the shortest possible time,
More informationDynamic Programming (DP) Massimo Paolucci University of Genova
Dynamic Programming (DP) Massimo Paolucci University of Genova DP cannot be applied to each kind of problem In particular, it is a solution method for problems defined over stages For each stage a subproblem
More informationYao s Minimax Principle
Complexity of algorithms The complexity of an algorithm is usually measured with respect to the size of the input, where size may for example refer to the length of a binary word describing the input,
More informationThe Binomial Model. Chapter 3
Chapter 3 The Binomial Model In Chapter 1 the linear derivatives were considered. They were priced with static replication and payo tables. For the non-linear derivatives in Chapter 2 this will not work
More informationA Note on Correlated Uncertainty and Hybrid Environmental Policies
This version: April 2014 A Note on Correlated Uncertainty and Hybrid Environmental Policies John K. Stranlnd* Department of Resorce Economics University of Massachsetts, Amherst Abstract: This note examines
More informationLecture25: Simultaneous Equation Modeling(Contd.)
P age Modle 8: Simltaneos Eqation Modeling Lectre5: Simltaneos Eqation Modeling(Contd.) Rdra P. Pradhan Vinod Gpta School of Management Indian Institte of Technolog Kharagpr, India Email: rdrap@vgsom.iitkgp.ernet
More informationSemantic Path based Personalized Recommendation on Weighted Heterogeneous Information Networks
Semantic Path based Personalized Recommendation on Weighted Heterogeneos Information Networks Chan Shi Beijing University of Posts and Telecommnications Beijing, China shichan@bpt.ed.cn Philip S. Y University
More informationECE 586GT: Problem Set 1: Problems and Solutions Analysis of static games
University of Illinois Fall 2018 ECE 586GT: Problem Set 1: Problems and Solutions Analysis of static games Due: Tuesday, Sept. 11, at beginning of class Reading: Course notes, Sections 1.1-1.4 1. [A random
More informationSingle-Year and Multi-year Insurance Policies in a Competitive Market
University of Pennsylvania ScholarlyCommons Operations, Information and Decisions Papers Wharton Faclty Research 8-01 Single-Year and Mlti-year Insrance Policies in a Competitive Market Pal R. Kleindorfer
More informationDynamic Programming: An overview. 1 Preliminaries: The basic principle underlying dynamic programming
Dynamic Programming: An overview These notes summarize some key properties of the Dynamic Programming principle to optimize a function or cost that depends on an interval or stages. This plays a key role
More informationIEOR E4004: Introduction to OR: Deterministic Models
IEOR E4004: Introduction to OR: Deterministic Models 1 Dynamic Programming Following is a summary of the problems we discussed in class. (We do not include the discussion on the container problem or the
More informationHandout 4: Deterministic Systems and the Shortest Path Problem
SEEM 3470: Dynamic Optimization and Applications 2013 14 Second Term Handout 4: Deterministic Systems and the Shortest Path Problem Instructor: Shiqian Ma January 27, 2014 Suggested Reading: Bertsekas
More informationOwnership structure and rm performance: evidence from the UK nancial services industry
Applied Financial Economics, 1998, 8, 175Ð 180 Ownership strctre and rm performance: evidence from the UK nancial services indstry RAM MUD AMB I* and C ARMELA NI COS IA *ISMA Centre, University of Reading,
More information. COVERS OF D-TYPE ARTIN GROUPS. 1. Introduction
. COVERS OF D-TYPE ARTIN GROUPS MEIRAV AMRAM 1, ROBERT SHWARTZ 1 AND MINA TEICHER Abstract. We stdy certain qotients of Generalized Artin grops which hae natral map onto D-type Artin grops. In particlar
More informationMartingale Pricing Theory in Discrete-Time and Discrete-Space Models
IEOR E4707: Foundations of Financial Engineering c 206 by Martin Haugh Martingale Pricing Theory in Discrete-Time and Discrete-Space Models These notes develop the theory of martingale pricing in a discrete-time,
More informationOperational Planning of Thermal Generators with Factored Markov Decision Process Models
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Operational Planning of Thermal Generators with Factored Markov Decision Process Models Nikovski, D. TR03-048 Jne 03 Abstract We describe a
More information1 Mathematics in a Pill 1.1 PROBABILITY SPACE AND RANDOM VARIABLES. A probability triple P consists of the following components:
1 Mathematics in a Pill The purpose of this chapter is to give a brief outline of the probability theory underlying the mathematics inside the book, and to introduce necessary notation and conventions
More informationMarkowitz portfolio theory
Markowitz portfolio theory Farhad Amu, Marcus Millegård February 9, 2009 1 Introduction Optimizing a portfolio is a major area in nance. The objective is to maximize the yield and simultaneously minimize
More informationIntersectoral Adjustment and Policy Intervention: the Importance of General-Equilibrium Effects
Review of International Economics, 3(), 330 355, 005 Intersectoral Adjstment and Policy Intervention: the Importance of General-Eqilibrim Effects Larry Karp and Thierry Pal* Abstract We model adjstment
More informationthe effort and sales should be higher.
10.3 An Example of Postcontractal Hidden Knowledge: The Salesman Game ð If the cstomer type is a Pshoer, the efficient sales effort is low and sales shold be moderate. ð If the cstomer type is a Bonanza,
More informationOutline for today. Stat155 Game Theory Lecture 13: General-Sum Games. General-sum games. General-sum games. Dominated pure strategies
Outline for today Stat155 Game Theory Lecture 13: General-Sum Games Peter Bartlett October 11, 2016 Two-player general-sum games Definitions: payoff matrices, dominant strategies, safety strategies, Nash
More information3. Prove Lemma 1 of the handout Risk Aversion.
IDEA Economics of Risk and Uncertainty List of Exercises Expected Utility, Risk Aversion, and Stochastic Dominance. 1. Prove that, for every pair of Bernouilli utility functions, u 1 ( ) and u 2 ( ), and
More informationJob Assist. Employee Information IT58. Ver RPC001386_EN_WB_L_3
IT58 Job Assist Employee Information Ver. 11.09 RPC001386_EN_WB_L_3 HAVE YOU BEEN OUT OF WORK FOR SOME TIME, AND ARE NOW TAKING UP A JOB? If the answer is YES, then yo may be able to claim additional tax
More informationIteration. The Cake Eating Problem. Discount Factors
18 Value Function Iteration Lab Objective: Many questions have optimal answers that change over time. Sequential decision making problems are among this classification. In this lab you we learn how to
More information1 Introduction and Motivation Time and uncertainty are central elements in nance theory. Pricing theory, ecient market theory, portfolio selection the
Stochastic Programming Tutorial for Financial Decision Making The Saddle Property of Optimal Prots Karl Frauendorfer Institute of Operations Research, University of St. Gallen Holzstr. 15, 9010 St. Gallen,
More informationFBT 2016 Supplement FBT Update. General FBT update. GST and taxable value of fringe benefits. Two FBT gross-up rates. FBT Supplement 2015/16
FBT 2016 Spplement April 2016 FBT Spplement 2015/16 2016 FBT Update The following is an pdate on rates, declarations and other precedent forms that may assist in the preparation of clients 2016 FBT retrns.
More information1. Trinomial model. This chapter discusses the implementation of trinomial probability trees for pricing
TRINOMIAL TREES AND FINITE-DIFFERENCE SCHEMES 1. Trinomial model This chapter discusses the implementation of trinomial probability trees for pricing derivative securities. These models have a lot more
More informationVersion A. Problem 1. Let X be the continuous random variable defined by the following pdf: 1 x/2 when 0 x 2, f(x) = 0 otherwise.
Math 224 Q Exam 3A Fall 217 Tues Dec 12 Version A Problem 1. Let X be the continuous random variable defined by the following pdf: { 1 x/2 when x 2, f(x) otherwise. (a) Compute the mean µ E[X]. E[X] x
More informationMAT 4250: Lecture 1 Eric Chung
1 MAT 4250: Lecture 1 Eric Chung 2Chapter 1: Impartial Combinatorial Games 3 Combinatorial games Combinatorial games are two-person games with perfect information and no chance moves, and with a win-or-lose
More informationApproximate Confidence Intervals for a Parameter of the Negative Hypergeometric Distribution
Approximate Confidence Intervals for a Parameter of the Negative Hypergeometric Distribtion Lei Zhang 1, William D. Johnson 2 1. Office of Health Data and Research, Mississippi State Department of Health,
More informationChapter 7: Portfolio Theory
Chapter 7: Portfolio Theory 1. Introduction 2. Portfolio Basics 3. The Feasible Set 4. Portfolio Selection Rules 5. The Efficient Frontier 6. Indifference Curves 7. The Two-Asset Portfolio 8. Unrestriceted
More informationDo you have to comply?
isolved ACA The reporting reqirements that the Affordable Care Act (ACA) imposed were difficlt for many employers to meet. If yor ACA process last year didn t go as smoothly as it cold have, now is the
More informationNBER WORKING PAPER SERIES CURRENCY RETURNS, INSTITUTIONAL INVESTOR FLOWS, AND EXCHANGE RATE FUNDAMENTALS. Kenneth A. Froot Tarun Ramadorai
NBER WORKING PAPER SERIES CURRENCY RETURNS, INSTITUTIONAL INVESTOR FLOWS, AND EXCHANGE RATE FUNDAMENTALS Kenneth A. Froot Tarn Ramadorai Working Paper 9080 http://www.nber.org/papers/w9080 NATIONAL BUREAU
More informationCrash Modelling, Value at Risk and Optimal Hedging
Crash Modelling, Vale at Risk and Optimal Hedging y Philip Ha (Bankers Trst and Imperial College, London) Pal Wilmott (Oxford University and Imperial College, London) First draft: Jly 1996 ddresses for
More informationOptimal Security Liquidation Algorithms
Optimal Security Liquidation Algorithms Sergiy Butenko Department of Industrial Engineering, Texas A&M University, College Station, TX 77843-3131, USA Alexander Golodnikov Glushkov Institute of Cybernetics,
More information6.231 DYNAMIC PROGRAMMING LECTURE 3 LECTURE OUTLINE
6.21 DYNAMIC PROGRAMMING LECTURE LECTURE OUTLINE Deterministic finite-state DP problems Backward shortest path algorithm Forward shortest path algorithm Shortest path examples Alternative shortest path
More information6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2
6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2 Daron Acemoglu and Asu Ozdaglar MIT October 14, 2009 1 Introduction Outline Review Examples of Pure Strategy Nash Equilibria Mixed Strategies
More informationChapter 15: Dynamic Programming
Chapter 15: Dynamic Programming Dynamic programming is a general approach to making a sequence of interrelated decisions in an optimum way. While we can describe the general characteristics, the details
More informationRemarks: 1. Often we shall be sloppy about specifying the ltration. In all of our examples there will be a Brownian motion around and it will be impli
6 Martingales in continuous time Just as in discrete time, the notion of a martingale will play a key r^ole in our continuous time models. Recall that in discrete time, a sequence ; 1 ;::: ; n for which
More informationBargaining in Monetary Economies.
Bargaining in Monetary Economies. Gillame Rochetea Christopher Waller Jly 24 Abstract Search models of monetary exchange have typically relied on Nash (195) bargaining, or eqivalent strategic soltions,
More informationThe Dynamic Power Law Model
The Dynamic Power Law Model Bryan Kelly Extremely Preliminary and Incomplete Abstract I propose a new measre of common, time varying tail risk in large cross sections. It is motivated by asset pricing
More informationCollateral and Debt Capacity in the Optimal Capital Structure
IRES2011-010 IRES Working Paper Series Collateral and Debt Capacity in the Optimal Capital Strctre Erasmo Giambona Antonio Mello Timothy Riddiogh May 2011 Collateral and Debt Capacity in the Optimal Capital
More informationClemson University PCard Training Course. January 2018
Clemson University PCard Training Corse Janary 2018 Introdction The Prchasing Card is a Chip and Pin VISA credit card issed by Bank of America. The Prchasing Card (PCard) is a method of payment offered
More informationREVENUE FROM CONTRACTS WITH CUSTOMERS MANUFACTURING INDUSTRY
INSIGHTS FROM THE BDO MANUFACTURING PRACTICE REVENUE FROM CONTRACTS WITH CUSTOMERS MANUFACTURING INDUSTRY OVERVIEW Companies have started gearing p to implement Acconting Standards Codification (ASC) Topic
More informationCS134: Networks Spring Random Variables and Independence. 1.2 Probability Distribution Function (PDF) Number of heads Probability 2 0.
CS134: Networks Spring 2017 Prof. Yaron Singer Section 0 1 Probability 1.1 Random Variables and Independence A real-valued random variable is a variable that can take each of a set of possible values in
More informationMarch 1, 2017 Due Tue, Mar. 7 by 5 pm.
CHMY 564-17 March 1, 017 De Te, Mar. 7 by 5 pm. Homework # 4 SOLUTIONS 1. (a) How many nodal srfaces of each type will an 11k orbital with m 3 have. Skippin e and j, k means l 7, bt l 8 was intended. Ths,
More informationIN THIS paper, we address the problem of spectrum sharing
622 IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 25, NO. 3, APRIL 2007 Pblic Safety and Commercial Spectrm Sharing via Network Pricing and Admission Control Qi Wang and Timothy X Brown Abstract
More informationIS TAX SHARING OPTIMAL? AN ANALYSIS IN A PRINCIPAL-AGENT FRAMEWORK
IS TAX SHARING OPTIMAL? AN ANALYSIS IN A PRINCIPAL-AGENT FRAMEWORK BARNALI GUPTA AND CHRISTELLE VIAUROUX ABSTRACT. We study the effects of a statutory wage tax sharing rule in a principal - agent framework
More informationWindow Width Selection for L 2 Adjusted Quantile Regression
Window Width Selection for L 2 Adjusted Quantile Regression Yoonsuh Jung, The Ohio State University Steven N. MacEachern, The Ohio State University Yoonkyung Lee, The Ohio State University Technical Report
More informationExact Simulation of Stochastic Volatility and. other Affine Jump Diffusion Processes
Exact Simlation of Stochastic Volatility and other Affine Jmp Diffsion Processes Mark Broadie Colmbia University, Gradate School of Bsiness, 415 Uris Hall, 322 Broadway, New York, NY, 127-692, mnb2@colmbia.ed
More informationSuppose a cereal manufacturer puts pictures of famous athletes on cards in boxes of
CHAPTER 17 Probability Models Sppose a cereal manfactrer pts pictres of famos athletes on cards in boxes of cereal, in the hope of increasing sales. The manfactrer annonces that 20% of the boxes contain
More informationWorking Paper Series. Government guarantees and financial stability. No 2032 / February 2017
Working Paper Series Franklin Allen, Elena Carletti, Itay Goldstein, Agnese Leonello Government garantees and financial stability No 2032 / Febrary 2017 Disclaimer: This paper shold not be reported as
More informationEconomic optimization in Model Predictive Control
Economic optimization in Model Predictive Control Rishi Amrit Department of Chemical and Biological Engineering University of Wisconsin-Madison 29 th February, 2008 Rishi Amrit (UW-Madison) Economic Optimization
More informationarxiv: v1 [q-fin.rm] 1 Jan 2017
Net Stable Funding Ratio: Impact on Funding Value Adjustment Medya Siadat 1 and Ola Hammarlid 2 arxiv:1701.00540v1 [q-fin.rm] 1 Jan 2017 1 SEB, Stockholm, Sweden medya.siadat@seb.se 2 Swedbank, Stockholm,
More informationTopic 6 Borrowing products
Topic 6 Borrowing prodcts Learning otcomes After stdying this topic, stdents will be able to: otline the key featres of the financial services prodcts for borrowing; identify the key featres of the costs
More informationOutline. 1 Introduction. 2 Algorithms. 3 Examples. Algorithm 1 General coordinate minimization framework. 1: Choose x 0 R n and set k 0.
Outline Coordinate Minimization Daniel P. Robinson Department of Applied Mathematics and Statistics Johns Hopkins University November 27, 208 Introduction 2 Algorithms Cyclic order with exact minimization
More information6.231 DYNAMIC PROGRAMMING LECTURE 10 LECTURE OUTLINE
6.231 DYNAMIC PROGRAMMING LECTURE 10 LECTURE OUTLINE Rollout algorithms Cost improvement property Discrete deterministic problems Approximations of rollout algorithms Discretization of continuous time
More informationConstructing Markov models for barrier options
Constructing Markov models for barrier options Gerard Brunick joint work with Steven Shreve Department of Mathematics University of Texas at Austin Nov. 14 th, 2009 3 rd Western Conference on Mathematical
More informationGPS integer ambiguity resolution by various decorrelation methods
Lo/Grafarend GPS integer ambigity resoltion by varios decorrelation methods Fachbeiträge GPS integer ambigity resoltion by varios decorrelation methods Lizhi Lo and Erik W Grafarend Smmary In order to
More informationNBER WORKING PAPER SERIES HORSES AND RABBITS? OPTIMAL DYNAMIC CAPITAL STRUCTURE FROM SHAREHOLDER AND MANAGER PERSPECTIVES
NBER WORKING PAPER ERIE HORE AND RABBI? OPIMAL DYNAMIC CAPIAL RUCURE FROM HAREHOLDER AND MANAGER PERPECIVE Nengji J Robert Parrino Allen M. Poteshman Michael. Weisbach Working Paper 937 http://www.nber.org/papers/w937
More informationHandout 8: Introduction to Stochastic Dynamic Programming. 2 Examples of Stochastic Dynamic Programming Problems
SEEM 3470: Dynamic Optimization and Applications 2013 14 Second Term Handout 8: Introduction to Stochastic Dynamic Programming Instructor: Shiqian Ma March 10, 2014 Suggested Reading: Chapter 1 of Bertsekas,
More informationThe Duration Derby: A Comparison of Duration Based Strategies in Asset Liability Management
The Duration Derby: A Comparison of Duration Based Strategies in Asset Liability Management H. Zheng Department of Mathematics, Imperial College London SW7 2BZ, UK h.zheng@ic.ac.uk L. C. Thomas School
More informationTwo Equivalent Conditions
Two Equivalent Conditions The traditional theory of present value puts forward two equivalent conditions for asset-market equilibrium: Rate of Return The expected rate of return on an asset equals the
More informationFiscal policy: Ricardian Equivalence, the e ects of government spending, and debt dynamics
Roberto Perotti November 20, 2013 Version 02 Fiscal policy: Ricardian Equivalence, the e ects of government spending, and debt dynamics 1 The intertemporal government budget constraint Consider the usual
More informationParticipating in Electricity Markets: The Generator s Perspective. D. Kirschen
Participating in Electricity Markets: The Generator s Perspective D. Kirschen 2006 1 Market Strctre Monopoly Oligopoly Perfect Competition Monopoly: Monopolist sets the price at will Mst be reglated Perfect
More informationStochastic Optimal Control
Stochastic Optimal Control Lecturer: Eilyan Bitar, Cornell ECE Scribe: Kevin Kircher, Cornell MAE These notes summarize some of the material from ECE 5555 (Stochastic Systems) at Cornell in the fall of
More informationHedging Derivative Securities with VIX Derivatives: A Discrete-Time -Arbitrage Approach
Hedging Derivative Securities with VIX Derivatives: A Discrete-Time -Arbitrage Approach Nelson Kian Leong Yap a, Kian Guan Lim b, Yibao Zhao c,* a Department of Mathematics, National University of Singapore
More informationConsistent Staffing for Long-Term Care Through On-Call Pools
Consistent Staffing for Long-Term Care Throgh On-Call Pools Athors names blinded for peer review Nrsing homes managers are increasingly striving to ensre consistency of care, defined as minimizing the
More informationOn Solving Integral Equations using. Markov Chain Monte Carlo Methods
On Solving Integral quations using Markov Chain Monte Carlo Methods Arnaud Doucet Department of Statistics and Department of Computer Science, University of British Columbia, Vancouver, BC, Canada mail:
More informationHomework #4. CMSC351 - Spring 2013 PRINT Name : Due: Thu Apr 16 th at the start of class
Homework #4 CMSC351 - Spring 2013 PRINT Name : Due: Thu Apr 16 th at the start of class o Grades depend on neatness and clarity. o Write your answers with enough detail about your approach and concepts
More informationThursday, March 3
5.53 Thursday, March 3 -person -sum (or constant sum) game theory -dimensional multi-dimensional Comments on first midterm: practice test will be on line coverage: every lecture prior to game theory quiz
More informationECON Answers Homework #3
ECON 331 - Answers Homework #3 Exercise 1: (a) First, I calculate the derivative of y with respect to t. Then, to get the growth rate, I calculate the ratio of this derive and the function: (b) dy dt =
More information1 The multi period model
The mlti perio moel. The moel setp In the mlti perio moel time rns in iscrete steps from t = to t = T, where T is a fixe time horizon. As before we will assme that there are two assets on the market, a
More informationOptimization Methods. Lecture 16: Dynamic Programming
15.093 Optimization Methods Lecture 16: Dynamic Programming 1 Outline 1. The knapsack problem Slide 1. The traveling salesman problem 3. The general DP framework 4. Bellman equation 5. Optimal inventory
More informationFig. 1. Markov Chain. Fig. 2. Noise. Fig. 3. Noisy Observations Y (k)
Preface The term hidden Markov model (HMM) is more familiar in the speech signal processing commnity and commnication systems bt recently it is gaining acceptance in finance, economics and management science.
More informationInterpolation. 1 What is interpolation? 2 Why are we interested in this?
Interpolation 1 What is interpolation? For a certain function f (x we know only the values y 1 = f (x 1,,y n = f (x n For a point x different from x 1,,x n we would then like to approximate f ( x using
More informationOptimal prepayment of Dutch mortgages*
137 Statistica Neerlandica (2007) Vol. 61, nr. 1, pp. 137 155 Optimal prepayment of Dutch mortgages* Bart H. M. Kuijpers ABP Investments, P.O. Box 75753, NL-1118 ZX Schiphol, The Netherlands Peter C. Schotman
More informationJournal of Computational and Applied Mathematics. The mean-absolute deviation portfolio selection problem with interval-valued returns
Journal of Computational and Applied Mathematics 235 (2011) 4149 4157 Contents lists available at ScienceDirect Journal of Computational and Applied Mathematics journal homepage: www.elsevier.com/locate/cam
More informationOPTIMAL PORTFOLIO CONTROL WITH TRADING STRATEGIES OF FINITE
Proceedings of the 44th IEEE Conference on Decision and Control, and the European Control Conference 005 Seville, Spain, December 1-15, 005 WeA11.6 OPTIMAL PORTFOLIO CONTROL WITH TRADING STRATEGIES OF
More information