Information Relaxations and Duality in Stochastic Dynamic Programs
|
|
- Ashlynn French
- 5 years ago
- Views:
Transcription
1 Information Relaxations and Duality in Stochastic Dynamic Programs David Brown, Jim Smith, and Peng Sun Fuqua School of Business Duke University February 28 1/39
2 Dynamic programming is widely applicable in principle, but often difficult to apply in practice. Dynamic programming is widely applicable because the world is full of sequential decision problems with uncertainty Dynamic programming is often difficult to apply because of: The curse of dimensionality the state space grows exponentially in the number of variables The challenge of modeling the gradual resolution of uncertainty Monte Carlo simulation (MCS) is easy to apply with large problems: Can simulate to find value with a given policy Hard to find an optimal policy or know how a given policy compares to the optimal policy 2/39
3 Our goal is to develop tools for studying large-scale DPs. Modern approaches to large-scale DPs include: Neurodynamic programming: Use simulation methods and linear approximations of value functions Bertsekas and Tsitsiklis (1996); Powell (27) - General theory Van Roy and Tsitsiklis (21), Longstaff and Schwartz (21) - Option valuation Linear programming techniques with constraint generation: de Farias and B. Van Roy (23, 24,...) - General theory Adelman (forthcoming) - Revenue management 3/39
4 We consider a dual approach based on information relaxations. A flexible framework that allows use of MCS with DPs: Provides upper bounds on DP values Users can control the computational effort vs. accuracy tradeoff Use with lower bounds from approximate policies to bracket optimal solution Inspired by Haugh and Kogan (24) who developed a similar dual approach for option pricing problems; see also Rogers (22) and Anderson and Broadie (24). We generalize to consider: General DPs Arbitrary information relaxations Linear programming duality rather than martingales 4/39
5 The Basic Idea: A dynamic program, not a good structure for MCS: Decision 1 Chance 1 Decision 2 Chance 2 Decision 3 Chance 3 Decision 4 Chance 4 Alt A Low Alt A Low Alt A Low Alt A Low Alt B Nominal Alt B Nominal Alt B Nominal Alt B Nominal Alt C High Alt C High Alt C High Alt C High The DP with additional information and a penalty: Chance 1 Chance 2 Chance 3 Chance 4 Decision 1 Decision 2 Decision 3 Decision 4 Low Low Low Low Alt A Alt A Alt A Alt A Nominal Nominal Nominal Nominal Alt B Alt B Alt B Alt B High High High High Alt C Alt C Alt C Alt C + Penalty 5/39
6 Related Literature: Relaxations of temporal feasibility constraints have been considered in the stochastic programming literature: E.g., Rockafellar and Wets (1991); Shapiro and Ruszczyński (27) Requires concave reward functions and convex action spaces Considers perfect info relaxations and linear penalties Uses sophisticated convex duality arguments Rogers (27) Pathwise Stochastic Optimal Control Perfect information relaxation Restricted forms for penalties Our results are simpler and more general General rewards and penalties Simple direct proofs 6/39
7 Outline: Introduction Basic setup: The primal DP Duality results: Penalties and relaxations Linear programming duality results Good penalties Example: Inventory control Example: Option pricing with stochastic volatility Example: Sequential exploration Conclusions 7/39
8 Basic Setup Discrete time; finite horizon: t =,...,T Uncertainty is described by a probability space (Ω, F, P) DM s information described by a filtration F = {Ft} where Ft describes the DM s information at the beginning of period t No forgetting: F F1... FT F DM will choose an action at in period t from At; A A AT = the set of all possible action vectors a Rewards {rt(a, ω)} depend on the actions a A and state ω Ω; Total reward: r(a, ω) = T t= r t(a, ω) (r(a) is a random variable.) 8/39
9 The Primal DP A policy α in A selects an action vector a A for each state ω Ω. In the primal DP, policies must be temporally feasible or F-adapted: Choice of action in period t can only depend on what is known then Let AF Adenote the set of F-adapted policies TheprimalDP: sup α AF E [r(α)] ( = sup α AF E [r(α( ω), ω)] ) If we allow mixed policies, the primal DP is a linear program with mixing probabilities as decision variables. 9/39
10 The Dual Approach: Information Relaxations AfiltrationG = {Gt} is a relaxation of filtration F = {Ft} if, for each t, Ft Gt F. The DM knows more in every period under G than under F. Perfect information relaxation: I = {It} where It = F Let AG = the set of G-adapted policies. For any relaxation G of F, AF AG AI = A As we relax the filtration, we expand the set of feasible policies. 1 / 39
11 The Dual Approach: Penalties Penalties z(a, ω), like rewards, depend on the choice of actions a and the state ω Penalties will be used to punish policies that use the additional information in G. Penalized objective function: E [r(α G ) z(α G )] Apenaltyz is dual feasible if E [z(α F )] for all α F in AF. A dual feasible penalty assigns zero (or negative) expected penalty to any temporally feasible policy Let ZF = set of all dual feasible penalties 11 / 39
12 The Dual Approach: Weak Duality Our main result can be viewed as a version of the weak duality lemma of linear programming. Lemma. If α F and z are primal and dual feasible respectively and G is a relaxation of F, then E [r(α F )] sup αg AG E [r(α G ) z(α G )]. Proof: E [r(α F )] E [r(α F ) z(α F )] (E [z(α F )] because z ZF and α F AF) sup αg AG E [r(α G ) z(α G )] (because α F AF and AF AG) 12 / 39
13 The Dual Approach: Weak Duality With the perfect information relaxation, weak duality implies that for any α F in AF and z ZF, E [r(α F )] E [ sup a A {r(a) z(a)} ]. This form is convenient for Monte Carlo simulation: In each trial, Randomly generate state ω Solve a deterministic inner problem of choosing a vector of actions a to maximize r(a, ω) z(a, ω) With imperfect information relaxations, we can often use MCS but the inner problems may be stochastic 13 / 39
14 The Dual Approach: Strong Duality In principle, we can find the optimal value by choosing the right penalty. Theorem. Let G be a relaxation of F. Then { sup αf AF E [r(α F )] = inf z ZF sup αg AG E [r(α G ) z(α G )] }. If the primal problem on the left is bounded, the dual problem on the right has an optimal solution z ZF that achieves this bound. Proof :Let z (a) =r(a) v where v = sup αf AF Then z is dual feasible and r(a) z (a) =v. E [r(α F )]. 14 / 39
15 The Dual Approach: Complementary Slackness The complementary slackness condition further characterizes optimal policies and penalties. Theorem. Let α F and z be feasible solutions for the primal and dual problems respectively with information relaxation G. A necessary and sufficient condition for these to be optimal solutions for their respective problems is: E [z (α F)] = and E [r(α F) z (α F)] = sup αg AG {E [r(α G ) z (α G )]} With an optimal penalty z, α F has zero expected penalty and solves the dual problem. 15 / 39
16 The Dual Approach: Structured Policies If the optimal policies in the primal problem have some particular structure (thresholds, Markovian, stationary,...), we can focus on policies with this structure in the dual. Proposition. Let S Aand suppose α F AF S is a primal optimal solution. Then weak and strong duality hold within this restricted policy set S. Moreover, for a fixed penalty z in Z S F = {z Z: E [z(α F)] α F AF S} the bound with the policy restriction to S is weakly tighter than the corresponding bound without the restriction. Structural constraints on policies lead to tighter bounds and more efficient computation. 16 / 39
17 How to Create Good Penalties: Let G be a relaxation of F; let{wt(a, ω)} be a sequence of generating functions where each wt depends only on the first t +1actions (a,..., at) of a. Define zt(a) =E [wt(a) Gt] E [wt(a) Ft] and z(a) = T t= z t(a). Proposition. For these penalties z: (i) E [z(α F )] = for all α F in AF (Dual feasible; potentially optimal); (ii) {zt(a)} is adapted to G and zt depends only on the first t +1 actions (a,..., at) of a (Leads to DP structure on G); and (iii) z is orthogonal to F in that E [zt(a) H] =whenever H is in Ft. (Depends only on the additional information in G) 17 / 39
18 Examples of Good Penalties: Define the DP value function: Let A t F (a) be the α in A F with the first t actions (a,...,at 1) constrained to match the first t actions in a Define Vt(a) = sup E [r(α) Ft] α A t F (a) An optimal penalty: Take z t (a) =E [Vt+1(a) Gt] E [Vt+1(a) Ft] z t (a) cancels the benefit of the knowledge in G in each period Alternatively, replace V with: An approximate value function (Haugh and Kogan) A value function based on a given policy (Anderson and Broadie) Some simple approximation 18 / 39
19 Properties of Penalties: Penalties that are close to each other (or close to optimal) give bounds that are close to each other (or close to optimal): sup α G AG E [r(α G ) z1(α G )] sup α G AG + sup α G AG E [r(α G ) z2(α G )] E [z2(αg) z1(αg)] In practice, there will typically be a tradeoff between: The accuracy of the bound and The computational effort required to compute it. We can control this tradeoff through our choice of information relaxation G and penalty z. 19 / 39
20 Outline: Introduction Basic setup: The primal DP Duality results: Penalties and relaxations Linear programming duality results Good penalties Example: Inventory control Example: Option pricing with stochastic volatility Example: Sequential exploration Conclusions 2 / 39
21 Example: Standard Inventory Control Model Discrete time: t =,...,T Order at units of good in period t; at Demand in period t, dt, is uncertain; revealed after ordering Inventory at the end of period t given by xt+1 = xt + at dt (Assumes unmet demand is backordered.) Costs: t= (r t(a, d) zt(a, rt(a, d) =ct(at)+ft(xt+1) Primal DP: inf E t= r t(α F, d) [ T ] αf AF [ { T d))} ] Perfect Information Dual DP: E inf a A 21 / 39
22 Example: Inventory Control Penalties Perfect information bound: zt = Inner problem is a deterministic dynamic lot sizing problem: inf a A T t= (ct(at)+ft(xt+1)) Easy to solve, esp. with linear ordering, holding and shortage costs Smooth bound: zt(a, d) =rt(a, d) E [rt(a, d) Ft] Inner problem as before but with a smoothed cost function: inf a A T t= (ct(at)+e [ft(xt+1) Ft]) With linear costs, if we drop at constraints, we get a weaker bound that decomposes into a series of simple newsvendor problems. 22 / 39
23 Example: Inventory Control Numerical Example; Assumptions From Zipkin (2) Linear costs; 5 Periods Detailed Assumptions: Nonstationary Demand Stationary Demand and and Stationary Costs Nonstationary Costs Period t Ordering costs kt Holding costs ht Backorder costs pt Mean demand E [dt] We consider Poisson and geometric demand distributions 23 / 39
24 Example: Inventory Control Numerical Example; Results Poisson Demand Distributions Nonstationary Demand Stationary Demand and and Stationary Costs Nonstationary Costs Cost Std. Err. % of Opt. Cost Std. Err. % of Opt. Optimal value Smooth bound % % Newsvendor bound % % Perfect info. bound % % Geometric Demand Distributions Nonstationary Demand Stationary Demand and and Stationary Costs Nonstationary Costs Cost Std. Err. % of Opt. Cost Std. Err. % of Opt. Optimal value Smooth bound % % Newsvendor bound % % Perfect info. bound % % 24 / 39
25 Example: Inventory Control Dual Policies (Poisson) Y-axis = Period-t order quantity, X-axis = Period-t inventory level; Red line = Optimal Policy Period Period1 Period2 Period3 Period Period Period1 Period2 Period3 Period Nonstationary Demand Stationary Demand and and Stationary Costs Nonstationary Costs Period t Ordering costs kt Holding costs ht Backorder costs pt Mean demand E [dt] / 39
26 Example: Inventory Control Dual Policies (Geometric) Y-axis = Period-t order quantity, X-axis = Period-t inventory level; Red line = Optimal Policy Period Period1 Period2 Period3 Period Period Period1 Period2 Period3 Period Nonstationary Demand Stationary Demand and and Stationary Costs Nonstationary Costs Period t Ordering costs kt Holding costs ht Backorder costs pt Mean demand E [dt] / 39
27 Example: Inventory Control World Model ; Assumptions Suppose ordering costs and mean demand evolve stochastically, but are observed before ordering: Mean Demands E[dt] Ordering Costs kt With a perfect info relaxation, the perfect information, smooth bounds, and newsvendor bounds carry over unchanged We also consider an imperfect info relaxation where G provides info about the state but not demand; no penalty. Inner problem a simpler stochastic DP. 27 / 39
28 Example: Inventory Control World Model ; Results Poisson Demands Geometric Demands Cost Std. Err. % of Opt. Cost Std. Err. % of Opt. Optimal value Imperfect info. bound % % Smooth bound % % Newsvendor bound % % Perfect info. bound % % Bounds also apply to models where the world is not observed. Partially observed MDP model is difficult to model Dual bounds are no more complicated 28 / 39
29 Example: Valuing an Option with Stochastic Volatility We will value put and call options on a dividend-paying stock Heston s (1993) stochastic volatility model, plus dividends: Stock Price St : dst = μstdt + vtstdξt Instant. Var. vt : dvt = κ ( v vt) dt + σ vtdνt Dividends Dti : S ti = S t Dti i where dξt and dνt are increments of a Brownian motion process with correlation ρ (Risk-neutral process) Base filtration F assumes: St and vt are known at time t (Could also take vt to be unknown) Dividend times (t1,...,tm) and amounts Dti are given 29 / 39
30 Example: Valuing an Option with Stochastic Volatility Actions: at = exercise or not; can exercise at most once Call option rewards: { rt(at,st,vt) = e rt (St K) if at = exercise otherwise where K is the strike price and r the risk-free interest rate. (Rewards for a put option are the negative of this.) Primal DP: Fix a finite set of possible exercise dates {t,t1,...,tt }: sup αf AF E [ T i= rti (α F,Sti,v ti ) ] 3 / 39
31 Example: Valuing an Option with Stochastic Volatility We consider a relaxation G where all volatilities are known in advance. Solved by simulation: Generate all volatilities v = {vt} T t= Given v, future stock prices are log-normally distributed with E [St2 St1, v] =St1e μ(t 2 t1) exp (ρ t 2 t=t1 v tdνt ρ2 2 t 2 t=t1 vtdt ). Solve inner problem: One-dimensional option valuation problem with time-varying volatility; we used a trinomial lattice. Structural restriction for call options: Under F, early exercise occurs only before dividends. Not true under G, but we can require this in the dual. 31 / 39
32 Example: Valuing an Option with Stochastic Volatility Penalties Generating function for period i (time t1 to t2) { if a i = exercise wi(a) = ( ) Δ(St1,t1) e r(t 2 t1) S t2 if ai = do not exercise Δ(St,t) is the delta for an option with constant volatility ( v) Easy to compute in the trinomial lattice; calculate once. When ai = exercise, zi(a) =;Whenai = do not exercise, zi(a) = E [wi(a) Gt i ] E [wi(a) Ft i ] ( ) = Δ(St1,t1) e r(t 2 t1) E [S t2 St1, v] St1 Approximately cancels the effect of knowing future volatilities on the stock price expectations 32 / 39
33 Example: Valuing an Option with Stochastic Volatility Results Assumptions: Put and call options, expire in 1 year S = K = $1; κ =.2; σ = 1%; r = μ =%; v = v = 25% One $2.5 dividend after 6 months sometimes optimal to exercise Lower bounds found by simulating with a fixed exercise threshold ρ = ρ =.25 ρ =.5 Mean Std Error Mean Std Error Mean Std Error Upper Bounds Call With No Penalty Options With Penalty Lower Bound Upper Bounds Put With No Penalty Options With Penalty Lower Bound Stochastic volatility affects option values, but you don t need to solve a two-dimensional DP to pin down the value. 33 / 39
34 Example: Valuing an Option with Stochastic Volatility Dual Policies Call option with ρ =.5; Actions chosen just before dividend in dual problem, by scenario: 35% With No Penalty 35% With Penalty 33% 33% 31% 31% 29% 29% 27% 25% 23% 21% 27% 25% 23% 21% Volatility at T =.5 Volatility at T =.5 19% 17% Threshold from Fixed Volatility Model 19% 17% Threshold from Fixed Volatility Model 15% 15% Stock Price at T =.5 Stock Price at T =.5 Pink = Exercise; Blue = Wait 34 / 39
35 Example: Sequential Exploration Given n (dependent) wells, what is the optimal drilling strategy? Drill 1 Drill 1 Drill 2 Drill 3 Drill 4 Drill 5 Quit Well 2 Rock Well 2 Charge Well 2 No Rock Well 2 Rock Well 2 No Charge Well 2 No Rock Well 5 Rock Well 5 Charge Well 5 No Rock Well 5 Rock Well 5 No Charge Well 5 No Rock Drill 3 Drill 4 Drill 5 Quit Well 5 Rock Well 5 Charge Well 5 No Rock Well 5 Rock Well 5 No Charge Well 5 No Rock Wells must succeed on each of k underlying factors to be wet From Bickel and Smith (26); Bickel, Smith and Meyer (27) 35 / 39
36 Example: Sequential Exploration The primal problem gets complex quickly. Size of Primal Problem Size of Inner Factors (k) Dual Problem Wells (Any k) (n) Actions States Actions States Actions States Actions States Actions States ,78 4, , ,477 6,561 13,173 83, ,25 3,125 91,854 59,49 1,837, E , ,375 15, E+5 531, E E ,29 2, ,5 78, E E E+8 4.1E ,57 6, E+6 39, E+7 4.3E+7 1.3E E+9 1, ,732 19, E E E E E E+11 2, ,879 59, E E E E+9 3.2E E+12 6,144 1,24 Perfect information dual: Randomly generate well results, then choose drilling order. Inner problem grows exponentially with the number of wells n (albeit slowly), but is independent of the number of factors k. 36 / 39
37 Example: Sequential Exploration Relaxations and Penalties We consider a numerical example with 5 wells and 3 factors 59,49 states; 91,854 actions Numeric assumptions from Bickel, Smith and Meyer (27) We consider information relaxations that provide perfect information in periods, 1,...5. Penalties: No Penalty: zt(a) = Myopic Penalty: zt(a) =rt(a) E [rt(a) Ft] [ Binary VF Penalty: zt(a) = ˆVt(a) E ˆVt(a) Ft ] where ˆVt is the value function for a binary version of the problem. Lower bound from using optimal policy from binary version of model. 37 / 39
38 Example: Sequential Exploration Results The bounds get tighter as we delay the provision of perfect information and treat more periods exactly. Some penalty is required to get reasonably tight bounds. Filtration Gi Upper Bounds: No Penalty With Myopic Penalty With Binary VF Penalty Exact Value Lower Bound with Binary Policy / 39
39 Summary and Conclusions Our dual approach is part of an iterative strategy for studying DPs: Lower bounds from MCS with a given policy Upper bounds from information relaxations and penalties If bounds are too broad, refine. Can control effort and avoid: Exactsolutionoflarge-scaleDPs Detailed modeling of uncertainty resolution (e.g., when is the state of the world known? is volatility observed?) Future research: Infinite horizon and/or continuous time Automatic generation of upper and lower bounds Careful analysis of the complexity vs. accuracy tradeoff 39 / 39
A Robust Option Pricing Problem
IMA 2003 Workshop, March 12-19, 2003 A Robust Option Pricing Problem Laurent El Ghaoui Department of EECS, UC Berkeley 3 Robust optimization standard form: min x sup u U f 0 (x, u) : u U, f i (x, u) 0,
More informationThe value of foresight
Philip Ernst Department of Statistics, Rice University Support from NSF-DMS-1811936 (co-pi F. Viens) and ONR-N00014-18-1-2192 gratefully acknowledged. IMA Financial and Economic Applications June 11, 2018
More informationMartingale Pricing Theory in Discrete-Time and Discrete-Space Models
IEOR E4707: Foundations of Financial Engineering c 206 by Martin Haugh Martingale Pricing Theory in Discrete-Time and Discrete-Space Models These notes develop the theory of martingale pricing in a discrete-time,
More informationApproximate Dynamic Programming for the Merchant Operations of Commodity and Energy Conversion Assets
Approximate Dynamic Programming for the Merchant Operations of Commodity and Energy Conversion Assets Selvaprabu (Selva) Nadarajah, (Joint work with François Margot and Nicola Secomandi) Tepper School
More informationEARLY EXERCISE OPTIONS: UPPER BOUNDS
EARLY EXERCISE OPTIONS: UPPER BOUNDS LEIF B.G. ANDERSEN AND MARK BROADIE Abstract. In this article, we discuss how to generate upper bounds for American or Bermudan securities by Monte Carlo methods. These
More informationImproved Lower and Upper Bound Algorithms for Pricing American Options by Simulation
Improved Lower and Upper Bound Algorithms for Pricing American Options by Simulation Mark Broadie and Menghui Cao December 2007 Abstract This paper introduces new variance reduction techniques and computational
More informationOnline Appendix: Extensions
B Online Appendix: Extensions In this online appendix we demonstrate that many important variations of the exact cost-basis LUL framework remain tractable. In particular, dual problem instances corresponding
More informationOptimal stopping problems for a Brownian motion with a disorder on a finite interval
Optimal stopping problems for a Brownian motion with a disorder on a finite interval A. N. Shiryaev M. V. Zhitlukhin arxiv:1212.379v1 [math.st] 15 Dec 212 December 18, 212 Abstract We consider optimal
More information17 MAKING COMPLEX DECISIONS
267 17 MAKING COMPLEX DECISIONS The agent s utility now depends on a sequence of decisions In the following 4 3grid environment the agent makes a decision to move (U, R, D, L) at each time step When the
More informationMONTE CARLO METHODS FOR AMERICAN OPTIONS. Russel E. Caflisch Suneal Chaudhary
Proceedings of the 2004 Winter Simulation Conference R. G. Ingalls, M. D. Rossetti, J. S. Smith, and B. A. Peters, eds. MONTE CARLO METHODS FOR AMERICAN OPTIONS Russel E. Caflisch Suneal Chaudhary Mathematics
More informationMonte Carlo Methods in Structuring and Derivatives Pricing
Monte Carlo Methods in Structuring and Derivatives Pricing Prof. Manuela Pedio (guest) 20263 Advanced Tools for Risk Management and Pricing Spring 2017 Outline and objectives The basic Monte Carlo algorithm
More informationAMH4 - ADVANCED OPTION PRICING. Contents
AMH4 - ADVANCED OPTION PRICING ANDREW TULLOCH Contents 1. Theory of Option Pricing 2 2. Black-Scholes PDE Method 4 3. Martingale method 4 4. Monte Carlo methods 5 4.1. Method of antithetic variances 5
More informationStochastic Processes and Stochastic Calculus - 9 Complete and Incomplete Market Models
Stochastic Processes and Stochastic Calculus - 9 Complete and Incomplete Market Models Eni Musta Università degli studi di Pisa San Miniato - 16 September 2016 Overview 1 Self-financing portfolio 2 Complete
More informationThe Irrevocable Multi-Armed Bandit Problem
The Irrevocable Multi-Armed Bandit Problem Ritesh Madan Qualcomm-Flarion Technologies May 27, 2009 Joint work with Vivek Farias (MIT) 2 Multi-Armed Bandit Problem n arms, where each arm i is a Markov Decision
More informationHedging under Arbitrage
Hedging under Arbitrage Johannes Ruf Columbia University, Department of Statistics Modeling and Managing Financial Risks January 12, 2011 Motivation Given: a frictionless market of stocks with continuous
More informationAsset-Liability Management
Asset-Liability Management John Birge University of Chicago Booth School of Business JRBirge INFORMS San Francisco, Nov. 2014 1 Overview Portfolio optimization involves: Modeling Optimization Estimation
More informationPolicy iterated lower bounds and linear MC upper bounds for Bermudan style derivatives
Finance Winterschool 2007, Lunteren NL Policy iterated lower bounds and linear MC upper bounds for Bermudan style derivatives Pricing complex structured products Mohrenstr 39 10117 Berlin schoenma@wias-berlin.de
More informationEquity correlations implied by index options: estimation and model uncertainty analysis
1/18 : estimation and model analysis, EDHEC Business School (joint work with Rama COT) Modeling and managing financial risks Paris, 10 13 January 2011 2/18 Outline 1 2 of multi-asset models Solution to
More informationSimple Improvement Method for Upper Bound of American Option
Simple Improvement Method for Upper Bound of American Option Koichi Matsumoto (joint work with M. Fujii, K. Tsubota) Faculty of Economics Kyushu University E-mail : k-matsu@en.kyushu-u.ac.jp 6th World
More informationEE266 Homework 5 Solutions
EE, Spring 15-1 Professor S. Lall EE Homework 5 Solutions 1. A refined inventory model. In this problem we consider an inventory model that is more refined than the one you ve seen in the lectures. The
More informationMYOPIC INVENTORY POLICIES USING INDIVIDUAL CUSTOMER ARRIVAL INFORMATION
Working Paper WP no 719 November, 2007 MYOPIC INVENTORY POLICIES USING INDIVIDUAL CUSTOMER ARRIVAL INFORMATION Víctor Martínez de Albéniz 1 Alejandro Lago 1 1 Professor, Operations Management and Technology,
More information16 MAKING SIMPLE DECISIONS
247 16 MAKING SIMPLE DECISIONS Let us associate each state S with a numeric utility U(S), which expresses the desirability of the state A nondeterministic action A will have possible outcome states Result
More informationModern Methods of Option Pricing
Modern Methods of Option Pricing Denis Belomestny Weierstraß Institute Berlin Motzen, 14 June 2007 Denis Belomestny (WIAS) Modern Methods of Option Pricing Motzen, 14 June 2007 1 / 30 Overview 1 Introduction
More informationBasic Arbitrage Theory KTH Tomas Björk
Basic Arbitrage Theory KTH 2010 Tomas Björk Tomas Björk, 2010 Contents 1. Mathematics recap. (Ch 10-12) 2. Recap of the martingale approach. (Ch 10-12) 3. Change of numeraire. (Ch 26) Björk,T. Arbitrage
More informationChapter 15: Jump Processes and Incomplete Markets. 1 Jumps as One Explanation of Incomplete Markets
Chapter 5: Jump Processes and Incomplete Markets Jumps as One Explanation of Incomplete Markets It is easy to argue that Brownian motion paths cannot model actual stock price movements properly in reality,
More informationLecture 7: Bayesian approach to MAB - Gittins index
Advanced Topics in Machine Learning and Algorithmic Game Theory Lecture 7: Bayesian approach to MAB - Gittins index Lecturer: Yishay Mansour Scribe: Mariano Schain 7.1 Introduction In the Bayesian approach
More informationEconomics 2010c: Lecture 4 Precautionary Savings and Liquidity Constraints
Economics 2010c: Lecture 4 Precautionary Savings and Liquidity Constraints David Laibson 9/11/2014 Outline: 1. Precautionary savings motives 2. Liquidity constraints 3. Application: Numerical solution
More informationMATH3075/3975 FINANCIAL MATHEMATICS TUTORIAL PROBLEMS
MATH307/37 FINANCIAL MATHEMATICS TUTORIAL PROBLEMS School of Mathematics and Statistics Semester, 04 Tutorial problems should be used to test your mathematical skills and understanding of the lecture material.
More informationLast Time. Martingale inequalities Martingale convergence theorem Uniformly integrable martingales. Today s lecture: Sections 4.4.1, 5.
MATH136/STAT219 Lecture 21, November 12, 2008 p. 1/11 Last Time Martingale inequalities Martingale convergence theorem Uniformly integrable martingales Today s lecture: Sections 4.4.1, 5.3 MATH136/STAT219
More informationLECTURE 4: BID AND ASK HEDGING
LECTURE 4: BID AND ASK HEDGING 1. Introduction One of the consequences of incompleteness is that the price of derivatives is no longer unique. Various strategies for dealing with this exist, but a useful
More informationMONTE CARLO BOUNDS FOR CALLABLE PRODUCTS WITH NON-ANALYTIC BREAK COSTS
MONTE CARLO BOUNDS FOR CALLABLE PRODUCTS WITH NON-ANALYTIC BREAK COSTS MARK S. JOSHI Abstract. The pricing of callable derivative products with complicated pay-offs is studied. A new method for finding
More informationIEOR E4703: Monte-Carlo Simulation
IEOR E4703: Monte-Carlo Simulation Other Miscellaneous Topics and Applications of Monte-Carlo Martin Haugh Department of Industrial Engineering and Operations Research Columbia University Email: martin.b.haugh@gmail.com
More informationMartingale Pricing Applied to Dynamic Portfolio Optimization and Real Options
IEOR E476: Financial Engineering: Discrete-Time Asset Pricing c 21 by Martin Haugh Martingale Pricing Applied to Dynamic Portfolio Optimization and Real Options We consider some further applications of
More informationRisk Neutral Valuation
copyright 2012 Christian Fries 1 / 51 Risk Neutral Valuation Christian Fries Version 2.2 http://www.christian-fries.de/finmath April 19-20, 2012 copyright 2012 Christian Fries 2 / 51 Outline Notation Differential
More informationChanges of the filtration and the default event risk premium
Changes of the filtration and the default event risk premium Department of Banking and Finance University of Zurich April 22 2013 Math Finance Colloquium USC Change of the probability measure Change of
More information,,, be any other strategy for selling items. It yields no more revenue than, based on the
ONLINE SUPPLEMENT Appendix 1: Proofs for all Propositions and Corollaries Proof of Proposition 1 Proposition 1: For all 1,2,,, if, is a non-increasing function with respect to (henceforth referred to as
More information16 MAKING SIMPLE DECISIONS
253 16 MAKING SIMPLE DECISIONS Let us associate each state S with a numeric utility U(S), which expresses the desirability of the state A nondeterministic action a will have possible outcome states Result(a)
More informationOptimization Models in Financial Mathematics
Optimization Models in Financial Mathematics John R. Birge Northwestern University www.iems.northwestern.edu/~jrbirge Illinois Section MAA, April 3, 2004 1 Introduction Trends in financial mathematics
More informationRobust Hedging of Options on a Leveraged Exchange Traded Fund
Robust Hedging of Options on a Leveraged Exchange Traded Fund Alexander M. G. Cox Sam M. Kinsley University of Bath Recent Advances in Financial Mathematics, Paris, 10th January, 2017 A. M. G. Cox, S.
More informationDuality Theory and Simulation in Financial Engineering
Duality Theory and Simulation in Financial Engineering Martin Haugh Department of IE and OR, Columbia University, New York, NY 10027, martin.haugh@columbia.edu. Abstract This paper presents a brief introduction
More informationMultistage Stochastic Programming
Multistage Stochastic Programming John R. Birge University of Michigan Models - Long and short term - Risk inclusion Approximations - stages and scenarios Computation Slide Number 1 OUTLINE Motivation
More informationSlides for Risk Management
Slides for Risk Management Introduction to the modeling of assets Groll Seminar für Finanzökonometrie Prof. Mittnik, PhD Groll (Seminar für Finanzökonometrie) Slides for Risk Management Prof. Mittnik,
More informationIntroduction to Probability Theory and Stochastic Processes for Finance Lecture Notes
Introduction to Probability Theory and Stochastic Processes for Finance Lecture Notes Fabio Trojani Department of Economics, University of St. Gallen, Switzerland Correspondence address: Fabio Trojani,
More information1 Dynamic programming
1 Dynamic programming A country has just discovered a natural resource which yields an income per period R measured in terms of traded goods. The cost of exploitation is negligible. The government wants
More informationSOLVING ROBUST SUPPLY CHAIN PROBLEMS
SOLVING ROBUST SUPPLY CHAIN PROBLEMS Daniel Bienstock Nuri Sercan Özbay Columbia University, New York November 13, 2005 Project with Lucent Technologies Optimize the inventory buffer levels in a complicated
More informationRelaxations of Approximate Linear Programs for the Real Option Management of Commodity Storage
Relaxations of Approximate Linear Programs for the Real Option Management of Commodity Storage Selvaprabu Nadarajah, François Margot, Nicola Secomandi Tepper School of Business, Carnegie Mellon University,
More informationEE365: Risk Averse Control
EE365: Risk Averse Control Risk averse optimization Exponential risk aversion Risk averse control 1 Outline Risk averse optimization Exponential risk aversion Risk averse control Risk averse optimization
More informationContinuous Time Finance. Tomas Björk
Continuous Time Finance Tomas Björk 1 II Stochastic Calculus Tomas Björk 2 Typical Setup Take as given the market price process, S(t), of some underlying asset. S(t) = price, at t, per unit of underlying
More informationOptimizing S-shaped utility and risk management
Optimizing S-shaped utility and risk management Ineffectiveness of VaR and ES constraints John Armstrong (KCL), Damiano Brigo (Imperial) Quant Summit March 2018 Are ES constraints effective against rogue
More informationPricing in markets modeled by general processes with independent increments
Pricing in markets modeled by general processes with independent increments Tom Hurd Financial Mathematics at McMaster www.phimac.org Thanks to Tahir Choulli and Shui Feng Financial Mathematics Seminar
More informationTangent Lévy Models. Sergey Nadtochiy (joint work with René Carmona) Oxford-Man Institute of Quantitative Finance University of Oxford.
Tangent Lévy Models Sergey Nadtochiy (joint work with René Carmona) Oxford-Man Institute of Quantitative Finance University of Oxford June 24, 2010 6th World Congress of the Bachelier Finance Society Sergey
More informationHedging Under Jump Diffusions with Transaction Costs. Peter Forsyth, Shannon Kennedy, Ken Vetzal University of Waterloo
Hedging Under Jump Diffusions with Transaction Costs Peter Forsyth, Shannon Kennedy, Ken Vetzal University of Waterloo Computational Finance Workshop, Shanghai, July 4, 2008 Overview Overview Single factor
More informationPricing American Options: A Duality Approach
Pricing American Options: A Duality Approach Martin B. Haugh and Leonid Kogan Abstract We develop a new method for pricing American options. The main practical contribution of this paper is a general algorithm
More informationLecture 17: More on Markov Decision Processes. Reinforcement learning
Lecture 17: More on Markov Decision Processes. Reinforcement learning Learning a model: maximum likelihood Learning a value function directly Monte Carlo Temporal-difference (TD) learning COMP-424, Lecture
More informationRobust Portfolio Choice and Indifference Valuation
and Indifference Valuation Mitja Stadje Dep. of Econometrics & Operations Research Tilburg University joint work with Roger Laeven July, 2012 http://alexandria.tue.nl/repository/books/733411.pdf Setting
More informationBasic Concepts in Mathematical Finance
Chapter 1 Basic Concepts in Mathematical Finance In this chapter, we give an overview of basic concepts in mathematical finance theory, and then explain those concepts in very simple cases, namely in the
More informationRobust Portfolio Decisions for Financial Institutions
Robust Portfolio Decisions for Financial Institutions Ioannis Baltas 1,3, Athanasios N. Yannacopoulos 2,3 & Anastasios Xepapadeas 4 1 Department of Financial and Management Engineering University of the
More informationForecast Horizons for Production Planning with Stochastic Demand
Forecast Horizons for Production Planning with Stochastic Demand Alfredo Garcia and Robert L. Smith Department of Industrial and Operations Engineering Universityof Michigan, Ann Arbor MI 48109 December
More informationSolving real-life portfolio problem using stochastic programming and Monte-Carlo techniques
Solving real-life portfolio problem using stochastic programming and Monte-Carlo techniques 1 Introduction Martin Branda 1 Abstract. We deal with real-life portfolio problem with Value at Risk, transaction
More informationThe Pennsylvania State University. The Graduate School. Department of Industrial Engineering AMERICAN-ASIAN OPTION PRICING BASED ON MONTE CARLO
The Pennsylvania State University The Graduate School Department of Industrial Engineering AMERICAN-ASIAN OPTION PRICING BASED ON MONTE CARLO SIMULATION METHOD A Thesis in Industrial Engineering and Operations
More informationOptimal Dam Management
Optimal Dam Management Michel De Lara et Vincent Leclère July 3, 2012 Contents 1 Problem statement 1 1.1 Dam dynamics.................................. 2 1.2 Intertemporal payoff criterion..........................
More informationRisk Neutral Measures
CHPTER 4 Risk Neutral Measures Our aim in this section is to show how risk neutral measures can be used to price derivative securities. The key advantage is that under a risk neutral measure the discounted
More informationPricing Volatility Derivatives with General Risk Functions. Alejandro Balbás University Carlos III of Madrid
Pricing Volatility Derivatives with General Risk Functions Alejandro Balbás University Carlos III of Madrid alejandro.balbas@uc3m.es Content Introduction. Describing volatility derivatives. Pricing and
More informationMulti-period mean variance asset allocation: Is it bad to win the lottery?
Multi-period mean variance asset allocation: Is it bad to win the lottery? Peter Forsyth 1 D.M. Dang 1 1 Cheriton School of Computer Science University of Waterloo Guangzhou, July 28, 2014 1 / 29 The Basic
More informationQuasi-Convex Stochastic Dynamic Programming
Quasi-Convex Stochastic Dynamic Programming John R. Birge University of Chicago Booth School of Business JRBirge SIAM FM12, MSP, 10 July 2012 1 General Theme Many dynamic optimization problems dealing
More informationIEOR E4703: Monte-Carlo Simulation
IEOR E4703: Monte-Carlo Simulation Simulating Stochastic Differential Equations Martin Haugh Department of Industrial Engineering and Operations Research Columbia University Email: martin.b.haugh@gmail.com
More informationConditional Certainty Equivalent
Conditional Certainty Equivalent Marco Frittelli and Marco Maggis University of Milan Bachelier Finance Society World Congress, Hilton Hotel, Toronto, June 25, 2010 Marco Maggis (University of Milan) CCE
More informationThe Uncertain Volatility Model
The Uncertain Volatility Model Claude Martini, Antoine Jacquier July 14, 008 1 Black-Scholes and realised volatility What happens when a trader uses the Black-Scholes (BS in the sequel) formula to sell
More informationBandit Problems with Lévy Payoff Processes
Bandit Problems with Lévy Payoff Processes Eilon Solan Tel Aviv University Joint with Asaf Cohen Multi-Arm Bandits A single player sequential decision making problem. Time is continuous or discrete. The
More informationReinforcement Learning. Slides based on those used in Berkeley's AI class taught by Dan Klein
Reinforcement Learning Slides based on those used in Berkeley's AI class taught by Dan Klein Reinforcement Learning Basic idea: Receive feedback in the form of rewards Agent s utility is defined by the
More informationExtraction capacity and the optimal order of extraction. By: Stephen P. Holland
Extraction capacity and the optimal order of extraction By: Stephen P. Holland Holland, Stephen P. (2003) Extraction Capacity and the Optimal Order of Extraction, Journal of Environmental Economics and
More informationIEOR E4602: Quantitative Risk Management
IEOR E4602: Quantitative Risk Management Basic Concepts and Techniques of Risk Management Martin Haugh Department of Industrial Engineering and Operations Research Columbia University Email: martin.b.haugh@gmail.com
More informationLecture 2: Making Good Sequences of Decisions Given a Model of World. CS234: RL Emma Brunskill Winter 2018
Lecture 2: Making Good Sequences of Decisions Given a Model of World CS234: RL Emma Brunskill Winter 218 Human in the loop exoskeleton work from Steve Collins lab Class Structure Last Time: Introduction
More informationLecture 8: Introduction to asset pricing
THE UNIVERSITY OF SOUTHAMPTON Paul Klein Office: Murray Building, 3005 Email: p.klein@soton.ac.uk URL: http://paulklein.se Economics 3010 Topics in Macroeconomics 3 Autumn 2010 Lecture 8: Introduction
More informationValuation of derivative assets Lecture 8
Valuation of derivative assets Lecture 8 Magnus Wiktorsson September 27, 2018 Magnus Wiktorsson L8 September 27, 2018 1 / 14 The risk neutral valuation formula Let X be contingent claim with maturity T.
More informationBounding Optimal Expected Revenues for Assortment Optimization under Mixtures of Multinomial Logits
Bounding Optimal Expected Revenues for Assortment Optimization under Mixtures of Multinomial Logits Jacob Feldman School of Operations Research and Information Engineering, Cornell University, Ithaca,
More informationAdvanced Topics in Derivative Pricing Models. Topic 4 - Variance products and volatility derivatives
Advanced Topics in Derivative Pricing Models Topic 4 - Variance products and volatility derivatives 4.1 Volatility trading and replication of variance swaps 4.2 Volatility swaps 4.3 Pricing of discrete
More informationCharacterization of the Optimum
ECO 317 Economics of Uncertainty Fall Term 2009 Notes for lectures 5. Portfolio Allocation with One Riskless, One Risky Asset Characterization of the Optimum Consider a risk-averse, expected-utility-maximizing
More informationIlliquidity, Credit risk and Merton s model
Illiquidity, Credit risk and Merton s model (joint work with J. Dong and L. Korobenko) A. Deniz Sezer University of Calgary April 28, 2016 Merton s model of corporate debt A corporate bond is a contingent
More informationConvergence Analysis of Monte Carlo Calibration of Financial Market Models
Analysis of Monte Carlo Calibration of Financial Market Models Christoph Käbe Universität Trier Workshop on PDE Constrained Optimization of Certain and Uncertain Processes June 03, 2009 Monte Carlo Calibration
More informationApproximate Revenue Maximization with Multiple Items
Approximate Revenue Maximization with Multiple Items Nir Shabbat - 05305311 December 5, 2012 Introduction The paper I read is called Approximate Revenue Maximization with Multiple Items by Sergiu Hart
More informationModel-independent bounds for Asian options
Model-independent bounds for Asian options A dynamic programming approach Alexander M. G. Cox 1 Sigrid Källblad 2 1 University of Bath 2 CMAP, École Polytechnique University of Michigan, 2nd December,
More informationExponential utility maximization under partial information
Exponential utility maximization under partial information Marina Santacroce Politecnico di Torino Joint work with M. Mania AMaMeF 5-1 May, 28 Pitesti, May 1th, 28 Outline Expected utility maximization
More informationValue at Risk Ch.12. PAK Study Manual
Value at Risk Ch.12 Related Learning Objectives 3a) Apply and construct risk metrics to quantify major types of risk exposure such as market risk, credit risk, liquidity risk, regulatory risk etc., and
More informationMultilevel Monte Carlo Simulation
Multilevel Monte Carlo p. 1/48 Multilevel Monte Carlo Simulation Mike Giles mike.giles@maths.ox.ac.uk Oxford University Mathematical Institute Oxford-Man Institute of Quantitative Finance Workshop on Computational
More informationRisk management. Introduction to the modeling of assets. Christian Groll
Risk management Introduction to the modeling of assets Christian Groll Introduction to the modeling of assets Risk management Christian Groll 1 / 109 Interest rates and returns Interest rates and returns
More informationMaking Decisions. CS 3793 Artificial Intelligence Making Decisions 1
Making Decisions CS 3793 Artificial Intelligence Making Decisions 1 Planning under uncertainty should address: The world is nondeterministic. Actions are not certain to succeed. Many events are outside
More informationEvaluating Strategic Forecasters. Rahul Deb with Mallesh Pai (Rice) and Maher Said (NYU Stern) Becker Friedman Theory Conference III July 22, 2017
Evaluating Strategic Forecasters Rahul Deb with Mallesh Pai (Rice) and Maher Said (NYU Stern) Becker Friedman Theory Conference III July 22, 2017 Motivation Forecasters are sought after in a variety of
More informationIntroduction to Game-Theoretic Probability
Introduction to Game-Theoretic Probability Glenn Shafer Rutgers Business School January 28, 2002 The project: Replace measure theory with game theory. The game-theoretic strong law. Game-theoretic price
More information3.2 No-arbitrage theory and risk neutral probability measure
Mathematical Models in Economics and Finance Topic 3 Fundamental theorem of asset pricing 3.1 Law of one price and Arrow securities 3.2 No-arbitrage theory and risk neutral probability measure 3.3 Valuation
More informationCS 774 Project: Fall 2009 Version: November 27, 2009
CS 774 Project: Fall 2009 Version: November 27, 2009 Instructors: Peter Forsyth, paforsyt@uwaterloo.ca Office Hours: Tues: 4:00-5:00; Thurs: 11:00-12:00 Lectures:MWF 3:30-4:20 MC2036 Office: DC3631 CS
More informationFast Convergence of Regress-later Series Estimators
Fast Convergence of Regress-later Series Estimators New Thinking in Finance, London Eric Beutner, Antoon Pelsser, Janina Schweizer Maastricht University & Kleynen Consultants 12 February 2014 Beutner Pelsser
More informationReinforcement Learning
Reinforcement Learning Basic idea: Receive feedback in the form of rewards Agent s utility is defined by the reward function Must (learn to) act so as to maximize expected rewards Grid World The agent
More informationIntroduction to Real Options
IEOR E4706: Foundations of Financial Engineering c 2016 by Martin Haugh Introduction to Real Options We introduce real options and discuss some of the issues and solution methods that arise when tackling
More informationMultistage risk-averse asset allocation with transaction costs
Multistage risk-averse asset allocation with transaction costs 1 Introduction Václav Kozmík 1 Abstract. This paper deals with asset allocation problems formulated as multistage stochastic programming models.
More informationLecture 17. The model is parametrized by the time period, δt, and three fixed constant parameters, v, σ and the riskless rate r.
Lecture 7 Overture to continuous models Before rigorously deriving the acclaimed Black-Scholes pricing formula for the value of a European option, we developed a substantial body of material, in continuous
More informationReasoning with Uncertainty
Reasoning with Uncertainty Markov Decision Models Manfred Huber 2015 1 Markov Decision Process Models Markov models represent the behavior of a random process, including its internal state and the externally
More informationThe stochastic calculus
Gdansk A schedule of the lecture Stochastic differential equations Ito calculus, Ito process Ornstein - Uhlenbeck (OU) process Heston model Stopping time for OU process Stochastic differential equations
More informationKØBENHAVNS UNIVERSITET (Blok 2, 2011/2012) Naturvidenskabelig kandidateksamen Continuous time finance (FinKont) TIME ALLOWED : 3 hours
This question paper consists of 3 printed pages FinKont KØBENHAVNS UNIVERSITET (Blok 2, 211/212) Naturvidenskabelig kandidateksamen Continuous time finance (FinKont) TIME ALLOWED : 3 hours This exam paper
More information1.1 Basic Financial Derivatives: Forward Contracts and Options
Chapter 1 Preliminaries 1.1 Basic Financial Derivatives: Forward Contracts and Options A derivative is a financial instrument whose value depends on the values of other, more basic underlying variables
More information