Approximations of Stochastic Programs. Scenario Tree Reduction and Construction

Similar documents
Scenario reduction and scenario tree construction for power management problems

Scenario tree generation for stochastic programming models using GAMS/SCENRED

Energy Systems under Uncertainty: Modeling and Computations

Approximation of Continuous-State Scenario Processes in Multi-Stage Stochastic Optimization and its Applications

Scenario Reduction and Scenario Tree Construction for Power Management Problems

Dynamic Risk Management in Electricity Portfolio Optimization via Polyhedral Risk Functionals

Scenario Reduction and Scenario Tree Construction for Power Management Problems

Scenario Generation for Stochastic Programming Introduction and selected methods

Multistage risk-averse asset allocation with transaction costs

Robust Dual Dynamic Programming

On Complexity of Multistage Stochastic Programs

Dynamic Asset and Liability Management Models for Pension Systems

Optimally Thresholded Realized Power Variations for Lévy Jump Diffusion Models

DASC: A DECOMPOSITION ALGORITHM FOR MULTISTAGE STOCHASTIC PROGRAMS WITH STRONGLY CONVEX COST FUNCTIONS

Risk aversion in multi-stage stochastic programming: a modeling and algorithmic perspective

Assessing Policy Quality in Multi-stage Stochastic Programming

Worst-case-expectation approach to optimization under uncertainty

Yao s Minimax Principle

Scenario Construction and Reduction Applied to Stochastic Power Generation Expansion Planning

Essays on Some Combinatorial Optimization Problems with Interval Data

Multistage Stochastic Programs

Optimal Security Liquidation Algorithms

VaR vs CVaR in Risk Management and Optimization

Equity correlations implied by index options: estimation and model uncertainty analysis

Worst-Case Value-at-Risk of Non-Linear Portfolios

Stochastic Dynamic Programming Using Optimal Quantizers

Implementing Models in Quantitative Finance: Methods and Cases

Stochastic Dual Dynamic Programming

Optimum Thresholding for Semimartingales with Lévy Jumps under the mean-square error

Optimal Order Placement

Dynamic Replication of Non-Maturing Assets and Liabilities

Progressive Hedging for Multi-stage Stochastic Optimization Problems

Stochastic Dual Dynamic Programming Algorithm for Multistage Stochastic Programming

Optimal construction of a fund of funds

A Multi-Stage Stochastic Programming Model for Managing Risk-Optimal Electricity Portfolios. Stochastic Programming and Electricity Risk Management

MATH3075/3975 FINANCIAL MATHEMATICS TUTORIAL PROBLEMS

Computational Finance. Computational Finance p. 1

Order book resilience, price manipulations, and the positive portfolio problem

2.1 Mathematical Basis: Risk-Neutral Pricing

Report for technical cooperation between Georgia Institute of Technology and ONS - Operador Nacional do Sistema Elétrico Risk Averse Approach

Utility Indifference Pricing and Dynamic Programming Algorithm

Multistage Stochastic Programming

Electricity Swing Options: Behavioral Models and Pricing

Forecast Horizons for Production Planning with Stochastic Demand

CONVERGENCE OF OPTION REWARDS FOR MARKOV TYPE PRICE PROCESSES MODULATED BY STOCHASTIC INDICES

Dynamic Portfolio Choice II

Valuing American Options by Simulation

Stochastic Programming in Gas Storage and Gas Portfolio Management. ÖGOR-Workshop, September 23rd, 2010 Dr. Georg Ostermaier

Multistage Stochastic Mixed-Integer Programs for Optimizing Gas Contract and Scheduling Maintenance

Solving real-life portfolio problem using stochastic programming and Monte-Carlo techniques

Dynamic Programming: An overview. 1 Preliminaries: The basic principle underlying dynamic programming

Mortgage Loan Portfolio Optimization Using Multi-Stage Stochastic Programming

6.231 DYNAMIC PROGRAMMING LECTURE 3 LECTURE OUTLINE

Quality Evaluation of Scenario-Tree Generation Methods for Solving Stochastic Programming Problem

Multistage Stochastic Demand-side Management for Price-Making Major Consumers of Electricity in a Co-optimized Energy and Reserve Market

MULTI-STAGE STOCHASTIC ELECTRICITY PORTFOLIO OPTIMIZATION IN LIBERALIZED ENERGY MARKETS

IMPA Commodities Course : Forward Price Models

Asset-Liability Management

Handout 4: Deterministic Systems and the Shortest Path Problem

3.4 Copula approach for modeling default dependency. Two aspects of modeling the default times of several obligors

Fast Convergence of Regress-later Series Estimators

Is intertemporal choice theory testable?

Investigation of the and minimum storage energy target levels approach. Final Report

Binomial model: numerical algorithm

European Contingent Claims

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

Risk Management for Chemical Supply Chain Planning under Uncertainty

Credit Risk Models with Filtered Market Information

Lecture 7: Bayesian approach to MAB - Gittins index

A Structural Model for Carbon Cap-and-Trade Schemes

Policy iterated lower bounds and linear MC upper bounds for Bermudan style derivatives

Dynamic Appointment Scheduling in Healthcare

Regression estimation in continuous time with a view towards pricing Bermudan options

Homework #4. CMSC351 - Spring 2013 PRINT Name : Due: Thu Apr 16 th at the start of class

An effective perfect-set theorem

PORTFOLIO OPTIMIZATION AND EXPECTED SHORTFALL MINIMIZATION FROM HISTORICAL DATA

A New Scenario-Tree Generation Approach for Multistage Stochastic Programming Problems Based on a Demerit Criterion

Optimal Dividend Policy of A Large Insurance Company with Solvency Constraints. Zongxia Liang

DRAFT. 1 exercise in state (S, t), π(s, t) = 0 do not exercise in state (S, t) Review of the Risk Neutral Stock Dynamics

IMPLEMENTING THE SPECTRAL CALIBRATION OF EXPONENTIAL LÉVY MODELS

Portfolio Management and Optimal Execution via Convex Optimization

BAYESIAN NONPARAMETRIC ANALYSIS OF SINGLE ITEM PREVENTIVE MAINTENANCE STRATEGIES

Interest-Sensitive Financial Instruments

Martingales. by D. Cox December 2, 2009

From Discrete Time to Continuous Time Modeling

e-companion ONLY AVAILABLE IN ELECTRONIC FORM

Market Design for Emission Trading Schemes

Problem Set 3. Thomas Philippon. April 19, Human Wealth, Financial Wealth and Consumption

Vladimir Spokoiny (joint with J.Polzehl) Varying coefficient GARCH versus local constant volatility modeling.

The Irrevocable Multi-Armed Bandit Problem

Introduction to modeling using stochastic programming. Andy Philpott The University of Auckland

Lecture 10: The knapsack problem

The Stigler-Luckock model with market makers

Discrete time interest rate models

Risk Measurement in Credit Portfolio Models

Introduction to Probability Theory and Stochastic Processes for Finance Lecture Notes

Contagion models with interacting default intensity processes

Reinforcement Learning and Simulation-Based Search

Deterministic Income under a Stochastic Interest Rate

Parametric Inference and Dynamic State Recovery from Option Panels. Torben G. Andersen

Transcription:

Approximations of Stochastic Programs. Scenario Tree Reduction and Construction W. Römisch Humboldt-University Berlin Institute of Mathematics 10099 Berlin, Germany www.mathematik.hu-berlin.de/~romisch (J. Dupačová, N. Gröwe-Kuska, H. Heitsch) GAMS Workshop, Heidelberg, Sept. 1-3, 2003 1

1 Introduction Let {ξ t } T t=1 be a discrete-time stochastic data process defined on some probability space (Ω, F, P ) and with ξ 1 deterministic. The stochastic decision x t at period t is assumed to depend only on (ξ 1,..., ξ t ) (nonanticipativity). Typical financial and production planning model: min{ie[ T c t (ξ t, x t )] : x t X t, x t nonanticipative, t=1 A tt (ξ t )x t + A t,t 1 (ξ t )x t 1 g t (ξ t )} Alternative for the minimization of expected costs: Minimizing some risk measure IF of the stochastic cost process {c t (ξ t, x t )} T t=1 (risk management). First step of its numerical solution: Approximation of {ξ t } T t=1 by finitely many scenarios with certain probabilities. Nonanticipativity leads to a scenario tree structure of the approximation. 2

2 Data process approximation by scenario trees The data process ξ = {ξ t } T t=1 is approximated by a process forming a scenario tree which is based on a finite set N of nodes. n = 1 n n N T N + (n) t = 1 t 1 t(n) T Scenario tree with t 1 = 2, T = 5, N = 23 and 11 leaves The root node n = 1 stands for period t = 1. Every other node n has a unique predecessor n and a set N + (n) of successors. Let path(n) be the set {1,..., n, n} of nodes from the root to node n, t(n) := path(n) and N T := {n N : N + (n) = } the set of leaves. A scenario corresponds to path(n) for some n N T. With the given scenario probabilities {π n } n NT, we define recursively node probabilities π n := n + N + (n) π n +, n N. 3

3 Generation of scenario trees (i) Development of a stochastic model for the data process ξ (parametric [e.g. time series model], nonparametric [e.g. resampling]) Load 8000 7000 6000 5000 4000 3000 2000 0 25 50 75 100 125 150 Hours Scenarios for the weekly electrical load and generation of simulation scenarios; (ii) Construction of a scenario tree out of the stochastic model or of the simulation scenarios; (iii) optional scenario tree reduction. 4

Approaches for (ii): (1) Barycentric scenario trees (conditional expectations w.r.t. a decomposition of the support into simplices) (Frauendorfer 96,...); (2) Fitting of trees with prescribed structure to given moments (Hoyland/Wallace 01, Hoyland/Kaut/Wallace 03); (3) Conditional sampling by (Quasi) Monte Carlo methods (QMC means low discrepancy sequences) (Morton 03, Koivu/Pennanen 02, 03); (4) Clustering methods for bundling scenarios (Philpott/Craddock/Waterer 00); (5) Scenario tree construction based on optimal approximations w.r.t. certain probability metrics (Pflug 01, Hochreiter/Pflug 02, Gröwe-Kuska/Heitsch/Römisch 03). Recent reference: Kaut/Wallace 03 5

Example: (Hochreiter/Pflug 02) Let P denote the uniform distribution on [ 3, 3] and P be the distribution of Z := c 1 Z 1 + c 2 Z 2, where Z 1 is discrete with two equally probable scenarios 1 and 1, Z 2 is standard normal, i.e., Z 2 N(0, 1), and c 1 and c 2 are normalizing constants (c 1 := 4 3 5, c 3 2 := 1 5 ). Then the first four (central) moments coincide 9 IR ξi P (dξ) = IR ξi P (dξ) = 0, 1, 0,, i = 1, 2, 3, 4. 5 However, the densities of P and P have the following form 0.45 0.4 0.35 0.3 0.25 0.2 0.15 0.1 0.05 0 5 4 3 2 1 0 1 2 3 4 5 and, thus, are quite different. 6

4 Distances of probability distributions Let P denote the probability distribution of the stochastic process {ξ t } T t=1 with ξ t in IR r, i.e., P has support Ξ IR rt = IR s. The Kantorovich functional or transportation metric takes the form µ c (P, Q) := inf{ c(ξ, ξ)η(dξ, d ξ) : π 1 η = P, π 2 η = Q}, Ξ Ξ where c : Ξ Ξ IR is a certain cost function and the minimum ist taken w.r.t. all probability measures η on Ξ Ξ having (fixed) marginals P and Q. Example: c p (ξ, ξ) := max{1, ξ ξ 0 p 1, ξ ξ 0 p 1 } ξ ξ (p 1, ξ 0 Ξ fixed) We consider the following convex stochastic program min{ f 0 (x, ξ)p (dξ) : x X} Ξ with a normal convex integrand f 0 and denote by v(p ) := inf f 0 (x, ξ)p (dξ) and S(P ) := arg min x X Ξ x X its optimal value and solution set, respectively. Ξ f 0 (x, ξ)p (dξ) 7

We choose c such that the property f 0 (x, ξ) f 0 (x, ξ) L( x )c(ξ, ξ), ξ, ξ Ξ, x X, holds with some function L( ) depending on x. This means that c plays the role of a continuity modulus of the function f 0 (x, ) from Ξ to IR (for each x X). Typically, f 0 is continuous and piecewise polynomial. Theorem: (Stability) Under weak conditions on the stochastic program the optimal values are Lipschitz continuous w.r.t. µ c, i.e., v(p ) v(q) ˆLµ c (P, Q), and the solution sets are upper semicontinuous. In particular, if S(P ) = { x} any element of the approximate solution set S(Q) is close to x if µ c (P, Q) is small. (Rachev/Römisch 02, Römisch 03) 8

Choice of p 1 in c = c p : two-stage with random right-hand side: p = 1. general two-stage with fixed recourse: p = 2. multi-stage with random right-hand side: p = 1. general multi-stage with T stages: p = T. ( present conjecture valid under appropriate assumptions on the dependence structure; not valid for mixed-integer models; in that case f 0 is piecewise continuous!) Approach: Select a probability metric a function c : Ξ Ξ IR such that the underlying stochastic optimization model is stable w.r.t. µ c. Given P and a tolerance ε > 0, determine a scenario tree such that its probability distribution P tr has the property µ c (P, P tr ) ε. 9

Distances of discrete distributions P : scenarios ξ i with probabilities p i, i = 1,..., N, Q: scenarios ξ j with probabilities q j, j = 1,..., M. Then N µ c (P, Q) = sup{ p i u i + i=1 M q j v j : u i + v j c(ξ i, ξ j ) i, j} j=1 = inf{ i,j η ij c(ξ i, ξ j ) : η ij 0, j η ij = p i, i η ij = q j } (optimal value of linear transportation problems) (a) Distances of distributions can be computed by solving specific linear programs. (b) The principle of optimal scenario generation can be formulated as a best approximation problem with respect to µ c. However, it is nonconvex and difficult to solve. (c) The best approximation problem simplifies considerably if the scenarios are taken from a specified finite set. 10

5 Scenario Reduction We consider discrete distributions P with scenarios ξ i and probabilities p i, i = 1,..., N, and Q having a subset of scenarios ξ j, j J {1,..., N}, of P, but different probabilities q j, j J. Optimal reduction of a given scenario set J: The best approximation of P with respect to µ c by such a distribution Q exists and is denoted by Q. It has the distance D J = µ c (P, Q) = p i min c(ξ i, ξ j ) j J i J and the probabilities q j = p j + p i, j J, where J j := i J j {i J : j = j(i)} and j(i) arg min c(ξ i, ξ j ), i j J J, i.e., the optimal redistribution consists in adding the deleted scenario weight to that of some of the closest scenarios. However, finding the optimal scenario set with a fixed number n of scenarios is a combinatorial optimization problem. 11

6 Fast reduction heuristics Starting point (n = N 1): min p l min c(ξ l, ξ j ) l {1,...,N} j l Algorithm 1: (Simultaneous backward reduction) Step [0]: Step [i]: Step [N-n+1]: Sorting of {c(ξ j, ξ k ) : j}, k, J [0] :=. l i arg min l J [i 1] k J [i 1] {l} J [i] := J [i 1] {l i }. Optimal redistribution. p k min c(ξ k, ξ j ). j J [i 1] {l} 12

Starting point (n = 1): min u {1,...,N} k=1 N p k c(ξ k, ξ u ) Algorithm 2: (Fast forward selection) Step [0]: Compute c(ξ k, ξ u ), k, u = 1,..., N, Step [i]: Step [n+1]: J [0] := {1,..., N}. u i arg min u J [i 1] J [i] := J [i 1] \ {u i }. k J [i 1] \{u} Optimal redistribution. p k min c(ξ k, ξ j ), j J [i 1] \{u} 13

1000 Original load scenario tree 500 0-500 -1000 0 25 50 75 100 125 150 1000 Reduced load scenario tree / backward 1000 500 500 0 0-500 -500-1000 -1000 1000 500 24 48 72 96 120 144 168 Reduced load scenario tree / forward 24 48 72 96 120 144 168 500 0 0-500 -500-1000 24 48 72 96 120 144 168-1000 24 48 72 96 120 144 168 1 1 14

Binary test scenario tree Let a binary scenario tree have N := 2 T 1 scenarios ξ i = (ξi 1,..., ξt i ), i = 1,..., N, with equal probabilities p i = 1 N, i = 1,..., N, and ξ1 1 =... = ξn 1 as its root node. Such a scenario tree is called regular if, for each t {1,..., T }, δ1 t := δ t and δ2 t := δ t with δ t IR + and ξ t i = t δi τ τ (t {1,..., T }) τ=1 where to each index i = 1,..., N there corresponds a T - tupel of indices (i 1,..., i T ) {1, 2} T. Proposition: Let a regular binary scenario tree with N = 2 T 1 scenarios and T 4 be given. Let t 0 arg min 2 t T δ t, t 0 T 2 and max{δ t 0+1, δ t 0+2 } 2δ t 0. Then it holds for each n IN with N 4 n < N: D opt n := min{d J : #J = N n} = N n N 2δt 0. Here, c is defined by c(ξ, ξ) := ξ ξ (ξ, ξ Ξ). 15

Example: (regular binary scenario tree) T = 11, N = 2 10 = 1024, (δ 1,..., δ 11 ) = (0, 0.5, 0.6, 0.7, 0.9, 1.1, 1.3, 1.6, 1.9, 2.3, 2.7), D opt n = N n N for each N 4 n < N. 10 5 0-5 -10 Relative accuracy: µ rel c (P, Q) := µ c(p, Q) µ c (P, δ ξl ) µ c (P, δ ξl ) = min{d J : #J = N 1} and c(, ) :=. 16

Number Backward of Simultaneous Fast Minimal n of Scenario Sets Backward Forward Distance Scenarios ζc rel Time ζc rel Time ζc rel Time 1 116.01 % 2 s 111.93 % 96 s 100.00 % 2 s 100.00 % 2 102.86 % 2 s 75.45 % 96 s 79.16 % 2 s * 3 78.54 % 2 s 66.54 % 96 s 63.96 % 2 s * 4 66.35 % 2 s 61.69 % 96 s 59.04 % 3 s * 5 64.81 % 2 s 57.95 % 96 s 54.51 % 3 s * 10 53.68 % 2 s 48.21 % 95 s 44.39 % 4 s * 20 39.16 % 2 s 40.15 % 95 s 35.84 % 7 s * 30 35.61 % 2 s 34.70 % 94 s 31.56 % 10 s * 50 31.55 % 2 s 29.11 % 93 s 26.75 % 15 s * 100 22.68 % 2 s 21.73 % 89 s 20.97 % 27 s * 150 18.48 % 2 s 18.16 % 85 s 18.02 % 38 s * 200 16.70 % 2 s 16.50 % 81 s 16.11 % 48 s * 250 15.23 % 2 s 15.21 % 76 s 14.55 % 56 s * 260 14.97 % 2 s 14.97 % 75 s 14.26 % 58 s 14.04 % 270 14.75 % 2 s 14.75 % 74 s 14.00 % 60 s 13.86 % 280 14.53 % 2 s 14.53 % 72 s 13.76 % 61 s 13.67 % 290 14.30 % 2 s 14.30 % 71 s 13.54 % 63 s 13.49 % 300 14.08 % 2 s 14.08 % 70 s 13.32 % 64 s 13.30 % 350 12.98 % 2 s 12.98 % 64 s 12.39 % 71 s 12.39 % 400 11.88 % 2 s 11.88 % 57 s 11.47 % 76 s 11.47 % 450 10.78 % 2 s 10.78 % 51 s 10.55 % 81 s 10.55 % 500 9.67 % 2 s 9.67 % 45 s 9.63 % 85 s 9.63 % 600 7.79 % 2 s 7.79 % 33 s 7.79 % 91 s 7.79 % 700 5.95 % 2 s 5.95 % 22 s 5.95 % 95 s 5.95 % 800 4.12 % 2 s 4.12 % 12 s 4.12 % 97 s 4.12 % Computational results for the binary scenario tree 17

7 Constructing scenario trees from data scenarios Let a fan of data scenarios ξ i = (ξ1, i..., ξt i ) with probabilities π i, i = 1,..., N, be given, i.e., all scenarios coincide at the starting point t = 1, i.e., ξ1 1 =... = ξ1 N =: ξ1. Hence, it has the form t = 1 may be regarded as the root node of the scenario tree consisting of N scenarios (leaves). Now, P is the (discrete) probability distribution of ξ. Let c be adapted to the underlying stochastic program containing P. We describe an algorithm that may produce, for each ε > 0, a scenario tree with distribution P ε, root node ξ 1, less nodes than P and µ c (P, P ε ) < ε. 18

Recursive reduction algorithm: Let ε t > 0, t = 1,..., T, be given such that T t=1 ε t ε, set t := T, I T +1 := {1,..., N}, πt i +1 := πi and P T +1 := P. For t = T,..., 2: Step t: Determine an index set I t I t+1 such that µ ct (P t, P t+1 ) < ε t, where {ξ i } i It is the support of P t and c t is defined by c t (ξ, ξ) := c((ξ 1,..., ξ t, 0,..., 0), ( ξ 1,..., ξ t, 0,..., 0)); (scenario reduction w.r.t. the time horizon [1, t]) Step 1: Determine a probability measure P ε such that its marginal distributions P ε Π 1 t are δ ξ 1 for t = 1 and P ε Π 1 t = πtδ i ξ i and π t t i := πt+1 i + π j t+1, i I t j J t,i where J t,i := {j I t+1 \ I t : i t (j) = i}, i t (j) arg min c t (ξ j, ξ i )} are the index sets according to the redistribution i I t rule. 19

Blue: compute c-distances of scenarios; delete the green scenario & add its weight to the red one 20

Application: ξ is the bivariate weekly data process having the components a) electrical load, b) hourly electricity spot prices (at EEX). Data scenarios are obtained from a stochastic model calibrated to the historical load data of a (small) German power utility and historical price data of the European Energy Exchange (EEX) at Leipzig. We choose N = 50, T = 7, ε = 0.05, ε t = ε T, and arrive at a tree with 4608 nodes (instead of 8400 nodes of the original fan). t hours I t 1 1 24 1 2 25 48 12 3 49 72 23 4 73 96 31 5 97 120 37 6 121 144 42 7 145 168 46 21

400 Scenario tree for the electrical load 350 300 Load 250 200 150 0 25 50 75 100 125 150 Hours 100 Scenario tree for hourly spot prices 80 60 Spot price 40 20 0 25 50 75 100 125 150 Hours 22

8 GAMS/SCENRED GAMS/SCENRED introduced to GAMS Distribution 20.6 (May 2002) SCENRED is a collection of C++ routines for the optimal reduction of scenarios or scenario trees GAMS/SCENRED provides the link from GAMS programs to the scenario reduction algorithms. The reduced problems can then be solved by a deterministic optimization algorithm provided by GAMS. SCENRED contains three reduction algorithms: - FAST BACKWARD method - Mix of FAST BACKWARD/FORWARD methods - Mix of FAST BACKWARD/BACKWARD methods Automatic selection (best expected performance w.r.t. running time) Details: www.scenred.de, www.scenred.com 23