EC316a: Advanced Scientific Computation, Fall Discrete time, continuous state dynamic models: solution methods

Similar documents
A simple wealth model

Idiosyncratic risk, insurance, and aggregate consumption dynamics: a likelihood perspective

1.1 Some Apparently Simple Questions 0:2. q =p :

Economics 2010c: Lecture 4 Precautionary Savings and Liquidity Constraints

Applied Stochastic Processes and Control for Jump-Diffusions

EE266 Homework 5 Solutions

17 MAKING COMPLEX DECISIONS

Random Tree Method. Monte Carlo Methods in Financial Engineering

Modelling Returns: the CER and the CAPM

Implementing Models in Quantitative Finance: Methods and Cases

6.231 DYNAMIC PROGRAMMING LECTURE 10 LECTURE OUTLINE

Market Risk Analysis Volume I

Lecture 17: More on Markov Decision Processes. Reinforcement learning

Stochastic Optimal Control

STOCHASTIC PROGRAMMING FOR ASSET ALLOCATION IN PENSION FUNDS

Reasoning with Uncertainty

Final exam solutions

INTERTEMPORAL ASSET ALLOCATION: THEORY

Solving dynamic portfolio choice problems by recursing on optimized portfolio weights or on the value function?

Making Complex Decisions

Monte Carlo Methods in Financial Engineering

Markov Decision Processes

Lecture 1: Lucas Model and Asset Pricing

Contents Critique 26. portfolio optimization 32

1 Introduction. Term Paper: The Hall and Taylor Model in Duali 1. Yumin Li 5/8/2012

Chapter 6 Forecasting Volatility using Stochastic Volatility Model

An Introduction to Dynamic Macroeconomic Models. Part One: Basic Models And Solution Methods

16 MAKING SIMPLE DECISIONS

Parameter estimation in SDE:s

MACROECONOMICS. Prelim Exam

Shape-Preserving Dynamic Programming

Chapter 2 Uncertainty Analysis and Sampling Techniques

1 Precautionary Savings: Prudence and Borrowing Constraints

Solving Asset-Pricing Models with Recursive Preferences

Fast Convergence of Regress-later Series Estimators

Energy Systems under Uncertainty: Modeling and Computations

An Empirical Reassessment of the Commodity Storage Model

Interest Rate Curves Calibration with Monte-Carlo Simulatio

How Costly is External Financing? Evidence from a Structural Estimation. Christopher Hennessy and Toni Whited March 2006

Math 416/516: Stochastic Simulation

E ects of di erences in risk aversion on the. distribution of wealth

Estimation of dynamic term structure models

Reinforcement Learning and Simulation-Based Search

Chapter 5 Univariate time-series analysis. () Chapter 5 Univariate time-series analysis 1 / 29

Strategies for Improving the Efficiency of Monte-Carlo Methods

Proxy Function Fitting: Some Implementation Topics

Complex Decisions. Sequential Decision Making

Consumption and Portfolio Decisions When Expected Returns A

Lecture 2 Dynamic Equilibrium Models: Three and More (Finite) Periods

A No-Arbitrage Theorem for Uncertain Stock Model

6.231 DYNAMIC PROGRAMMING LECTURE 8 LECTURE OUTLINE

Monte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMSN50)

Pakes (1986): Patents as Options: Some Estimates of the Value of Holding European Patent Stocks

Introduction to Sequential Monte Carlo Methods

Dynamic Programming: An overview. 1 Preliminaries: The basic principle underlying dynamic programming

Hedging Derivative Securities with VIX Derivatives: A Discrete-Time -Arbitrage Approach

Smile in the low moments

Monte Carlo Methods in Structuring and Derivatives Pricing

What can we do with numerical optimization?

4 Reinforcement Learning Basic Algorithms

Risk shocks and monetary policy in the new normal

Monte Carlo Methods for Uncertainty Quantification

Markov Decision Processes: Making Decision in the Presence of Uncertainty. (some of) R&N R&N

MODELLING VOLATILITY SURFACES WITH GARCH

Machine Learning for Quantitative Finance

Pension Funds Performance Evaluation: a Utility Based Approach

Dynamic Portfolio Choice II

King s College London

Monte Carlo Based Numerical Pricing of Multiple Strike-Reset Options

The Pennsylvania State University. The Graduate School. Department of Industrial Engineering AMERICAN-ASIAN OPTION PRICING BASED ON MONTE CARLO

An Implementation of Markov Regime Switching GARCH Models in Matlab

Approximation of Continuous-State Scenario Processes in Multi-Stage Stochastic Optimization and its Applications

10. Monte Carlo Methods

Valuation of Forward Starting CDOs

16 MAKING SIMPLE DECISIONS

King s College London

TOBB-ETU, Economics Department Macroeconomics II (ECON 532) Practice Problems III

Asian Option Pricing: Monte Carlo Control Variate. A discrete arithmetic Asian call option has the payoff. S T i N N + 1

Option Pricing for Discrete Hedging and Non-Gaussian Processes

Fast and accurate pricing of discretely monitored barrier options by numerical path integration

THE OPTIMAL ASSET ALLOCATION PROBLEMFOR AN INVESTOR THROUGH UTILITY MAXIMIZATION

Non-Deterministic Search

Identifying Long-Run Risks: A Bayesian Mixed-Frequency Approach

MFE Course Details. Financial Mathematics & Statistics

Extend the ideas of Kan and Zhou paper on Optimal Portfolio Construction under parameter uncertainty

STATE UNIVERSITY OF NEW YORK AT ALBANY Department of Economics. Ph. D. Comprehensive Examination: Macroeconomics Fall, 2010

Markov-Chain Approximations for Life-Cycle Models

Asset Prices and the Return to Normalcy

Multistage risk-averse asset allocation with transaction costs

Chapter 9 Dynamic Models of Investment

Sequential Decision Making

Solving Nonlinear Rational Expectations Models by Approximating the Stochastic Equilibrium System. Michael P. Evers (Bonn University)

Idiosyncratic risk and the dynamics of aggregate consumption: a likelihood-based perspective

Simple Robust Hedging with Nearby Contracts

1 Dynamic programming

BSc (Hons) Software Engineering BSc (Hons) Computer Science with Network Security

The Method of Moderation

Chapter 5 Portfolio. O. Afonso, P. B. Vasconcelos. Computational Economics: a concise introduction

Portfolio Choice and Permanent Income

HIGH ORDER DISCONTINUOUS GALERKIN METHODS FOR 1D PARABOLIC EQUATIONS. Ahmet İzmirlioğlu. BS, University of Pittsburgh, 2004

Transcription:

EC316a: Advanced Scientific Computation, Fall 2003 Notes Section 4 Discrete time, continuous state dynamic models: solution methods We consider now solution methods for discrete time models in which decisions are made over continuous state variables. These models give rise to functional equations whose unknowns are entire functions defined on a subset of Euclidean space: for instance, the unknown of the Bellman equation V (s) = max {f(s, x) + δev (g(s, x, ɛ))} x X(s) is the value function V (. ). In most applications, these functional equations lack closed form solutions, and can only be solved by numerical approximation methods. One numerical method widely used by economists is linear quadratic [Gaussian] (LQG) approximation, but

it is often inadequate to deal with the sorts of models which may be encountered. A more generally useful method, developed in the physical sciences, is that of collocation, which is flexible, accurate, and numerically efficient. We will now describe the collocation method, after presenting the LQG method. Linear quadratic [Gaussian] control The linear quadratic control model is an unconstrained Markov decision model with a quadratic reward function f(s, x) and a linear state transition function g(s, x, ɛ). It is of special importance because it is one of the few continuous state Markov decision models known to have a finite dimensional solution: the optimal policy and shadow price functions of the infinite horizon LQ control problem are both linear in the state variable. The parameters of the shadow price function are characterized by

nonlinear fixed point Riccati equations, which may be solved for each period with backward recursion. Thus standard nonlinear equation solution methods may be used to solve a LQ problem. The LQ model may also be applied in a stochastic context, with well-behaved (Gaussian) errors, since the shadow price and optimal policy functions depend only on the mean (or expected value) of the shock to each state, rather than on the variance or higher moments of the shock process. This gives rise to certainty equivalence, in which one may replace the expectation of the shock with its mean, and solving the resulting deterministic problem. The problem is then known as an LQG problem. The LQ methodology is often applied to more complex models (e.g. those not possessing a

linear state transition function: for instance, a macroeconomic model that cannot be linearized) by replacing the nonlinear f and g functions with linear and quadratic approximants, and solving the resulting LQ problem. These approximations are characteristically derived by forming first and second order Taylor expansions around the certainty equivalent steady state of the model. However, that methodology requires that the steady state of the model is computable, and that Taylor expansions around that steady state will not experience too much curvature in the relevant functions. Likewise, this linearization process usually discards any constraints on the states and actions. If those constraints are binding in the neighborhood of the steady state (e.g., the Federal Reserve s ability to lower interest rates at this point in time is limited), then the LQ approximation will be particularly poor. Therefore, methods which do not rely on these Taylor approximations are to be preferred.

Bellman equation collocation methods Consider an infinite horizon discrete time model with a one dimensional state and action space and univariate shocks. In a continuous state context, this model has the Bellman equation V (s) = max {f(s, x) + δev (g(s, x, ɛ))} x X(s) Assume that the state space is a bounded interval S of the real line, and that the actions are either discrete or continuous and subject to simple bounds a(s) x b(s) that are continuous functions of the state. To compute an approximate solution using collocation methods, we first write the value function approximant as a linear combination of n known basis functions φ 1, φ 2,..., φ n on S with undetermined coefficients: V (s) n j=1 c j φ j (s)

Second, fix the basis function coefficients c 1, c 2,..., c n by requiring the value function to satisfy the Bellman equation at n collocation nodes s 1, s 2,..., s n. This strategy replaces the Bellman functional equation with a system of n nonlinear equations in n unknowns: n j=1 c j φ j (s i ) = max {f(s i, x) + δe x X(s i ) n j=1 c j φ j (g(s i, x, ɛ))} which may be compactly expressed in vector form as the collocation equation Φ C = v(c) where Φ C, the collocation matrix, is an n x n matrix whose typical i j element is the j th basis function evaluated at the i th collocation node. When v, the collocation function, is evaluated at a particular vector of basis coefficients c, it yields a vector whose i th entry is the value obtained by solving the optimization problem embedded in the Bellman equation at the i th

collocation node s i, replacing the value function V with its approximant j c j φ j. In principle, the collocation equation may be solved with any nonlinear equation solution method: as a fixed point problem c = Φ 1 v(c), or as a root finding problem Φ C v(c) = 0, solving for c using Newton s method or a quasi Newton method. If the problem is stochastic, expectations must be computed in a practical manner: for instance, by replacing the continuous random variable ɛ in the state transition function g with a discrete approximant: for instance, one that assumes values ɛ 1, ɛ 2,..., ɛ k with probabilities w 1, w 2,..., w k respectively. This is known as a numerical quadrature method, which computes discrete integrals as approximations to the continuous probability density function.

A number of practical decisions must be made when applying the collocation method. The basis functions (interpolators, such as spline functions) and collocation nodes must be selected, and an algorithm for solving the collocation equation chosen. Last, an appropriate quadrature technique for dealing with expectations must be selected. The choice of basis node scheme will depend on the curvature of the value function. The larger the number of basis functions and collocation nodes, the greater the computational burden, so the researcher will want to experiment with various basis node schemes and dimensions of the problem to render it computationally efficient. Collocation methods address many of the shortcomings of LQ approximations, since they employ global (rather than local) function approximation schemes, and are not limited to the first and second degree approximations

afforded by LQ methods. However, polynomial and spline approximants used in collocation methods can behave strangely, especially in the presence of nondifferentiabilities in the value function and binding constraints on the action variables (which may cause problems of nonconvergence). Although we have discussed collocation methods for a very simple problem, the routines included in the CEtools MATLAB toolbox will support solution of models with multidimensional states, actions, and shocks. The major computational challenge in implementing these methods for any model is the implementation of the vmax(s,c) function: a function that solves the optimization problem embedded in the Bellman equation at the collocation nodes and returns the collocation function values and derivatives. The vmax function will return several objects: an n x 1 vector v

of optimal values at the collocation nodes, an n x 1 vector x of associated optimal actions at the nodes, and an n x n matrix vjac, the Jacobian of the collocation function evaluated at the basis coefficients c. After the collocation method has rendered a solution, the residual function should be computed to evaluate the quality of the approximation. This function measures the difference between the left and right sides of the Bellman equation at arbitrary states s when thee value function is replaced with its approximant (and the optimal basis coefficients c). It would be zero for all states in an exact solution, and will be zero at the collocation nodes for any solution. If the approximation is adequate, the residual function will not depart too far from zero for any arbitrary value of the state in the interval S. If large residuals are obtained, the problem should be re solved using a different basis node scheme.

We now consider numerical solutions via collocation methods for several of the models discussed in the last section. In the asset replacement problem of 8.3.1, the stochastic element of the problem is taken to be the replacement cost k: k t+1 = k + γ(k t k) + ɛ t+1 where ɛ is an i.i.d. normal shock with mean zero and variance σ 2. In the implementation of the collocation method, the shock is discretized using a five node Gaussian quadrature scheme. Solution to the problem (demdp01) demonstrates that for a given asset age, the value of the firm is a downward sloping function of the replacement cost. The function is kinked at the critical replacement cost, below which the asset is to be replaced. The younger the asset, the greater the value of the firm. In the economic growth example of 8.4.1, the model is operationalized by assuming a social

benefit function u(c) = c 1 α /(1 α), with α=0.2, and an aggregate production function h(x) = x β, with β=0.5. The shock process, which modifies the value of production, is taken to be lognormal with variance σ 2 =0.01. The model is coded to incorporate the constraints on the action variable: in this case, to specify that investment x must be non negative and no greater than s. The model function must also specify the reward function value, the state transition function, and the analytical first and second derivatives of those functions. The lognormal production shock is discretized using a three node Gaussian quadrature scheme, and a polynomial basis is used on the interval [5,10] for the space of expected wealth. As we see from the graphs (demdp07) of a Monte Carlo simulation of this model, the steady state distribution is centered on the value of the certainty equivalent path of expected wealth, which converges asymptotically to about 7.5 units after 10 years or so.

In the continuous state mine management problem of 8.4.3, the model is solved using an inverse demand function p(x) = a 1 a 2 x (that is, linear demand) and a cost of extraction function c(s, x) = b 1 x 0.5b 2 x(2s x) which causes the cost of extraction to rise with the depletion of the mine. Constraints, as in the problem above, are placed on the action space to indicate that the extraction (or harvest ) must be non negative and no greater than s. Solution of the model indicates a shadow price function with a kink at two units of remaining stock: that is, beyond that point, extraction will never be the optimal strategy. This model is illustrated in demdp09. Finally, in the production inventory example of 8.4.7, the model is operationalized with quadratic production and cost of storage functions: c(q) = c 1 q +0.5c 2 q 2 and k(x) = k 1 x+0.5k 2 x 2, respectively. The evolution of the market price is

governed by p t+1 = p + ρ(p t p) + ɛ where the latter is an i.i.d. normal shock. The model is considerably more complex in its solution since it is characterized by two states and two actions, requiring two dimensional grids in each of these spaces. Since the price process is mean reverting, the optimal inventory policy will be to store nothing if the price is sufficiently high, since it is likely to fall, and the cost of storage will exceed expected appreciation of the good. For sufficiently low prices, it will be economical to hold inventories, since in that instance the expected appreciation of the good will exceed the cost of storage. The value of the firm is an increasing function of both the market price and beginning inventories. With low prices, a simulation (demdp13) reveals that the firm will obtain substantial stocks at the outset, but is expected to gradually reduce those stocks over time, reaching a small steady state mean value of inventories.