Calibrating Financial Models Using Consistent Bayesian Estimators

Similar documents
Equity correlations implied by index options: estimation and model uncertainty analysis

FX Smile Modelling. 9 September September 9, 2008

Calibration Lecture 4: LSV and Model Uncertainty

Heinz W. Engl. Industrial Mathematics Institute Johannes Kepler Universität Linz, Austria

Chapter 7: Estimation Sections

European option pricing under parameter uncertainty

Dynamic Hedging in a Volatile Market

THAT COSTS WHAT! PROBABILISTIC LEARNING FOR VOLATILITY & OPTIONS

Monte Carlo Methods in Financial Engineering

Tangent Lévy Models. Sergey Nadtochiy (joint work with René Carmona) Oxford-Man Institute of Quantitative Finance University of Oxford.

Numerics for SLV models in FX markets

Pricing Implied Volatility

IEOR E4703: Monte-Carlo Simulation

King s College London

Application of MCMC Algorithm in Interest Rate Modeling

Machine Learning for Quantitative Finance

ST440/550: Applied Bayesian Analysis. (5) Multi-parameter models - Summarizing the posterior

Unobserved Heterogeneity Revisited

Posterior Inference. , where should we start? Consider the following computational procedure: 1. draw samples. 2. convert. 3. compute properties

Economics 2010c: Lecture 4 Precautionary Savings and Liquidity Constraints

Model Risk Embedded in Yield-Curve Construction Methods

Calibration of Interest Rates

Pricing Variance Swaps under Stochastic Volatility Model with Regime Switching - Discrete Observations Case

Simple Robust Hedging with Nearby Contracts

Chapter 7: Estimation Sections

Optimal Hedging of Variance Derivatives. John Crosby. Centre for Economic and Financial Studies, Department of Economics, Glasgow University

Optimally Thresholded Realized Power Variations for Lévy Jump Diffusion Models

RISK-NEUTRAL VALUATION AND STATE SPACE FRAMEWORK. JEL Codes: C51, C61, C63, and G13

Boundary conditions for options

Advanced Topics in Derivative Pricing Models. Topic 4 - Variance products and volatility derivatives

Sample Path Large Deviations and Optimal Importance Sampling for Stochastic Volatility Models

A Consistent Pricing Model for Index Options and Volatility Derivatives

Implementing Models in Quantitative Finance: Methods and Cases

The Uncertain Volatility Model

Local Variance Gamma Option Pricing Model

Accelerated Option Pricing Multiple Scenarios

Local vs Non-local Forward Equations for Option Pricing

Extrapolation analytics for Dupire s local volatility

MODELLING VOLATILITY SURFACES WITH GARCH

Pricing of a European Call Option Under a Local Volatility Interbank Offered Rate Model

Simulating Stochastic Differential Equations

Which GARCH Model for Option Valuation? By Peter Christoffersen and Kris Jacobs

Heston Stochastic Local Volatility Model

Estimating the Greeks

The Black-Scholes Model

King s College London

Hedging Under Jump Diffusions with Transaction Costs. Peter Forsyth, Shannon Kennedy, Ken Vetzal University of Waterloo

2.1 Mathematical Basis: Risk-Neutral Pricing

FE610 Stochastic Calculus for Financial Engineers. Stevens Institute of Technology

Hedging Barrier Options through a Log-Normal Local Stochastic Volatility Model

TEST OF BOUNDED LOG-NORMAL PROCESS FOR OPTIONS PRICING

1 Bayesian Bias Correction Model

Financial Risk Management

ANALYSIS OF THE BINOMIAL METHOD

Rapid computation of prices and deltas of nth to default swaps in the Li Model

Robust Longevity Risk Management

Computational Finance Binomial Trees Analysis

Simple Robust Hedging with Nearby Contracts

Understanding Index Option Returns

Pricing Barrier Options under Local Volatility

UQ, STAT2201, 2017, Lectures 3 and 4 Unit 3 Probability Distributions.

Financial Risk Management

Module 10:Application of stochastic processes in areas like finance Lecture 36:Black-Scholes Model. Stochastic Differential Equation.

Extracting Information from the Markets: A Bayesian Approach

Lecture 17: More on Markov Decision Processes. Reinforcement learning

Interest-Sensitive Financial Instruments

Dynamic Asset Pricing Models: Recent Developments

Practical example of an Economic Scenario Generator

A radial basis function approach to reconstructing the local volatility surface of European options. James Glover

IMPLEMENTING THE SPECTRAL CALIBRATION OF EXPONENTIAL LÉVY MODELS

From Discrete Time to Continuous Time Modeling

FINITE DIFFERENCE METHODS

Vega Maps: Predicting Premium Change from Movements of the Whole Volatility Surface

Adjusting the Black-Scholes Framework in the Presence of a Volatility Skew

On modelling of electricity spot price

EFFICIENT MONTE CARLO ALGORITHM FOR PRICING BARRIER OPTIONS

IEOR E4703: Monte-Carlo Simulation

Introduction Random Walk One-Period Option Pricing Binomial Option Pricing Nice Math. Binomial Models. Christopher Ting.

The Black-Scholes Model

The histogram should resemble the uniform density, the mean should be close to 0.5, and the standard deviation should be close to 1/ 12 =

Optimum Thresholding for Semimartingales with Lévy Jumps under the mean-square error

The Yield Envelope: Price Ranges for Fixed Income Products

STA 532: Theory of Statistical Inference

Stochastic Differential Equations in Finance and Monte Carlo Simulations

Regression estimation in continuous time with a view towards pricing Bermudan options

University of Oxford. Robust hedging of digital double touch barrier options. Ni Hao

Chapter 4: Asymptotic Properties of MLE (Part 3)

MFE/3F Questions Answer Key

Hedging Default Risks of CDOs in Markovian Contagion Models

START HERE: Instructions. 1 Exponential Family [Zhou, Manzil]

Market Risk Analysis Volume I

Pricing with a Smile. Bruno Dupire. Bloomberg

Valuation of Volatility Derivatives. Jim Gatheral Global Derivatives & Risk Management 2005 Paris May 24, 2005

Distributed Computing in Finance: Case Model Calibration

Model Estimation. Liuren Wu. Fall, Zicklin School of Business, Baruch College. Liuren Wu Model Estimation Option Pricing, Fall, / 16

Rho and Delta. Paul Hollingsworth January 29, Introduction 1. 2 Zero coupon bond 1. 3 FX forward 2. 5 Rho (ρ) 4. 7 Time bucketing 6

Chapter 2 Uncertainty Analysis and Sampling Techniques

Stochastic Volatility (SV) Models

Rough volatility models: When population processes become a new tool for trading and risk management

Modeling skewness and kurtosis in Stochastic Volatility Models

Transcription:

Calibrating Financial Models Using Consistent Bayesian Estimators Christoph Reisinger Joint work with Alok Gupta June 25, 2010

Example model uncertainty A local volatility model, jump diffusion model, and (Heston) stochastic volatility model calibrated to 60 observed European calls for different strike/maturity pairs within 3 basis points. 12 0.4 10! = 0.10! (volatility) 0.3 0.2 0.1 1 0 0.5 t (time in years) 0 50 200 150 100 S (asset price) frequency 8 6 4 2 0 1 0.5 0 0.5 1 jump size Parameter rate of long run volatility of correlation initial reversion variance volatility variance Value 0.0745 0.1415 0.1038-0.2127 0.0167 The value of an up-and-out barrier call with strike 90% and barrier 110% of the spot varies by 177 basis points.

3 Example parameter uncertainty Three different local volatility models calibrated to 60 observed European calls for different strike/maturity pairs within 3 basis points. See also Hamida and Cont (2005).! (volatility) 0.4 0.35 0.3 0.25 0.2 0.15 0.1 0.05 0 1 0.8 0.6 0.4 0.2 100 150 200 t (time in years) 0 50 S (asset price) The value of an up-and-out barrier call with strike 90% and barrier 110% of the spot varies by 26 basis points.

Derivative pricing models Model choice: Assume a model θ; model value of a derivative V (θ). Calibration: Find θ s.t. V (θ ) = V the market price of liquid contracts. Pricing and hedging: Solve a pricing equation for a new (exotic) derivative, A(θ ) V (θ ) = 0; hedge with sensitivities derived from V (θ ).

An ill-posed problem Remidies for this model ambiguity. Regularisation: market fit(θ) + regularity measure(θ) min θ Worst-case replication approach: sup A(θ)V (θ) = 0, s.t. V (θ) = V for calibration products θ Bayesian framework: prior information encapsulated in p(θ) likelihood of market prices p(v θ) posterior distribution p(θ V )

6 Motivation Model ambiguity and over-parametrisation lead to uncertainty in the pricing model and the need to quantify and risk-manage the resulting risk. A Bayesian perspective seems well-suited to these objectives. It combines prior and historical information ( regularisation ) with currently observed prices ( calibration ). Consistency guarantees that parameter estimates are not led astray by prior assumptions.

7 Outline Calibration problems in financial engineering and their ill-posedness Bayesian approach to the calibration problem Consistency of Bayesian estimators Practical construction of posteriors and examples Related work: measuring model uncertainty, robust hedging Conclusions

8 Model for underlying Assume price process S = (S t ) t 0 s.t. (by abuse of notation) S t = S(t, (Z u ) 0 u t, θ) a function of time t, some standard process Z = (Zt ) t 0, and parameter(s) θ Θ. Assume henceforth that θ is a finite dimensional vector: Θ R M. We are specifically interested in applications where this parameter is the discretisation of a functional parameter, for example representing a local volatility function.

9 Model price Now consider an option over a finite time horizon [0, T ] written on S and with payoff function h, and the time t value of this option written as f t (θ) = E Q [B(t, T )h(s(θ)) F t ] with respect to some risk-neutral measure Q, where B(t, T ) is the discount factor for the time interval [t, T ].

10 Observations Denote θ the true parameter. Suppose at time t [0, T ] we observe a set of such option prices {f (i) t (θ) : i I t }, with additive noise {e (i) t : i I t }, i.e. we observe V (i) t = f (i) t (θ ) + e (i) t. The calibration problem is to find the value of θ that best reproduces the observed prices V = {V (i) t : i I t, t Υ n ([0, T ])}. Here Υ n ([0, T ]) = {t 1,..., t n : 0 = t 1 < t 2 <... < t n T } is a partition of the interval [0, T ] into n parts.

11 Bayesian framework Assume we have some prior information for θ, e.g. it belongs to a particular subspace of the parameter space, or is positive, or represents a smooth function, summarised by a prior density p(θ) for θ. p(v θ) is the likelihood of observing the data V given θ. Bayes rule gives the posterior density of θ, p(θ V ) = p(v θ) p(θ), p(v ) where p(v ) is given by p(v ) = p(v θ) p(θ) dθ.

Bayesian literature Consistency of Bayesian estimators: Doob (1953), Schwartz (1965) Le Cam (1953): relation to maximum likelihood estimators Fitzpatrick (1991): relation to regularisation Wasserman (1998 ), Barron, Schervish, and Wasserman (1999), Shen and Wasserman (2001), Goshal (1998), Goshal, Gosh, and van der Vaart (2000): properties, convergence rates All assume i.i.d. data. Here: observations of different functions of the parameter.

13 Example Black-Scholes model with σ = 0.2; observe prices each week for the first 52 weeks of a two year at-the-money call option; S 0 = 100 and the interest rate r = 0.05, s.t. f 0 (σ ) = 16.13; uniform prior p(σ) on [0.18,0.22]; mean-zero Gaussian noise e t of standard deviation 5% of the true option price, i.e. e t N(0, 1 20 f t(σ )). See also Jacquier and Jarrow (2000).

14 Example 2500 2000 n=0 n=1 n=4 n=12 n=26 n=52 1500 p(! V) 1000 500 0 0.18 0.19 0.2 0.21 0.22! Posterior densities after n observations. Notice that most of the probability measure collects around the true value of σ = 0.2.

Convergence in probability Assumptions on the prior: The prior p has compact support Θ, p is bounded, continuous at θ (true parameter) with p(θ ) > 0. Assumptions on the observations: F tn G tm for all (n, m), i.e. the driving process of the underlying is independent from the market noise, Gaussian noise with variance ɛ 2 t, and t, θ θ Θ 1 ε t f t(θ) f t(θ ) θ θ k > 0. Then: θ n (V) := θ F tn G tn P θ.

16 Estimators A function L : R 2M R is a loss function L(θ, θ ) iff { L(θ, θ ) = 0 if θ = θ R M L(θ, θ ) > 0 if θ θ. The corresponding Bayes estimator θ L (V ) is { } θ L (V ) = arg min L(θ, θ ) p(θ V ) dθ. θ Θ Θ Examples: L1 (θ, θ ) = θ θ 2 gives Bayes estimator θ L1 (Y ) = E[θ V ] (the mean value of θ with respect to the Bayesian posterior density p(θ V )) θmap (V ) = arg max{p(θ V )}, the maximum a posteriori (MAP) estimator

17 Consistency result p(θ n (V )), the posterior density of θ after n observations, is p(θ n (V )) = p n(v θ) p(θ) p n (V ) = t Υ n 1 2πεt exp = p(v t 1 θ)... p(v tn θ) p(θ) p n (V ) { 1 (V 2ε 2 t f t (θ)) 2} p(θ) t p n(v ). Define the sequence of Bayes estimators ˆθ by, g(θ n (V ), θ ) = E[L(θ n (V ), θ )] = L(θ, θ ) p n (θ V ) dθ ˆθ n (V ) = arg min θ Θ {g(θ n(v ), θ )}. Then, under the assumptions from earlier, and for L bounded and continuous on Θ, ˆθ n (V ) is consistent. Θ

18 Multiple observations Suppose multiple observations f (i) t per time, i I t, with similar assumptions as above for all i. Deduce the Bayes estimator ˆθ n (V ) is consistent. Speeds up convergence. Taken to the extreme, can construct a consistent estimator by gathering a large number of observations of different functions (options with different strikes, maturities) of θ at time 0. We give an example of this later.

19 Non-scalar parameter Take the case when θ is not scalar but a finite-dimensional parameter, θ R M. Replace the monotonicity assumption on the observations by: K > k > 0 θ Θ K 2 1 n 1 f t (θ) f t (θ ) 2 ε 2 t Υ t θ θ 2 k 2 n For all L bounded and continuous on θ, the non-scalar Bayes estimator ˆθ n (V ) is consistent.

20 Discussion of assumptions Let f t (θ) be smooth in t and θ, and ɛ t = ɛ constant. Then the above assumption can only be violated if either 1. θ θ t f t (θ) = f t (θ ), or 2. θ θ t (θ θ ) θ f t (θ ) = 0. 1. Under 1., it is clearly impossible to identify which parameter gave rise to the observations. 2. Under 2., perturbations of the parameter in directions orthogonal to the gradient are overshadowed by the noise. This confirms an intuitive rule for a good choice of observation variables (calibration products) as those which are most sensitive to the parameters.

21 Local volatility The (discretised) local volatility model is a good example: Complete market model. Used by traders in some markets. Large (infinite) number of parameters. Ill-conditioned (ill-posed) calibration. Dynamically inconsistent.

22 Inverse problems literature Identification of local volatility: [Dupire (1994)] Lagnado and Osher (1997) Jackson, Süli, and Howison (1999) Chiarella, Craddock, and El-Hassan (2000) Coleman, Li, and Verma (2001) Berestycki, Busca, and Florent (2002) Egger and Engl (2005) Achdou and Pironneau (2004) Zubelli, Scherzer, and De Cezaro (2010)

23 Bayesian prior We incorporate: positivity the a-t-m vol smoothness Use the natural Gaussian prior p(θ) exp { 1 λ θ 2 θ 0 2} 1/ λ can be thought of as the prior variance of θ Example: p lv (σ) exp { 1 2 λ p log(σ) log(σ atm ) 2 } κ where u 2 κ = (1 κ) u 2 2 + κ u 2 2

24 Likelihood Recall V (i) t the market observed price at t of a European call with strike K i, maturity T i ; f (i) t (θ) the theoretical price when the model parameter is θ; define the basis point square-error function as G t (θ) = 108 S 2 t define δ i = 104 S 0 V (i)ask t i I w i f (i) t (θ) V (i) t 2 V (i) t = 1 (i)bid 2 (V t + V (i)ask t ); V (i)bid t a basis point bid-ask spread. As in Hamida and Cont (2005) demand G(θ) δ 2, then p(v θ) 1 G(θ) δ 2 exp { 1 2δ 2 G(θ) }.

25 Construction of posterior Then the posterior is p(θ V ) 1 G(θ) δ 2 exp { 1 2δ 2 [ λ θ θ0 2 + G(θ) ]}. Note: maximising the posterior is equivalent to specific Tikhonov regularisations (e.g. Fitzpatrick (1991)).

26 Two datasets 1. Simulated data-set: We price European calls with 11 strikes and 6 maturities on the surface given in Jackson, Süli and Howison (1999). Similar to there, we take S0 = 5000, r = 0.05, d = 0.03. To each of the prices we add Gaussian noise with mean zero and standard deviation 0.1% as in Hamida and Cont (2005) and treat these as the observed prices. We take the calibration error acceptance level as δ = 3 basis points following the results of Jackson et al (1999). 2. Market data: We take real S&P 500 implied volatility data used in Coleman, Li and Verma (2001) to price corresponding European calls. 70 European call prices are calculated from implied volatilities with 10 strikes and 7 maturities. Spot price of the underlying at time 0 is S 0 = $590, interest rate is r = 0.060 and dividend rate is d = 0.026.

27 Parameter discretisation 1. For the first example, we take grid nodes s = 2500, 4500, 4750, 5000, 5250, 5500, 7000, 10000, t = 0.0, 0.5, 1.0, so a total of M = 27 parameters (cf 66 calibration prices). 2. For the second example, s = 300, 500, 560, 590, 620, 670, 800, 1200, t = 0.0, 0.5, 1.0, 2.0, so a total of M = 32 parameters (cf 70 calibration prices). Interpolate with cubic splines in S, linear in t.

28 Computational issues Sample from the posterior using Markov Chain Monte Carlo, see e.g. Beskos and Stuart (2009): 1. Select a starting point θ 0 for which g(θ 0 V ) > 0. 2. For r = 1,..., n, sample a proposal θ # from a symmetric jumping distribution J(θ # θ r 1 ) and set { { } θ θ r = # with probability min g(θ # V ) g(θ r 1 V ), 1 otherwise. θ r 1 Then the sequence of iterations θ 1,..., θ n converges to the target distribution g(θ V ). Speed up by thinning, and eliminate burn-in. Monitor potential scale reduction factor for convergence.

Sampling the posterior For the simulated dataset: 479 surfaces sampled from the posterior distribution, the true surface in opaque black. 29

30 Pointwise confidence intervals For the simulated dataset: 95% and 68% pointwise confidence intervals for volatility of paths, the true surface in opaque black.

Re-calibration Now a path is simulated on the true local volatility surface and the Bayesian posterior is updated using the newly observed prices each week for 12 weeks (plotted: weeks 3,6,9,12). The transparency of each surface reflects the Bayesian weight of the surface. 31

32 Pricing a barrier option 0.25 0.2 pdf Bayes MAP true bid ask posterior probability 0.15 0.1 0.05 0 80 82 84 86 88 90 92 94 96 98 100 102 price For simulated dataset: prices for up-and-out barrier calls with strike 5000 (S 0 = 5000), barrier 5500, maturity 3 months. Included are the true price with its bid-ask spread, the MAP price, and the Bayes price with its associated posterior pdf.

For the simulated dataset: prices for American puts with strike 5000 (S 0 = 5000) and maturity 1 year. Included are the true price with its bid-ask spread, the MAP price, and the Bayes price with its associated posterior pdf. 33 Pricing an American option 0.25 0.2 pdf Bayes MAP true bid ask posterior probability 0.15 0.1 0.05 0 116 118 120 122 124 126 128 price

Market data For S&P 500 dataset: using Metropolis sampling, 600 surfaces from the posterior distribution. 34

35 Pricing an American option 0.18 0.16 pdf Bayes MAP 0.14 0.12 posterior probability 0.1 0.08 0.06 0.04 0.02 0 11.8 12 12.2 12.4 12.6 12.8 13 13.2 13.4 13.6 13.8 price For S&P 500 dataset: prices for American put option with strike $590 (S 0 = $590) and maturity 1 year. Included are the MAP price and the Bayes price with its associated posterior pdf of prices.

Model uncertainty measures Bayesian model uncertainty measures: Branger and Schlag (2004) Gupta and R. (2010) This is in contrast to worst-case measures: Price-based : Cont (2006) Risk-differencing : Kerkhof, Melenberg, Schumacher (2002) Hedging-based : uncertain parameter models, e.g. Avellaneda, Lévy, and Paras (1995)

37 Discussion and extensions Construction of Bayesian posteriors using prior information and market data Consistency would also like negative result Gives model uncertainty measures Potentially useful for robust hedging