Econ 582 Nonlinear Regression

Similar documents
Amath 546/Econ 589 Univariate GARCH Models

12 The Bootstrap and why it works

Parameterized Expectations

GMM for Discrete Choice Models: A Capital Accumulation Application

A potentially useful approach to model nonlinearities in time series is to assume different behavior (structural break) in different subsamples

1 The Solow Growth Model

Lecture Quantitative Finance Spring Term 2015

Log-Robust Portfolio Management

Infinite Reload Options: Pricing and Analysis

Lecture 17: More on Markov Decision Processes. Reinforcement learning

1 Residual life for gamma and Weibull distributions

LECTURE NOTES 3 ARIEL M. VIALE

Chapter 7 One-Dimensional Search Methods

DEPARTMENT OF ECONOMICS

Final Exam Suggested Solutions

Financial Econometrics

Log-linear Modeling Under Generalized Inverse Sampling Scheme

CS 3331 Numerical Methods Lecture 2: Functions of One Variable. Cherung Lee

Agricultural and Applied Economics 637 Applied Econometrics II

Computational Finance Least Squares Monte Carlo

Australian Journal of Basic and Applied Sciences. Conditional Maximum Likelihood Estimation For Survival Function Using Cox Model

Func%on Approxima%on. Pieter Abbeel UC Berkeley EECS

arxiv: v1 [math.st] 18 Sep 2018

Solving dynamic portfolio choice problems by recursing on optimized portfolio weights or on the value function?

Solution of Equations

Investing through Economic Cycles with Ensemble Machine Learning Algorithms

INTERTEMPORAL ASSET ALLOCATION: THEORY

EC316a: Advanced Scientific Computation, Fall Discrete time, continuous state dynamic models: solution methods

Outline. 1 Introduction. 2 Algorithms. 3 Examples. Algorithm 1 General coordinate minimization framework. 1: Choose x 0 R n and set k 0.

12. Conditional heteroscedastic models (ARCH) MA6622, Ernesto Mordecki, CityU, HK, 2006.

Introduction to Numerical Methods (Algorithm)

Markov Decision Processes II

1 Explicit Euler Scheme (or Euler Forward Scheme )

R&D, International Sourcing and the Joint Impact on Firm Performance: Online Appendix

Chapter 7: Estimation Sections

Budget Management In GSP (2018)

GLOBAL CONVERGENCE OF GENERAL DERIVATIVE-FREE TRUST-REGION ALGORITHMS TO FIRST AND SECOND ORDER CRITICAL POINTS

Sequential Decision Making

Lecture 8: Linear Prediction: Lattice filters

Econ 424/CFRM 462 Portfolio Risk Budgeting

Web-based Supplementary Materials for. A space-time conditional intensity model. for invasive meningococcal disease occurence

Monte-Carlo Methods in Financial Engineering

1 Explicit Euler Scheme (or Euler Forward Scheme )

Comparing the Means of. Two Log-Normal Distributions: A Likelihood Approach

Chapter 6 Forecasting Volatility using Stochastic Volatility Model

Problem set 5. Asset pricing. Markus Roth. Chair for Macroeconomics Johannes Gutenberg Universität Mainz. Juli 5, 2010

A Two-Step Estimator for Missing Values in Probit Model Covariates

Numerical simulations of techniques related to utility function and price elasticity estimators.

Overview of Structural Estimation

What can we do with numerical optimization?

Exercise. Show the corrected sample variance is an unbiased estimator of population variance. S 2 = n i=1 (X i X ) 2 n 1. Exercise Estimation

Trade and Labor Market: Felbermayr, Prat, Schmerer (2011)

Chapter 3: Black-Scholes Equation and Its Numerical Evaluation

Estimating Term Structure of U.S. Treasury Securities: An Interpolation Approach

VARIANCE ESTIMATION FROM CALIBRATED SAMPLES

1 Consumption and saving under uncertainty

GMM Estimation. 1 Introduction. 2 Consumption-CAPM

Technical Report Doc ID: TR April-2009 (Last revised: 02-June-2009)

Statistical Models and Methods for Financial Markets

Estimation of a parametric function associated with the lognormal distribution 1

Monetary Economics Final Exam

X ln( +1 ) +1 [0 ] Γ( )

Growth model with Epstein-Zin preferences and stochastic volatility

ST440/550: Applied Bayesian Analysis. (5) Multi-parameter models - Summarizing the posterior

Mengdi Wang. July 3rd, Laboratory for Information and Decision Systems, M.I.T.

Estimating Pricing Kernel via Series Methods

Extended DSGE Model of the Czech Economy

Making Complex Decisions

Amath 546/Econ 589 Univariate GARCH Models: Advanced Topics

Trust Region Methods for Unconstrained Optimisation

Introduction to Computational Finance and Financial Econometrics Introduction to Portfolio Theory

A distributed Laplace transform algorithm for European options

Seminar on Public Finance

A Non-Parametric Technique of Option Pricing

Extend the ideas of Kan and Zhou paper on Optimal Portfolio Construction under parameter uncertainty

A Stochastic Levenberg-Marquardt Method Using Random Models with Application to Data Assimilation

SYLLABUS AND SAMPLE QUESTIONS FOR MSQE (Program Code: MQEK and MQED) Syllabus for PEA (Mathematics), 2013

Vladimir Spokoiny (joint with J.Polzehl) Varying coefficient GARCH versus local constant volatility modeling.

The Design of Optimal Education Policies

Chapter 5 Univariate time-series analysis. () Chapter 5 Univariate time-series analysis 1 / 29

Chapter 4: Asymptotic Properties of MLE (Part 3)

Financial Econometrics Jeffrey R. Russell. Midterm 2014 Suggested Solutions. TA: B. B. Deng

Regression Review and Robust Regression. Slides prepared by Elizabeth Newton (MIT)

Financial Optimization ISE 347/447. Lecture 15. Dr. Ted Ralphs

Chapter 8: CAPM. 1. Single Index Model. 2. Adding a Riskless Asset. 3. The Capital Market Line 4. CAPM. 5. The One-Fund Theorem

Tangent Lévy Models. Sergey Nadtochiy (joint work with René Carmona) Oxford-Man Institute of Quantitative Finance University of Oxford.

ECON 4325 Monetary Policy and Business Fluctuations

A THREE-FACTOR CONVERGENCE MODEL OF INTEREST RATES

Machine Learning for Quantitative Finance

Multi-armed bandits in dynamic pricing

Introduction Dickey-Fuller Test Option Pricing Bootstrapping. Simulation Methods. Chapter 13 of Chris Brook s Book.

Lecture 5: Iterative Combinatorial Auctions

As an example, we consider the following PDE with one variable; Finite difference method is one of numerical method for the PDE.

Experience with the Weighted Bootstrap in Testing for Unobserved Heterogeneity in Exponential and Weibull Duration Models

1. You are given the following information about a stationary AR(2) model:

Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS001) p approach

Chapter 5 Finite Difference Methods. Math6911 W07, HM Zhu

4 Reinforcement Learning Basic Algorithms

Systems of Ordinary Differential Equations. Lectures INF2320 p. 1/48

A NEW APPROACH TO MERTON MODEL DEFAULT AND PREDICTIVE ANALYTICS WITH APPLICATIONS TO RECESSION ECONOMICS TOMMY LEWIS

Transcription:

Econ 582 Nonlinear Regression Eric Zivot June 3, 2013

Nonlinear Regression In linear regression models = x 0 β (1 )( 1) + [ x ]=0 [ x = x] =x 0 β = [ x = x] [ x = x] x = β it is assumed that the regression function (x) = [ x = x] =x 0 β is a linear (in x) function of the 1 vector β. In parametric nonlinear regression, the regression function (x θ) is a nonlinear function of parameters θ = (x θ)+ [ x = x] = (x θ) = [ x = x]

Examples of nonlinear regression functions ( θ) = 1 + 2 =1 =3 1+ 3 ( θ) = 1 + 2 3 =1 =3 ( θ) = 1 + 2 exp( 3 ) =1 =3 ( θ) = 1 + 2 + 3 ( 4 )1 ( 4 ) (x θ) = (x 0 θ) known, = Ã! (x θ) = θ 0 1 x 1 +(θ 0 2 x 2 3 1)Φ (x θ) = ³ θ 0 1 x 1 1( 2 3 )+(θ 0 2 x 1)1 ( 2 3 ) 4

Remarks Typically (x θ) is a continuous and differentiable function of θ In the switching examples with the indicator function, (x θ) is not differentiable in θ The form of (x θ) is sometimes motivated by economic theory (e.g. cost function estimation) Sometimes the form of (x θ) is adopted as a flexible approximation to an unknown regression function (e.g. (x θ) =polynomial in x)

Nonlinear Least Squares Estimation = ( x θ )+ ( 1) ( 1) Example (Cobb-Douglas production function) = 1 2 1 3 2 + = 2 =3 The nonlinear least squares estimator solves min (θ) = ( (x θ)) 2 = 2

Assume (x θ) is a continuous and differentiable function of θ The FOCs for a minimum are where Note 2 (θ) = 2 = 2 m θ (x θ) 1 ( (x ˆθ)) (x ˆθ) ( (x ˆθ))m θ (x ˆθ) =0 = (x θ) = ( (x ˆθ))m θ (x ˆθ) = 2 (x ) 1. (x ) ˆ m θ (x ˆθ)

Matrix Notation y 1 = 1. x 0 1 m(x θ) 1 = (x 1 θ). (x θ) 1 X = ε =. x 0 1 (θ) = (y m(x θ)) 0 (y m(x θ)) m θ (X θ) = m(x θ) 0 = (x 1 ) (x 1 ) 1..... (x ) 1 (x )

FOCs (θ) = 2 m(x ˆθ) 0 0 (y m(x ˆθ)) = 2m θ (X ˆθ) 0 (y m(x ˆθ)) = 2m θ (X ˆθ) 0ˆε = 0 Note: In general we have nonlinear equations in unknown and there is no analytical solution. Hence, ˆθ must be found numerically using an iterative algorithm. The most commonly used algorithm is Gauss-Newton iteration.

Gauss-Newton (GM) Algorithm The GN algorithm can be motivated as follows. Consider a 1st order Taylor series approximation to m(x θ) at θ = θ 1 (starting value) m(x θ) ( 1) = m(x θ 1 ) ( 1) + m θ (X θ 1 )(θ θ 1 ) ( ) ( 1) + error Approximate the nonlinear regression using the TS approximation y = m(x θ)+ε m(x θ 1 )+m θ (X θ 1 )(θ θ 1 )+ε

Using m(x θ 1 )+m θ (X θ 1 )(θ θ 1 ) = [m(x θ 1 ) m θ (X θ 1 )θ 1 ]+m θ (X θ 1 )θ rewrite the model as y m(x θ 1 )+m θ (X θ 1 )θ 1 = m θ (X θ 1 )θ + ε or ȳ(θ 1 ) = m θ (X θ 1 )θ + ε ȳ(θ 1 ) = y m(x θ 1 )+m θ (X θ 1 )θ 1 This approximate model is linear in θ

Estimate the approximate linear model by least squares min θ 1 (θ) = (ȳ(θ 1 ) m θ (X θ 1 )θ) 0 (ȳ(θ 1 ) m θ (X θ 1 )θ) θ 2 = ³ m θ (X θ 1 ) 0 m θ (X θ 1 ) 1 mθ (X θ 1 ) 0 ȳ(θ 1 ) Then repeat estimation of approximate linear model using updated estimate θ 2 min θ 2 (θ) = (ȳ(θ 2 ) m θ (X θ 2 )θ) 0 (ȳ(θ 2 ) m θ (X θ 2 )θ) θ 3 = ³ m θ (X θ 2 ) 0 m θ (X θ 2 ) 1 mθ (X θ 2 ) 0 ȳ(θ 2 )

At iteration we have θ +1 = ³ m θ (X θ ) 0 m θ (X θ ) 1 mθ (X θ ) 0 ȳ(θ ) Substituting in ȳ(θ )=y m(x )+m θ (X θ )θ we have θ +1 = ³ m θ (X θ ) 0 m θ (X θ ) 1 mθ (X θ ) 0 [y m(x θ )+m θ (X θ )θ ] = ³ m θ (X θ ) 0 m θ (X θ ) 1 mθ (X θ ) 0 [y m(x θ )] + ³ m θ (X θ ) 0 m θ (X θ ) 1 mθ (X θ ) 0 m θ (X θ )θ = θ + ³ m θ (X θ ) 0 m θ (X θ ) 1 mθ (X θ ) 0 [y m(x θ )]

Note: Using (θ ) θ +1 = θ 1 2 = 2m θ (X θ ) 0 [y m(x θ )] we have ³ mθ (X θ ) 0 m θ (X θ ) 1 (θ ) Provided that m θ (X θ ) 0 m θ (X θ ) is pd m θ (X θ ) is full rank Then the FOCs are satisfied if θ +1 θ That is, (θ ) = m θ (X θ ) 0 [y m(x θ )] 0

Common Convergence Criteria Stop when θ +1 θ 10 6 x = ³ 2 1 + + 2 1 2 To avoid issues with the units of θ it is better to stop when θ +1 θ θ + 10 5 Stop when (θ )

Stop when (θ +1 ) (θ ) (θ )+

Remarks The solution to the FOCs can be a local minimum, local maximum or a global minimum. The GN iteration scheme always leads in the direction of a minimum instead of a maximum provided m θ (X θ ) 0 m θ (X θ ) is pd θ +1 = θ + ³ m θ (X θ ) 0 m θ (X θ ) 1 mθ (X θ ) 0 [y m(x θ )] = θ 1 ³ mθ (X θ ) 0 (θ ) m 2 θ (X θ ) 1 If (θ ) 0 then θ +1 θ ; if (θ ) 0 then θ +1 θ

The GN iteration scheme can overshoot the global minimum. To guard against this a step-length correction, is often added to the algorithm θ +1 = θ ³ mθ (X θ ) 0 m θ (X θ ) 1 (θ ) where is chosen such that (θ +1 ) (θ ) To guard against getting stuck at a local minimum it is often suggested that different starting values be used.

Asymptotic Distribution of NLS estimator (Homoskedastic Case) = (x θ)+ [ 2 ]= 2 Consider the linear approximation evaluated at the true value of θ (θ) =m θ (x θ) 0 θ+ Assuming 1 1 m θ (x θ) 0 m θ (x θ)m θ (x θ) 0 h mθ (x θ)m θ (x θ) 0i = M θθ 1 X m θ (x θ) (0 2 M θθ ) Then (ˆθ θ) (0 2 M 1 θθ )

Equivalently, ˆθ Theasymptoticvariance 2 M 1 θθ with ˆ 2 = cm θθ = 1 1 µ 1 2 M 1 θθ can be consistently estimated using ˆ 2 c M 1 θθ ˆ 2 m θ (x ˆθ)m θ (x ˆθ) 0

Asymptotic Distribution of NLS estimator (Heteroskedastic Case) = (x θ)+ [ 2 x ]= 2 (x ) Then (ˆθ θ) (0 M 1 V = h m θ (x θ)m θ (x θ) 0 2 θθ VM 1 i θθ ) The matrix V can be consistently estimated using the White-type HC estimator ˆV = 1 m θ (x ˆθ)m θ (x ˆθ) 0ˆ 2