Semiparametric Modeling, Penalized Splines, and Mixed Models David Ruppert Cornell University

Size: px
Start display at page:

Download "Semiparametric Modeling, Penalized Splines, and Mixed Models David Ruppert Cornell University"

Transcription

1 Semiparametric Modeling, Penalized Splines, and Mixed Models David Ruppert Cornell University Possible Model SBMD i,j is spinal bone mineral density on ith subject at age equal to age i,j lide January 24 Joint work with Babette Brumback, Ray Carroll, Brent Coull, Ciprian Crainiceanu, Matt Wand, Yan Yu, and others Slide 3 SBMD i,j = U i + m(age i,j ) + ɛ i,j, i =,, m = 23, j = i,, n i U i is the random intercept for subject i {U i } are assumed iid N(, σ 2 U ) Example (data from Hastie and James, this analysis in RWC) lide 2 spinal bone mineral density Slide 4 Underlying philosophy minimalist statistics keep it as simple as possible 2 build on classical parametric statistics 3 modular methodology age (years)

2 3 2 lide 5 Reference Semiparametric Regression by Ruppert, Wand, and Carroll (23) Slide 7 IQ 9 Quadratic Lots of examples from biostatistics 8 7 Spline lead (microgram/deciliter) Thanks to Rich Canfield for data and estimates Recent Example April 7, 23 Canfield et al (23) Intellectual impairment and blood lead Semiparametric regression Partial linear or partial spline model: Y i = W T i β W + m(x i ) + ɛ i lide 6 longitudinal (mixed model) nine covariates (modelled linearly) effect of lead modelled as a spline (semiparametric model) disturbing conclusion Slide 8 Eg, m(x) = X T i β X + B T (x)b B T (x) = ( B (x) B K (x) ) X T i = ( X i X p i ) B T (x) = { (x κ ) p + (x κ K ) p + }

3 Fitting LIDAR data with plus functions lide 9 Example m(x) = β + β x + b (x κ ) b K (x κ K ) + slope jumps by b k at κ k Slide log ratio range Linear plus function 2 plus fn derivative Generalization ide 2 8 Slide 2 m(x) = β +β x+ +β p x p +b (x κ ) p ++ +b K (x κ K ) p + pth derivative jumps by p! b k at κ k 6 4 first p derivatives are continuous

4 Quadratic plus function ide plus fn derivative 2nd derivative Slide 5 Penalized least-squares Minimize n { Y (W T i β W + X T i β X + B T (X i )b) } 2 + λ b T Db i= Eg, D = I Ordinary Least Squares Penalized Least Squares Raw Data 2 knots 3 knots 5 knots Raw Data 2 knots 3 knots 5 knots ide Slide knots 2 knots 5 knots knots knots 2 knots 5 knots knots

5 Ridge Regression From previous slides: From previous slide: n { Y (W T i β W + X T i β X + B T (X i )b) } 2 + λ b T Db i= Let X have row ( Wi T X T i B T (X i ) ) Then β W = { X T X + λ blockdiag(,, D) } X T Y β X b ide 7 Let X have row ( Wi T X T i B T (X i ) ) Then β W = { X T X + λ blockdiag(,, D) } X T Y β X b Also, a BLUP in a mixed model and an empirical Bayes estimator Slide 9 Linear mixed model: ( ) ( ) β X T X X T Z ( ) X T Y = b Z T X Z T Z + λσ b Z T Y { = ( X Z ) T ( X Z ) + λ blockdiag(, Σ b )} ( X Z ) T Y Linear Mixed Models Y = Xβ + Zb + ε ide 8 where b is N(, σ 2 b Σ b ) Xβ are the fixed effects and Zb are the random effects Henderson s equations ( ) ( β X T X X T Z = b Z T X Z T Z + λσ b ) ( ) X T Y Z T Y Slide 2 Selecting λ cross-validation (CV) 2 generalized cross-validation (GCV) 3 ML or REML in mixed model framework λ = σ2 ɛ σb 2

6 Selecting the Number of Knots (a) SpaHet, j = 3, typical data set 5 5 (b) MASE comparisons x 4 ide 2 y 5 5 True full search relative MASE 5 fixed nknots myopic full search K 25 Slide 23 MSE 2 Variance MSE frequency number of knots (coded) ASE K=4 n = ASE K=5 Bias df (λ) fit Optimal n =,, 2 knots, quadratic spline (a) SpaHetLS, j = 3, n = 2, 5 5 (b) MASE comparisons Return to spinal bone mineral density study ide 22 y frequency True full search number of knots (coded) relative MASE 5 fixed nknots myopic full search ASE K=4 n = 2, 5 x 3 5 K 5 5 ASE K=5 x 3 Slide 24 spinal bone mineral density age (years) SBMD i,j = U i + m(age i,j ) + ɛ i,j, i =,, m = 23, j = i,, n i

7 ide 25 X = age age n age m age mnm ide 26 Z = (age κ ) + (age κ K ) + (age n κ ) + (age n κ K ) + (age m κ ) + (age m κ K ) + (age mnm κ ) + (age mnm κ K ) + Slide 27 u = U U m b b K Slide 28 age (years) spinal bone mineral density Variability bars on m and estimated density of U i

8 ide 29 spinal bone mineral density Broken down by ethnicity Hispanic Asian White Black Slide 3 Only requires an expansion of the fixed effects by adding the columns black hispanic white black hispanic white black m hispanic m white m age (years) black m hispanic m white m ide 3 Model with ethnicity effects SBMD ij = U i + m(age ij ) + β black i + β 2 hispanic i +β 3 white i + ε ij, j n i, i m Asian is the reference group Slide 32 contrast with Asian subjects 5 5 Black Hispanic White

9 ide 33 In this model, the age effects curve for the four ethnic groups are parallel Could we model them as non-parallel? Might be problematic in this example because of the small values of the n i Slide 35 Penalized Splines and Additive Models Additive model: Y i = m (X,i ) + + m P (X P,i ) + ɛ i But the methodology should be useful in other contexts Add interactions between age and black, hispanic, and white Bivariate additive spline model ide 34 These are fixed effects Then add interactions between black, hispanic, white, and asian and the linear plus functions in age These are mean-zero random effects with their own variance component This variance component control the amount of shrinkage of the enthicity-specific curves to the overall effect Slide 36 Y i = β +β x, X i + b x, (X i κ x, ) b x,k (X i κ x,kx ) + + β z, Z i + b z, (Z i κ z, ) b z,k (Z i κ z,kz ) + + ɛ i no need for backfitting computation very rapid no identifiability issues inference is simple

10 The Bias-Variance Trade-off and Confidence Bands lambda= lambda= ide 37 Bayesian methods The linear mixed model is half-bayesian The random effects have a prior The parameters without a prior are: fixed effects give them diffuse normal priors variance components give them diffuse inverse gamma priors Slide 39 log ratio -8-4 log ratio range lambda=3 log ratio -8-4 log ratio range lambda= range range Bayesian methods ide 38 Can be easily implemented in WinBUGS or programmed in, say, MATLAB Allows Bayes rather than empirical Bayes inference Uncertainty due to smoothing parameter selection is taken into account Slide 4 How does one adjust confidence intervals for bias? undersmooth so variance dominates and bias can be safetly ignored

11 x n=, 2 knots σ=3 Wahba/Nychka Bayesian Intervals [ ] u [ σ 2 u I ] ide 4 MSE MSE Slide 43 y = Xβ + Zu + ε, Cov ε = σ 2 εi, 5 5 Variance Bias 2 C = ( X Z ) β and ũ are BLUPs log(λ) 3 optimal 2 ide 42 Adjustment for bias continued estimate bias by a higher order method and subtract off bias (essentially the same as above) Wahba/Nychka Bayesian intervals bias is random so adds to posterior variance interval is widened but there is no offset Slide 44 Cov ([ β ũ ] u ) = σε(c 2 T C+ σ2 ε D) C T C(C T C+ σ2 σu 2 ε D) σu 2 (Frequentist variance Ignores bias) ([ ]) Cov β ũ u = σ 2 ε(c T C + σ2 ε σ 2 u D) (Bayesian posterior variance Takes bias into account)

12 Correction for measurement error ide 45 strontium ratio age (million years) Slide 47 Relatively little research in this area Fan and Truong (993): deconvolution kernels first work inefficient in finite-sample studies no inference strictly for -dimensional smoothing Carroll, Maca, Ruppert functional SIMEX methods and structural spline methods more efficient than Fan and Truong Effect of measurement error Berry, Carroll, and Ruppert (JASA, 22) fully Bayesian 2 smoothing or penalized splines ide 46 y Slide 48 rather efficient in finite-sample studies inference available scales up semiparametric inference is easy structural x plus error W = X + error and Var(X) = Var(error)

13 Berry, Carroll, and Ruppert Correction for measurement error starts with mixed-model spline formulation 8 but fully Bayesian conjugate priors ide 49 true covariates are iid normal but surprisingly robust Slide normal measurement error in Gibbs, only sampling of true (unknown) covariates requires a Hastings-Metropolis step Solid: true Dotted: uncorrected Dashed: corrected Effect of measurement error Measurement Error, continued Ganguli, Staudenmayer, Wand: ide 5 y Slide 52 EM maximum likelihood estimation in BCR model Works about as well as the fully Bayesian approach Extension to additive models x plus error W = X + error and Var(X) = Var(error)

14 Generalized Regression Extension to non-gaussian responses is conceptually easy ide 53 Get a GLLM However, GLIM s are not trivial Can use: Monte Carlo EM Or MCMC Single-Index Models Y i = g(x T i θ) + Z T i β + ɛ i Yu and Ruppert (22, JASA) ide 54 Let g(x) = γ + γ x + + γ p x p +c (x κ ) p c K (x κ K ) p + Becomes a nonlinear regression model Y i = m(x i, Z i, θ, β, γ, c) + ɛ i

Semiparametric Modeling, Penalized Splines, and Mixed Models

Semiparametric Modeling, Penalized Splines, and Mixed Models Semi 1 Semiparametric Modeling, Penalized Splines, and Mixed Models David Ruppert Cornell University http://wwworiecornelledu/~davidr January 24 Joint work with Babette Brumback, Ray Carroll, Brent Coull,

More information

Estimating the Interest Rate Term Structures of Treasury and Corporate Debt with Bayesian Penalized Splines

Estimating the Interest Rate Term Structures of Treasury and Corporate Debt with Bayesian Penalized Splines Journal of Data Science 3(25), 223-24 Estimating the Interest Rate Term Structures of Treasury and Corporate Debt with Bayesian Penalized Splines Min Li 1 and Yan Yu 2 1 California State University, Sacramento

More information

Machine Learning for Quantitative Finance

Machine Learning for Quantitative Finance Machine Learning for Quantitative Finance Fast derivative pricing Sofie Reyners Joint work with Jan De Spiegeleer, Dilip Madan and Wim Schoutens Derivative pricing is time-consuming... Vanilla option pricing

More information

Computational Statistics Handbook with MATLAB

Computational Statistics Handbook with MATLAB «H Computer Science and Data Analysis Series Computational Statistics Handbook with MATLAB Second Edition Wendy L. Martinez The Office of Naval Research Arlington, Virginia, U.S.A. Angel R. Martinez Naval

More information

1. You are given the following information about a stationary AR(2) model:

1. You are given the following information about a stationary AR(2) model: Fall 2003 Society of Actuaries **BEGINNING OF EXAMINATION** 1. You are given the following information about a stationary AR(2) model: (i) ρ 1 = 05. (ii) ρ 2 = 01. Determine φ 2. (A) 0.2 (B) 0.1 (C) 0.4

More information

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is:

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is: **BEGINNING OF EXAMINATION** 1. You are given: (i) A random sample of five observations from a population is: 0.2 0.7 0.9 1.1 1.3 (ii) You use the Kolmogorov-Smirnov test for testing the null hypothesis,

More information

Calibration of Interest Rates

Calibration of Interest Rates WDS'12 Proceedings of Contributed Papers, Part I, 25 30, 2012. ISBN 978-80-7378-224-5 MATFYZPRESS Calibration of Interest Rates J. Černý Charles University, Faculty of Mathematics and Physics, Prague,

More information

Application of MCMC Algorithm in Interest Rate Modeling

Application of MCMC Algorithm in Interest Rate Modeling Application of MCMC Algorithm in Interest Rate Modeling Xiaoxia Feng and Dejun Xie Abstract Interest rate modeling is a challenging but important problem in financial econometrics. This work is concerned

More information

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018

Subject CS1 Actuarial Statistics 1 Core Principles. Syllabus. for the 2019 exams. 1 June 2018 ` Subject CS1 Actuarial Statistics 1 Core Principles Syllabus for the 2019 exams 1 June 2018 Copyright in this Core Reading is the property of the Institute and Faculty of Actuaries who are the sole distributors.

More information

Introduction to Sequential Monte Carlo Methods

Introduction to Sequential Monte Carlo Methods Introduction to Sequential Monte Carlo Methods Arnaud Doucet NCSU, October 2008 Arnaud Doucet () Introduction to SMC NCSU, October 2008 1 / 36 Preliminary Remarks Sequential Monte Carlo (SMC) are a set

More information

Chapter 7: Estimation Sections

Chapter 7: Estimation Sections 1 / 40 Chapter 7: Estimation Sections 7.1 Statistical Inference Bayesian Methods: Chapter 7 7.2 Prior and Posterior Distributions 7.3 Conjugate Prior Distributions 7.4 Bayes Estimators Frequentist Methods:

More information

2D penalized spline (continuous-by-continuous interaction)

2D penalized spline (continuous-by-continuous interaction) 2D penalized spline (continuous-by-continuous interaction) Two examples (RWC, Section 13.1): Number of scallops caught off Long Island Counts are made at specific coordinates. Incidence of AIDS in Italian

More information

Chapter 8: Sampling distributions of estimators Sections

Chapter 8: Sampling distributions of estimators Sections Chapter 8 continued Chapter 8: Sampling distributions of estimators Sections 8.1 Sampling distribution of a statistic 8.2 The Chi-square distributions 8.3 Joint Distribution of the sample mean and sample

More information

Ridge, Bayesian Ridge and Shrinkage

Ridge, Bayesian Ridge and Shrinkage Readings Chapter 15 Christensen Merlise Clyde October 1, 2015 Ridge Trace t(x$coef) 2 0 2 4 6 8 0.00 0.02 0.04 0.06 0.08 0.10 x$lambda Generalized Cross-validation > select(lm.ridge(employed ~., data=longley,

More information

Statistical Models and Methods for Financial Markets

Statistical Models and Methods for Financial Markets Tze Leung Lai/ Haipeng Xing Statistical Models and Methods for Financial Markets B 374756 4Q Springer Preface \ vii Part I Basic Statistical Methods and Financial Applications 1 Linear Regression Models

More information

CS340 Machine learning Bayesian model selection

CS340 Machine learning Bayesian model selection CS340 Machine learning Bayesian model selection Bayesian model selection Suppose we have several models, each with potentially different numbers of parameters. Example: M0 = constant, M1 = straight line,

More information

Chapter 5 Univariate time-series analysis. () Chapter 5 Univariate time-series analysis 1 / 29

Chapter 5 Univariate time-series analysis. () Chapter 5 Univariate time-series analysis 1 / 29 Chapter 5 Univariate time-series analysis () Chapter 5 Univariate time-series analysis 1 / 29 Time-Series Time-series is a sequence fx 1, x 2,..., x T g or fx t g, t = 1,..., T, where t is an index denoting

More information

Window Width Selection for L 2 Adjusted Quantile Regression

Window Width Selection for L 2 Adjusted Quantile Regression Window Width Selection for L 2 Adjusted Quantile Regression Yoonsuh Jung, The Ohio State University Steven N. MacEachern, The Ohio State University Yoonkyung Lee, The Ohio State University Technical Report

More information

(5) Multi-parameter models - Summarizing the posterior

(5) Multi-parameter models - Summarizing the posterior (5) Multi-parameter models - Summarizing the posterior Spring, 2017 Models with more than one parameter Thus far we have studied single-parameter models, but most analyses have several parameters For example,

More information

Quantitative Risk Management

Quantitative Risk Management Quantitative Risk Management Asset Allocation and Risk Management Martin B. Haugh Department of Industrial Engineering and Operations Research Columbia University Outline Review of Mean-Variance Analysis

More information

Bayesian Inference for Random Coefficient Dynamic Panel Data Models

Bayesian Inference for Random Coefficient Dynamic Panel Data Models Bayesian Inference for Random Coefficient Dynamic Panel Data Models By Peng Zhang and Dylan Small* 1 Department of Statistics, The Wharton School, University of Pennsylvania Abstract We develop a hierarchical

More information

2 Control variates. λe λti λe e λt i where R(t) = t Y 1 Y N(t) is the time from the last event to t. L t = e λr(t) e e λt(t) Exercises

2 Control variates. λe λti λe e λt i where R(t) = t Y 1 Y N(t) is the time from the last event to t. L t = e λr(t) e e λt(t) Exercises 96 ChapterVI. Variance Reduction Methods stochastic volatility ISExSoren5.9 Example.5 (compound poisson processes) Let X(t) = Y + + Y N(t) where {N(t)},Y, Y,... are independent, {N(t)} is Poisson(λ) with

More information

Practice Exam 1. Loss Amount Number of Losses

Practice Exam 1. Loss Amount Number of Losses Practice Exam 1 1. You are given the following data on loss sizes: An ogive is used as a model for loss sizes. Determine the fitted median. Loss Amount Number of Losses 0 1000 5 1000 5000 4 5000 10000

More information

Posterior Inference. , where should we start? Consider the following computational procedure: 1. draw samples. 2. convert. 3. compute properties

Posterior Inference. , where should we start? Consider the following computational procedure: 1. draw samples. 2. convert. 3. compute properties Posterior Inference Example. Consider a binomial model where we have a posterior distribution for the probability term, θ. Suppose we want to make inferences about the log-odds γ = log ( θ 1 θ), where

More information

Market Risk Analysis Volume I

Market Risk Analysis Volume I Market Risk Analysis Volume I Quantitative Methods in Finance Carol Alexander John Wiley & Sons, Ltd List of Figures List of Tables List of Examples Foreword Preface to Volume I xiii xvi xvii xix xxiii

More information

ST440/550: Applied Bayesian Analysis. (5) Multi-parameter models - Summarizing the posterior

ST440/550: Applied Bayesian Analysis. (5) Multi-parameter models - Summarizing the posterior (5) Multi-parameter models - Summarizing the posterior Models with more than one parameter Thus far we have studied single-parameter models, but most analyses have several parameters For example, consider

More information

STAT 509: Statistics for Engineers Dr. Dewei Wang. Copyright 2014 John Wiley & Sons, Inc. All rights reserved.

STAT 509: Statistics for Engineers Dr. Dewei Wang. Copyright 2014 John Wiley & Sons, Inc. All rights reserved. STAT 509: Statistics for Engineers Dr. Dewei Wang Applied Statistics and Probability for Engineers Sixth Edition Douglas C. Montgomery George C. Runger 7 Point CHAPTER OUTLINE 7-1 Point Estimation 7-2

More information

Idiosyncratic risk, insurance, and aggregate consumption dynamics: a likelihood perspective

Idiosyncratic risk, insurance, and aggregate consumption dynamics: a likelihood perspective Idiosyncratic risk, insurance, and aggregate consumption dynamics: a likelihood perspective Alisdair McKay Boston University June 2013 Microeconomic evidence on insurance - Consumption responds to idiosyncratic

More information

Consistent estimators for multilevel generalised linear models using an iterated bootstrap

Consistent estimators for multilevel generalised linear models using an iterated bootstrap Multilevel Models Project Working Paper December, 98 Consistent estimators for multilevel generalised linear models using an iterated bootstrap by Harvey Goldstein hgoldstn@ioe.ac.uk Introduction Several

More information

Weight Smoothing with Laplace Prior and Its Application in GLM Model

Weight Smoothing with Laplace Prior and Its Application in GLM Model Weight Smoothing with Laplace Prior and Its Application in GLM Model Xi Xia 1 Michael Elliott 1,2 1 Department of Biostatistics, 2 Survey Methodology Program, University of Michigan National Cancer Institute

More information

Non-informative Priors Multiparameter Models

Non-informative Priors Multiparameter Models Non-informative Priors Multiparameter Models Statistics 220 Spring 2005 Copyright c 2005 by Mark E. Irwin Prior Types Informative vs Non-informative There has been a desire for a prior distributions that

More information

SELECTION OF VARIABLES INFLUENCING IRAQI BANKS DEPOSITS BY USING NEW BAYESIAN LASSO QUANTILE REGRESSION

SELECTION OF VARIABLES INFLUENCING IRAQI BANKS DEPOSITS BY USING NEW BAYESIAN LASSO QUANTILE REGRESSION Vol. 6, No. 1, Summer 2017 2012 Published by JSES. SELECTION OF VARIABLES INFLUENCING IRAQI BANKS DEPOSITS BY USING NEW BAYESIAN Fadel Hamid Hadi ALHUSSEINI a Abstract The main focus of the paper is modelling

More information

High-Frequency Data Analysis and Market Microstructure [Tsay (2005), chapter 5]

High-Frequency Data Analysis and Market Microstructure [Tsay (2005), chapter 5] 1 High-Frequency Data Analysis and Market Microstructure [Tsay (2005), chapter 5] High-frequency data have some unique characteristics that do not appear in lower frequencies. At this class we have: Nonsynchronous

More information

High Dimensional Bayesian Optimisation and Bandits via Additive Models

High Dimensional Bayesian Optimisation and Bandits via Additive Models 1/20 High Dimensional Bayesian Optimisation and Bandits via Additive Models Kirthevasan Kandasamy, Jeff Schneider, Barnabás Póczos ICML 15 July 8 2015 2/20 Bandits & Optimisation Maximum Likelihood inference

More information

Objective Bayesian Analysis for Heteroscedastic Regression

Objective Bayesian Analysis for Heteroscedastic Regression Analysis for Heteroscedastic Regression & Esther Salazar Universidade Federal do Rio de Janeiro Colóquio Inter-institucional: Modelos Estocásticos e Aplicações 2009 Collaborators: Marco Ferreira and Thais

More information

Chapter 7: Estimation Sections

Chapter 7: Estimation Sections 1 / 31 : Estimation Sections 7.1 Statistical Inference Bayesian Methods: 7.2 Prior and Posterior Distributions 7.3 Conjugate Prior Distributions 7.4 Bayes Estimators Frequentist Methods: 7.5 Maximum Likelihood

More information

Chapter 7: Estimation Sections

Chapter 7: Estimation Sections Chapter 7: Estimation Sections 7.1 Statistical Inference Bayesian Methods: 7.2 Prior and Posterior Distributions 7.3 Conjugate Prior Distributions Frequentist Methods: 7.5 Maximum Likelihood Estimators

More information

Robust Regression for Capital Asset Pricing Model Using Bayesian Approach

Robust Regression for Capital Asset Pricing Model Using Bayesian Approach Thai Journal of Mathematics : 016) 71 8 Special Issue on Applied Mathematics : Bayesian Econometrics http://thaijmath.in.cmu.ac.th ISSN 1686-009 Robust Regression for Capital Asset Pricing Model Using

More information

IMPLEMENTING THE SPECTRAL CALIBRATION OF EXPONENTIAL LÉVY MODELS

IMPLEMENTING THE SPECTRAL CALIBRATION OF EXPONENTIAL LÉVY MODELS IMPLEMENTING THE SPECTRAL CALIBRATION OF EXPONENTIAL LÉVY MODELS DENIS BELOMESTNY AND MARKUS REISS 1. Introduction The aim of this report is to describe more precisely how the spectral calibration method

More information

Estimating the Interest Rate Term Structure of Corporate Debt. with a Semiparametric Penalized Spline Model. Robert Jarrow, David Ruppert, and Yan Yu

Estimating the Interest Rate Term Structure of Corporate Debt. with a Semiparametric Penalized Spline Model. Robert Jarrow, David Ruppert, and Yan Yu Estimating the Interest Rate Term Structure of Corporate Debt with a Semiparametric Penalized Spline Model Robert Jarrow, David Ruppert, and Yan Yu Oct 22, 2003 Abstract This paper provides a new methodology

More information

European option pricing under parameter uncertainty

European option pricing under parameter uncertainty European option pricing under parameter uncertainty Martin Jönsson (joint work with Samuel Cohen) University of Oxford Workshop on BSDEs, SPDEs and their Applications July 4, 2017 Introduction 2/29 Introduction

More information

Therefore, statistical modelling tools are required which make thorough space-time analyses of insurance regression data possible and allow to explore

Therefore, statistical modelling tools are required which make thorough space-time analyses of insurance regression data possible and allow to explore Bayesian space time analysis of health insurance data Stefan Lang, Petra Kragler, Gerhard Haybach and Ludwig Fahrmeir University of Munich, Ludwigstr. 33, 80539 Munich email: lang@stat.uni-muenchen.de

More information

Generalized Additive Modelling for Sample Extremes: An Environmental Example

Generalized Additive Modelling for Sample Extremes: An Environmental Example Generalized Additive Modelling for Sample Extremes: An Environmental Example V. Chavez-Demoulin Department of Mathematics Swiss Federal Institute of Technology Tokyo, March 2007 Changes in extremes? Likely

More information

RESEARCH ARTICLE. The Penalized Biclustering Model And Related Algorithms Supplemental Online Material

RESEARCH ARTICLE. The Penalized Biclustering Model And Related Algorithms Supplemental Online Material Journal of Applied Statistics Vol. 00, No. 00, Month 00x, 8 RESEARCH ARTICLE The Penalized Biclustering Model And Related Algorithms Supplemental Online Material Thierry Cheouo and Alejandro Murua Département

More information

CS340 Machine learning Bayesian statistics 3

CS340 Machine learning Bayesian statistics 3 CS340 Machine learning Bayesian statistics 3 1 Outline Conjugate analysis of µ and σ 2 Bayesian model selection Summarizing the posterior 2 Unknown mean and precision The likelihood function is p(d µ,λ)

More information

درس هفتم یادگیري ماشین. (Machine Learning) دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی

درس هفتم یادگیري ماشین. (Machine Learning) دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی یادگیري ماشین توزیع هاي نمونه و تخمین نقطه اي پارامترها Sampling Distributions and Point Estimation of Parameter (Machine Learning) دانشگاه فردوسی مشهد دانشکده مهندسی رضا منصفی درس هفتم 1 Outline Introduction

More information

Extracting Information from the Markets: A Bayesian Approach

Extracting Information from the Markets: A Bayesian Approach Extracting Information from the Markets: A Bayesian Approach Daniel Waggoner The Federal Reserve Bank of Atlanta Florida State University, February 29, 2008 Disclaimer: The views expressed are the author

More information

A potentially useful approach to model nonlinearities in time series is to assume different behavior (structural break) in different subsamples

A potentially useful approach to model nonlinearities in time series is to assume different behavior (structural break) in different subsamples 1.3 Regime switching models A potentially useful approach to model nonlinearities in time series is to assume different behavior (structural break) in different subsamples (or regimes). If the dates, the

More information

Relevant parameter changes in structural break models

Relevant parameter changes in structural break models Relevant parameter changes in structural break models A. Dufays J. Rombouts Forecasting from Complexity April 27 th, 2018 1 Outline Sparse Change-Point models 1. Motivation 2. Model specification Shrinkage

More information

Outline. Review Continuation of exercises from last time

Outline. Review Continuation of exercises from last time Bayesian Models II Outline Review Continuation of exercises from last time 2 Review of terms from last time Probability density function aka pdf or density Likelihood function aka likelihood Conditional

More information

Statistics and Finance

Statistics and Finance David Ruppert Statistics and Finance An Introduction Springer Notation... xxi 1 Introduction... 1 1.1 References... 5 2 Probability and Statistical Models... 7 2.1 Introduction... 7 2.2 Axioms of Probability...

More information

Statistical Inference and Methods

Statistical Inference and Methods Department of Mathematics Imperial College London d.stephens@imperial.ac.uk http://stats.ma.ic.ac.uk/ das01/ 14th February 2006 Part VII Session 7: Volatility Modelling Session 7: Volatility Modelling

More information

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER Two hours MATH20802 To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER STATISTICAL METHODS Answer any FOUR of the SIX questions.

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Maximum Likelihood Estimation EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #6 EPSY 905: Maximum Likelihood In This Lecture The basics of maximum likelihood estimation Ø The engine that

More information

Bayesian Normal Stuff

Bayesian Normal Stuff Bayesian Normal Stuff - Set-up of the basic model of a normally distributed random variable with unknown mean and variance (a two-parameter model). - Discuss philosophies of prior selection - Implementation

More information

Chapter 8: Sampling distributions of estimators Sections

Chapter 8: Sampling distributions of estimators Sections Chapter 8: Sampling distributions of estimators Sections 8.1 Sampling distribution of a statistic 8.2 The Chi-square distributions 8.3 Joint Distribution of the sample mean and sample variance Skip: p.

More information

A New Hybrid Estimation Method for the Generalized Pareto Distribution

A New Hybrid Estimation Method for the Generalized Pareto Distribution A New Hybrid Estimation Method for the Generalized Pareto Distribution Chunlin Wang Department of Mathematics and Statistics University of Calgary May 18, 2011 A New Hybrid Estimation Method for the GPD

More information

Fitting financial time series returns distributions: a mixture normality approach

Fitting financial time series returns distributions: a mixture normality approach Fitting financial time series returns distributions: a mixture normality approach Riccardo Bramante and Diego Zappa * Abstract Value at Risk has emerged as a useful tool to risk management. A relevant

More information

Using MCMC and particle filters to forecast stochastic volatility and jumps in financial time series

Using MCMC and particle filters to forecast stochastic volatility and jumps in financial time series Using MCMC and particle filters to forecast stochastic volatility and jumps in financial time series Ing. Milan Fičura DYME (Dynamical Methods in Economics) University of Economics, Prague 15.6.2016 Outline

More information

A Markov Chain Monte Carlo Approach to Estimate the Risks of Extremely Large Insurance Claims

A Markov Chain Monte Carlo Approach to Estimate the Risks of Extremely Large Insurance Claims International Journal of Business and Economics, 007, Vol. 6, No. 3, 5-36 A Markov Chain Monte Carlo Approach to Estimate the Risks of Extremely Large Insurance Claims Wan-Kai Pang * Department of Applied

More information

Exam STAM Practice Exam #1

Exam STAM Practice Exam #1 !!!! Exam STAM Practice Exam #1 These practice exams should be used during the month prior to your exam. This practice exam contains 20 questions, of equal value, corresponding to about a 2 hour exam.

More information

discussion Papers Some Flexible Parametric Models for Partially Adaptive Estimators of Econometric Models

discussion Papers Some Flexible Parametric Models for Partially Adaptive Estimators of Econometric Models discussion Papers Discussion Paper 2007-13 March 26, 2007 Some Flexible Parametric Models for Partially Adaptive Estimators of Econometric Models Christian B. Hansen Graduate School of Business at the

More information

The Monte Carlo Method in High Performance Computing

The Monte Carlo Method in High Performance Computing The Monte Carlo Method in High Performance Computing Dieter W. Heermann Monte Carlo Methods 2015 Dieter W. Heermann (Monte Carlo Methods)The Monte Carlo Method in High Performance Computing 2015 1 / 1

More information

Which GARCH Model for Option Valuation? By Peter Christoffersen and Kris Jacobs

Which GARCH Model for Option Valuation? By Peter Christoffersen and Kris Jacobs Online Appendix Sample Index Returns Which GARCH Model for Option Valuation? By Peter Christoffersen and Kris Jacobs In order to give an idea of the differences in returns over the sample, Figure A.1 plots

More information

Understanding Tail Risk 1

Understanding Tail Risk 1 Understanding Tail Risk 1 Laura Veldkamp New York University 1 Based on work with Nic Kozeniauskas, Julian Kozlowski, Anna Orlik and Venky Venkateswaran. 1/2 2/2 Why Study Information Frictions? Every

More information

Master s in Financial Engineering Foundations of Buy-Side Finance: Quantitative Risk and Portfolio Management. > Teaching > Courses

Master s in Financial Engineering Foundations of Buy-Side Finance: Quantitative Risk and Portfolio Management.  > Teaching > Courses Master s in Financial Engineering Foundations of Buy-Side Finance: Quantitative Risk and Portfolio Management www.symmys.com > Teaching > Courses Spring 2008, Monday 7:10 pm 9:30 pm, Room 303 Attilio Meucci

More information

A Practical Implementation of the Gibbs Sampler for Mixture of Distributions: Application to the Determination of Specifications in Food Industry

A Practical Implementation of the Gibbs Sampler for Mixture of Distributions: Application to the Determination of Specifications in Food Industry A Practical Implementation of the for Mixture of Distributions: Application to the Determination of Specifications in Food Industry Julien Cornebise 1 Myriam Maumy 2 Philippe Girard 3 1 Ecole Supérieure

More information

1 Bayesian Bias Correction Model

1 Bayesian Bias Correction Model 1 Bayesian Bias Correction Model Assuming that n iid samples {X 1,...,X n }, were collected from a normal population with mean µ and variance σ 2. The model likelihood has the form, P( X µ, σ 2, T n >

More information

Two-step conditional α-quantile estimation via additive models of location and scale 1

Two-step conditional α-quantile estimation via additive models of location and scale 1 Two-step conditional α-quantile estimation via additive models of location and scale 1 Carlos Martins-Filho Department of Economics IFPRI University of Colorado 2033 K Street NW Boulder, CO 80309-0256,

More information

From Financial Engineering to Risk Management. Radu Tunaru University of Kent, UK

From Financial Engineering to Risk Management. Radu Tunaru University of Kent, UK Model Risk in Financial Markets From Financial Engineering to Risk Management Radu Tunaru University of Kent, UK \Yp World Scientific NEW JERSEY LONDON SINGAPORE BEIJING SHANGHAI HONG KONG TAIPEI CHENNAI

More information

CPSC 540: Machine Learning

CPSC 540: Machine Learning CPSC 540: Machine Learning Monte Carlo Methods Mark Schmidt University of British Columbia Winter 2018 Last Time: Markov Chains We can use Markov chains for density estimation, p(x) = p(x 1 ) }{{} d p(x

More information

An Introduction to Bayesian Inference and MCMC Methods for Capture-Recapture

An Introduction to Bayesian Inference and MCMC Methods for Capture-Recapture An Introduction to Bayesian Inference and MCMC Methods for Capture-Recapture Trinity River Restoration Program Workshop on Outmigration: Population Estimation October 6 8, 2009 An Introduction to Bayesian

More information

is the bandwidth and controls the level of smoothing of the estimator, n is the sample size and

is the bandwidth and controls the level of smoothing of the estimator, n is the sample size and Paper PH100 Relationship between Total charges and Reimbursements in Outpatient Visits Using SAS GLIMMIX Chakib Battioui, University of Louisville, Louisville, KY ABSTRACT The purpose of this paper is

More information

Estimating Term Structure of U.S. Treasury Securities: An Interpolation Approach

Estimating Term Structure of U.S. Treasury Securities: An Interpolation Approach Estimating Term Structure of U.S. Treasury Securities: An Interpolation Approach Feng Guo J. Huston McCulloch Our Task Empirical TS are unobservable. Without a continuous spectrum of zero-coupon securities;

More information

Equity correlations implied by index options: estimation and model uncertainty analysis

Equity correlations implied by index options: estimation and model uncertainty analysis 1/18 : estimation and model analysis, EDHEC Business School (joint work with Rama COT) Modeling and managing financial risks Paris, 10 13 January 2011 2/18 Outline 1 2 of multi-asset models Solution to

More information

Analyzing Oil Futures with a Dynamic Nelson-Siegel Model

Analyzing Oil Futures with a Dynamic Nelson-Siegel Model Analyzing Oil Futures with a Dynamic Nelson-Siegel Model NIELS STRANGE HANSEN & ASGER LUNDE DEPARTMENT OF ECONOMICS AND BUSINESS, BUSINESS AND SOCIAL SCIENCES, AARHUS UNIVERSITY AND CENTER FOR RESEARCH

More information

Bayesian Linear Model: Gory Details

Bayesian Linear Model: Gory Details Bayesian Linear Model: Gory Details Pubh7440 Notes By Sudipto Banerjee Let y y i ] n i be an n vector of independent observations on a dependent variable (or response) from n experimental units. Associated

More information

Adaptive Experiments for Policy Choice. March 8, 2019

Adaptive Experiments for Policy Choice. March 8, 2019 Adaptive Experiments for Policy Choice Maximilian Kasy Anja Sautmann March 8, 2019 Introduction The goal of many experiments is to inform policy choices: 1. Job search assistance for refugees: Treatments:

More information

Estimation after Model Selection

Estimation after Model Selection Estimation after Model Selection Vanja M. Dukić Department of Health Studies University of Chicago E-Mail: vanja@uchicago.edu Edsel A. Peña* Department of Statistics University of South Carolina E-Mail:

More information

CPSC 540: Machine Learning

CPSC 540: Machine Learning CPSC 540: Machine Learning Monte Carlo Methods Mark Schmidt University of British Columbia Winter 2019 Last Time: Markov Chains We can use Markov chains for density estimation, d p(x) = p(x 1 ) p(x }{{}

More information

An Improved Skewness Measure

An Improved Skewness Measure An Improved Skewness Measure Richard A. Groeneveld Professor Emeritus, Department of Statistics Iowa State University ragroeneveld@valley.net Glen Meeden School of Statistics University of Minnesota Minneapolis,

More information

Small Sample Performance of Instrumental Variables Probit Estimators: A Monte Carlo Investigation

Small Sample Performance of Instrumental Variables Probit Estimators: A Monte Carlo Investigation Small Sample Performance of Instrumental Variables Probit : A Monte Carlo Investigation July 31, 2008 LIML Newey Small Sample Performance? Goals Equations Regressors and Errors Parameters Reduced Form

More information

Can Rare Events Explain the Equity Premium Puzzle?

Can Rare Events Explain the Equity Premium Puzzle? Can Rare Events Explain the Equity Premium Puzzle? Christian Julliard and Anisha Ghosh Working Paper 2008 P t d b J L i f NYU A t P i i Presented by Jason Levine for NYU Asset Pricing Seminar, Fall 2009

More information

Personalized screening intervals for biomarkers using joint models for longitudinal and survival data

Personalized screening intervals for biomarkers using joint models for longitudinal and survival data Personalized screening intervals for biomarkers using joint models for longitudinal and survival data Dimitris Rizopoulos, Jeremy Taylor, Joost van Rosmalen, Ewout Steyerberg, Hanneke Takkenberg Department

More information

Probabilistic Meshless Methods for Bayesian Inverse Problems. Jon Cockayne July 8, 2016

Probabilistic Meshless Methods for Bayesian Inverse Problems. Jon Cockayne July 8, 2016 Probabilistic Meshless Methods for Bayesian Inverse Problems Jon Cockayne July 8, 2016 1 Co-Authors Chris Oates Tim Sullivan Mark Girolami 2 What is PN? Many problems in mathematics have no analytical

More information

UNIVERSITY OF OSLO. Please make sure that your copy of the problem set is complete before you attempt to answer anything.

UNIVERSITY OF OSLO. Please make sure that your copy of the problem set is complete before you attempt to answer anything. UNIVERSITY OF OSLO Faculty of Mathematics and Natural Sciences Examination in: STK4540 Non-Life Insurance Mathematics Day of examination: Wednesday, December 4th, 2013 Examination hours: 14.30 17.30 This

More information

A Multivariate Analysis of Intercompany Loss Triangles

A Multivariate Analysis of Intercompany Loss Triangles A Multivariate Analysis of Intercompany Loss Triangles Peng Shi School of Business University of Wisconsin-Madison ASTIN Colloquium May 21-24, 2013 Peng Shi (Wisconsin School of Business) Intercompany

More information

Review for Final Exam Spring 2014 Jeremy Orloff and Jonathan Bloom

Review for Final Exam Spring 2014 Jeremy Orloff and Jonathan Bloom Review for Final Exam 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom THANK YOU!!!! JON!! PETER!! RUTHI!! ERIKA!! ALL OF YOU!!!! Probability Counting Sets Inclusion-exclusion principle Rule of product

More information

A RIDGE REGRESSION ESTIMATION APPROACH WHEN MULTICOLLINEARITY IS PRESENT

A RIDGE REGRESSION ESTIMATION APPROACH WHEN MULTICOLLINEARITY IS PRESENT Fundamental Journal of Applied Sciences Vol. 1, Issue 1, 016, Pages 19-3 This paper is available online at http://www.frdint.com/ Published online February 18, 016 A RIDGE REGRESSION ESTIMATION APPROACH

More information

Missing Data. EM Algorithm and Multiple Imputation. Aaron Molstad, Dootika Vats, Li Zhong. University of Minnesota School of Statistics

Missing Data. EM Algorithm and Multiple Imputation. Aaron Molstad, Dootika Vats, Li Zhong. University of Minnesota School of Statistics Missing Data EM Algorithm and Multiple Imputation Aaron Molstad, Dootika Vats, Li Zhong University of Minnesota School of Statistics December 4, 2013 Overview 1 EM Algorithm 2 Multiple Imputation Incomplete

More information

Supplementary Appendix to The Risk Premia Embedded in Index Options

Supplementary Appendix to The Risk Premia Embedded in Index Options Supplementary Appendix to The Risk Premia Embedded in Index Options Torben G. Andersen Nicola Fusari Viktor Todorov December 214 Contents A The Non-Linear Factor Structure of Option Surfaces 2 B Additional

More information

Monte Carlo Methods in Financial Engineering

Monte Carlo Methods in Financial Engineering Paul Glassennan Monte Carlo Methods in Financial Engineering With 99 Figures

More information

A Test of the Normality Assumption in the Ordered Probit Model *

A Test of the Normality Assumption in the Ordered Probit Model * A Test of the Normality Assumption in the Ordered Probit Model * Paul A. Johnson Working Paper No. 34 March 1996 * Assistant Professor, Vassar College. I thank Jahyeong Koo, Jim Ziliak and an anonymous

More information

Omitted Variables Bias in Regime-Switching Models with Slope-Constrained Estimators: Evidence from Monte Carlo Simulations

Omitted Variables Bias in Regime-Switching Models with Slope-Constrained Estimators: Evidence from Monte Carlo Simulations Journal of Statistical and Econometric Methods, vol. 2, no.3, 2013, 49-55 ISSN: 2051-5057 (print version), 2051-5065(online) Scienpress Ltd, 2013 Omitted Variables Bias in Regime-Switching Models with

More information

COS 513: Gibbs Sampling

COS 513: Gibbs Sampling COS 513: Gibbs Sampling Matthew Salesi December 6, 2010 1 Overview Concluding the coverage of Markov chain Monte Carlo (MCMC) sampling methods, we look today at Gibbs sampling. Gibbs sampling is a simple

More information

Bayesian Hierarchical/ Multilevel and Latent-Variable (Random-Effects) Modeling

Bayesian Hierarchical/ Multilevel and Latent-Variable (Random-Effects) Modeling Bayesian Hierarchical/ Multilevel and Latent-Variable (Random-Effects) Modeling 1: Formulation of Bayesian models and fitting them with MCMC in WinBUGS David Draper Department of Applied Mathematics and

More information

9. Logit and Probit Models For Dichotomous Data

9. Logit and Probit Models For Dichotomous Data Sociology 740 John Fox Lecture Notes 9. Logit and Probit Models For Dichotomous Data Copyright 2014 by John Fox Logit and Probit Models for Dichotomous Responses 1 1. Goals: I To show how models similar

More information

Nonresponse Adjustment of Survey Estimates Based on. Auxiliary Variables Subject to Error. Brady T. West. University of Michigan, Ann Arbor, MI, USA

Nonresponse Adjustment of Survey Estimates Based on. Auxiliary Variables Subject to Error. Brady T. West. University of Michigan, Ann Arbor, MI, USA Nonresponse Adjustment of Survey Estimates Based on Auxiliary Variables Subject to Error Brady T West University of Michigan, Ann Arbor, MI, USA Roderick JA Little University of Michigan, Ann Arbor, MI,

More information

Regret-based Selection

Regret-based Selection Regret-based Selection David Puelz (UT Austin) Carlos M. Carvalho (UT Austin) P. Richard Hahn (Chicago Booth) May 27, 2017 Two problems 1. Asset pricing: What are the fundamental dimensions (risk factors)

More information

A Multifrequency Theory of the Interest Rate Term Structure

A Multifrequency Theory of the Interest Rate Term Structure A Multifrequency Theory of the Interest Rate Term Structure Laurent Calvet, Adlai Fisher, and Liuren Wu HEC, UBC, & Baruch College Chicago University February 26, 2010 Liuren Wu (Baruch) Cascade Dynamics

More information