Small Area Estimation of Poverty Indicators using Interval Censored Income Data

Size: px
Start display at page:

Download "Small Area Estimation of Poverty Indicators using Interval Censored Income Data"

Transcription

1 Small Area Estimation of Poverty Indicators using Interval Censored Income Data Paul Walter 1 Marcus Groß 1 Timo Schmid 1 Nikos Tzavidis 2 1 Chair of Statistics and Econometrics, Freie Universit?t Berlin 2 Department of Social Statistics & Demography, University of Southampton ITACOSM Bologna June 2017 Paul Walter 1 (26)

2 Motivation In order to fight poverty, it is essential to have knowledge about its spatial distribution. Small area estimation (SAE) methods enable the estimation of poverty indicators at a geographical level where direct estimation is either not possible, due to a lack of sample size or very imprecise (Rao & Molina, 2015). One commonly used SAE method is the empirical best predictor (EBP), which is based on a mixed model (Molina & Rao, 2010). Estimation becomes imprecise, when due to confidentially or other reasons the dependent variable in the underlying mixed model, such as income, is censored to particular intervals. Paul Walter 2 (26)

3 Motivation To get more precise estimates, two methodologies are introduced, one based on the expectation maximization algorithm (EM) (Dempster et al., 1977) (Stewart, 1983) and one based on the stochastic expectation maximization (SEM) algorithm (Caleux, 1985). How do the proposed methods assist in improving the precision of small area prediction when the dependent variable is censored to particular intervals? Paul Walter 3 (26)

4 Outline I Under Normality The EBP Approach The EM- and SEM-Algorithm MSE Estimation using Interval Censored Data Simulation Results II Paul Walter 4 (26)

5 The EBP Approach (1) Nested error linear regression model y ij = x T ij β + u i + e ij, j = 1,..., n i, i = 1,..., D, u i iid N(0, σ 2 u ), the random area-specific effects (1) e ij iid N(0, σ 2 e ), the unit-level error terms where y ij is unknown and only observed to fall into a certain interval (A k 1, A k ) on a continuous scale and k ij (1 k ij K) is the indicator into which of the intervals y ij falls. Paul Walter 5 (26)

6 The EBP Approach (2) Nested error linear regression model y ij = x T ij β + u i + e ij, j = 1,..., n i, i = 1,..., D, Use sample data to estimate β, σ u, σ e, u i, with the EM or SEM-algorithm. Micro-simulating a synthetic population: Generate a synthetic population under the model a large number of times each time estimating the target parameter. Linear and non-linear poverty indicators can be computed. Paul Walter 6 (26)

7 Methodology Reconstructing the distribution of the unknown y ij is necessary to estimate the parameters of model (1). From Bayes theorem it follows that: f (y ij x ij, k ij, u i ) f (k ij y ij, x ij, u i )f (y ij x ij, u i ) with f (k ij y ij, x ij, u i ) = { 1, if A k 1 y ij A k 0, else and from the assumption implied by model (1) f (y ij x ij, u i ) N(x T ij β + u i, σ 2 e). Paul Walter 7 (26)

8 Estimation and Computational Details (SEM) 1. Estimate ˆθ = ( ˆβ, û i, ˆσ 2 e, ˆσ 2 u) from model (1) using the midpoints of the intervals as a substitute for the unknown y ij. 2. Sample from the conditional distribution f (y ij x ij, u i ) by drawing randomly from N(xij T ˆβ + û i, ˆσ e) 2 within the given interval A k 1 y ij, A k. Obtain (ỹ ij, x ij ) for j = 1,... n i and i = 1,..., D. 3. Re-estimate ˆθ from model (1) by using the pseudo sample (ỹ ij, x ij ) obtained in step Iterate steps B + M times, with B burn-in iterations and M additional iterations. 5. Discard the burn-in iterations and estimate ˆθ by averaging the obtained M estimates. Paul Walter 8 (26)

9 Estimation and Computational Details (EM) 1. Estimate ˆθ = ( ˆβ, û i, ˆσ 2 e, ˆσ 2 u) from model (1) using the midpoints of the intervals as a substitute for the unknown y ij. 2. Estimate E[I (A k 1 y ij A k ) f (y ij x ij, u i )], the expected value of a two sided truncated normal distributed variable as pseudo ỹ ij : ỹ ij = E[I (A k 1 y ij A k ) f (y ij x ij, u i )] = (x T ij ˆβ + û i ) + ˆσ e φ(z k 1 ) φ(z k ) Φ(Z k ) Φ(Z k 1 ), obtain (ỹ ij, x ij ) for j = 1,... n i and i = 1,..., D. The conditional variance is given by the variance of a two sided truncated normal distributed variable as: {[ ] [ ] 2 } Var(ỹ ij x ij, k ij, u i ) = ˆσ 2 Zk 1 φ(z k 1 ) Z k φ(z k ) φ(zk 1 ) φ(z k 1 ) e Φ(Z k ) Φ(Z k 1 ) Φ(Z k ) Φ(Z k 1 ) } {{ } :=s ij with Z k = (A k (x T ij ˆβ + û i ))/ˆσ e. Paul Walter 9 (26)

10 Estimation and Computational Details (EM) 3. Re-estimate ˆθ from model (1) by using the pseudo sample (ỹ ij, x ij ) obtained in step 2. The variance ˆσ 2 e is given by: ˆσ 2 e = ni j=1 4. Iterate steps until convergence. 5. Obtain ˆθ from the last iteration step. D i=1 (ỹ ij (xij T ˆβ + û i )) 2 ni D j=1 i=1 (1 s ij) Paul Walter 10 (26)

11 MSE Estimation 1. Use the sample estimates ˆθ = ( ˆβ, ˆσ u, 2 ˆσ e 2 ) obtained by the EM- or SEM-algorithm to generate u (b) iid i N(0, ˆσ 2 u ) and e (b) iid ij N(0, ˆσ 2 e ) and to simulate a bootstrap superpopulation model = xij T ˆβ + u (b) i + e (b) ij. y (b) ij 2. Estimate the population indicator I i,b using y (b) ij. 3. Extract a bootstrap sample from y (b) ij, group it according to the K intervals (A k 1, A k ) and apply the EBP method using only the interval informations and treating y (b) ij as unknown. 4. Obtain Î EBP i,b. 5. Iterate steps 1-4, b = 1,..., B times. The MSE-estimate for each area i given by: MSE(Î EBP i ) = B 1 B (Î EBP i,b I i,b ) 2. (2) b=1 Paul Walter 11 (26)

12 Model-based Simulation: Normal Scenarios Finite population U of size N = 10000, partitioned into D = 50 regions U 1, U 2,..., U D of sizes N i = 200 Consider an unbalanced design with sample sizes n i between 8 n i 29 leading to a sample size of D i=1 n i = 921 The following super-population model is used to simulate M = 100 Monte Carlo populations: y ij = x ij + u i + e ij, x ij N(µ i, 3) µ i = U[ 3, 3], u i iid N(0, ), e ij iid N(0, ) j = 1,..., n i, i = 1,..., D. Paul Walter 12 (26)

13 Model-based Simulation: Normal Scenarios The following methods are applied for parameter estimation of model (1): LME - Estimate the model parameters with the true y ij to evaluate the performance of the estimation methods relying on the interval censored y ij. EM - Estimation based on the generated pseudo ỹ ij. SEM - Estimation based on the drawn pseudo ỹ ij, with 40 burn-ins and 200 iterations. Paul Walter 13 (26)

14 Model-based Simulation: Normal Scenarios Income distribution for two arbitrary chosen populations: Normal scenario 1: 7 intervals Interval Frequencies [1, 2000) 970 [2000, 3000) 1367 [3000, 4000) 2063 [4000, 5000) 2266 [5000, 6000) 1767 [6000, 7500) 1265 [7500, Inf ) 302 Normal scenario 2: 4 intervals Interval Frequencies [1, 3000) 2337 [3000, 5000) 4329 [5000, 7500) 3032 [7500, Inf ) 302 Paul Walter 14 (26)

15 Model-based Simulation: Quality Measures To evaluate the performance of the EBPs, the root mean squared error () of any parameter estimate Î EBP is estimated in each area i: ( [ Îi EBP ) 1 = M M m=1 (Î EBP(m) i ] I (m) ) 1/2 2 i, (3) where M corresponds to the number of Monte Carlo populations. Paul Walter 15 (26)

16 Under Normality Model-based Simulation: Normal Scenario 1 Mean Gini HCR LME LMEBox EM EMBox SEM SEMBox Method LME LMEBox EM EMBox SEM SEMBox Method LME LMEBox EM EMBox SEM SEMBox Method LME LMEBox EM EMBox SEM SEMBox Mean Gini HCR Paul Walter 16 (26)

17 Model-based Simulation: Normal Scenario 1 Mean (LME) Type Empirical Estimated Mean (EM) Domain Type Empirical Estimated 200 Mean (SEM) Domain Type Empirical Estimated 200 Domain Paul Walter 17 (26)

18 Model-based Simulation: Normal Scenario Gini (LME) Type Empirical Estimated Gini (EM) Domain Type Empirical Estimated Gini (SEM) Domain Type Empirical Estimated Domain Paul Walter 18 (26)

19 Model-based Simulation: Normal Scenario 1 Density plot of ŷ from a particular simulation run: Density Method EM_Prediction LME_Prediction SEM_Prediction Y Paul Walter 19 (26)

20 Under Normality Model-based Simulation: Normal Scenario 2 Mean Gini HCR LME LMEBox EM EMBox SEM SEMBox Method LME LMEBox EM EMBox SEM SEMBox Method LME LMEBox EM EMBox SEM SEMBox Method LME LMEBox EM EMBox SEM SEMBox Mean Gini HCR Paul Walter 20 (26)

21 Model-based Simulation: Normal Scenario 2 Mean (LME) Type Empirical Estimated Mean (EM) Domain Type Empirical Estimated 200 Mean (SEM) Domain Type Empirical Estimated 200 Domain Paul Walter 21 (26)

22 Model-based Simulation: Normal Scenario Gini (LME) Type Empirical Estimated Gini (EM) Domain Type Empirical Estimated Gini (SEM) Domain Type Empirical Estimated Domain Paul Walter 22 (26)

23 Model-based Simulation: Normal Scenario 2 Density plot of ŷ from a particular simulation run: Density Method EM_Prediction LME_Prediction SEM_Prediction Y Paul Walter 23 (26)

24 Previous research has shown, that whenever the dependent variable is censored to certain intervals, the EM- and SEM-algorithm outperform naive estimation procedures (regression on the midpoints of the intervals) or direct estimation, in terms of, in the EBPs. Simulation results show, that the accuracy loss in the EBPs, using the SEM- or EM-algorithm compared to the use of uncensored data, strongly depends on the number of intervals. The performance of the SEM- and EM- algorithm is in most scenarios quite similar. Paul Walter 24 (26)

25 Since the EM and SEM-algorithm strongly relies on the Gaussian assumption of the error terms, which can not accurately be tested whenever the dependent variable is grouped, two transformations are incorporated into the algorithms to handle departures from normality. The SEM-algorithm under Box-Cox transformation is outperforming the EM-algorithm under transformation in most scenarios and also performs well in the model based normal scenarios. The use of the SEM-algorithm under Box-Cox transformation is suggest as preferred estimation procedure. Paul Walter 25 (26)

26 Bibliography [1] Rao, J.N.K. & Molina, I. (2015), Small area estimation. John Wiley & Sons. [3] Molina, I. & Rao, J.N.K. (2010), Small area estimation of poverty indicators. Canadian Journal of Statistics, 38(3), [4] Caleux, G. & Dieboldt, J. (1985), The sem algorithm: a probalistic teacher algorithm derived from the em algorithm for the mixture problem. Computational Statistics Quarterly, 2: [5] Stewart, M. B. (1983), On least square estimation when the dependent variable is grouped. Review of Economic Studies, 50(4): [6] Dempster, A., Laird, N., & Rubin, D. (1977), Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society. Series B (Methodological), 39(1):1-38. [7] Gonzalez-Manteiga, W., Lombarda, M. J., Molina, I., Morales, D., and Santamara, L. (2008). Analytic and bootstrap approximations of prediction errors under a multivariate fayherriot model. Computational Statistics & Data Analysis, 52 (12): Paul Walter 26 (26)

Strategies for Improving the Efficiency of Monte-Carlo Methods

Strategies for Improving the Efficiency of Monte-Carlo Methods Strategies for Improving the Efficiency of Monte-Carlo Methods Paul J. Atzberger General comments or corrections should be sent to: paulatz@cims.nyu.edu Introduction The Monte-Carlo method is a useful

More information

SMALL AREA ESTIMATES OF INCOME: MEANS, MEDIANS

SMALL AREA ESTIMATES OF INCOME: MEANS, MEDIANS SMALL AREA ESTIMATES OF INCOME: MEANS, MEDIANS AND PERCENTILES Alison Whitworth (alison.whitworth@ons.gsi.gov.uk) (1), Kieran Martin (2), Cruddas, Christine Sexton, Alan Taylor Nikos Tzavidis (3), Marie

More information

A New Hybrid Estimation Method for the Generalized Pareto Distribution

A New Hybrid Estimation Method for the Generalized Pareto Distribution A New Hybrid Estimation Method for the Generalized Pareto Distribution Chunlin Wang Department of Mathematics and Statistics University of Calgary May 18, 2011 A New Hybrid Estimation Method for the GPD

More information

Missing Data. EM Algorithm and Multiple Imputation. Aaron Molstad, Dootika Vats, Li Zhong. University of Minnesota School of Statistics

Missing Data. EM Algorithm and Multiple Imputation. Aaron Molstad, Dootika Vats, Li Zhong. University of Minnesota School of Statistics Missing Data EM Algorithm and Multiple Imputation Aaron Molstad, Dootika Vats, Li Zhong University of Minnesota School of Statistics December 4, 2013 Overview 1 EM Algorithm 2 Multiple Imputation Incomplete

More information

Introduction Dickey-Fuller Test Option Pricing Bootstrapping. Simulation Methods. Chapter 13 of Chris Brook s Book.

Introduction Dickey-Fuller Test Option Pricing Bootstrapping. Simulation Methods. Chapter 13 of Chris Brook s Book. Simulation Methods Chapter 13 of Chris Brook s Book Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg : 6828 0364 : LKCSB 5036 April 26, 2017 Christopher

More information

Week 7 Quantitative Analysis of Financial Markets Simulation Methods

Week 7 Quantitative Analysis of Financial Markets Simulation Methods Week 7 Quantitative Analysis of Financial Markets Simulation Methods Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg : 6828 0364 : LKCSB 5036 November

More information

Using Halton Sequences. in Random Parameters Logit Models

Using Halton Sequences. in Random Parameters Logit Models Journal of Statistical and Econometric Methods, vol.5, no.1, 2016, 59-86 ISSN: 1792-6602 (print), 1792-6939 (online) Scienpress Ltd, 2016 Using Halton Sequences in Random Parameters Logit Models Tong Zeng

More information

Monitoring Processes with Highly Censored Data

Monitoring Processes with Highly Censored Data Monitoring Processes with Highly Censored Data Stefan H. Steiner and R. Jock MacKay Dept. of Statistics and Actuarial Sciences University of Waterloo Waterloo, N2L 3G1 Canada The need for process monitoring

More information

ST440/550: Applied Bayesian Analysis. (5) Multi-parameter models - Summarizing the posterior

ST440/550: Applied Bayesian Analysis. (5) Multi-parameter models - Summarizing the posterior (5) Multi-parameter models - Summarizing the posterior Models with more than one parameter Thus far we have studied single-parameter models, but most analyses have several parameters For example, consider

More information

Experience with the Weighted Bootstrap in Testing for Unobserved Heterogeneity in Exponential and Weibull Duration Models

Experience with the Weighted Bootstrap in Testing for Unobserved Heterogeneity in Exponential and Weibull Duration Models Experience with the Weighted Bootstrap in Testing for Unobserved Heterogeneity in Exponential and Weibull Duration Models Jin Seo Cho, Ta Ul Cheong, Halbert White Abstract We study the properties of the

More information

arxiv: v1 [q-fin.rm] 13 Dec 2016

arxiv: v1 [q-fin.rm] 13 Dec 2016 arxiv:1612.04126v1 [q-fin.rm] 13 Dec 2016 The hierarchical generalized linear model and the bootstrap estimator of the error of prediction of loss reserves in a non-life insurance company Alicja Wolny-Dominiak

More information

Estimating Mixed Logit Models with Large Choice Sets. Roger H. von Haefen, NC State & NBER Adam Domanski, NOAA July 2013

Estimating Mixed Logit Models with Large Choice Sets. Roger H. von Haefen, NC State & NBER Adam Domanski, NOAA July 2013 Estimating Mixed Logit Models with Large Choice Sets Roger H. von Haefen, NC State & NBER Adam Domanski, NOAA July 2013 Motivation Bayer et al. (JPE, 2007) Sorting modeling / housing choice 250,000 individuals

More information

Omitted Variables Bias in Regime-Switching Models with Slope-Constrained Estimators: Evidence from Monte Carlo Simulations

Omitted Variables Bias in Regime-Switching Models with Slope-Constrained Estimators: Evidence from Monte Carlo Simulations Journal of Statistical and Econometric Methods, vol. 2, no.3, 2013, 49-55 ISSN: 2051-5057 (print version), 2051-5065(online) Scienpress Ltd, 2013 Omitted Variables Bias in Regime-Switching Models with

More information

Market Risk Analysis Volume IV. Value-at-Risk Models

Market Risk Analysis Volume IV. Value-at-Risk Models Market Risk Analysis Volume IV Value-at-Risk Models Carol Alexander John Wiley & Sons, Ltd List of Figures List of Tables List of Examples Foreword Preface to Volume IV xiii xvi xxi xxv xxix IV.l Value

More information

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is:

**BEGINNING OF EXAMINATION** A random sample of five observations from a population is: **BEGINNING OF EXAMINATION** 1. You are given: (i) A random sample of five observations from a population is: 0.2 0.7 0.9 1.1 1.3 (ii) You use the Kolmogorov-Smirnov test for testing the null hypothesis,

More information

Comparison of design-based sample mean estimate with an estimate under re-sampling-based multiple imputations

Comparison of design-based sample mean estimate with an estimate under re-sampling-based multiple imputations Comparison of design-based sample mean estimate with an estimate under re-sampling-based multiple imputations Recai Yucel 1 Introduction This section introduces the general notation used throughout this

More information

Fast Convergence of Regress-later Series Estimators

Fast Convergence of Regress-later Series Estimators Fast Convergence of Regress-later Series Estimators New Thinking in Finance, London Eric Beutner, Antoon Pelsser, Janina Schweizer Maastricht University & Kleynen Consultants 12 February 2014 Beutner Pelsser

More information

A RIDGE REGRESSION ESTIMATION APPROACH WHEN MULTICOLLINEARITY IS PRESENT

A RIDGE REGRESSION ESTIMATION APPROACH WHEN MULTICOLLINEARITY IS PRESENT Fundamental Journal of Applied Sciences Vol. 1, Issue 1, 016, Pages 19-3 This paper is available online at http://www.frdint.com/ Published online February 18, 016 A RIDGE REGRESSION ESTIMATION APPROACH

More information

mme: An R package for small area estimation with multinomial mixed models

mme: An R package for small area estimation with multinomial mixed models mme: An R package for small area estimation with multinomial mixed models M.E. López-Vizcaíno 1, M.J. Lombardía 2 and D. Morales 3 1 Instituto Galego de Estatística 2 Universidade da Coruña 3 Universidad

More information

Small area estimation for poverty indicators

Small area estimation for poverty indicators Small area estimation for poverty indicators Risto Lehtonen (University of Helsinki) Ari Veijanen (Statistics Finland) Mikko Myrskylä (Max Planck Institute for Demographic Research) Maria Valaste (Social

More information

LOSS SEVERITY DISTRIBUTION ESTIMATION OF OPERATIONAL RISK USING GAUSSIAN MIXTURE MODEL FOR LOSS DISTRIBUTION APPROACH

LOSS SEVERITY DISTRIBUTION ESTIMATION OF OPERATIONAL RISK USING GAUSSIAN MIXTURE MODEL FOR LOSS DISTRIBUTION APPROACH LOSS SEVERITY DISTRIBUTION ESTIMATION OF OPERATIONAL RISK USING GAUSSIAN MIXTURE MODEL FOR LOSS DISTRIBUTION APPROACH Seli Siti Sholihat 1 Hendri Murfi 2 1 Department of Accounting, Faculty of Economics,

More information

Modelling Returns: the CER and the CAPM

Modelling Returns: the CER and the CAPM Modelling Returns: the CER and the CAPM Carlo Favero Favero () Modelling Returns: the CER and the CAPM 1 / 20 Econometric Modelling of Financial Returns Financial data are mostly observational data: they

More information

Chapter 7 - Lecture 1 General concepts and criteria

Chapter 7 - Lecture 1 General concepts and criteria Chapter 7 - Lecture 1 General concepts and criteria January 29th, 2010 Best estimator Mean Square error Unbiased estimators Example Unbiased estimators not unique Special case MVUE Bootstrap General Question

More information

Week 1 Quantitative Analysis of Financial Markets Distributions B

Week 1 Quantitative Analysis of Financial Markets Distributions B Week 1 Quantitative Analysis of Financial Markets Distributions B Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg : 6828 0364 : LKCSB 5036 October

More information

A Two-Step Estimator for Missing Values in Probit Model Covariates

A Two-Step Estimator for Missing Values in Probit Model Covariates WORKING PAPER 3/2015 A Two-Step Estimator for Missing Values in Probit Model Covariates Lisha Wang and Thomas Laitila Statistics ISSN 1403-0586 http://www.oru.se/institutioner/handelshogskolan-vid-orebro-universitet/forskning/publikationer/working-papers/

More information

(5) Multi-parameter models - Summarizing the posterior

(5) Multi-parameter models - Summarizing the posterior (5) Multi-parameter models - Summarizing the posterior Spring, 2017 Models with more than one parameter Thus far we have studied single-parameter models, but most analyses have several parameters For example,

More information

Experience with the Weighted Bootstrap in Testing for Unobserved Heterogeneity in Exponential and Weibull Duration Models

Experience with the Weighted Bootstrap in Testing for Unobserved Heterogeneity in Exponential and Weibull Duration Models Experience with the Weighted Bootstrap in Testing for Unobserved Heterogeneity in Exponential and Weibull Duration Models Jin Seo Cho, Ta Ul Cheong, Halbert White Abstract We study the properties of the

More information

Consistent estimators for multilevel generalised linear models using an iterated bootstrap

Consistent estimators for multilevel generalised linear models using an iterated bootstrap Multilevel Models Project Working Paper December, 98 Consistent estimators for multilevel generalised linear models using an iterated bootstrap by Harvey Goldstein hgoldstn@ioe.ac.uk Introduction Several

More information

Chapter 2 Uncertainty Analysis and Sampling Techniques

Chapter 2 Uncertainty Analysis and Sampling Techniques Chapter 2 Uncertainty Analysis and Sampling Techniques The probabilistic or stochastic modeling (Fig. 2.) iterative loop in the stochastic optimization procedure (Fig..4 in Chap. ) involves:. Specifying

More information

STAT 509: Statistics for Engineers Dr. Dewei Wang. Copyright 2014 John Wiley & Sons, Inc. All rights reserved.

STAT 509: Statistics for Engineers Dr. Dewei Wang. Copyright 2014 John Wiley & Sons, Inc. All rights reserved. STAT 509: Statistics for Engineers Dr. Dewei Wang Applied Statistics and Probability for Engineers Sixth Edition Douglas C. Montgomery George C. Runger 7 Point CHAPTER OUTLINE 7-1 Point Estimation 7-2

More information

Multivariate Cox PH model with log-skew-normal frailties

Multivariate Cox PH model with log-skew-normal frailties Multivariate Cox PH model with log-skew-normal frailties Department of Statistical Sciences, University of Padua, 35121 Padua (IT) Multivariate Cox PH model A standard statistical approach to model clustered

More information

Lecture 17: More on Markov Decision Processes. Reinforcement learning

Lecture 17: More on Markov Decision Processes. Reinforcement learning Lecture 17: More on Markov Decision Processes. Reinforcement learning Learning a model: maximum likelihood Learning a value function directly Monte Carlo Temporal-difference (TD) learning COMP-424, Lecture

More information

Testing Out-of-Sample Portfolio Performance

Testing Out-of-Sample Portfolio Performance Testing Out-of-Sample Portfolio Performance Ekaterina Kazak 1 Winfried Pohlmeier 2 1 University of Konstanz, GSDS 2 University of Konstanz, CoFE, RCEA Econometric Research in Finance Workshop 2017 SGH

More information

Construction and behavior of Multinomial Markov random field models

Construction and behavior of Multinomial Markov random field models Graduate Theses and Dissertations Iowa State University Capstones, Theses and Dissertations 2010 Construction and behavior of Multinomial Markov random field models Kim Mueller Iowa State University Follow

More information

Market Risk Analysis Volume I

Market Risk Analysis Volume I Market Risk Analysis Volume I Quantitative Methods in Finance Carol Alexander John Wiley & Sons, Ltd List of Figures List of Tables List of Examples Foreword Preface to Volume I xiii xvi xvii xix xxiii

More information

1. You are given the following information about a stationary AR(2) model:

1. You are given the following information about a stationary AR(2) model: Fall 2003 Society of Actuaries **BEGINNING OF EXAMINATION** 1. You are given the following information about a stationary AR(2) model: (i) ρ 1 = 05. (ii) ρ 2 = 01. Determine φ 2. (A) 0.2 (B) 0.1 (C) 0.4

More information

Using Monte Carlo Integration and Control Variates to Estimate π

Using Monte Carlo Integration and Control Variates to Estimate π Using Monte Carlo Integration and Control Variates to Estimate π N. Cannady, P. Faciane, D. Miksa LSU July 9, 2009 Abstract We will demonstrate the utility of Monte Carlo integration by using this algorithm

More information

Market Risk Analysis Volume II. Practical Financial Econometrics

Market Risk Analysis Volume II. Practical Financial Econometrics Market Risk Analysis Volume II Practical Financial Econometrics Carol Alexander John Wiley & Sons, Ltd List of Figures List of Tables List of Examples Foreword Preface to Volume II xiii xvii xx xxii xxvi

More information

A comment on Christoffersen, Jacobs and Ornthanalai (2012), Dynamic jump intensities and risk premiums: Evidence from S&P500 returns and options

A comment on Christoffersen, Jacobs and Ornthanalai (2012), Dynamic jump intensities and risk premiums: Evidence from S&P500 returns and options A comment on Christoffersen, Jacobs and Ornthanalai (2012), Dynamic jump intensities and risk premiums: Evidence from S&P500 returns and options Garland Durham 1 John Geweke 2 Pulak Ghosh 3 February 25,

More information

Effects of missing data in credit risk scoring. A comparative analysis of methods to gain robustness in presence of sparce data

Effects of missing data in credit risk scoring. A comparative analysis of methods to gain robustness in presence of sparce data Credit Research Centre Credit Scoring and Credit Control X 29-31 August 2007 The University of Edinburgh - Management School Effects of missing data in credit risk scoring. A comparative analysis of methods

More information

Statistical Models and Methods for Financial Markets

Statistical Models and Methods for Financial Markets Tze Leung Lai/ Haipeng Xing Statistical Models and Methods for Financial Markets B 374756 4Q Springer Preface \ vii Part I Basic Statistical Methods and Financial Applications 1 Linear Regression Models

More information

Optimal Window Selection for Forecasting in The Presence of Recent Structural Breaks

Optimal Window Selection for Forecasting in The Presence of Recent Structural Breaks Optimal Window Selection for Forecasting in The Presence of Recent Structural Breaks Yongli Wang University of Leicester Econometric Research in Finance Workshop on 15 September 2017 SGH Warsaw School

More information

Window Width Selection for L 2 Adjusted Quantile Regression

Window Width Selection for L 2 Adjusted Quantile Regression Window Width Selection for L 2 Adjusted Quantile Regression Yoonsuh Jung, The Ohio State University Steven N. MacEachern, The Ohio State University Yoonkyung Lee, The Ohio State University Technical Report

More information

Machine Learning for Quantitative Finance

Machine Learning for Quantitative Finance Machine Learning for Quantitative Finance Fast derivative pricing Sofie Reyners Joint work with Jan De Spiegeleer, Dilip Madan and Wim Schoutens Derivative pricing is time-consuming... Vanilla option pricing

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Review of previous lecture: Why confidence intervals? Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao Suppose you want to know the

More information

PARAMETRIC AND NON-PARAMETRIC BOOTSTRAP: A SIMULATION STUDY FOR A LINEAR REGRESSION WITH RESIDUALS FROM A MIXTURE OF LAPLACE DISTRIBUTIONS

PARAMETRIC AND NON-PARAMETRIC BOOTSTRAP: A SIMULATION STUDY FOR A LINEAR REGRESSION WITH RESIDUALS FROM A MIXTURE OF LAPLACE DISTRIBUTIONS PARAMETRIC AND NON-PARAMETRIC BOOTSTRAP: A SIMULATION STUDY FOR A LINEAR REGRESSION WITH RESIDUALS FROM A MIXTURE OF LAPLACE DISTRIBUTIONS Melfi Alrasheedi School of Business, King Faisal University, Saudi

More information

Linear Regression with One Regressor

Linear Regression with One Regressor Linear Regression with One Regressor Michael Ash Lecture 9 Linear Regression with One Regressor Review of Last Time 1. The Linear Regression Model The relationship between independent X and dependent Y

More information

Geostatistical Inference under Preferential Sampling

Geostatistical Inference under Preferential Sampling Geostatistical Inference under Preferential Sampling Marie Ozanne and Justin Strait Diggle, Menezes, and Su, 2010 October 12, 2015 Marie Ozanne and Justin Strait Preferential Sampling October 12, 2015

More information

Chapter 3. Dynamic discrete games and auctions: an introduction

Chapter 3. Dynamic discrete games and auctions: an introduction Chapter 3. Dynamic discrete games and auctions: an introduction Joan Llull Structural Micro. IDEA PhD Program I. Dynamic Discrete Games with Imperfect Information A. Motivating example: firm entry and

More information

Limit Theorems for the Empirical Distribution Function of Scaled Increments of Itô Semimartingales at high frequencies

Limit Theorems for the Empirical Distribution Function of Scaled Increments of Itô Semimartingales at high frequencies Limit Theorems for the Empirical Distribution Function of Scaled Increments of Itô Semimartingales at high frequencies George Tauchen Duke University Viktor Todorov Northwestern University 2013 Motivation

More information

Chapter 7: Estimation Sections

Chapter 7: Estimation Sections 1 / 40 Chapter 7: Estimation Sections 7.1 Statistical Inference Bayesian Methods: Chapter 7 7.2 Prior and Posterior Distributions 7.3 Conjugate Prior Distributions 7.4 Bayes Estimators Frequentist Methods:

More information

On modelling of electricity spot price

On modelling of electricity spot price , Rüdiger Kiesel and Fred Espen Benth Institute of Energy Trading and Financial Services University of Duisburg-Essen Centre of Mathematics for Applications, University of Oslo 25. August 2010 Introduction

More information

Analysis of truncated data with application to the operational risk estimation

Analysis of truncated data with application to the operational risk estimation Analysis of truncated data with application to the operational risk estimation Petr Volf 1 Abstract. Researchers interested in the estimation of operational risk often face problems arising from the structure

More information

Monte Carlo Simulations in the Teaching Process

Monte Carlo Simulations in the Teaching Process Monte Carlo Simulations in the Teaching Process Blanka Šedivá Department of Mathematics, Faculty of Applied Sciences University of West Bohemia, Plzeň, Czech Republic CADGME 2018 Conference on Digital

More information

Computational Statistics Handbook with MATLAB

Computational Statistics Handbook with MATLAB «H Computer Science and Data Analysis Series Computational Statistics Handbook with MATLAB Second Edition Wendy L. Martinez The Office of Naval Research Arlington, Virginia, U.S.A. Angel R. Martinez Naval

More information

Box-Cox Transforms for Realized Volatility

Box-Cox Transforms for Realized Volatility Box-Cox Transforms for Realized Volatility Sílvia Gonçalves and Nour Meddahi Université de Montréal and Imperial College London January 1, 8 Abstract The log transformation of realized volatility is often

More information

Chapter 7 Sampling Distributions and Point Estimation of Parameters

Chapter 7 Sampling Distributions and Point Estimation of Parameters Chapter 7 Sampling Distributions and Point Estimation of Parameters Part 1: Sampling Distributions, the Central Limit Theorem, Point Estimation & Estimators Sections 7-1 to 7-2 1 / 25 Statistical Inferences

More information

Computational Finance. Computational Finance p. 1

Computational Finance. Computational Finance p. 1 Computational Finance Computational Finance p. 1 Outline Binomial model: option pricing and optimal investment Monte Carlo techniques for pricing of options pricing of non-standard options improving accuracy

More information

King s College London

King s College London King s College London University Of London This paper is part of an examination of the College counting towards the award of a degree. Examinations are governed by the College Regulations under the authority

More information

Statistical analysis and bootstrapping

Statistical analysis and bootstrapping Statistical analysis and bootstrapping p. 1/15 Statistical analysis and bootstrapping Michel Bierlaire michel.bierlaire@epfl.ch Transport and Mobility Laboratory Statistical analysis and bootstrapping

More information

Which GARCH Model for Option Valuation? By Peter Christoffersen and Kris Jacobs

Which GARCH Model for Option Valuation? By Peter Christoffersen and Kris Jacobs Online Appendix Sample Index Returns Which GARCH Model for Option Valuation? By Peter Christoffersen and Kris Jacobs In order to give an idea of the differences in returns over the sample, Figure A.1 plots

More information

Evaluation of a New Variance Components Estimation Method Modi ed Henderson s Method 3 With the Application of Two Way Mixed Model

Evaluation of a New Variance Components Estimation Method Modi ed Henderson s Method 3 With the Application of Two Way Mixed Model Evaluation of a New Variance Components Estimation Method Modi ed Henderson s Method 3 With the Application of Two Way Mixed Model Author: Weigang Qie; Chenfan Xu Supervisor: Lars Rönnegård June 0th, 009

More information

Fitting financial time series returns distributions: a mixture normality approach

Fitting financial time series returns distributions: a mixture normality approach Fitting financial time series returns distributions: a mixture normality approach Riccardo Bramante and Diego Zappa * Abstract Value at Risk has emerged as a useful tool to risk management. A relevant

More information

An iterative approach to minimize the mean squared error in ridge regression

An iterative approach to minimize the mean squared error in ridge regression Hong Kong Baptist University HKBU Institutional Repository HKBU Staff Publication 205 An iterative approach to minimize the mean squared error in ridge regression Ka Yiu Wong Department of Mathematics,

More information

Applied Statistics I

Applied Statistics I Applied Statistics I Liang Zhang Department of Mathematics, University of Utah July 14, 2008 Liang Zhang (UofU) Applied Statistics I July 14, 2008 1 / 18 Point Estimation Liang Zhang (UofU) Applied Statistics

More information

Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS

Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS Chapter 7: SAMPLING DISTRIBUTIONS & POINT ESTIMATION OF PARAMETERS Part 1: Introduction Sampling Distributions & the Central Limit Theorem Point Estimation & Estimators Sections 7-1 to 7-2 Sample data

More information

Monte Carlo Methods in Financial Engineering

Monte Carlo Methods in Financial Engineering Paul Glassennan Monte Carlo Methods in Financial Engineering With 99 Figures

More information

Interval estimation. September 29, Outline Basic ideas Sampling variation and CLT Interval estimation using X More general problems

Interval estimation. September 29, Outline Basic ideas Sampling variation and CLT Interval estimation using X More general problems Interval estimation September 29, 2017 STAT 151 Class 7 Slide 1 Outline of Topics 1 Basic ideas 2 Sampling variation and CLT 3 Interval estimation using X 4 More general problems STAT 151 Class 7 Slide

More information

Putting the Econ into Econometrics

Putting the Econ into Econometrics Putting the Econ into Econometrics Jeffrey H. Dorfman and Christopher S. McIntosh Department of Agricultural & Applied Economics University of Georgia May 1998 Draft for presentation to the 1998 AAEA Meetings

More information

Chapter 8: Sampling distributions of estimators Sections

Chapter 8: Sampling distributions of estimators Sections Chapter 8 continued Chapter 8: Sampling distributions of estimators Sections 8.1 Sampling distribution of a statistic 8.2 The Chi-square distributions 8.3 Joint Distribution of the sample mean and sample

More information

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi

Chapter 4: Commonly Used Distributions. Statistics for Engineers and Scientists Fourth Edition William Navidi Chapter 4: Commonly Used Distributions Statistics for Engineers and Scientists Fourth Edition William Navidi 2014 by Education. This is proprietary material solely for authorized instructor use. Not authorized

More information

SELECTION OF VARIABLES INFLUENCING IRAQI BANKS DEPOSITS BY USING NEW BAYESIAN LASSO QUANTILE REGRESSION

SELECTION OF VARIABLES INFLUENCING IRAQI BANKS DEPOSITS BY USING NEW BAYESIAN LASSO QUANTILE REGRESSION Vol. 6, No. 1, Summer 2017 2012 Published by JSES. SELECTION OF VARIABLES INFLUENCING IRAQI BANKS DEPOSITS BY USING NEW BAYESIAN Fadel Hamid Hadi ALHUSSEINI a Abstract The main focus of the paper is modelling

More information

Equity correlations implied by index options: estimation and model uncertainty analysis

Equity correlations implied by index options: estimation and model uncertainty analysis 1/18 : estimation and model analysis, EDHEC Business School (joint work with Rama COT) Modeling and managing financial risks Paris, 10 13 January 2011 2/18 Outline 1 2 of multi-asset models Solution to

More information

A Skewed Truncated Cauchy Uniform Distribution and Its Moments

A Skewed Truncated Cauchy Uniform Distribution and Its Moments Modern Applied Science; Vol. 0, No. 7; 206 ISSN 93-844 E-ISSN 93-852 Published by Canadian Center of Science and Education A Skewed Truncated Cauchy Uniform Distribution and Its Moments Zahra Nazemi Ashani,

More information

Simulation Wrap-up, Statistics COS 323

Simulation Wrap-up, Statistics COS 323 Simulation Wrap-up, Statistics COS 323 Today Simulation Re-cap Statistics Variance and confidence intervals for simulations Simulation wrap-up FYI: No class or office hours Thursday Simulation wrap-up

More information

Improving the accuracy of estimates for complex sampling in auditing 1.

Improving the accuracy of estimates for complex sampling in auditing 1. Improving the accuracy of estimates for complex sampling in auditing 1. Y. G. Berger 1 P. M. Chiodini 2 M. Zenga 2 1 University of Southampton (UK) 2 University of Milano-Bicocca (Italy) 14-06-2017 1 The

More information

Australian Journal of Basic and Applied Sciences. Conditional Maximum Likelihood Estimation For Survival Function Using Cox Model

Australian Journal of Basic and Applied Sciences. Conditional Maximum Likelihood Estimation For Survival Function Using Cox Model AENSI Journals Australian Journal of Basic and Applied Sciences Journal home page: wwwajbaswebcom Conditional Maximum Likelihood Estimation For Survival Function Using Cox Model Khawla Mustafa Sadiq University

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Maximum Likelihood Estimation EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #6 EPSY 905: Maximum Likelihood In This Lecture The basics of maximum likelihood estimation Ø The engine that

More information

Volatility Models and Their Applications

Volatility Models and Their Applications HANDBOOK OF Volatility Models and Their Applications Edited by Luc BAUWENS CHRISTIAN HAFNER SEBASTIEN LAURENT WILEY A John Wiley & Sons, Inc., Publication PREFACE CONTRIBUTORS XVII XIX [JQ VOLATILITY MODELS

More information

Non-informative Priors Multiparameter Models

Non-informative Priors Multiparameter Models Non-informative Priors Multiparameter Models Statistics 220 Spring 2005 Copyright c 2005 by Mark E. Irwin Prior Types Informative vs Non-informative There has been a desire for a prior distributions that

More information

Unit 5: Sampling Distributions of Statistics

Unit 5: Sampling Distributions of Statistics Unit 5: Sampling Distributions of Statistics Statistics 571: Statistical Methods Ramón V. León 6/12/2004 Unit 5 - Stat 571 - Ramon V. Leon 1 Definitions and Key Concepts A sample statistic used to estimate

More information

Posterior Inference. , where should we start? Consider the following computational procedure: 1. draw samples. 2. convert. 3. compute properties

Posterior Inference. , where should we start? Consider the following computational procedure: 1. draw samples. 2. convert. 3. compute properties Posterior Inference Example. Consider a binomial model where we have a posterior distribution for the probability term, θ. Suppose we want to make inferences about the log-odds γ = log ( θ 1 θ), where

More information

Unit 5: Sampling Distributions of Statistics

Unit 5: Sampling Distributions of Statistics Unit 5: Sampling Distributions of Statistics Statistics 571: Statistical Methods Ramón V. León 6/12/2004 Unit 5 - Stat 571 - Ramon V. Leon 1 Definitions and Key Concepts A sample statistic used to estimate

More information

Estimating log models: to transform or not to transform?

Estimating log models: to transform or not to transform? Journal of Health Economics 20 (2001) 461 494 Estimating log models: to transform or not to transform? Willard G. Manning a,, John Mullahy b a Department of Health Studies, Biological Sciences Division,

More information

Monte Carlo Methods in Finance

Monte Carlo Methods in Finance Monte Carlo Methods in Finance Peter Jackel JOHN WILEY & SONS, LTD Preface Acknowledgements Mathematical Notation xi xiii xv 1 Introduction 1 2 The Mathematics Behind Monte Carlo Methods 5 2.1 A Few Basic

More information

Financial Models with Levy Processes and Volatility Clustering

Financial Models with Levy Processes and Volatility Clustering Financial Models with Levy Processes and Volatility Clustering SVETLOZAR T. RACHEV # YOUNG SHIN ICIM MICHELE LEONARDO BIANCHI* FRANK J. FABOZZI WILEY John Wiley & Sons, Inc. Contents Preface About the

More information

STATISTICS and PROBABILITY

STATISTICS and PROBABILITY Introduction to Statistics Atatürk University STATISTICS and PROBABILITY LECTURE: SAMPLING DISTRIBUTIONS and POINT ESTIMATIONS Prof. Dr. İrfan KAYMAZ Atatürk University Engineering Faculty Department of

More information

Application of MCMC Algorithm in Interest Rate Modeling

Application of MCMC Algorithm in Interest Rate Modeling Application of MCMC Algorithm in Interest Rate Modeling Xiaoxia Feng and Dejun Xie Abstract Interest rate modeling is a challenging but important problem in financial econometrics. This work is concerned

More information

UPDATED IAA EDUCATION SYLLABUS

UPDATED IAA EDUCATION SYLLABUS II. UPDATED IAA EDUCATION SYLLABUS A. Supporting Learning Areas 1. STATISTICS Aim: To enable students to apply core statistical techniques to actuarial applications in insurance, pensions and emerging

More information

Brooks, Introductory Econometrics for Finance, 3rd Edition

Brooks, Introductory Econometrics for Finance, 3rd Edition P1.T2. Quantitative Analysis Brooks, Introductory Econometrics for Finance, 3rd Edition Bionic Turtle FRM Study Notes Sample By David Harper, CFA FRM CIPM and Deepa Raju www.bionicturtle.com Chris Brooks,

More information

GENERATION OF STANDARD NORMAL RANDOM NUMBERS. Naveen Kumar Boiroju and M. Krishna Reddy

GENERATION OF STANDARD NORMAL RANDOM NUMBERS. Naveen Kumar Boiroju and M. Krishna Reddy GENERATION OF STANDARD NORMAL RANDOM NUMBERS Naveen Kumar Boiroju and M. Krishna Reddy Department of Statistics, Osmania University, Hyderabad- 500 007, INDIA Email: nanibyrozu@gmail.com, reddymk54@gmail.com

More information

Calibration of Interest Rates

Calibration of Interest Rates WDS'12 Proceedings of Contributed Papers, Part I, 25 30, 2012. ISBN 978-80-7378-224-5 MATFYZPRESS Calibration of Interest Rates J. Černý Charles University, Faculty of Mathematics and Physics, Prague,

More information

A Markov Chain Monte Carlo Approach to Estimate the Risks of Extremely Large Insurance Claims

A Markov Chain Monte Carlo Approach to Estimate the Risks of Extremely Large Insurance Claims International Journal of Business and Economics, 007, Vol. 6, No. 3, 5-36 A Markov Chain Monte Carlo Approach to Estimate the Risks of Extremely Large Insurance Claims Wan-Kai Pang * Department of Applied

More information

EC316a: Advanced Scientific Computation, Fall Discrete time, continuous state dynamic models: solution methods

EC316a: Advanced Scientific Computation, Fall Discrete time, continuous state dynamic models: solution methods EC316a: Advanced Scientific Computation, Fall 2003 Notes Section 4 Discrete time, continuous state dynamic models: solution methods We consider now solution methods for discrete time models in which decisions

More information

Chapter 7: Estimation Sections

Chapter 7: Estimation Sections 1 / 31 : Estimation Sections 7.1 Statistical Inference Bayesian Methods: 7.2 Prior and Posterior Distributions 7.3 Conjugate Prior Distributions 7.4 Bayes Estimators Frequentist Methods: 7.5 Maximum Likelihood

More information

Chapter 4: Asymptotic Properties of MLE (Part 3)

Chapter 4: Asymptotic Properties of MLE (Part 3) Chapter 4: Asymptotic Properties of MLE (Part 3) Daniel O. Scharfstein 09/30/13 1 / 1 Breakdown of Assumptions Non-Existence of the MLE Multiple Solutions to Maximization Problem Multiple Solutions to

More information

A Non-Random Walk Down Wall Street

A Non-Random Walk Down Wall Street A Non-Random Walk Down Wall Street Andrew W. Lo A. Craig MacKinlay Princeton University Press Princeton, New Jersey list of Figures List of Tables Preface xiii xv xxi 1 Introduction 3 1.1 The Random Walk

More information

Generating Random Numbers

Generating Random Numbers Generating Random Numbers Aim: produce random variables for given distribution Inverse Method Let F be the distribution function of an univariate distribution and let F 1 (y) = inf{x F (x) y} (generalized

More information

Confidence Intervals Introduction

Confidence Intervals Introduction Confidence Intervals Introduction A point estimate provides no information about the precision and reliability of estimation. For example, the sample mean X is a point estimate of the population mean μ

More information

BIO5312 Biostatistics Lecture 5: Estimations

BIO5312 Biostatistics Lecture 5: Estimations BIO5312 Biostatistics Lecture 5: Estimations Yujin Chung September 27th, 2016 Fall 2016 Yujin Chung Lec5: Estimations Fall 2016 1/34 Recap Yujin Chung Lec5: Estimations Fall 2016 2/34 Today s lecture and

More information