RESEARCH ARTICLE. The Penalized Biclustering Model And Related Algorithms Supplemental Online Material

Size: px
Start display at page:

Download "RESEARCH ARTICLE. The Penalized Biclustering Model And Related Algorithms Supplemental Online Material"

Transcription

1 Journal of Applied Statistics Vol. 00, No. 00, Month 00x, 8 RESEARCH ARTICLE The Penalized Biclustering Model And Related Algorithms Supplemental Online Material Thierry Cheouo and Alejandro Murua Département de mathématiques et de statistique, Université de Montréal, CP 68, succ. centre-ville, Montréal, Québec H3C 3J7 Canada Received 00 Month 00x; in final form 00 Month 00x) A. Introduction In these sections we provide further details on the EM and Bayesian algorithms described in the main body of the paper. Section B provides the EM updating equations for the plaid model of Section. Section C displays the full conditional distributions of the labels and parameters associated with the penalized plaid model of Section 4. The procedure to initialize the parameters and labels for the Marov chain Monte Carlo implementation of the penalized plaid model is described in Section D. Section E gives the URL addresses of the biclustering pacages used in this study, including our own pacage implementing the penalized plaid model. B. The EM updating equations Note that the bicluster and combination bicluster probabilities pρ i κ j ) are constants depending only on the combination bicluster. We will denote them as π, = 0,,..., K. Observe that pρ, κ Y, θ) = pρ i, κ j y ij, θ). It is straightforward to verify that pρ i, κ j y ij, θ) = pρ i κ j y ij, θ) = ρ i κ j π cb) σρ φ i κ j) y ij µρ i κ j, θ))/σρ i κ j ) ) π cb ) σρ i κ j) φ y ij µρ i κ j, θ))/σρ ), i κ j ) where cb) and cb ) denote the combination biclusters associated to ρ i, κ j ) and ρ i, κ j ), respectively. The maximizer of Qθ θ) for the plaid model is obtained by Corresponding author: A. Murua. murua@dms.umontreal.ca ISSN: print/issn online c 00x Taylor & Francis DOI: 0.080/066476YYxxxxxxxx

2 T. Cheouo & A. Murua taing the derivatives with respect to θ. These yield: µ = E θρ i κ j ) E θρ i κ j )y ij E θρ i κ j ρ i κ j )µ + α i + β j ) α i = j E θρ i κ j ) E θρ i κ j )y ij E θρ i κ j ρ i κ j )µ + α i + β j ) µ j β j = i E θρ i κ j ) E θρ i κ j )y ij E θρ i κ j ρ i κ j )µ + α i + β j ) µ i π cb) pρ i κ j = ρ cb) κ cb) y ij, θ), where ρ cb) κ cb) denotes the corresponding -th combination bicluster, and σ = qp = qp E θ y ij ) ρ i κ j µ + α i + β j ) yij E θρ i κ j )y ij µ + α i + β j ) + E θρ i κ j ρ i κ j )µ + α i + β j )µ + α i + β j )., Note that the updating equations are recursive. The parameters can be estimated using a Gauss-Seidel relaxation scheme over =,..., K. For example, let the superscript t + ) denote the coefficients recently updated, and the superscript t), the coefficients not yet updated. Then in order to solve the system, say for α i s, we iterate for within the EM iterations α t+) i = j E θ ρ iκ j ) E θρ i κ j )y ij j < E θρ i κ j ρ i κ j )µ t+) + β t+) j ) E θρ i κ j ρ i κ j )µ t) + α t) i + β t) j ) > } µ t+). + α t+) i Also note that the expectation is intractable if the number of biclusters K is large. For example, one needs to compute E θρ i κ j ) = ρ i κ j pρ i κ j y ij, θ), which is ρ iκ j= a sum involving K terms. In the non-overlapping bicluster model, the sum reduces to one term E θρ i κ j ) = pρ i κ j =, ρ i κ j ) = y ij, θ). In this latter

3 case, the updating equations simplify to Journal of Applied Statistics 3 µ = E θ ρ iκ j ) E θρ i κ j )y ij α i = j E θ ρ iκ j ) E θρ i κ j )y ij µ β j = i E θ ρ iκ j ) E θρ i κ j )y ij µ σ = E θ ρ iκ j ) j i E θρ i κ j ) y ij µ α i β j ) π cb) pρ i κ j = y ij, θ). C. The penalized plaid model : the full conditionals The labels Note that the lielihood may be written as exp yij γ ij ) ρ } iκ j µ + α i + β j ) ) + log σρ i, κ j ) σρ i, κ j ) γ ij y )} ij µ 0 ) + log σ0. σ 0 Let the bicluster, be fixed. Define the variables z ij = y ij ρ i κ j µ α i β j ), α = α i ) i I R r, β = β j ) j J R c. To find the full conditional of the labels, say ρ i, we use the fact that K y ij ρ i κ j µ + α i + β j ) = z ij ρ i κ j µ + α i + β j ) = = ρ i κ j z ij µ α i β j ) + ρ i )z ij + ρ i κ j )z ij. Note that γ ij = K = ρ iκ j ). For a given, we will write γ ij = K =

4 4 T. Cheouo & A. Murua ρ i κ j ). Then ρ i κ j γ ij ) = ρ i κ j γ ij ρ i κ j )ρ i κ j = ρ i κ j. We have j γ ij ) z ij ρ i κ j µ + α i + β j )) σ ρ i κ j ) z ij µ α i β = ρ j) i σ ρ i κ j ) j J + ρ i ) j z ij µ α i β = ρ j) i σ ρ i κ j ) j J zij γ ij ) σ ρ i κ j ) + ρ zij i γ ij ) σ ρ i κ j ) j / J + ρ i ) zij γ ij ) σ ρ i κ j ) + zij γ ij ) σ ρ i κ j ). j J j / J As before, let θ denote the set of parameters of the model. Define A i = exp } z ij µ α i β j) σ ρ i κ j ) j J B i = exp σ0 C i = exp D i,ρi γ ij y ij µ 0 ) j J γ ij ) j J = exp γ ij ) j / J } σ 0 ) / σ ρ i κ j ), j J ) j J γ ij/, z ij σ ρ i κ j ) + log σ ρ i κ j ) )}, zij )} σ ρ i κ j ) + log σ ρ i κ j ). Also let ρ i) denote the set of all row labels except ρ i. From the above equation it is straightforward to verify that the full conditionals of ρ i satisfy pρ i y ij }, ρ i), κ), θ) A ρi i B ρi i C ρi i D i,ρi πρ i ), where πρ i ) = exp λ j K = ρ i κ j + γ ij + γ ij )κ j ρ i ) }. In particular, the ratio pρ i = y ij }, ρ i), κ), θ)/pρ i = 0 y ij }, ρ i), κ), θ) is given by A i B i C i D i,d i,0 exp λ j γ ij )κ j }. The term D i,ρi may be ignored for models whose variances do not depend on i, j). In particular, for the plaid model, the logarithm of this ratio is σ j J z ij µ α i β j ) + γ ij )zij +γ ijy ij µ 0 ) } λ j γ ij)κ j. The full conditional for κ j s are found in a similar way by symmetry.

5 The row and column effects Journal of Applied Statistics 5 Define the matrices R = diag j J σ ρ i, κ j )), and C = diag i I σ ρ i, κ j )). Let m denote the vector of all s in R m. Since the variance of α is given by σαv = σαi r r r r ), we may write α = V a for a random vector a N0, σαi r ). It is easy to verify that the full conditional of a is a multivariate normal with mean µ a, and variance Σ a, given by µ a, = V R V + σ α I r ) V z α,, Σ a, = V R V + σ α I r ), where z α, = j J z ij µ β j )/σ ρ i, κ j )) i I. Similarly, let U = I c c c c ). We may write β = U b for a random vector b N0, σ β I c ). It is easy to verify that the full conditional of b is a multivariate normal with mean µ b, and variance Σ b, given by µ b, = U C U + σ β I c ) U z β,, Σ b, = U C U + σ β I c ), where z β, = i I z ij µ α i )/σ ρ i, κ j )) j J. For the plaid model σρ i, κ j ) = σ, and for the model of Cheng and Church [], σρ i, κ j ) = σ. In both cases the variance is constant on each bicluster. Therefore, for these models, R = σ c I r and C = σ r I c. Hence the conditional means and variances for a and b become µ a, = Σ a, = µ b, = Σ b, = c σ c σ r σ r σ + σ α + σ α + σ β ) c σ z i z ) i I, ) I r + c σα r σ r r ), ) r z j z ) j J, σ + ) σα I c + r σβ c σ c c ), where z denotes the mean of the values of z ij in the bicluster, and z i = j J z ij /c, and z j = i I z ij /r. It can be easily shown that the full conditionals of the means µ, = 0,,..., K are also normal distributions with means and variances given by µ µ, = σ µ Σ µ, = σ µ + ) B + ) B ) z ij α i β j σ ρ i, κ j ) σ, ρ i, κ j ) ) B ). σ ρ i, κ j ) Again, for the plaid and Cheng and Church models, the means and variances

6 6 T. Cheouo & A. Murua simplify to µ µ, = σ µ + n ) n σ σ z, Σ µ, = σµ + n ) σ. Note that when σµ, σα, and σβ estimators. tend to infinity we obtain the hard-em or ICM) The variances Let z ij = y ij ρ i κ j µ + α i + β j ). The full conditionals of the variances are proportional to σ exp zij + ) νs σ + νs ) B σ } n logσ ) ν + ) log σ. If we suppose that there is no overlapping among the biclusters, we obtain the Cheng and Church model []. The corresponding full conditional of σ is an inverseχ distribution with scale νs + ) B zij )/ν + n ), and ν + n degrees of freedom. If instead we suppose that σρ i, κ j ) = σ independently of the cell i, j) i.e., σ = σ for all = 0,,..., K), then we obtain the full conditional of σ for the plaid model. This is also an inverse-χ distribution, but this time with scale νs + z ij )/ν + pq), and ν + pq degrees of freedom. D. The penalized plaid model: initial values Finding the initial membership labels ρ, κ) is a difficult tas. Several procedures have been suggested in the literature. We have adopted a technique similar to that of Turner et al. [3]. We run two independent -means algorithms [4] with = : once for the rows and once for the columns. Using the Cartesian product of the resulting -means row and column labels, we divide the data matrix into four groups. A single initial bicluster is chosen among these four groups according to a variance criterion explained a few lines below. The procedure is repeated as many times as the number of initial biclusters needed. A single initial bicluster is chosen after each application of the independent row and column -means algorithms. The elements of the biclusters already chosen are mased by replacing their original values y ij by random values. This is done so that in the next iteration a different group may be chosen. The masing procedure is not new. It has been used before by Sheng, Moreau and De Moor [] to determinate multiple biclusters. The criterion to choose an initial bicluster among the four groups yielded by each iteration is the following. Suppose that the cells of each group follow a random effect additive ANOVA model. That is, on each group g, y ij = µ g + α ig + β jg + ɛ ij, g =,, 3, 4. The standard moment estimates of the variances are ˆσ gα = MSS gα) MSS g e), ˆσ gβ c = MSS gβ) MSS g e), ˆσ ge = MSS g e), ) g r g where r g is the number of rows in the g-th group, c g is the number of columns, and MSS g e), MSS g α) and MSS g β) are the corresponding mean sum of squares for error, rows and columns, respectively. We select as an initial bicluster the group g

7 REFERENCES 7 that maximizes ˆσ gα + ˆσ gβ )/ˆσ ge. For each initial bicluster, the parameters µ g, α ig and β jg are initialized as ȳ.., ȳ i. ȳ.., ȳ.j ȳ.., respectively, where y.., ȳ i., ȳ.j stand for the overall bicluster mean, the bicluster i-th row mean, and the bicluster j-th column mean, respectively. The parameter µ 0 is estimated as the arithmetic mean of the zero-bicluster. The variance σ is initialized as the mean sum of squares of all the residuals. E. Biclustering algorithm pacages The following table gives the URLs associated with the biclustering pacages used in the comparison study presented in Section 5 of the main body of this paper. The pacage names are given in parenthesis under the associated algorithm names. Algorithm FABIA fabia) Description and Address R pacage: Factor Analysis for Bicluster Acquisition SAMBA EXPANDER) EXpression Analyzer and DisplayER: java-based tool for analysis of gene expression and NGS data acgt.cs.tau.ac.il/expander/overview.html TUR biclust) R pacage: the plaid model implementation of Turner and al. cran.r-project.org/web/pacages/biclust/ CC biclust) R pacage: original method suggested by Cheng and Church to fit a mixture model cran.r-project.org/web/pacages/biclust/ Spectral biclust) R pacage: uses a singular value decomposition cran.r-project.org/web/pacages/biclust/ Penalized plaid penalizedplaid) java pacage: The models are: ) the non-overlapping bicluster model; ) the plaid model; and 3) the penalized plaid model. The estimation methods are: a) the Gibbs sampler and b) the Metropolis-Hastings algorithm References [] Y. Cheng and G. Church, Biclustering of expression data, Int. Conf. Intelligent Systems for Molecular Biology 000), pp Next Generation Sequencing

8 8 REFERENCES [] Q. Sheng, Y. Moreau, and B. De Moor, Biclustering microarray data by Gibbs sampling, Bioinformatics 9 003), pp. ii96 ii05. [3] H. Turner, T. Bailey, and W. Krzanowsi, Improved biclustering of microarray data demonstrated through systematic performance tests, Computational Statistics & Data Analysis ), pp [4] J.H. Ward, Hierarchical groupings to optimize an objective function, J. American Statistical Association ), pp

A potentially useful approach to model nonlinearities in time series is to assume different behavior (structural break) in different subsamples

A potentially useful approach to model nonlinearities in time series is to assume different behavior (structural break) in different subsamples 1.3 Regime switching models A potentially useful approach to model nonlinearities in time series is to assume different behavior (structural break) in different subsamples (or regimes). If the dates, the

More information

Chapter 5 Univariate time-series analysis. () Chapter 5 Univariate time-series analysis 1 / 29

Chapter 5 Univariate time-series analysis. () Chapter 5 Univariate time-series analysis 1 / 29 Chapter 5 Univariate time-series analysis () Chapter 5 Univariate time-series analysis 1 / 29 Time-Series Time-series is a sequence fx 1, x 2,..., x T g or fx t g, t = 1,..., T, where t is an index denoting

More information

Bayesian Linear Model: Gory Details

Bayesian Linear Model: Gory Details Bayesian Linear Model: Gory Details Pubh7440 Notes By Sudipto Banerjee Let y y i ] n i be an n vector of independent observations on a dependent variable (or response) from n experimental units. Associated

More information

Evaluation of a New Variance Components Estimation Method Modi ed Henderson s Method 3 With the Application of Two Way Mixed Model

Evaluation of a New Variance Components Estimation Method Modi ed Henderson s Method 3 With the Application of Two Way Mixed Model Evaluation of a New Variance Components Estimation Method Modi ed Henderson s Method 3 With the Application of Two Way Mixed Model Author: Weigang Qie; Chenfan Xu Supervisor: Lars Rönnegård June 0th, 009

More information

Statistical Inference and Methods

Statistical Inference and Methods Department of Mathematics Imperial College London d.stephens@imperial.ac.uk http://stats.ma.ic.ac.uk/ das01/ 14th February 2006 Part VII Session 7: Volatility Modelling Session 7: Volatility Modelling

More information

COS 513: Gibbs Sampling

COS 513: Gibbs Sampling COS 513: Gibbs Sampling Matthew Salesi December 6, 2010 1 Overview Concluding the coverage of Markov chain Monte Carlo (MCMC) sampling methods, we look today at Gibbs sampling. Gibbs sampling is a simple

More information

A Practical Implementation of the Gibbs Sampler for Mixture of Distributions: Application to the Determination of Specifications in Food Industry

A Practical Implementation of the Gibbs Sampler for Mixture of Distributions: Application to the Determination of Specifications in Food Industry A Practical Implementation of the for Mixture of Distributions: Application to the Determination of Specifications in Food Industry Julien Cornebise 1 Myriam Maumy 2 Philippe Girard 3 1 Ecole Supérieure

More information

ROM SIMULATION Exact Moment Simulation using Random Orthogonal Matrices

ROM SIMULATION Exact Moment Simulation using Random Orthogonal Matrices ROM SIMULATION Exact Moment Simulation using Random Orthogonal Matrices Bachelier Finance Society Meeting Toronto 2010 Henley Business School at Reading Contact Author : d.ledermann@icmacentre.ac.uk Alexander

More information

Extend the ideas of Kan and Zhou paper on Optimal Portfolio Construction under parameter uncertainty

Extend the ideas of Kan and Zhou paper on Optimal Portfolio Construction under parameter uncertainty Extend the ideas of Kan and Zhou paper on Optimal Portfolio Construction under parameter uncertainty George Photiou Lincoln College University of Oxford A dissertation submitted in partial fulfilment for

More information

Technical Appendix: Policy Uncertainty and Aggregate Fluctuations.

Technical Appendix: Policy Uncertainty and Aggregate Fluctuations. Technical Appendix: Policy Uncertainty and Aggregate Fluctuations. Haroon Mumtaz Paolo Surico July 18, 2017 1 The Gibbs sampling algorithm Prior Distributions and starting values Consider the model to

More information

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER Two hours MATH20802 To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER STATISTICAL METHODS Answer any FOUR of the SIX questions.

More information

Bivariate Birnbaum-Saunders Distribution

Bivariate Birnbaum-Saunders Distribution Department of Mathematics & Statistics Indian Institute of Technology Kanpur January 2nd. 2013 Outline 1 Collaborators 2 3 Birnbaum-Saunders Distribution: Introduction & Properties 4 5 Outline 1 Collaborators

More information

Risk Management and Time Series

Risk Management and Time Series IEOR E4602: Quantitative Risk Management Spring 2016 c 2016 by Martin Haugh Risk Management and Time Series Time series models are often employed in risk management applications. They can be used to estimate

More information

Course information FN3142 Quantitative finance

Course information FN3142 Quantitative finance Course information 015 16 FN314 Quantitative finance This course is aimed at students interested in obtaining a thorough grounding in market finance and related empirical methods. Prerequisite If taken

More information

STA258 Analysis of Variance

STA258 Analysis of Variance STA258 Analysis of Variance Al Nosedal. University of Toronto. Winter 2017 The Data Matrix The following table shows last year s sales data for a small business. The sample is put into a matrix format

More information

A Multivariate Analysis of Intercompany Loss Triangles

A Multivariate Analysis of Intercompany Loss Triangles A Multivariate Analysis of Intercompany Loss Triangles Peng Shi School of Business University of Wisconsin-Madison ASTIN Colloquium May 21-24, 2013 Peng Shi (Wisconsin School of Business) Intercompany

More information

Nonresponse Adjustment of Survey Estimates Based on. Auxiliary Variables Subject to Error. Brady T. West. University of Michigan, Ann Arbor, MI, USA

Nonresponse Adjustment of Survey Estimates Based on. Auxiliary Variables Subject to Error. Brady T. West. University of Michigan, Ann Arbor, MI, USA Nonresponse Adjustment of Survey Estimates Based on Auxiliary Variables Subject to Error Brady T West University of Michigan, Ann Arbor, MI, USA Roderick JA Little University of Michigan, Ann Arbor, MI,

More information

SELECTION OF VARIABLES INFLUENCING IRAQI BANKS DEPOSITS BY USING NEW BAYESIAN LASSO QUANTILE REGRESSION

SELECTION OF VARIABLES INFLUENCING IRAQI BANKS DEPOSITS BY USING NEW BAYESIAN LASSO QUANTILE REGRESSION Vol. 6, No. 1, Summer 2017 2012 Published by JSES. SELECTION OF VARIABLES INFLUENCING IRAQI BANKS DEPOSITS BY USING NEW BAYESIAN Fadel Hamid Hadi ALHUSSEINI a Abstract The main focus of the paper is modelling

More information

Macroeconometric Modeling: 2018

Macroeconometric Modeling: 2018 Macroeconometric Modeling: 2018 Contents Ray C. Fair 2018 1 Macroeconomic Methodology 4 1.1 The Cowles Commission Approach................. 4 1.2 Macroeconomic Methodology.................... 5 1.3 The

More information

Definition 9.1 A point estimate is any function T (X 1,..., X n ) of a random sample. We often write an estimator of the parameter θ as ˆθ.

Definition 9.1 A point estimate is any function T (X 1,..., X n ) of a random sample. We often write an estimator of the parameter θ as ˆθ. 9 Point estimation 9.1 Rationale behind point estimation When sampling from a population described by a pdf f(x θ) or probability function P [X = x θ] knowledge of θ gives knowledge of the entire population.

More information

Introduction to the Maximum Likelihood Estimation Technique. September 24, 2015

Introduction to the Maximum Likelihood Estimation Technique. September 24, 2015 Introduction to the Maximum Likelihood Estimation Technique September 24, 2015 So far our Dependent Variable is Continuous That is, our outcome variable Y is assumed to follow a normal distribution having

More information

Supplementary online material to Information tradeoffs in dynamic financial markets

Supplementary online material to Information tradeoffs in dynamic financial markets Supplementary online material to Information tradeoffs in dynamic financial markets Efstathios Avdis University of Alberta, Canada 1. The value of information in continuous time In this document I address

More information

Lecture 3: Factor models in modern portfolio choice

Lecture 3: Factor models in modern portfolio choice Lecture 3: Factor models in modern portfolio choice Prof. Massimo Guidolin Portfolio Management Spring 2016 Overview The inputs of portfolio problems Using the single index model Multi-index models Portfolio

More information

Estimating a Dynamic Oligopolistic Game with Serially Correlated Unobserved Production Costs. SS223B-Empirical IO

Estimating a Dynamic Oligopolistic Game with Serially Correlated Unobserved Production Costs. SS223B-Empirical IO Estimating a Dynamic Oligopolistic Game with Serially Correlated Unobserved Production Costs SS223B-Empirical IO Motivation There have been substantial recent developments in the empirical literature on

More information

Calibration of Interest Rates

Calibration of Interest Rates WDS'12 Proceedings of Contributed Papers, Part I, 25 30, 2012. ISBN 978-80-7378-224-5 MATFYZPRESS Calibration of Interest Rates J. Černý Charles University, Faculty of Mathematics and Physics, Prague,

More information

Lattice (Binomial Trees) Version 1.2

Lattice (Binomial Trees) Version 1.2 Lattice (Binomial Trees) Version 1. 1 Introduction This plug-in implements different binomial trees approximations for pricing contingent claims and allows Fairmat to use some of the most popular binomial

More information

Financial Econometrics

Financial Econometrics Financial Econometrics Volatility Gerald P. Dwyer Trinity College, Dublin January 2013 GPD (TCD) Volatility 01/13 1 / 37 Squared log returns for CRSP daily GPD (TCD) Volatility 01/13 2 / 37 Absolute value

More information

1 Explaining Labor Market Volatility

1 Explaining Labor Market Volatility Christiano Economics 416 Advanced Macroeconomics Take home midterm exam. 1 Explaining Labor Market Volatility The purpose of this question is to explore a labor market puzzle that has bedeviled business

More information

Posterior Inference. , where should we start? Consider the following computational procedure: 1. draw samples. 2. convert. 3. compute properties

Posterior Inference. , where should we start? Consider the following computational procedure: 1. draw samples. 2. convert. 3. compute properties Posterior Inference Example. Consider a binomial model where we have a posterior distribution for the probability term, θ. Suppose we want to make inferences about the log-odds γ = log ( θ 1 θ), where

More information

Portfolio Optimization. Prof. Daniel P. Palomar

Portfolio Optimization. Prof. Daniel P. Palomar Portfolio Optimization Prof. Daniel P. Palomar The Hong Kong University of Science and Technology (HKUST) MAFS6010R- Portfolio Optimization with R MSc in Financial Mathematics Fall 2018-19, HKUST, Hong

More information

Module 2: Monte Carlo Methods

Module 2: Monte Carlo Methods Module 2: Monte Carlo Methods Prof. Mike Giles mike.giles@maths.ox.ac.uk Oxford University Mathematical Institute MC Lecture 2 p. 1 Greeks In Monte Carlo applications we don t just want to know the expected

More information

Modelling strategies for bivariate circular data

Modelling strategies for bivariate circular data Modelling strategies for bivariate circular data John T. Kent*, Kanti V. Mardia, & Charles C. Taylor Department of Statistics, University of Leeds 1 Introduction On the torus there are two common approaches

More information

STA218 Analysis of Variance

STA218 Analysis of Variance STA218 Analysis of Variance Al Nosedal. University of Toronto. Fall 2017 November 27, 2017 The Data Matrix The following table shows last year s sales data for a small business. The sample is put into

More information

Missing Data. EM Algorithm and Multiple Imputation. Aaron Molstad, Dootika Vats, Li Zhong. University of Minnesota School of Statistics

Missing Data. EM Algorithm and Multiple Imputation. Aaron Molstad, Dootika Vats, Li Zhong. University of Minnesota School of Statistics Missing Data EM Algorithm and Multiple Imputation Aaron Molstad, Dootika Vats, Li Zhong University of Minnesota School of Statistics December 4, 2013 Overview 1 EM Algorithm 2 Multiple Imputation Incomplete

More information

Financial intermediaries in an estimated DSGE model for the UK

Financial intermediaries in an estimated DSGE model for the UK Financial intermediaries in an estimated DSGE model for the UK Stefania Villa a Jing Yang b a Birkbeck College b Bank of England Cambridge Conference - New Instruments of Monetary Policy: The Challenges

More information

Financial Risk Management

Financial Risk Management Financial Risk Management Professor: Thierry Roncalli Evry University Assistant: Enareta Kurtbegu Evry University Tutorial exercices #4 1 Correlation and copulas 1. The bivariate Gaussian copula is given

More information

Estimating Market Power in Differentiated Product Markets

Estimating Market Power in Differentiated Product Markets Estimating Market Power in Differentiated Product Markets Metin Cakir Purdue University December 6, 2010 Metin Cakir (Purdue) Market Equilibrium Models December 6, 2010 1 / 28 Outline Outline Estimating

More information

Risk Margin Quantile Function Via Parametric and Non-Parametric Bayesian Quantile Regression

Risk Margin Quantile Function Via Parametric and Non-Parametric Bayesian Quantile Regression Risk Margin Quantile Function Via Parametric and Non-Parametric Bayesian Quantile Regression Alice X.D. Dong 1 Jennifer S.K. Chan 1 Gareth W. Peters 2 Working paper, version from February 12, 2014 arxiv:1402.2492v1

More information

Modelling, Estimation and Hedging of Longevity Risk

Modelling, Estimation and Hedging of Longevity Risk IA BE Summer School 2016, K. Antonio, UvA 1 / 50 Modelling, Estimation and Hedging of Longevity Risk Katrien Antonio KU Leuven and University of Amsterdam IA BE Summer School 2016, Leuven Module II: Fitting

More information

ELEMENTS OF MATRIX MATHEMATICS

ELEMENTS OF MATRIX MATHEMATICS QRMC07 9/7/0 4:45 PM Page 5 CHAPTER SEVEN ELEMENTS OF MATRIX MATHEMATICS 7. AN INTRODUCTION TO MATRICES Investors frequently encounter situations involving numerous potential outcomes, many discrete periods

More information

What s New in Econometrics. Lecture 11

What s New in Econometrics. Lecture 11 What s New in Econometrics Lecture 11 Discrete Choice Models Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Multinomial and Conditional Logit Models 3. Independence of Irrelevant Alternatives

More information

Premia 14 HESTON MODEL CALIBRATION USING VARIANCE SWAPS PRICES

Premia 14 HESTON MODEL CALIBRATION USING VARIANCE SWAPS PRICES Premia 14 HESTON MODEL CALIBRATION USING VARIANCE SWAPS PRICES VADIM ZHERDER Premia Team INRIA E-mail: vzherder@mailru 1 Heston model Let the asset price process S t follows the Heston stochastic volatility

More information

Extended Model: Posterior Distributions

Extended Model: Posterior Distributions APPENDIX A Extended Model: Posterior Distributions A. Homoskedastic errors Consider the basic contingent claim model b extended by the vector of observables x : log C i = β log b σ, x i + β x i + i, i

More information

Final exam solutions

Final exam solutions EE365 Stochastic Control / MS&E251 Stochastic Decision Models Profs. S. Lall, S. Boyd June 5 6 or June 6 7, 2013 Final exam solutions This is a 24 hour take-home final. Please turn it in to one of the

More information

Oil and macroeconomic (in)stability

Oil and macroeconomic (in)stability Oil and macroeconomic (in)stability Hilde C. Bjørnland Vegard H. Larsen Centre for Applied Macro- and Petroleum Economics (CAMP) BI Norwegian Business School CFE-ERCIM December 07, 2014 Bjørnland and Larsen

More information

1. You are given the following information about a stationary AR(2) model:

1. You are given the following information about a stationary AR(2) model: Fall 2003 Society of Actuaries **BEGINNING OF EXAMINATION** 1. You are given the following information about a stationary AR(2) model: (i) ρ 1 = 05. (ii) ρ 2 = 01. Determine φ 2. (A) 0.2 (B) 0.1 (C) 0.4

More information

Bayesian Inference for Random Coefficient Dynamic Panel Data Models

Bayesian Inference for Random Coefficient Dynamic Panel Data Models Bayesian Inference for Random Coefficient Dynamic Panel Data Models By Peng Zhang and Dylan Small* 1 Department of Statistics, The Wharton School, University of Pennsylvania Abstract We develop a hierarchical

More information

Market Risk Analysis Volume I

Market Risk Analysis Volume I Market Risk Analysis Volume I Quantitative Methods in Finance Carol Alexander John Wiley & Sons, Ltd List of Figures List of Tables List of Examples Foreword Preface to Volume I xiii xvi xvii xix xxiii

More information

"Pricing Exotic Options using Strong Convergence Properties

Pricing Exotic Options using Strong Convergence Properties Fourth Oxford / Princeton Workshop on Financial Mathematics "Pricing Exotic Options using Strong Convergence Properties Klaus E. Schmitz Abe schmitz@maths.ox.ac.uk www.maths.ox.ac.uk/~schmitz Prof. Mike

More information

Solving dynamic portfolio choice problems by recursing on optimized portfolio weights or on the value function?

Solving dynamic portfolio choice problems by recursing on optimized portfolio weights or on the value function? DOI 0.007/s064-006-9073-z ORIGINAL PAPER Solving dynamic portfolio choice problems by recursing on optimized portfolio weights or on the value function? Jules H. van Binsbergen Michael W. Brandt Received:

More information

Modelling Returns: the CER and the CAPM

Modelling Returns: the CER and the CAPM Modelling Returns: the CER and the CAPM Carlo Favero Favero () Modelling Returns: the CER and the CAPM 1 / 20 Econometric Modelling of Financial Returns Financial data are mostly observational data: they

More information

ECON 815. A Basic New Keynesian Model II

ECON 815. A Basic New Keynesian Model II ECON 815 A Basic New Keynesian Model II Winter 2015 Queen s University ECON 815 1 Unemployment vs. Inflation 12 10 Unemployment 8 6 4 2 0 1 1.5 2 2.5 3 3.5 4 4.5 5 Core Inflation 14 12 10 Unemployment

More information

Online Appendix (Not intended for Publication): Federal Reserve Credibility and the Term Structure of Interest Rates

Online Appendix (Not intended for Publication): Federal Reserve Credibility and the Term Structure of Interest Rates Online Appendix Not intended for Publication): Federal Reserve Credibility and the Term Structure of Interest Rates Aeimit Lakdawala Michigan State University Shu Wu University of Kansas August 2017 1

More information

Applications of Good s Generalized Diversity Index. A. J. Baczkowski Department of Statistics, University of Leeds Leeds LS2 9JT, UK

Applications of Good s Generalized Diversity Index. A. J. Baczkowski Department of Statistics, University of Leeds Leeds LS2 9JT, UK Applications of Good s Generalized Diversity Index A. J. Baczkowski Department of Statistics, University of Leeds Leeds LS2 9JT, UK Internal Report STAT 98/11 September 1998 Applications of Good s Generalized

More information

induced by the Solvency II project

induced by the Solvency II project Asset Les normes allocation IFRS : new en constraints assurance induced by the Solvency II project 36 th International ASTIN Colloquium Zürich September 005 Frédéric PLANCHET Pierre THÉROND ISFA Université

More information

Application of MCMC Algorithm in Interest Rate Modeling

Application of MCMC Algorithm in Interest Rate Modeling Application of MCMC Algorithm in Interest Rate Modeling Xiaoxia Feng and Dejun Xie Abstract Interest rate modeling is a challenging but important problem in financial econometrics. This work is concerned

More information

Monetary Economics Final Exam

Monetary Economics Final Exam 316-466 Monetary Economics Final Exam 1. Flexible-price monetary economics (90 marks). Consider a stochastic flexibleprice money in the utility function model. Time is discrete and denoted t =0, 1,...

More information

Methods and Models of Loss Reserving Based on Run Off Triangles: A Unifying Survey

Methods and Models of Loss Reserving Based on Run Off Triangles: A Unifying Survey Methods and Models of Loss Reserving Based on Run Off Triangles: A Unifying Survey By Klaus D Schmidt Lehrstuhl für Versicherungsmathematik Technische Universität Dresden Abstract The present paper provides

More information

A Bayesian model for classifying all differentially expressed proteins simultaneously in 2D PAGE gels

A Bayesian model for classifying all differentially expressed proteins simultaneously in 2D PAGE gels BMC Bioinformatics This Provisional PDF corresponds to the article as it appeared upon acceptance. Fully formatted PDF and full text (HTML) versions will be made available soon. A Bayesian model for classifying

More information

Equity correlations implied by index options: estimation and model uncertainty analysis

Equity correlations implied by index options: estimation and model uncertainty analysis 1/18 : estimation and model analysis, EDHEC Business School (joint work with Rama COT) Modeling and managing financial risks Paris, 10 13 January 2011 2/18 Outline 1 2 of multi-asset models Solution to

More information

Unobserved Heterogeneity Revisited

Unobserved Heterogeneity Revisited Unobserved Heterogeneity Revisited Robert A. Miller Dynamic Discrete Choice March 2018 Miller (Dynamic Discrete Choice) cemmap 7 March 2018 1 / 24 Distributional Assumptions about the Unobserved Variables

More information

Identifying Long-Run Risks: A Bayesian Mixed-Frequency Approach

Identifying Long-Run Risks: A Bayesian Mixed-Frequency Approach Identifying : A Bayesian Mixed-Frequency Approach Frank Schorfheide University of Pennsylvania CEPR and NBER Dongho Song University of Pennsylvania Amir Yaron University of Pennsylvania NBER February 12,

More information

Continuous Distributions

Continuous Distributions Quantitative Methods 2013 Continuous Distributions 1 The most important probability distribution in statistics is the normal distribution. Carl Friedrich Gauss (1777 1855) Normal curve A normal distribution

More information

may be of interest. That is, the average difference between the estimator and the truth. Estimators with Bias(ˆθ) = 0 are called unbiased.

may be of interest. That is, the average difference between the estimator and the truth. Estimators with Bias(ˆθ) = 0 are called unbiased. 1 Evaluating estimators Suppose you observe data X 1,..., X n that are iid observations with distribution F θ indexed by some parameter θ. When trying to estimate θ, one may be interested in determining

More information

4 Reinforcement Learning Basic Algorithms

4 Reinforcement Learning Basic Algorithms Learning in Complex Systems Spring 2011 Lecture Notes Nahum Shimkin 4 Reinforcement Learning Basic Algorithms 4.1 Introduction RL methods essentially deal with the solution of (optimal) control problems

More information

EE641 Digital Image Processing II: Purdue University VISE - October 29,

EE641 Digital Image Processing II: Purdue University VISE - October 29, EE64 Digital Image Processing II: Purdue University VISE - October 9, 004 The EM Algorithm. Suffient Statistics and Exponential Distributions Let p(y θ) be a family of density functions parameterized by

More information

Construction and behavior of Multinomial Markov random field models

Construction and behavior of Multinomial Markov random field models Graduate Theses and Dissertations Iowa State University Capstones, Theses and Dissertations 2010 Construction and behavior of Multinomial Markov random field models Kim Mueller Iowa State University Follow

More information

# generate data num.obs <- 100 y <- rnorm(num.obs,mean = theta.true, sd = sqrt(sigma.sq.true))

# generate data num.obs <- 100 y <- rnorm(num.obs,mean = theta.true, sd = sqrt(sigma.sq.true)) Posterior Sampling from Normal Now we seek to create draws from the joint posterior distribution and the marginal posterior distributions and Note the marginal posterior distributions would be used to

More information

John Hull, Risk Management and Financial Institutions, 4th Edition

John Hull, Risk Management and Financial Institutions, 4th Edition P1.T2. Quantitative Analysis John Hull, Risk Management and Financial Institutions, 4th Edition Bionic Turtle FRM Video Tutorials By David Harper, CFA FRM 1 Chapter 10: Volatility (Learning objectives)

More information

Idiosyncratic risk, insurance, and aggregate consumption dynamics: a likelihood perspective

Idiosyncratic risk, insurance, and aggregate consumption dynamics: a likelihood perspective Idiosyncratic risk, insurance, and aggregate consumption dynamics: a likelihood perspective Alisdair McKay Boston University June 2013 Microeconomic evidence on insurance - Consumption responds to idiosyncratic

More information

Point Estimators. STATISTICS Lecture no. 10. Department of Econometrics FEM UO Brno office 69a, tel

Point Estimators. STATISTICS Lecture no. 10. Department of Econometrics FEM UO Brno office 69a, tel STATISTICS Lecture no. 10 Department of Econometrics FEM UO Brno office 69a, tel. 973 442029 email:jiri.neubauer@unob.cz 8. 12. 2009 Introduction Suppose that we manufacture lightbulbs and we want to state

More information

The change of correlation structure across industries: an analysis in the regime-switching framework

The change of correlation structure across industries: an analysis in the regime-switching framework Kyoto University, Graduate School of Economics Research Project Center Discussion Paper Series The change of correlation structure across industries: an analysis in the regime-switching framework Masahiko

More information

Window Width Selection for L 2 Adjusted Quantile Regression

Window Width Selection for L 2 Adjusted Quantile Regression Window Width Selection for L 2 Adjusted Quantile Regression Yoonsuh Jung, The Ohio State University Steven N. MacEachern, The Ohio State University Yoonkyung Lee, The Ohio State University Technical Report

More information

Notes on the EM Algorithm Michael Collins, September 24th 2005

Notes on the EM Algorithm Michael Collins, September 24th 2005 Notes on the EM Algorithm Michael Collins, September 24th 2005 1 Hidden Markov Models A hidden Markov model (N, Σ, Θ) consists of the following elements: N is a positive integer specifying the number of

More information

Comparison of Pricing Approaches for Longevity Markets

Comparison of Pricing Approaches for Longevity Markets Comparison of Pricing Approaches for Longevity Markets Melvern Leung Simon Fung & Colin O hare Longevity 12 Conference, Chicago, The Drake Hotel, September 30 th 2016 1 / 29 Overview Introduction 1 Introduction

More information

CHOICE THEORY, UTILITY FUNCTIONS AND RISK AVERSION

CHOICE THEORY, UTILITY FUNCTIONS AND RISK AVERSION CHOICE THEORY, UTILITY FUNCTIONS AND RISK AVERSION Szabolcs Sebestyén szabolcs.sebestyen@iscte.pt Master in Finance INVESTMENTS Sebestyén (ISCTE-IUL) Choice Theory Investments 1 / 65 Outline 1 An Introduction

More information

Chapter 8: CAPM. 1. Single Index Model. 2. Adding a Riskless Asset. 3. The Capital Market Line 4. CAPM. 5. The One-Fund Theorem

Chapter 8: CAPM. 1. Single Index Model. 2. Adding a Riskless Asset. 3. The Capital Market Line 4. CAPM. 5. The One-Fund Theorem Chapter 8: CAPM 1. Single Index Model 2. Adding a Riskless Asset 3. The Capital Market Line 4. CAPM 5. The One-Fund Theorem 6. The Characteristic Line 7. The Pricing Model Single Index Model 1 1. Covariance

More information

Portfolio selection with multiple risk measures

Portfolio selection with multiple risk measures Portfolio selection with multiple risk measures Garud Iyengar Columbia University Industrial Engineering and Operations Research Joint work with Carlos Abad Outline Portfolio selection and risk measures

More information

Stochastic Volatility (SV) Models

Stochastic Volatility (SV) Models 1 Motivations Stochastic Volatility (SV) Models Jun Yu Some stylised facts about financial asset return distributions: 1. Distribution is leptokurtic 2. Volatility clustering 3. Volatility responds to

More information

Microeconomic Theory August 2013 Applied Economics. Ph.D. PRELIMINARY EXAMINATION MICROECONOMIC THEORY. Applied Economics Graduate Program

Microeconomic Theory August 2013 Applied Economics. Ph.D. PRELIMINARY EXAMINATION MICROECONOMIC THEORY. Applied Economics Graduate Program Ph.D. PRELIMINARY EXAMINATION MICROECONOMIC THEORY Applied Economics Graduate Program August 2013 The time limit for this exam is four hours. The exam has four sections. Each section includes two questions.

More information

Assicurazioni Generali: An Option Pricing Case with NAGARCH

Assicurazioni Generali: An Option Pricing Case with NAGARCH Assicurazioni Generali: An Option Pricing Case with NAGARCH Assicurazioni Generali: Business Snapshot Find our latest analyses and trade ideas on bsic.it Assicurazioni Generali SpA is an Italy-based insurance

More information

Asymptotic methods in risk management. Advances in Financial Mathematics

Asymptotic methods in risk management. Advances in Financial Mathematics Asymptotic methods in risk management Peter Tankov Based on joint work with A. Gulisashvili Advances in Financial Mathematics Paris, January 7 10, 2014 Peter Tankov (Université Paris Diderot) Asymptotic

More information

A New Multivariate Kurtosis and Its Asymptotic Distribution

A New Multivariate Kurtosis and Its Asymptotic Distribution A ew Multivariate Kurtosis and Its Asymptotic Distribution Chiaki Miyagawa 1 and Takashi Seo 1 Department of Mathematical Information Science, Graduate School of Science, Tokyo University of Science, Tokyo,

More information

Lecture IV Portfolio management: Efficient portfolios. Introduction to Finance Mathematics Fall Financial mathematics

Lecture IV Portfolio management: Efficient portfolios. Introduction to Finance Mathematics Fall Financial mathematics Lecture IV Portfolio management: Efficient portfolios. Introduction to Finance Mathematics Fall 2014 Reduce the risk, one asset Let us warm up by doing an exercise. We consider an investment with σ 1 =

More information

A simple wealth model

A simple wealth model Quantitative Macroeconomics Raül Santaeulàlia-Llopis, MOVE-UAB and Barcelona GSE Homework 5, due Thu Nov 1 I A simple wealth model Consider the sequential problem of a household that maximizes over streams

More information

INTERTEMPORAL ASSET ALLOCATION: THEORY

INTERTEMPORAL ASSET ALLOCATION: THEORY INTERTEMPORAL ASSET ALLOCATION: THEORY Multi-Period Model The agent acts as a price-taker in asset markets and then chooses today s consumption and asset shares to maximise lifetime utility. This multi-period

More information

Operational Risk Aggregation

Operational Risk Aggregation Operational Risk Aggregation Professor Carol Alexander Chair of Risk Management and Director of Research, ISMA Centre, University of Reading, UK. Loss model approaches are currently a focus of operational

More information

Analyzing Oil Futures with a Dynamic Nelson-Siegel Model

Analyzing Oil Futures with a Dynamic Nelson-Siegel Model Analyzing Oil Futures with a Dynamic Nelson-Siegel Model NIELS STRANGE HANSEN & ASGER LUNDE DEPARTMENT OF ECONOMICS AND BUSINESS, BUSINESS AND SOCIAL SCIENCES, AARHUS UNIVERSITY AND CENTER FOR RESEARCH

More information

Semiparametric Modeling, Penalized Splines, and Mixed Models

Semiparametric Modeling, Penalized Splines, and Mixed Models Semi 1 Semiparametric Modeling, Penalized Splines, and Mixed Models David Ruppert Cornell University http://wwworiecornelledu/~davidr January 24 Joint work with Babette Brumback, Ray Carroll, Brent Coull,

More information

A way to improve incremental 2-norm condition estimation

A way to improve incremental 2-norm condition estimation A way to improve incremental 2-norm condition estimation Jurjen Duintjer Tebbens Institute of Computer Science Academy of Sciences of the Czech Republic duintjertebbens@cs.cas.cz Miroslav Tůma Institute

More information

Logit Models for Binary Data

Logit Models for Binary Data Chapter 3 Logit Models for Binary Data We now turn our attention to regression models for dichotomous data, including logistic regression and probit analysis These models are appropriate when the response

More information

ARCH and GARCH models

ARCH and GARCH models ARCH and GARCH models Fulvio Corsi SNS Pisa 5 Dic 2011 Fulvio Corsi ARCH and () GARCH models SNS Pisa 5 Dic 2011 1 / 21 Asset prices S&P 500 index from 1982 to 2009 1600 1400 1200 1000 800 600 400 200

More information

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Solutions to Midterm

Booth School of Business, University of Chicago Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Solutions to Midterm Booth School of Business, University of Chicago Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay Solutions to Midterm Problem A: (34 pts) Answer briefly the following questions. Each question has

More information

Research Memo: Adding Nonfarm Employment to the Mixed-Frequency VAR Model

Research Memo: Adding Nonfarm Employment to the Mixed-Frequency VAR Model Research Memo: Adding Nonfarm Employment to the Mixed-Frequency VAR Model Kenneth Beauchemin Federal Reserve Bank of Minneapolis January 2015 Abstract This memo describes a revision to the mixed-frequency

More information

Lecture Note of Bus 41202, Spring 2008: More Volatility Models. Mr. Ruey Tsay

Lecture Note of Bus 41202, Spring 2008: More Volatility Models. Mr. Ruey Tsay Lecture Note of Bus 41202, Spring 2008: More Volatility Models. Mr. Ruey Tsay The EGARCH model Asymmetry in responses to + & returns: g(ɛ t ) = θɛ t + γ[ ɛ t E( ɛ t )], with E[g(ɛ t )] = 0. To see asymmetry

More information

A RIDGE REGRESSION ESTIMATION APPROACH WHEN MULTICOLLINEARITY IS PRESENT

A RIDGE REGRESSION ESTIMATION APPROACH WHEN MULTICOLLINEARITY IS PRESENT Fundamental Journal of Applied Sciences Vol. 1, Issue 1, 016, Pages 19-3 This paper is available online at http://www.frdint.com/ Published online February 18, 016 A RIDGE REGRESSION ESTIMATION APPROACH

More information

Inferences on Correlation Coefficients of Bivariate Log-normal Distributions

Inferences on Correlation Coefficients of Bivariate Log-normal Distributions Inferences on Correlation Coefficients of Bivariate Log-normal Distributions Guoyi Zhang 1 and Zhongxue Chen 2 Abstract This article considers inference on correlation coefficients of bivariate log-normal

More information

Semiparametric Modeling, Penalized Splines, and Mixed Models David Ruppert Cornell University

Semiparametric Modeling, Penalized Splines, and Mixed Models David Ruppert Cornell University Semiparametric Modeling, Penalized Splines, and Mixed Models David Ruppert Cornell University Possible Model SBMD i,j is spinal bone mineral density on ith subject at age equal to age i,j lide http://wwworiecornelledu/~davidr

More information

The mean-variance portfolio choice framework and its generalizations

The mean-variance portfolio choice framework and its generalizations The mean-variance portfolio choice framework and its generalizations Prof. Massimo Guidolin 20135 Theory of Finance, Part I (Sept. October) Fall 2014 Outline and objectives The backward, three-step solution

More information

MAFS Computational Methods for Pricing Structured Products

MAFS Computational Methods for Pricing Structured Products MAFS550 - Computational Methods for Pricing Structured Products Solution to Homework Two Course instructor: Prof YK Kwok 1 Expand f(x 0 ) and f(x 0 x) at x 0 into Taylor series, where f(x 0 ) = f(x 0 )

More information