RESEARCH ARTICLE. The Penalized Biclustering Model And Related Algorithms Supplemental Online Material
|
|
- Clyde Hoover
- 5 years ago
- Views:
Transcription
1 Journal of Applied Statistics Vol. 00, No. 00, Month 00x, 8 RESEARCH ARTICLE The Penalized Biclustering Model And Related Algorithms Supplemental Online Material Thierry Cheouo and Alejandro Murua Département de mathématiques et de statistique, Université de Montréal, CP 68, succ. centre-ville, Montréal, Québec H3C 3J7 Canada Received 00 Month 00x; in final form 00 Month 00x) A. Introduction In these sections we provide further details on the EM and Bayesian algorithms described in the main body of the paper. Section B provides the EM updating equations for the plaid model of Section. Section C displays the full conditional distributions of the labels and parameters associated with the penalized plaid model of Section 4. The procedure to initialize the parameters and labels for the Marov chain Monte Carlo implementation of the penalized plaid model is described in Section D. Section E gives the URL addresses of the biclustering pacages used in this study, including our own pacage implementing the penalized plaid model. B. The EM updating equations Note that the bicluster and combination bicluster probabilities pρ i κ j ) are constants depending only on the combination bicluster. We will denote them as π, = 0,,..., K. Observe that pρ, κ Y, θ) = pρ i, κ j y ij, θ). It is straightforward to verify that pρ i, κ j y ij, θ) = pρ i κ j y ij, θ) = ρ i κ j π cb) σρ φ i κ j) y ij µρ i κ j, θ))/σρ i κ j ) ) π cb ) σρ i κ j) φ y ij µρ i κ j, θ))/σρ ), i κ j ) where cb) and cb ) denote the combination biclusters associated to ρ i, κ j ) and ρ i, κ j ), respectively. The maximizer of Qθ θ) for the plaid model is obtained by Corresponding author: A. Murua. murua@dms.umontreal.ca ISSN: print/issn online c 00x Taylor & Francis DOI: 0.080/066476YYxxxxxxxx
2 T. Cheouo & A. Murua taing the derivatives with respect to θ. These yield: µ = E θρ i κ j ) E θρ i κ j )y ij E θρ i κ j ρ i κ j )µ + α i + β j ) α i = j E θρ i κ j ) E θρ i κ j )y ij E θρ i κ j ρ i κ j )µ + α i + β j ) µ j β j = i E θρ i κ j ) E θρ i κ j )y ij E θρ i κ j ρ i κ j )µ + α i + β j ) µ i π cb) pρ i κ j = ρ cb) κ cb) y ij, θ), where ρ cb) κ cb) denotes the corresponding -th combination bicluster, and σ = qp = qp E θ y ij ) ρ i κ j µ + α i + β j ) yij E θρ i κ j )y ij µ + α i + β j ) + E θρ i κ j ρ i κ j )µ + α i + β j )µ + α i + β j )., Note that the updating equations are recursive. The parameters can be estimated using a Gauss-Seidel relaxation scheme over =,..., K. For example, let the superscript t + ) denote the coefficients recently updated, and the superscript t), the coefficients not yet updated. Then in order to solve the system, say for α i s, we iterate for within the EM iterations α t+) i = j E θ ρ iκ j ) E θρ i κ j )y ij j < E θρ i κ j ρ i κ j )µ t+) + β t+) j ) E θρ i κ j ρ i κ j )µ t) + α t) i + β t) j ) > } µ t+). + α t+) i Also note that the expectation is intractable if the number of biclusters K is large. For example, one needs to compute E θρ i κ j ) = ρ i κ j pρ i κ j y ij, θ), which is ρ iκ j= a sum involving K terms. In the non-overlapping bicluster model, the sum reduces to one term E θρ i κ j ) = pρ i κ j =, ρ i κ j ) = y ij, θ). In this latter
3 case, the updating equations simplify to Journal of Applied Statistics 3 µ = E θ ρ iκ j ) E θρ i κ j )y ij α i = j E θ ρ iκ j ) E θρ i κ j )y ij µ β j = i E θ ρ iκ j ) E θρ i κ j )y ij µ σ = E θ ρ iκ j ) j i E θρ i κ j ) y ij µ α i β j ) π cb) pρ i κ j = y ij, θ). C. The penalized plaid model : the full conditionals The labels Note that the lielihood may be written as exp yij γ ij ) ρ } iκ j µ + α i + β j ) ) + log σρ i, κ j ) σρ i, κ j ) γ ij y )} ij µ 0 ) + log σ0. σ 0 Let the bicluster, be fixed. Define the variables z ij = y ij ρ i κ j µ α i β j ), α = α i ) i I R r, β = β j ) j J R c. To find the full conditional of the labels, say ρ i, we use the fact that K y ij ρ i κ j µ + α i + β j ) = z ij ρ i κ j µ + α i + β j ) = = ρ i κ j z ij µ α i β j ) + ρ i )z ij + ρ i κ j )z ij. Note that γ ij = K = ρ iκ j ). For a given, we will write γ ij = K =
4 4 T. Cheouo & A. Murua ρ i κ j ). Then ρ i κ j γ ij ) = ρ i κ j γ ij ρ i κ j )ρ i κ j = ρ i κ j. We have j γ ij ) z ij ρ i κ j µ + α i + β j )) σ ρ i κ j ) z ij µ α i β = ρ j) i σ ρ i κ j ) j J + ρ i ) j z ij µ α i β = ρ j) i σ ρ i κ j ) j J zij γ ij ) σ ρ i κ j ) + ρ zij i γ ij ) σ ρ i κ j ) j / J + ρ i ) zij γ ij ) σ ρ i κ j ) + zij γ ij ) σ ρ i κ j ). j J j / J As before, let θ denote the set of parameters of the model. Define A i = exp } z ij µ α i β j) σ ρ i κ j ) j J B i = exp σ0 C i = exp D i,ρi γ ij y ij µ 0 ) j J γ ij ) j J = exp γ ij ) j / J } σ 0 ) / σ ρ i κ j ), j J ) j J γ ij/, z ij σ ρ i κ j ) + log σ ρ i κ j ) )}, zij )} σ ρ i κ j ) + log σ ρ i κ j ). Also let ρ i) denote the set of all row labels except ρ i. From the above equation it is straightforward to verify that the full conditionals of ρ i satisfy pρ i y ij }, ρ i), κ), θ) A ρi i B ρi i C ρi i D i,ρi πρ i ), where πρ i ) = exp λ j K = ρ i κ j + γ ij + γ ij )κ j ρ i ) }. In particular, the ratio pρ i = y ij }, ρ i), κ), θ)/pρ i = 0 y ij }, ρ i), κ), θ) is given by A i B i C i D i,d i,0 exp λ j γ ij )κ j }. The term D i,ρi may be ignored for models whose variances do not depend on i, j). In particular, for the plaid model, the logarithm of this ratio is σ j J z ij µ α i β j ) + γ ij )zij +γ ijy ij µ 0 ) } λ j γ ij)κ j. The full conditional for κ j s are found in a similar way by symmetry.
5 The row and column effects Journal of Applied Statistics 5 Define the matrices R = diag j J σ ρ i, κ j )), and C = diag i I σ ρ i, κ j )). Let m denote the vector of all s in R m. Since the variance of α is given by σαv = σαi r r r r ), we may write α = V a for a random vector a N0, σαi r ). It is easy to verify that the full conditional of a is a multivariate normal with mean µ a, and variance Σ a, given by µ a, = V R V + σ α I r ) V z α,, Σ a, = V R V + σ α I r ), where z α, = j J z ij µ β j )/σ ρ i, κ j )) i I. Similarly, let U = I c c c c ). We may write β = U b for a random vector b N0, σ β I c ). It is easy to verify that the full conditional of b is a multivariate normal with mean µ b, and variance Σ b, given by µ b, = U C U + σ β I c ) U z β,, Σ b, = U C U + σ β I c ), where z β, = i I z ij µ α i )/σ ρ i, κ j )) j J. For the plaid model σρ i, κ j ) = σ, and for the model of Cheng and Church [], σρ i, κ j ) = σ. In both cases the variance is constant on each bicluster. Therefore, for these models, R = σ c I r and C = σ r I c. Hence the conditional means and variances for a and b become µ a, = Σ a, = µ b, = Σ b, = c σ c σ r σ r σ + σ α + σ α + σ β ) c σ z i z ) i I, ) I r + c σα r σ r r ), ) r z j z ) j J, σ + ) σα I c + r σβ c σ c c ), where z denotes the mean of the values of z ij in the bicluster, and z i = j J z ij /c, and z j = i I z ij /r. It can be easily shown that the full conditionals of the means µ, = 0,,..., K are also normal distributions with means and variances given by µ µ, = σ µ Σ µ, = σ µ + ) B + ) B ) z ij α i β j σ ρ i, κ j ) σ, ρ i, κ j ) ) B ). σ ρ i, κ j ) Again, for the plaid and Cheng and Church models, the means and variances
6 6 T. Cheouo & A. Murua simplify to µ µ, = σ µ + n ) n σ σ z, Σ µ, = σµ + n ) σ. Note that when σµ, σα, and σβ estimators. tend to infinity we obtain the hard-em or ICM) The variances Let z ij = y ij ρ i κ j µ + α i + β j ). The full conditionals of the variances are proportional to σ exp zij + ) νs σ + νs ) B σ } n logσ ) ν + ) log σ. If we suppose that there is no overlapping among the biclusters, we obtain the Cheng and Church model []. The corresponding full conditional of σ is an inverseχ distribution with scale νs + ) B zij )/ν + n ), and ν + n degrees of freedom. If instead we suppose that σρ i, κ j ) = σ independently of the cell i, j) i.e., σ = σ for all = 0,,..., K), then we obtain the full conditional of σ for the plaid model. This is also an inverse-χ distribution, but this time with scale νs + z ij )/ν + pq), and ν + pq degrees of freedom. D. The penalized plaid model: initial values Finding the initial membership labels ρ, κ) is a difficult tas. Several procedures have been suggested in the literature. We have adopted a technique similar to that of Turner et al. [3]. We run two independent -means algorithms [4] with = : once for the rows and once for the columns. Using the Cartesian product of the resulting -means row and column labels, we divide the data matrix into four groups. A single initial bicluster is chosen among these four groups according to a variance criterion explained a few lines below. The procedure is repeated as many times as the number of initial biclusters needed. A single initial bicluster is chosen after each application of the independent row and column -means algorithms. The elements of the biclusters already chosen are mased by replacing their original values y ij by random values. This is done so that in the next iteration a different group may be chosen. The masing procedure is not new. It has been used before by Sheng, Moreau and De Moor [] to determinate multiple biclusters. The criterion to choose an initial bicluster among the four groups yielded by each iteration is the following. Suppose that the cells of each group follow a random effect additive ANOVA model. That is, on each group g, y ij = µ g + α ig + β jg + ɛ ij, g =,, 3, 4. The standard moment estimates of the variances are ˆσ gα = MSS gα) MSS g e), ˆσ gβ c = MSS gβ) MSS g e), ˆσ ge = MSS g e), ) g r g where r g is the number of rows in the g-th group, c g is the number of columns, and MSS g e), MSS g α) and MSS g β) are the corresponding mean sum of squares for error, rows and columns, respectively. We select as an initial bicluster the group g
7 REFERENCES 7 that maximizes ˆσ gα + ˆσ gβ )/ˆσ ge. For each initial bicluster, the parameters µ g, α ig and β jg are initialized as ȳ.., ȳ i. ȳ.., ȳ.j ȳ.., respectively, where y.., ȳ i., ȳ.j stand for the overall bicluster mean, the bicluster i-th row mean, and the bicluster j-th column mean, respectively. The parameter µ 0 is estimated as the arithmetic mean of the zero-bicluster. The variance σ is initialized as the mean sum of squares of all the residuals. E. Biclustering algorithm pacages The following table gives the URLs associated with the biclustering pacages used in the comparison study presented in Section 5 of the main body of this paper. The pacage names are given in parenthesis under the associated algorithm names. Algorithm FABIA fabia) Description and Address R pacage: Factor Analysis for Bicluster Acquisition SAMBA EXPANDER) EXpression Analyzer and DisplayER: java-based tool for analysis of gene expression and NGS data acgt.cs.tau.ac.il/expander/overview.html TUR biclust) R pacage: the plaid model implementation of Turner and al. cran.r-project.org/web/pacages/biclust/ CC biclust) R pacage: original method suggested by Cheng and Church to fit a mixture model cran.r-project.org/web/pacages/biclust/ Spectral biclust) R pacage: uses a singular value decomposition cran.r-project.org/web/pacages/biclust/ Penalized plaid penalizedplaid) java pacage: The models are: ) the non-overlapping bicluster model; ) the plaid model; and 3) the penalized plaid model. The estimation methods are: a) the Gibbs sampler and b) the Metropolis-Hastings algorithm References [] Y. Cheng and G. Church, Biclustering of expression data, Int. Conf. Intelligent Systems for Molecular Biology 000), pp Next Generation Sequencing
8 8 REFERENCES [] Q. Sheng, Y. Moreau, and B. De Moor, Biclustering microarray data by Gibbs sampling, Bioinformatics 9 003), pp. ii96 ii05. [3] H. Turner, T. Bailey, and W. Krzanowsi, Improved biclustering of microarray data demonstrated through systematic performance tests, Computational Statistics & Data Analysis ), pp [4] J.H. Ward, Hierarchical groupings to optimize an objective function, J. American Statistical Association ), pp
A potentially useful approach to model nonlinearities in time series is to assume different behavior (structural break) in different subsamples
1.3 Regime switching models A potentially useful approach to model nonlinearities in time series is to assume different behavior (structural break) in different subsamples (or regimes). If the dates, the
More informationChapter 5 Univariate time-series analysis. () Chapter 5 Univariate time-series analysis 1 / 29
Chapter 5 Univariate time-series analysis () Chapter 5 Univariate time-series analysis 1 / 29 Time-Series Time-series is a sequence fx 1, x 2,..., x T g or fx t g, t = 1,..., T, where t is an index denoting
More informationBayesian Linear Model: Gory Details
Bayesian Linear Model: Gory Details Pubh7440 Notes By Sudipto Banerjee Let y y i ] n i be an n vector of independent observations on a dependent variable (or response) from n experimental units. Associated
More informationEvaluation of a New Variance Components Estimation Method Modi ed Henderson s Method 3 With the Application of Two Way Mixed Model
Evaluation of a New Variance Components Estimation Method Modi ed Henderson s Method 3 With the Application of Two Way Mixed Model Author: Weigang Qie; Chenfan Xu Supervisor: Lars Rönnegård June 0th, 009
More informationStatistical Inference and Methods
Department of Mathematics Imperial College London d.stephens@imperial.ac.uk http://stats.ma.ic.ac.uk/ das01/ 14th February 2006 Part VII Session 7: Volatility Modelling Session 7: Volatility Modelling
More informationCOS 513: Gibbs Sampling
COS 513: Gibbs Sampling Matthew Salesi December 6, 2010 1 Overview Concluding the coverage of Markov chain Monte Carlo (MCMC) sampling methods, we look today at Gibbs sampling. Gibbs sampling is a simple
More informationA Practical Implementation of the Gibbs Sampler for Mixture of Distributions: Application to the Determination of Specifications in Food Industry
A Practical Implementation of the for Mixture of Distributions: Application to the Determination of Specifications in Food Industry Julien Cornebise 1 Myriam Maumy 2 Philippe Girard 3 1 Ecole Supérieure
More informationROM SIMULATION Exact Moment Simulation using Random Orthogonal Matrices
ROM SIMULATION Exact Moment Simulation using Random Orthogonal Matrices Bachelier Finance Society Meeting Toronto 2010 Henley Business School at Reading Contact Author : d.ledermann@icmacentre.ac.uk Alexander
More informationExtend the ideas of Kan and Zhou paper on Optimal Portfolio Construction under parameter uncertainty
Extend the ideas of Kan and Zhou paper on Optimal Portfolio Construction under parameter uncertainty George Photiou Lincoln College University of Oxford A dissertation submitted in partial fulfilment for
More informationTechnical Appendix: Policy Uncertainty and Aggregate Fluctuations.
Technical Appendix: Policy Uncertainty and Aggregate Fluctuations. Haroon Mumtaz Paolo Surico July 18, 2017 1 The Gibbs sampling algorithm Prior Distributions and starting values Consider the model to
More informationTwo hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER
Two hours MATH20802 To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER STATISTICAL METHODS Answer any FOUR of the SIX questions.
More informationBivariate Birnbaum-Saunders Distribution
Department of Mathematics & Statistics Indian Institute of Technology Kanpur January 2nd. 2013 Outline 1 Collaborators 2 3 Birnbaum-Saunders Distribution: Introduction & Properties 4 5 Outline 1 Collaborators
More informationRisk Management and Time Series
IEOR E4602: Quantitative Risk Management Spring 2016 c 2016 by Martin Haugh Risk Management and Time Series Time series models are often employed in risk management applications. They can be used to estimate
More informationCourse information FN3142 Quantitative finance
Course information 015 16 FN314 Quantitative finance This course is aimed at students interested in obtaining a thorough grounding in market finance and related empirical methods. Prerequisite If taken
More informationSTA258 Analysis of Variance
STA258 Analysis of Variance Al Nosedal. University of Toronto. Winter 2017 The Data Matrix The following table shows last year s sales data for a small business. The sample is put into a matrix format
More informationA Multivariate Analysis of Intercompany Loss Triangles
A Multivariate Analysis of Intercompany Loss Triangles Peng Shi School of Business University of Wisconsin-Madison ASTIN Colloquium May 21-24, 2013 Peng Shi (Wisconsin School of Business) Intercompany
More informationNonresponse Adjustment of Survey Estimates Based on. Auxiliary Variables Subject to Error. Brady T. West. University of Michigan, Ann Arbor, MI, USA
Nonresponse Adjustment of Survey Estimates Based on Auxiliary Variables Subject to Error Brady T West University of Michigan, Ann Arbor, MI, USA Roderick JA Little University of Michigan, Ann Arbor, MI,
More informationSELECTION OF VARIABLES INFLUENCING IRAQI BANKS DEPOSITS BY USING NEW BAYESIAN LASSO QUANTILE REGRESSION
Vol. 6, No. 1, Summer 2017 2012 Published by JSES. SELECTION OF VARIABLES INFLUENCING IRAQI BANKS DEPOSITS BY USING NEW BAYESIAN Fadel Hamid Hadi ALHUSSEINI a Abstract The main focus of the paper is modelling
More informationMacroeconometric Modeling: 2018
Macroeconometric Modeling: 2018 Contents Ray C. Fair 2018 1 Macroeconomic Methodology 4 1.1 The Cowles Commission Approach................. 4 1.2 Macroeconomic Methodology.................... 5 1.3 The
More informationDefinition 9.1 A point estimate is any function T (X 1,..., X n ) of a random sample. We often write an estimator of the parameter θ as ˆθ.
9 Point estimation 9.1 Rationale behind point estimation When sampling from a population described by a pdf f(x θ) or probability function P [X = x θ] knowledge of θ gives knowledge of the entire population.
More informationIntroduction to the Maximum Likelihood Estimation Technique. September 24, 2015
Introduction to the Maximum Likelihood Estimation Technique September 24, 2015 So far our Dependent Variable is Continuous That is, our outcome variable Y is assumed to follow a normal distribution having
More informationSupplementary online material to Information tradeoffs in dynamic financial markets
Supplementary online material to Information tradeoffs in dynamic financial markets Efstathios Avdis University of Alberta, Canada 1. The value of information in continuous time In this document I address
More informationLecture 3: Factor models in modern portfolio choice
Lecture 3: Factor models in modern portfolio choice Prof. Massimo Guidolin Portfolio Management Spring 2016 Overview The inputs of portfolio problems Using the single index model Multi-index models Portfolio
More informationEstimating a Dynamic Oligopolistic Game with Serially Correlated Unobserved Production Costs. SS223B-Empirical IO
Estimating a Dynamic Oligopolistic Game with Serially Correlated Unobserved Production Costs SS223B-Empirical IO Motivation There have been substantial recent developments in the empirical literature on
More informationCalibration of Interest Rates
WDS'12 Proceedings of Contributed Papers, Part I, 25 30, 2012. ISBN 978-80-7378-224-5 MATFYZPRESS Calibration of Interest Rates J. Černý Charles University, Faculty of Mathematics and Physics, Prague,
More informationLattice (Binomial Trees) Version 1.2
Lattice (Binomial Trees) Version 1. 1 Introduction This plug-in implements different binomial trees approximations for pricing contingent claims and allows Fairmat to use some of the most popular binomial
More informationFinancial Econometrics
Financial Econometrics Volatility Gerald P. Dwyer Trinity College, Dublin January 2013 GPD (TCD) Volatility 01/13 1 / 37 Squared log returns for CRSP daily GPD (TCD) Volatility 01/13 2 / 37 Absolute value
More information1 Explaining Labor Market Volatility
Christiano Economics 416 Advanced Macroeconomics Take home midterm exam. 1 Explaining Labor Market Volatility The purpose of this question is to explore a labor market puzzle that has bedeviled business
More informationPosterior Inference. , where should we start? Consider the following computational procedure: 1. draw samples. 2. convert. 3. compute properties
Posterior Inference Example. Consider a binomial model where we have a posterior distribution for the probability term, θ. Suppose we want to make inferences about the log-odds γ = log ( θ 1 θ), where
More informationPortfolio Optimization. Prof. Daniel P. Palomar
Portfolio Optimization Prof. Daniel P. Palomar The Hong Kong University of Science and Technology (HKUST) MAFS6010R- Portfolio Optimization with R MSc in Financial Mathematics Fall 2018-19, HKUST, Hong
More informationModule 2: Monte Carlo Methods
Module 2: Monte Carlo Methods Prof. Mike Giles mike.giles@maths.ox.ac.uk Oxford University Mathematical Institute MC Lecture 2 p. 1 Greeks In Monte Carlo applications we don t just want to know the expected
More informationModelling strategies for bivariate circular data
Modelling strategies for bivariate circular data John T. Kent*, Kanti V. Mardia, & Charles C. Taylor Department of Statistics, University of Leeds 1 Introduction On the torus there are two common approaches
More informationSTA218 Analysis of Variance
STA218 Analysis of Variance Al Nosedal. University of Toronto. Fall 2017 November 27, 2017 The Data Matrix The following table shows last year s sales data for a small business. The sample is put into
More informationMissing Data. EM Algorithm and Multiple Imputation. Aaron Molstad, Dootika Vats, Li Zhong. University of Minnesota School of Statistics
Missing Data EM Algorithm and Multiple Imputation Aaron Molstad, Dootika Vats, Li Zhong University of Minnesota School of Statistics December 4, 2013 Overview 1 EM Algorithm 2 Multiple Imputation Incomplete
More informationFinancial intermediaries in an estimated DSGE model for the UK
Financial intermediaries in an estimated DSGE model for the UK Stefania Villa a Jing Yang b a Birkbeck College b Bank of England Cambridge Conference - New Instruments of Monetary Policy: The Challenges
More informationFinancial Risk Management
Financial Risk Management Professor: Thierry Roncalli Evry University Assistant: Enareta Kurtbegu Evry University Tutorial exercices #4 1 Correlation and copulas 1. The bivariate Gaussian copula is given
More informationEstimating Market Power in Differentiated Product Markets
Estimating Market Power in Differentiated Product Markets Metin Cakir Purdue University December 6, 2010 Metin Cakir (Purdue) Market Equilibrium Models December 6, 2010 1 / 28 Outline Outline Estimating
More informationRisk Margin Quantile Function Via Parametric and Non-Parametric Bayesian Quantile Regression
Risk Margin Quantile Function Via Parametric and Non-Parametric Bayesian Quantile Regression Alice X.D. Dong 1 Jennifer S.K. Chan 1 Gareth W. Peters 2 Working paper, version from February 12, 2014 arxiv:1402.2492v1
More informationModelling, Estimation and Hedging of Longevity Risk
IA BE Summer School 2016, K. Antonio, UvA 1 / 50 Modelling, Estimation and Hedging of Longevity Risk Katrien Antonio KU Leuven and University of Amsterdam IA BE Summer School 2016, Leuven Module II: Fitting
More informationELEMENTS OF MATRIX MATHEMATICS
QRMC07 9/7/0 4:45 PM Page 5 CHAPTER SEVEN ELEMENTS OF MATRIX MATHEMATICS 7. AN INTRODUCTION TO MATRICES Investors frequently encounter situations involving numerous potential outcomes, many discrete periods
More informationWhat s New in Econometrics. Lecture 11
What s New in Econometrics Lecture 11 Discrete Choice Models Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Multinomial and Conditional Logit Models 3. Independence of Irrelevant Alternatives
More informationPremia 14 HESTON MODEL CALIBRATION USING VARIANCE SWAPS PRICES
Premia 14 HESTON MODEL CALIBRATION USING VARIANCE SWAPS PRICES VADIM ZHERDER Premia Team INRIA E-mail: vzherder@mailru 1 Heston model Let the asset price process S t follows the Heston stochastic volatility
More informationExtended Model: Posterior Distributions
APPENDIX A Extended Model: Posterior Distributions A. Homoskedastic errors Consider the basic contingent claim model b extended by the vector of observables x : log C i = β log b σ, x i + β x i + i, i
More informationFinal exam solutions
EE365 Stochastic Control / MS&E251 Stochastic Decision Models Profs. S. Lall, S. Boyd June 5 6 or June 6 7, 2013 Final exam solutions This is a 24 hour take-home final. Please turn it in to one of the
More informationOil and macroeconomic (in)stability
Oil and macroeconomic (in)stability Hilde C. Bjørnland Vegard H. Larsen Centre for Applied Macro- and Petroleum Economics (CAMP) BI Norwegian Business School CFE-ERCIM December 07, 2014 Bjørnland and Larsen
More information1. You are given the following information about a stationary AR(2) model:
Fall 2003 Society of Actuaries **BEGINNING OF EXAMINATION** 1. You are given the following information about a stationary AR(2) model: (i) ρ 1 = 05. (ii) ρ 2 = 01. Determine φ 2. (A) 0.2 (B) 0.1 (C) 0.4
More informationBayesian Inference for Random Coefficient Dynamic Panel Data Models
Bayesian Inference for Random Coefficient Dynamic Panel Data Models By Peng Zhang and Dylan Small* 1 Department of Statistics, The Wharton School, University of Pennsylvania Abstract We develop a hierarchical
More informationMarket Risk Analysis Volume I
Market Risk Analysis Volume I Quantitative Methods in Finance Carol Alexander John Wiley & Sons, Ltd List of Figures List of Tables List of Examples Foreword Preface to Volume I xiii xvi xvii xix xxiii
More information"Pricing Exotic Options using Strong Convergence Properties
Fourth Oxford / Princeton Workshop on Financial Mathematics "Pricing Exotic Options using Strong Convergence Properties Klaus E. Schmitz Abe schmitz@maths.ox.ac.uk www.maths.ox.ac.uk/~schmitz Prof. Mike
More informationSolving dynamic portfolio choice problems by recursing on optimized portfolio weights or on the value function?
DOI 0.007/s064-006-9073-z ORIGINAL PAPER Solving dynamic portfolio choice problems by recursing on optimized portfolio weights or on the value function? Jules H. van Binsbergen Michael W. Brandt Received:
More informationModelling Returns: the CER and the CAPM
Modelling Returns: the CER and the CAPM Carlo Favero Favero () Modelling Returns: the CER and the CAPM 1 / 20 Econometric Modelling of Financial Returns Financial data are mostly observational data: they
More informationECON 815. A Basic New Keynesian Model II
ECON 815 A Basic New Keynesian Model II Winter 2015 Queen s University ECON 815 1 Unemployment vs. Inflation 12 10 Unemployment 8 6 4 2 0 1 1.5 2 2.5 3 3.5 4 4.5 5 Core Inflation 14 12 10 Unemployment
More informationOnline Appendix (Not intended for Publication): Federal Reserve Credibility and the Term Structure of Interest Rates
Online Appendix Not intended for Publication): Federal Reserve Credibility and the Term Structure of Interest Rates Aeimit Lakdawala Michigan State University Shu Wu University of Kansas August 2017 1
More informationApplications of Good s Generalized Diversity Index. A. J. Baczkowski Department of Statistics, University of Leeds Leeds LS2 9JT, UK
Applications of Good s Generalized Diversity Index A. J. Baczkowski Department of Statistics, University of Leeds Leeds LS2 9JT, UK Internal Report STAT 98/11 September 1998 Applications of Good s Generalized
More informationinduced by the Solvency II project
Asset Les normes allocation IFRS : new en constraints assurance induced by the Solvency II project 36 th International ASTIN Colloquium Zürich September 005 Frédéric PLANCHET Pierre THÉROND ISFA Université
More informationApplication of MCMC Algorithm in Interest Rate Modeling
Application of MCMC Algorithm in Interest Rate Modeling Xiaoxia Feng and Dejun Xie Abstract Interest rate modeling is a challenging but important problem in financial econometrics. This work is concerned
More informationMonetary Economics Final Exam
316-466 Monetary Economics Final Exam 1. Flexible-price monetary economics (90 marks). Consider a stochastic flexibleprice money in the utility function model. Time is discrete and denoted t =0, 1,...
More informationMethods and Models of Loss Reserving Based on Run Off Triangles: A Unifying Survey
Methods and Models of Loss Reserving Based on Run Off Triangles: A Unifying Survey By Klaus D Schmidt Lehrstuhl für Versicherungsmathematik Technische Universität Dresden Abstract The present paper provides
More informationA Bayesian model for classifying all differentially expressed proteins simultaneously in 2D PAGE gels
BMC Bioinformatics This Provisional PDF corresponds to the article as it appeared upon acceptance. Fully formatted PDF and full text (HTML) versions will be made available soon. A Bayesian model for classifying
More informationEquity correlations implied by index options: estimation and model uncertainty analysis
1/18 : estimation and model analysis, EDHEC Business School (joint work with Rama COT) Modeling and managing financial risks Paris, 10 13 January 2011 2/18 Outline 1 2 of multi-asset models Solution to
More informationUnobserved Heterogeneity Revisited
Unobserved Heterogeneity Revisited Robert A. Miller Dynamic Discrete Choice March 2018 Miller (Dynamic Discrete Choice) cemmap 7 March 2018 1 / 24 Distributional Assumptions about the Unobserved Variables
More informationIdentifying Long-Run Risks: A Bayesian Mixed-Frequency Approach
Identifying : A Bayesian Mixed-Frequency Approach Frank Schorfheide University of Pennsylvania CEPR and NBER Dongho Song University of Pennsylvania Amir Yaron University of Pennsylvania NBER February 12,
More informationContinuous Distributions
Quantitative Methods 2013 Continuous Distributions 1 The most important probability distribution in statistics is the normal distribution. Carl Friedrich Gauss (1777 1855) Normal curve A normal distribution
More informationmay be of interest. That is, the average difference between the estimator and the truth. Estimators with Bias(ˆθ) = 0 are called unbiased.
1 Evaluating estimators Suppose you observe data X 1,..., X n that are iid observations with distribution F θ indexed by some parameter θ. When trying to estimate θ, one may be interested in determining
More information4 Reinforcement Learning Basic Algorithms
Learning in Complex Systems Spring 2011 Lecture Notes Nahum Shimkin 4 Reinforcement Learning Basic Algorithms 4.1 Introduction RL methods essentially deal with the solution of (optimal) control problems
More informationEE641 Digital Image Processing II: Purdue University VISE - October 29,
EE64 Digital Image Processing II: Purdue University VISE - October 9, 004 The EM Algorithm. Suffient Statistics and Exponential Distributions Let p(y θ) be a family of density functions parameterized by
More informationConstruction and behavior of Multinomial Markov random field models
Graduate Theses and Dissertations Iowa State University Capstones, Theses and Dissertations 2010 Construction and behavior of Multinomial Markov random field models Kim Mueller Iowa State University Follow
More information# generate data num.obs <- 100 y <- rnorm(num.obs,mean = theta.true, sd = sqrt(sigma.sq.true))
Posterior Sampling from Normal Now we seek to create draws from the joint posterior distribution and the marginal posterior distributions and Note the marginal posterior distributions would be used to
More informationJohn Hull, Risk Management and Financial Institutions, 4th Edition
P1.T2. Quantitative Analysis John Hull, Risk Management and Financial Institutions, 4th Edition Bionic Turtle FRM Video Tutorials By David Harper, CFA FRM 1 Chapter 10: Volatility (Learning objectives)
More informationIdiosyncratic risk, insurance, and aggregate consumption dynamics: a likelihood perspective
Idiosyncratic risk, insurance, and aggregate consumption dynamics: a likelihood perspective Alisdair McKay Boston University June 2013 Microeconomic evidence on insurance - Consumption responds to idiosyncratic
More informationPoint Estimators. STATISTICS Lecture no. 10. Department of Econometrics FEM UO Brno office 69a, tel
STATISTICS Lecture no. 10 Department of Econometrics FEM UO Brno office 69a, tel. 973 442029 email:jiri.neubauer@unob.cz 8. 12. 2009 Introduction Suppose that we manufacture lightbulbs and we want to state
More informationThe change of correlation structure across industries: an analysis in the regime-switching framework
Kyoto University, Graduate School of Economics Research Project Center Discussion Paper Series The change of correlation structure across industries: an analysis in the regime-switching framework Masahiko
More informationWindow Width Selection for L 2 Adjusted Quantile Regression
Window Width Selection for L 2 Adjusted Quantile Regression Yoonsuh Jung, The Ohio State University Steven N. MacEachern, The Ohio State University Yoonkyung Lee, The Ohio State University Technical Report
More informationNotes on the EM Algorithm Michael Collins, September 24th 2005
Notes on the EM Algorithm Michael Collins, September 24th 2005 1 Hidden Markov Models A hidden Markov model (N, Σ, Θ) consists of the following elements: N is a positive integer specifying the number of
More informationComparison of Pricing Approaches for Longevity Markets
Comparison of Pricing Approaches for Longevity Markets Melvern Leung Simon Fung & Colin O hare Longevity 12 Conference, Chicago, The Drake Hotel, September 30 th 2016 1 / 29 Overview Introduction 1 Introduction
More informationCHOICE THEORY, UTILITY FUNCTIONS AND RISK AVERSION
CHOICE THEORY, UTILITY FUNCTIONS AND RISK AVERSION Szabolcs Sebestyén szabolcs.sebestyen@iscte.pt Master in Finance INVESTMENTS Sebestyén (ISCTE-IUL) Choice Theory Investments 1 / 65 Outline 1 An Introduction
More informationChapter 8: CAPM. 1. Single Index Model. 2. Adding a Riskless Asset. 3. The Capital Market Line 4. CAPM. 5. The One-Fund Theorem
Chapter 8: CAPM 1. Single Index Model 2. Adding a Riskless Asset 3. The Capital Market Line 4. CAPM 5. The One-Fund Theorem 6. The Characteristic Line 7. The Pricing Model Single Index Model 1 1. Covariance
More informationPortfolio selection with multiple risk measures
Portfolio selection with multiple risk measures Garud Iyengar Columbia University Industrial Engineering and Operations Research Joint work with Carlos Abad Outline Portfolio selection and risk measures
More informationStochastic Volatility (SV) Models
1 Motivations Stochastic Volatility (SV) Models Jun Yu Some stylised facts about financial asset return distributions: 1. Distribution is leptokurtic 2. Volatility clustering 3. Volatility responds to
More informationMicroeconomic Theory August 2013 Applied Economics. Ph.D. PRELIMINARY EXAMINATION MICROECONOMIC THEORY. Applied Economics Graduate Program
Ph.D. PRELIMINARY EXAMINATION MICROECONOMIC THEORY Applied Economics Graduate Program August 2013 The time limit for this exam is four hours. The exam has four sections. Each section includes two questions.
More informationAssicurazioni Generali: An Option Pricing Case with NAGARCH
Assicurazioni Generali: An Option Pricing Case with NAGARCH Assicurazioni Generali: Business Snapshot Find our latest analyses and trade ideas on bsic.it Assicurazioni Generali SpA is an Italy-based insurance
More informationAsymptotic methods in risk management. Advances in Financial Mathematics
Asymptotic methods in risk management Peter Tankov Based on joint work with A. Gulisashvili Advances in Financial Mathematics Paris, January 7 10, 2014 Peter Tankov (Université Paris Diderot) Asymptotic
More informationA New Multivariate Kurtosis and Its Asymptotic Distribution
A ew Multivariate Kurtosis and Its Asymptotic Distribution Chiaki Miyagawa 1 and Takashi Seo 1 Department of Mathematical Information Science, Graduate School of Science, Tokyo University of Science, Tokyo,
More informationLecture IV Portfolio management: Efficient portfolios. Introduction to Finance Mathematics Fall Financial mathematics
Lecture IV Portfolio management: Efficient portfolios. Introduction to Finance Mathematics Fall 2014 Reduce the risk, one asset Let us warm up by doing an exercise. We consider an investment with σ 1 =
More informationA simple wealth model
Quantitative Macroeconomics Raül Santaeulàlia-Llopis, MOVE-UAB and Barcelona GSE Homework 5, due Thu Nov 1 I A simple wealth model Consider the sequential problem of a household that maximizes over streams
More informationINTERTEMPORAL ASSET ALLOCATION: THEORY
INTERTEMPORAL ASSET ALLOCATION: THEORY Multi-Period Model The agent acts as a price-taker in asset markets and then chooses today s consumption and asset shares to maximise lifetime utility. This multi-period
More informationOperational Risk Aggregation
Operational Risk Aggregation Professor Carol Alexander Chair of Risk Management and Director of Research, ISMA Centre, University of Reading, UK. Loss model approaches are currently a focus of operational
More informationAnalyzing Oil Futures with a Dynamic Nelson-Siegel Model
Analyzing Oil Futures with a Dynamic Nelson-Siegel Model NIELS STRANGE HANSEN & ASGER LUNDE DEPARTMENT OF ECONOMICS AND BUSINESS, BUSINESS AND SOCIAL SCIENCES, AARHUS UNIVERSITY AND CENTER FOR RESEARCH
More informationSemiparametric Modeling, Penalized Splines, and Mixed Models
Semi 1 Semiparametric Modeling, Penalized Splines, and Mixed Models David Ruppert Cornell University http://wwworiecornelledu/~davidr January 24 Joint work with Babette Brumback, Ray Carroll, Brent Coull,
More informationA way to improve incremental 2-norm condition estimation
A way to improve incremental 2-norm condition estimation Jurjen Duintjer Tebbens Institute of Computer Science Academy of Sciences of the Czech Republic duintjertebbens@cs.cas.cz Miroslav Tůma Institute
More informationLogit Models for Binary Data
Chapter 3 Logit Models for Binary Data We now turn our attention to regression models for dichotomous data, including logistic regression and probit analysis These models are appropriate when the response
More informationARCH and GARCH models
ARCH and GARCH models Fulvio Corsi SNS Pisa 5 Dic 2011 Fulvio Corsi ARCH and () GARCH models SNS Pisa 5 Dic 2011 1 / 21 Asset prices S&P 500 index from 1982 to 2009 1600 1400 1200 1000 800 600 400 200
More informationBooth School of Business, University of Chicago Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay. Solutions to Midterm
Booth School of Business, University of Chicago Business 41202, Spring Quarter 2012, Mr. Ruey S. Tsay Solutions to Midterm Problem A: (34 pts) Answer briefly the following questions. Each question has
More informationResearch Memo: Adding Nonfarm Employment to the Mixed-Frequency VAR Model
Research Memo: Adding Nonfarm Employment to the Mixed-Frequency VAR Model Kenneth Beauchemin Federal Reserve Bank of Minneapolis January 2015 Abstract This memo describes a revision to the mixed-frequency
More informationLecture Note of Bus 41202, Spring 2008: More Volatility Models. Mr. Ruey Tsay
Lecture Note of Bus 41202, Spring 2008: More Volatility Models. Mr. Ruey Tsay The EGARCH model Asymmetry in responses to + & returns: g(ɛ t ) = θɛ t + γ[ ɛ t E( ɛ t )], with E[g(ɛ t )] = 0. To see asymmetry
More informationA RIDGE REGRESSION ESTIMATION APPROACH WHEN MULTICOLLINEARITY IS PRESENT
Fundamental Journal of Applied Sciences Vol. 1, Issue 1, 016, Pages 19-3 This paper is available online at http://www.frdint.com/ Published online February 18, 016 A RIDGE REGRESSION ESTIMATION APPROACH
More informationInferences on Correlation Coefficients of Bivariate Log-normal Distributions
Inferences on Correlation Coefficients of Bivariate Log-normal Distributions Guoyi Zhang 1 and Zhongxue Chen 2 Abstract This article considers inference on correlation coefficients of bivariate log-normal
More informationSemiparametric Modeling, Penalized Splines, and Mixed Models David Ruppert Cornell University
Semiparametric Modeling, Penalized Splines, and Mixed Models David Ruppert Cornell University Possible Model SBMD i,j is spinal bone mineral density on ith subject at age equal to age i,j lide http://wwworiecornelledu/~davidr
More informationThe mean-variance portfolio choice framework and its generalizations
The mean-variance portfolio choice framework and its generalizations Prof. Massimo Guidolin 20135 Theory of Finance, Part I (Sept. October) Fall 2014 Outline and objectives The backward, three-step solution
More informationMAFS Computational Methods for Pricing Structured Products
MAFS550 - Computational Methods for Pricing Structured Products Solution to Homework Two Course instructor: Prof YK Kwok 1 Expand f(x 0 ) and f(x 0 x) at x 0 into Taylor series, where f(x 0 ) = f(x 0 )
More information